Close Menu
  • Home
  • AI
  • Business
  • Crypto
  • Entertainment
  • Finance
  • LIfe
  • Market
  • Sports
  • US
  • Tech

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Powerschool paid the hacker ransom, but now the school says it’s being forced

May 8, 2025

When Crypto bounces, Bitcoin surges above $101,000

May 8, 2025

Key Takeout: Documentary name is Al Jazeera’s Abuakure Murderer | Crime News

May 8, 2025
Facebook X (Twitter) Instagram
XMcnx
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • Home
  • AI
  • Business
  • Crypto
  • Entertainment
  • Finance
  • LIfe
  • Market
  • Sports
  • US
  • Tech
XMcnx
Home » Google launches an “implicit cache” to ensure cheap access to the latest AI models
AI

Google launches an “implicit cache” to ensure cheap access to the latest AI models

By supportMay 8, 2025No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Gettyimages 2169339854.jpg
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link


Google has deployed the Gemini API features, claiming that the company will make the latest AI models cheaper for third-party developers.

Google calls the feature “implicit cache” and says it can provide 75% savings in “iterative contexts” passed to the model via the Gemini API. It supports Google’s Gemini 2.5 Pro and 2.5 Flash models.

With the continued increasing cost of using frontier models, it could be a welcome news for developers.

I just shipped an implicit cache to the Gemini API and automatically enabled a 75% cost saving on the Gemini 2.5 model when a request hits the cache.

We’ve also reduced the MIN tokens needed to hit 1K with 2.5 flash cache and 2K with 2.5 Pro!

– Logan Kilpatrick (@officiallogank) May 8, 2025

A practice widely adopted in the AI ​​industry, cache reduces computing requirements and costs by frequently accessing or reusing pre-computed data from models. For example, a cache can store answers to questions that users often ask the model, eliminating the need for the model to recreate answers to the same request.

Google previously provided a model prompt cache, but only explicit prompt cache. This means that developers had to define the best frequency prompt. Cost savings were supposed to be guaranteed, but explicit rapid caching usually involved a lot of manual work.

Some developers were not happy with how Google’s explicit caching implementation worked on Gemini 2.5 Pro. Complaints have reached a hot pitch over the past week, prompting the Gemini team to apologise and pledge to make changes.

In contrast to explicit caches, implicit caches are automatic. By default, it is enabled on Gemini 2.5 models, so if a Gemini API request hits the model into the cache, it passes cost savings.

TechCrunch Events

Berkeley, California
|
June 5th

Book now

“(w) submits a request to one of the Gemini 2.5 models. If the request shares a common prefix as one of the previous requests, it qualifies for a cache hit,” Google explained in a blog post. “Dynamic passing cost savings.”

According to Google’s developer documentation, the minimum prompt token count for implicit cache is 1,024 for 2.5 flash and 2,048 for 2.5 Pro. A token is a raw bit of a data model with 1,000 tokens, equivalent to about 750 words.

Given that Google’s final claim to reduce costs from cash has been violated, this new feature has some brewing space for buyers. For one thing, Google recommends that developers keep a repeatable context at the beginning of requests, increasing the likelihood of implicit cache hits. The company says that the context that could change from request to request should be added at the end.

In another case, Google did not offer third-party verification that the new implicit caching system would provide the promised automatic savings. So you need to see what early recruits say.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleDelta’s game emulator will become an app supported by Patreon after changes to the US app store policy
Next Article Google deploys AI tools to protect Chrome users from fraud
support

Related Posts

AI

Show startups in TechCrunch sessions: AI can still be done.

By supportMay 8, 2025
AI

Google deploys AI tools to protect Chrome users from fraud

By supportMay 8, 2025
AI

Openai launches data residency program in Asia

By supportMay 8, 2025
AI

Sequoia leads a $1.5 billion tender offer for sales automation startup clay

By supportMay 8, 2025
AI

Asking chatbots for short answers can increase hallucinations, research finds

By supportMay 8, 2025
AI

Amazon’s latest AI tools are designed to enhance your product list

By supportMay 8, 2025
Add A Comment
Leave A Reply Cancel Reply

Don't Miss

Powerschool paid the hacker ransom, but now the school says it’s being forced

By supportMay 8, 2025

A few months after Hacked Educational Software Maker PowerSchool paid a hacker ransom to remove…

When Crypto bounces, Bitcoin surges above $101,000

May 8, 2025

Key Takeout: Documentary name is Al Jazeera’s Abuakure Murderer | Crime News

May 8, 2025

Instagram thread gets video ads

May 8, 2025
Top Posts

Cancelling the Joy Reed Show is “mistakes”

February 26, 2025

Black melodrama has a possibility

February 26, 2025

The “Facts of Life” star died in 83

February 25, 2025

Cara Sophia Gascon joins Oscar despite social media controversy

February 25, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to XMcnx – your trusted source for insightful information about the world of Crypto, Market trends, the latest developments in the US, cutting-edge AI technologies, Tech innovations, and Finance.

At XMcnx, our mission is to provide you with timely, accurate, and relevant news and analyses that empower you to stay ahead in an ever-evolving digital world. We understand the challenges of navigating through the complexities of modern markets, technology, and financial systems. That’s why we’re dedicated to delivering high-quality content that helps you make informed decisions.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

Powerschool paid the hacker ransom, but now the school says it’s being forced

May 8, 2025

When Crypto bounces, Bitcoin surges above $101,000

May 8, 2025

Key Takeout: Documentary name is Al Jazeera’s Abuakure Murderer | Crime News

May 8, 2025
Most Popular

TikTok announces it will go dark on Sunday without ‘definitive’ guarantees

January 18, 2025

President Trump mints $31 billion in new official $TRUMP crypto meme coin

January 18, 2025

El Salvador’s secret weapon? Stacey Herbert talks about the company’s extensive Bitcoin education program

January 18, 2025
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 xmcnx. Designed by xmcnx.

Type above and press Enter to search. Press Esc to cancel.