Google is releasing a new artificial intelligence model designed to deliver high performance with a focus on efficiency.
The model, Gemini 2.5 Flash, will soon be launched in Vertex AI, Google’s artificial intelligence development platform. The company claims that it offers “dynamic and manageable” computing, allowing developers to adjust processing times depending on the complexity of queries.
“[You can] customize the speed, accuracy, and cost balance to meet your specific needs,” Google writes in a blog post provided to TechCrunch. “This flexibility is key to optimizing flash performance in high-volume, cost-sensitive applications.”
Gemini 2.5 Flash comes at a time when the cost of flagship AI models continues to rise. Lower-cost, higher-performance models such as 2.5 Flash are an attractive alternative to expensive top-of-the-line options at the expense of some accuracy.
Gemini 2.5 Flash is a “reasoning” model, similar to OpenAI’s o3-mini and DeepSeek’s R1. This means that it takes a little longer to answer questions and validate itself.
Google claims that 2.5 Flash is ideal for “high-volume” and “real-time” applications such as customer service and document parsing.
“This workhorse is optimized specifically for low latency and low cost,” Google said in a blog post. “It’s the perfect engine for responsive virtual assistants and real-time summarization tools where efficiency at scale is key.”
Google has not published a security report or technical report for Gemini 2.5 Flash, making it difficult to understand what the model has advantages and disadvantages. The company previously told TechCrunch that it does not publish reports for models that it considers “experimental.”
On Wednesday, Google also announced that it plans to bring Gemini models, such as 2.5 Flash, to the on-premises environment starting in the third quarter. Gemini models will be available in Google Distributed Cloud (GDC), Google’s on-premises solution for customers with strict data management requirements. Google says it is working with Nvidia to bring Gemini models to GDC-compatible Nvidia Blackwell systems, which customers can purchase through Google or other channels.