Google Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002 Released with Faster Output

Google Gemini on Tuesday released newer versions of its 1.5 Pro artificial intelligence (AI) model. These come just a few months after the Mountain View-based tech giant released its last version Geminiextended the context window to 2 million tokens. Dubbed Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002, the company said these AI models will not only offer higher output and lower costs, but will also provide users with a higher rate limit. Additionally, the filter settings have also been updated to ensure that AI models follow instructions more closely.

New Gemini AI Models Released

In one blog postThe company has detailed the Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002 AI models. These models are currently available as experimental model versions and build on the Gemini 1.5 Pro, which was first released at Google I/O in May. These are currently available to the company’s developers and enterprise customers. Developers can access them for free from Google AI Studio and the Gemini API. Enterprises can access them through Vertex AI.

According to internal tests conducted by Google, the latest Gemini 1.5 Pro and Flash models also outperformed the previous-generation Gemini model. The company claims that the new models have achieved a seven percent improvement in the Massive Multitask Language Understanding Pro (MMLU-Pro) benchmark. Furthermore, the AI models are said to have achieved around 20 percent improvement in the MATH and HiddenMath benchmarks when compared to the Gemini 1.5 Pro.

The Gemini-1.5-Pro-002 and Gemini-1.5-Flash-002 AI models will also offer an increased speed limit. The speed limits are daily usage limits for users. With the 1.5 Flash model, users will get 2,000 requests per minute (RPM), and the 1.5 Pro model will offer 1,000 RPM. Google said these limits were increased to allow developers to build with new versions of Gemini.

It’s not just the speed limit that’s getting an upgrade. The company has also increased the throughput tokens per second with Gemini-1.5-Pro-002 and Flash-002, making the models more responsive and faster at producing long blocks of text.

A major upgrade with these AI models is the improvements made to the filters. Google says that the new Gemini AI models will be better able to comply with prompts and follow instructions thanks to these updated filters. Google is also improving its suite of safeguards to ensure that the AI models do not produce anything harmful. Notably, the default filters will not be implemented in the new AI models to allow developers to choose their preferred configuration.