Gemini 1.5 Flash price drop with tuning rollout complete, and more

Logan Kilpatrick, Shrestha Basu Mallick

Gemini 1.5 Flash is now available to developers at more than 70% lower prices. Set up billing for Gemini API in Google AI Studio and access other new features like 1.5 Flash tuning.

Google

•

Logan Kilpatrick, Shrestha Basu Mallick

•3 min read•beginner•

--

•View Original

Gemini

Overview

The article discusses the recent updates to Gemini 1.5 Flash, including a significant price drop and the completion of the tuning rollout. It highlights improvements to the Gemini API and Google AI Studio, expanding language support and enhancing developer documentation.

What You'll Learn

1

How to utilize the Gemini 1.5 Flash model for cost-effective AI applications

2

Why tuning is essential for optimizing model performance in AI development

3

When to leverage expanded language support in the Gemini API for global applications

4

How to access Google AI Studio as a Google Workspace user

Prerequisites & Requirements

Understanding of AI model tuning and API integration
Familiarity with Google AI Studio and Gemini API(optional)

Key Questions Answered

What are the new pricing details for Gemini 1.5 Flash?

As of August 12, 2024, the input price for Gemini 1.5 Flash has decreased by 78% to $0.075 per million tokens, while the output price has decreased by 71% to $0.3 per million tokens for prompts under 128K tokens. This significant reduction aims to make the model more accessible for developers.

How does the Gemini API support multiple languages?

The Gemini API now supports queries in over 100 additional languages, allowing developers to prompt and receive outputs in their preferred languages. This expansion aims to eliminate language-related block finish reasons, enhancing usability for global applications.

What improvements have been made to Google AI Studio?

Recent improvements to Google AI Studio include overhauled keyboard shortcuts, a 50% decrease in loading time, and the addition of prompt suggestions. These enhancements aim to streamline the user experience for developers working with AI models.

What is the significance of the tuning rollout for Gemini 1.5 Flash?

The tuning rollout for Gemini 1.5 Flash allows developers to customize base models, improving performance for specific tasks by providing additional data. This can reduce context size, latency, and costs while increasing accuracy in task execution.

Key Statistics & Figures

Input token cost reduction

78%

Effective August 12, 2024, the input price is reduced to $0.075 per million tokens.

Output token cost reduction

71%

Effective August 12, 2024, the output price is reduced to $0.3 per million tokens.

Language support expansion

100+ additional languages

The Gemini API now supports queries in over 100 languages, enhancing global usability.

Technologies & Tools

Backend

Gemini API

Used for AI model integration and interaction.

Frontend

Google AI Studio

Provides a platform for developers to build and tune AI models.

Key Actionable Insights

1
Take advantage of the significant cost reductions in Gemini 1.5 Flash to build high-volume applications.
With input and output costs drastically lowered, developers can implement more extensive AI solutions without the burden of high operational costs, making it an ideal time to innovate.

2
Utilize the tuning capabilities of Gemini 1.5 Flash to enhance model performance.
By customizing models through tuning, developers can achieve better accuracy and efficiency, particularly for specific tasks, leading to improved user satisfaction and application effectiveness.

3
Leverage the expanded language support in the Gemini API for international projects.
This feature allows developers to create applications that cater to a global audience, reducing barriers and enhancing user engagement across different languages.

Common Pitfalls

1

Neglecting to utilize the tuning capabilities of Gemini 1.5 Flash can lead to suboptimal model performance.

Without tuning, developers may miss out on improving accuracy and efficiency for specific tasks, which can result in less effective applications.

Related Concepts

AI Model Tuning

API Integration

Multi-modal Understanding

Global Application Development