Skip to main content

Gemini 2.5 Flash

Approved Data Classifications

Description

Gemini 2.5 Flash is a highly efficient multimodal AI model developed by Google, designed to excel in automated tasks and low latency scenarios. Released on March 20, 2025, this model features a context window of over 1 million tokens featuring thinking capabilities for the first time in a Google flash model. This allows the model to process vast datasets and handle complex problems from different information sources, including text, audio, images, video and even entire code repositories.

Capabilities

ModelRelease DateInputOutputContext LengthCost (per 1 million tokens)
gemini-2.5-flashMar 20 2025Text, Image, Audio, Video, PDFText1,048,576$2.50/1M input
$1.25/1M output
info
  • 1M represents 1 Million Tokens
  • All prices listed are based on 1 Million Tokens

Availability

Cloud Provider

Usage

curl -X POST https://api.ai.it.ufl.edu/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer <API_TOKEN>" \
-d '{
"model": "gemini-2.5-flash",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant."
},
{
"role": "user",
"content": "Write a haiku about an Alligator."
}
]
}'

When to Use

  • Premium reasoning and coding capabilities
  • Handling extremely long contexts
  • Best-in-class coding model
  • Autonomous, multi-step workflows
  • Multimodal input support

References

  1. Google Gemini 2.5 Flash Report Card
    https://cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/2-5-flash
  2. Google Gemini 2.5 Report
    https://storage.googleapis.com/deepmind-media/gemini/gemini_v2_5_report.pdf