Skip to main content

Gemini 2.0 Flash

Approved Data Classifications

Description

Gemini 2.0 Flash is a highly efficient multimodal AI model developed by Google DeepMind, designed to excel in automated tasks and low latency scenarios. Launched in December 2024, this model features a context window of up to 1 million tokens, enabling it to process extensive inputs such as long documents, images, and videos effectively. Gemini 2.0 Flash is optimized for power agentic experiences making it ideal for applications that require rapid responses, such as summarization, data extraction, and real-time chat interactions.

Capabilities

ModelTraining DataInputOutputContext LengthCost (per 1 million tokens)
gemini-2.0-flashJune 2024Image, Text, Audio, VideoText1,000,000$.10/1M input
$0.70/1M output
info
  • 1M represents 1 Million Tokens
  • All prices listed are based on 1 Million Tokens

Availability

Cloud Provider

Usage

curl -X POST https://api.ai.it.ufl.edu/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer <API_TOKEN>" \
-d '{
"model": "gemini-2.0-flash",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant."
},
{
"role": "user",
"content": "Write a haiku about an Alligator."
}
]
}'