Skip to main content

Gemini 1.5 Flash

Approved Data Classifications

Description

Gemini 1.5 Flash is a lightweight and highly efficient multimodal AI model developed by Google DeepMind, designed to excel in high-volume, low-latency tasks. Launched in May 2024, this model features a context window of up to 1 million tokens, enabling it to process extensive inputs such as long documents, images, and videos effectively. Gemini 1.5 Flash is optimized for speed and cost-efficiency, making it ideal for applications that require rapid responses, such as summarization, data extraction, and real-time chat interactions. The model leverages a distillation process from its larger counterpart, Gemini 1.5 Pro, ensuring that it retains impressive performance while being more accessible for developers looking to implement AI solutions at scale. With its ability to handle multimodal reasoning and deliver high-quality outputs across various tasks, Gemini 1.5 Flash represents a significant advancement in the Gemini model family, catering to the needs of businesses and developers aiming for efficient AI integration.

Capabilities

ModelTraining DataInputOutputContext LengthCost (per 1 million tokens)
gemini-1.5-flashFebruary 2024Image, TextText1,000,000$.15/1M input
$0.60/1M output
info
  • 1M represents 1 Million Tokens
  • All prices listed are based on 1 Million Tokens

Availability

Cloud Provider

Usage

curl -X POST https://api.ai.it.ufl.edu/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer <API_TOKEN>" \
-d '{
"model": "gemini-1.5-flash",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant."
},
{
"role": "user",
"content": "Write a haiku about an Alligator."
}
]
}'