Skip to main content

Gemini 1.5 Flash

Approved Data Classifications

Description

Gemini 1.5 Flash is Google's lower-latency multimodal model for high-volume workloads. It was released on May 14, 2024 and supports a 1 million-token context window.

Capabilities

ModelRelease DateInputOutputContext LengthCost (per 1 million tokens)
gemini-1.5-flashMay 14 2024Text, Image, Audio, VideoText1,000,000$.15/1M input
$0.60/1M output
info
  • 1M represents 1 Million Tokens
  • All prices listed are based on 1 Million Tokens

Availability

Cloud Provider

Usage

curl -X POST https://api.ai.it.ufl.edu/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer <API_TOKEN>" \
-d '{
"model": "gemini-1.5-flash",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant."
},
{
"role": "user",
"content": "Write a haiku about an Alligator."
}
]
}'

References

  1. Google
    https://ai.google/
  2. LLM Stats
    https://llm-stats.com
  3. Artificial Analysis
    https://artificialanalysis.ai