Skip to main content

Gemini 3.0 Flash

Approved Data Classifications

Description

Gemini 3 Flash is Google's balanced 3-series model built for speed and scale, offering Pro-level intelligence at Flash speed and pricing. It supports multimodal inputs and a 1,048,576-token input window with up to 65,536 output tokens, and the Gemini 3 series is currently in preview.

Capabilities

ModelTraining DataInputOutputContext LengthCost (per 1 million tokens)
gemini-3-flash-previewJanuary 2025Text, Image, Video, Audio, PDFText1,048,576 (in) / 65,536 (out)$0.50/1M input (text/image/video)
$1.00/1M input (audio)
$3.00/1M output
info
  • 1M represents 1 Million Tokens
  • All prices listed are based on 1 Million Tokens

Availability

Cloud Provider

Usage

curl -X POST https://api.ai.it.ufl.edu/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer <API_TOKEN>" \
-d '{
"model": "gemini-3-flash-preview",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant."
},
{
"role": "user",
"content": "Write a haiku about an Alligator."
}
]
}'

When to Use

  • Low-latency, high-throughput tasks
  • Cost-efficient multimodal analysis
  • Agentic workflows at scale
  • Large-context summarization and extraction
  • Real-time chat and support experiences

References

  1. Gemini 3 Developer Guide
    https://ai.google.dev/gemini-api/docs/gemini-3
  2. Gemini Models (Gemini API)
    https://ai.google.dev/gemini-api/docs/models
  3. Gemini API Pricing
    https://ai.google.dev/gemini-api/docs/pricing