Skip to main content

Kokoro

Approved Data Classifications

Description

Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient. With Apache-licensed weights, Kokoro can be deployed anywhere from production environments to personal projects.

For a list of voices and languages available for this model, please see: VOICES

The above was sourced from the following model card on hugging face: https://huggingface.co/hexgrad/Kokoro-82M

Capabilities

ModelTraining DataInputOutputContext LengthCost (per minute of audio)
kokoroJan 2025TextAudion/a$0.006/minute
info
  • Pricing is based on one minute of audio
  • All prices listed are based on 1 minute of audio

Availability

Cloud Provider

Usage

curl https://api.ai.it.ufl.edu/v1/audio/speech \
-H "Authorization: Bearer <API_TOKEN>" \
-H 'Content-Type: application/json' \
-d "{ \
"model": "kokoro", \
"input": "I am an AI assistant here to help", \
"voice": "af_heart", \
"response_format": "mp3", \
"speed": 1.0
}" \
--output output.mp3