Skip to main content

Llama 3.1 8B

Approved Data Classifications

Description

Llama-3.1-8B is a compact yet powerful language model developed by Meta, designed to deliver efficient performance in multilingual dialogue applications. With 8 billion parameters, this model is optimized for scenarios where computational resources are limited, making it ideal for startups and small businesses seeking to integrate AI capabilities without incurring significant costs. It features a context length of up to 128,000 tokens, allowing it to process extensive text inputs while maintaining coherence and relevance in its outputs. Llama-3.1-8B excels in tasks such as content generation, summarization, and natural language understanding, leveraging advanced training techniques like supervised fine-tuning and reinforcement learning with human feedback to ensure high-quality responses. Its adaptability and efficiency make it a versatile tool for developers looking to implement AI solutions across various industries while balancing performance with resource constraints.

Capabilities

ModelTraining DataInputOutputContext LengthCost (per 1 million tokens)
llama-3.1-8b-instructJuly 2024TextText128,000$0.22/1M input
$0.22/1M output
info
  • 1M represents 1 Million Tokens
  • All prices listed are based on 1 Million Tokens

Availability

Cloud Provider

Usage

curl -X POST https://api.ai.it.ufl.edu/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer <API_TOKEN>" \
-d '{
"model": "llama-3.1-8b-instruct",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant."
},
{
"role": "user",
"content": "Write a haiku about an Alligator."
}
]
}'