Perplexity
Sonar
Default
Search-grounded model with real-time reasoning and built-in web search
Intelligence
Speed
Price
$1.00 • $5.00
Input
Output

Sonar is search-grounded model with real-time reasoning and built-in web search. Learn more in our Sonar usage guide.

128,000 context window
8,192 max output tokens
Aug 2025 knowledge cutoff
Pricing
Pricing is based on the number of tokens used. For tool-specific models, like search and computer use, there's a fee per tool call. See details in the pricing page.
Text tokens
Per 1M tokens
Batch API price
Input
$1.00
Cached input
$0.25
Output
$5.00
Quick comparison
Input
Cached input
Output
Sonar
$1.00
Modalities
Text
Input and output
Image
Not supported
Audio
Not supported
Endpoints
Chat Completions
v1/chat/completions
Responses
v1/responses
Batch
v1/batch
Realtime
v1/realtime
Assistants
v1/assistants
Fine-tuning
v1/fine-tuning
Embeddings
v1/embeddings
Image Generation
v1/images/generations
Image Edit
v1/images/edits
Speech Generation
v1/audio/speech
Transcription
v1/audio/transcriptions
Translation
v1/translations
Moderation
v1/moderations
Completions (legacy)
v1/completions
Features
Streaming
Supported
Function calling
Supported
Structured outputs
Not supported
Fine-tuning
Not supported
Distillation
Not supported
Fast response
Not supported
Cost efficient
Not supported
Tools
Tools supported by this model when using the Responses API.
Web search
Supported
Code interpreter
Not supported
File search
Not supported
Image generation
Not supported
MCP
Not supported
Snapshots
Snapshots let you lock in a specific version of the model so that performance and behavior remain consistent. Below is a list of all available snapshots and aliases for Sonar.
Perplexity
sonar
Sonar
Perplexity
sonar-large
Sonar Large
Rate limits
Rate limits ensure fair and reliable access to the API by placing specific caps on requests or tokens used within a given time period. Your usage tier determines how high these limits are set and automatically increases as you send more requests and spend more on the API.
TierRPMTPMBatch queue limit
Free100100,000N/A
Tier 110002,000,000
Tier 220004,000,000
Tier 3500010,000,000
Tier 41000020,000,000
Tier 52000040,000,000