Skip to main content
Model APIs are often more economical, with access to a wide range of models. Pricing varies by model type—text models are charged by processed tokens or compute time of your request, while audio models are charged by the duration of processed audio.

Tier-based API rate limits

Tiers are based on lifetime spending and update automatically. As your usage grows, your tier increases. Or you can move up instantly by purchasing additional credits.
Adaptive Rate Limits: Rate limits are applied dynamically based on overall platform conditions.
‘Output Token Length’ is how much the model can write in response. It’s different from ‘Context Length’, which is the sum of the input and output tokens.

Billing methods

Text models

Text models use a token-based billing method, depending on the model.

Token-based billing

In a token-based billing model, charges are determined by the number of tokens processed, where each “token” represents an individual unit processed by the model.

Audio models

Audio models are charged based on the duration of processed audio. Charges are calculated per second and aggregated into a per-minute rate for clarity.

FAQs

Your usage tier, which determines your rate limits, increases monthly based on your proof-of-payment. Need a faster upgrade? Reach out anytime at support@friendli.ai — we’re happy to help!
You’ll receive an alert when approaching your monthly cap. Please contact support@friendli.ai to discuss options for increasing your monthly cap. We may help you (1) pay early to reset your monthly cap, or (2) upgrade your plan to increase your monthly cap and unlock more features.
For more questions, contact support@friendli.ai.
Last modified on June 18, 2026