Tier-Based API Rate Limits
Tiers are based on lifetime spending and update automatically. As your usage grows, your tier increases. Or you can move up instantly by purchasing additional credits.| Tiers | Qualifications | RPM (paid model) | RPM (free model) | Output Token Length |
|---|---|---|---|---|
| Tier 0 | Signed up | Adaptive Rate Limits* | Adaptive Rate Limits* | 8K |
| Tier 1 | Total historical spend of $10+ | 100 | 60 | 16K |
| Tier 2 | Total historical spend of $50+ | 1,000 | 1,000 | 16K |
| Tier 3 | Total historical spend of $500+ | 5,000 | 5,000 | 32K |
| Tier 4 | Total historical spend of $5,000+ | 10,000 | 10,000 | 64K |
| Tier 5 | Contact support@friendli.ai | Custom | Custom | Custom |
*Adaptive Rate Limits: Rate limits are applied dynamically based on overall platform conditions.
Billing Methods
Text Models
Text models use a token-based billing method, depending on the model.Token-Based Billing
In a token-based billing model, charges are determined by the number of tokens processed, where each “token” represents an individual unit processed by the model.| Model Code | Price per Token |
|---|---|
| LGAI-EXAONE/K-EXAONE-236B-A23B | Input $0.2 · Cached Input $0.1 · Output $0.8 / 1M tokens |
| MiniMaxAI/MiniMax-M2.5 | Input $0.3 · Cached Input $0.06 · Output $1.2 / 1M tokens |
| MiniMaxAI/MiniMax-M2.1 | Input $0.3 · Cached Input $0.15 · Output $1.2 / 1M tokens |
| zai-org/GLM-5 | Input $1 · Cached Input $0.5 · Output $3.2 / 1M tokens |
| zai-org/GLM-4.7 | Input $0.6 · Output $2.2 / 1M tokens |
| meta-llama/Llama-3.3-70B-Instruct | $0.6 / 1M tokens |
| meta-llama/Llama-3.1-8B-Instruct | $0.1 / 1M tokens |
| Qwen/Qwen3-235B-A22B-Instruct-2507 | Input $0.2 · Output $0.8 / 1M tokens |
| Qwen/Qwen3-30B-A3B | Input $0.15 · Output $0.6 / 1M tokens |
| deepseek-ai/DeepSeek-V3.2 | Input $0.5 · Cached Input $0.25 · Output $1.5 / 1M tokens |
| deepseek-ai/DeepSeek-V3.1 | Input $0.5 · Cached Input $0.25 · Output $1.5 / 1M tokens |
Audio Models
Audio models are charged based on the duration of processed audio. Charges are calculated per second and aggregated into a per-minute rate for clarity.| Model Code | Price per Audio Minute |
|---|---|
| openai/whisper-large-v3 | $0.0015 / audio minute |
FAQs
How do I increase my rate limits?
How do I increase my rate limits?
Your usage tier, which determines your rate limits, increases monthly based on your proof-of-payment. Need a faster upgrade? Reach out anytime at support@friendli.ai — we’re happy to help!
Do I need to upgrade my plan to use popular models?
Do I need to upgrade my plan to use popular models?
Popular models are available to all users, depending on the limits determined by their usage tiers.
What if I exceed my monthly cap?
What if I exceed my monthly cap?
You’ll receive an alert when approaching your monthly cap. Please contact support@friendli.ai to discuss options for increasing your monthly cap. We may help you (1) pay early to reset your monthly cap, or (2) upgrade your plan to increase your monthly cap and unlock more features.
For more questions, contact support@friendli.ai.