Refreshed daily
AI Router Traffic Statistics
A live picture of what actually flows through our gateway — token volume, model mix, provider market share, latency percentiles and workload categories. Refreshed every day.
Tokens processed
105.87B
+71.5%vs previous period
API requests
20.8M
+66.9%vs previous period
Active models
15
Active providers
14
Daily token volume
Aggregate across all models and providers
Provider market share
Share of total token traffic by provider
- anthropic31.6%
- google26.4%
- openai22.7%
- deepseek7.9%
- mistral4.4%
- cerebras2.8%
- xai2.1%
- moonshot2.0%
Workload categories
How our customers actually use the gateway
- Programming65.6% · 14.84B
- General24.4% · 5.52B
- Reasoning6.2% · 1.41B
- Agents / Tool use3.8% · 865.69M
Top models
Ranked by token volume in the selected window
| # | Model | Provider | Requests | Tokens | Share |
|---|---|---|---|---|---|
| 1 | Claude Sonnet 4.6 | anthropic | 630K | 4.74B | 23.7% |
| 2 | Gemini 3 Pro | 374K | 3.08B | 15.4% | |
| 3 | GPT-5.4 | openai | 497K | 2.24B | 11.2% |
| 4 | Gemini 3 Flash | 319K | 1.99B | 10.0% | |
| 5 | GPT-5.5 | openai | 310K | 1.46B | 7.3% |
| 6 | Claude Opus 4.7 | anthropic | 245K | 1.45B | 7.3% |
| 7 | DeepSeek V3.2 | deepseek | 230K | 1.06B | 5.3% |
| 8 | DeepSeek R1.1 | deepseek | 115K | 612.02M | 3.1% |
| 9 | Mistral Large 3 | mistral | 102K | 553.37M | 2.8% |
| 10 | o4-mini | openai | 109K | 521.48M | 2.6% |
| 11 | Gemini 3 Flash-Lite | 167K | 512.80M | 2.6% | |
| 12 | Claude Haiku 4.5 | anthropic | 128K | 507.36M | 2.5% |
| 13 | Kimi K2 | moonshot | 66K | 423.43M | 2.1% |
| 14 | Llama 4 Scout (Cerebras) | cerebras | 128K | 421.75M | 2.1% |
| 15 | GPT-5.4 Mini | openai | 152K | 391.27M | 2.0% |
Provider response times
Time-to-first-token (ms) percentiles
| Provider | p50 | p90 | p99 | Tokens/sec |
|---|---|---|---|---|
| cerebras | 120 | 220 | 381 | 1730 |
| groq | 177 | 315 | 551 | 562 |
| deepinfra | 406 | 772 | 1486 | 157 |
| mistral | 410 | 762 | 1485 | 95 |
| fireworks | 414 | 784 | 1504 | 181 |
| 459 | 869 | 1641 | 139 | |
| together | 467 | 871 | 1657 | 164 |
| openai | 501 | 925 | 1755 | 102 |
| cohere | 560 | 1016 | 1888 | 102 |
| xai | 583 | 1125 | 2127 | 90 |
| deepseek | 656 | 1235 | 2238 | 62 |
| anthropic | 799 | 1455 | 2706 | 80 |
| moonshot | 916 | 1583 | 2541 | 61 |
| perplexity | 933 | 1699 | 2958 | 65 |
All values are aggregated across the gateway. Individual request content is never exposed. Refreshed every 30 minutes.