/ratelimit command (alias /limits) shows the current state of your LLM provider’s rate limits, without making any extra call: ChatCLI reads the x-ratelimit-* headers the provider already returns on every response and keeps a per-provider snapshot.
Capture is passive and central: an observer on the auth path intercepts the HTTP responses of all providers that send the
x-ratelimit-* header family (OpenAI, OpenRouter, Anthropic-compatible endpoints, etc.). No environment variable is required.Usage
What is shown
For each provider that reported limits, ChatCLI shows up to two buckets:| Bucket | Fields |
|---|---|
requests | remaining / limit, % used, resets in N s |
tokens | remaining / limit, % used, resets in N s |
See also
- Provider Fallback — automatic switch when a provider fails/saturates
- Session Cost —
/cost - Command Reference