Each data point is one 5-hour Anthropic window — exposing how quota caps
vary throughout the day. The implied cap = tokens burned ÷ utilization.
A sudden jump = Anthropic changed your limit.
LiveAuto-refreshes in 60sGenerated: 2026-05-07 05:25 UTC
Current Implied Cap
5-Hour Window
~180.45M
350.1K burned1.0% utilized
7-Day (Anthropic)
1496.96M
688.60M burned46.0% utilized
7-Day (Sonnet)
1786.51M
500.22M burned28.0% utilized
What is the implied cap?
Anthropic publishes utilization percentages but not the underlying token limits.
By dividing tokens burned by the utilization fraction, we back-calculate
the effective quota cap for each rate-limit window.
A sudden jump in the chart below = Anthropic changed your limit.
Reading this chart:
Multiple data points per calendar day = multiple 5h windows.
A drop between the 10:00 UTC window and the 15:00 UTC window means Anthropic
lowered the cap mid-day. Night windows (20:00+ UTC) often show higher caps
than business-hour windows (14:00–20:00 UTC).
5h series (orange) on the right axis; 7-day series on the left.
Implied Token Cap — 5h Windows (30 Days)
Each data point is one 5-hour Anthropic window. Reveals intraday variation: business hours vs. evening vs. night caps.
Token Burn per 5h Window (30 Days)
Tokens burned within each 5-hour Anthropic window. A tall bar = heavy burn; when Anthropic adjusts the cap, burn patterns shift visibly.
Utilization
5-Hour Window
1.0%
350.1K burned / ~180.45M capas of 2026-05-07 05:24 UTC
7-Day (Anthropic)
46.0%
688.60M burned / ~1496.96M capas of 2026-05-07 05:24 UTC
7-Day (Sonnet)
28.0%
500.22M burned / ~1786.51M capas of 2026-05-07 05:24 UTC
Utilization % — 5h Windows (30 Days)
How much of the quota is being consumed per 5-hour window.
Raw Snapshot
Window
Utilized
Burned
Implied Cap
As Of
5-Hour Window
1.0%
350.1K
~180.45M
2026-05-07 05:24 UTC
7-Day (Anthropic)
46.0%
688.60M
1496.96M
2026-05-07 05:24 UTC
7-Day (Sonnet)
28.0%
500.22M
1786.51M
2026-05-07 05:24 UTC
How This Is Calculated
Anthropic doesn't publish rate limits directly. Every API response includes two values:
tokens consumed in the current window and what percentage of the limit that represents.
Dividing one by the other gives the implied cap — a real-time estimate of the actual limit.
A step-change in the chart means the underlying limit changed: a sudden jump indicates an increase,
a drop indicates a reduction or a change in model mix.
Three windows are tracked independently: a 5-hour rolling window, a 7-day window for Anthropic models,
and a 7-day window for Sonnet specifically — each resets on its own schedule.
vmfarms operates on an Anthropic 20× Max plan — the caps shown
here reflect our actual quota, which is substantially higher than the standard API tier.
Your own limits will differ based on your plan.
Need fully managed cloud hosting without the markup?
We handle the infrastructure — bare metal performance, managed for you.