How do Codex usage limits work?

Codex meters subscription usage on a 5-hour rolling window, with a separate weekly cap. Since April 2026, ChatGPT Plus, Pro, and Business plans meter by tokens rather than message count, so heavier requests drain the window faster.

What happens when I hit the Codex limit?

You can buy extra credits, switch to a smaller model, or wait for the window to reset. The cheaper fix is sending fewer tokens per request in the first place.

How do I stop hitting Codex limits without upgrading?

Reduce how much context Codex sends per turn. Headroom compresses repetitive tool output and boilerplate locally before they reach the model, cutting token usage by about 50% so the same ChatGPT plan effectively lasts twice as long.

Codex Usage Limits: Hit the 5-Hour Limit?

How the 5-hour rolling window works

Codex meters subscription usage on a 5-hour rolling window. Once you start sending messages, a session begins and continues for five hours. Inside that window, every prompt, file read, tool call, and model response counts toward your allowance. When you hit the cap, you are blocked from sending more messages until the window rolls over.

There is also a separate weekly cap that prevents very heavy users from running constantly. Most people hit the 5-hour limit first.

Since April 2026, Plus, Pro, and Business plans meter Codex by tokens rather than message count, so heavier requests now drain the window faster. If you run out before it resets, OpenAI lets you buy additional credits or switch to a smaller model to keep going — both useful, but one costs money and the other costs capability.

What each plan tier covers

Plan	Best for	Codex capacity	You'll hit the limit when…
Free & Go	The occasional prompt, not sustained coding.	Limited access.	Almost immediately for real work.
Plus	Light to moderate coding — short sessions, a few focused hours a day.	Entry tier for real Codex work.	Heavy debugging or codebase exploration.
Pro (x5 / x20)	Full-time individual developers using Codex most of the workday.	Substantially higher; pick x5 or x20 to match your load.	Long multi-project days (x5); rarely (x20).
Business & Enterprise	Teams running long, context-heavy sessions across projects.	Highest allowances, pooled seats.	Rarely, for most team workflows.

The right tier depends less on your seniority and more on your workflow style. Someone who runs many short, narrow prompts uses far less than someone who does long debugging sessions with lots of file reads.

Why you hit limits sooner than expected

Most usage gets eaten by content you did not write yourself: tool output, file content Codex reads on its own, repeated conversation history, and large search results. A single failing build or a few large file reads can use as much of your budget as a long thoughtful prompt.

For a fuller breakdown of what burns usage fastest, read our Codex usage guide. If you want to understand why the bill (rather than the limit) feels high, our Why is Codex so expensive? page covers that angle.

Stretch your plan instead of upgrading

The fastest way to avoid the next limit message is to reduce how much context Codex sends per turn. Headroom intercepts your prompts locally, compresses repetitive logs and boilerplate, and forwards a leaner version to the model. Same workflow, ~50% fewer tokens, your plan effectively lasts twice as long. No account migration, no measurable quality loss.

For the full set of tools and tactics, see our Codex cost guide.

Codex usage limits and the 5-hour window

How the 5-hour rolling window works

What each plan tier covers

Why you hit limits sooner than expected

Stretch your plan instead of upgrading

Make your current plan last longer

Not ready to install yet?

Headroom is macOS-only — for now

You're on the list.