Key takeaways
- GitHub replaced Copilot's flat Premium Request Units with token-priced GitHub AI Credits on June 1, 2026. Base plan prices held (Pro $10, Pro+ $39, Business $19/user, Enterprise $39/user), but the spending ceiling is gone and the cheaper-model fallback was removed.
- The viral "$29 to $750" and "$50 to $3,000" screenshots are individual-developer projections, not GitHub-issued bills. Real receipts confirm that heavy agentic users burn credits fast; autocomplete-only users notice nothing.
- Autocompletes and Next Edit Suggestions stay unlimited. Copilot code review now also burns GitHub Actions minutes on top of AI Credits, a real double meter to watch.
- For founders and engineering leads: turn on a hard budget cap, run the usage export, enforce model-selection discipline, and price out flat-fee alternatives (Windsurf, Claude Code, Cursor) as leverage and as a hybrid stack.
What actually changed on June 1, 2026
GitHub's chief product officer Mario Rodriguez announced the shift on April 27, 2026. The official GitHub Blog post framed it as the only way to keep an agentic Copilot economically sustainable. The go-live changelog landed on June 1, 2026 and the mechanics are now official.
Premium Request Units are gone. The replacement is the GitHub AI Credit. One credit equals one cent ($0.01), and every interaction is priced on the model's published per-token rate across input, output, and cached tokens. Code completions and Next Edit Suggestions still cost nothing. The agentic features (agent mode, the Copilot cloud agent, Copilot CLI, Spaces, Spark, and third-party coding agents) consume credits.
Two changes hit hardest. First, the cheaper-model fallback is gone. Previously, exhausting your Premium Requests dropped Copilot to a lighter model so you could keep working. Now, if your included credits run out and your additional-usage budget is zero, credit-consuming features stop until the next cycle. Second, Copilot code review now consumes GitHub Actions minutes alongside AI Credits, a true double meter. For teams that already run a tight delivery process (we cover this in our notes on mobile app deployment strategy), that detail matters more than the headline.
The new individual plans, line by line
Between the April announcement and the June 1 launch, GitHub absorbed enough backlash to add two things: a "flex allotment" of bonus credits inside each plan, and a brand new Copilot Max tier. The breakdown as of go-live:
- Pro at $10/month: $10 base credits plus $5 flex, $15 total credits per month.
- Pro+ at $39/month: $39 base plus $31 flex, $70 total.
- Max at $100/month: $100 base plus $100 flex, $200 total.
- Business at $19/user/month: 1:1 base credits, no flex. June through August 2026 gets a promo top-up to $30 per user.
- Enterprise at $39/user/month: 1:1 base credits, no flex. Promo top-up to $70 per user through August.
GitHub committed to keeping base credits fixed 1:1 with price. The flex allotments are explicitly described as variable and "may change over time," which is the same caveat that has made cloud providers' free tiers a moving target for a decade. If you are scoping a team that ships product end-to-end, our guide to affordable app development services in the USA walks through how to model variable tooling costs into project budgets so they do not creep up on you mid-build.
How tokens get priced (the actual math)
The mechanics are simple. Every interaction passes input, output, and cached tokens through the chosen model. The token count is multiplied by the model's per-million-token rate. The resulting dollar amount is converted to AI Credits at 1 credit = $0.01. The cost of any interaction depends on exactly two things: the model and the token count.
Per-million-token rates from the official pricing docs:
- OpenAI: GPT-5 mini (included on paid plans, consumes zero credits): $0.25 input / $2.00 output. GPT-5.4: $2.50 / $15.00. GPT-5.5: $5.00 / $30.00.
- Anthropic: Claude Haiku 4.5: $1.00 / $5.00. Claude Sonnet 4.5 to 4.6: $3.00 / $15.00. Claude Opus 4.5 to 4.8: $5.00 / $25.00 plus cache-write costs. For the technical context on what the Opus 4.8 jump actually delivers in practice, see our Claude Opus 4.8 deep dive.
- Google: Gemini 2.5 Pro: $1.25 / $10.00. Gemini 3.5 Flash: $1.50 / $9.00.
GPT-4.1 and GPT-5 mini are the two models marked "included" on paid plans. They consume zero credits no matter how many turns you take. Auto model selection, where Copilot picks the cheapest model that can answer the prompt, gets an additional 10% discount on model costs.
That last line is the most important point in the entire post. Model choice, not credit pool size, is the lever. A team that defaults agents to GPT-5 mini and reserves Opus 4.8 for genuinely hard multi-file problems can run inside Pro+ all month. A team running every prompt on Opus 4.8 with full-repo context will burn through Pro+ in days. The same scope-discipline thinking shows up in our MVP vs full product strategy guide, and it applies here just as cleanly.
Where the viral "$29 to $750" numbers really come from
The headlines are wild. $29 plans projected to $750. $50 plans hitting $3,000. "10x to 50x" cost increases. TechCrunch surfaced these figures on May 30, 2026, sourced to individual developers on Reddit, X, and GitHub's own community discussion thread. None of them are GitHub-issued bills. They are worst-case extrapolations from heavy agentic users.
What is real, sourced to attributable receipts:
- A developer used 1,180 credits (16% of a Pro+ monthly allowance) in a single Claude 4.8 session, per The Register.
- Another reported burning 8% of Pro+ in two hours, projecting the 7,000-unit quota would last "less than two days" at that pace (community discussion 192948, user mtaheri8541).
- A 3-minute GPT-5.5 chat consumed 405 credits, roughly $4 (community thread).
- A single change request cost $6+ on Pro+.
The GitHub community discussion drew 904 downvotes against 22 upvotes across 435 comments in the first 72 hours, one of the most lopsided reactions in the forum's history. Counter-voices in the same threads argue the nightmare bills mostly hit "vibe coders" running bloated agentic loops on frontier models, and that disciplined tool use stays affordable. Both are right. The dollar figure depends entirely on how a developer uses the product. Treating the agent like a search box on Opus 4.8 will hurt. Treating it like a precision instrument will not.
Why GitHub did this
The official line, from Rodriguez's April blog, was honest enough: "a quick chat question and a multi-hour autonomous coding session can cost the user the same amount" under the flat model, and "it had become common for a handful of requests to incur costs exceeding the plan price." Translation: Copilot was a loss leader, and the loss was getting bigger every week.
Ed Zitron's "Where's Your Ed At" newsletter quoted an internal Microsoft document on April 20: "the week-over-week cost of running GitHub Copilot nearly doubling since January." That is secondhand and GitHub has not confirmed it, but the language in the official April post is consistent with the framing.
The wider pattern: Cursor moved to usage-based credits in June 2025, apologised and refunded customers in July 2025 after backlash, then settled into the new model. Windsurf shifted to quotas in March 2026. Anthropic moved enterprise customers to usage-based billing. Microsoft CEO Satya Nadella has said every per-user Microsoft business is moving to per-user-plus-usage. AI coding tools are graduating from a fixed perk to a managed, metered service. Treat token spend like cloud spend. The same FinOps discipline applies. Our notes on audit-ready AI agents cover why observability matters when an autonomous loop can rack up hundreds of dollars in minutes if you let it.
The backlash, in detail
Coverage was broad and pointed. The New Stack, The Register, TechCrunch, Tech Times. Recurring complaints across the threads:
- Loss of cost predictability. Flat-rate simplicity is gone. Even disciplined users now have to think before every agent invocation.
- No cheaper-model fallback. When you run out, you stop, you do not switch. For teams running tight sprints, the abrupt stop is the worst part.
- Double meter on code review. AI Credits plus Actions minutes on the same review run.
- Annual-plan bait-and-switch. Per The Register, model multipliers jumped on June 1 for annual subscribers: Claude Opus 4.7 reportedly from 7.5x to 27x; GPT-5.4 from 1x to 6x.
- Small teams and individual developers feel it most. They lack pooling and the enterprise budget tooling that softens the blow for larger orgs.
GitHub paused new sign-ups for Student, Pro, Pro+, and Max plans during the transition. A defensive move that did not help the optics.
The seven-day playbook for engineering leaders
The right reframing: AI coding tools are now a metered cloud service. Run them like one. In the first week:
- Set a hard cap today. Individuals: set the additional-usage budget to $0, or a deliberate small overage. Orgs: set a universal user-level budget above the per-seat value, set an enterprise budget as a failsafe, and crucially, enable "Stop usage when budget limit is reached" on every enterprise and cost-center limit. It is OFF by default. Without it, the limit is an alert, not a guardrail, and charges keep accruing.
- Measure your real baseline. Export the usage report and look at per-model token burn. Do it during the June through August promotional window for Business and Enterprise plans, but budget against the smaller post-September pool, not the inflated promo numbers.
- Enforce model-selection discipline. Default to included or lightweight models (GPT-5 mini, GPT-4.1). Use auto mode for the 10% discount. Reserve frontier models (GPT-5.5, Claude Opus, Gemini Pro) for genuinely hard, multi-file problems. The credit pool is rarely the constraint. Model choice is.
The thirty-day playbook
Once the baseline is clean, the rest of the work is FinOps for AI tooling. Same skill set as cloud cost management, applied to a new line item. The next four weeks:
- Identify power users and run a FinOps-style analysis. Who drives spend, on which models, for what work. Give power users individual budget overrides rather than raising everyone's limit. This is exactly the kind of cost-vs-value framework we use when comparing engineering hiring options in our guide to hiring developers in Malta.
- Practice prompt and context hygiene. Scope agents to specific files or modules instead of indexing whole repos. Cutting context from ~100k to ~20k tokens cuts input cost roughly 80%. Batch related questions in one session. Avoid blind iterative loops on frontier models. The same discipline our team applies when scoping mobile application development applies here.
- Watch the code-review double meter. Because Copilot code review now also consumes GitHub Actions minutes, set an org-level default runner and monitor the
copilot-pull-request-reviewerworkflow. For high-volume PR automation, compare against purpose-built CI tools. - Price out alternatives as leverage and as a hybrid option. For autocomplete-heavy teams, Copilot Pro at $10 is still the value leader. For agent-heavy individual work, flat-fee Claude Code Max ($100 or $200) or Windsurf and Cursor tiers may be cheaper and more predictable. For regulated or privacy-sensitive shops (which we see often across our healthcare, fintech, and legal industry work), Tabnine self-hosted or local models via Ollama warrant evaluation.
How the alternatives stack up
The competitive snapshot for mid-2026:
| Tool | Individual pricing | Team/Enterprise | Billing model |
|---|---|---|---|
| GitHub Copilot | Pro $10, Pro+ $39, Max $100 | Business $19/user, Enterprise $39/user (pooled) | Token-based AI Credits |
| Cursor | Hobby free, Pro $20, Pro+ $60, Ultra $200 | Teams $40/user, Enterprise custom | Dollar credit pool equals plan price; Auto mode unlimited |
| Windsurf | Free, Pro $20, Max $200 | Teams $40/user | Quota-based, flat tiers |
| Claude Code | Pro $20, Max 5x $100, Max 20x $200 | Team Premium $100/seat (5-seat min) | Subscription token budget or API pay-as-you-go |
| Tabnine | Dev ~$12 | Code Assistant $39/user, Agentic $59/user | Flat seat; self-hosted option |
| Self-hosted (Qwen, DeepSeek, Llama via Ollama) | Free model + hardware | Free model + infra and ops | Zero per-token; you run the GPUs |
Two notes: competitor pricing changes fast (verify each vendor's page before committing), and Sourcegraph Cody is now enterprise-only at roughly $59/user/month after Free and Pro tiers retired in July 2025.
For most teams, the smartest move is hybrid. Use Copilot Pro for autocomplete at $10/seat. Layer Claude Code or Cursor on top for agentic work where the flat fee is more predictable. Reserve self-hosted models for routine internal tasks where privacy or cost matter more than top-tier reasoning. Our AI development practice covers this scoping work for the projects we ship, and the same logic applies whether you are choosing tools for an internal IDE or a customer-facing AI feature, which we cover in our AI and machine learning in modern app development guide.
Benchmarks that should change your plan
- If a developer routinely exhausts Pro+ or Business credits before mid-month on disciplined use, move them to Copilot Max or a flat-fee competitor.
- If org token spend is not correlating with merged work or shipped features, tighten model policy and budgets before buying more credits.
- If your monthly metered overage approaches the cost of a flat-fee competitor seat, switch that cohort.
- Watch whether GitHub extends promo credits past August 2026. If they do, migration pressure is real and worth negotiating on.
How Brandrums helps engineering teams navigate this
The June 2026 shift turned AI coding tools from a perk into a line item that needs FinOps discipline. We help engineering teams stand up that discipline in two ways. First, we help you scope and pick the right tool mix (Copilot for autocomplete, flat-fee tools for agent work, self-hosted for privacy-sensitive paths) based on the actual work the team ships. Second, we plug into the build itself, whether that is mobile application development, web platform work, or AI feature integration, so the tooling and the product roadmap stay aligned. You can see how we approach delivery in our project portfolio.
Key takeaways
- Token-based AI Credits replaced flat Premium Requests on June 1, 2026. Code completions stay free. Agents and code review now meter.
- The "$29 to $750" headlines are individual projections. The real receipts (1,180 credits in one session, 8% of Pro+ in two hours) prove that heavy agentic use burns credits fast, while disciplined use stays affordable.
- Model choice is the lever. Default to lightweight or included models, reserve frontier models for hard problems, and use auto mode for the 10% discount.
- Set the hard budget cap this week. Most enterprise budget toggles are alerts by default, not guardrails. You have to explicitly enable "Stop usage when limit is reached."
- Price out flat-fee alternatives (Windsurf, Claude Code, Cursor) as leverage and as a hybrid option. Most teams will end up running a portfolio.
FAQ
Did GitHub Copilot get more expensive?
Base plan prices did not change. Pro is still $10, Pro+ is still $39, Business is still $19 per user, and Enterprise is still $39 per user. What changed is the spending ceiling. On the old plan, you hit a soft cap when you exhausted Premium Requests. On the new plan, you can keep using AI Credits past your included amount up to whatever overage budget you set. Whether your bill goes up depends entirely on how heavily you use agents and which models you choose.
What are GitHub AI Credits?
AI Credits are GitHub's new currency for metered Copilot usage. One credit equals one cent. Every agentic interaction is priced on the model's per-million-token rate across input, output, and cached tokens, then converted to credits. Code completions and Next Edit Suggestions still cost zero credits.
Is the cheaper-model fallback really gone?
Yes. Previously, exhausting your Premium Requests dropped Copilot to a cheaper fallback model so you could keep working at no extra charge. That behaviour is no longer in the product. If your additional-usage budget is set to $0 and you exhaust included credits, agentic features stop until the next cycle. Code completions still work.
How do I avoid surprise bills?
Set the additional-usage budget to $0 if you never want to pay over your subscription. For teams, set universal user-level budgets above the per-seat value, set an enterprise budget as a failsafe, and explicitly enable "Stop usage when budget limit is reached" on every limit. That toggle is OFF by default. Without it, limits are alerts and charges keep accruing.
Are there cheaper alternatives to GitHub Copilot in 2026?
For autocomplete-only use, Copilot Pro at $10 is still the value leader. For agent-heavy work, flat-fee tiers from Claude Code, Windsurf, and Cursor (each in the $20 to $200 per month range) are often cheaper and more predictable. For privacy-sensitive teams, Tabnine self-hosted or local models via Ollama are worth evaluating. Most teams end up running a hybrid stack rather than picking one tool.
What changes for big enterprise customers?
Business and Enterprise plans get a credit pool shared across the organisation, plus four overlapping budget controls (universal user-level budget, individual user overrides, cost-center budgets, and an enterprise budget). The enterprise budget does not cap total spend; it only caps metered overage after the pooled credits are exhausted. License fees are still owed regardless. Promotional extra credits are running June through August 2026: $30 per Business user, $70 per Enterprise user. They drop back to the standard 1:1 pool on September 1.
Ready to plan the right AI tooling stack for your team?
The June 2026 shift made one thing obvious. AI coding tools are now a managed cloud service, not a fixed perk. The teams that handle the transition cleanly are the ones running real FinOps discipline on their tooling spend, layering flat-fee alternatives where they make sense, and reserving frontier models for problems that genuinely need them. Tell us what your team is shipping and we will help you scope the right Copilot plus alternatives mix. Or review our pricing options if you are evaluating engineering support for an upcoming build.



