GitHub just switched Copilot to metered billing, and developers are watching months of credits vanish in a single day

sanitation@lemmy.today · 2 days ago

GitHub just switched Copilot to metered billing, and developers are watching months of credits vanish in a single day

m3t00🌎🇺🇦@lemmy.world · 4 hours ago

what? free trial over? 📯 ai spam free-for–all has sucked so far

gdtf@lemmy.ml · 3 hours ago

Nice. Maybe now my company stops trying to shove it down our throats.

SirEDCaLot@lemmy.today · 17 hours ago

Ever run an AI model locally? If you want the most capability you need a fast GPU with 32-48gb RAM. And that’s all for you, ONE user.

Copilot has millions of users, with tens or hundreds of thousands of them hitting the AI all at once. Each one needs $thousands worth of GPU and RAM dedicated to them for the length of their query processing.

Where do you think the money to buy all that hardware comes from? You see OpenAI buying a double digit percentage of the world’s RAM production, you think they got it on clearance sale?

No, there are investors. Investors who are pouring hundreds of billions into this AI stuff. And they don’t do this because it’s fun, they do it because they expect a BIG return.

So what’s going on is just like your neighborhood drug pusher, only the drug pusher is more honest. He says ‘first hit’s free, man’. AI company says ‘AI models are an easy and cost effective way to modernize your workflow!’; they don’t tell you that once you’ve integrated them and fired all the humans who know how to do the work, the price is gonna go way up.

Because the fact is, there IS a real cost of AI compute. GPU time, or at the large scale, datacenter space, power, cooling, etc.

In another few months to few years, the C-suites will stop huffing the koolaid and will start doing cost-benefit analysis on where AI is and isn’t cost-effective vs. humans. With any luck (for the AI people) by that time the AIs will be good enough that it’s a clear benefit. If not this bubble’s gonna pop.

Rooster326@programming.dev · 7 hours ago

Has anyone ever actually had a drug dealer giving out free hits?

I feel like that’s the biggest lie DARE ever told.

Constant Pain@lemmy.world · 4 hours ago

When I was a teen, I was offered drugs for free more than once, so it’s not all a lie. Maybe it is just uncommon in many places.

I’m Brazilian, BTW.

MIDItheKID@lemmy.world · 5 hours ago

DARE told so many lies. I was told that if I smoked weed, it would ruin my life and I would become a helpless drug addict. Then I smoked weed and that didn’t happen. Then they got me thinking “what else did they lie to me about?”.

Dasus@lemmy.world · 6 hours ago

Yeah no. That’s not good business, for small dealers. Only large ones looking to improve their margins and completely without morals.

Pretty much the only people who do that are actual medical doctors (who’ve gotten convinced into believing some lies by pharma reps who are the lowest of the low) and sample offerers at large grocery stores.

I mean I’ve had free hits from my dealers, and given out free hits. Of weed. While someone is buying weed. So it’s not like I’m pushing it on anyone or anyone is pushing anything on me.

Drug propaganda is mostly lies yeah.

Legalise, educate, tax and regulate.

TrackinDaKraken@lemmy.world · 4 hours ago

brucethemoose@lemmy.world · 3 hours ago

It’s more complex than that. The weights of big models are distributed, and then tokens are processed in parallel for multiple users. The setup varies, but it could be 8 GPUs serving many dozens of users at once, or bigger sets with even more parallelism.

I think the bigger problem is that Copilot is… shit.

It’s probably some ancient, inefficient architecture, not something super sparse and hardware efficient like (say) Deepseek V4, or Kimi 2.6, or Gemini Pro.

And literally every interesting dev team Microsoft has ever acquired (Phi, WizardLM, many more), and any interesting innovation they figured out, has just disappeared into a black hole.

They don’t have custom hardware, either, like Huawei NPUs or Cerebras WSEs, or Google TPUs. They’ve written some very interesting papers on that, and proceeded to do squat with them.

Also, it is AWFUL for its size. Tiny models that are basically free run circles around CoPilot.

What I’m getting at is that CoPilot is probably the most inefficient LLM out there. Like, it’s impressive how bad it is.

Zexks@lemmy.world · 3 hours ago

Umm copilot is just linkage to other models. My work VS instance defaults to claude but there are several others available. “Copilot” itself is not its own model

brucethemoose@lemmy.world · 3 hours ago

Really? I don’t use it for work, but I swore I was hitting some internal MS model for chat/code, as it was one of the worst experiences I’ve had with LLMs over 24B.

BlackPenguins@lemmy.world · 7 hours ago

That’s why I would rather run locally. I control my data. Better for the environment. And if I ask a programming question, sure ChatGPT will come back in seconds, but I’m fine with waiting 30-60 seconds for my own AI. How impatient are we.

Auli@lemmy.ca · 6 hours ago

And itbos slower and a whose model then they are using.

heartSagan5@lemmy.zip · 17 hours ago

So, what you’re saying…is the AI Bubble is going to pop once the pencil pushers do the math? But they’re asking their local LLM for that… so it isn’t happening?

SirEDCaLot@lemmy.today · 16 hours ago

Not pop. Correct.

A lot of the managers aggressively pushing AI have little or no understanding of it themselves. They just hear of a technology that can make a human more productive by doing most of the work for them. So absolutely that’s worth a ton of money. It’s why many companies are encouraging if not demanding employees to start using AI- because in their mind, one employee fully utilizing AI can do the work of two standard employees. Of course they believe this because they’ve never actually had to use the damn thing themselves and thus don’t realize it doesn’t do all the work for you. Or worse they think it does and your wonderful code base turns into spaghetti.

Side note- A few companies even had leaderboards for who was using the most AI tokens. This led to ‘tokenmaxxing’, trying to consume as many tokens as possible to prove you are adopting AI. Things like 'Write unit tests for our company code base, then refactor the code base. Spin up an instance of Claude and another of ChatGPT to each generate unit tests of the old code and run them against the new code, then run the tests against each other to check each other’s work, submit full debug output to another instance of gpt 5.5 that will check for hallucinations… Keep that query going for a few paragraphs and you’ll have an army of AI workers all checking each other’s work while producing zero productive output but costing a fortune to run.

T156@lemmy.world · 16 hours ago

Ever run an AI model locally? If you want the most capability you need a fast GPU with 32-48gb RAM. And that’s all for you, ONE user.

Even then, that’s quite small. Top of the line frontier models would be looking at hundreds of gigabytes of video memory, and just as much RAM.

A terabyte of VRAM/RAM needed for something like CoPilot is probably a fairly sensible estimate.

phx@lemmy.world · 16 hours ago

Depends on what you want to do, the model, and optimization or quantization.

A lot of LLM stuff that seemed pretty amazing a few years ago - chatbots and the like that respond to questions in plain language - can run in comparatively light hardware. Coding agents can take more, but could also be optimized against a particular language and spit out useful snippets.

Image stuff can be pretty complex especially at higher resolutions and detail, and creating seamless video segments gets expensive on hardware, fast.

SirEDCaLot@lemmy.today · 7 hours ago

Quite true. The thing is, there aren’t billions and billions of dollars in chatbots. The billions are for the creative stuff and the code.

And that is where the reckoning / correction will come from, the bill has to come due eventually. When top end generative AI starts to have a real cost associated with it, then it’s no longer a blanket ‘everyone start using this immediately’ mandate, it prompts some consideration of cost versus output quality.

sudoer777@lemmy.ml · 16 hours ago

Deepseek V4 Flash exists as a small open-weight model and is crazy cheap for what it is capable of. ClosedAI/Anthropic and other non-free models are sketchy AF with their pricing and basically everything else about them.

spaceracoon@lemmy.zip · 24 hours ago

The squeeze could not happen soon enough 😌

heartSagan5@lemmy.zip · 17 hours ago

For real. This whole “AI/LLM” nonsense could’ve been a nothing burger if they correctly charged it at the get go… Ah, well. Venture capital is suckers, I guess?

Auli@lemmy.ca · 6 hours ago

The problem is though they made it basically free. And that was a mistake cause people don’t like paying for stuff once it was free.

Sinthesis@lemmy.today · 16 hours ago

From what I’ve seen, the AI companies don’t even know what to charge. For example, claude.ai finally made an API for user costs and we get this fun fact:

Values for a given date may be revised for up to 30 days as late events arrive and reconciliation runs. For invoicing-grade totals, query dates at least 30 days in the past.

underscores@lemmy.zip · 19 hours ago

btw soon GitHub actions will no longer be free to self host

garbage_world@lemmy.world · 16 hours ago

For private repos, unless something changed since I last heard about that

kazerniel@lemmy.world · 1 day ago

*plays the world’s tiniest violin*

mazzilius_marsti@lemmy.world · 19 hours ago

what is a “usage” btw? They said 1 cent = 1 usage.

So 1 message sent to Copilot? Or usage = number of response Copilot gives back to the user?

SirEDCaLot@lemmy.today · 18 hours ago

I think that was before. Now it’s based on the compute cost of the request. Like if you ask a large model to perform a long and difficult task you’ll pay through the nose. If you ask a simple model to perform an easy task it’ll be dirt cheap. And that’s what’s got people pissed, is that what used to be ‘one query’ now may consume a lot of credits and it may not be clear how many.

qaz@lemmy.world · 2 days ago

Everyone could’ve seen it coming from mile away

nbsp@programming.dev · 2 days ago

anyone who didnt is a fucking idiot

zbyte64@awful.systems · 6 hours ago

I’ve got some bad news about our managers and leaders…

pHr34kY@lemmy.world · 2 days ago

It wasn’t profitable. This bait and switch was a long time coming.

minorkeys@sh.itjust.works · 16 hours ago

I’m sure they wanted to drag out the adoption longer than it was viable, too. Didn’t quite get everyone dependent.

mic_check_one_two@lemmy.dbzer0.com · 2 days ago

Yup, the “first one is free” deal is always a trap.

SCmSTR@lemmy.blahaj.zone · 2 days ago

Literal drug dealer strategy

Clutter@sh.itjust.works · 2 days ago

In just glad it happened relatively soon. Should have happened sooner :-)

replicat@lemmy.world · 2 days ago

I’ve been using it for well over a year now.

Hit my limit in 1 day and canceled.

Ohi@lemmy.world · 2 days ago

Yup, same. x57 for GPT 5.5 now?! The GitHub agent architecture was hot as hell and allowed me to work on projects without ever bootstrapping it locally, but there is no way I’m gonna get sucked into spending hundreds a month. It felt good seeing that $250 refund come through for my yearly plan. I’ll take those funds to Claude or Codex directly. Throttled usage is way better than being fucked over by a glorified reseller.

Passerby6497@lemmy.world · 23 hours ago

It felt good seeing that $250 refund come through for my yearly plan. I’ll take those funds to Claude or Codex directly

I look forward to seeing the follow up comment where you do the same with one of those when they inevitably do the same

BlackLaZoR@lemmy.world · 16 hours ago

They’ll throttle you more - it’s just different way of gouging price

Ohi@lemmy.world · 14 hours ago

Perhaps, but at least I’m not being charged for each attempt at doing work besides my time in the attempt. Copilot would bug out and that cost me usage credits. Borderline stealing from its users paying for limited credits per month.

darkangelazuarl@lemmy.world · 1 day ago

Lol, all AI is shifting to this model. Those other companies will bait and switch too just a matter of time.

KeenFlame@feddit.nu · 19 hours ago

The company with all money all phones all email all calendars and meetings and satellites and shit will probably not.

Ohi@lemmy.world · 14 hours ago

It’s too bad their models are also all garbage, or I’d be on that Google AI train. They’re desperately trying to stay relevant in a space they should be owning in the first place. You plant shit seeds, you get shit weeds. Even if the models were good, their track record shows it’s not worth the time investment adopting their stack because chances are it ends up in the infamous graveyard within 3-5 years.

MrTolkinghoen@lemmy.zip · 22 hours ago

Yeah I don’t doubt that anthropic is burning money so fast right now. It’s actually wild to see, GitHub’s pricing might actually be the first glimpse of real pricing for AI we’re seeing.

bigbangdangler@reddthat.com · 2 days ago

Sigh. None of this is surprising in the least. The cost of AI infrastructure and compute (coupled with the complexity of the chain) makes it prohibitively expensive.

It only has appeared cheap because of investor money flowing like Niagara on the off-chance that it could be made cheap enough to be profitable after getting everyone addicted to using it. I really don’t think it’s there, and it’s definitely not cheap enough to continue flying for free much longer.

BassTurd@lemmy.world · 2 days ago

There will be some major bag holders soon. Who’s is gonna be? The funnelers or the funnelees? How does MS bounce back having integrated copilot into everything?

MalReynolds@slrpnk.net · 2 days ago

Given the recent NASDAQ changes, looks like US pension funds will end up with the bag. What a scam.

Microslop can suck it.

BNE@lemmy.blahaj.zone · 2 days ago

So a massive wealth transfer - right after Covid, ANOTHER unprecedented wealth transfer.

Side question: when are you people in the imperial core going to do something about this with all that God given Liberty™ and Freedom® you continually bomb and destabilize the rest of us for?

MalReynolds@slrpnk.net · 2 days ago

Shakes Magic 8 Ball: Signs point to never.

BassTurd@lemmy.world · 2 days ago

We got ourselves a keyboard warrior here.

BNE@lemmy.blahaj.zone · 1 day ago

A stunningly generic defensive sledge from user BassTurd, I guess.

HaraldvonBlauzahn@feddit.org · 2 days ago

As in the 2008 saying: “All that money has not really disappeared. It is just in the pockets of different people”.

JackbyDev@programming.dev · 2 days ago

B- but the DOW 🥺

KeenFlame@feddit.nu · 19 hours ago

I don’t understand how everyone repeats this shit… as if we have never ever before seen a renewable asset… no, it’s not expensive to run them. It is expensive to give away usage. It is expensive to train. It is expensive to buy hardware. It is very very cheap to run. But hypercapitalism ruins everyone

Auli@lemmy.ca · 6 hours ago

So how do they get back the money spent on hardware and training? They also have to update their hardware occasionally. It is expensive to run.

ryantown@lemmy.world · 2 days ago

Yeah, what’s the surprise here? Turns out it’s expensive.

KeenFlame@feddit.nu · 19 hours ago

For the poor that will take the hit when the next bailout happens

paraphrand@lemmy.world · 2 days ago

Microsoft was deceptive here and never made it clear exactly what sort of deal you were getting with the flat rate. There was no indication of the actual magnitude.

W98BSoD@lemmy.dbzer0.com · 7 hours ago

Pop-pop!

Thorry@feddit.org · edit-2 2 days ago

Yeah especially because they now have some convoluted method involving different counts of processing, cache etc. But the developer has no easy way of seeing those statistics and thus has no feel for them. And developers already have little control over how much tokens a task takes. Which was fine with the flat rate, just use the service. But now that those things actually matter, the stats should be way easier to see?

So in typical Microsoft fashion not only did they raise prices they somehow made it even more shit. Like the AI already sucked, but does the service itself need to suck as well?

Not being able to control costs and very vague productivity improvement claims makes the ROI impossible to calculate. So even if the AI wasn’t shit, it would still be hard to figure out if it’s even helping at all.

dan@upvote.au · 2 days ago

different counts of processing, cache etc

Don’t all the providers do this, though? Anthropic/Claude has different pricing based on if you’re caching for five mins vs one hour (which are the only two options for cache TTL). https://platform.claude.com/docs/en/about-claude/pricing

paraphrand@lemmy.world · 2 days ago

Well, the point here is the deception. So, if you can find a similar link from the past from Microsoft…

Meron35@lemmy.world · 1 day ago

Copilot was uniquely awful at this, because up to until literally days before the switch to usage based billing there was no way for people to track token usage, despite repeated calls from the community.

Microsoft only added a billing “projection” feature on the admin page that was meant to download a spreadsheet (which straight up didn’t work for most people) less than a week before the new billing structure.

dan@upvote.au · 1 day ago

Ahh, that’s not great. I’ve only ever used it for free since I get the $10/month plan for free as an open-source maintainer (https://docs.github.com/en/copilot/how-tos/copilot-on-github/set-up-copilot/enable-copilot/set-up-for-teachers-and-os-maintainers) so I’ve never had to deal with the billing side.

The amount of value they used to provide with the $10 plan felt too good to be true.

Meron35@lemmy.world · 18 hours ago

It was too good to be true. Microsoft had to implement additional 5 hour and week usage limits to cope with demand.

The problem was, for the entire history of copilot, there was literally no way to even check these usage limits. All usage counted towards these limits, even those with a 0x multiplier that didn’t consume premium requests.

It was so bad that many people couldn’t even use the premium requests they paid for the month. The only around it was to switch to auto model routing, which would tend to route to lower quality models.

BassTurd@lemmy.world · 2 days ago

You’d have to be truly ignorant to not know that it was going to happen (not saying that you are). It’ll be interesting to see if there any legal recourse, but I’m guessing not.

All of these companies are hemmoraging money to train and provide LLMs. The only option is to charge more. This current increase, IMO, is an alpha test for future increases. Unless there is a major jump in the technology, moreso the logic to train LLMs more efficiently, the cost has to keep going up to stay solvent for most of these major players.

paraphrand@lemmy.world · 2 days ago

I’m not talking about ignorance here, just one of the facts about how Microsoft was being shitty.

Are you defending Microsoft’s choice to be shitty? Or just excited to share how you knew better?

BassTurd@lemmy.world · 2 days ago

Absolutely not defending Microsoft. I’m just saying that it was obvious even a couple of years ago that this current pricing model isn’t feasible. It was always part of the plan to jack up prices, because that’s how every subscription service operates. In the case of AI, the expenses are astronomical and the pricing has never been close to something resembling profit.

The_Decryptor@aussie.zone · 2 days ago

Microsoft announced these price increases back in April, this shouldn’t be a surprise to anybody.

toddestan@lemmy.world · 5 hours ago

The surprise is that while Microsoft did announce how they were changing how Copilot usage is billed, they didn’t really give users a good way to gauge how expensive their current usage would be under the new billing system. Turns out it’s a lot more expensive than most people were expecting.

ryantown@lemmy.world · 2 days ago

It was very much a red flag when they shut down signups.

porous_grey_matter@lemmy.ml · 2 days ago

It was easy to know how much the slop machine costs to run if you bothered to put even a tiny effort into finding out.

artyom@piefed.social · 2 days ago

The surprise is that they’re not continuing to throw VC money at it to bolster user numbers and mislead investors.

Trilogy3452@lemmy.world · 2 days ago

I think this is "stupid money’, again.

I hope this makes companies stop being “AI native”

teft@piefed.social · 2 days ago

Man, enshittification is happening so fast for ai. Imagine the next big thing. It’ll be enshittified prior to release.

kieron115@startrek.website · 21 hours ago

This is the same business model that tech “startups” use, just at a vastly accelerated time table. Upend some existing market (in this case, several markets) by burning through cash at an unsustainable rate. Once your customers have been hooked, and/or the alternatives eliminated as competitors, you crank up the prices to try and make the business cashflow positive.

porous_grey_matter@lemmy.ml · 2 days ago

This isn’t enshittification in the traditional sense, they haven’t captured the market enough for that. They’re just panicking because they’re burning cash way too fast.

paraphrand@lemmy.world · 2 days ago

That word is only used correctly about 5% of the times it is used.

Windex007@lemmy.world · 2 days ago

Don’t wanna be “that guy”, but if if 95% of the time people use it to generally mean “greedy companies making things shitty to try and wring an extra buck”, then that just is what that word means now.

It leaves a vacuum for the concept it was originally intended to describe, but that’s how she goes.

paraphrand@lemmy.world · edit-2 23 hours ago

I guess so. But the original definition was useful and encapsulated a specific thing. And then everyone discarding that definition is a bummer. But it’s not like there is anything to be done about it. And Doctoro has said he’s not too annoyed by it himself.

Windex007@lemmy.world · 1 day ago

I wholeheartedly agree. I love when I hear a new word, and a mushy concept just snaps into focus. Like, yea, I understood that concept in my “gut” but had not special word for it? So gratifying.

Feels like theft when it gets appropriated to something else.

Glad to hear Cory is cool w/ it. Putting a word (even by accident) to a poorly-expresable part of the human experience that let’s people communicate more effectively? That is a contribution to society that most of us only dream of. Fucker did it by accident. Thrilled for him.

SpaceCowboy@lemmy.ca · 21 hours ago

I don’t think saying AI is being enshittified is too far from the orignal essay

This is enshittification: surpluses are first directed to users; then, once they’re locked in, surpluses go to suppliers; then once they’re locked in, the surplus is handed to shareholders and the platform becomes a useless pile of shit. From mobile app stores to Steam, from Facebook to Twitter, this is the enshittification lifecycle.

The AI companies were definitely subsidizing usage of the tech in the hopes of getting people locked in. Don’t think people are all that locked in. Probably some people are, suppliers like NVIDIA seem pretty locked in. But I don’t think there’s all that much to direct to shareholders relative to what they put into it to try to get lock in.

So it’s kinda failed enshittification? Premature enshittification… preshittification?

Bluescluestoothpaste@sh.itjust.works · 6 hours ago

It’s just enshittification, like what are we talking about? They’re making the product shittier to raise margins, period.

KeenFlame@feddit.nu · 19 hours ago

You bastard… it’s perfect

Ginny [they/she]@lemmy.blahaj.zone · 2 days ago

It’s ironic, but I could care less. You’re literally beating a dead horse.

MycelialMass@lemmy.world · 2 days ago

Are suggesting the word itself has become enshittified?

boonhet@sopuli.xyz · 2 days ago

Interestingly enough, not the correct use for that word lol

MycelialMass@lemmy.world · 1 day ago

😏

Billegh@lemmy.world · 2 days ago

Quick, we need a word for enshittified enshittification!

Jakeroxs@sh.itjust.works · 1 day ago

“They’re burning cash too fast”

Their most recent earnings was 82.9B just last quarter, you don’t know what you’re talking about.

porous_grey_matter@lemmy.ml · 1 day ago

Ah yes. Of course, all of Microsoft just exists to subsidize GitHub AI tool usage.

SpaceCowboy@lemmy.ca · 21 hours ago

Well given the number of developers they canned so they could dump more money into the AI money pit, they kinda were heading that direction. But I guess they clued in that there might be a small problem with that business model.

Jakeroxs@sh.itjust.works · 24 hours ago

They’re putting copilot in everything, it’s not just github that has copilot lmfao

zurohki@aussie.zone · 2 days ago

Meta kind of did that with their VR stuff. They skipped the appealing to users part and build a bland, brand-safe, microtransaction-laden experience to sell to businesses assuming they could just use their size to force users to buy it.

Homosexual sapiens@lemmy.blahaj.zone · 6 hours ago

And in the processs they bought and shuttered the majority of VR game studios

JackbyDev@programming.dev · 2 days ago

I think that’s because Meta acquiring Oculus was the enshitfication.

badgermurphy@lemmy.world · 2 days ago

They forgot the first step where you make a great product that everyone’s clamoring to use.

spankysalmon@fedinsfw.app · 2 days ago

They did, if by “everyone” you mean rich detached executives that constantly make decisions they have not the intelligence to understand.

nightlily@leminal.space · 2 days ago

„Enshittified“ implies it was ever not shit.

Lost_My_Mind@lemmy.world · 2 days ago

As a mod for the Enshitification you hate to see Enshitification in any form.

As a mod of Fuck_AI, I’ll make an exception in this one case.

COMMENCE THE ENSHITIFICATION!!!

Which completely goes against my Enshitification mod mindset, but here we are.

So…nobody told me life was going to be this way…

clap-clap-clap-clap

dustyData@lemmy.world · 2 days ago

Enshittification strategies only work if the product actually produces revenue and it grows along with market share. Unfortunately, even if market use grows, revenue from AI is not there. Even YouTube crossed the break even boundary before the start of ad enshittification and after the Google buyout. LLMs financial projections are starting to show that it will never reach enough revenue to cover costs, much less cover ROI into infrastructure and capacity. The math isn’t mathing and investors are starting to get cold feet. So they either enshittify now or the capital soon will start vanishing.

nieceandtows@programming.dev · 1 day ago

Basically what happened with Claude Opus 4.8

Bizzle@lemmy.world · 2 days ago

It’ll be a macguffin that makes your life a little more convenient in some ways, but it’ll cost the equivalent of a 1996 Honda Civic to use monthly and you have to smash your hand with a hammer to turn it on

GitHub just switched Copilot to metered billing, and developers are watching months of credits vanish in a single day

GitHub just switched Copilot to metered billing, and developers are watching months of credits vanish in a single day

Just a moment...