DeepSeek makes the V4 Pro price discount permanent

Nifty3929 · 2026-05-24T15:56:53 1779638213

China may be subsidizing this for now in a way that US companies can't or won't - but if they keep building power infrastructure and the US doesn't, then it will no longer require subsidy from them. It will simply be absolutely cheaper (including profit margin) to serve tokens in China.

China is building for the future, while Western Democracies are afraid of the future, and of their own shadow.

hedora · 2026-05-24T16:35:31 1779640531

I'm not sure how much of it is subsidies. If the open weight models are anything to judge by, China is taking price performance seriously, and the US model vendors are looking for performance at any cost. Like any other Pareto optimization, we end up paying 10x more for the last few percent improvement on benchmark scores.

Of course, like literally every other time this has played out in computing history, the companies focused on price performance will end up with more economic resources, and get to turn the upgrade crank more often and for longer.

Also, of course, China's way ahead of the US on things like renewables, batteries, and electrification of their economy. All of that feeds into cheaper power to run the models, but I suspect it's a second order effect vs. "improve the software".

submain · 2026-05-24T21:17:31 1779657451

It seems to me China is chasing widespread adoption, while the US is chasing the AGI dream.

layoric · 2026-05-24T22:02:07 1779660127

They also banned crypto mining which previously was using the free to cheap electricity, so if AI data centres are using those now under utilised supply, very possible subsidies are very low.

aftbit · 2026-05-24T23:26:54 1779665214

And yet despite the ban, China's contributions to Bitcoin mining remain very large.

https://cryptonews.com/news/china-doubles-down-on-crypto-ban...

mxschumacher · 2026-05-24T19:16:50 1779650210

not just renewables, also massive nuclear capacity and huge modern coal plants. They can really crank up capacity if they want to. How long will it take to get a new nuclear power plant operational in the US?

radialstub · 2026-05-24T19:18:01 1779650281

> Of course, like literally every other time this has played out in computing history, the companies focused on price performance will end up with more economic resources, and get to turn the upgrade crank more often and for longer.

The iphone is the best selling computing device in history and is among the most expensive in its category.

palata · 2026-05-24T20:13:57 1779653637

Most smartphones being sold are Android, though.

radialstub · 2026-05-24T20:53:05 1779655985

True, however apple makes the overwhelming majority of the profit in the smartphone market.

dietr1ch · 2026-05-24T21:12:25 1779657145

For most people Apple's main selling point is about showing off the cute devices and battery life, but that's not going to play a role when users are free to choose the tool that will call the models.

swingboy · 2026-05-24T22:39:28 1779662368

You might be vastly overestimating what a majority of phone owners use their phone for.

rootusrootus · 2026-05-24T22:59:20 1779663560

I’ve never seen anyone show off an iPhone. What a weird take.

dietr1ch · 2026-05-24T23:26:36 1779665196

I was talking more about laptops, but haven't you seen people sms bubble colour-shaming?

onlyrealcuzzo · 2026-05-24T16:24:06 1779639846

> China may be subsidizing this for now in a way that US companies can't or won't

They're subsidizing this in many ways - Huawei chips, new DDR5 memory fabs, etc.

Ultimately, DeepSeek's architecture is significantly more cost effective than anything from Google, OpenAI, or Anthropic.

Presumably, they'll incorporate DeepSeek's MLA* architecture to get all the benefits for next year's releases (if not this year's upcoming releases) which will bring down their costs...

They need to actually make money, though, so that might still not give them enough room to make enough money.

Ultimately, hardware depreciation is like 80% of total spending. So power is not as big of a deal in cost. The bigger problem is if you can get the power at all, not how expensive it is.

If you want to bring down inference costs, using less hardware is far more effective than getting cheaper electricity.

Google is in a sweet spot, because they aren't paying 80% margins to nVidia for hardware. So they're probably paying half as much deprecation as everyone else is (or maybe 1/4th for inference - which is now the biggest percentage overall).

nl · 2026-05-24T23:34:54 1779665694

> Huawei chips, new DDR5

The US is subsidizing in exactly the same way through the US Chip Act (as well as state level tax subsidies):

> The act includes $39 billion in subsidies for chip manufacturing on U.S. soil along with 25% investment tax credits for costs of manufacturing equipment, and $13 billion for semiconductor research and workforce training

https://en.wikipedia.org/wiki/CHIPS_and_Science_Act

> Presumably, they'll incorporate DeepSeek's MLA* architecture to get all the benefits for next year's releases (if not this year's upcoming releases) which will bring down their costs.

You can be sure the frontier labs all have similar approaches, but they just don't talk about them. That's why eg Google Flash (the old versions!) were do cheap.

I mean Google published MTP a month or so ago and it has sped up Qwen models by 1.7 times.

If that is what they still publish you get an idea of what they aren't.

onionisafruit · 2026-05-24T16:37:20 1779640640

What’s the TLA architecture? I haven’t read about that.

borski · 2026-05-24T16:44:12 1779641052

MLA, not TLA: https://medium.com/data-science/deepseek-v3-explained-1-mult...

PearlRiver · 2026-05-24T22:06:09 1779660369

Look up the US deficit- they have been subsidizing everything since the 1980s.

toddmorey · 2026-05-24T16:20:35 1779639635

It feels like the US for years has operated under the assumption that homeostasis for the global economy would always be “designed in California, assembled in China.”

Like there was something in the American DNA that was lacking in China and innovation would always need to happen here.

But China it seems doesn’t need the US to produce great cars, devices, robotics, or AI. We absolutely need China to help us build all of the above.

fridder · 2026-05-24T16:58:33 1779641913

Might be more far to say: they needed the US until they caught up. The massive straight up IP theft helps a lot here. Though theft might be too strong since a lot of companies knew what they were getting in to

palata · 2026-05-24T20:21:46 1779654106

> The massive straight up IP theft helps a lot here

I think this is vastly underestimating what "catching up" means. All my life, people have been saying "China copies". Now they are objectively better at many things (including robotics), and... well it seems that we cannot "just copy".

I saw western companies trying to "copy" superior Chinese technology, talking to brilliant engineers explaining how much they were learning by actually trying to copy.

The lesson I got from that is that China did not "copy"; they learned. And it took time, and now they are better. Now the western world has to learn from them, I guess.

jubilanti · 2026-05-24T21:53:15 1779659595

Growing up moving around both conservative and liberal parts of the US, from middle school to college, I distinctly remember several US history classes where I was taught the exact same narrative about Samuel Slater. About how he was an American hero and the Father of the American Industrial Revolution because he memorized a bunch of industrial patent blueprints and brought them over to the US.

It got told as: the evil English made it illegal to even import blueprints for factory machinery, to keep the colonies in resource-extractive poverty, so they'd have to send raw materials overseas to get processed, then import the finished goods. (My other history teacher, the Anno / Dawn of Discovery video game series, also cemented this bit about resource extraction in my head at a young age.) But then thanks to heroic ingenuity and cunning, I was told, the US was able to outwit the colonizers and process its own raw materials, eventually gaining full economic, military, and political supremacy.

Sounds familiar.

aucisson_masque · 2026-05-24T21:36:40 1779658600

It's ip theft when the Chinese do it but when it's the American copying on Chinese it's called learning.

whatshisface · 2026-05-24T20:59:51 1779656391

Producing great products is a game at which every player wins, because sellers must find willing buyers. It only fails if one participant panics and jumps out of the window, or if a significant number of people are not participating (this is always the case when wealth inequality is involved).

Projectiboga · 2026-05-24T20:53:06 1779655986

China is out producing us at new scientists and engineers.

toddmorey · 2026-05-24T18:27:49 1779647269

Ok, not my favorite narrative, but assume asymmetric application of intellectual property rights was a big factor. Wouldn't the US exploiting asymmetric labor wages, rights, and conditions be the even bigger story? It still feels like a short-sighted own goal. The US abandoned its ability to manufacture. Maybe dark factories and robotics can bring it back, but manufacturing supply chains are just so much more advanced in Asia than in the US.

smallmancontrov · 2026-05-24T22:43:03 1779662583

> Wouldn't the US exploiting asymmetric labor wages, rights, and conditions be the even bigger story?

Yes, but "the US" is reductive. The exploitation wasn't done by the towns having their tentpole industries shipped overseas, it was done by the people shipping them overseas and pocketing the profit. US capital owners made a deal with the Chinese Communist Party that was good for both of them and bad for the US.

airstrike · 2026-05-25T00:06:33 1779667593

That's really well said.

The promise was always to get cheaper goods and services in the US, so long as the Chinese firms never competed. Guess what, they compete now.

swasheck · 2026-05-24T20:15:57 1779653757

IP theft may only be part of the story though. it’s a question of priorities. US optimizes for profit which can place limits reinvestment. China seems to optimize for ubiquity and dominance, and has the capital to throw at those goals. when you’re beholden to the shareholder/ceo/investor, you make concessions to stay within their will. when you’re beholden to the state, you do the same.

gmerc · 2026-05-24T21:20:04 1779657604

Talking about IP theft with a straight face in context of AI. lol. Not that kind of IP theft, that doesn’t count.

xbmcuser · 2026-05-24T22:11:58 1779660718

Lol it was not ip theft it was American and European companies building factories in China themselves teaching them how to manufacture use their cheap labour. Well they learned and as they were the dong the manufacturing got better at it. I believe the current aerospace industry which the US leads in is also result of IP theft from the British then out innovating them.

anigbrowl · 2026-05-24T18:10:59 1779646259

Wait until you hear about the history of US industrialization. This trope of 'they stole our ideas' needs to fade away, it's a coping mechanism based on the assumption of inherent superiority of American society rather than the natural wax and wane of civilizations due to varying structural factors.

lejalv · 2026-05-24T20:11:49 1779653509

This so much. You can also read up about when Germany sent industrial spies to Great Britain. And the first documented case of industrial spionage was against... China.

It plays this way: you're behind, you ignore IP rules. You're ahead: you create them to defend your newly-gained status.

Also please no moralizing here on IP when the entire OpenAI/Anthropic playbook has been "massive straight up IP theft". The irony.

dangus · 2026-05-24T18:14:50 1779646490

At some point we can’t keep blaming IP theft for obvious innovation and investments being made by China.

We also can’t blame subsidy. All countries subsidize their industries.

This video on the auto industry covers a different industry but has a lot of the same rhymes as far as China’s strategy:

https://youtube.com/watch?v=UhhZu0ZHdw4

The gist of it is that China does the following:

1. Treats low margin industries like mining and utilities as areas to focus investment and come up with incremental improvements, making those available to all companies. The West, by contrast, allows private companies to handle those industries, who logically don’t bother investing in them since their investors consider those basic industries to be low-value segments of the production chain. But now we see those advantages in China where investments have been made (e.g., the best battery chemistries and mining/refining, the cheapest power (when was the last time your local utility company focused on reducing pricing?)).

2. Because all companies in China have access to the same excellent infrastructure, they must compete furiously on quality/features/price of their products.

3. China allows foreign competition so long as they operate in China (see: Tesla) further insisting that their domestic products be globally competitive and that foreign products sold in their country benefit their local ecosystem.

FpUser · 2026-05-24T21:20:02 1779657602

>"IP theft"

Can we stop this crying baby already. Every country has stolen from the other. Did you really expect countries to settle on sewing closes and ship all profits to foreign companies for eternity? The IP is just an artificial concept that participants follow for so long as it benefits all parties.

api · 2026-05-24T18:22:07 1779646927

The US committed massive IP theft in the 19th century when we industrialized.

ceejayoz · 2026-05-24T18:41:52 1779648112

As did the big AI providers.

falcor84 · 2026-05-24T19:50:51 1779652251

I would appreciate some reading pointers about this.

nl · 2026-05-24T23:39:43 1779665983

> Samuel Slater ... known as the "Father of the American Industrial Revolution", a phrase coined by Andrew Jackson, and the "Father of the American Factory System". In the United Kingdom, he was called "Slater the Traitor" and "Sam the Slate" because he brought British textile technology to the United States, modifying it for American use.

> He learned of the American interest in developing similar machines, and he was also aware of British law against exporting the designs. He memorized as much as he could, and departed for New York City in 1789. Some people of Belper called him "Slater the Traitor", as they considered his move a betrayal of the town where many earned their living at Strutt's mills

https://en.wikipedia.org/wiki/Samuel_Slater#Early_life_and_e...

victorbjorklund · 2026-05-24T20:27:12 1779654432

dig into https://en.wikipedia.org/wiki/Håkan_Lans for example

falcor84 · 2026-05-24T22:22:47 1779661367

What? How is someone born in 1947 relevant to ip theft in the 19th century?

imjonse · 2026-05-24T20:05:23 1779653123

'usa ip theft 19th century' in your fav search engine

falcor84 · 2026-05-24T23:33:05 1779665585

Well, I did, and to save others the time, the most relevant resource I found appears to be the book "Smuggler Nation: How Illicit Trade Made America” (2013) by Peter Andreas

gedy · 2026-05-24T22:40:36 1779662436

Sure but I think what people are actually concerned with today is China copying a product and dumping cheaply back in the country it was taken from. That scale and speed is not what was happening in the 19th century.

I personally have little issue with countries doing that for domestic use (I hate using term "IP theft"), but to re-export so quickly you can't run a viable business in your own country is not fine.

HerbManic · 2026-05-24T23:49:18 1779666558

The one major area they are still behind in is CPU tech, but they are hungry and thus moving quick.

Looking at Loongsons processors for instance. About 15 years ago they coudl barely compete with a Pentium 2. Now they are about 4-5 years behind Intel/AMD. Further behind on some more specific work loads (SSL decoding for example) Not great but that is a decent jump. The jumps between generations are pretty decent.

LA446 was a decent enough processor core but had an awful memory controller that held it back as soon as it needed to reach outside of cache. As such it was SLOW.

But they learned the lesson and now the LA664 almost entirely fixed that issue. I think a big part of performance issues is that they are working domestic 5 to 7nm processes, so a good 5-7 years behind.

They are launching the LA864 later this year and are touting some decent performance gains. That is just marketing so far but something to keep an eye on.

Considering that these chips are using their own ISA, own designs, domestic manufacturing and they aren't terrible is a big thing.

I suspect in the next 5 years they have the chance of completely closing the gap. But it can also go the other way that they end up stalling as smaller nodes get much more difficult to attain.

TedDoesntTalk · 2026-05-25T00:48:35 1779670115

How much does corporate espionage help them?

encrypted_bird · 2026-05-24T21:05:54 1779656754

> Like there was something in the American DNA that was lacking in China

In most Americans' eyes, unfortunately, there was. It was just known by the name "American Exceptionalism". Yes, it's nonsense, but unfortunately it is nonsense that has historically been used by most empires throughout history, and believed just as fervently by said empires' populi since it's one of the central elements of imperialism as a whole.

TurdF3rguson · 2026-05-24T21:47:20 1779659240

The US models are still better though, let's not get carried away. Ours are better, theirs are cheaper. That's how it's always been.

dzonga · 2026-05-24T23:38:24 1779665904

people might not wanna admit it because it feels politically incorrect - but that belief is massively due the idea of "western (white) supremacy".

cz if you're smart & pragmatic - then you will know innovation can come from anywhere - but western elites choose to continually bury their heads in the sand.

roncesvalles · 2026-05-24T19:14:47 1779650087

>Like there was something in the American DNA that was lacking in China and innovation would always need to happen here.

There is (was): attracting the best minds around the world to a free and stable society. Trump voters threw it all away because they couldn't stand non-whites coming to America and doing better than old stock Americans.

DaSHacka · 2026-05-24T21:38:51 1779658731

> attracting the best minds around the world to a free and stable society.

China is comprised of ~91.5% ethnically Chinese citizens. [0]

> Tump voters threw it all away because they couldn't stand non-whites coming to America and doing better than old stock Americans.

The U.S. is more diverse than it's ever been [1], and under Trump we're still below the deportations of Obama's terms.

Sounds like open-borders immigration was never necessary in the first place, given that we're being beat by a country with a similar demographic skew that we had like 80 years ago. Coincidentally, when we arguably had our best economic opportunities for citizens. Who'da thunk.

Clearly, the only solution to our fading relevance is opening the border again and importing 500 million more ""doctors and engineers"" all the while China is investing in their *actual* doctors and engineers, and has extremely strict immigration policies [2].

[0] https://en.wikipedia.org/wiki/List_of_ethnic_groups_in_China

[1] https://en.wikipedia.org/wiki/Historical_racial_and_ethnic_d...

[2] https://en.wikipedia.org/wiki/China#Population_policies

roncesvalles · 2026-05-24T21:44:54 1779659094

You're conflating Mexican border hoppers with skilled immigrants.

I'm absolutely opposed to illegal immigration and have a more extreme position on how to deal with it than most Americans.

What I'm irked by are Trump's attacks on legal immigration and the general worsening of the environment. ICE's kidnappings, the 100k H-1B fee, and the recent Green Card thing have deeply eroded America's attractiveness to legal immigrants.

I think when MAGA came after H-1Bs, it became pretty clear that it's not about law and order, it's just a race thing.

And if you want to go gloves off, I'll just say it: the main problem in America is that its 3 major ethnic groups are infected by anti-intellectualism and slothfulness, whereas the Chinese and various other cultures are not. The direct benefit from skilled immigration is so that we can increase the ratio of people who actually value education and hard work vs the failing old stock Americans whose broccoli-headed kids dream of becoming YouTube influencers instead of astronauts.

avadodin · 2026-05-24T22:38:36 1779662316

That's such a gross misrepresentation of reality.

First of all, the only group of immigrants targeted by the admin are those critical of certain middle eastern regime.

Republican racists mainly care about the immigrants that do not take their middle-class jobs anyways.

Anti-Indian hate is restricted to a minority of software engineers and anti-Chinese hate is virtually non-existent.

I do believe it is idiotic to have your universities full of Chinese, your manufacturing in China and, at the same time, treat China as a geopolitical enemy.

delfinom · 2026-05-24T16:51:54 1779641514

Propaganda. We americans ate that shit up.

There's nothing special about anything we design in the US other than time and money commitment to create it. China did have some espionage of course going on, but the vast majority of shit isn't some secret. And with the US shitting on China with restrictions, we increasingly caused them to invest time and money into things they otherwise would have passively accepted as coming from the west. ASML sees the writing on the wall for themselves in particular.

jdcasale · 2026-05-24T18:28:17 1779647297

It's both.

The US has generally resorted to propaganda rather than addressing the self-inflicted structural conditions responsible for the erosion of our dominance. China also conducted a broad, sustained, large-scale campaign of IP theft across almost every industry.

Obviously there is no natural law preventing China from innovating (We have treated political liberalism as a prerequisite to innovation in a way that was always partly self-congratulatory), but it's also obviously true that the speed of the gap closure is due in significant part to theft.

That doesn't change the fact that they are now a legitimate competitor who has gotten a lot of things right (and among these, some things that we get very wrong) and probably actually leads in some areas.

infecto · 2026-05-24T18:57:42 1779649062

I like this take a lot and agree with it. The US for too long has been asleep at the wheel on many areas, power generation one of them. China with no doubt has conducted very deep and sustained espionage campaigns and even with LLMs there is enough evidence that most of the initial gains was training off of western models. Again no complaints here but I think it’s important to acknowledge both which can be true at the same time.

FpUser · 2026-05-24T21:25:00 1779657900

>"Again no complaints here but I think it’s important to acknowledge both which can be true at the same time."

and this acknowledgement will pay your bills

sandworm101 · 2026-05-24T19:01:29 1779649289

As john oliver said on conan many years ago: "an inflatable barbecue!".

China can certainly design an inflatable barbecue. China can certainly biuld an inlfatable barbecue. But will the chinese people ever want and buy an inflatable barbecue? ... never. That is why the US will remain the premier consumer economy.

nl · 2026-05-24T23:46:40 1779666400

The US is the richest consumer market in the world.

And yet BYD is likely to outsell Ford worldwide this year (despite being banned in the US)

https://en.wikipedia.org/wiki/List_of_automotive_manufacture...

gmerc · 2026-05-24T21:18:30 1779657510

Remember kids, in the west it’s “investment”, in China “subsidy”

bcrosby95 · 2026-05-24T16:23:02 1779639782

Put another way: if the average US citizen doesn't subsidize the costs of these trillion dollar companies, China is gonna come get you. Funny that you talk about being afraid of your own shadow.

I have some exposure to utility regulation and from what I can tell some of the AI companies are "good actors" and willing to shoulder some of the burden. But others are pretty adversarial and want a free lunch.

bryanlarsen · 2026-05-24T16:32:21 1779640341

Power is foundational to pretty much everything. Cheap power is going to give China a massive advantage in everything; AI is just incidental.

seviu · 2026-05-24T18:09:25 1779646165

Cheap power at what cost for our planet?

Not long ago we were crying death to bitcoin, it’s going to destroy the planet.

Come AI, with unlimited power demand. Everybody screaming we need more power.

We need infrastructure, clean energy, even nuclear. We are doing all in the wrong order.

FabCH · 2026-05-24T18:28:18 1779647298

China added 315GW of solar in 2025.

For context, EU added 65 and US 43.

In one year, China _added_ almost the total capacity EU has.

China is the one place where AI actually can use clean energy…

tedd4u · 2026-05-24T20:15:35 1779653735

Possibly France, too.

- 70% nuclear

- 26% renewables

- 4% gas/coal

usrnm · 2026-05-24T21:47:20 1779659240

France cannot really add more of it. Not fast and cheap enough, anyway

DennisP · 2026-05-24T21:38:20 1779658700

China also has 1,271 GW of coal capacity, and is planning 500GW more.

bryanlarsen · 2026-05-24T21:50:58 1779659458

And their coal capacity factor (ie the percentage of time they use their coal) is dropping at about the same rate.

lejalv · 2026-05-24T20:16:15 1779653775

...and China manufactured almost the totality of the EU and US solar capacity.

bix6 · 2026-05-24T18:25:53 1779647153

China is the leader in solar?

seviu · 2026-05-24T21:14:28 1779657268

For a while already

margalabargala · 2026-05-24T16:31:22 1779640282

> Put another way: if the average US citizen doesn't subsidize the costs of these trillion dollar companies, China is gonna come get you.

The future is blatantly going to be electric. Between cars, heat pumps, ranges, etc, the quantity of kilowatt hours consumed will rise dramatically per capita because they are replacing burned fossil fuels.

We don't need to subsidize the trillion dollar companies, we can settle for just not cancelling wind and solar projects, and generally updating the grid infrastructure.

A rising tide lifts all boats. If the subsidies go to common infrastructure, that's good for everyone. There's no need to complain about a road being paved because it will benefit FedEx in addition to everyone else.

jm_l · 2026-05-24T16:44:32 1779641072

All public infrastructure benefits the public but the role of our governance is to correctly prioritize. $100 billion spent on nuclear power plants is $100 billion being withheld from other critical social services.

manyatoms · 2026-05-24T18:06:06 1779645966

The US could very causally spend a couple $100B less on their military and not have a real reduction in capability.

margalabargala · 2026-05-24T16:50:43 1779641443

> $100 billion spent on nuclear power plants is $100 billion being withheld from other critical social services.

What? No it isn't.

There are many places the government could use to appropriate funds, not just social services. The military, for example. Other subsidies. Tax credits. Simply increasing the debt.

coliveira · 2026-05-24T18:11:34 1779646294

No, the money is not coming from a fixed box. When the US wants to do something (typically starting a new war), they never ask where the money is coming from. This tells you everything about how the decisions are made, if it is a priority for them, they will spend the money first and ask questions later. If green infrastructure was a real priority they would invest the money and later find ways to pay for it.

lobocinza · 2026-05-24T23:13:35 1779664415

But those wars are typically fought to maintain the US status, to preserve its ability to debase the national currency effectively siphoning wealth from the world economy. Self-preservations comes first. I'm just describing the system.

airstrike · 2026-05-25T00:08:28 1779667708

Which of the wars in Iraq, Afghanistan, and now Iran were fought to maintain US status?

skeledrew · 2026-05-24T17:38:10 1779644290

> not cancelling wind and solar projects

Tell it to the guy doing just that, as much as possible.

swasheck · 2026-05-24T20:21:58 1779654118

windmills cause cancer and kill bald eagles so we can’t do wind. /s

Aboutplants · 2026-05-24T16:03:19 1779638599

I believe you are right. These models are at worst a 6 month lag to the costly frontier models, but the ability to scale energy production is years ahead of where the US is. That advantage is often under appreciated

Their cost of energy is what matters vs the US as much as speed buildout.

mxschumacher · 2026-05-24T19:19:33 1779650373

I'm still not entirely clear on the problem <-> capability matching. E.g. it seems like Kimi K2.6 with good context would already be able to solve a huge chunk of problems. What share of prompts require frontier models?

energy123 · 2026-05-24T16:26:57 1779640017

It's not really a bottleneck. US capital is building data centers in South Asia, MENA and SEA. Many of these countries offer tax breaks because they want US data centers, and they have abundant equatorial land for solar.

You might say that US would prefer sovereignty but that's a separate argument vis-a-vis strategic competition with China in particular.

delfinom · 2026-05-24T16:53:13 1779641593

Wonder if they are finally exploring installing anti air defenses on these datacenters given they are massively expensive and devastating targets of extreme opportunities.

epolanski · 2026-05-24T23:09:19 1779664159

> China is building for the future, while Western Democracies are afraid of the future, and of their own shadow.

Yes, countries where compromise is not required, where social, capital and human costs are non-factors and where regulations are bendable at will by who's in power can be more effective at achieving some goals.

dominotw · 2026-05-24T23:02:06 1779663726

> China is building for the future, while Western Democracies are afraid of the future

who are the decision makers in china?

dartharva · 2026-05-24T20:27:11 1779654431

> while Western Democracies are afraid of the future, and of their own shadow.

Trillions of Dollars being invested against AI infra would indicate otherwise. US is in fact betting a lot of its economic future on AI.

bdangubic · 2026-05-24T23:28:29 1779665309

yup - good read: https://www.thebignewsletter.com/p/the-efficiency-moat-why-c...

lenerdenator · 2026-05-24T16:32:56 1779640376

> while Western Democracies are afraid of the future, and of their own shadow.

Well, yeah. This is a technology that has the potential to make large chunks of the population unemployed.

Chunks of the population that took on debts prior to late 2022 with the understanding that there would be a way to pay those debts back with their labor.

blowscum · 2026-05-24T16:59:12 1779641952

> Chunks of the population that took on debts prior to late 2022 with the understanding that there would be a way to pay those debts back with their labor.

I’m calling it now, the future is indentured servitude.

themafia · 2026-05-24T18:28:25 1779647305

> then it will no longer require subsidy from them

Is there actually a huge Chinese consumer market for these products? If not then I'm not sure how you ever actually achieve this endpoint. Chinese wages and American wages are not nearly the same thing yet.

> It will simply be absolutely cheaper (including profit margin) to serve tokens in China.

It will simply create more pollution and environmental destruction too.

> China is building for the future

That's the plan. Whether that's true requires an honest analysis.

> while Western Democracies are afraid of the future

Developed nations take fewer risks than undeveloped ones. Do you assume this pitched dichotomy will naturally sustain itself?

> and of their own shadow.

Yea, it's funny what having open and fair elections can do for a country.

lejalv · 2026-05-24T20:19:32 1779653972

You got me with fair. Gerrymandering, PACs, two-party system, electoral college.

Where do we start...

themafia · 2026-05-24T20:37:20 1779655040

We start logically. Do you presume your handful of cases exemplify the entire Democratic system? Do you assume that "China" is best understood as a single centralized entity?

You completely walked past the argument to pick at a meaningless nit.

lejalv · 2026-05-24T20:57:23 1779656243

Handing out lessons in democracy from the record-holder country in foreign intervention (https://en.wikipedia.org/wiki/United_States_involvement_in_r...) had equal civil rights only in the 1960s, pardoned the perpetrators of Jan 6, has its supreme court in entirely political hands, and has the awesomest repressive force in the world, together with the incarcerated population to go with.

Maybe I picked like 4 meaningless nits as in: US politicians respect so much democracy that they constantly reweight "one person, one vote" to suit the interest of the incumbent, they do not have their outrageously expensive campaigns financed (legally) by private interest groups, the popular vote is represented, and elections are uncontested (unless the wrong candidate wins, where the Supreme Court promptly fixes the issue), and it has room for more than two (quite similar I may say) viewpoints in representation.

Maybe.

But please don't call “Yea, it's funny what having open and fair elections can do for a country.” an argument.

themafia · 2026-05-24T22:35:51 1779662151

Please don't take one sentence out of a larger context and pretend it represents the argument.

Which, again, you've managed to completely ignore.

The argument, ironically in black and white, so you can sense it, "this isn't a black and white scenario and seeing it as China vs USA blinds you to the complex differences and global geopolitical forces involved."

I get that you don't personally like America, for whatever reason, but you've blinded yourself to sense in your rush to convey your rather negative and absolutely common sensibilities.

readthenotes1 · 2026-05-24T16:38:59 1779640739

"China is building for the future, "

Meanwhile, the USA is paying for its past excesses, with interest on its debt being the number two most expensive line item in the budget.

https://fiscaldata.treasury.gov/americas-finance-guide/feder...

quantum_state · 2026-05-24T23:33:20 1779665600

In the last 40 years, China has been building while the US has been wasting money and lives fighting wars. Can we learn to really put America first for once?

DennisP · 2026-05-24T21:46:44 1779659204

If you look at total debt instead of just national government debt, then China is even worse off than the US.

Article in Fortune: https://archive.is/53Vu0

delfinom · 2026-05-24T16:54:52 1779641692

Yea, I really don't see how much longer the US economy can hold on. The baby boomers are working overtime to rob multiple future generations of opportunity to feed their profits now.

The formerly "fiscal conservatives" that I know are working overtime explaining how the debt isn't a bad thing and we can just move numbers.

xienze · 2026-05-24T19:38:35 1779651515

> The formerly "fiscal conservatives" that I know are working overtime explaining how the debt isn't a bad thing and we can just move numbers.

Sounds like they're just catching up to what Democrats always used to say whenever a Democrat was in the White House and some Republican would complain about the national debt. "A government isn't a household, debt doesn't work the same way, you don't get it."

59nadir · 2026-05-24T21:07:32 1779656852

That's interesting, because I thought it was common knowledge that Republican presidents actually add more on average to national debt...?

zrtac · 2026-05-24T16:08:39 1779638919

That is the talking point of OpenAI and a16z's super PAC:

https://www.wired.com/story/super-pac-backed-by-openai-and-p...

"Build American AI, a nonprofit linked to a super PAC bankrolled by executives at OpenAI and Andreessen Horowitz, is funding a campaign to spread pro-AI messaging and stoke fears about China."

In reality Xi has warned of AI bubbles. If China was really pushing it they'd be equal or ahead because so many researchers are Chinese anyway. Instead, China is building real stuff instead of focusing on hot air like a16z ("crypto", "AI", you name it). Maybe China should sponsor that PAC to accelerate the demise of the West.

aurareturn · 2026-05-24T16:20:00 1779639600

They wouldn’t be ahead because they can’t buy Nvidia compute racks anymore and they don’t have EUV machines.

Blackwell is 10-20x more efficient than H200. Vera Rubin is expected to be several times more efficient than Blackwell.

The US has way more compute installed in Gigawatts because China can’t get enough chips. https://epoch.ai/blog/trends-in-ai-supercomputers

I do wonder how most Chinese employees at OpenAI and Anthropic feel about their employer constantly spreading anti China propaganda to decrease competition. Perhaps money solves almost all things so they go along with it.

coliveira · 2026-05-24T18:18:33 1779646713

This is the next phase of the OpenAI deception: give us as much money as we want or you'll be labeled anti-US and pro-China (guaranteed by the propaganda arm of openAI).

watwut · 2026-05-24T19:59:54 1779652794

American companies are selling tokens on a loss for years now. Where is that alternative universe in which America is not subsidizing this?

Selling under price to capture market was American playbook for last 20 or more years.

ufish235 · 2026-05-24T15:58:53 1779638333

What the fuck are you talking about - have you seen what data centres are doing in the West? Do you want more of that?

infecto · 2026-05-24T16:02:22 1779638542

I have not fully seen or appreciated most of the negativity. Obviously there are exceptions to that but in my eyes it has largely exposed how vulnerable the west is due to poor infrastructure constructs and a lack of building out generation and transmission.

arjie · 2026-05-24T16:25:41 1779639941

To be honest, I’m sort of annoyed that the datacenter around the corner from my home closed. It was a five minute walk on 3rd street and I know of it because we used to have so many cages there 15 years ago. Now I have to drive to Fremont.

Nifty3929 · 2026-05-24T15:59:53 1779638393

Yes, and yes!

bryanlarsen · 2026-05-24T16:06:16 1779638776

Yes, I want cheap clean power.

stavros · 2026-05-24T19:01:02 1779649262

What are data centers doing? I'd never heard of anybody having had a problem with them until about two months ago.

stuaxo · 2026-05-24T16:06:21 1779638781

Nope.

We have exported production to China in many things, we forget that we had dark satanic mills of our own.

revolvingthrow · 2026-05-24T15:46:02 1779637562

Amusing that just when the big three AI providers from US raise prices significantly, even for the mini models, you’ve got a Chinese model slashing their already-cheap offer by 75%. Not to mention you can run this model on your own hardware, although admittedly even the flash stretches the meaning of local for individual people.

skybrian · 2026-05-24T16:15:03 1779639303

My guess is that the popular US providers get a lot more traffic and are supply-limited. No point in lowering prices unless you can serve the traffic that will result.

elcritch · 2026-05-24T23:25:41 1779665141

Yesterday I did some testing on the cost to solve the same simple problem on openrouter with different models using cline. Simple problem but it had a few nuances to solve it properly and so required reasoning.

After reading comments like this I was expecting (hoping?) that DeepSeek or similar would be cheaper.

However I was surprised that DeepSeek v4 cost about 5.5x GPT-5.4 to solve the problem.

- Deepseek-v4-pro-medium cost $2.47 - GPT-5.4-medium cost $0.45 - GPT-5.5-low was $0.86

VulgarExigency · 2026-05-24T23:39:45 1779665985

That doesn't sound right. Were you using the actual Deepseek provider? The one time I spent 3 dollars on Deepseek in a day, I had 615k output tokens, 96M cache hit input tokens, and 5M cache miss output tokens.

HDBaseT · 2026-05-25T00:41:03 1779669663

Yeah, I struggle to use more than a few dollars a day using Deepseek V4 Pro (max reasoning).

* Some people suggest not using max reasoning due to overthinking and looping issues, this may consume more tokens than needed.

Aurornis · 2026-05-24T20:06:15 1779653175

Nothing weird about it. It’s all supply and demand.

The US providers are at capacity limits and are increasing pricing as demand increases.

The Chinese providers are relatively unknown and not even allowed for a lot of applications. They have to cut the price just to be attractive.

arbuge · 2026-05-24T22:47:32 1779662852

Can they actually make money at these prices?

gmerc · 2026-05-24T21:21:45 1779657705

IPO metrics juicing is a bitch

Lwerewolf · 2026-05-24T16:40:28 1779640828

Given that you can run quantized flash on 128g ram, and there's a heavy focus around it (DS4)... I'd say that it's pretty feasible for a decent amount of devs. Never thought I'd buy an MBP but here we are.

n.b. I can't use nonlocal models for a big chunk of my work, so there's that as well.

MattDamonSpace · 2026-05-24T18:49:36 1779648576

Capitalist competition at its finest

bwfan123 · 2026-05-24T15:18:33 1779635913

Kudos to the DeepSeek folks for making tokens not only affordable but also open source. This is a race to the bottom for token costs in a good way.

tomaskafka · 2026-05-24T22:20:47 1779661247

Open weights aren’t open source. Source is the learning data and algorithms, and that is closed.

azinman2 · 2026-05-24T23:47:48 1779666468

And this is purely a way to undercut American models. If/once they’re ahead, it’ll stop being the case. Already qwen is doing that.

HDBaseT · 2026-05-25T00:42:46 1779669766

I'm not entirely sold on this idea, open source models aren't really hurting Deepseek or Qwens bottom line.

99.99% of people cannot run these models on their own hardware, they are forced to rent it from someone. That someone is almost always the big China players themselves anyways.

alyxya · 2026-05-22T17:50:46 1779472246

Once they have their own coding agent which they seem to be working towards, I may start predominantly using their models. They seem to be doing all the "right" things, open sourcing models, publishing research, and keeping prices low for everyone.

ammar_x · 2026-05-22T18:25:23 1779474323

You can use V4 Pro with Claude Code [1].

I tried it and it's impressive.

[1]: https://api-docs.deepseek.com/quick_start/agent_integrations...

KronisLV · 2026-05-22T20:11:56 1779480716

I'm working on a custom launcher for hooking up Claude Code with various providers (groups env variables in profiles) cause DeepSeek doesn't have vision and sometimes I need browser use with screenshots or Opus reasoning, for other tasks it's fine: https://ccode.kronis.dev/

  # After installed (or when run portably with ./ccode)
  ccode init-config
  ccode edit-config
  
  # Run with default profile
  ccode
  # Run with named profile
  ccode --deepseek
  
  # Set default profile
  ccode set-default-profile deepseek

Also turns out that with a local proxy you can get Remote Control working and see the DeepSeek sessions in the desktop app, screenshots on the page. Other than that, I'm happy that it works pretty well and the discount is enough to make me consider going from Anthropic's Max subscription to Pro and using it only where DeepSeek is insufficient. With that proxy I eventually hope to be able to transparently switch models mid-task, if I need Opus for like 5 turns or something.

Overall though I'm not sure exactly how well Claude Code would stack up against OpenCode, since the latter overall feels a bit less hacky with 3rd party models and is even getting niche but nice features like a locally runnable web version: https://opencode.ai/docs/web/

BiraIgnacio · 2026-05-23T01:27:55 1779499675

I've been using V4 flash consistently with Claude. Pretty great fast and darn cheap. I use it about 3h/day and so far haven't crossed $1 USD/week.

FWIW, I this is what I have in my settings.json

  "env": {
    "ANTHROPIC_AUTH_TOKEN":"sk-nope_not_real",   
    "ANTHROPIC_BASE_URL": "https://api.deepseek.com/anthropic",
    "ANTHROPIC_MODEL": "deepseek-v4-flash",
    "ANTHROPIC_DEFAULT_OPUS_MODEL": "deepseek-v4-flash",
    "ANTHROPIC_DEFAULT_SONNET_MODEL": "deepseek-v4-flash",
    "ANTHROPIC_DEFAULT_HAIKU_MODEL": "deepseek-v4-flash",
    "CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC": "1",
    "CLAUDE_CODE_EFFORT_LEVEL": "low",
    "CLAUDE_CODE_DISABLE_ADAPTIVE_THINKING": "1",
    "CLAUDE_CODE_DISABLE_THINKING": "0",
    "CLAUDE_CODE_ENABLE_AWAY_SUMMARY": "0",
    "CLAUDE_CODE_SUBAGENT_MODEL": "deepseek-v4-flash",
    "CLAUDE_CODE_MAX_OUTPUT_TOKENS": "8000",
    "CLAUDE_CODE_FILE_READ_MAX_OUTPUT_TOKENS": "4000",
    "BASH_MAX_OUTPUT_LENGTH": "20000",
    "CLAUDE_AUTOCOMPACT_PCT_OVERRIDE": "60",
    "CLAUDE_CODE_AUTO_COMPACT_WINDOW": "200000",
    "CLAUDE_CODE_DISABLE_GIT_INSTRUCTIONS": "1"
  }

oezi · 2026-05-23T07:16:26 1779520586

3h/day and how many parallel agents? 1/3/10?

I think out tokens would be a better metric.

hawtads · 2026-05-23T03:08:35 1779505715

Why not use higher thinking effort?

ed_mercer · 2026-05-23T03:37:31 1779507451

Hi, is it comparable to Opus?

chewz · 2026-05-23T09:33:01 1779528781

V4 Pro is between Sonnet and Opus. But it is cheap. Slow but very cheap. Very diligent.

I run a proxy that allows me switching back to Opus when necessary.

Deepseek isn't like Z.ai which is bit cheaper only on the surface. Or like Qwen 3.7 Max which is Opus-level but very expensive.

Deepseek is my favorite since V3 but V4 is definitely catch-up to newer Anthropic models

itsthecourier · 2026-05-23T09:34:14 1779528854

thank you so much for sharing ir

rjh29 · 2026-05-22T20:39:42 1779482382

How does the cost compare using the API vs the $20/month plans with other providers?

I did some back of the envelope calculations and it seems like you would pay $5/month using DeepSeek directly or $15-20 with OpenRouter or similar. But would be interested to hear real world usage.

0xbadcafebee · 2026-05-22T22:26:17 1779488777

It is still more expensive per-request than the common Anthropic and OpenAI subscriptions, but the math changes a lot based on your specific use case. https://codeberg.org/mutablecc/calculate-ai-cost/src/branch/...

But as usual, there are far cheaper subscriptions with higher limits than Anthropic and OpenAI, that also provide DeepSeek v4 Pro. So you should use those subscriptions first until you max them out, then look at a different subscription.

iammrpayments · 2026-05-23T07:08:36 1779520116

I don’t even use Claude that much and was hitting limits in the 20$ using sonnet, I’ve deposited 5$ with deepseek and haven’t hit the limit after spending 60million+ tokens. So no way it’s more expensive.

nchmy · 2026-05-24T06:29:12 1779604152

The link you shared is just a large table of data, which is hard to browse on a phone.

Could you please elaborate on the far cheaper subscriptions that we should be using?

stavros · 2026-05-22T22:52:21 1779490341

I've been using it pretty extensively over a month and I'm at maybe $7. It thinks for quite a while, but the results have been better than Sonnet for me.

maxdo · 2026-05-22T21:22:14 1779484934

I'm not curious what tasks you tested it for. Im working on coding agent writing code dynamically on request for customers. i'd say code itself very simple and aggressively cached, and patternalized, e.g. we adding lots of hints to the system.

the only real family models that work were claude and openai, surprisingly, for tasks that needs faster speed, gpt 5.4 is very impressive. Deep seek was very average , doing things somewhere in gemini flash 3.0 domain.

thisisit · 2026-05-22T19:08:14 1779476894

I am curious - Is there a way to switch between models depending on the task? Because I believe Deepseek V4 is not multimodal and it will be good to switch back to Claude if vision or other capabilities are required.

mewse-hn · 2026-05-22T20:34:16 1779482056

I was looking into something similar because I wanted to test a local model for doing basic coding and smart model (deepseek) for planning.

It's basically not possible with claude code, the api endpoint is a single environment variable and whatever models are on that endpoint are what's available.

HOWEVER, if you run a proxy like LiteLLM, you can configure it to send requests to different api endpoints on the back end and expose them as different "models" on the front end, then configure claude code to switch between those virtual models.

thisisit · 2026-05-22T21:16:04 1779484564

Found this: https://github.com/farion1231/cc-switch

It allows for switching models in Claude Code.

mewse-hn · 2026-05-22T21:21:05 1779484865

Right that says it has a proxy feature so it can probably do what I was describing with LiteLLM

mvanbaak · 2026-05-23T00:47:38 1779497258

Check out the project called superpowers. It can use different models for different agents. I use it witb opencode to have different models for reaearch, planning, execution, testing etc

longsword · 2026-05-22T23:08:58 1779491338

There is a tool called deepclaude, which runs a proxy in the background capable of doing this, by simply doing /model in Claude.

maxdo · 2026-05-22T21:25:36 1779485136

i've been trying that, in reality every time you try to save it, it's not worth it, the cost of mistake is so high , you can spent 2-3h on just wrong assumption, you lost your time and all the burned tokens.

firecall · 2026-05-23T02:04:24 1779501864

It seems you can use the Claude Code CLI harness without a Claude Pro subscription now, which I don't think you could a before?

I've been using Deepseek v4 with Cline in VS Code as a replacement for Github Copilot, and it's not been too bad.

jdasdf · 2026-05-24T21:55:35 1779659735

I'm my experience claude code is kind of shit.

Pi works very well with deepseek though

hbarka · 2026-05-22T21:45:54 1779486354

The npm install of Claude Code deprecated, since Feb 2026.

Scarbutt · 2026-05-22T18:35:59 1779474959

Surprised Anthropic hasn't done anything to restrict Claude Code from using other providers.

cortesoft · 2026-05-22T18:44:38 1779475478

At this point in the AI wars, it is probably better to have more users of Claude code rather than restrict which LLMs it can connect to. Claude code is probably (currently at least) stickier than the LLM model itself. Getting people into the Claude code ecosystem is worth it.

Later, they can always lock it down more or add Claude LLM only features to it.

wolttam · 2026-05-22T19:06:19 1779476779

The value of Claude Code the harness isn't that great. There's a lot of other good harnesses out there.

rane · 2026-05-22T19:47:04 1779479224

I thought so, and then I tried Opencode and Codex and started to appreciate Claude Code a lot more. They've actually done great work with the small details.

intuxikated · 2026-05-22T23:18:52 1779491932

I actually have't looked back since trying opencode The ability to properly see what the agent is doing in tool calls and subagents is really unmatched, CC strips all reasoning and return values, only displaying tool calls, and you're unable to expand a single subagent, it's expand everything and scroll endlessly or show everything collapsed with basically no info at all (read x files, ran x commands) Just seems like extremely basic features are missing

crooked-v · 2026-05-22T19:36:45 1779478605

And it gets dragged down by Anthropic actively injecting unhelpful things into prompts without telling users about them (https://github.com/anthropics/claude-code/issues/58262).

chandureddyvari · 2026-05-22T19:34:27 1779478467

What’s your favourite harness? Is there any benchmarks for harness like LLMs have for swe verified?

Mkengin · 2026-05-23T18:00:16 1779559216

There Seen to be more and more harness benchmarks out there, pretty interesting read:

https://neuralnoise.com/2026/harness-bench-wip/

wolttam · 2026-05-22T20:59:34 1779483574

You can check my profile for which one I like most :) I do think there have been efforts to benchmark different harnesses.

Personally I'm not going to choose one harness or another based on +/- a few percentage points in a benchmark. I'm going to use one the one that I find the most ergonomic, that isn't too bloated, etc. The models are the primary lever, not the harness.

koolba · 2026-05-22T19:14:26 1779477266

Good or better? Curious which would be in either bucket.

wolttam · 2026-05-22T19:19:24 1779477564

Probably a matter of taste. I prefer the harness I wrote, I don't want to go near Anthropic's bloated mess of a harness with a 10-meter pole.

odiroot · 2026-05-23T15:56:23 1779551783

IMHO the ergonomics of their tooling are not great. I'd rather use Codex or even OpenCode. Configuration alone is very arcane with lacking documentation. Sandboxing/permission system is quite confusing too.

HWR_14 · 2026-05-22T21:42:15 1779486135

It went the other way, you can't use other harnesses to connect to the cheaper versions of Claude. So clearly they think their current moat is Claude Code use, not the LLM itself.

wiradikusuma · 2026-05-22T19:13:07 1779477187

That's interesting. I thought Claude Code is not as good, therefore people want to use Claude model with other alternatives. This is the other way around.

Which begs the question, regardless of the model, which Claude Code alternative is better? (I keep saying "Claude Code alternative" because I don't know the term... LLM CLI?)

flexagoon · 2026-05-22T20:11:42 1779480702

AFAIK the two most popular open source harnesses right now are OpenCode and Pi. They take a pretty different approach, OpenCode includes a lot of features while Pi is very minimal by design and focused on extensibility, to the point where many people are just asking Pi to write a plugin for itself whenever they want it to have a new feature. I personally like Pi's philosophy more and I think its developer justified the choices really well in his blog post:

https://mariozechner.at/posts/2025-11-30-pi-coding-agent/#to... (the pi-coding-agent section)

rjh29 · 2026-05-22T20:44:41 1779482681

Author blocks referrals from HN, weirdly dramatic, especially considering they have 1086 karma here. I wonder what we did to them.

flexagoon · 2026-05-22T22:13:35 1779488015

Oh damn, I haven't noticed because my browser removes the referer header. But I think the image on the block page is a pretty good answer to why he did that.

SturgeonsLaw · 2026-05-23T01:38:20 1779500300

What's the image trying to convey? Genuine question, I just come here to read nerd stuff and I'm not aware of any controversy

flexagoon · 2026-05-23T02:26:36 1779503196

The image shows Garry Tan, the CEO of Y Combinator. He has lately been on a huge AI psychosis streak, bragging about things like "shipping 37000 lines of code every day" and "using Claude Code so much it burned out his USB-C power connectors". He's in a lobster suit because he's talking about OpenClaw, an AI agent assistant which those same AI psychosis types lean into too much by giving it full read-write access to all their life and then getting surprised when it accidentally deletes all of their emails.

Pi's developer is obviously not anti-AI, and he definitely doesn't hate OpenClaw, since it's based on Pi. But there's a growing number of people who take those things too far, and a lot of them are on HN. You can easily find them in the comments of any AI-related post here. I assume that's the type of people the image is portraying.

wrs · 2026-05-22T19:22:36 1779477756

The common term for a tool that wraps an LLM with a workflow is “harness”.

jijji · 2026-05-23T00:12:37 1779495157

I've seen good results with opencode connected to glm 5.1 on ollama cloud... for $20 a month you get similar performance that you get with opus 4.7

copperx · 2026-05-22T21:19:53 1779484793

I love oh-my-pi, but I'm not sure if it's "better". Maybe just as good.

g023 · 2026-05-22T21:38:06 1779485886

I use DeepSeek v4 flash with CoPilot and it works pretty good.

LaurensBER · 2026-05-22T19:27:14 1779478034

It works very well with OpenCode. My team keeps hitting the 5h limits on other subscriptions and it's pretty good to have Deepseek as a backup. I just put 50 bucks on there and it feels like it'll never run out.

It's not good enough to fully replace any of the frontier models yet but it's definitely great to have as a backup!

lambda · 2026-05-22T17:51:36 1779472296

Why do you need them to provide a coding agent? Just use their model with any off the shelf coding agent. I happen to prefer Pi, but use whatever works for you.

alyxya · 2026-05-22T18:15:43 1779473743

I probably have an unfounded assumption that whatever coding agent they make will work really well with their models, better than external harnesses. I don't have a good sense for how all the model + harness combinations compare, nor any good way to compare them myself, but generally believe model companies train their models to work best with their own harness.

wolttam · 2026-05-22T19:08:31 1779476911

I've noticed that models have gotten less finicky with this over time. Harnesses don't need to be complex to get good coding performance from models, they just need to implement some sane primitives for code exploration and editing.

wyre · 2026-05-22T22:08:28 1779487708

It is in the model's provider's interest for you to believe this because they get to lock you into their harness and inference. As models get better they will get better at using any harness, it comes down to how well the harness is actually engineered. I highly recommend you take an hour or two and check out Pi to either solidify or change your assumption. The harness is essentially just another developer tool and can be as opinionated, overly-engineered, minimal as anything else. I would think for DeepSeek, especially, they're efforts are much better spent researching how to make their LLM's better instead of working on engineering a harness that might get some marginal gain building it for their models.

Edit: here is a really good twitter thread about this exact topic: https://xcancel.com/kunchenguid/status/2057700714626105412

hootz · 2026-05-22T17:56:45 1779472605

Yeah, I'm using Pi with their models through an OpenCode Go subscription and it works pretty well. 10 bucks and V4-Flash is virtually infinite.

apitman · 2026-05-22T18:50:16 1779475816

What's the best way to use it with Pi, OpenRouter?

schaefer · 2026-05-22T20:23:18 1779481398

> What's the best way to use it with Pi, OpenRouter?

I can't claim it's "the best"...

But the Pi.dev and OpenRouter combo is what I'm doing at home, and I love it. Setup was easy, I can use /model to switch between any of the openrouter models and whatever I'm hosting locally via VLLM.

brianwawok · 2026-05-23T00:35:21 1779496521

Open router is a 5% tax? If you use it seriously may as well skip it

schaefer · 2026-05-24T03:44:30 1779594270

I don't have an LLM-positive culture at work. I'm on a bit of an island. Or under a rock.

Anyhow, I'm pulling myself up by my own bootstraps.

For me a 5% overhead is fine... if it gives me better visibility of this rapidly moving field.

lambda · 2026-05-22T20:20:32 1779481232

I only use local models myself personally. But yeah, OpenRouter would probably be a good option.

lofaszvanitt · 2026-05-22T22:56:50 1779490610

Qwen cli

satvikpendem · 2026-05-22T18:40:40 1779475240

RL with the harness inputs and outputs of users is one of the primary improvers of model performance, a self perpetuating flywheel.

smoe · 2026-05-22T20:27:41 1779481661

Earlier this week I started testing Chinese models on my codebase. I haven’t really looked at interactive coding yet, but more at issue triage, bug auto-fixing, log analytics, etc.

I used DeepSeek, Kimi, GLM, Qwen, and MiMO against GPT-5.5 high as reference, all running in Pi harness without anything installed.

So far, Kimi and MiMO look the most promising to me. I haven’t tested them rigorously enough to make a strong statement, but my first impression is that, in practice, all those models may be less behind on typical daily tasks than people think.

They are a bit “work hard, not smart". Getting to same-ish results more slowly and using more tokens, but at a fraction of the price

try-working · 2026-05-23T02:10:24 1779502224

I just did a little comparison using benchmarks for GPT 5.1 through 5.4 to map out the equivalent capability-level of some of the Chinese models.

Based on these benchmarks, here's a rough mapping:

- Qwen 3.7 ~= GPT 5.3

- Kimi K2.6 ~= GPT 5.15

- DS V4 ~= GPT 5.1

So yes, we have GPT 5 at home now. No need to pay the Legacy Labs anymore.

Here's the benchmark I used since I can't post images here: https://x.com/trydotworks/status/2058004995195490706?s=20

_under_scores_ · 2026-05-22T22:37:42 1779489462

I switched to predomentantly using mimo this week, mostly out of curiosity to see how dependant I was on frontier models. Honestly I cant really tell the difference. I would say I work on pretty average codebases with well know frameworks doing pretty typical things and initial impressions is that mimo, kimi and deepseek can probably handle what I need more or less the same as gpt5.5 or claude.

c0rruptbytes · 2026-05-22T20:58:37 1779483517

I personally really like DS4 Flash - it's the largest I can run locally with decent speeds and I feel like it's good enough to maintain a codebase with less effort

r0b05 · 2026-05-23T04:18:34 1779509914

What hardware and quant do you run it with?

maxdo · 2026-05-22T21:28:20 1779485300

maybe i need to give it second chance, surprisingly Kimi 2.6 consistently fail even to generate valid json plan, where gemma 4 was doing really good, but slow.

JSR_FDED · 2026-05-23T12:49:49 1779540589

Are you going through OpenRouter or direct? I’ve had nothing short of excellent results from Kimi.

azinman2 · 2026-05-24T23:48:23 1779666503

And not letting you opt out of being their training data.

jdboyd · 2026-05-23T02:02:07 1779501727

I would prefer a coding agent to be somewhat independent of the model provider. Providers are trading off on quality, features, and price so frequently, and I don't want to keep changing my agent every time.

I am looking forward to things slowing down and stabilizing. I'm not saying that should happen today, just I am looking forward to it.

gaolei8888 · 2026-05-23T02:42:45 1779504165

I think this will happen much sooner than we thought. Maybe it will happen in next 6 months

akritid · 2026-05-23T15:29:44 1779550184

You can take Codex today and ask it to rewrite itself to work with any API

hawtads · 2026-05-23T03:09:17 1779505757

There is OpenCode and Pi, they both work pretty well

tequila_shot · 2026-05-22T18:24:33 1779474273

You no longer need "their coding agent". You can hook up claude code to use Deepseek. Works perfectly.