‘Sputnik moment’: $1tn wiped off US stocks after Chinese firm unveils AI chatbot

Alas Poor Erinaceus@lemmy.ml · 9 months ago

‘Sputnik moment’: $1tn wiped off US stocks after Chinese firm unveils AI chatbot

toothbrush@lemmy.blahaj.zone · edit-2 9 months ago

One of those rare lucid moments by the stock market? Is this the market correction that everyone knew was coming, or is some famous techbro going to technobabble some more about AI overlords and they return to their fantasy values?

themoonisacheese@sh.itjust.works · 9 months ago

It’s quite lucid. The new thing uses a fraction of compute compared to the old thing for the same results, so Nvidia cards for example are going to be in way less demand. That being said Nvidia stock was way too high surfing on the AI hype for the last like 2 years, and despite it plunging it’s not even back to normal.

jacksilver@lemmy.world · 9 months ago

My understanding is it’s just an LLM (not multimodal) and the train time/cost looks the same for most of these.

DeepSeek ~$6million https://www.theregister.com/2025/01/26/deepseek_r1_ai_cot/?td=rt-3a
Llama 2 estimated ~$4-5 million https://www.visualcapitalist.com/training-costs-of-ai-models-over-time/

I feel like the world’s gone crazy, but OpenAI (and others) is pursing more complex model designs with multimodal. Those are going to be more expensive due to image/video/audio processing. Unless I’m missing something that would probably account for the cost difference in current vs previous iterations.

will_a113@lemmy.ml · 9 months ago

The thing is that R1 is being compared to gpt4 or in some cases gpt4o. That model cost OpenAI something like $80M to train, so saying it has roughly equivalent performance for an order of magnitude less cost is not for nothing. DeepSeek also says the model is much cheaper to run for inferencing as well, though I can’t find any figures on that.

jacksilver@lemmy.world · 9 months ago

My main point is that gpt4o and other models it’s being compared to are multimodal, R1 is only a LLM from what I can find.

Something trained on audio/pictures/videos/text is probably going to cost more than just text.

But maybe I’m missing something.

will_a113@lemmy.ml · 9 months ago

The original gpt4 is just an LLM though, not multimodal, and the training cost for that is still estimated to be over 10x R1’s if you believe the numbers. I think where R 1 is compared to 4o is in so-called reasoning, where you can see the chain of though or internal prompt paths that the model uses to (expensively) produce an output.

jacksilver@lemmy.world · edit-2 9 months ago

I’m not sure how good a source it is, but Wikipedia says it was multimodal and came out about two years ago - https://en.m.wikipedia.org/wiki/GPT-4. That being said.

The comparisons though are comparing the LLM benchmarks against gpt4o, so maybe a valid arguement for the LLM capabilites.

However, I think a lot of the more recent models are pursing architectures with the ability to act on their own like Claude’s computer use - https://docs.anthropic.com/en/docs/build-with-claude/computer-use, which DeepSeek R1 is not attempting.

Edit: and I think the real money will be in the more complex models focused on workflows automation.

WalnutLum@lemmy.ml · 9 months ago

Yea except DeepSeek released a combined Multimodal/generation model that has similar performance to contemporaries and a similar level of reduced training cost ~20 hours ago:

https://huggingface.co/deepseek-ai/Janus-Pro-7B

veroxii@aussie.zone · 9 months ago

Holy smoke balls. I wonder what else they have ready to release over the next few weeks. They might have a whole suite of things just waiting to strategically deploy

modulus@lemmy.ml · 9 months ago

One of the things you’re missing is the same techniques are applicable to multimodality. They’ve already released a multimodal model: https://seekingalpha.com/news/4398945-deepseek-releases-open-source-ai-multimodal-model-janus-pro-7b

CameronDev@programming.dev · 9 months ago

How is the “fraction of compute” being verified? Is the model available for independent analysis?

toothbrush@lemmy.blahaj.zone · 9 months ago

Its freely availible with a permissive license, but I dont think that that claim has been verified yet.

Zaktor@sopuli.xyz · 9 months ago

And the data is not available. Knowing the weights of a model doesn’t really tell us much about its training costs.

davel [he/him]@lemmy.ml · 9 months ago

If AI is cheaper, then we may use even more of it, and that would soak up at least some of the slack, though I have no idea how much.

scratsearcher 🔍🔮📊🎲@sopuli.xyz · 9 months ago

Most rational market: Sell off NVIDIA stock after Chinese company trains a model on NVIDIA cards.

Anyways NVIDIA still up 1900% since 2020 …

how fragile is this tower?

protist@mander.xyz · 9 months ago

Emergence of DeepSeek raises doubts about sustainability of western artificial intelligence boom

Is the “emergence of DeepSeek” really what raised doubts? Are we really sure there haven’t been lots of doubts raised previous to this? Doubts raised by intelligent people who know what they’re talking about?

floofloof@lemmy.ca · edit-2 9 months ago

Ah, but those “intelligent” people cannot be very intelligent if they are not billionaires. After all, the AI companies know exactly how to assess intelligence:

Microsoft and OpenAI have a very specific, internal definition of artificial general intelligence (AGI) based on the startup’s profits, according to a new report from The Information. … The two companies reportedly signed an agreement last year stating OpenAI has only achieved AGI when it develops AI systems that can generate at least $100 billion in profits. That’s far from the rigorous technical and philosophical definition of AGI many expect. (Source)

5in1k@lemm.ee · 9 months ago

The economy rests on a fucking chatbot. This future sucks.

Cowbee [he/they]@lemmy.ml · 9 months ago

On the brightside, the clear fragility and lack of direct connection to real productive forces shows the instability of the present system.

leftytighty@slrpnk.net · 9 months ago

And no matter how many protectionist measures that the US implements we’re seeing that they’re losing the global competition. I guess protectionism and oligarchy aren’t the best ways to accomplish the stated goals of a capitalist economy. How soon before China is leading in every industry?

Cowbee [he/they]@lemmy.ml · 9 months ago

This conclusion was foregone when China began to focus on developing the Productive Forces and the US took that for granted. Without a hard pivot, the US can’t even hope to catch up to the productive trajectory of China, and even if they do hard pivot, that doesn’t mean they even have a chance to in the first place.

In fact, protectionism has frequently backfired, and had other nations seeking inclusion into BRICS or more favorable relations with BRICS nations.

darvit@lemmy.darvit.nl · 9 months ago

Economy =/= stock market

Eatspancakes84@lemmy.world · 9 months ago

That’s the thing: if the cost of AI goes down , and AI is a valuable input to businesses that should be a good thing for the economy. To be sure, not for the tech sector that sells these models, but for all of the companies buying these services it should be great.

conartistpanda@lemmy.world · 9 months ago

Sure workers will reap a big chunk of that value right?

Valmond@lemmy.world · 9 months ago

Right?.jpg

doubtingtammy@lemmy.ml · 9 months ago

Only thanks to the PRC

skuzz@discuss.tchncs.de · 9 months ago

Almost like yet again the tech industry is run by lemming CEOs chasing the latest moss to eat.

Doomsider@lemmy.world · 9 months ago

Wow, China just fucked up the Techbros more than the Democratic or Republican party ever has or ever will. Well played.

TankovayaDiviziya@lemmy.world · 9 months ago

Well… if there is one thing I have to commend CCP is they are unafraid to crack down on billionaires after all.

kshade@lemmy.world · 9 months ago

It’s kinda funny. Their magical bullshitting machine scored higher on made up tests than our magical bullshitting machine, the economy is in shambles! It’s like someone losing a year’s wages in sports betting.

Naia@lemmy.blahaj.zone · 9 months ago

Just because people are misusing tech they know nothing about does not mean this isn’t an impressive feat.

If you know what you are doing, and enough to know when it gives you garbage, LLMs are really useful, but part of using them correctly is giving them grounding context outside of just blindly asking questions.

kshade@lemmy.world · 9 months ago

It is impressive, but the marketing around it has really, really gone off the deep end.

UnderpantsWeevil@lemmy.world · 9 months ago

Democrats and Republicans have been shoveling truckload after truckload of cash into a Potemkin Village of a technology stack for the last five years. A Chinese tech company just came in with a dirt cheap open-sourced alternative and I guarantee you the American firms will pile on to crib off the work.

Far from fucking them over, China just did the Americans’ homework for them. They just did it in a way that undercuts all the “Sam Altman is the Tech Messiah! He will bring about AI God!” holy roller nonsense that was propping up a handful of mega-firm inflated stock valuations.

Small and Mid-cap tech firms will flourish with these innovations. Microsoft will have to write the last $13B it sunk into OpenAI as a lose.

Valmond@lemmy.world · 9 months ago

Didn’t donald add like $500B for AI? Seems it’salmost enough to pay the -$600B nVidia lost…

MetalMachine@feddit.nl · 9 months ago

The best part is that it’s open source and available for download

Phoenicianpirate@lemm.ee · 9 months ago

So can I have a private version of it that doesn’t tell everyone about me and my questions?

SpaceRanger@lemmy.world · 9 months ago

Checkout ollama. https://ollama.com/library/deepseek-r1

Phoenicianpirate@lemm.ee · 9 months ago

Thank you very much. I did ask chatGPT was technical questions about some… subjects… but having something that is private AND can give me all the information I want/need is a godsend.

Goodbye, chatGPT! I barely used you, but that is a good thing.

λλλ@programming.dev · 9 months ago

Yep, lookup ollama

Mongostein@lemmy.ca · 9 months ago

Yeah, but you have to run a different model if you want accurate info about China.

Phoenicianpirate@lemm.ee · 9 months ago

Yeah but China isn’t my main concern right now. I got plenty of questions to ask and knowledge to seek and I would rather not be broadcasting that stuff to a bunch of busybody jackasses.

Mongostein@lemmy.ca · 9 months ago

I agree. I don’t know enough about all the different models, but surely there’s a model that’s not going to tell you “<whoever’s> government is so awesome” when asking about rainfall or some shit.

Alsephina@lemmy.ml · 9 months ago

Unfortunately it’s trained on the same US propaganda filled english data as any other LLM and spits those same talking points. The censors are easy to bypass too.

MetalMachine@feddit.nl · 9 months ago

tooclose104@lemmy.ca · 9 months ago

Can someone with the knowledge please answer this question?

TonyTonyChopper@mander.xyz · 9 months ago

Yes, you can run a downgraded version of it on your own pc.

tooclose104@lemmy.ca · 9 months ago

Apparently phone too! Like 3 cards down was another post linking to instructions on how to run it locally on a phone in a container app or termux. Really interesting. I may try it out in a vm on my server.

boomzilla@programming.dev · edit-2 9 months ago

I watched one video and read 2 pages of text. So take this with a mountain of salt. From that I gathered that deepseek R1 is the model you interact with when you use the app. The complexity of a model is expressed as the number of parameters (though I don’t know yet what those are) which dictate its hardware requirements. R1 contains 670 bn Parameter and requires very very beefy server hardware. A video said it would be 10th of GPUs. And it seems you want much of VRAM on you GPU(s) because that’s what AI crave. I’ve also read 1BN parameters require about 2GB of VRAM.

Got a 6 core intel, 1060 6 GB VRAM,16 GB RAM and Endeavour OS as a home server.

I just installed Ollama in about 1/2 an hour, using docker on above machine with no previous experience on neural nets or LLMs apart from chatting with ChatGPT. The installation contains the Open WebUI which seems better than the default you got at ChatGPT. I downloaded the qwen2.5:3bn model (see https://ollama.com/search) which contains 3 bn parameters. I was blown away by the result. It speaks multiple languages (including displaying e.g. hiragana), knows how much fingers a human has, can calculate, can write valid rust-code and explain it and it is much faster than what i get from free ChatGPT.

The WebUI offers a nice feedback form for every answer where you can give hints to the AI via text, 10 score rating thumbs up/down. I don’t know how it incooperates that feedback, though. The WebUI seems to support speech-to-text and vice versa. I’m eager to see if this docker setup even offers APIs.

I’ll probably won’t use the proprietary stuff anytime soon.

CeeBee_Eh@lemmy.world · 9 months ago

I asked it about Tiananmen Square, it told me it can’t answer that because it can only respond with “harmless” responses.

MetalMachine@feddit.nl · 9 months ago

Yes the online model has those filters. Some one tried it with one of the downloaded models and it answers just fine

Ascend910@lemmy.ml · 9 months ago

When running locally, it works just fine without filters

jaschen@lemm.ee · 9 months ago

I tried the smaller models and it’s not fine. It’s hard coded.

CeeBee_Eh@lemmy.world · 9 months ago

This was a local instance.

apprehensively_human@lemmy.ca · 9 months ago

Does the same thing on my local instance.

jaschen@lemm.ee · 9 months ago

You misspelled “lies”. Or were you trying to type “psyops tool”??

Valmond@lemmy.world · 9 months ago

Removed by mod

jaschen@lemm.ee · 9 months ago

Yes but your server can’t handle the biggest LLM.

labbbb2@thelemmy.club · 9 months ago

But Chinese…

vga@sopuli.xyz · edit-2 9 months ago

They’d need to do some pretty fucking advanced hackery to be able to do surveillance on you just via the model. Everything’s possible I guess, but … yeah perhaps not.

If they could do that, essentially nothing you do on your computer would be safe.

SocialMediaRefugee@lemmy.ml · 9 months ago

This just shows how speculative the whole AI obsession has been. Wildly unstable and subject to huge shifts since its value isn’t based on anything solid.

ByteJunk@lemmy.world · 9 months ago

It’s based on guessing what the actual worth of AI is going to be, so yeah, wildly speculative at this point because breakthroughs seem to be happening fairly quickly, and everyone is still figuring out what they can use it for.

There are many clear use cases that are solid, so AI is here to stay, that’s for certain. But how far can it go, and what will it require is what the market is gambling on.

If out of the blue comes a new model that delivers similar results on a fraction of the hardware, then it’s going to chop it down by a lot.

If someone finds another use case, for example a model with new capabilities, boom value goes up.

It’s a rollercoaster…

WoodScientist@sh.itjust.works · 9 months ago

There are many clear use cases that are solid, so AI is here to stay, that’s for certain. But how far can it go, and what will it require is what the market is gambling on.

I would disagree on that. There are a few niche uses, but OpenAI can’t even make a profit charging $200/month.

The uses seem pretty minimal as far as I’ve seen. Sure, AI has a lot of applications in terms of data processing, but the big generic LLMs propping up companies like OpenAI? Those seems to have no utility beyond slop generation.

Ultimately the market value of any work produced by a generic LLM is going to be zero.

NιƙƙιDιɱҽʂ@lemmy.world · 9 months ago

Language learning, code generatiom, brainstorming, summarizing. AI has a lot of uses. You’re just either not paying attention or are biased against it.

It’s not perfect, but it’s also a very new technology that’s constantly improving.

Toofpic@feddit.dk · 9 months ago

I decided to close the post now - there is place for any opinion, but I can see people writing things which are completely false however you look at them: you can dislike Sam Altman (I do), you can worry about China’s interest in entering the competition now and like that (I do), but the comments about LLM being useless while millions of people use it daily for multiple purposes sound just like lobbying.

UndercoverUlrikHD@programming.dev · 9 months ago

It’s difficult to take your comment serious when it’s clear that all you’re saying seems to based on ideological reasons rather than real ones.

Besides that, a lot of the value is derived from the market trying to figure out if/what company will develop AGI. Whatever company manages to achieve it will easily become the most valuable company in the world, so people fomo into any AI company that seems promising.

Jhex@lemmy.world · 9 months ago

Besides that, a lot of the value is derived from the market trying to figure out if/what company will develop AGI. Whatever company manages to achieve it will easily become the most valuable company in the world, so people fomo into any AI company that seems promising.

There is zero reason to think the current slop generating technoparrots will ever lead into AGI. That premise is entirely made up to fuel the current “AI” bubble

UndercoverUlrikHD@programming.dev · 9 months ago

The market don’t care what either of us think, investors will do what investors do, speculate.

Leg@sh.itjust.works · 9 months ago

They may well lead to the thing that leads to the thing that leads to the thing that leads to AGI though. Where there’s a will

Jhex@lemmy.world · 9 months ago

sure, but that can be said of literally anything. It would be interesting if LLM were at least new but they have been around forever, we just now have better hardware to run them

NιƙƙιDιɱҽʂ@lemmy.world · edit-2 9 months ago

That’s not even true. LLMs in their modern iteration are significantly enabled by transformers, something that was only proposed in 2017.

The conceptual foundations of LLMs stretch back to the 50s, but neither the physical hardware nor the software architecture were there until more recently.

Arehandoro@lemmy.ml · 9 months ago

Nvidia’s most advanced chips, H100s, have been banned from export to China since September 2022 by US sanctions. Nvidia then developed the less powerful H800 chips for the Chinese market, although they were also banned from export to China last October.

I love how in the US they talk about meritocracy, competition being good, blablabla… but they rig the game from the beginning. And even so, people find a way to be better. Fascinating.

shawn1122@lemm.ee · 9 months ago

You’re watching an empire in decline. It’s words stopped matching its actions decades ago.

Breve@pawb.social · edit-2 8 months ago

deleted by creator

wulrus@programming.dev · 9 months ago

Hello darkness my old friend

Pieisawesome@lemmy.world · 9 months ago

It’s knowledge isn’t updated.

It doesn’t know current events, so this isn’t a big gotcha moment

Klear@lemmy.world · 9 months ago

It’s still hilarious.

lud@lemm.ee · edit-2 4 months ago

deleted by creator

wulrus@programming.dev · 9 months ago

It continued like this though

Krauerking@lemy.lol · 9 months ago

Thanks for now. Bye.

What you trying to be on skynets good side or something?

تحريرها كلها ممكن@lemmy.ml · 9 months ago

We will have true AI once it is capable of answering “I don’t know” instead of making things up

PolandIsAStateOfMind@lemmy.ml · edit-2 9 months ago

Turns out, some people i know are apparently fake AI.

Naia@lemmy.blahaj.zone · 9 months ago

Which is actually something Deepseek is able to do.

Even if it can still generate garbage when used incorrectly like all of them, it’s still impressive that it will tell you it doesn’t “know” something, but can try to help if you give it more context. which is how this stuff should be used anyway.

SplashJackson@lemmy.ca · 9 months ago

Lol serves you right for pushing AI onto us without our consent

SocialMediaRefugee@lemmy.ml · 9 months ago

The determination to make us use it whether we want to or not really makes me resent it.

JOMusic@lemmy.ml · 9 months ago

and it’s open-source!

thespcicifcocean@lemmy.world · 9 months ago

how long do you think it’ll take before the west decides to block all access to the model?

JOMusic@lemmy.ml · edit-2 9 months ago

They actually can’t. Being open-source, it’s already proliferated. Apparently there are already over 500 derivatives of it on HuggingFace. The only thing that could be done is that each country in the West outlaws having a copy of it, like with other illegal materials. Even by that point, it will already be deep within business ecosystems across the globe.

Nup. OpenAI can be shut down, but it is almost impossible for R1 to go away at this point.

Eatspancakes84@lemmy.world · 9 months ago

It’s ridiculous to think that there would still be an alliance of “Western Countries”. The Greenland thing, the threats related to NATO, tariff threats, techbros weaponising the US government to escape regulation in Europe etc etc. China is the FAR more reliable partner for Europe and South America. Good luck blocking the Chinese software in the US, but I think you will find no friends with your new leader in place.

Valmond@lemmy.world · 9 months ago

Yeah there is a lot of bro-style crap going on right now, but China is a brutal dictatorship.

Choose wisely.

davel [he/him]@lemmy.ml · edit-2 9 months ago

Corkyskog@sh.itjust.works · 9 months ago

Is there a way for me to download and run it locally, or does that require a super computer?

mystic-macaroni@lemmy.ml · 9 months ago

Check out ollama.com You can download a whole bunch of models for free. The way I rum ollama is on linux from the cli, but if you can’t do it that way try jan.ai

IngeniousRocks (They/She) @lemmy.dbzer0.com · 9 months ago

If you have a GPU with ray tracing hardware and at least 12gVRAM you should be able to run it albeit slowly at home

Etterra@discuss.online · 9 months ago

Good. LLM AIs are overhyped, overused garbage. If China putting one out is what it takes to hack the legs out from under its proliferation, then I’ll take it.

davel [he/him]@lemmy.ml · 9 months ago

Cutting the cost by 97% will do the opposite of hampering proliferation.

ArchRecord@lemm.ee · edit-2 5 months ago

deleted by creator

lordnikon@lemmy.world · 9 months ago

No but it would be nice if it would turn back in the tool it was. When it was called machine learning like it was for the last decade before the bubble started.

WoodScientist@sh.itjust.works · 9 months ago

It’s not about hampering proliferation, it’s about breaking the hype bubble. Some of the western AI companies have been pitching to have hundreds of billions in federal dollars devoted to investing in new giant AI models and the gigawatts of power needed to run them. They’ve been pitching a Manhattan Project scale infrastructure build out to facilitate AI, all in the name of national security.

You can only justify that kind of federal intervention if it’s clear there’s no other way. And this story here shows that the existing AI models aren’t operating anywhere near where they could be in terms of efficiency. Before we pour hundreds of billions into giant data center and energy generation, it would behoove us to first extract all the gains we can from increased model efficiency. The big players like OpenAI haven’t even been pushing efficiency hard. They’ve just been vacuuming up ever greater amounts of money to solve the problem the big and stupid way - just build really huge data centers running big inefficient models.

UnderpantsWeevil@lemmy.world · 9 months ago

What DeepSeek has done is to eliminate the threat of “exclusive” AI tools - ones that only a handful of mega-corps can dictate terms of use for.

Now you can have a Wikipedia-style AI (or a Wookiepedia AI, for that matter) that’s divorced from the C-levels looking to monopolize sectors of the service economy.

davel [he/him]@lemmy.ml · 9 months ago

It’s been known for months that they were living on borrowed time: Google “We Have No Moat, And Neither Does OpenAI” Leaked Internal Google Document Claims Open Source AI Will Outcompete Google and OpenAI

dependencyinjection@discuss.tchncs.de · 9 months ago

Overhyped? Sure, absolutely.

Overused garbage? That’s incredibly hyperbolic. That’s like saying the calculator is garbage. The small company where I work as a software developer has already saved countless man hours by utilising LLMs as tools, which is all they are if you take away the hype; a tool to help skilled individuals work more efficiently. Not to replace skilled individuals entirely, as Sam Dead eyes Altman would have you believe.

WoodScientist@sh.itjust.works · 9 months ago

LLMs as tools,

Yes, in the same way that buying a CD from the store, ripping to your hard drive, and returning the CD is a tool.

PlantPowerPhysicist@discuss.tchncs.de · 9 months ago

Remember to cancel your Microsoft 365 subscription to kick them while they’re down

Joe Dyrt@lemmy.ml · 9 months ago

Joke’s on them: I never started a subscription!

Zink@programming.dev · 9 months ago

I don’t have one to cancel, but I might celebrate today by formatting the old windows SSD in my system and using it for some fast download cache space or something.

Phen@lemmy.eco.br · 9 months ago

“wiped”? There was money and it ceased to exist?

protist@mander.xyz · edit-2 9 months ago

The money went back into the hands of all the people and money managers who sold their stocks today.

Edit: I expected a bloodbath in the markets with the rhetoric in this article, but the NASDAQ only lost 3% and the DJIA was positive today…

Nvidia was significantly over-valued and was due for this. I think most people who are paying attention knew that

shootwhatsmyname@lemm.ee · 9 months ago

There’s been a lot of disproportionate hype around deepseek lately

Hexadecimalkink@lemmy.ml · 9 months ago

Trump counterbalance keeping it in check but my gut is saying once tariffs come in February there’s going to be a market correction. Pure speculation on my part.

Jimmycakes@lemmy.world · 9 months ago

You don’t have to say speculation when talking about the future of stocks. It’s implied unless you are a time traveler in which case you should lead with that.

Hexadecimalkink@lemmy.ml · 9 months ago

I am a time traveller and I was trying to throw you off my trail but I seem to have failed.

someacnt@sh.itjust.works · 9 months ago

To be fair, NQ futures momentarily dropped 5% before recovering some. A few days from now on would be interesting.

breadguyyyyyyyyy@sh.itjust.works · 9 months ago

“off US stocks”

mosscap@slrpnk.net · 9 months ago

It’s pixie dust

jsomae@lemmy.ml · 9 months ago

The funny thing is, this was unveiled a while ago and I guess investors only just noticed it.

NoSpotOfGround@lemmy.world · edit-2 9 months ago

Text below, for those trying to avoid Twitter:

Most people probably don’t realize how bad news China’s Deepseek is for OpenAI.

They’ve come up with a model that matches and even exceeds OpenAI’s latest model o1 on various benchmarks, and they’re charging just 3% of the price.

It’s essentially as if someone had released a mobile on par with the iPhone but was selling it for $30 instead of $1000. It’s this dramatic.

What’s more, they’re releasing it open-source so you even have the option - which OpenAI doesn’t offer - of not using their API at all and running the model for “free” yourself.

If you’re an OpenAI customer today you’re obviously going to start asking yourself some questions, like “wait, why exactly should I be paying 30X more?”. This is pretty transformational stuff, it fundamentally challenges the economics of the market.

It also potentially enables plenty of AI applications that were just completely unaffordable before. Say for instance that you want to build a service that helps people summarize books (random example). In AI parlance the average book is roughly 120,000 tokens (since a “token” is about 3/4 of a word and the average book is roughly 90,000 words). At OpenAI’s prices, processing a single book would cost almost $2 since they change $15 per 1 million token. Deepseek’s API however would cost only $0.07, which means your service can process about 30 books for $2 vs just 1 book with OpenAI: suddenly your book summarizing service is economically viable.

Or say you want to build a service that analyzes codebases for security vulnerabilities. A typical enterprise codebase might be 1 million lines of code, or roughly 4 million tokens. That would cost $60 with OpenAI versus just $2.20 with DeepSeek. At OpenAI’s prices, doing daily security scans would cost $21,900 per year per codebase; with DeepSeek it’s $803.

So basically it looks like the game has changed. All thanks to a Chinese company that just demonstrated how U.S. tech restrictions can backfire spectacularly - by forcing them to build more efficient solutions that they’re now sharing with the world at 3% of OpenAI’s prices. As the saying goes, sometimes pressure creates diamonds.

Last edited 4:23 PM · Jan 21, 2025 · 932.3K Views

Tiger@sh.itjust.works · 9 months ago

Thank you for bringing the text over, I won’t click on X.

shawn1122@lemm.ee · edit-2 9 months ago

Deepthink R1(the reasoning model) was only released on January 20. Still took a while though.