Started By
Message

Open source AI DeepSeek is about to crash the AI bubble

Posted on 1/26/25 at 10:32 am
Posted by stout
Porte du Lafitte
Member since Sep 2006
175646 posts
Posted on 1/26/25 at 10:32 am
How China’s new AI model DeepSeek is threatening U.S. dominance


quote:

A little-known AI lab out of China has ignited panic throughout Silicon Valley after releasing AI models that can outperform America’s best despite being built more cheaply and with less-powerful chips.

DeepSeek, as the lab is called, unveiled a free, open-source large-language model in late December that it says took only two months and less than $6 million to build, using reduced-capability chips from Nvidia called H800s.

The new developments have raised alarms on whether America’s global lead in artificial intelligence is shrinking and called into question big tech’s massive spend on building AI models and data centers.

In a set of third-party benchmark tests, DeepSeek’s model outperformed Meta
’s Llama 3.1, OpenAI’s GPT-4o and Anthropic’s Claude Sonnet 3.5 in accuracy ranging from complex problem-solving to math and coding.

DeepSeek on Monday released r1, a reasoning model that also outperformed OpenAI’s latest o1 in many of those third-party tests.

“To see the DeepSeek new model, it’s super impressive in terms of both how they have really effectively done an open-source model that does this inference-time compute, and is super-compute efficient,” Microsoft CEO Satya Nadella said at the World Economic Forum in Davos, Switzerland, on Wednesday. “We should take the developments out of China very, very seriously.”

DeepSeek also had to navigate the strict semiconductor restrictions that the U.S. government has imposed on China, cutting the country off from access to the most powerful chips, like Nvidia’s H100s. The latest advancements suggest DeepSeek either found a way to work around the rules, or that the export controls were not the chokehold Washington intended.

“They can take a really good, big model and use a process called distillation,” said Benchmark General Partner Chetan Puttagunta. “Basically you use a very large model to help your small model get smart at the thing you want it to get smart at. That’s actually very cost-efficient.”

Little is known about the lab and its founder, Liang WenFeng. DeepSeek was was born of a Chinese hedge fund called High-Flyer Quant that manages about $8 billion in assets, according to media reports.

But DeepSeek isn’t the only Chinese company making inroads.

Leading AI researcher Kai-Fu Lee has said his startup 01.ai was trained using only $3 million. TikTok parent company ByteDance on Wednesday released an update to its model that claims to outperform OpenAI’s o1 in a key benchmark test.

“Necessity is the mother of invention,” said Perplexity CEO Aravind Srinivas. “Because they had to figure out work-arounds, they actually ended up building something a lot more efficient.”



Nvidia Stock May Fall As DeepSeek’s ‘Amazing’ AI Model Disrupts OpenAI


quote:

America’s policy of restricting Chinese access to Nvidia’s most advanced AI chips has unintentionally helped a Chinese AI developer leapfrog U.S. rivals who have full access to the company’s latest chips.

This proves a basic reason why startups are often more successful than large companies: Scarcity spawns innovation.

A case in point is the Chinese AI Model DeepSeek R1 — a complex problem-solving model competing with OpenAI’s o1 — which “zoomed to the global top 10 in performance”— yet was built far more rapidly, with fewer, less powerful AI chips, at a much lower cost, according to the Wall Street Journal.

The success of R1 should benefit enterprises. That’s because companies see no reason to pay more for an effective AI model when a cheaper one is available — and is likely to improve more rapidly.



quote:

DeepSeek’s success could spawn new rivals to U.S.-based large language model developers. If these startups build powerful AI models with fewer chips and get improvements to market faster, Nvidia revenue could grow more slowly as LLm developers replicate DeepSeek’s strategy of using fewer, less advanced AI chips.






This post was edited on 1/26/25 at 10:34 am
Posted by thatguy777
br
Member since Feb 2007
2493 posts
Posted on 1/26/25 at 10:40 am to
there are several reports that the 6 mil figure is complete bogus. Supposedly they can't report the hundreds of NVDA's H100 chips they purchased before the US implemented certain reg's. I am not that familiar with the situation and the details surrounding it, but it does seem far fetched that the 6 mil figure is legit.
This post was edited on 1/26/25 at 10:41 am
Posted by FnTigers
Member since Sep 2021
1912 posts
Posted on 1/26/25 at 11:02 am to
It still leaves a lot to be desired. Guess we'll see.
Posted by jefforize
Member since Feb 2008
45008 posts
Posted on 1/26/25 at 11:09 am to
Fairly concerning for anyone with tech exposure.

Posted by Naked Bootleg
Premium Plus® Member
Member since Jul 2021
2708 posts
Posted on 1/26/25 at 11:17 am to
Not sure I'm buying it.

"it" being China developing something that bested Western tech to such an allegedly large degree.
Posted by lynxcat
Member since Jan 2008
24721 posts
Posted on 1/26/25 at 11:32 am to
No it isn’t. This technology gets commoditized almost immediately. The pace of change is unlike anything the world has ever experienced. The big US firms can go design their own approaches now that they know the approach DeepSeek uses is so efficient.

I think folks expect too much Linear A to B to C in a space in its infancy. It’s great to know there are new approaches that challenge the leaders to rethink their entire approach.
This post was edited on 1/26/25 at 11:33 am
Posted by jefforize
Member since Feb 2008
45008 posts
Posted on 1/26/25 at 11:40 am to
Do you think people will be rushing to buy NVDA tomorrow?
Posted by lynxcat
Member since Jan 2008
24721 posts
Posted on 1/26/25 at 11:48 am to
This is just noise in an eventual AI reality. Doesn’t change the thesis on Nvidia.
Posted by MrLSU
Yellowstone, Val d'isere
Member since Jan 2004
28258 posts
Posted on 1/26/25 at 5:15 pm to
I’ve been using Deepseek today and it’s pretty impressive. It’s hands down better than Anthropic, ChatGPT and Grok
Posted by jefforize
Member since Feb 2008
45008 posts
Posted on 1/26/25 at 5:25 pm to
Yeah man. This is a disruptor
Posted by I Love Bama
Alabama
Member since Nov 2007
38303 posts
Posted on 1/26/25 at 5:42 pm to
Agreed. Futures taking a beating this evening and I plan on getting out of my AI plays.

Posted by kaaj24
Dallas
Member since Jan 2010
790 posts
Posted on 1/26/25 at 6:05 pm to
Yes, a rising tide lifts all boats. Given the need of data processing for a world with AI tools expanding, I am still bullish on Nvidia.

I haven’t invested in any other AI plays directly.
Posted by The Egg
Houston, TX
Member since Dec 2004
81840 posts
Posted on 1/26/25 at 6:25 pm to
Posted by BottomlandBrew
Member since Aug 2010
28394 posts
Posted on 1/26/25 at 6:35 pm to
quote:

AI lab out of China


quote:

Open source


Does not compute.
Posted by lsuconnman
Baton rouge
Member since Feb 2007
3595 posts
Posted on 1/26/25 at 7:05 pm to
(no message)
This post was edited on 3/15/25 at 12:35 pm
Posted by Turnblad85
Member since Sep 2022
3154 posts
Posted on 1/26/25 at 7:22 pm to
Fyi, those nerds over at reddit seem to think its a junk knockoff.


Might be a good day to buy some AI stocks on sale.
Posted by stout
Porte du Lafitte
Member since Sep 2006
175646 posts
Posted on 1/26/25 at 7:25 pm to
quote:

Might be a good day to buy some AI stocks on sale.


Would be nice to pick up some discounted Nvidia. Any dip for them is going to be temporary
Posted by Boomer Rick
Member since Apr 2021
247 posts
Posted on 1/26/25 at 7:31 pm to
Do you have a link? I don’t like reddit but I could see it being useful for this discussion.
Posted by thatguy777
br
Member since Feb 2007
2493 posts
Posted on 1/26/25 at 7:36 pm to
I downloaded the app over the weekend and asked it for the best restaurants in br and it’s responded with number 1 being Restaurant IPO, which closed down a while back.
Posted by bayoubengals88
LA
Member since Sep 2007
21187 posts
Posted on 1/26/25 at 7:36 pm to
quote:

Might be a good day to buy some AI stocks on sale.
This is my thinking. Or some short range QQQ Calls. Something 7-21 days out to give the market time to bounce back from a nothingburger.
first pageprev pagePage 1 of 3Next pagelast page

Back to top
logoFollow TigerDroppings for LSU Football News
Follow us on X, Facebook and Instagram to get the latest updates on LSU Football and Recruiting.

FacebookXInstagram