Chinese AI Startup DeepSeek Stuns Tech Industry, Causes Stock Panic

Free

*censored*
VVO Supporter 🍦🎈👾❤
Joined
Sep 22, 2018
Messages
42,282
Location
Moonbase Caligula
SL Rez
2008
Joined SLU
2009
SLU Posts
55565
I accidentally bumped into the NOT ALLOWED filters of DeepSeek's chatbot.

I asked it to tell me a short story about Free Xue. (It's a thing I do when performing AI prompt evaluation...) When I made a request of ChatGPT for this, it generated a 300 word tale about a digital artist in a future metropolis called NeoCity who refuses to compromise her work, and challenges the status quo. It was trite Mary Sue slop, but cute enough.

DeepSeek, however, started outputting several paragraphs on the current legal situation of a real individual, I believe this person. DeepSeek's chatbot appears to have a retroactive banning feature, as the "story" I saw written out was - before I could copy and paste it - wiped from the chatbot and replaced with "Sorry, that's beyond my current scope. Let’s talk about something else."

If I suddenly vanish from the internet, consider this a potential reason...
 

Noodles

The sequel will probably be better.
Joined
Sep 20, 2018
Messages
5,989
Location
Illinois
SL Rez
2006
Joined SLU
04-28-2010
SLU Posts
6947
Perplexity understands spelling and counting like a proper first grader.

The word "Raspberry" contains 3 instances of the letter "R". Breaking it down:
  • First letter: R
  • Seventh letter: R
  • Eighth letter: R
The full spelling is R-A-S-P-B-E-R-R-Y, with the "R" appearing at the beginning and twice consecutively near the end.
 
  • 1Interesting
Reactions: Govi

Bartholomew Gallacher

Well-known member
Joined
Sep 26, 2018
Messages
6,873
SL Rez
2002
  • 1Facepalm
Reactions: Govi

Caete

Scientist Lady of Science
Joined
Sep 20, 2018
Messages
3,766
Location
20 Minutes into the future
SL Rez
2006
I accidentally bumped into the NOT ALLOWED filters of DeepSeek's chatbot.

I asked it to tell me a short story about Free Xue. (It's a thing I do when performing AI prompt evaluation...) When I made a request of ChatGPT for this, it generated a 300 word tale about a digital artist in a future metropolis called NeoCity who refuses to compromise her work, and challenges the status quo. It was trite Mary Sue slop, but cute enough.

DeepSeek, however, started outputting several paragraphs on the current legal situation of a real individual, I believe this person. DeepSeek's chatbot appears to have a retroactive banning feature, as the "story" I saw written out was - before I could copy and paste it - wiped from the chatbot and replaced with "Sorry, that's beyond my current scope. Let’s talk about something else."

If I suddenly vanish from the internet, consider this a potential reason...
Yeah, the "sorry that's beyond my current scope" is what it shows anytime it self censors. Seems like they are having issues with the whole thing due to the influx of users just like every other start up has since forever... Maybe they can use their scrape for cash to buy more slots on a SAE data farm.
 

Kamilah Hauptmann

Shitpost Sommelier
Joined
Sep 20, 2018
Messages
15,061
Location
Cat Country (Can't Stop Here)
SL Rez
2005
Joined SLU
Reluctantly
Maybe ask DeepSeek for a summary. :shiftyeyes:
When asking Copilot what part of a particular request was forbidden it wouldn't even tell me that. I would have to start experimenting by changing one part at a time until I found the problem area.
 

Bartholomew Gallacher

Well-known member
Joined
Sep 26, 2018
Messages
6,873
SL Rez
2002
Tom's Hardware is running a story, that the $6 mio. training cost for Deepseek are misleading, because the company behind it invested $1.6bn into server buildouts and has 50.000 Hopper GPUs at its disposal.

So while the its right, getting there took a lot of more work and cost. This does not change the fact though that Deepseek did this all with domestic hires only, and they've improved the method on a major level with Multi-Head Latent Attention and other innovations.

So no good news for Nvidia, because it still could translate into wide spread usage will cause less GPU demand.

 

Free

*censored*
VVO Supporter 🍦🎈👾❤
Joined
Sep 22, 2018
Messages
42,282
Location
Moonbase Caligula
SL Rez
2008
Joined SLU
2009
SLU Posts
55565
Last month, DeepSeek turned the AI world on its head with the release of a new, competitive simulated reasoning model that was free to download and use under an MIT license. Now, the company is preparing to make the underlying code behind that model more accessible, promising to release five open source repos starting next week.

In a social media post late Thursday, DeepSeek said the daily releases it is planning for its "Open Source Week" would provide visibility into "these humble building blocks in our online service [that] have been documented, deployed and battle-tested in production. As part of the open-source community, we believe that every line shared becomes collective momentum that accelerates the journey."
While DeepSeek has been very non-specific about just what kind of code it will be sharing, an accompanying GitHub page for "DeepSeek Open Infra" promises the coming releases will cover "code that moved our tiny moonshot forward" and share "our small-but-sincere progress with full transparency." The page also refers back to a 2024 paper detailing DeepSeek's training architecture and software stack.