https://venturebeat.com/ai/why-everyone-in-ai-is-freaking-out-about-deepseek/
As of a few days ago, only the nerdiest of nerds (I say this as one) had ever heard of DeepSeek, a Chinese AI subsidiary of the equally evocatively named High-Flyer Capital Management, a quantitative analysis (or quant) firm that initially launched in 2015.
hmm…
Advanced large language model DeepSeek R1 is taking users by storm, wowing reviewers and earning praise from AI-phobes. The Hangzhou-based tech startup’s new model beats OpenAI’s o1 on math and reasoning benchmarks, and blows Meta’s* Llama 3.1 and OpenAI’s GPT-40 out of the water in coding and complex problem-solving. The model is free to run locally, with access to its API priced at a fraction of competitors’ rates. The setup reportedly cost $5.6 million to train (vs $78 million for GPT-40), and uses performance-capped chips due to US restrictions, which also saw the use ban the delivery of more powerful processers to China. Instead, DeepSeek R1 harnesses its power from superior compute efficiency.
hmm…
hmm…
hmm…
WtR