DeepSeek: all the news about the startup that’s shaking up AI stocks

DeepSeek: all the news about the startup that’s shaking up AI stocks

DeepSeek, a startup founded in 2023 by Liang Wenfeng in Hangzhou, China, has quickly revolutionized the AI industry with its groundbreaking R1 model that triggered a $1 trillion tech stock selloff in January 2025. The company’s exceptional cost-efficiency, building models for under $6 million with API costs of just $2.19 per million tokens, is shaking up established players like OpenAI and testing the impact of Western technology export controls to China.

Key Takeaways

  • DeepSeek’s R1 model launched in January 2025 instantly reached #1 on the Apple App Store with 2.6 million downloads in one day, surpassing ChatGPT and causing major market disruption.
  • The company’s innovative cost structure delivers AI services at roughly 1/50th the cost of competitors, with development expenses just a fraction of what established players invest.
  • DeepSeek’s technical breakthroughs include a Mixture-of-Experts architecture with 671 billion total parameters but only 37 billion active during operation, needing just 1/10th of the computing power of Meta’s Llama 3.1.
  • The company provides a complete ecosystem of AI solutions including DeepSeek-V3, R1, Coder, Math, VL, and Janus Pro, spanning general-purpose and specialized applications.
  • DeepSeek’s success despite US chip export limitations to China raises concerns about the effectiveness of current export control strategies and could reshape global AI competition.

DeepSeek’s Breakthrough Sends Shockwaves Through AI Industry

From Startup to Market Disruptor

Founded in 2023 by Liang Wenfeng in Hangzhou, China, DeepSeek has quickly transformed from newcomer to industry giant. After releasing its first AI model in November 2023, the company launched its game-changing R1 model on January 20, 2025. The DeepSeek AI assistant app immediately shot to #1 on the Apple App Store, overtaking ChatGPT with an impressive 2.6 million downloads in just one day.

Financial Earthquake

DeepSeek’s rapid rise triggered an unprecedented $1 trillion tech stock selloff on January 27, 2025. Nvidia was hit hardest, plummeting 18% and shedding $589 billion in market value. Other tech leaders weren’t spared – Microsoft, Meta, Oracle, and Broadcom all saw significant drops. Many analysts now call this a “Sputnik moment” in the AI development race, marking a clear shift in the competitive landscape.

Revolutionary Cost-Efficiency Disrupting AI Economics

DeepSeek’s approach to AI development has created a financial shakeup in the industry. Their R1 model was built for under $6 million, a fraction of ChatGPT’s 4o model, which reportedly cost $100 million. This dramatic cost difference extends to their service pricing as well.

The startup’s API costs represent a game-changing shift – just $2.19 per million tokens compared to OpenAI’s $60. For businesses running multiple AI operations, this can transform budget allocations overnight. Even more impressive, DeepSeek’s inference costs are approximately 1/50th of what Anthropic charges for their Claude 3.5 Sonnet.

Open-Source Advantages for Businesses and Developers

DeepSeek’s open-source strategy offers several cost-saving benefits:

  • Free usage with full model weights
  • Flexible deployment on local hardware or cloud infrastructure
  • Freedom from ongoing subscription costs
  • Ability to customize models without premium fees

This pricing model directly challenges the industry giants like OpenAI and Google, who’ve built business models around proprietary access to their technologies. For smaller developers and businesses previously priced out of advanced AI capabilities, DeepSeek’s approach opens doors to tools that were financially out of reach.

By cutting both development and operational costs, DeepSeek isn’t just offering cheaper alternatives – they’re potentially changing how AI gets distributed throughout the market. This could accelerate adoption across sectors where cost barriers previously limited AI implementation.

Technical Innovations Behind DeepSeek’s Performance

DeepSeek’s AI models pack impressive tech under the hood. The DeepSeek-V3 features a massive 671 billion total parameters while keeping only 37 billion active during operation. This efficient design lets it deliver powerful results without excessive computing demands.

Advanced Architecture Choices

DeepSeek achieves its performance through smart design decisions:

  • The Mixture-of-Experts (MoE) architecture activates only the most relevant parameters for each task, saving significant computing resources
  • Multi-Head Latent Attention (MLA) technology speeds up processing by focusing computational effort where it matters most
  • The R1 model incorporates specialized reasoning capabilities for complex problem-solving
  • The system reportedly uses just 1/10th of the computing power required by Meta’s Llama 3.1

These innovations let DeepSeek deliver high-quality AI performance even with limited resources, making advanced AI more accessible and cost-effective for practical applications.

Comprehensive Product Ecosystem

DeepSeek’s product lineup covers the full spectrum of AI applications, challenging established players across multiple domains. The company’s suite ranges from general-purpose models to specialized tools for specific tasks.

Diverse AI Solutions

DeepSeek’s flagship offerings include several powerful models:

  • DeepSeek-V3: A versatile chat model that goes head-to-head with OpenAI’s GPT models, handling everyday conversations and content creation
  • DeepSeek-R1: Focuses on advanced reasoning, making it ideal for complex problem-solving scenarios that require logical thinking
  • DeepSeek Coder: Targets development tasks, directly competing with GitHub Copilot in the code generation and assistance space
  • DeepSeek Math: Specializes in tackling mathematical problems with precision, filling a niche for academic and scientific applications
  • DeepSeek-VL: Interprets and analyzes images with text, similar to GPT-4V’s capabilities but with DeepSeek’s unique approach
  • Janus Pro: Creates images from text prompts, positioning itself as an alternative to popular options like Midjourney and DALL-E

The strength of DeepSeek’s approach lies in how these solutions complement each other while maintaining competitive quality in their respective categories. I’ve found their focused development strategy particularly effective, as each product addresses specific user needs rather than trying to create one-size-fits-all solutions.

Geopolitical Implications and Export Control Questions

DeepSeek’s rise highlights significant gaps in current export restriction policies. The company’s ability to develop advanced AI models while operating under US chip export limitations to China has caught the attention of policymakers and security experts alike.

Challenge to Export Controls

The success of DeepSeek directly questions the effectiveness of Western technology transfer barriers. Despite restrictions meant to slow China’s AI advancement, the startup has managed to create competitive models that rival those from OpenAI and Anthropic. This achievement suggests several important considerations:

  • Current export controls may have substantial loopholes that allow for continued AI development
  • China’s domestic computing infrastructure might be more advanced than previously assessed
  • Technical talent within China has found creative workarounds to hardware limitations
  • Supply chain diversification strategies have reduced dependency on US-controlled technologies

This situation has intensified what many analysts describe as a global AI competition between major powers. I expect this development to trigger significant policy reassessments across Western governments, particularly in the United States. The Biden administration’s semiconductor controls were specifically designed to limit China’s most advanced AI capabilities—but DeepSeek’s progress suggests these measures may need substantial revision.

Industry analysts note that US regulatory agencies are likely monitoring these developments closely, with potential responses ranging from tightened export rules to increased scrutiny of international research collaborations involving AI technologies.

What Happens Next for AI Markets and Innovation

Reshaping the Competitive Landscape

DeepSeek’s entrance has triggered several shifts that’ll reshape AI markets. The cost advantage they’ve introduced is forcing a reevaluation of pricing throughout the industry. I expect established players like OpenAI and Anthropic to face mounting pressure to slash their prices while improving performance – a challenging balancing act.

This disruption extends beyond pricing to fundamental business models. Companies built on high-margin AI services must now rethink their approach as lower-cost alternatives gain traction. For investors, this means a thorough reassessment of AI company valuations across the board, with many stocks likely to experience volatility.

The sustainability of DeepSeek’s cost advantage remains an open question. Can they maintain their edge as competitors respond? This uncertainty will drive several important trends:

  • Open-source AI development will likely accelerate globally, with more organizations sharing research and models to stay competitive
  • Market consolidation may increase as smaller players struggle to adapt, leading to strategic acquisitions
  • Profit margins will compress across the sector as price competition intensifies
  • Innovation cycles will speed up with companies racing to offer unique capabilities beyond raw performance

The next 12-18 months will be particularly telling as the market absorbs these changes. Companies that can quickly adapt their cost structures while delivering genuine innovation will emerge stronger from this transition period.

 

Table of Contents

Related Blogs

Johns Hopkins University Press Ventures into AI Collaboration with Unique Licensing Strategy

In a groundbreaking move to align academic publishing with the digital age, the Johns Hopkins

Perplexity AI Now Integrated into n8n: Smarter Automations with One Node

The integration of Perplexity AI into n8n represents a significant leap forward in workflow automation,

Introducing Perplexity Labs: The New Frontier in AI Research & Innovation

Perplexity AI has launched Perplexity Labs, a comprehensive AI-powered research and productivity platform that transforms