Deep Seek Language Model

New Open Source DeepSeek V3 Language Model Making Waves

In the world of large language models (LLMs) there tend to be relatively few upsets ever since OpenAI barged onto the scene with its transformer-based GPT models a few years ago, yet now it seems that ...

The Information

DeepSeek To Release Next Flagship AI Model With Strong Coding Ability

Chinese AI startup DeepSeek is expected to launch its next-generation AI model that features strong coding capabilities in the coming weeks, according to two people with direct knowledge of the plan.

Semiconductor Engineering

DeepSeek: Improving Language Model Reasoning Capabilities Using Pure Reinforcement Learning

“We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT ...

WinBuzzer

DeepSeek Reveals R1 Model Architecture Secrets Ahead of V4 Model Launch

DeepSeek has expanded its R1 whitepaper by 60 pages to disclose training secrets, clearing the path for a rumored V4 coding ...

TechCrunch

DeepSeek: Everything you need to know about the AI chatbot app

DeepSeek is backed by High-Flyer Capital Management, a Chinese quantitative hedge fund that uses AI to inform its trading decisions. AI enthusiast Liang Wenfeng co-founded High-Flyer in 2015. Wenfeng, ...

DIGITIMES

DeepSeek V4 update: Conditional memory reshapes large-model efficiency

DeepSeek founder Liang Wenfeng has published a new paper with a research team from Peking University, outlining key technical ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results