ai · April 24, 2026

DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence

Huggingface.co · View original source

DeepSeek-V4: Towards Highly Efficient Million-Token Context Intelligence

Technical Report
Introduction
We present a preview version of DeepSeek-V4 series, including two strong Mixture-of-Experts (MoE) language models DeepSeek-V4-Pro with 1.6T parameters (49B activated) … [+13188 chars]