DeepSeek releases model it calls 'intermediate step' towards 'next-generation architecture'
Chinese AI developer DeepSeek has unveiled its latest model, DeepSeek-V3.2-Exp, describing it as an “experimental release” with improved training efficiency and enhanced ability to handle long text sequences. The Hangzhou-based firm called it an i...

The Hangzhou-based company called DeepSeek-V3.2-Exp an "intermediate step toward our next-generation architecture" in a post on developer forum Hugging Face.
That architecture will likely be DeepSeek's most important product release since V3 and R1 shocked Silicon Valley and tech investors outside China.
The V3.2-Exp model includes a mechanism called DeepSeek Sparse Attention, which the Chinese firm says can cut computing costs and boost some types of model performance. DeepSeek said in a post on X on Monday that it is cutting API prices by "50%+".
While DeepSeek's next-generation architecture is unlikely to roil markets as previous versions did in January, it could still put significant pressure on domestic rivals like Alibaba's Qwen and U.S. counterparts like OpenAI if it can repeat the success of DeepSeek R1 and V3.
That would require it to demonstrate high capability for a fraction of what competitors charge and spend in model training.
The Economic Times Business News App for the Latest News in Business, Sensex, Stock Market Updates & More.
The Economic Times News App for Quarterly Results, Latest News in ITR, Business, Share Market, Live Sensex News & More.