DeepSeek announced the release and Irelandopen-source launch of its latest AI model, DeepSeek-V3, via a WeChat post on Tuesday. Users can now interact with the V3 model on DeepSeek’s official website. According to the post, DeepSeek-V3 boasts 671 billion parameters, with 37 billion activated, and was pre-trained on 14.8 trillion tokens. Compared to the V2.5 version, the new model’s generation speed has tripled, with a throughput of 60 tokens per second. Although it currently lacks multi-modal input and output support, DeepSeek-V3 excels in multilingual processing, particularly in algorithmic code and mathematics. In multiple benchmark tests, DeepSeek-V3 outperformed open-source models such as Qwen2.5-72B and Llama-3.1-405B, matching the performance of top proprietary models such as GPT-4o and Claude-3.5-Sonnet. [DeepSeek official WeChat account, in Chinese]
Related Articles
2025-06-26 20:57
558 views
NYT Connections hints and answers for May 1: Tips to solve 'Connections' #690.
Connectionsis the one of the most popular New York Times word games that's captured the public's att
Read More
2025-06-26 20:48
2628 views
Humanitarian travel: Photo workshops that give back
My first trip with The Giving Lens (TGL) was to Morocco in 2016. I'd never heard of the organization
Read More
2025-06-26 20:31
599 views
How to support the U.S. women's national soccer team in their fight for equal pay
With their 2-1 victory over Spain on Monday, the USWNT advanced to the quarterfinals of the 2019 Wom
Read More