in

DeepSeek launches new AI model with 671 billion parameters, rivaling GPT-4o

DeepSeek launches new AI model with 671 billion parameters, rivaling GPT-4o

Credit: 123rf

DeepSeek announced the release and open-source launch of its latest AI model, DeepSeek-V3, via a WeChat post on Tuesday. Users can now interact with the V3 model on DeepSeek’s official website. According to the post, DeepSeek-V3 boasts 671 billion parameters, with 37 billion activated, and was pre-trained on 14.8 trillion tokens. Compared to the V2.5 version, the new model’s generation speed has tripled, with a throughput of 60 tokens per second. Although it currently lacks multi-modal input and output support, DeepSeek-V3 excels in multilingual processing, particularly in algorithmic code and mathematics. In multiple benchmark tests, DeepSeek-V3 outperformed open-source models such as Qwen2.5-72B and Llama-3.1-405B, matching the performance of top proprietary models such as GPT-4o and Claude-3.5-Sonnet. [DeepSeek official WeChat account, in Chinese]

What do you think?

Newbie

Written by Buzzapp Master

Leave a Reply

Your email address will not be published. Required fields are marked *

GIPHY App Key not set. Please check settings

    Ruben Amorim admits Manchester United are underperforming after the dramatic win over Lyon

    Ruben Amorim admits Manchester United are underperforming after the dramatic win over Lyon

    This fusion-powered rocket could halve the time it takes to get to Mars

    This fusion-powered rocket could halve the time it takes to get to Mars