Tech 21 Century

  • About
  • MY BOOKS
  • Home
  • Electronics
    • Apple Products
    • Gadgets
    • Mobile Phones
  • General Tech
    • Tech News
    • Smart Home Tech
    • Technology Certifications
    • VoIP
    • Web Hosting
  • Software
    • Windows
    • Mobile Apps
    • Software Product Reviews
    • Top Software Products
    • Video Conversion Software
  • Product Reviews
    • Computers and Peripherals
    • Home Routers and Modems
    • Mesh WiFi
    • Smart Home
    • VPN
  • Security
  • Computers
  • Gaming
  • Internet
  • Video Streaming
You are here: Home / Tech News / DeepSeek V3 Announced on the Hugging Face AI Platform
I may earn a small commission if you buy through the links in this website without any extra cost to you. My Recommendations however are not biased in any way.

DeepSeek V3 Announced on the Hugging Face AI Platform

The Chinese company DeepSeek has released its latest large language model (LLM) called DeepSeek-V3-0324. This 641 gigabyte model was released on the Hugging Face AI platform with minimal pre-announcement, in line with the company’s practice of restrained product announcements.

deepseeek v3

The model is unique in that its license allows for free commercial use. Early benchmarks show that DeepSeek-V3-0324 is capable of running on commercially available hardware, such as Apple’s Mac Studio with the M3 Ultra processor.

AI scientist Awni Hannun from Apple reported that this model is capable to achieve a processing speed of over 20 tokens per second using this Apple hardware setup.

This ability to run a large language model on local ready-to-use hardware is the exact opposite of the conventional way of using other AI models that require massive data center infrastructures to support high-performance AI models.

According to DeepSeek, initial tests have shown significant improvement compared to previous versions.

The model has been rigorously tested by internal stakeholders and has excellent performance, potentially surpassing all other competitive models and even Anthropic’s Claude Sonnet 3.5 in non-logical tasks.

However, unlike subscription-based models like Sonnet and ChatGPT, DeepSeek-V3-0324 is free to download and use.

Technically, the model is a Mixture of Experts (MoE) architecture. It selectively uses around 37 billion of its 671 billion parameters per task, enhancing efficiency by reducing computational requirements while maintaining performance.

The model also uses Multi-Head Latent Attention (MLA) and Multi-Token Prediction (MTP) technologies, which contribute to improved context retention and faster output speed.

Access to the model can be obtained through Hugging Face, the OpenRouter API and chat interface, and the DeepSeek chat platform, if desired. The Hyperbolic Labs inference provider also offers access to the model.

Key Features and Performance

  • The model was pre-trained on 14.8T tokens using 2.788M GPU hours
  • DeepSeek-V3 outperforms Llama 3.1 405B and GPT-4o on key benchmarks
  • It demonstrates exceptional capabilities in coding and mathematical tasks
  • The model is designed for a wide range of natural language processing tasks

DeepSeek AI also offers API access and an online demo for those looking to test the model’s capabilities. The company originally released DeepSeek R1 on Jan. 20, 2025, which is an LLM that runs on open source license which is free to use, and has shaken the AI technology market with its affordability in terms of costs.

Spread the love

Related Posts

  • Ericsson Mobility Report Predicts Mobile Traffic to Double by 2030
  • AI Transformation Drives Urgent Need for ICT Workforce Reskilling, Finds Industry Report
  • The 5G Energy Efficiency Challenge – Is 5G Mobile Technology Energy Efficient?
  • T-Mobile and Verizon Set to Acquire US Cellular’s Network Infrastructure
  • Leading Hard Drive Company Plans to Introduce a Massive 60TB Solid State Drive by 2024

Filed Under: Tech News

About Harris Andrea

Harris Andrea is an IT professional with more than 2 decades of experience in the technology field. He has worked in a diverse range of companies including software and systems integrators, computer networking firms etc. Currently he is employed in a large Internet Service Provider. He holds several professional certifications including Cisco CCNA, CCNP and EC-Council's CEH and ECSA security certifications. Harris is also the author of 2 technology books which are available at Amazon here.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Search this Site

Copyright © 2026 | Contact | Privacy Policy & Disclaimers | Amazon Affiliate Disclaimer | Terms of Service