DeepSeek-V2: This NEW Opensource MoE Model Beats GPT-4, Claude-3 & Llama-3 in multiple benchmarks!

Sdílet
Vložit
  • čas přidán 6. 05. 2024
  • In this video, We'll be talking about Deepseek-V2 which is a new Opensource Mixture-of-experts model released by the team of DeepSeek. This model is beating GPT-4, ChatGPT, Claude-3, Gemini, Mixtral 8x22b & Llama 3 in multiple benchmarks. It can be used to create own ChatGPT clone and Github Copilot alternative. The DeepSeek team was also previously known for their State-of-the art Coding Model DeepSeek Coder.
    [Key Takeaways]
    💻 DeepSeek V2 Launch: The latest version of DeepSeek, a powerful language model, has been launched, offering a new alternative to traditional LLMs.
    🤖 Mixture of Experts Model: DeepSeek V2 uses a Mixture of Experts model, which allows it to process large amounts of data efficiently and accurately, making it a strong contender in the AI model space.
    📈 160 Experts for Specific Tasks: This model boasts 160 experts trained for specific tasks, enabling it to handle a wide range of applications and use cases.
    💸 Cost-Effective: With a pricing model of just $0.28 per 1 million tokens, DeepSeek V2 offers a cost-effective solution for businesses and developers, making it an attractive GPT-4 Alternative.
    📊 Impressive Benchmarks: DeepSeek V2 has demonstrated impressive performance in various benchmarks, including the MT Bench, MMLU, and Arithmetic benchmarks, showcasing its capabilities as a reliable ChatGPT Alternative.
    📂 Opensource and Accessible: As an Opensource LLM, DeepSeek V2 is freely available on HuggingFace, making it easily accessible to developers and researchers.
    ⏱️ Fast and Efficient: DeepSeek V2 has been shown to be fast and efficient in processing requests, making it suitable for real-time applications and chat interfaces.
    🚀 Potential for Wider Adoption: With its impressive performance and cost-effective pricing, DeepSeek V2 has the potential to be widely adopted in various industries, including customer service, language translation, and content generation.
  • Věda a technologie

Komentáře • 21

  • @HLSiTech
    @HLSiTech Před měsícem +1

    I have been waiting for this model for a long-time now! This is great! I'm going to give it a shot now. 😀

  • @ojikutu
    @ojikutu Před měsícem +2

    This is awesome news. especially for the inference pricing. Open source is really outpacing proprietery SOTA models.

  • @Epirium
    @Epirium Před měsícem +1

    This is awesome I will be using this in all my ai tools, thanks for sharing

  • @JoraMacKornev
    @JoraMacKornev Před měsícem +1

    Great model. I will try to use it.

  • @DeaViNG
    @DeaViNG Před měsícem +1

    would love to see a video about some nice tts ai

  • @vextorfx6243
    @vextorfx6243 Před měsícem

    Is this model good at other things than code? Is it better than Claude AI 3 sonnet for tasks such as solving homework and exercises?

    • @AICodeKing
      @AICodeKing  Před měsícem +1

      That depends on your type of homework.

  • @NLPprompter
    @NLPprompter Před měsícem

    watched this video in 2x speed it is really like the matrix downloading information into my brain! (less than 10 minutes video are GEMs!)

  • @cig_in_mouth3786
    @cig_in_mouth3786 Před měsícem

    Issue with deepseek use different prompts and most prompt trained for llama3/2 so vscode plugin doesnt work with it

  • @SonGoku-pc7jl
    @SonGoku-pc7jl Před měsícem

    uau! good price! but not in spanish :(

  • @Epirium
    @Epirium Před měsícem +1

    please make video on lobe chat

  • @wasimdorboz
    @wasimdorboz Před měsícem +3

    i dont think deepseek can beat claude 3

    • @AICodeKing
      @AICodeKing  Před měsícem +13

      In the benchmarks it is very near it. Let's see how it does in the LLM Arena.

    • @TimotheedeAlmeida
      @TimotheedeAlmeida Před měsícem

      yes maybe but for api user 1M generated token by claude3 is up to 75$ deepseek 0.3$ that let you think carrefully to your real needs

    • @TimotheedeAlmeida
      @TimotheedeAlmeida Před měsícem +1

      and maybe it is not tested and verified but as for now i begin to use it in french and it is doing well

    • @TheRealUsername
      @TheRealUsername Před měsícem +3

      Perhaps it is optimized for the benchmarks, we should wait for its score in the Arena.

    • @tomaszzielinski4521
      @tomaszzielinski4521 Před měsícem +1

      At 1% of Claude 3 price it is worth a try.