Mistral NeMo : THIS IS THE BEST LLM Right Now! (Fully Tested & Beats Qwen2, DeepSeek-V2 & Others)

Sdílet
Vložit
  • čas přidán 5. 09. 2024

Komentáře • 67

  • @adamgkruger
    @adamgkruger Před měsícem +6

    Thanks!

    • @AICodeKing
      @AICodeKing  Před měsícem

      Thanks a lot for the support. Also, you're the first comment as well!

  • @Lemure_Noah
    @Lemure_Noah Před měsícem +11

    In the nvidia NIM you must change the inference parameters. I've changed the temperature to 0.6 (NIM default is 0.2, oo low) and top P to 0.9 (default is 0.5). After this fix, the model could answer the question #1.
    Also, there is a ambiguity in the sentence *"What is the capital of the country WHOSE name ends with 'lia'"*
    If "whose" referes to the name of the country, then the answer could be what you sad "Camberra, the capital of AustraLIA". Or even "Rome, the capital of ItaLIA" (Italy in italian)
    If "whose" referes to the name of the city, then the answer is BrasiLIA, the capital of Brazil.
    It was an excellent analysis and I enjoyed it! I will check out the other videos you mentioned.

    • @AICodeKing
      @AICodeKing  Před měsícem +7

      Thanks for the great suggestion and comment. Although, When a company is hosting a demo, The default settings should be set correct by the company itself. Also,If you look at the Mistral Nemo HuggingFace page you'll see that their page also say "Unlike previous Mistral models, Mistral Nemo requires smaller temperatures. We recommend to use a temperature of 0.3". Also, you are correct about the Ambiguity in the sentence. I know that this sentence causes a confusion. But, if someone (a Human or LLM) reads carefully they'll know that it should be a Country since that's the most likely case here. This question is answered correctly by multiple LLMs in my tests. So, that's why I have kept it.

    • @monza8844
      @monza8844 Před měsícem +1

      > "Also, there is a ambiguity in the sentence "What is the capital of the country WHOSE name ends with 'lia'"
      if you had finished school you would know that "whose" always refers to the country, since it's closest.

    • @Lemure_Noah
      @Lemure_Noah Před 9 dny

      @@monza8844 I'm not native English speaker, so it's true: I didn't "finish English school". And possibly, some LLM didn't finish school either, when we see so many wrong answers for this silly question. But magically some models start answering it correctly when you apply the fix. Give a try and blame me later.

  • @magolide
    @magolide Před měsícem +10

    I showed my daughter what deepseek can do by making a tick-tack-toe game in seconds using python, that keeps track of scores and has a slick UI, she says we developers are the modern day wizards🧙‍♂️ can't wait for NeMo to be on Ollama

    • @AICodeKing
      @AICodeKing  Před měsícem +3

      Haha! Made me emotional

    • @magolide
      @magolide Před měsícem +3

      @@AICodeKing me too bro, keep up the great work. You're my go-to guy for anything AI related.

    • @vitalis
      @vitalis Před měsícem +1

      Encourage her to learn how to leverage GPTs in her daily live.

    • @Larimuss
      @Larimuss Před měsícem +1

      Get textgen ui, latest version runs.

  • @Aarifshah-A
    @Aarifshah-A Před měsícem +2

    Not missing a single video of yours keep up the good work -- good sir 🎉

  • @claxvii177th6
    @claxvii177th6 Před měsícem +3

    Sometimes i leave the vids at the start cuz my feed is already flooded with AI development content. Working with this shit i, firstly, love content like yours, but also, it is super stressful, AI content goes at a super fast rate. It is mind blowing, i read a couple of papers every week, test a bunch of stuff and still i feel like i can't keep up! It is so hard to measure the speed the AI development... for instance, i started using qwen2 for my rag applications, and there are a bunch of stuff i should be already be testing, it iz crazy.

  • @eyeseethru
    @eyeseethru Před měsícem

    Nice to see updated test questions!

  • @leoalsufi932
    @leoalsufi932 Před měsícem

    Brother, I just want to say please continue doing the great work that you’re doing don’t morph your way of doing things to be like the others which they suck… thank you you’re awesome

  • @Historypress-pq4ng
    @Historypress-pq4ng Před měsícem +1

    This might be the best LLM year reward

  • @krispybutter2555
    @krispybutter2555 Před měsícem

    You should have your own website that lists your personal AI stack combo's that are good; one for api paid like Claude 3.5 Sonnet and local free "manageable" for common users (like 16GB GPU ram or something). Lists like Maestro + Qwen 2 + DeepSeekV2 OR GPT40 mini + Qwen2 + ContinueDev OR Aider + Gemini + NextJS + Subabase OR Devyan (CrewAI) + DeepSeek-Coder-V2, etc, etc. That would be super cool! I like all the video's you got, but what combo and which stacks to use get's confusing since a lot of these get updates seemingly every month or something.

  • @MeinDeutschkurs
    @MeinDeutschkurs Před měsícem +1

    The square face is so cute! 😂

  • @MeinDeutschkurs
    @MeinDeutschkurs Před měsícem +5

    Can‘t wait to see it on ollama. And btw, yes: it is frustrating when people leave a video after about n minutes. In my videos, 90% vanish after 30 seconds. 🤣🤣🤣

  • @zx9rmario
    @zx9rmario Před měsícem

    I asked Nemo: what is the pinlayout from a NE555 IC, the answer was correct.

  • @zigma1928
    @zigma1928 Před měsícem

    Thank you! It would be nice to see how to get the most out of all these goodies in everyday coding.

  • @ReLogic888
    @ReLogic888 Před měsícem +2

    0:30 Yes you are right, not many viewers realized that testing LLM is more valueable than just words or even "benchmark".
    But, if i may, i want to give you a suggest also. I think you need to improve about video/slide transition in your content. Do not using 'up-down transition' in your video/content frequently (especially if it quite fast), it could make us as viewer has 'cybersickness' symptoms. Better use left-right transition with slow motion of the movement.

    • @AICodeKing
      @AICodeKing  Před měsícem

      This is great suggestion. I'll try to do something about it. Does it also hurt when I scroll pages to show content (Like while talking about the Model and showing the blog post)

    • @settlece
      @settlece Před měsícem

      @@AICodeKing you could put some anime girls around the border and as the vid gos on she gets less and less cloth....
      jokes aside i love your stuff real world AI info

    • @AICodeKing
      @AICodeKing  Před měsícem +1

      That would make me surpass MrBeast.

  • @clint9344
    @clint9344 Před měsícem

    lol love the humor, their loss is our gain...thanks once again AICodeKing for another excellent vid.. keep up great work. would like to see a copilot vid with this one also...be in peace God speed.

  • @mdubbau
    @mdubbau Před měsícem +3

    Can you please make a video using NeMo as a Copilot?

    • @AICodeKing
      @AICodeKing  Před měsícem +3

      Yes, will do. Waiting for it to be available on Ollama.

    • @saro.saribekyan
      @saro.saribekyan Před měsícem

      ​@@AICodeKingIsn't LM Studio an option?

  • @dDesirie
    @dDesirie Před měsícem

    Just a heads up, it's available on Open Router AI, which is both supported and recommended by Aider. I think a video comparing the coding abilities between this, GPT-4O Mini, and Claude 3.5 Sonnet when used in Aider would be super useful. What do you think?

    • @AICodeKing
      @AICodeKing  Před měsícem +1

      I'll try to do that.

    • @dDesirie
      @dDesirie Před měsícem

      @@AICodeKing Fantastic! You're the best!

  • @alby13
    @alby13 Před měsícem

    good video. I wonder how it compares to qwen2 7b,

  • @vauths8204
    @vauths8204 Před měsícem

    Thanks King!

  • @MuhanadAbulHusn
    @MuhanadAbulHusn Před měsícem

    Man I asked you before and would like to repeat my question. Can you give us some info on the minimum hardware sources required to run these small modules locally?

  • @catsanzsh
    @catsanzsh Před měsícem

    300th like:D

  • @vauths8204
    @vauths8204 Před měsícem

    New good model lol a truly prestigious title from the king

  • @bwin4531
    @bwin4531 Před měsícem

    Where should i start as a beginner learning all of this?

  • @Techn0man1ac
    @Techn0man1ac Před měsícem

    Its available in LLM Studio but not working.

  • @paulyflynn
    @paulyflynn Před měsícem

    SVG face is much easier than SVG butterfly. But what do I know, I am not a model

  • @Tofu3435
    @Tofu3435 Před měsícem

    Even in q3 it is better than llama 3 8b with q6. Also great for roleplay because no builtin moderation.

  • @HansKonrad-ln1cg
    @HansKonrad-ln1cg Před měsícem

    australia, mongolia, somalia. also: i dont think you can say so easily that a 12b model is "in the same class" as a 7b model. every billion parameters more counts imo.

  • @nothing7ish
    @nothing7ish Před měsícem

    Another new one already? As far as I know, Codestral was released a few days ago.

  • @crushfire2004
    @crushfire2004 Před měsícem

    What is current best model

    • @AICodeKing
      @AICodeKing  Před měsícem +1

      DeepSeek is currently the best.

  • @DGFilmsNYC
    @DGFilmsNYC Před měsícem

    Hunky Dory.... someone is a mamss boy lmao 🤣

  • @Historypress-pq4ng
    @Historypress-pq4ng Před měsícem

    Also how can I get money in llms ?

    • @AICodeKing
      @AICodeKing  Před měsícem

      What do you mean by "Money in LLMs"?

    • @Historypress-pq4ng
      @Historypress-pq4ng Před měsícem

      I MIGHT HOW CAN I SELL THE LLM LIKE CHATGPT TO A BETTER VERSION FOR EXAMPLE OR WE USE THE LLM TO BUILD US SOMETHING WE CAN SELL IT ?

  • @MacS7n
    @MacS7n Před měsícem

    When a new model isn’t compared with Qwen and DeepSeek you know something is wrong

  • @Noaman2022
    @Noaman2022 Před měsícem

    Bro which model is the best to use as php developer ?

    • @AICodeKing
      @AICodeKing  Před měsícem

      All are generally good. Currently, DeepSeek-Coder-V2

    • @Noaman2022
      @Noaman2022 Před měsícem

      @@AICodeKingthanks alot for fast reply bro

  • @halfrockstarpro6412
    @halfrockstarpro6412 Před měsícem

    But its knowledge stops in the year 2021!

  • @TawnyE
    @TawnyE Před měsícem

    E

  • @nicolaslaborie5015
    @nicolaslaborie5015 Před měsícem

    Australia? Somalia?
    Sure it's not China and Beijing but multiple countries end with lia

    • @AICodeKing
      @AICodeKing  Před měsícem +1

      I was looking for Australia. But, if it would have told any country that ends with "ila" and it's capital. Then, I would have given it a pass. But, it didn't. So, it's a fail.

    • @Lemure_Noah
      @Lemure_Noah Před měsícem

      @@AICodeKing I'll repeat what I wrote above: there is a ambiguity in the sentence "What is the capital of the country WHOSE name ends with 'lia'"
      If "whose" referes to the name of the country, then the answer could be what you sad "Camberra, the capital of AustraLIA". Or even "Rome, the capital of ItaLIA" (Italy in italian). The Somalia was also a valid solution.
      BUT if "whose" referes to the name of the city, then the answer could be BrasiLIA, the capital of Brazil.

  • @paulyflynn
    @paulyflynn Před měsícem

    Thanks!

  • @stonedoubt
    @stonedoubt Před měsícem +1

    Thanks!

    • @AICodeKing
      @AICodeKing  Před měsícem +1

      Thanks a lot for the support!

    • @stonedoubt
      @stonedoubt Před měsícem

      @@AICodeKing I have found your channel to be a great resource without hype… too much anyway. I am looking forward to your claude-engineer v2 review if you haven’t done one I haven’t seen yet 😎

    • @AICodeKing
      @AICodeKing  Před měsícem

      I'll do that