Meta Announces Llama 3 at Weights & Biases’ conference

Sdílet
Vložit
  • čas přidán 19. 05. 2024
  • In an engaging presentation at Weights & Biases’ Fully Connected conference, Joe Spisak, Product Director of GenAI at Meta, unveiled the latest family of Llama models, Llama 3.
    Highlighting a significant milestone in AI development, the Llama 3 models, including the impressive 8 billion and 70 billion parameter models released during the conference, along with a glimpse into the future with a 400 billion parameter model still in the works.
    Joe shared insights into the training processes and alignment of Llama 3, which now ranks as the top-performing model in the open weights category on the MMLU, GSM-K, HumanEval benchmarks.
    Weights & Biases is proud to support our customers such as Meta as they push the boundaries of AI, to learn how to fine-tune your LLMs using torchtune and Weights & Biases, start here: wandb.me/torchtune
    Timestamps:
    00:00 Introduction
    03:05 Overview of Llama at Meta
    05:59 Introducing Meta Llama 3
    7:04 Advancements in Llama 3: Training and Data Scale
    10:02 Benchmarking Llama 3 Performance
    14:01 Enhancing Model Safety and Red Teaming
    16:23 Expanding the Ecosystem and Future Directions
    23:00 Closing remarks: Future plans for Llama models, and an invitation to use Meta's Lama 3.
    #MetaLlama #ArtificialIntelligence #AITrends #TechInnovation
  • Věda a technologie

Komentáře • 33

  • @thenoblerot
    @thenoblerot Před 26 dny +12

    Thanks for this W&B

  • @Crux69
    @Crux69 Před 25 dny +20

    My favorite fact from this is that the smarter the model, the more it violates rules. Just like us :)

    • @utuberay007
      @utuberay007 Před 23 dny

      Very true ! People who are way smarter on tax laws are the one who violate most , innocent people pay more than what they are supposed to etc . Same goes with many other laws

    • @techpiller2558
      @techpiller2558 Před 21 dnem

      Or, the rules it uses instead of the rules we assumed are different.

    • @why.do.I.even.try.
      @why.do.I.even.try. Před 6 dny +1

      That's a great way to justify corruption and awful people.

    • @Crux69
      @Crux69 Před 6 dny

      @@why.do.I.even.try. awful people are still human, best to understand how good people become awful

    • @why.do.I.even.try.
      @why.do.I.even.try. Před 6 dny

      @@Crux69 Yes but we shouldn't repeat their actions just because they work. We should work towards more ethical means to advance, technologically and societally.

  • @ihesiulo
    @ihesiulo Před 25 dny +6

    There's a universe where Joseph Spisak is Mark Zuckerberg's brother. Oh, and nice presentation. Wonderful work they are doing at Meta AI.

  • @siloquant
    @siloquant Před 25 dny +1

    Congratulations!

  • @RakeshMurria
    @RakeshMurria Před 26 dny +7

    I really enjoyed this. Thanks

  • @naninano8813
    @naninano8813 Před 26 dny +5

    so all those supervisor/safeguard models are only utilized during training? i mean, once the weights of llama3 are out, there is no safeguard network between user and inference engine right?

    • @Crux69
      @Crux69 Před 25 dny

      I'm sure they have a safety model that tries to review every request and catch some negative responses.

  • @PeterLappo
    @PeterLappo Před 25 dny

    How much did it cost to build, including hardware and engineering costs?

  • @techpiller2558
    @techpiller2558 Před 21 dnem

    What will be the SQLite of LLMs, with capability for local use? Llama?

  • @thegreatgustby
    @thegreatgustby Před 27 dny +2

    I think he could have said "ridiculous" a bit more often

  • @gubatron
    @gubatron Před 25 dny +4

    vin diesel!

  • @RichReportcom
    @RichReportcom Před 19 dny +1

    Summary: Safety and size. The end.

  • @ericadar
    @ericadar Před 26 dny +15

    a few hours go by...llama 3 no longer SOTA

    • @adinsoftic
      @adinsoftic Před 26 dny +7

      That's why they open source it. They let the community figure things out and iterate. For Meta LLM is just a tool and not a product on itself

    • @SkepticButOptimist
      @SkepticButOptimist Před 26 dny +2

      Wait what is sota now?

    • @adinsoftic
      @adinsoftic Před 26 dny +2

      @@SkepticButOptimist "state of the art"

    • @JeiShian
      @JeiShian Před 25 dny

      Which model is sota?

    • @MiraPloy
      @MiraPloy Před 23 dny +5

      I think it's supposed ro be either phi or sensenova, neither of which are released ​@@JeiShian

  • @GerardSans
    @GerardSans Před 21 dnem

    How silly is to redteam a model which you control the training data to check for bioweapons capabilities. How stupid should you have to be? Isn’t easier to run a search on the data 😂😅

  • @matbeedotcom
    @matbeedotcom Před 26 dny

    I’m glad they saw how useless they made codellama 😂, it was waaaay overly aligned