Understand ChatGPT Better Than 99% of People

Sdílet
Vložit
  • čas přidán 22. 08. 2024

Komentáře • 42

  • @borismeinardus
    @borismeinardus  Před 6 měsíci +1

    🚀 There is so much more to explore in ML than just LLMs like ChatGPT. Feel free to grab my FREE cheat sheet of different ML domains and open challenges:
    borismeinardus.substack.com/p/a-list-of-different-ml-domains

    • @observer698
      @observer698 Před 6 měsíci

      Trying to register using your link above but it is asking for an organization email? When I use my personal email address there is no new registration option??

  • @srisaisubramanyamdavanam9912
    @srisaisubramanyamdavanam9912 Před 6 měsíci +7

    Prompt engineering is all about supervised fine tuning the model.
    Great video bro

    • @borismeinardus
      @borismeinardus  Před 6 měsíci +4

      Thank you!!
      Well, with prompt engineering you don't really train a model, but it is related to the SFT stage of a model because you partially improve model response quality by engineering the prompt in a way that aligns with the formatting used during SFT :)
      I hope this makes sense haha

    • @srisaisubramanyamdavanam9912
      @srisaisubramanyamdavanam9912 Před 6 měsíci

      Haa makes sense. Thanks for correction

    • @rohanpandey9957
      @rohanpandey9957 Před 6 měsíci

      ​@@borismeinardusthis indeed

  • @abdelkaioumbouaicha
    @abdelkaioumbouaicha Před 5 měsíci +1

    📝 Summary of Key Points:
    📌 The video explains the working mechanism of chat GPT, focusing on how it understands text through an attention mechanism that identifies relationships between words to predict the next word accurately.
    🧐 Chat GPT is trained through methods like mask language modeling and next token prediction to understand language, grammar, and world knowledge. It undergoes pre-training on vast amounts of data to generate text that makes sense.
    🚀 Supervised fine-tuning and reinforcement learning from human feedback are crucial steps in training chat GPT to provide responses aligned with desired outcomes and preferences, enhancing its ability to generate suitable text.
    💡 Additional Insights and Observations:
    💬 The attention mechanism in chat GPT helps it comprehend context and relationships between words, improving its predictive capabilities.
    📊 GPT models are trained on massive datasets to learn language, grammar, and world knowledge, enabling them to generate coherent text.
    🌐 Techniques like retrieval augmentation generation (RAG) allow for customizing GPT models without extensive retraining, making them versatile for various tasks.
    📣 Concluding Remarks:
    The video delves into the intricate workings of chat GPT, highlighting its training methods, mechanisms for understanding text, and techniques for customization. Understanding these processes provides insights into how AI language models like GPT function and adapt to different contexts effectively.
    Generated using TalkBud

  • @redeux
    @redeux Před 6 měsíci +1

    The GTC sessions are all pretty badly timed for me but going to wake up extra early for a few. Had considered a full day workshop but they are really full night events for me lol. Thanks for sharing that!

  • @weizhang446
    @weizhang446 Před 2 měsíci

    Simple (and probably stupid) question: does a domain-specific problem, say understanding ocean data or medical images (here I'm using the word "understanding" in a very loosened way), need a specific pre-training on the domain data? Or can one just use an LLM pre-trained on the whole internet corpus and then fine-tune it?

    • @borismeinardus
      @borismeinardus  Před 2 měsíci

      Not a stupid question at all!
      In this case you would finetune it ☺️

  • @rohanpandey9957
    @rohanpandey9957 Před 6 měsíci

    hope you can make a video on implementing a simple paper i just saw your project video but got no idea how to sctually do ut ima be honest i need some hand holding at start 😅

  • @dariusfoodeei8181
    @dariusfoodeei8181 Před 6 měsíci +2

    Hey from EPFL in Switzerland, great video! Is it me or is the resolution downgraded though?

    • @borismeinardus
      @borismeinardus  Před 6 měsíci +1

      Thank you! But yeah.. I did an oopsie and uploaded a 1080p version :(
      I hope it still works!

    • @borismeinardus
      @borismeinardus  Před 6 měsíci

      I'm experiment with new editing and so on. In the next video I will fix it! Thank you for the feedback though! :)

    • @dariusfoodeei8181
      @dariusfoodeei8181 Před 6 měsíci

      @@borismeinardus Editing is super clean though! Those animations are sick keep it up

  • @suryanshsinha2838
    @suryanshsinha2838 Před 5 měsíci

    Please do a video on the most efficient way to get comfortable with PyTorch

  • @leokonto6860
    @leokonto6860 Před 6 měsíci +6

    Your temperature knob is turned to over 9000

  • @skyndream211
    @skyndream211 Před 4 měsíci

    Thank you, Boris, for being a creator 😍

  • @abhinav__pm
    @abhinav__pm Před 5 měsíci

    Bro, I want to fine-tune a model for a translation task. However, I encountered a ‘CUDA out of memory’ error. Now, I plan to purchase a GPU from AWS ec2 instance. How is the payment processed in AWS? They asked for card details when I signed up. Do they automatically process the payment?

  • @TechDevin
    @TechDevin Před 5 měsíci

    Plzz make a video on machine learning course in German in public universities plzzzzz bro

  • @shadowskullG
    @shadowskullG Před 6 měsíci +2

    even if I win I will have to buy a new case mine is micro atx but definitely worth it

    • @borismeinardus
      @borismeinardus  Před 6 měsíci +4

      haha yeah, need to be aware of the electricity bill 💸

  • @athenaconstantinou5353
    @athenaconstantinou5353 Před 6 měsíci +1

    Great work!

  • @anweshachowdhury6125
    @anweshachowdhury6125 Před 6 měsíci +1

    Would you do a live coding stream in the future?

  • @NuggzBii
    @NuggzBii Před 5 měsíci

    You should do a video about Word2Vec

  • @skullgamer977
    @skullgamer977 Před 6 měsíci +1

    Hey bro can you tell how can I make a software easily in my 4gb ram pc and suggest some software ideas

    • @borismeinardus
      @borismeinardus  Před 6 měsíci +1

      Hey!
      Not 100% sure what you mean with software but if you refer to models that you want to train you can look at some of these different options
      - using smaller models
      - quantizing the models, i.e. changing the datatype from to smaller ones like float16 or float8
      - using parameter efficient fine tuning techniques like LoRA
      - using smaller batchsize
      - using lower resolution images (if using images/ videos)
      I hope this somewhat helps :)

    • @skullgamer977
      @skullgamer977 Před 6 měsíci

      @@borismeinardus thanks bro

  • @rohanpandey9957
    @rohanpandey9957 Před 6 měsíci

    nice video man

  • @vettv3462
    @vettv3462 Před 6 měsíci

    All the music etc was a bit distracting.
    I didnt hear anything between the start and the end. Lol

  • @shobhitagnihotri416
    @shobhitagnihotri416 Před 6 měsíci +2

    Your view on sora

    • @borismeinardus
      @borismeinardus  Před 6 měsíci +4

      Sora is a crazy good model - not perfect of course, but really really good. It's so much fun to think of how people will use it in the future and what effect it (amongst all of AI) will have on the world. It's the speed of progress that makes this work exciting - just think of the Will Smith eating spaghetti generated video from like 11 months ago. Let's see where we will be in one more year!
      Some might argue, it is not the most revolutionary when it comes to the technical details, but I think that is not why Sora is so amazing and relevant.
      It shows where the world is heading, and at what insane speed we are moving. We, who are interested in that technology are aware of all this, but the "normal" population still isn't but most definitely will need to adapt.
      That's why ChatGPT was also so important. It brought an immense amount of attention to the world of AI.

  • @athul_sidiq
    @athul_sidiq Před 6 měsíci

    Damn 🔥

  • @ThePositiev3x
    @ThePositiev3x Před 5 měsíci

    I'm afraid your understanding of the temperature is wrong.