What is Retrieval-Augmented Generation (RAG)?

Sdílet
Vložit
  • čas přidán 7. 05. 2024
  • Try RAG with watsonx → ibm.biz/BdMsRT
    Learn more about RAG→ ibm.biz/BdMsRt
    Large language models usually give great answers, but because they're limited to the training data used to create the model. Over time they can become incomplete--or worse, generate answers that are just plain wrong. One way of improving the LLM results is called "retrieval-augmented generation" or RAG. In this video, IBM Senior Research Scientist Marina Danilevsky explains the LLM/RAG framework and how this combination delivers two big advantages, namely: the model gets the most up-to-date and trustworthy facts, and you can see where the model got its info, lending more credibility to what it generates.
    Get started for free on IBM Cloud → ibm.biz/sign-up-now
    Subscribe to see more videos like this in the future → ibm.biz/subscribe-now

Komentáře • 355

  • @xzskywalkersun515
    @xzskywalkersun515 Před 5 měsíci +439

    This lecturer should be given credit for such an amazing explanation.

    • @cosmicscattering5499
      @cosmicscattering5499 Před 3 měsíci +3

      I was thinking the same, she explained this so clearly.

    • @tariqmking
      @tariqmking Před měsícem +1

      Yes this was excellently explained, kudos to her.

    • @brianmi40
      @brianmi40 Před měsícem +5

      Or at least credit for being able to write backwards!

  • @vt1454
    @vt1454 Před 6 měsíci +293

    IBM should start a learning platform. Their videos are so good.

    • @XEQUTE
      @XEQUTE Před 5 měsíci +6

      i think they already do

    • @srinivasreddyt9555
      @srinivasreddyt9555 Před měsícem

      Yes, they have it already. CZcams.

    • @siddheshpgaikwad
      @siddheshpgaikwad Před 19 dny +1

      Its mirrored video, she wrote naturally and video was mirrored later

    • @Hossam_Ahmed_
      @Hossam_Ahmed_ Před 18 dny

      They have skill build but not videos at least most of the content

    • @CaptPicard81
      @CaptPicard81 Před 16 dny

      They do, I recently attended a week long AI workshop based on an IBM curriculum

  • @ghtgillen
    @ghtgillen Před 7 měsíci +40

    Your ability to write backwards on the glass is amazing! ;-)

    • @jsonbourne8122
      @jsonbourne8122 Před 5 měsíci +18

      They flip the video

    • @Paul-rs4gd
      @Paul-rs4gd Před 3 měsíci +6

      @@jsonbourne8122 So obvious, but I did not think of it. My idea was way more complicated!

  • @vikramn2190
    @vikramn2190 Před 7 měsíci +30

    I believe the video is slightly inaccurate. As one of the commenters mentioned, the LLM is frozen and the act of interfacing with external sources and vector datastores is not carried out by the LLM.
    The following is the actual flow:
    Step 1: User makes a prompt
    Step 2: Prompt is converted to a vector embedding
    Step 3: Nearby documents in vector space are selected
    Step 4: Prompt is sent along with selected documents as context
    Step 5: LLM responds with given context
    Please correct me if I'm wrong.

    • @DJ-lo8qj
      @DJ-lo8qj Před 19 dny

      I’m not sure. Looking at OpenAI documentation on RAG, they have a similar flow as demonstrated in this video. I think the retrieval of external data is considered to be part of the LLM (at least per OpenAI)

    • @PlaytimeEntertainment
      @PlaytimeEntertainment Před 17 dny

      I do not think retrieval is part of LLM. LLM is the best model at the end of convergence after training. It can't be modified rather after LLM response you can always use that info for next flow of retrieval

  • @jordonkash
    @jordonkash Před 2 měsíci +27

    4:15 Marina combines the colors of the word prompt to emphasis her point. Nice touch

  • @ntoscano01
    @ntoscano01 Před 3 měsíci +21

    Very well explained!!! Thank you for your explanation of this. I’m so tired of 45 minute CZcams videos with a college educated professional trying to explain ML topics. If you can’t explain a topic in your own language in 10 minutes or less than you have failed to either understand it yourself or communicate effectively.

  • @ericadar
    @ericadar Před 4 měsíci +47

    Marina is a talented teacher. This was brief, clear and enjoyable.

  • @geopopos
    @geopopos Před měsícem +39

    I love seeing a large company like IBM invest in educating the public with free content! You all rock!

  • @maruthuk
    @maruthuk Před 7 měsíci +18

    Loved the simple example to describe how RAG can be used to augment the responses of LLM models.

  • @xdevs23
    @xdevs23 Před měsícem +5

    The entire video I've been wondering how they made the transparent whiteboard

  • @javi_park
    @javi_park Před 3 měsíci +28

    hold up - the fact that the board is flipped is the most underrated modern education marvel nobody's talking about

    • @RiaKeenan
      @RiaKeenan Před 3 měsíci

      I know, right?!

    • @euseikodak
      @euseikodak Před 3 měsíci +4

      Probably they filmed it in front of a glass board and flipped the video on edition later on

    • @politicallyincorrect1705
      @politicallyincorrect1705 Před 2 měsíci

      Filmed in front of a non-reflective mirror.

    • @TheTomtz
      @TheTomtz Před měsícem

      Just simply write on a glass board ,record it from the other side and laterally flip the image! Simple aa that.. and pls dont distract people from the contents being lectured by thinkin about the process behind the rec🤣

    • @thewallstreetjournal5675
      @thewallstreetjournal5675 Před měsícem

      Is the board fliped or has she been flipped?

  • @TheAllnun21
    @TheAllnun21 Před 5 měsíci +16

    Wow, this is the best beginner's introduction I've seen on RAG!

  • @aam50
    @aam50 Před 5 měsíci +16

    That's a really great explanation of RAG in terms most people will understand. I was also sufficiently fascinated by how the writing on glass was done to go hunt down the answer from other comments!

  • @m.kaschi2741
    @m.kaschi2741 Před 5 měsíci +5

    Wow, I opened youtube coming from the ibm blog just to leave a comment. Clearly explained, very good example, and well presented as well!! :) Thank you

  • @ReflectionOcean
    @ReflectionOcean Před 5 měsíci +21

    1. Understanding the challenges with LLMs - 0:36
    2. Introducing Retrieval-Augmented Generation (RAG) to solve LLM issues - 0:18
    3. Using RAG to provide accurate, up-to-date information - 1:26
    4. Demonstrating how RAG uses a content store to improve responses - 3:02
    5. Explaining the three-part prompt in the RAG framework - 4:13
    6. Addressing how RAG keeps LLMs current without retraining - 4:38
    7. Highlighting the use of primary sources to prevent data hallucination - 5:02
    8. Discussing the importance of improving both the retriever and the generative model - 6:01

  • @Lucildor
    @Lucildor Před 3 měsíci

    Please keep all these videos coming! They are so easy to understand and straightforward. Muchas gracias!

  • @GregSolon
    @GregSolon Před 2 měsíci

    One of the easiest to understand RAG explanations I've seen - thanks.

  • @redwinsh258
    @redwinsh258 Před 6 měsíci +21

    The interesting part is not retrieval from the internet, but retrieval from long term memory, and with a stated objective that builds on such long term memory, and continually gives it "maintenance" so it's efficient and effective to answer. LLMs are awesome because even though there are many challenges ahead, they sort of give us a hint of what's possible, without them it would be hard to have the motivation to follow the road

  • @kingvanessa946
    @kingvanessa946 Před 3 měsíci +1

    For me, this is the most easy-to-understand video to explain RAG!

  • @jyhherng
    @jyhherng Před 6 měsíci +5

    this let's me understand why the embeddings used to generate the vectorstore is a different set from the embeddings of the LLM... Thanks, Marina!

  • @user-cd6hp5kc1n
    @user-cd6hp5kc1n Před 7 měsíci +16

    The ability to write backwards, much less cursive writing backwards, is very impressive!

    • @IBMTechnology
      @IBMTechnology  Před 7 měsíci +7

      See ibm.biz/write-backwards

    • @jsonbourne8122
      @jsonbourne8122 Před 5 měsíci

      Left hand too!

    • @NishanSaliya
      @NishanSaliya Před 5 měsíci

      @@IBMTechnology Thanks .... I was reading comments to check for an answer for that question!

  • @projectfocrin
    @projectfocrin Před 5 měsíci +5

    Great explanation. Even the pros in the field I have never seen explain like this.

  • @444Yielding
    @444Yielding Před 18 dny +3

    This video is highly underviewed for as informative as it is!

  • @hamidapremani6151
    @hamidapremani6151 Před 2 měsíci

    The explanation was spot on!
    IBM is the go to platform to learn about new technology with their high quality content explained and illustrated with so much simplicity.

  • @paulaenchina
    @paulaenchina Před 3 měsíci +1

    This is the best explanation I have seen so far for RAG! Amazing content!

  • @TheMsksk
    @TheMsksk Před 8 měsíci

    Great video as always. Thanks for sharing.

  • @Shailendrashail
    @Shailendrashail Před 8 měsíci +1

    Good Explanation of RAG. Thanks for sharing.

  • @past_life_project
    @past_life_project Před 3 měsíci

    I have watched many IBM videos and this is the undoubtedly the best ! I will be searching for your videos now Marina!

  • @rujmah
    @rujmah Před 2 měsíci

    Brilliant explanation and illustration. Thanks for your hard work putting this presentation together.

  • @vnaykmar7
    @vnaykmar7 Před 5 měsíci +2

    Such an amazing explanation. Thank you ma'am!

  • @kunalsoni7681
    @kunalsoni7681 Před 6 měsíci

    Thanks for letting us know about this feature of LLM :)

  • @afshinkarimi2382
    @afshinkarimi2382 Před 8 měsíci

    Great video. Thanks for sharing

  • @rvssrkrishna2
    @rvssrkrishna2 Před 2 měsíci

    Very precise and exact information on RAG in a nutshell. Thank you for saving my time.

  • @gaemrpaterso-ri2jd
    @gaemrpaterso-ri2jd Před 8 měsíci

    Great video, you guys should do one on promising tech industries

  • @francischacko3627
    @francischacko3627 Před 15 dny

    perfect explanation understood every bit , no lags kept it very interesting ,amazing job

  • @mstarlingc
    @mstarlingc Před 5 měsíci +1

    Pretty simple explanation, thank you

  • @Anubis2828
    @Anubis2828 Před měsícem

    Great, simple, quick explanation

  • @Aryankingz
    @Aryankingz Před 6 měsíci +3

    That's what Knowledge graphs are for, to keep LLMs grounded with a reliable source and up-to-date.

  • @rafa1rafa
    @rafa1rafa Před 5 měsíci +2

    Great explanation! The video was very didactic, congratulations!

  • @evaiintelligence
    @evaiintelligence Před 19 dny

    Marina has done a great job explaining LLM and RAGs in simple terms.

  • @sawyerburnett8319
    @sawyerburnett8319 Před 3 měsíci +1

    Wow, having a lightbulb moment finally after hearing this mentioned so often. Makes more sense now!

  • @bdouglas
    @bdouglas Před měsícem

    That was excellent, simple, and elegant! Thank you!

  • @HimalayJoriwal
    @HimalayJoriwal Před 2 měsíci

    Best explanation so far from all the content on internet.

  • @janhorak8799
    @janhorak8799 Před 2 měsíci +9

    Did all the speakers have to learn how to write in a mirrored way or is this effect reached by some digital trick?

    • @VlogBySKSK
      @VlogBySKSK Před 23 dny

      There is a digital mirroring technique which is used to show the content this way...

    • @mao-tse-tung
      @mao-tse-tung Před 15 dny +1

      She was right handed before the mirror effect

  • @rockochamp
    @rockochamp Před 5 měsíci +1

    very well executed presentation.
    i had to think twice about how you can write in reverse but then i RAGed my system 2 :)

  • @kallamamran
    @kallamamran Před 4 měsíci +2

    We also need the models to cross check their own answers with the sources of information before printing out the answer to the user. There is no self control today. Models just say things. "I don't know" is actually a perfectly fine answer sometimes!

  • @toenytv7946
    @toenytv7946 Před 2 měsíci +1

    Great down the rabbit hole video. Very deep and understandable. IBM academy worthy in my opinion.

  • @johnmccullough7084
    @johnmccullough7084 Před 6 měsíci +1

    Appreciate the succinct explanation. 👍

  • @laurentpastorelli1354
    @laurentpastorelli1354 Před 4 měsíci +1

    Super good and clear, well done!

  • @user-im6ub3sf6m
    @user-im6ub3sf6m Před 3 měsíci

    Great explanation with an example. Thank you

  • @khalidelgazzar
    @khalidelgazzar Před 4 měsíci +2

    Great explanation. Thank you!😊

  • @user-hk5dk9rb6p
    @user-hk5dk9rb6p Před 3 měsíci +1

    Fantastic video and explanation. Thank you!

  • @gbluemink
    @gbluemink Před 3 měsíci +1

    So the question I have here is when I have an answer from my LLM but not the Rag data, what is the response to the user? "I don't know" or the LLM response that may be out of date or without a reliable source? Looks like a question for an LLM :)

  • @user-bo1kv5zy3w
    @user-bo1kv5zy3w Před 6 měsíci

    Awesome explanation. Love you.

  • @preciousrose2715
    @preciousrose2715 Před 21 dnem

    This was such an amazing explanation!

  • @lauther_27
    @lauther_27 Před 5 měsíci +1

    Amazing video, thanks IBM ❤

  • @PaulGrew-wl7mh
    @PaulGrew-wl7mh Před měsícem

    An amazing explanation that made RAG understandable in about 4:23 minutes!

  • @oieieio741
    @oieieio741 Před 5 měsíci

    Very Helpful! Great explanation. thx IBM

  • @zuzukouzina-original
    @zuzukouzina-original Před 3 měsíci

    Very clear explanation, much respect 🫡

  • @ashwinkumar675
    @ashwinkumar675 Před 14 dny

    This is so well explained! Thank you 👍🏻✅

  • @mohamadhijazi3895
    @mohamadhijazi3895 Před 27 dny

    The video is short and consice yet the delivery is very elegant. She might be the best instructor that have teached me. Any idea how the video was created?

  • @star2k279
    @star2k279 Před 4 měsíci +1

    Thank you for such a great explanation.

  • @Kekko400D
    @Kekko400D Před 3 měsíci

    Fantastic explanation, proud to be an IBMer

  • @eddisonlewis8099
    @eddisonlewis8099 Před 3 měsíci

    AWESOME EXPLANATION OF THE CONCEPT RAG

  • @AC-xd7sw
    @AC-xd7sw Před 4 měsíci

    Insightful, please more video like this

  • @lewi594
    @lewi594 Před 4 měsíci +1

    This is brilliant and concise, helped make sense of a complex subject..
    Can this be implemented in a small environment with limited computing? Such that the retriever only has access to a closed data source

  • @Junglytics
    @Junglytics Před 3 měsíci

    Great video, excellent explanation!

  • @neutron417
    @neutron417 Před 4 měsíci +2

    From which corpus/database are the documents retrieved from? Are they up-to date? and how does it know the best documents to select from a given set?

  • @AdarshKumar-kx2cn
    @AdarshKumar-kx2cn Před 2 měsíci

    Beautifully explained....thanks

  • @katsunoi
    @katsunoi Před 5 měsíci

    nice video - great explanation!

  • @terencelewis4985
    @terencelewis4985 Před 3 měsíci

    Excellent explanation!

  • @sprintwithcarlos
    @sprintwithcarlos Před 5 měsíci

    Great explanation!

  • @JasonVonHolmes
    @JasonVonHolmes Před měsícem

    This was explained fantastically.

  • @ericmcnally5128
    @ericmcnally5128 Před 2 měsíci

    This is a fantastic lesson video.

  • @thomasbrowne6649
    @thomasbrowne6649 Před měsícem

    This is excellent and I hope IBM does well in this space. We need a reliable, non-hype vendor.

  • @MraM23
    @MraM23 Před 3 měsíci

    Great lessons! Nice of you to step out 🙃 and make such engaging and educative content This is a very useful in helping us in critical thinking. Thank you for sharing this video. 👍
    Current ai models may impose neurotypical norms and expectations based on current data trained on . 🤔
    Curious to see more on how IBM approach the challenges and limitations of Ai

  • @sumedhaj9017
    @sumedhaj9017 Před 2 měsíci

    Amazing explanation! Thank you:)

  • @alexiojunior7867
    @alexiojunior7867 Před 25 dny

    wow this was an amazing Explanation ,very easy to understand

  • @DaveEtchells
    @DaveEtchells Před 5 měsíci +1

    *Excellent* explanation, you gave me all the key concepts in one shot.
    I gather that the retrieval could be in various forms, for instance a vector database in addition to direct text from internet searches?

    • @scottt9382
      @scottt9382 Před 4 měsíci

      a RAG target can be any associated data store (PDF repo, Sharepoint, Google Drive,..) that can be accessed via a query - the LLM has a semantic understanding of the prompt and the queries are the output of the LLM

  • @AntenorTeixeira
    @AntenorTeixeira Před 4 měsíci

    That's the best video about RAG that I've watched

  • @user-xf4vm2gf6g
    @user-xf4vm2gf6g Před 3 měsíci

    Excellent ! thank you for sharing this knowledge !

  • @mohammadsubhan1318
    @mohammadsubhan1318 Před 3 měsíci

    Nicely explained 👍

  • @aniket_mishr
    @aniket_mishr Před 29 dny

    The explanation was very good 💯.

  • @MarshallMelnychuk
    @MarshallMelnychuk Před 7 měsíci +9

    Thank you Marina, very helpful and informative video. One question I have is; how do you make these videos like this? Being able to on a screen facing the camera, this is great. What's your secret?

    • @PeterCooperUK
      @PeterCooperUK Před 7 měsíci +10

      Sometimes these are done on transparent "whiteboards" and the video is then flipped horizontally.

    • @ChristopherSmithGPlus
      @ChristopherSmithGPlus Před 7 měsíci

      czcams.com/video/eVOPDQ5KYso/video.htmlsi=LADnROL0SF33Hg54

    • @IBMTechnology
      @IBMTechnology  Před 7 měsíci +15

      See ibm.biz/write-backwards

    • @lowkeyproducktvt2101
      @lowkeyproducktvt2101 Před 6 měsíci +1

      @@IBMTechnology okay now i get it !!!

    • @rayuduaddagarla3857
      @rayuduaddagarla3857 Před 6 měsíci +3

      IBM should hire left hand writers so it will right handed after flip 😊

  • @shinemuphy
    @shinemuphy Před 5 měsíci

    Excellent explanation. thx

  • @deltawhiplash1614
    @deltawhiplash1614 Před 4 dny

    This is a really good video thank you for sharing this knowledge

  • @421sap
    @421sap Před 5 měsíci

    Thank you, Marina Danilevsky ....

  • @BooleanDisorder
    @BooleanDisorder Před 5 měsíci +1

    Thank you for these videos. Makes it much easier to nagivate this new AI-ra of machine learning.

  • @toddlilly4780
    @toddlilly4780 Před 8 měsíci +4

    Love this! How is she writing on the glass? Is she writing backward? Or how does that work?

    • @IBMTechnology
      @IBMTechnology  Před 8 měsíci +7

      See ibm.biz/write-backwards

    • @johncornell2498
      @johncornell2498 Před 7 měsíci +2

      Lol, so simple when explained@@IBMTechnology, thanks I can actually pay attention to what she's saying now 😆

  • @paulc465
    @paulc465 Před 5 měsíci

    thank you. very informative!

  • @rnat9246
    @rnat9246 Před 8 měsíci +1

    How do you built the connector from LLM to external sources? Usually a foundation model is frozen.

    • @GeandersonLenz
      @GeandersonLenz Před 7 měsíci

      Langchain

    • @hammadusmani7950
      @hammadusmani7950 Před 6 měsíci +1

      In it's simplest form, you can just copy paste the external source into the prompt. Python is a way to automate that.

  • @carvalhoribeiro
    @carvalhoribeiro Před měsícem

    Amazing work. Thanks for sharing this.

  • @johanhumblet6090
    @johanhumblet6090 Před 4 měsíci

    very well explained!

  • @prasannakulkarni5664
    @prasannakulkarni5664 Před 27 dny

    the color coding on your whiteboard is really apt here !

  • @rahulberry4806
    @rahulberry4806 Před 11 dny

    thanks for the great explanation

  • @shashankshekharsingh9336

    very good and clear explanation

  • @berkeokur99
    @berkeokur99 Před 5 měsíci

    Love the neon markers, also the content of course

  • @sk-6032
    @sk-6032 Před 6 hodinami

    Very well explained 🙏🏼👍

  • @user-uk9mt4ue6w
    @user-uk9mt4ue6w Před 4 měsíci +1

    Все толково, четко и понятно. Респект автору.

  • @mrhassell
    @mrhassell Před 20 dny

    RAG combines the generative power of LLMs with the precision of specialized data search mechanisms, resulting in nuanced and contextually relevant responses.