Video není dostupné.
Omlouváme se.

Multi-Modal RAG: Chat with Text and Images in Documents

Sdílet
Vložit
  • čas přidán 18. 08. 2024

Komentáře • 24

  • @engineerprompt
    @engineerprompt  Před měsícem

    If you want to learn RAG Beyond Basics, checkout this course: prompt-s-site.thinkific.com/courses/rag

  • @stressrelaxationmusicchann4638

    Hey this is amazing and i kindly request you to upload some videos how can we work with pdf document extraction for text ,tables, images graphs etc.. in the documents for rag application

  • @aa-xn5hc
    @aa-xn5hc Před měsícem +2

    These rag videos are super interesting

  • @wtcbretburstjk3726
    @wtcbretburstjk3726 Před měsícem +2

    thank you, keep it coming chief great work !

  • @IdPreferNot1
    @IdPreferNot1 Před měsícem

    Such great code explanation and layout... so many Gist-able functions...thanks!!

  • @zoranProCode
    @zoranProCode Před měsícem

    Why it’s exactly 10x better?! Maybe it’s just better?

  • @roip429
    @roip429 Před měsícem

    Excellent tutorial!
    Can you share the .ipynb please

  • @alpcan3777
    @alpcan3777 Před 15 dny

    Thanks for great video. Is it possible to take both input image and text from user and query this? For example, user will upload its car image and ask about similar cars with lowest price based on the uploaded image. Then the system retrieve related car image and text from database.

  • @AEismann-d6c
    @AEismann-d6c Před měsícem

    I wonder how much time before we will be able to run this locally, and then what would be a good model. So far from my testing nothing could compare to GPT-4... Thanks for the video

    • @free_thinker4958
      @free_thinker4958 Před měsícem

      CLaude 3.5 sonnet is far more performant than any model now

    • @engineerprompt
      @engineerprompt  Před měsícem +1

      local vision models have still a long way to go. But hopefully we will have something "good enough" soon.

  • @VidishArvind
    @VidishArvind Před měsícem

    Can u make the same thing using free api models cause gpt api ain't free. Also a guide to host it on a cloud would also be great. End to end app deployed on cloud

  • @Know_Ur_World
    @Know_Ur_World Před 21 dnem

    Can u use pdf containing images instead of this text data and image data

  • @amanharis1845
    @amanharis1845 Před měsícem

    Hi, I had a small doubt. Doesn't the Langchain's document loaders extract image from the document?

    • @engineerprompt
      @engineerprompt  Před měsícem +1

      No, by default, its does not. You can use something like unstructedio that can extract images and tables. Will create a video on it soon.

    • @amanharis1845
      @amanharis1845 Před měsícem +1

      @@engineerprompt I have actually built a RAG chatbot using Langchain for my organisation. The pdf that we load usually contains lots of tables and few images. So far it is giving good responses from those PDFs. But ya if there is a method to extract these non text datas more efficiently, I'll definitely want to integrate with my chatbot.

    • @aadarshunniwilson8517
      @aadarshunniwilson8517 Před 26 dny

      ​@@engineerprompt any updates on this.

  • @TheAstralftw
    @TheAstralftw Před měsícem +1

    This is nice demo but really useless in real world scenarios because you can maybe extract those images from wiki, but you can not from specific PDF file.. but it is still nice demo, but not very useful in real world projects where you need to build specific app .. still good thing for someone who wants to learn

  • @kishorethota9959
    @kishorethota9959 Před 19 dny

    Can we get the code?