How to build a retrieval-augmented generation (RAG) AI system in Java (Spring Boot + LangChain4j)

Sdílet
Vložit
  • čas přidán 21. 08. 2024
  • Learn how to build an AI powered application that knows your business context and is able to interact with your Java code. Recorded at Vaadin Create 2023.
    Speaker: Marcus Hellberg, VP of DevRel, Vaadin

Komentáře • 7

  • @mustiosrs
    @mustiosrs Před 5 měsíci

    Yo I have a question: You've specified some arbitrary embedding model to convert the text chunks into vectors to be stored into the database. Are these vectors also sent to GPT-4 as context? Doesn't GPT-4 only accept their proprietary embedding models such as ada-002 and their new v3 models?

    • @vladislavknyshov3482
      @vladislavknyshov3482 Před 5 měsíci +2

      Hey, if I got it right, embeddings are only used to find document chunks similar to the question. Then those chunks provided to the model as text context

    • @mustiosrs
      @mustiosrs Před 5 měsíci +2

      @@vladislavknyshov3482 Thanks, I misinterpreted the API of OpenAI, apparentlt the embedding model is completely detached from the LLM AI

  • @TobilobaOwolabi-wy8cq
    @TobilobaOwolabi-wy8cq Před 4 měsíci

    So what if I dont have a GPT-4 subscription and i generate an API key nonetheless, will my api key still work in this case where our tokenizer and streamingChatLanguageModel all use GPT-4?

  • @MiguelAngel-ug3bh
    @MiguelAngel-ug3bh Před 6 měsíci

    "and it all of a sudden knows everything about Vaadin, and can put me out of a job. So I don`t know why I created that" 🤣

  • @HadraouiZakaria
    @HadraouiZakaria Před měsícem

    Resource code plz