How to build a retrieval-augmented generation (RAG) AI system in Java (Spring Boot + LangChain4j)
Vložit
- čas přidán 21. 08. 2024
- Learn how to build an AI powered application that knows your business context and is able to interact with your Java code. Recorded at Vaadin Create 2023.
Speaker: Marcus Hellberg, VP of DevRel, Vaadin
Yo I have a question: You've specified some arbitrary embedding model to convert the text chunks into vectors to be stored into the database. Are these vectors also sent to GPT-4 as context? Doesn't GPT-4 only accept their proprietary embedding models such as ada-002 and their new v3 models?
Hey, if I got it right, embeddings are only used to find document chunks similar to the question. Then those chunks provided to the model as text context
@@vladislavknyshov3482 Thanks, I misinterpreted the API of OpenAI, apparentlt the embedding model is completely detached from the LLM AI
So what if I dont have a GPT-4 subscription and i generate an API key nonetheless, will my api key still work in this case where our tokenizer and streamingChatLanguageModel all use GPT-4?
"and it all of a sudden knows everything about Vaadin, and can put me out of a job. So I don`t know why I created that" 🤣
Resource code plz