Llama 3 RAG Demo with DSPy Optimization, Ollama, and Weaviate!

Sdílet
Vložit
  • čas přidán 5. 09. 2024

Komentáře • 26

  • @dianaliu7543
    @dianaliu7543 Před 4 měsíci +4

    This is so different from RAG using GPT. Lots to learn

  • @LaHoraMaker
    @LaHoraMaker Před 4 měsíci +4

    Dear Connor, that was the fastest release ever!

  • @cipritom
    @cipritom Před 4 měsíci +6

    Is it me, or at least the last part is a digital avatar?

  • @VincentFulco
    @VincentFulco Před 4 měsíci +1

    Would love to see an interface to groq please!

  • @catchychazz
    @catchychazz Před 4 měsíci +1

    Great video, Connor. Have you tested out if SAMMO is better than DSPy for production?

    • @larsbell1569
      @larsbell1569 Před 4 měsíci

      I thought SAMMO was primarily prompt templating? Does it have some new features we are not aware of?

    • @catchychazz
      @catchychazz Před 4 měsíci

      @@larsbell1569 No, you're right. I was working on the assumption that in production, you'd be using the most capable language models.
      At the risk of running token-heavy pre-compiled DSPy prompts, having a simpler prompt that automatically augments few-shot prompting only on specific user inputs/triggered events might be a more cost-efficient solution.

  • @tobkin
    @tobkin Před 4 měsíci

    Two questions:
    - Why use gpt-4 instead of gpt-4-turbo for the teleprompter?
    - What are you using to make your pointer act like that?

  • @Tarun_Mamidi
    @Tarun_Mamidi Před 4 měsíci +1

    Is llama really OSS if we don’t know how or what it is trained on?

  • @MrjbushM
    @MrjbushM Před 4 měsíci

    Thank you.

  • @420_gunna
    @420_gunna Před 4 měsíci +4

    this mf cookin

  • @charismaowojoameh7681
    @charismaowojoameh7681 Před 3 měsíci

    I tried the implementation but i keep getting the error "model not found"

  • @PeterWilliams97
    @PeterWilliams97 Před 4 měsíci +1

    I ran your notebook and got the following error.
    print(RAG()("What is binary quantization?").answer)
    AttributeError Traceback (most recent call last)
    Cell In[7], line 1
    ----> 1 print(RAG()("What is binary quantization?").answer)
    File ~/code/vector_search/weaviate/recipes/.wenv/lib/python3.11/site-packages/dspy/primitives/program.py:26, in Module.__call__(self, *args, **kwargs)
    25 def __call__(self, *args, **kwargs):
    ---> 26 return self.forward(*args, **kwargs)
    Cell In[6], line 16
    15 def forward(self, question):
    ---> 16 context = self.retrieve(question).passages
    17 pred = self.generate_answer(context=context, question=question).answer
    18 return dspy.Prediction(context=context, answer=pred, question=question)
    File ~/code/vector_search/weaviate/recipes/.wenv/lib/python3.11/site-packages/dspy/retrieve/retrieve.py:30, in Retrieve.__call__(self, *args, **kwargs)
    29 def __call__(self, *args, **kwargs):
    ---> 30 return self.forward(*args, **kwargs)
    File ~/code/vector_search/weaviate/recipes/.wenv/lib/python3.11/site-packages/dspy/retrieve/retrieve.py:39, in Retrieve.forward(self, query_or_queries, k)
    36 # print(queries)
    37 # TODO: Consider removing any quote-like markers that surround the query too.
    38 k = k if k is not None else self.k
    ---> 39 passages = dsp.retrieveEnsemble(queries, k=k)
    40 return Prediction(passages=passages)
    ...
    79 .do()
    81 results = results["data"]["Get"][self._weaviate_collection_name]
    82 parsed_results = [result[self._weaviate_collection_text_key] for result in results]
    AttributeError: 'WeaviateClient' object has no attribute 'query'

    • @connor-shorten
      @connor-shorten  Před 4 měsíci

      Hey Peter! Apologies we have upgraded the WeaviateRM to use the Weaviate v4 client, can you please try upgrading dspy with `!pip install dspy-ai --upgrade` ?

    • @connor-shorten
      @connor-shorten  Před 4 měsíci

      Can you please share any error messages as an Issue on Weaviate recipes? It might be easier to help debug there instead of CZcams comments.

    • @LyuboslavPetrov
      @LyuboslavPetrov Před 4 měsíci

      @connorshorten6311 Please do update the video with accurate setup instructions. I have been fighting to get this running (DSPY + Weaviate + OLLAMA) for the past 2-3 hours to no avail. Tried multiple weaviate-client/server combinations, ran trough docker and standalone, configured, updated/downgraded dspy-ai. Went through so many help pages, cannot count now. I am tired, but still would like to play with this set of technologies. Thanks

  • @RichardHamnett
    @RichardHamnett Před 3 měsíci +1

    The recipe is gone

    • @connor-shorten
      @connor-shorten  Před 3 měsíci

      Hey Richard! Sorry we refactored recipes! The links are now fixed!

  • @human_agi
    @human_agi Před 4 měsíci +1

    how to install ollama

  • @koljanos
    @koljanos Před 4 měsíci

    Hey, what version of Weaviate-client you are using????

    • @connor-shorten
      @connor-shorten  Před 4 měsíci

      Hey! I am using v4 and the latest version of dspy-ai, can you please share any error messages as an Issue on Weaviate recipes? It might be easier to help debug there instead of CZcams comments.

    • @koljanos
      @koljanos Před 4 měsíci

      Allright, will check the issues!

  • @SolSay
    @SolSay Před měsícem

    this guy looks like AI)

  • @leeme179
    @leeme179 Před 4 měsíci +3

    maybe just me but the blur/smooth filter to the face cam makes me suspect that face cam is AI generated 🤣🤣🤣🤣🤣🤣

    • @dixon1e
      @dixon1e Před 4 měsíci

      Uncanny valley feel.