RAG Implementation using Mistral 7B, Haystack, Weaviate, and FastAPI

AI Anytime

zhlédnutí 26 265

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 22. 08. 2024

Komentáře • 74

@jorgerios4091 Před 10 měsíci ⁺³
Oh Man, you rock! it seems like you read your followers' minds. Thank you so much for this great video.
@AIAnytime Před 10 měsíci ⁺²
Yeah Non Invasive models in brain has been implanted 😂.. thanks btw
@talhaabdulqayyum193 Před 10 měsíci ⁺⁵
PLease upload on the Fine tuning of the Llama 2 models (7B, 13B et), especially regarding how to create a data set for a chatbot to model fine tuning and model inference
@shivamroy1775 Před 10 měsíci ⁺³
Absolutely great quality content. Please continue the great work, These videos are extremely useful.
@AIAnytime Před 10 měsíci ⁺¹
Thanks Shivam... sure more on the cards.
@MohamElY Před 3 měsíci
Haystack the same time @@AIAnytime
@jorgerios4091 Před 10 měsíci ⁺⁴
Hello.
I've been thinking about ways to improve response speed in this example, and I came across fine-tuning the model using autotrain. I was wondering if fine-tuning the model with GPU support from platforms like Kaggle or Colab could be a viable option for achieving faster responses.
I think it would be really interesting if you could explore this topic in one of your videos and show us how to fine-tune a model using GPU support (this same example). It would be great to see the process in action and learn more about the benefits and considerations involved in fine-tuning models for faster response times.
Looking forward to your insights on this topic! Keep up the great work on your channel.
Jorge.
@user-rr8qz5us1m Před 8 měsíci
This should be addressed
@sneharoy3566 Před 10 měsíci ⁺¹
This is just next level
@giridharreddy7011 Před 10 měsíci ⁺⁴
Sir, can you please show how to run it in Mac M1 system as well
@ajayanilkumar1645 Před 10 měsíci ⁺¹
Yes please sir, it would be helpful for me also
@afj4ojfa340 Před 10 měsíci
Thank you for doing a video on this!
@AIAnytime Před 10 měsíci
Glad it was helpful!
@rajkachhadiya6192 Před 10 měsíci
really very informative video. Thank you dear for your efforts and informative video. I appreciate your effort and content dear. Keep it up. Once again thank you ❤❤❤
@AIAnytime Před 10 měsíci
It's my pleasure
@Techonsapevole Před 10 měsíci
Thanks, great tutorial
@AIAnytime Před 10 měsíci
Glad it was helpful!
@ancient_living Před 9 měsíci ⁺¹
Hi,
The embedding dimension specified is 768 and you are using a 384 model. It throws error, can you help?
@Sulayman.786 Před 10 měsíci ⁺²
AI now?! Let's go!
@AIAnytime Před 10 měsíci
Let's go
@manishpatil09 Před 8 měsíci ⁺¹
At 1:40 you said you are going to use "Vector Database" not "Vector Store". Can somebody please tell me the difference between them.
@nogueirad Před 5 měsíci
Hi! Duda from Weaviate here. They are the same. Langchain calls the. Vector Store, but they are the same :)
@TrueWarship Před 6 měsíci
hi i really love you video weldone but am finding it hard to install haystack on my system it saying haystack.nodes is not found.
please any help
@sampriti6026 Před 7 měsíci
Which sentence transformer are we using exactly? And why not any top model from mteb ? What are the drawbacks? Would be helpful if you can clarify.
Great video tho❤
@ashishkgp Před 5 měsíci ⁺¹
THIS GITHUB REPO IS NOT UPDATED AND IS GIVING MULTIPLE ISSUES!
@casualdiscord Před 7 měsíci
It seems like there's a mismatch that I can't figure out. The model has dimensions 384 and the datastore 768. So it won't update any dimensions:
RuntimeError: Embedding dimensions of the model (384) don't match the embedding dimensions of the document store (768). Initiate WeaviateDocumentStore again with arg embedding_dim=384.
What am I missing?
@lukasvanuden6049 Před 5 měsíci
Can you do it without Docker?
Since Docker needs a commercial licence for bigger companies..
@rpmknowledge3067 Před 10 měsíci ⁺¹
thankyou sir sir i want use sqldatabase inplace of pdf and etc
so please make video for mysql
@rohanghogare5766 Před 2 měsíci
This git repository has not been updated. Can you upload models, template files/folders
@davidfa7363 Před 8 měsíci ⁺¹
Hi. I am building a chatbot with my pdf document. I created an endpoint in Fastapi. If i send 1 request the answer is really fast, but if i try to send 100 requests in the same time the response time is so much bigger. How can i run those requests paralell using gpu?
@thehkmalhotra9714 Před 8 měsíci
+1
@khanmahmuna Před 6 měsíci
thanks for the informative video,can you upload a same tutriol video for different types of document summarization implementing RAG and open source LLM not OpenAI asap...it would be greatful need help in that and not found any related video for summarization .
@atulya.anurag Před 10 měsíci ⁺²
help !!
Tried everything i know to resolve this problem (I have cloned your git repo)
Error: ImportError: cannot import name 'Document' from 'haystack' (E:\ATULYA\Haystack\myenv\Lib\site-packages\haystack\__init__.py)
@pratheeshp.m.4078 Před 10 měsíci
im getting same error on mac python 3.11
@pratheeshp.m.4078 Před 10 měsíci ⁺¹
is there any particular version of python or haystack we have to use?
@tareeshmuluguru395 Před 10 měsíci
did you get the solution for this error??
@andrejusb1 Před 10 měsíci ⁺¹
I had same problem, fixed it by updating requirements.txt. Remove "farm-haystack" and keep only "farm-haystack[weaviate]". You need to pip uninstall farm-haystack. It seems there is some conflict, when both libs are installed and some classes are not visible
@RedCloudServices Před 10 měsíci ⁺¹
thanks again this is my exact use case here in DC! Just testing different opensource stacks there is a great langchain interview w Weviate discussing the technique of using a summary vector in each prompt to improve LLM responses. How would you improve the response time?
@AIAnytime Před 10 měsíci
Yeah that's a very good idea. Those can be used in memory as well. Give it a try.
Let me know what you build. Thanks
@rachadnajjar Před 7 měsíci
Hi , thank you for the tutorial , when i'm install the requirements.txt , I got this error message : ERROR: Could not find a version that satisfies the requirement torch (from versions: none)
ERROR: No matching distribution found for torch . Can you please advise and help? Thank you
@TJ-hs1qm Před 10 měsíci
awesome thank you!
@AIAnytime Před 10 měsíci
You're welcome!
@VaibhavPatil-rx7pc Před 8 měsíci
excellent !
@AIAnytime Před 8 měsíci
Many thanks!
@AminovJP Před 10 měsíci
Thank you very much, you are fantastic, you could be a teacher your natural way of talking is very captivating !!!!, what do you think about PGVector ?
@AIAnytime Před 10 měsíci
Thank you! 😃 yes You can try Supabase. It has a PG extension i guess.
@SonGoku-pc7jl Před 9 měsíci ⁺¹
i try finishing this pending video, and i have a question. today i see your video of jina v2. in this video you says about model of incrustation in huggingface that name is multilingüe. Is possible to fusion? you can make a video for best implementations cases that jina 2 or ideas for other cases of language spanish or catalan jeje :P thanks for your content! is increible!
@AIAnytime Před 9 měsíci
Great suggestion. Will create one soon...
@MohamElY Před 3 měsíci
❤❤Viagra
@gogodo9934 Před 10 měsíci
Bro when’s the “deployment on aws” video coming out ?. It’s been more than a month since the 2 nd part released
@tareeshmuluguru395 Před 10 měsíci
i am facing an issue with embedding dimension. this is the error: Embedding dimensions of the model (384) don't match the embedding dimensions of the document store (768). Initiate WeaviateDocumentStore again with arg embedding_dim=384
@SanjayBalikar Před 9 měsíci
I had same issue try delete the docker image and again rerun everything starting from creating docker image.
@tarik1895 Před 29 dny
Can please explain where you download the two models you have the mistral and the llama
@AIAnytime Před 29 dny ⁺¹
Huggingface repo.
@jw200 Před 6 měsíci
can you explain why this all is good thing?
people talk about it as awesome stuff. but why?
@Sundarampandey Před 8 měsíci
bhai thoda mike ki quality sai krr lo yaar
aawaz chubh rai h kasam se
Baaki great knowledge
keep up the good work
@wtfJonKnowNothing Před 10 měsíci
I just love how you pronounce massine 😂
@AIAnytime Před 10 měsíci
Lol 😂
@networks6111 Před 5 měsíci
Hey bro! very nice video, what are the system specs for running this project? Is 8gb RAM enough, to produce results if we use Quantised Bloke mistral model
@usmanyousaaf Před 10 měsíci
Hi sir
Am making a pdf questions answers chatbot but the issues ia my dataset is to dirty and aslo very small
So what's u r suggestion without changing the dataset how can i achieve the average accuracy
I know its a silly question but ...
@SaiRohith08 Před 10 měsíci
Hello! can i add multiple pdfs and can it read images in pdfs?
@TJ-hs1qm Před 10 měsíci ⁺¹
Dependency management in Python is a pain. Which exact package version to install? Nobody knows 😂
@AIAnytime Před 10 měsíci ⁺²
Use poetry for managing the versions.
@Udayanverma Před 7 měsíci
same! ImportError: cannot import name 'send_event' from 'haystack.telemetry' (/usr/local/lib/python3.8/site-packages/haystack/telemetry/__init__.py)
@TJ-hs1qm Před 10 měsíci
check your mic or audio settings. totally over-driving the sound 🤗
@AIAnytime Před 10 měsíci
Thanks for the tip!
@vivekmishra69 Před 9 měsíci
Hey did you try fine-tuning mistral on custom domain data? Can you recommended GPU for fine tuning 7B models such as Mistral, Llama etc..?
@AIAnytime Před 9 měsíci ⁺¹
Take a rental GPU on Runpod or A100 or v100 Google Colab with high RAM.
@souvickdas5564 Před 10 měsíci
Please provide the link for downloading the quantized Mistral model
@AIAnytime Před 10 měsíci ⁺³
Here you go: huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF
@oryxchannel Před 10 měsíci
_fire_
@user-iu4id3eh1x Před 10 měsíci
Weaviate is better or Pinecone?
@AIAnytime Před 10 měsíci
Pinecone premium is better. Weaviate is also good as open sourced.

Další v pořadí

Automatické přehrávání

RAG But Better: Rerankers with Cohere AI