Search Your PDF App using Langchain, ChromaDB, and Open Source LLM: No OpenAI API (Runs on CPU)

Sdílet
Vložit
  • čas přidán 22. 08. 2024
  • Welcome to this tutorial video where we introduce an innovative approach to searching your PDF application using the power of Langchain, ChromaDB, and Open Source LLM, all running on your CPU.
    Langchain is a powerful library designed for generative AI tasks, providing a range of capabilities that enhance language generation and understanding.
    ChromaDB, on the other hand, acts as a vector store and database, enabling us to store and retrieve vectors efficiently. By integrating ChromaDB into our search tool, we can create a robust and scalable solution for managing the vector representations of PDF documents, allowing for faster and more accurate searches.
    Finally, we utilize an Open Source LLM (Language Model) to enable question-answering capabilities within our search tool. With the power of LLM, we can process user queries and extract relevant information from PDF documents, providing precise and context-aware answers to the user's questions.
    The unique aspect of this tutorial is that we do not rely on the OpenAI API, meaning you can run this entire system on your CPU without the need for external services. This ensures greater control, privacy, and accessibility for your PDF search needs.
    Join us in this tutorial video as we guide you through the process of building your own PDF search tool using Langchain, ChromaDB, and Open Source LLM. Discover how to harness the potential of these technologies to create a powerful and efficient search system tailored to your requirements. Let's unlock the potential of your PDF application and revolutionize the way you search for information.
    AI Anytime's GitHub: github.com/AIA...
    LaMiNi LM Model: huggingface.co...
    ChromaDB: www.trychroma....
    Langchain: python.langcha...
    LLM Playlist: • Large Language Models
    Join WhatsApp: chat.whatsapp.....
    #langchain #python #ai

Komentáře • 201

  • @jorgerios4091
    @jorgerios4091 Před rokem +14

    Awesome, this is what real people need, a free alternative of custom knowledge Q&A to OpenAI. I've tried privateGPT but it is too slow taking 2 minutes to provide answers while consuming 16-RAM and woring in the command prompt. This model looks much better, I'll be looking forward for the chatbot variant. Thank you!

    • @AIAnytime
      @AIAnytime  Před rokem +1

      Thanks for your kind words! I agree with you,,,,,,,, The chatbot video will be released by tomorrow.

  • @sandilemfazi
    @sandilemfazi Před 2 měsíci +1

    You are a blessing my guy. Amazing, thanks for taking your time and teaching this. Truly Appreciate your efforts.

  • @rahulsingh-jk9qs
    @rahulsingh-jk9qs Před 2 hodinami

    Hi thanks for your videos i have seen and subscribed you because your explaination process is very easy to understand

  • @master8696
    @master8696 Před 8 měsíci +1

    You are doing a superb job! These videos not only give knowledge, but motivates us to learn GenAI and start writing few pieces of code towards the use cases. Keep doing such use case driven videos, trust me that the community will appreciate and embrace folks like you. Keep rocking!

  • @mcodetsh
    @mcodetsh Před měsícem +2

    Many of the settings and imports have been deprecated and you will get many errors. I recommend not using this code but just learning the work flow and the thinking processes. Thank you still for this video.

  • @ditchtech
    @ditchtech Před rokem +4

    Instructive and thorough, appreicate your efforts!

    • @AIAnytime
      @AIAnytime  Před rokem

      Thank you Sir, please consider subscribing to the channel.

  • @RahulGupta-ub1op
    @RahulGupta-ub1op Před 4 měsíci +3

    What if we ask a question that is not from PDF?

  • @shruti2806
    @shruti2806 Před měsícem

    13:30 why vectorstore?
    17:18 safe tensors
    20:12 project ideas
    40:00 device_map
    45:12 streamlit cache resource (decorator)
    49:00 chaintype
    1:01:47 token size (how it affects answers)

  • @ilaydelrey3122
    @ilaydelrey3122 Před 8 měsíci +2

    Thank you for putting this tutorial together. It would be great if you could also include the versions of the packages you use in your requirements.txt because the packages change so fast and many things dont work anymore

    • @253_r.asidharth8
      @253_r.asidharth8 Před 8 měsíci +1

      Did u manage to run the project??

    • @AIAnytime
      @AIAnytime  Před 8 měsíci +4

      Let me update the GitHub repo with the version. Thanks

    • @ilaydelrey3122
      @ilaydelrey3122 Před 8 měsíci

      @@253_r.asidharth8 no not yet due to newer package version

    • @adityapatel_00
      @adityapatel_00 Před 8 měsíci

      @@AIAnytimeMay I know when will you update the requirements.txt file with versions you used?

  • @vivekraj9333
    @vivekraj9333 Před 11 měsíci

    I'm glad that I found your channel brother 🙌❤

    • @AIAnytime
      @AIAnytime  Před 11 měsíci

      Welcome aboard! Thanks. Plz keep supporting.

  • @mainakmukhrjee6328
    @mainakmukhrjee6328 Před 10 měsíci +4

    Hello sir , can you please help me with an error:ModuleNotFoundError: No module named 'langchain' ? I have installed langchain and have checked it pip show langchain

  •  Před 6 měsíci

    Very well explained video, and nice content. Congratulations!

  • @mohamedkeddache4202
    @mohamedkeddache4202 Před 5 měsíci +2

    it happened to me every time 😭
    i follow the video and do everything right but i still have problems
    i install python 3.10 then create a new environment and install the requirements then i downloaded the model.
    i think the error are from incompatible version ?
    need help please.

    • @saynomore3704
      @saynomore3704 Před měsícem

      Yes same, I'm unable to create the parquet files properly like he did, it's also asking me to use huggingfaceembeddings instead of sentences transformer embeddings. Someone please help out

  • @SreehariR-u8v
    @SreehariR-u8v Před 7 hodinami

    Please help. Tried your method but getting the following error:
    Uncaught app exception
    Traceback (most recent call last):
    File "C:\Users\Ahead06\Documents\Chatbot\CHATBOT3\lib\site-packages\streamlit
    untime\scriptrunner\exec_code.py", line 85, in exec_func_with_error_handling
    result = func()
    ;;
    NotImplementedError: Cannot copy out of meta tensor; no data! Please use torch.nn.Module.to_empty() instead of torch.nn.Module.to() when moving module from meta to a different device.

  • @dchuguashvili
    @dchuguashvili Před 11 měsíci +2

    Is it possible to deploy a chatbot that has been fine-tuned using a custom knowledge base and the Llama2 framework on a live production website? My plan is to fine-tune the chatbot with data derived from 100 pages of PDF documents. The aim is for the chatbot to interact with online users and generate responses based on this material. If the chatbot is anticipated to engage with approximately 2,000 users per month and accommodate at least 20 users simultaneously, could you offer a rough estimate of the projected costs?

    • @junaidiqbal4104
      @junaidiqbal4104 Před 10 měsíci

      hi, do you get any idea about that, i hope you will answer it

  • @keeperofthelight9681
    @keeperofthelight9681 Před 5 měsíci +1

    Doesnt work getting crazzy error with embdeggins both with huggingfqce a d sentence transformer

  • @associatedbiblestudentsofs5308
    @associatedbiblestudentsofs5308 Před 10 měsíci +1

    'Chromadb' is not compatible with Python 3.11. I'm trying to find a work around, but a very well-developed course. Thank you.

  • @taison0072
    @taison0072 Před 5 měsíci +1

    Hello bro , since my PC low end is it possible to the the llm models on azure and access from there and make the same application and also deploy it online

  • @raghu0770
    @raghu0770 Před rokem +1

    I think if we ask general questions lik “who is naredra modi?” It will answer out of the pdf files

  • @khalidal-reemi3361
    @khalidal-reemi3361 Před 9 měsíci

    Thanks alot
    I learned alot. I will try doiing this tutorial.

  • @sneharoy3566
    @sneharoy3566 Před rokem

    Superb video. So easy to follow...

  • @ukcp265
    @ukcp265 Před měsícem

    How to handle and maintain chroma db for multiple user requests, and how we know which directory belongs to which user request in 2-tier architecture

  • @abhisycvirat
    @abhisycvirat Před 4 měsíci

    The kid smoking in the background distracted me 😂

  • @rrxn
    @rrxn Před 21 dnem

    ERROR: Ignored the following versions that require a different python version: 0.55.2 Requires-Python

  • @otimistarj
    @otimistarj Před 9 dny

    ImportError: Using `low_cpu_mem_usage=True` or a `device_map` requires Accelerate: `pip install accelerate`

  • @rahulsingh-jk9qs
    @rahulsingh-jk9qs Před 2 hodinami

    I was getting error on install of packages than i remove version and install its work on langchain_community but its not working
    Traceback (most recent call last):
    File "C:\xampp\htdocs\asktoai\ingest.py", line 6, in
    from constants import CHROMA_SETTINGS
    File "C:\xampp\htdocs\asktoai\constants.py", line 2, in
    import chromadb.config

  • @mlloving
    @mlloving Před 9 měsíci +1

    Awesome demo. Would you please let me know where to download the repo of this demo. I did not find it on your github. Thanks.

    • @AIAnytime
      @AIAnytime  Před 9 měsíci

      It's on my GitHub. Please check the repositories.

  • @vrynstudios
    @vrynstudios Před 10 měsíci

    What will it cost to host it in a server? Suppose I have 1000 users daily and use it how much would I need to pay for such PDF searching feature? *Please reply* . I am a noob on AI server side hosting.

  • @FootyFunniesSS
    @FootyFunniesSS Před rokem

    Great work man.. this really helps

    • @AIAnytime
      @AIAnytime  Před rokem

      Glad it helped

    • @FootyFunniesSS
      @FootyFunniesSS Před rokem

      ​@@AIAnytime getting error -
      ValueError: weight is on the meta device, we need a `value` to put in on cpu.

  • @madhupatel6707
    @madhupatel6707 Před rokem +1

    Great explain! I did the same as you did in the video but got some errors so is there any way to reach you, Really need your help.

    • @AIAnytime
      @AIAnytime  Před rokem +1

      How can I help you? My credentials are on about channel or on CZcams banner.

  • @abhishekpasalkar6680
    @abhishekpasalkar6680 Před 9 měsíci +1

    showing this error Even after updating chroma and also migrating it
    "ValueError: You are using a deprecated configuration of Chroma."

  • @shivamthaman7081
    @shivamthaman7081 Před 5 měsíci

    Please consider investing in a microphone thst will enhance the quality of audio in the videos

  • @anuyogesh8979
    @anuyogesh8979 Před 6 měsíci

    I am getting this error - " AttributeError: 'Collection' object has no attribute '__pydantic_extra__' " why?

  • @ashishanand4233
    @ashishanand4233 Před 3 měsíci

    how to solve this AttributeError: 'Client' object has no attribute 'chroma_api_impl'?

  • @_yurisales
    @_yurisales Před 4 měsíci

    How can I make my streamlit + chromadb application faster? I'm loading locally 30 pdf files and when I run the application, It takes like 40 minutes to load the documents and, after, it loads the streamlit interface. Is there a way where I can reduce this time? Is there a way to work with multithreading or multiparallelism in chromadb?

  • @fabsync
    @fabsync Před 7 měsíci

    Super awesome tutorial! I wonder if you want to search pdfs in folders and subfolders.. what would be the code for that?

  • @raghu0770
    @raghu0770 Před rokem

    Many people’s build like this but there is no solution to restrict the model to answer only for pdf files

  • @avijit_barua
    @avijit_barua Před rokem

    Too much great working

  • @sarojapulipaka2972
    @sarojapulipaka2972 Před měsícem

    Can we also give large files as input file (1000 pages)

  • @qvenmisakais
    @qvenmisakais Před 4 měsíci

    Hi, is a perfecto tutorial, great working!. I have a question. Even though I ask questions in Spanish, he answers me in English. Where can you define the language?

  • @overrideguilarte
    @overrideguilarte Před rokem +1

    Is there any model in Spanish similar to this one that can be integrated?

    • @AIAnytime
      @AIAnytime  Před rokem

      Yes you can try something like 'GPT-2 SMALL SPANISH '... Explore on Huggingface models.
      Please subscribe to the channel if this helps. Thanks

    • @overrideguilarte
      @overrideguilarte Před rokem

      @@AIAnytime thanks

  • @mr-rhymes-sings-poems
    @mr-rhymes-sings-poems Před 10 měsíci

    sir i made an offload folder , because it asked me to , but when i do 1 query it is making 2.5 gb something files in the offload folder , so how to solve it , please help

  • @dr.aravindacvnmamit3770
    @dr.aravindacvnmamit3770 Před 7 měsíci +1

    ValueError: You are using a deprecated configuration of Chroma.

    • @samarth-joseph
      @samarth-joseph Před 6 měsíci +2

      Downgrade chromadb version
      pip uninstall chromadb
      pip install chromadb==0.3.29

  • @sandedom339
    @sandedom339 Před rokem

    Very nice! can you load multiple PDF files for Q&A?

    • @AIAnytime
      @AIAnytime  Před rokem +1

      Yes of course! Make sure your machine has enough compute power for inference. You can invest multiple files to create embeddings on any machine.

  • @jilanikashif
    @jilanikashif Před rokem

    Hi, this is great tutorial and really helped me, shifting from Machine Learning to Generative AI is really amazing. It would be great if you create app for the same using Flask and Docker

    • @AIAnytime
      @AIAnytime  Před rokem

      I have an app where i have containerised the app. Plz watch those 2 videos as well.

    • @jilanikashif
      @jilanikashif Před rokem

      @@AIAnytime Could you please share the link. Thanks for quick reply

    • @BharatVarsh47
      @BharatVarsh47 Před 10 měsíci

      can u share link please? @@AIAnytime

  • @deepudeepak1390
    @deepudeepak1390 Před 11 měsíci +1

    Can i use falcon 40b in the place that llm u r using???

    • @AIAnytime
      @AIAnytime  Před 11 měsíci

      Ofcourse you can. Make sure you have enough compute power.

  • @nandanhegde3444
    @nandanhegde3444 Před rokem

    great video

  • @Hope1GamingCSGODota2more
    @Hope1GamingCSGODota2more Před 7 měsíci +1

    what about supabase for vector store?

  • @user-qi4jw1lf9i
    @user-qi4jw1lf9i Před 8 měsíci

    PLEASE TELL ME THE SYSTEM CONFUGURATION REQUIRED FOR THIS....
    I HAVE DONE BUT SHOWING Load_weight proeblem

  • @anannyachamat6366
    @anannyachamat6366 Před 5 měsíci

    Hi Sir, can you please tell me if the Text Summarization using LaMiniT5 248M and This Search your PDF using LaMiniT5 738M can be integrated and made into one single project??? PLS Answer sir.

  • @mohlabo3917
    @mohlabo3917 Před 4 měsíci

    hi cannot install chromadb --error failed

  • @roberty.agyekumaddo6071
    @roberty.agyekumaddo6071 Před 9 měsíci

    Awesome tutorial. Is there a way of embedding this into a webpage? Also, is there a way to customize the appearance of the streamlit GUI.

  • @PavanKumar-yk5mq
    @PavanKumar-yk5mq Před 7 měsíci

    How to deploy this RAG model in aws. I mean what services we can use to deploy other than EC2?

  • @MikelBaghdasarian
    @MikelBaghdasarian Před rokem

    Really Interesting, I was wondering how to mix it with the oobabooga repo, add some options like load various PDFs, load CSV, XLM, and other type of documents (power point, .txt, and others) with some DDBB behind for user access... that would be awesome!

    • @AIAnytime
      @AIAnytime  Před rokem +2

      Cool idea! Maybe I can see if I can create a video soon.

    • @MikelBaghdasarian
      @MikelBaghdasarian Před rokem

      Great! If you need a concept of use let me know, I already tryied and mixed both git repost and they are working fine, the best is to have access to add LORA trained on obabooga to be used under langchain!@@AIAnytime

  • @STACKCODE
    @STACKCODE Před 5 měsíci +1

    Traceback (most recent call last):
    File "D:\coding\Webkul_interview\ollama\vectorbd.py", line 4, in
    from constants import CHROMA_SETTINGS
    ImportError: cannot import name 'CHROMA_SETTINGS' from 'constants' (d:\coding\Webkul_interview\ollama\env\Lib\site-packages\constants.py)

    • @STACKCODE
      @STACKCODE Před 5 měsíci

      resolve error

    • @leehenriques6661
      @leehenriques6661 Před 5 měsíci

      @@STACKCODE Please tell me as well how did you do it
      i am facing the same error

  • @geojames4236
    @geojames4236 Před 10 měsíci

    Awesome video... can you pls suggest any model for querying the Portuguese PDF document ?

  • @ROKKor-hs8tg
    @ROKKor-hs8tg Před 7 měsíci

    This is without any subscription to any form. Does this code have a Google Colab page for testing? Once the libraries are downloaded, will the code work?

  • @vivekpatel2736
    @vivekpatel2736 Před 2 měsíci

    @AIAnytime can we get the images also from pdf in answer ?

  • @saumyajaiswal6585
    @saumyajaiswal6585 Před 7 měsíci

    Thank you for the awesome video. With the source citation does it also give images from the pdf in the answer?

  • @user-py8qx6th8p
    @user-py8qx6th8p Před 9 měsíci

    I have few questions. What other models besides lamini can I use? I am trying to use llama2 or bloom. Also, what API should I use if I dont want to download the llm

    • @AIAnytime
      @AIAnytime  Před 9 měsíci +1

      Look at my latest video.... Using Zephyr and Mistral LLMs.

  • @JahangeerRathore
    @JahangeerRathore Před 10 měsíci

    Awosome but i want to use Lamini model online not offline becuse my RAM is 8 GB it crashed after running how to achieve that thanks in advance...

  • @yanayana-cm5qg
    @yanayana-cm5qg Před 2 měsíci

    what python version are you using?

  • @I3lor
    @I3lor Před rokem +1

    great video, if i replace the checkpoint with any other model (eg. google/mt5), will the project still work as intended?

    • @AIAnytime
      @AIAnytime  Před rokem +1

      Yes, absolutely! It should work if you have a decent machine that can load the model in memory.

    • @I3lor
      @I3lor Před rokem

      @@AIAnytime thank you, you have been very helpful

  • @user-zj7cp8dg9f
    @user-zj7cp8dg9f Před 4 měsíci

    how can we do this with knowledge graph ?

  •  Před 6 měsíci

    Is it possible to change "all-MiniLM-L6-v2" per Watson LLM or any other paid LLM?

  • @DeviGoneMad
    @DeviGoneMad Před 5 měsíci

    can you mention the version of python you are using here?

  • @AMITSINGH-hu4es
    @AMITSINGH-hu4es Před 9 měsíci

    PDF file of resume is not output.

  • @truckfinanceaustralia1335

    great vid!

  • @leehenriques6661
    @leehenriques6661 Před 5 měsíci

    AttributeError: chroma_api_impl
    can you help me fix this please

  • @John-jx4ho
    @John-jx4ho Před rokem

    Awesome!

  • @yashsrivastava4878
    @yashsrivastava4878 Před 11 měsíci +1

    could you please do a video on same LMQL,Langchain and Chainlit together which takes multiple files of different format please

    • @AIAnytime
      @AIAnytime  Před 11 měsíci +1

      Sure Yash. Soon. Thanks for the idea.

    • @yashsrivastava4878
      @yashsrivastava4878 Před 11 měsíci

      @@AIAnytime thank you sir 😊

    • @yashsrivastava4878
      @yashsrivastava4878 Před 11 měsíci

      ​@@AIAnytime sir please make video on this as soon as you can 🙏

    • @AIAnytime
      @AIAnytime  Před 11 měsíci

      By Sunday. Currently in a family emergency. Apologies for the delay!

    • @yashsrivastava4878
      @yashsrivastava4878 Před 11 měsíci

      @@AIAnytime ok sir 🙂

  • @talhaabdulqayyum193
    @talhaabdulqayyum193 Před 11 měsíci

    I am getting this error
    NotImplementedError: Cannot copy out of meta tensor; no data!
    Any work arounds?

  • @mythzing7
    @mythzing7 Před rokem

    Getting this error when i ran the code. Searched online, couldn't find a solution. Could you please help?
    NotImplementedError: Cannot copy out of meta tensor; no data!

  • @tapanpati9452
    @tapanpati9452 Před 10 měsíci

    How to connect you?The whatsapp link does not working...

  • @Rider-jn6zh
    @Rider-jn6zh Před 5 měsíci

    Hello brother,
    Can you please upload videos on how to evaluate llm model and which evaluation metrics can be used for specific usecase.
    As I am getting this question in every interview and not able to answer itt

  • @user-yd3zk4hb1o
    @user-yd3zk4hb1o Před rokem

    Since the cromadb is updated the code throwing some errors related to chromadb , can you please update those codes and push it your repo

    • @AIAnytime
      @AIAnytime  Před rokem +1

      Just a request, can you open a PR on GitHub repo? I will just merge that PR? Let me know... Just a few lines of code. They have migrated from duck db to sqlite.

  • @epictetus__
    @epictetus__ Před 9 měsíci

    Bookmark: 21:00

  • @pagadishyam7049
    @pagadishyam7049 Před 11 měsíci +1

    Hi, your videos are really very impressive. I am trying to recreate this but receiving the below error when executing ingest.py error Message: " duckdb.InvalidInputException: Invalid Input Error: Required module 'pandas.core.arrays.arrow.dtype' failed to import, due to the following Python
    exception:
    ModuleNotFoundError: No module named 'pandas.core.arrays.arrow.dtype' "

    • @AIAnytime
      @AIAnytime  Před 11 měsíci

      Can you look at the chroma db version you are using? Chroma db has recently migrated from duckdb to Sqlite. In that case, you need to make changes in Constants.py... do you mind looking at GitHub issues of this repo on my GitHub?

    • @pagadishyam7049
      @pagadishyam7049 Před 11 měsíci

      @@AIAnytime can i use old version of cromadb, will it work?

    • @pagadishyam7049
      @pagadishyam7049 Před 11 měsíci

      @@AIAnytime downgrade the version of cromadb and pandas: chromadb==0.3.26 , pandas==2.0.3 it worked for me, hope this helps others.

  • @foodfashionmasti8297
    @foodfashionmasti8297 Před 11 měsíci

    Chroma db error your using a deprecated configuration of chromo, some migrate

  • @ehteshamnehal7024
    @ehteshamnehal7024 Před rokem

    Hi. Sometime while running the model I'm getting the following error -Cannot copy out of meta tensor; no data! Any Idea how to solve this? Also I'm using faiss db instead of chroma. Thanks.

    • @AIAnytime
      @AIAnytime  Před rokem

      Can you check if you are offloading some weights to CPU? You using cuda or cpu as device map? Or auto?

    • @ehteshamnehal7024
      @ehteshamnehal7024 Před rokem

      @@AIAnytime I'm not off loading any weights to CPU. Also I'm using auto.

  • @rajesh1906
    @rajesh1906 Před 2 měsíci

    can we run with 8gb RAM ?

  • @pratikchatterjee5992
    @pratikchatterjee5992 Před rokem

    This is great! Thanks a lot! I love the way you are explaining every bits and pieces.
    I am facing an error.
    Till the 1st question the app is working but when ever I ask the 2nd question getting the below error
    NotImplementedError: Cannot copy out of meta tensor; no data!
    Any idea?

    • @AIAnytime
      @AIAnytime  Před rokem

      Thanks for your comments. Probably, you don't have much compute power. What's your laptop specs? And device_map is auto, CPU, or CUDA?

  • @SpoortiSpoorti-p8e
    @SpoortiSpoorti-p8e Před 27 dny

    Where it is code???

  • @manofsteel6173
    @manofsteel6173 Před 3 měsíci

    where is code link??

  • @mohitkapoor4374
    @mohitkapoor4374 Před rokem

    Very nice tutorial. It helped me solve a issue I was working on. Could you please help as how can we reduce the latency of answers from ChatBot? Also what if the pdf of more than 100 pages?

    • @AIAnytime
      @AIAnytime  Před rokem +2

      Thanks for the comment. Infrastructure is the key and ofcourse some tweaking with the preprocessing and algorithms. Get a better compute power and you can see the improvement.

    • @mohitkapoor4374
      @mohitkapoor4374 Před rokem

      @@AIAnytime Thank you so much. Is there any way I can connect with you or any tutorial I should follow to scale things up after referring your tutorial?

  • @MrTabishMehdi
    @MrTabishMehdi Před 11 měsíci

    Can you please update the version of all libraries. I am getting error in Chromadb because of version. Kindly do the needful

    • @RameshPatil28592
      @RameshPatil28592 Před 4 měsíci

      Hi did you update the libraries and resolved chromadb error?

  • @adityapatel_00
    @adityapatel_00 Před 8 měsíci

    Hello Brother,
    Appreciate your work. But can you please update the requirements.txt with the version numbers, the versions are changed and we are facing problems running it.
    Can you hurry?
    Thank you.

    • @ShadyPencil
      @ShadyPencil Před 7 měsíci

      try this...
      pydantic==1.10.13
      chromadb==0.3.26
      langchain==0.0.267
      streamlit==1.25.0
      transformers==4.31.0
      torch==2.0.1
      einops==0.6.1
      bitsandbytes==0.41.1
      accelerate==0.21.0
      pdfminer.six==20221105
      beautifulsoup4==4.12.2
      sentence-transformers
      duckdb==0.7.1
      sentencepiece==0.1.99
      six==1.16.0
      requests==2.31.0
      uvicorn==0.18.3
      torchvision==0.15.2
      streamlit-chat

  • @stephennfernandes
    @stephennfernandes Před rokem

    Hey great work man this really helps, could please explain in brief or if possible make a video about how vecrorDBs work internally? What are they exactly, word embeddings like starspace fasttext or sentence transformer embeddings over similarity search. How does this technology like langchain llama_index work internally

    • @AIAnytime
      @AIAnytime  Před rokem +1

      Hi Stephen, thanks for your comment. Maybe I can try doing that. But i feel there are many such videos available on CZcams. But yes I can explain in simpler terms. My focus is to help my subscribers build projects in Generative AI... But stay tuned 🔜

    • @stephennfernandes
      @stephennfernandes Před rokem

      @@AIAnytime thanks a ton

  • @adithyas6428
    @adithyas6428 Před 5 měsíci

    i am getting an error NotImplementedError: Cannot copy out of meta tensor; no data!
    did anyone face this error and is there any solution

    • @DeviGoneMad
      @DeviGoneMad Před 5 měsíci

      yep facing the same! did u fixt it?

    • @adithyas6428
      @adithyas6428 Před 2 měsíci

      @@DeviGoneMad no
      could not find a solution

  • @JonathanLyon
    @JonathanLyon Před 3 měsíci

    are you available for hire?

  • @dingowhiz481
    @dingowhiz481 Před rokem

    Excellent video ! exactly what I need for POC. I realize that Chroma installation is a challenge with 'pip install chroma-migrate' and run `chroma-migrate` commands which crashed my Linux. Do you know of LTS version of Chroma?

    • @AIAnytime
      @AIAnytime  Před rokem

      Thank you for your comment! Can you let me know your python version? Can you try python 3.10 and then do a pip install chromadb?

    • @dingowhiz481
      @dingowhiz481 Před rokem

      @@AIAnytime I'm running Python 3.10.6 on Ubuntu 22.04.2

  • @user-qi4jw1lf9i
    @user-qi4jw1lf9i Před 8 měsíci

    giving error ModuleNotFoundError: No module named 'pandas.core.arrays.arrow.dtype' please correct this code in ingest.py file
    please hlp

  • @mort-ai
    @mort-ai Před rokem +1

    does this work on any language?

    • @AIAnytime
      @AIAnytime  Před rokem

      Thanks for your comment! No it doesn't work for any language.

  • @deepak1291
    @deepak1291 Před 7 měsíci

    have anyone ran it with 8GB RAM (CPU) windows laptop?

  • @nandanhegde3444
    @nandanhegde3444 Před rokem

    how much gb is LaMini file??

  • @bhautikin
    @bhautikin Před rokem

    Does it means I only add data into the vectordb for new PDF, and no need to train again?

    • @AIAnytime
      @AIAnytime  Před rokem

      You have to create embeddings for the new files.

    • @bhautikin
      @bhautikin Před rokem

      @AIAnytime got it. Thanks

  • @akshay_raut
    @akshay_raut Před rokem

    Great tutorial, waiting for the chatbot ... and wp link is not working ,can you please share group link again ? Thank you!

    • @AIAnytime
      @AIAnytime  Před rokem

      Thanks Akshay! Please find it here: chat.whatsapp.com/EDnAeyBL18GB9xxcnyTW3Y The chatbot video will be posted by tomorrow.

    • @sscoder170
      @sscoder170 Před 2 měsíci

      @@AIAnytime can you share the link of that chatbot video here please regarding with this.

  • @083-cse-sameerkhan3
    @083-cse-sameerkhan3 Před 10 měsíci +1

    sir will it work on 8GB RAM

    • @AIAnytime
      @AIAnytime  Před 10 měsíci +1

      Difficult but it will for a few questions

  • @deepjyotibaishya7576
    @deepjyotibaishya7576 Před 7 měsíci

    This repo link please

  • @user-qi4jw1lf9i
    @user-qi4jw1lf9i Před 9 měsíci

    please make it 10 millions pages
    for lawyer use case