Greg Kamradt (Data Indy)
Greg Kamradt (Data Indy)
  • 178
  • 2 379 048
I didn't know RAG could be this easy
Gradient AI: tinyurl.com/gradient-ai
Get the code: github.com/gkamradt/RAGWithGradient
Get updates from me: mail.gregkamradt.com/
Greg’s Info:
- Twitter: GregKamradt
- Newsletter: mail.gregkamradt.com/
- Website: gregkamradt.com/
- LinkedIn: www.linkedin.com/in/gregkamradt/
- Work with me: tiny.one/TEi2HhN
- Contact Me: Twitter DM, LinkedIn Message, or contact@dataindependent.com
zhlédnutí: 4 154

Video

I interview the man behind AI Virtual Try-On
zhlédnutí 1,5KPřed 3 měsíci
Kopia: www.brands.trykopia.com/ Calvin: CalvinnChenn Greg’s Info: - Twitter: GregKamradt - Newsletter: mail.gregkamradt.com/ - Website: gregkamradt.com/ - LinkedIn: www.linkedin.com/in/gregkamradt/ - Work with me: tiny.one/TEi2HhN - Contact Me: Twitter DM, LinkedIn Message, or contact@dataindependent.com
World’s Fastest Talking AI: Deepgram + Groq
zhlédnutí 40KPřed 4 měsíci
- Deepgram: tinyurl.com/deepgram-aura to get $200 free credit - Code Tutorial Overview: github.com/gkamradt/QuickAgent OVERVIEW: I’m Greg Kamradt, and I’m on a mission to figure out how businesses will create more value using AI. In this overview, we look at the 3 pieces needed to create a super fast AI voice bot. Sponsors that help support the channel: - Deepgram (Transcription Services): tiny...
The Secret Behind The "Chat With Business Data" Industry
zhlédnutí 4,5KPřed 5 měsíci
aiwithwork.com/ Greg’s Info: - Twitter: GregKamradt - Newsletter: mail.gregkamradt.com/ - Website: gregkamradt.com/ - LinkedIn: www.linkedin.com/in/gregkamradt/ - Work with me: tiny.one/TEi2HhN - Contact Me: Twitter DM, LinkedIn Message, or contact@dataindependent.com
The 5 Levels Of Text Splitting For Retrieval
zhlédnutí 59KPřed 6 měsíci
Get Code: fullstackretrieval.com/ Get updates from me: mail.gregkamradt.com/ * www.chunkviz.com/ Greg’s Info: - Twitter: GregKamradt - Newsletter: mail.gregkamradt.com/ - Website: gregkamradt.com/ - LinkedIn: www.linkedin.com/in/gregkamradt/ - Work with me: tiny.one/TEi2HhN - Contact Me: Twitter DM, LinkedIn Message, or contact@dataindependent.com Outline: 0:00 - Intro 3:42 - Theory...
I asked 10 businesses how they ACTUALLY use AI
zhlédnutí 7KPřed 6 měsíci
Tell me about your AI impact: o423w74xx6a.typeform.com/to/dRs8TYgO Get updates from me: mail.gregkamradt.com/ Interviews: * teereximus * nickscamara_ * dionisloire * Reidoutloud_ * markerdmann Greg’s Info: - Twitter: GregKamradt - Newsletter: mail.gregkamradt.com/ - Website: gregkamradt.com/ - LinkedIn: www.linkedin.com/in/...
I pressure tested GPT-4's 128K context retrieval
zhlédnutí 22KPřed 8 měsíci
Get updates from me: mail.gregkamradt.com/ FullStackRetrieval.com Tweet write up: GregKamradt/status/1722386725635580292 Code: github.com/gkamradt/LLMTest_NeedleInAHaystack Check out how GPT-4 does at retrieval with 128K tokens worth of context. Lost In The Middle: www-cs.stanford.edu/~nfliu/papers/lost-in-the-middle.arxiv2023.pdf Greg’s Info: - Twitter: GregKamradt - Ne...
FullStackRetrieval.com - All Things LLM Retrieval (Trailer)
zhlédnutí 3,5KPřed 8 měsíci
Sign up (Free) to get access: fullstackretrieval.com/
I react to OpenAI DevDay
zhlédnutí 2,2KPřed 8 měsíci
Get updates from me: mail.gregkamradt.com/ Greg’s Info: - Twitter: GregKamradt - Newsletter: mail.gregkamradt.com/ - Website: gregkamradt.com/ - LinkedIn: www.linkedin.com/in/gregkamradt/ - Work with me: tiny.one/TEi2HhN - Contact Me: Twitter DM, LinkedIn Message, or contact@dataindependent.com
How I Fine-Tuned An AI Clone - Can You Tell The Difference?
zhlédnutí 4,6KPřed 8 měsíci
Gradient.AI: tinyurl.com/gradient-ai Greg’s Email Updates: mail.gregkamradt.com/ Code: github.com/gkamradt/FineTuningClone Overview: In this video, I experiment with using AI to clone myself - from matching my speaking style to generating a convincing video. It's a wild journey across different AI tools as I try to fool people into thinking my digital clone is real. I test out open-sourced mode...
Group By Meaning...Not Keywords
zhlédnutí 2,1KPřed 8 měsíci
Stay up to date with Greg: mail.gregkamradt.com/ Semantic Deduplicator: github.com/gkamradt/SemanticDeduplicator SingleStore: tiny.one/QUtq9Wa Dive into the innovative world of semantic deduplication. Navigate the challenges of consolidating product feedback and explore the mechanics behind a new Python package I built. From refining grocery lists to streamlining CZcams comments, witness the tr...
I figured out what GPT-4 Vision could do
zhlédnutí 11KPřed 9 měsíci
Email Subscribers get the list: gregkamradt.ck.page/b6630af43e Outline 0:00 - Intro 1:16 - Describe 2:06 - Interpret 3:30 - Recommend 5:23 - Convert 7:23 - Extract 8:46 - Evaluate 10:45 - Assist 13:28 - Greg's Reflections Greg’s Info: - Twitter: tiny.one/VtxG3kC - Newsletter: tiny.one/9kkx2D0 - Website: tiny.one/QWsPKqX - LinkedIn: tiny.one/rzlQ1bB - Work with me: tiny.one/UVetE5r - Contact Me:...
The AI Task Force You Need At Work
zhlédnutí 1,9KPřed 10 měsíci
Your database solution, Singlestore: tiny.one/QUtq9Wa Book time w/ Greg: tiny.one/8eJm3r4 Want to enable your employees with AI tools but not sure where to start? Join Greg as he shares key learnings from top companies who have created AI committees. Learn why you need leadership buy-in, how to manage expectations around AI's capabilities, and explore ideas for workforce enablement. Discover re...
11 Ways Zapier Employees Use AI (Mike Knoop Interview)
zhlédnutí 4,1KPřed 10 měsíci
Get more AI interviews, analysis, hot takes, and tutorials: tiny.one/k43As2p Overview Join Greg and Mike as they chat about Zapier's journey into AI. Learn how Zapier enabled all employees to explore AI tools during a company "code red", leading to huge adoption increases. Hear Mike's advice for companies just starting with AI, like revisiting previously unsolvable problems and extracting insig...
4 Reasons Why AI Won’t Work
zhlédnutí 2,8KPřed 10 měsíci
All things data, check out SingleStore: tiny.one/DGOpgdT Outline Is AI all hype or is it really as profound as fire? In this video, we go on a journey to find the top reasons why AI might fail and not live up to expectations. Get ready as we review 4 solid arguments on why AI could be a total dud: * AI Hallucinations - Models make up fake info confidently - how can we trust them? * Too Complex ...
I Cloned My Favorite Podcast Host (with AI Voice Cloning)
zhlédnutí 2,8KPřed 11 měsíci
I Cloned My Favorite Podcast Host (with AI Voice Cloning)
4 Non-Technical Ways I Use ChatGPT @ Work
zhlédnutí 1,8KPřed rokem
4 Non-Technical Ways I Use ChatGPT @ Work
There's More To Retrieval Than Vector Stores
zhlédnutí 6KPřed rokem
There's More To Retrieval Than Vector Stores
Extract Topics From Video/Audio With LLMs (Topic Modeling w/ LangChain)
zhlédnutí 15KPřed rokem
Extract Topics From Video/Audio With LLMs (Topic Modeling w/ LangChain)
Anderson 'CoopBot' Content Moderation & Game Generator (Early Signals #4)
zhlédnutí 1,3KPřed rokem
Anderson 'CoopBot' Content Moderation & Game Generator (Early Signals #4)
Siqi Chen - AI Thoughts, Wild Predictions and Musings
zhlédnutí 1,7KPřed rokem
Siqi Chen - AI Thoughts, Wild Predictions and Musings
Function Calling via ChatGPT API - First Look With LangChain
zhlédnutí 45KPřed rokem
Function Calling via ChatGPT API - First Look With LangChain
ChatGPT made my interview questions for me (Streamlit + LangChain)
zhlédnutí 7KPřed rokem
ChatGPT made my interview questions for me (Streamlit LangChain)
Jared Zoneraich - Future Of Prompt Engineering, Management, and Collaboration
zhlédnutí 1,9KPřed rokem
Jared Zoneraich - Future Of Prompt Engineering, Management, and Collaboration
Generate Content With AI Researchers (Early Signals #3)
zhlédnutí 3,2KPřed rokem
Generate Content With AI Researchers (Early Signals #3)
Build Your Own AI Twitter Bot Using LLMs
zhlédnutí 12KPřed rokem
Build Your Own AI Twitter Bot Using LLMs
ChatGPT Home Automation, Personalized Ads + 3 Ideas (AI Early Signals #2)
zhlédnutí 2,6KPřed rokem
ChatGPT Home Automation, Personalized Ads 3 Ideas (AI Early Signals #2)
Control Tone & Writing Style Of Your LLM Output
zhlédnutí 13KPřed rokem
Control Tone & Writing Style Of Your LLM Output
Influencer's AI Bot Makes In $72K In A Week (AI Early Signals #1)
zhlédnutí 7KPřed rokem
Influencer's AI Bot Makes In $72K In A Week (AI Early Signals #1)
Matt Welsh - Co-Founder & CEO Of Fixie (B2B AI Agent Platform)
zhlédnutí 6KPřed rokem
Matt Welsh - Co-Founder & CEO Of Fixie (B2B AI Agent Platform)

Komentáře

  • @tvlover2447
    @tvlover2447 Před dnem

    Outstanding well done!

  • @neerajshrivastava5600

    Excellent content, however if you speak slowly it will make it easy to absorb the information

  • @ch_cking
    @ch_cking Před 3 dny

    thank you ryan gosling

  • @jeffersonhighsmith7757

    "Filler words" this is hugely important, IMO. Because it's literally how human beings speak. They use these delaying tactics constantly.

  • @shakthiprashanth1278

    Simple & Clear explanation.

  • @DieserWeg14
    @DieserWeg14 Před 6 dny

    Loved your channel, could you do one with LangServe, please, thanks.

  • @ganderamu5138
    @ganderamu5138 Před 6 dny

    .unique() is not working for everything,while datacleaning some extra information it was showing df['TypeofContact'].unique()

  • @ajaykumarreddy8841
    @ajaykumarreddy8841 Před 13 dny

    Hi Greg. Great video! Thanks for sharing. But I have some issues when running the code: Firstly, the Speech-to-text performance is not very good. I literally have to shout into my mic for it to be able to hear. I thought it was my microphone issue and tested it normally on a simple voice recorder and it worked as expected. Secondly, the text-to-speech voice output keeps breaking in between. Not sure if that is expected because of ffplay but definitely wasn't as smooth as what you showed in the video. Thirdly, the voice input is not getting recognized immediately once I get the response. It seems there is a small but noticeable delay to when I get the response back as voice to when I can again start speaking even though it says "Listening" in the console. I have to wait for like 5-10 seconds before I start speaking for the program to recognize my voice or else it isn't doing so. Is anyone else facing the same issue?

  • @jatinaqua007
    @jatinaqua007 Před 15 dny

    Great video! so what if my question is out of context of the pdf document? Will the open ai answers it from its generic knowledge? Or it will simply say that it doesn't know the answer? Either way can we configure it to respond the way we want?

  • @haribattula5187
    @haribattula5187 Před 16 dny

    I guess semantic search is what already vector databases are supporting.. and i don't find any advantages by doing sentence split, then calculating cosine distance and putting them in same bucket.. am i missing something here?

  • @alexeponon3250
    @alexeponon3250 Před 16 dny

    Single and multi hop explained concerning the semantic splitting. Nice !!

  • @cybern9ne
    @cybern9ne Před 18 dny

    Iterrows is NOT faster than itertuples.

  • @nihilitymandate6073
    @nihilitymandate6073 Před 19 dny

    Not a comment on the context, but I think that the style of the thumbnail is very smart. It reminds me of the Wired 5 Levels of Difficulty style. I think if the aesthetic is softer, it can be even more popular.

  • @user-dh6wx3fe6y
    @user-dh6wx3fe6y Před 20 dny

    group rank is really intuitive and simple, but I don't really understand how normalizing ranks works? and what's the idea of applying it?

  • @derricks6777
    @derricks6777 Před 21 dnem

    The voices are all terrible, imho. They would work well in a cyberpunk game - but that's about it. :D

  • @lets-talk-ai
    @lets-talk-ai Před 21 dnem

    Amazing video as always, thanks Greg. Did anyone had some issues with ffplay?

  • @sbotros
    @sbotros Před 22 dny

    thank you

  • @bhagirathsinhparmar2902

    I can't appreciate this video or this playlist more. This work is a masterpiece. Thank you!!

  • @JoanApita
    @JoanApita Před 24 dny

    man it took me 3 weeks to find you. thank you please keep on coming.

  • @raulgarcia6191
    @raulgarcia6191 Před 25 dny

    I've a better way for imsgesyin PDF, I converted PDF to markdown and turn all images to markdown image reference, then put all the images in a separate folder, that way my embeddings show markdown text with markdown images which can later be turned into html images in the chatbot

  • @rodrigoniveyro9763
    @rodrigoniveyro9763 Před 25 dny

    Thanks! you helped me to get to the right path to my solution !

  • @lindamatthews7260
    @lindamatthews7260 Před 26 dny

    Thank you. I could not get my code to run. Thank you….

  • @ramachinta3140
    @ramachinta3140 Před 27 dny

    Very helpful Video, Thank you!

  • @7dainis777
    @7dainis777 Před 27 dny

    I can see video was posted a year ago. There is one better approach, which I'm not sure was available year ago. You can use RAG + Data Embedings. Each document chunk you can convert to vectors and store in vector store/database. Then also prompt can be converted into vector and matched against vector store/database, which will give you closest match. Then just use GPT model on best match found earlier.

  • @kapilkevlani145
    @kapilkevlani145 Před 28 dny

    can somebody helps me with interruption handling in this provided video as I have created a voicebot which is running on UI but unfortunately it cannot handle voice interruptions.

  • @Robert-wj9lb
    @Robert-wj9lb Před 28 dny

    LOL I think this entire video is made with an A.I. clone of you :P

  • @badrinarayanans355
    @badrinarayanans355 Před 28 dny

    Really Informative 😊

  • @robertovillegas2220
    @robertovillegas2220 Před 29 dny

    I have a use case with a haven't seen anywhere: I create a private GPT that has documents as contexts. This documents has criteria for a specific subject I give it system instructions so its function is to evaluate a user document that it's attached as a part of the prompt, to see if the document complies with the criteria in the context documents, and give detail response on the result of evaluation and the justification that includes, the content of the user document and the criteria in the context documents. I want to do that in LangChain but I don't know hot to add a user document as a part of the prompt for the RAG.. It would be great if you can you explain how to approach this implementation. Thanks you for the content!! Keep the good work.

  • @JOSEGARCIA-ch2jp
    @JOSEGARCIA-ch2jp Před měsícem

    pricing, pricing, pricing always pricing.

  • @badrinarayanans355
    @badrinarayanans355 Před měsícem

    Great Insights

  • @sarthaksharma9861
    @sarthaksharma9861 Před měsícem

    Hey! Does the 4o version use something similar?

  • @mlguy8376
    @mlguy8376 Před měsícem

    Rip to semantic search for large datasets 😂! But interesting approach.

  • @alextonev4145
    @alextonev4145 Před měsícem

    Love how straight to the point the explanation was

  • @umeshkumarasamy6608
    @umeshkumarasamy6608 Před měsícem

    8:52 Thou shan't troll the beginners for you were one too. - Pro Code, Rule 4.

  • @dgpreston5593
    @dgpreston5593 Před měsícem

    Nicely done

  • @amrohendawi6007
    @amrohendawi6007 Před měsícem

    This is an amazing professional content! it hits the point directly

  • @stonedizzleful
    @stonedizzleful Před měsícem

    This is an insanely detailed from first principles tutorial. Thank you for taking the time to put this together.

  • @aarshmehtani5468
    @aarshmehtani5468 Před měsícem

    While running the code elements=partition_pdf(filename=filename,strategy="hi_res",infer_table_structure=True,model_name="yolox") in my Jupyter notebook, I encountered errors such as TesseractNotFoundError. If anyone has faced this issue or knows how to solve it, please guide me as soon as possible. Great work, sir. I can confidently say that such a combination of content and explanation is unparalleled in the CZcams world. @DataIndependent

    • @aarshmehtani5468
      @aarshmehtani5468 Před měsícem

      Now this problem is solved but new problem has come. Basically this code is not working properly due some versions or subclasses. So please give the alternative method.

  • @amaanqureshi1286
    @amaanqureshi1286 Před měsícem

    My question is pinecone only stores vectors and not text files, how do I get the texts in my program

  • @shankstuv
    @shankstuv Před měsícem

    Love this! I'm working with transcripts where semantically generated chunks can be quite large. These chunks need to be further divided to fit the limits of the embedding model. Given this, isn't semantic chunking unnecessary if we ultimately have to recursively break down the larger chunks into smaller ones?

  • @umeshkumarasamy6608
    @umeshkumarasamy6608 Před měsícem

    You Good sir are not a Guy who talks the talk. I learnt more about Langchain from you in this half hour than anybody else I have listened to on the last 3months. The veritable quarter to be exact. You are a Guy who walks the walk 🫡

  • @user-mg2qk2cl3f
    @user-mg2qk2cl3f Před měsícem

    Liked this semantic splitting! Cool stuff you´ve done there!! Also agentic chunking. Pretty cool!!!

  • @xmagcx1
    @xmagcx1 Před měsícem

    great video, thanks you

  • @SudhakarVyas
    @SudhakarVyas Před měsícem

    Hey Greg, thanks for this video! Since, there is a limit to access open ai api key without paying, how can the above implementation be carried out with other open source LLMs ?

  • @fabsync
    @fabsync Před měsícem

    Fantastic tutorial! It would be great to see another tutorial using "transformers" instead of openai with chroma or any local database... and how will you save the extracted information.. does Kor tokenize that information, etc?

  • @xinxunzeng9639
    @xinxunzeng9639 Před měsícem

    Great video! it helped me clarify the past and present of all the chunks. I have a question, in agent chunking, there can be an issue of having too much content on a single topic. In extreme cases, an entire book might be about one topic. How should we further break it down in such situations?

  • @YanasChanell
    @YanasChanell Před měsícem

    That was really helpful, thank you for information you're sharing!

  • @Dr_Tripper
    @Dr_Tripper Před měsícem

    was working then: RuntimeError: Task <Task pending name='Task-10' coro=<AsyncLiveClient.send() running at C:\Users\Sarah\.conda\envs\deepgram\lib\site-packages\deepgram\clients\live\v1\async_client.py:251> cb=[_chain_future.<locals>._call_set_state() at C:\Users\Sarah\.conda\envs\deepgram\lib\asyncio\futures.py:392]> got Future <Future pending> attached to a different loop