LangChain "Advanced Retrieval" Webinar

Sdílet
Vložit
  • čas přidán 29. 08. 2024

Komentáře • 37

  • @akhils1428
    @akhils1428 Před rokem +1

    Awesome work from Mathew and the Unstructured team. Really looking forward to the new features.

  • @RedCloudServices
    @RedCloudServices Před 11 měsíci +2

    I love where Langchain sits in the LLM ecosystem as a framework. I’m already numb at the stunning number of products that sit on top of Langchain which don’t really seem to distinguish from each other. Microsoft’s latest “flow” product (among others from the big platforms) are a direct threat to these smaller startups. I was on a zoom w AWS recently and they everything required to build an app using any LLM with Langchain (still in beta)

  • @kevon217
    @kevon217 Před rokem

    Great discussions. Really appreciate you all sharing your knowledge, experience, and opinions.

  • @pavellegkodymov4295
    @pavellegkodymov4295 Před rokem

    you guys are doing a really great and useful job, thanks for your precious time!

  • @ChrisSMurphy1
    @ChrisSMurphy1 Před rokem +15

    Harrison seems like a one man show..what a hustler

    • @the0netheycallgod413
      @the0netheycallgod413 Před rokem

      Savage

    • @asatorftw
      @asatorftw Před rokem +1

      He's a total beast indeed. Always responding super fast to any questions and tweets + doing all of this stuff. Hats off!

    • @SDGwynn
      @SDGwynn Před rokem

      You mean Anton right??

  • @LeonidGaneline
    @LeonidGaneline Před rokem +2

    PRESENTERS:
    - Harrison Chase - LangChain
    - Matthew Robinson - Unstructured
    - Anton Troynikov - Chroma

  • @johnallen9992
    @johnallen9992 Před rokem +1

    interesting.. intelligent self feeding retrieval/ splits.. well.. self guiding based on 'direction' on query. 'to know how to know' is superior to 'to know what to know'

  • @tristanmorris2074
    @tristanmorris2074 Před rokem +1

    What software do you guys use for your webinars? The interface is very clean

  • @AlTheRize
    @AlTheRize Před rokem

    What a great session !

  • @dr.mikeybee
    @dr.mikeybee Před 7 měsíci

    When people learn something new, we take into consideration the source from whom we learned this new thing. We give more credence to some than to others. In other words, we have taste. When my astrologically inclined brother-in-law tells me something about science, I'm less likely to give his story total credence than if I heard it from Lex Fridman. When we do RAG, we don't rate our sources. The best we can do is ask the model to rate output. How can we safely start to rate sources and save the ratings?

  • @micbab-vg2mu
    @micbab-vg2mu Před 11 měsíci

    Great talk - thank you

  • @nukulkhadse5253
    @nukulkhadse5253 Před rokem

    One of the issues that I faced is that the self query retriever doesn't understand semantic meaning of the metadata variable we pass and it tries to match the similar word for it. For e.g., if we pass a date parameter in metadata as "26 August, 2023", it will not understand if your query has "August 26, 2023" in the input text and doesn't return any documents and just empty list. This seems like an issue to me as the user can ask questions with date in any format.

  • @decaturdev7127
    @decaturdev7127 Před rokem +1

    Why is your new video marked as for youtube kids???? Can't even save it.

  • @frazuppi4897
    @frazuppi4897 Před 11 měsíci

    can we have the slides? Thanks for the amazing talk

  • @dr.mikeybee
    @dr.mikeybee Před 7 měsíci

    Why are you concentrating on foundation models when local models have gotten so good?

  • @zacboyles1396
    @zacboyles1396 Před 11 měsíci

    31:30 💯

  • @johnallen9992
    @johnallen9992 Před rokem

    More importantly in a Pirsig/ Jordan Peterson - where Pirsig is beyond Peterson.. is can you get a 'ought from an 'is' ?' just because you can, does it mean you should ?
    Query to have option to set boundaries for self-guided feedback retrievals - not just for cost but ethically. Empiricist view of Science, useful maps based on limited hypotheses giving 'objective' firm results but yet with no 'direction' .. doesnt tell you where to go.
    oh yes.. relevancy across many aspects.. especially time base.. if streamed

  • @peter00
    @peter00 Před rokem +5

    The data confidentiality point has been solved by the cloud providers, you can get a private instance on Azure and no data will go to Microsoft or Open AI. Same goes for AWS and GCP. I don't get why this keeps being brought up a a benefit of local models. Chances are you'd run these local models on eg AWS Bedrock, same place as managed models (eg Claude)

    • @sskarimirelandsskarimirela8750
      @sskarimirelandsskarimirela8750 Před rokem +2

      the point is the freedom ..... Private Server ..... a lot o reason to not be slave for GAFA

    • @fkxfkx
      @fkxfkx Před rokem +4

      It in no way has been "solved", merely addressed and not totally convincingly.

    • @thomlinford
      @thomlinford Před rokem

      I respectfully disagree, but only a little bit 😅- I'm seeing two main use cases where this remains an issue... Communications and utilities network data legally must be on prem (not talking OT infrastructure). The other is the defence industry, both government and private.
      Both areas are actually some of the most active industries in gen ai

    • @thomlinford
      @thomlinford Před rokem

      But otherwise, agree that most businesses need to calm down and investigate before refusing .

    • @pavellegkodymov4295
      @pavellegkodymov4295 Před rokem +1

      I think it's more about APIs, you are sending your corporate data to e.g. Open AI and although it won't be used according to contract, it still will be stored on OpenAI for 30 days for "analysis purposes". The way things are now where you have a team of CIA agents deeply integrated into big tech (and OpenAI is almost owned my Microsoft now), it's not hard to imagine, that this analysis could be misused to gain more control over operations of a company, an industry and influence it's decision making process.

  • @csmac3144a
    @csmac3144a Před 9 měsíci +2

    Some of us are very sensitive to audio quality -- particularly with voice. Even if you don't see much difference, investing in a high-quality mic and using a relatively acoustically dead room will improve the quality of these webinars dramatically (again, many people are completely insensitive to these matters, but probably ⅓ of people are not). I loved the content, but the amplitude spikes in Matthew's garbage headset were so bad I almost had to stop listening.

  • @Cdaprod
    @Cdaprod Před rokem

    I write retrieval systems for agents can I work at chroma? I really need a job

    • @Cdaprod
      @Cdaprod Před rokem

      I’m primarily working with Metadata Querying for my own purposes but I also have been working on semantic generation of metadata.

    • @Cdaprod
      @Cdaprod Před rokem

      31:45 To say that something like a long term agent can’t narrow down it’s instructions permanently while having that underlying method of retrieval that you get from basically the “identity” that your trying to provide, it’s about finding that level of adaptation over time without completely changing the consistency of results.

    • @Cdaprod
      @Cdaprod Před rokem

      Is everyone else using multiple self hosted vectordbs based on use case?

  • @prasenjitgiri919
    @prasenjitgiri919 Před 11 měsíci

    The thing that would have made these talks more helpful if it was shown how it was being done rather than talking over it. This is what lacks in most of the langchain talks or events. And, the level of abstraction is dumbfound after a certain level.

  • @paulwilliams9904
    @paulwilliams9904 Před rokem

    Matthew, with the deepest respect. You say 'ah/um' A LOT. It may help to slow down and pick words your carefully. Unforturnately, it was too distracting to watch this video.

  • @Cdaprod
    @Cdaprod Před rokem +1

    35:15 @harrison found a fly 👀