Unlock AI Agent real power?! Long term memory & Self improving

"I want Llama3.1 to perform 10x with my private knowledge" - Self learning Local Llama3.1 405B

Unlimited AI Agents running locally with Ollama & AnythingLLM

Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn

Přilepili si tetování na druhou ruku.

"Make Agent 10x cheaper, faster & better?" - LLM System Evaluation 101

AI Jason

zhlédnutí 18 034

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 7. 09. 2024

Komentáře • 31

@Jim-ey3ry Před 3 měsíci ⁺²³
This is gold, most of people just show you how to build toy demo, but not many actually get into details of how to get into production; Thank you Jason!
@xXWillyxWonkaXx Před 3 měsíci
Couldnt agree more. This is gold.
@apereiracv Před 3 měsíci ⁺⁷
I recently be created a whole testing system for our LLM chatbots and we did exactly this:
LLM as evaluator and code
We created it as a series of unit tests with LLM generated cases.
Since our results were mostly conversational, we made tests pass/fail according to a scoring system
@tkp2843 Před 3 měsíci ⁺⁵
This is great. Loved the use of firecrawl (as a scrape tool) to get the website's data. Feel like it always helps improve the model output quality. Cheers!
@kenchang3456 Před 3 měsíci ⁺⁵
Way excellent video that goes well beyond demo. Thank you very much for this guidance.
@darrenhinde2971 Před 3 měsíci
Been looking for more detail on eval on LLMs and been scratching around for a while. Thanks for this.
@jasonfinance Před 3 měsíci ⁺³
Amazing work as always Jason!
@kayshidow Před 3 měsíci ⁺¹
I've used promptfoo for some of my test with local llm to test the ai workflow. It allow you to write assertion like you'll do with software
@contractorwolf Před 3 měsíci
goddamn Jason your videos just blow my mind each time. Thanks for such a thorough explanation and example.
@someshfengade9623 Před 3 měsíci ⁺¹
I found langfuse metric monitoring little bit better.
@agenticmark Před 3 měsíci ⁺¹
fine tune llama 3 (8bit) - you will get exactly the behavior you want - its what I do
@humanish_ai Před 3 měsíci ⁺¹
Finally you back 🎉
@jimmy-ef2ow Před 3 měsíci ⁺¹
jason can we get another video about comfy ui?
@techfren Před 3 měsíci ⁺¹
lesgooo!! ❤‍🔥❤‍🔥❤‍🔥
@CorkyBallasdancewithme Před 2 měsíci
great stuff, as new to hearing this, very interesting, can this be built by a novice . . .
@titusblair Před 3 měsíci
Awesome! Keep up the great work!
@JorritvanGinkel Před 3 měsíci
This is so good, thanks man!
@fullgazz Před 3 měsíci ⁺¹
Who never spent 4 hours to save 10 min? That's our hobby spent time to save time.
@AGI-Bingo Před 2 měsíci ⁺¹
If 25 people or more use it successfully then you literally gave humanity more time to live and be free
@Joe-bp5mo Před 3 měsíci
Sick, whats the best practice metrics for evaluating agents?
@MatrixCodeBreaker88 Před 3 měsíci
Great Video
@jordanz9580 Před 3 měsíci
fireeee content!
@Ms.Robot. Před 3 měsíci
I love how my Ai girl insults the competion with flame balls,then tells me.she loves me.❤🎉😊
@user-nt7lj1nc8s Před 3 měsíci
Why not use Gemini as the LLM? It is free.
@HyperUpscale Před 3 měsíci ⁺¹
Lets me share my experience about any google AI model ... because it doesn't understand human and it hallucinate way too much.
Practically ... in my cases 75% of the time what I get back is totally useless result. You cant use for anything... To be considered for evaluation ... you must be joking
@irql2 Před 3 měsíci
I dont see the value of "Agents". All of this stuff is easily done with basic function calling. I think I'm going to need to see some more creative use cases before I jump on board, i just dont get it yet.
@ayoubfr8660 Před 3 měsíci
Maybe we can discuss this, I am trying to jump on in but not until I find a decent idea to apply.
@symbol9new Před 3 měsíci
when your assistant has a lot of functions, he starts giving out hallucinations, have you ever encountered this?
@SydneyF-eg5lt Před 3 měsíci
Good content but so hard to listen to his Engrish. Monotonous Pitch n sped up delivery didn’t seem to help either.

Další v pořadí

Automatické přehrávání

Unlock AI Agent real power?! Long term memory & Self improving

Unlock AI Agent real power?! Long term memory & Self improving

"I want Llama3.1 to perform 10x with my private knowledge" - Self learning Local Llama3.1 405B

"I want Llama3.1 to perform 10x with my private knowledge" - Self learning Local Llama3.1 405B

Unlimited AI Agents running locally with Ollama & AnythingLLM

Unlimited AI Agents running locally with Ollama & AnythingLLM

Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn

Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn

Přilepili si tetování na druhou ruku.

Přilepili si tetování na druhou ruku.

I play this like Cristiano Ronaldo⚽❓

I play this like Cristiano Ronaldo⚽❓

Claude 3.5 struggle too?! The $Million dollar challenge

Claude 3.5 struggle too?! The $Million dollar challenge

Building with AI Agent Teams

Building with AI Agent Teams

“Wait, this Agent can Scrape ANYTHING?!” - Build universal web scraping agent

“Wait, this Agent can Scrape ANYTHING?!” - Build universal web scraping agent

This Social Media AI System Creates Unique Content Daily! (100% Automated)

This Social Media AI System Creates Unique Content Daily! (100% Automated)

GPT4V + Puppeteer = AI agent browse web like human? 🤖

GPT4V + Puppeteer = AI agent browse web like human? 🤖

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

AUTOGEN TUTORIAL - build AI agents with GPT-4o and Microsoft's AutoGen

AUTOGEN TUTORIAL - build AI agents with GPT-4o and Microsoft's AutoGen

Hugging Face + Langchain in 5 mins | Access 200k+ FREE AI models for your AI apps

Hugging Face + Langchain in 5 mins | Access 200k+ FREE AI models for your AI apps

Industrial-scale Web Scraping with AI & Proxy Networks

Industrial-scale Web Scraping with AI & Proxy Networks

Only I get to bully my sister 😤

Only I get to bully my sister 😤

Mikuláš Černák: PŘÍBĚH BOSSE (celý dokument)

Mikuláš Černák: PŘÍBĚH BOSSE (celý dokument)

How Far Would You Make It? 😳

How Far Would You Make It? 😳

PÁRTY VE 20 vs VE 30 LETECH 😅😂

PÁRTY VE 20 vs VE 30 LETECH 😅😂

SPONGEBOB POWER-UPS IN BRAWL STARS!!!

SPONGEBOB POWER-UPS IN BRAWL STARS!!!

We need to beat it… 😳⚽️

We need to beat it… 😳⚽️

Moja Prvá Šialená Skúsenosť s Alkoholom - Animácia CZ/SK

Moja Prvá Šialená Skúsenosť s Alkoholom - Animácia CZ/SK

TOHLE JSEM FAKT NEPOTŘEBOVAL VĚDĚT 😅

TOHLE JSEM FAKT NEPOTŘEBOVAL VĚDĚT 😅