Supercharging RAG with Generative Feedback Loops from Weaviate

Physics of Language Models: Part 1, Context-Free Grammar

EP6: CELLTYPIST & CELLHINT - Towards Automated Annotation And Integration Of Single-Cell Data

Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn

KONČÍM CESTU NA OLYMPII A ZÁVODNÍ KARIÉRU

Don't Let This Happen To You... 😂

[Own work] On Measuring Faithfulness or Self-consistency of Natural Language Explanations

AI Coffee Break with Letitia

zhlédnutí 3 128

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 7. 09. 2024
Excited to share my ACL 2024 presentation on my almost-last PhD paper about LLM self-explanations! 🎓📚
Are you joining ACL 2024 in Bangkok? Ping me-let's chat!
AI Coffee Break Merch! 🛍️ aicoffeebreak....
📜 “On measuring faithfulness of natural language explanations” L Parcalabescu, A Frank arxiv.org/abs/...
(follow-up paper for vision and language models):
📜 “Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations?” L Parcalabescu, A Frank arxiv.org/abs/...
Thanks to our Patrons who support us in Tier 2, 3, 4: 🙏
Dres. Trost GbR, Siltax, Vignesh Valliappan, Michael, Sunny Dhiana, Andy Ma
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
🔥 Optionally, pay us a coffee to help with our Coffee Bean production! ☕
Patreon: / aicoffeebreak
Ko-fi: ko-fi.com/aico...
Join this channel to get access to perks:
/ @aicoffeebreak
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
🔗 Links:
AICoffeeBreakQuiz: / aicoffeebreak
Twitter: / aicoffeebreak
Reddit: / aicoffeebreak
CZcams: / aicoffeebreak
#AICoffeeBreak #MsCoffeeBean #MachineLearning #AI #research #ACL2024NLP #PhDLife
Video editing: Nils Trost
Music 🎵 : Bella Bella Beat - Nana Kwabena

Komentáře • 20

@theosalmon Před měsícem ⁺⁶
Thank you Dr. Letitia.
@alexkubiesa9073 Před měsícem ⁺³
This sounds very useful! LLM users tend to assume that just because it writes like a human, that it can introspect and reason about its thought processes, which of course not a given. But it’s great to see progress on measuring this ability (or at least self-consistency) so that newer models can be more ergonomic.
@DerPylz Před měsícem ⁺⁵
Thanks for sharing your work! Always great so see what you're up to!
@AICoffeeBreak Před měsícem ⁺¹
Much appreciated!
@MikeAirforce111 Před měsícem ⁺⁴
Congrats Doctor!! :-) Looking forward for your future work!
@Thomas-gk42 Před měsícem ⁺⁶
Congratulations to your doctorate🖖
@beatrixcarroll8144 Před měsícem ⁺⁶
Congrats Dr. Letitia!!!! Wow, YOU ROCK!!!!!!! :-D :-) P.S. We missed you!!
@fingerstyledojo Před měsícem ⁺⁵
Yay, new video!
Thanks for letting me pass yesterday lol
@AICoffeeBreak Před měsícem ⁺¹
Wow, you have a channel! It's amazing, just checked it out! 🤩
@serta5727 Před měsícem ⁺⁴
Cool 😎 your explanation was very understandable
@nitinss3257 Před měsícem ⁺⁵
1 minute ago for non members ... good to see ya
@MaxShawabkeh Před měsícem ⁺³
Congrats on the PhD! This is really valuable work! I'm currently trying to squeeze out as much reasoning capabilities as I can out of small LLMs (7-15B) for my company's product, and I'd love a longer video or recorded talk going into details of your findings, any patterns you've found that contribute to improving or reducing self-consistency, or any insights on which existing models or training corpora result in better self consistency and reasoning capabilities. If you have any pointers, I'd appreciate it!
@AICoffeeBreak Před měsícem ⁺²
As far as we can see with this paper's experiments, RLHF helps improve self-consistency, but we have not yet any hints for what else had this effect. Maybe size, but for what we *could* test on our infrastructure, we did not measure an effect, but it might be there, we just couldn't test far enough.
@MaxShawabkeh Před měsícem
@@AICoffeeBreak Thanks!
@naromsky Před měsícem ⁺⁴
🎉
@Ben_D. Před měsícem ⁺⁴
No ASMR? 😟
@AICoffeeBreak Před měsícem ⁺²
It was an entire blooper. Next time for sure. 😅
@anluifb Před měsícem ⁺¹
So you came up with a method, didn't have time to explain the method to us, and didn't show us that it works. Great.
If you still have time before Bangkok I would suggest rerecording and focusing on the implementation and interpretation of results rather than the context and wordy descriptions.
@AICoffeeBreak Před měsícem ⁺¹
Thanks for your feedback. The method is in the video, just not the tiny details.
1. Interpret with SHAP prediction and explanation. (Mentioned in the video)
2. Measure their alignment (mentioned) after:
- normalisation: to bring the values to the same range (mentioned. Did not mention that shap properties make their value very different between output tokens with different probabilities)
- aggregation: to collect the many values from many outputs. (mentioned. Did not mention we use the mean for this)
For the results I've synthesized what we see with words and the main takeaways. For lengthy tables, please check the paper and its appendix. I don't know what you mean that the video doesn't show that it works. I've also shown an individual example before the takeaways. The problem that there is no ground truth, of course exists for us as well as for previous work. But for the first time in literature, we now *compare* existing works to each other-and to our method to them.
This is why the context is important, namely to make this clear. Because our paper makes the contribution to evaluate and clarify the state of the field, and as a follow-up contribution, we have this new method by solving the shortcomings of existing tests.

Další v pořadí

Automatické přehrávání

Supercharging RAG with Generative Feedback Loops from Weaviate

Supercharging RAG with Generative Feedback Loops from Weaviate

Physics of Language Models: Part 1, Context-Free Grammar

Physics of Language Models: Part 1, Context-Free Grammar

EP6: CELLTYPIST & CELLHINT - Towards Automated Annotation And Integration Of Single-Cell Data

EP6: CELLTYPIST & CELLHINT - Towards Automated Annotation And Integration Of Single-Cell Data

Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn

Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn

KONČÍM CESTU NA OLYMPII A ZÁVODNÍ KARIÉRU

KONČÍM CESTU NA OLYMPII A ZÁVODNÍ KARIÉRU

Don't Let This Happen To You... 😂

Don't Let This Happen To You... 😂

The First Time You Say ' Mom ' #shortsfeed #funny

The First Time You Say ' Mom ' #shortsfeed #funny

Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained

Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained

MAMBA and State Space Models explained | SSM explained

MAMBA and State Space Models explained | SSM explained

77% Of Employees Report AI Has Increased Workloads

77% Of Employees Report AI Has Increased Workloads

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

This is the dangerous AI that got Sam Altman fired. Elon Musk, Ilya Sutskever.

This is the dangerous AI that got Sam Altman fired. Elon Musk, Ilya Sutskever.

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

Why Democracy Is Mathematically Impossible

Why Democracy Is Mathematically Impossible

Transformers explained | The architecture behind LLMs

Transformers explained | The architecture behind LLMs

ML Was Hard Until I Learned These 5 Secrets!

ML Was Hard Until I Learned These 5 Secrets!

Finding out a genie's loopholes in advance

Finding out a genie's loopholes in advance

Stop Watch + Wolverine

Stop Watch + Wolverine

Nurse's Mission: Bringing Joy to Young Lives #shorts

Nurse's Mission: Bringing Joy to Young Lives #shorts

Lamine Yamal and his little brother 😍 #fcbarcelona #LamineYamal #shorts

Lamine Yamal and his little brother 😍 #fcbarcelona #LamineYamal #shorts

The dog made the right choice#Short #Officer Rabbit #angel

The dog made the right choice#Short #Officer Rabbit #angel

so trueee😂 #nevada #tiktok

so trueee😂 #nevada #tiktok

I play this like Cristiano Ronaldo⚽❓

I play this like Cristiano Ronaldo⚽❓

C’est qui le plus fort 😂

C’est qui le plus fort 😂