SUPERHUMAN RAG #ai

GraphRAG or SpeculativeRAG ?

Grokking: Generalization beyond Overfitting on small algorithmic datasets (Paper Explained)

HAPPY BIRTHDAY @mozabrick 🎉 #cat #funny

Koupil jsem Nejrychlejší Autíčko na Ovládání za 30 000 Kč!

Slow motion boy #shorts by Tsuriki Show

New Discovery: Retrieval Heads for Long Context

code_your_own_AI

zhlédnutí 2 642

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 21. 07. 2024
New study by MIT and Peking Univ discovers a new element in transformers: Retrieval Heads!
Retrieval heads impact RAG performance, CoT reasoning and long context length retrieval quality.
These special attention heads are performing the information retrieval of long context, and show exceptional characteristics. Applications are: New dev for better RAG systems, better long context windows retrieval plus improved causal reasoning, better Chain-of-Thought reasoning (CoT), reduce factual hallucination of RAG and LLM by advanced retrieval heads, all induced by these newly discovered retrieval attention heads in the transformer architecture.
5 potential real world use cases of more powerful retrieval heads:
-----------------------------------------------------------------------------------------------------
1. Enhanced Document Summarization Tools for Legal and Healthcare Fields:
By leveraging the insights into retrieval heads, developers can create advanced document summarization tools tailored for sectors like legal and healthcare, where precision in extracting relevant information from lengthy documents is crucial. Such tools would ensure that vital details are accurately captured and summarized, enabling faster and more reliable review processes for legal cases or patient medical histories, thus reducing the workload and improving the decision-making accuracy for professionals in these fields.
2. Real-Time Information Retrieval Systems for Financial Markets:
Financial analysts and traders could benefit from real-time information retrieval systems that utilize retrieval heads to quickly parse and extract critical data from vast amounts of market news, reports, and regulatory filings. This application would allow for the rapid assimilation of pertinent information, enhancing decision-making in fast-paced environments and potentially leading to more informed and timely investment strategies.
3. Advanced Assistive Technologies for Educational Purposes:
Educational platforms can integrate these insights to develop more nuanced assistive technologies that help students engage with learning materials more effectively. For instance, a system could use retrieval heads to dynamically extract key concepts and summaries from extensive educational content, providing students with tailored reviews or preparatory materials that focus on areas where they need more understanding, thereby personalizing and enhancing the learning experience.
4.Optimized Content Moderation in Social Media:
Social media platforms can implement models with enhanced retrieval heads to improve content moderation by accurately identifying and extracting problematic elements from large volumes of posts and comments. This application could lead to more effective and nuanced filtering processes, reducing the spread of misinformation and inappropriate content, while maintaining a balance with freedom of expression.
5. Intelligent Search Engines for Scientific Research:
Utilizing retrieval heads in search engines specifically designed for scientific research could revolutionize how researchers find relevant studies and data. These search engines would be capable of understanding the context of queries and retrieving the most pertinent papers or data from vast digital libraries, significantly accelerating the research process and encouraging deeper insights across disciplines like physics, chemistry, and biology. This would not only save time but also foster interdisciplinary collaborations by seamlessly linking relevant findings and methodologies across diverse fields.
00:00 Intro (Green grasshoppers)
03:16 What do attention heads focus on?
05:58 Long context Factuality by retrieval heads
07:20 Needle in a Haystack Benchmark
10:01 How many retrieval heads in a LLM?
15:30 What is a retrieval head?
21:10 Retrieval heatmap consistent with pre-trained base model
23:10 Retrieval heads and Chain-of-Thought Reasoning
25:17 Retrieval heads explain why LLMs hallucinate
28:10 How to generate more retrieval heads in LLMs?
All rights with authors (arxiv pre-print):
Retrieval Head Mechanistically Explains Long-Context Factuality
by Wenhao Wu et al.
#airesearch #insights #reasoning
Věda a technologie

Komentáře • 12

@_paixi Před 2 měsíci ⁺²
Their idea to prune the non-retrieval heads from the KV cache would be a huge breakthrough if it works.
@juice2 Před 2 měsíci
So many interesting findings in this paper. Thank you for the video!
@mshonle Před 2 měsíci ⁺¹
The posited connection between retrieval performance and reasoning may also explain how models that are trained on code have reasoning improved even on non-coding tasks.
@blaisedestais6585 Před 2 měsíci
Hey! Thanks for the explanation! Do you have an idea of how attention works for few shot learning? Is it just the same or are they other things to have in mind? Cause few shot is so important in prompt engineering! Thanks!!!
@thedoctor5478 Před 2 měsíci ⁺¹
such a good one. ty
@christiand6312 Před 2 měsíci
Love this… sehr gut
@MichaelScharf Před 2 měsíci ⁺²
Could you assist the links to the paper(s)
@christiand6312 Před 2 měsíci
Pause the video, take a screenshot, gpt vision it
@yannickpezeu3419 Před 2 měsíci
❤❤❤
@wilfredomartel7781 Před 2 měsíci ⁺¹
❤
@wilfredomartel7781 Před 2 měsíci
🤗 thanks for sharing this paper. Amazing work👏

Další v pořadí

Automatické přehrávání

SUPERHUMAN RAG #ai

SUPERHUMAN RAG #ai

GraphRAG or SpeculativeRAG ?

GraphRAG or SpeculativeRAG ?

Grokking: Generalization beyond Overfitting on small algorithmic datasets (Paper Explained)

Grokking: Generalization beyond Overfitting on small algorithmic datasets (Paper Explained)

HAPPY BIRTHDAY @mozabrick 🎉 #cat #funny

HAPPY BIRTHDAY @mozabrick 🎉 #cat #funny

Koupil jsem Nejrychlejší Autíčko na Ovládání za 30 000 Kč!

Koupil jsem Nejrychlejší Autíčko na Ovládání za 30 000 Kč!

Slow motion boy #shorts by Tsuriki Show

Slow motion boy #shorts by Tsuriki Show

35M Subscriber Moment Almost Here🎉❤️ Supported by Korean creators🇰🇷🤝🇯🇵

35M Subscriber Moment Almost Here🎉❤️ Supported by Korean creators🇰🇷🤝🇯🇵

NEW TextGrad by Stanford: Better than DSPy

NEW TextGrad by Stanford: Better than DSPy

Q* explained: Complex Multi-Step AI Reasoning

Q* explained: Complex Multi-Step AI Reasoning

Masterclass on AI by Microsoft

Masterclass on AI by Microsoft

The Most Important Algorithm in Machine Learning

The Most Important Algorithm in Machine Learning

DeepMind AlphaFold 3 - This Will Change Everything!

DeepMind AlphaFold 3 - This Will Change Everything!

The Man Who Revolutionized Computer Science With Math

The Man Who Revolutionized Computer Science With Math

Possible End of Humanity from AI? Geoffrey Hinton at MIT Technology Review's EmTech Digital

Possible End of Humanity from AI? Geoffrey Hinton at MIT Technology Review's EmTech Digital

Uncovering The Genius of Fibonnaci Turbines

Uncovering The Genius of Fibonnaci Turbines

NEW Multi-Modal AI by APPLE

NEW Multi-Modal AI by APPLE

Canon Live Event - Introducing the Canon EOS R1 and the Canon EOS R5 Mark II

Canon Live Event – Introducing the Canon EOS R1 and the Canon EOS R5 Mark II

The only competitor of Nokia ☠️ | #trollface

The only competitor of Nokia ☠️ | #trollface

The World's Largest Computer Crash Just Happened...

The World's Largest Computer Crash Just Happened...

Overrated vs. Underrated Tech

Overrated vs. Underrated Tech

AirPody budou mít KAMERY?😳 #news #apple #airpods

AirPody budou mít KAMERY?😳 #news #apple #airpods

S24 Ultra and IPhone 14 Pro Max telephoto shooting comparison #shorts

S24 Ultra and IPhone 14 Pro Max telephoto shooting comparison #shorts

Samsung Galaxy Ring Review: I Wanted to Love It!

Samsung Galaxy Ring Review: I Wanted to Love It!

ВОЗМОЖНО ЛИ ПОЧИСТИТЬ КЛАВИАТУРУ КЛЕЕМ?🤔 #shorts

ВОЗМОЖНО ЛИ ПОЧИСТИТЬ КЛАВИАТУРУ КЛЕЕМ?🤔 #shorts