CS480/680 Lecture 19: Attention and Transformer Networks

Sensor Fusion for Autonomous Vehicles: Strategies, Methods, and Tradeoffs | Synopsys

Jeff Dean (Google): Exciting Trends in Machine Learning

Power Up: Creating the Perfect Charging Point in Your Home!

Earth - MŮJ SOUSED DOSTAL 10 LET, KVŮLI DÍTĚTI SE MUSÍŠ OMEZOVAT, YT NECHCI, MUSEL JSEM PŘESTAT S…

OPAKUJ PO MNĚ 🫵 😂

Multimodality and Data Fusion Techniques in Deep Learning

ISTA Conference

zhlédnutí 4 222

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 17. 10. 2023
Petar Velev, Senior Software Engineer at Bosch Engineering Center Sofia
In this lecture, I will introduce the concept of multimodal deep learning and highlight the critical role of data fusion techniques. I’ll begin by explaining the principle of multimodality and how it aligns with the inherently multimodal nature of human cognition.
Through real-world examples, such as networks that merge audio and video, audio and accelerometer, or audio and text, I’ll illustrate how multimodal learning is implemented in practice.
A key part of the discussion will be devoted to data fusion techniques - early, late, and hybrid fusion. I’ll present their applications and discuss their respective advantages and potential limitations.
To conclude, I’ll provide a brief overview of the future of multimodal deep learning, touching on potential developments and challenges. The aim of this lecture is to offer a succinct yet comprehensive understanding of multimodal deep learning, demonstrating its transformative potential in the field of AI.
Věda a technologie

Komentáře • 2

@nataliatenoriomaia1635 Před 3 měsíci
great talk!
@manalkim200 Před 3 měsíci
interesting

Další v pořadí

Automatické přehrávání

CS480/680 Lecture 19: Attention and Transformer Networks

CS480/680 Lecture 19: Attention and Transformer Networks

Sensor Fusion for Autonomous Vehicles: Strategies, Methods, and Tradeoffs | Synopsys

Sensor Fusion for Autonomous Vehicles: Strategies, Methods, and Tradeoffs | Synopsys

Jeff Dean (Google): Exciting Trends in Machine Learning

Jeff Dean (Google): Exciting Trends in Machine Learning

Power Up: Creating the Perfect Charging Point in Your Home!

Power Up: Creating the Perfect Charging Point in Your Home!

Earth - MŮJ SOUSED DOSTAL 10 LET, KVŮLI DÍTĚTI SE MUSÍŠ OMEZOVAT, YT NECHCI, MUSEL JSEM PŘESTAT S…

Earth - MŮJ SOUSED DOSTAL 10 LET, KVŮLI DÍTĚTI SE MUSÍŠ OMEZOVAT, YT NECHCI, MUSEL JSEM PŘESTAT S…

OPAKUJ PO MNĚ 🫵 😂

OPAKUJ PO MNĚ 🫵 😂

PRVNÍ HÁDKA MEZI MILANEM A KAMILEM | Příběhy o lásce a vztazích ve škole #kikido #shorts

PRVNÍ HÁDKA MEZI MILANEM A KAMILEM | Příběhy o lásce a vztazích ve škole #kikido #shorts

How do Multimodal AI models work? Simple explanation

How do Multimodal AI models work? Simple explanation

MIA: Carl de Boer, Deep learning regulatory models; David Kelley, Expression prediction from DN

MIA: Carl de Boer, Deep learning regulatory models; David Kelley, Expression prediction from DN

MIT 6.S191: Deep Generative Modeling

MIT 6.S191: Deep Generative Modeling

Unlocking the Power of Real-Time Analytics with InfluxDB

Unlocking the Power of Real-Time Analytics with InfluxDB

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

Why Computer Vision Is a Hard Problem for AI

Why Computer Vision Is a Hard Problem for AI

Embeddings - EXPLAINED!

Embeddings - EXPLAINED!

How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini

How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini

40$ or 50$ or Typecase iPad keyboard #ipadkeyboard #ipadcase #typecase #ipad #ipadpro

40$ or 50$ or Typecase iPad keyboard #ipadkeyboard #ipadcase #typecase #ipad #ipadpro

Why 3D Printing Struggles with Curved Surfaces #3dprinting

Why 3D Printing Struggles with Curved Surfaces #3dprinting

Tag her 🤭💞 #miniphone #smartphone #iphone #samsung #fyp

Tag her 🤭💞 #miniphone #smartphone #iphone #samsung #fyp

If Google Recreated The Apple Vision Pro part 2

If Google Recreated The Apple Vision Pro part 2

100+ Linux Things you Need to Know

100+ Linux Things you Need to Know

Deep Cleaning and Fixing The DIRTIEST IPad 🤢🤮 #shorts #apple #ipad

Deep Cleaning and Fixing The DIRTIEST IPad 🤢🤮 #shorts #apple #ipad

Want more performance out of your PC? #pc #pcbuild #pcgaming #gaming pc #RGB

Want more performance out of your PC? #pc #pcbuild #pcgaming #gaming pc #RGB

The Weird, Terrible Smartphones They Only Have in North Korea

The Weird, Terrible Smartphones They Only Have in North Korea