Transformer Attention (Attention is All You Need) Applied to Time Series

A Very Simple Transformer Encoder for Protein Classification in PyTorch

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

How to succeed on a date 💘

Was ist im Eis versteckt? 🧊 Coole Winter-Gadgets von Amazon

Jak chutná JOYEHO SANDWICH z Přátel? 🥪 - Insta Bašta #13

Transformer Encoder vs LSTM Comparison for Simple Sequence (Protein) Classification Problem

Let's Learn Transformers Together

zhlédnutí 175

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 23. 06. 2024
The purpose of this video is to highlight results comparing a single Transformer Encoder layer to a single LSTM layer for a very simple problem. Several texts on Natural Language Processing describe the power of LSTM as well as the advanced sequence processing capabilities of Self Attention and the Transformer. This video offers very simple results in support of these notions in the field of Natural Language Processing.
Previous Video:
• A Very Simple Transfor...
Code:
github.com/BrandenKeck/pytorc...
Interesting Post:
ai.stackexchange.com/question...
Music Credits:
Breakfast in Paris by Alex-Productions | onsound.eu/
Music promoted by www.free-stock-music.com
Creative Commons / Attribution 3.0 Unported License (CC BY 3.0)
creativecommons.org/licenses/...
Small Town Girl by | e s c p | www.escp.space
escp-music.bandcamp.com

Komentáře • 3

@Pancake-lj6wm Před 9 dny
Zamm!
@LeoDaLionEdits Před 9 dny
I never knew that transformers were that much more time efficient at large embedding sizes
@lets_learn_transformers Před 9 dny ⁺¹
Hey @LeoDaLionEdits - I'm very interested in ideas like these. I unfortunately lost my link to the paper - but there was an interesting arXiv article on why XGBoost still dominates Kaggle competitions in comparison to Deep Neural Networks. Based on the problem, I think RNN / LSTM may often be more competitive in the same way: the simpler, tried-and-true model winning out. From a performance perspective, this book notes the advantage in parallel processing of transformers in sections 10.1 (intro) and 10.1.4 (parallelizing self-attention): web.stanford.edu/~jurafsky/slp3/ed3book.pdf

Další v pořadí

Automatické přehrávání

Transformer Attention (Attention is All You Need) Applied to Time Series

Transformer Attention (Attention is All You Need) Applied to Time Series

A Very Simple Transformer Encoder for Protein Classification in PyTorch

A Very Simple Transformer Encoder for Protein Classification in PyTorch

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

How to succeed on a date 💘

How to succeed on a date 💘

Was ist im Eis versteckt? 🧊 Coole Winter-Gadgets von Amazon

Was ist im Eis versteckt? 🧊 Coole Winter-Gadgets von Amazon

Jak chutná JOYEHO SANDWICH z Přátel? 🥪 - Insta Bašta #13

Jak chutná JOYEHO SANDWICH z Přátel? 🥪 - Insta Bašta #13

Physicists Claim They Can Send Particles Into the Past

Physicists Claim They Can Send Particles Into the Past

Aside: Transformer Attention for Time Series - Follow-Up with Real World Data

Aside: Transformer Attention for Time Series - Follow-Up with Real World Data

Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only models

Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only models

How I’d learn ML in 2024 (if I could start over)

How I’d learn ML in 2024 (if I could start over)

Rotary Positional Embeddings: Combining Absolute and Relative

Rotary Positional Embeddings: Combining Absolute and Relative

I wish every AI Engineer could watch this.

I wish every AI Engineer could watch this.

Sequence-to-Sequence (seq2seq) Encoder-Decoder Neural Networks, Clearly Explained!!!

Sequence-to-Sequence (seq2seq) Encoder-Decoder Neural Networks, Clearly Explained!!!

180 - LSTM Autoencoder for anomaly detection

180 - LSTM Autoencoder for anomaly detection

A Very Simple Transformer Encoder for Time Series Forecasting in PyTorch

A Very Simple Transformer Encoder for Time Series Forecasting in PyTorch

Tom & Jerry !! 😂😂

Tom & Jerry !! 😂😂

FOOLED THE GUARD🤢

FOOLED THE GUARD🤢

JAKOU PRÁCI DĚLÁ IKON A SÉGRA ?! 😂 #shorts

JAKOU PRÁCI DĚLÁ IKON A SÉGRA ?! 😂 #shorts

ONE BREATH CHALLENGE! 👀😱😆 | Triple Charm #Shorts

ONE BREATH CHALLENGE! 👀😱😆 | Triple Charm #Shorts

3M❤️ #thankyou #shorts

3M❤️ #thankyou #shorts

STRÁVIL JSEM NOC V KAPSLOVÉM HOTELU... špatný nápad

STRÁVIL JSEM NOC V KAPSLOVÉM HOTELU... špatný nápad

Někdy to prostě nemáme jednoduchý… 🤷‍♂️🤣 Petr hostem Všechnopárty Karla Šípa na České televizi. 📺

Někdy to prostě nemáme jednoduchý… 🤷‍♂️🤣 Petr hostem Všechnopárty Karla Šípa na České televizi. 📺