Swin Transformer V2 - Paper explained

ConvNet beats Vision Transformers (ConvNeXt) Paper explained

Harvard Medical AI: Mark Endo presents "Masked Autoencoders Are Scalable Vision Learners"

Gladiator II | Official Trailer (2024 Movie) - Paul Mescal, Pedro Pascal, Denzel Washington

HAPPY BIRTHDAY @mozabrick 🎉 #cat #funny

KDYŽ SE NADECHNEŠ, PROHRAJEŠ… 😱 #shorts

Masked Autoencoders (MAE) Paper Explained

Soroush Mehraban

zhlédnutí 2 631

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 17. 07. 2024
Paper link: arxiv.org/abs/2111.06377
In this video, I explain how masked autoencoders work by inspiring ideas from BERT paper and pertaining a vision transformer without requiring any additional label.
Table of Content:
00:00 Intro
00:19 BERT idea
02:09 Language and vision difference
05:29 Proposed Architecture
11:30 After pertaining
14:03 Masking ratio

Komentáře • 14

@pythonerdhanabanshi4554 Před 8 měsíci
Hello man....my thanks are for you. Your realization with MAE is clean, which is absent in almost all other explanations on CZcams.
@soroushmehraban Před 8 měsíci
Glad you enjoyed it!
@dddz7738 Před rokem ⁺¹
presentation skills = lit !
@soroushmehraban Před rokem ⁺¹
Thanks 😃
@mehdipashazadeh9071 Před 9 měsíci
awesome
@user-fm7ol4jr9o Před 11 měsíci ⁺¹
explained in detail
@soroushmehraban Před 11 měsíci ⁺¹
Glad you liked it😃
@joseluisdiaz233 Před rokem
Thanks
@rohollahhosseyni8564 Před 10 měsíci
Great
@lorryzou9367 Před 9 měsíci
Kaiming He is just the best
@soroushmehraban Před 9 měsíci
Indeed
@alihadimoghadam8931 Před rokem
🫡 well done
@buh357 Před 4 měsíci
does it work on small dataset ? let say 1000 images?
@soroushmehraban Před 4 měsíci
I don't think so. Transformers are data hungry and need a lot of data to generalize. the smallest dataset for pretraining that I saw was on ViTPose that they pretrained using this technique on 150k images and when they doubled the data it got only 1.3% better.

Další v pořadí

Automatické přehrávání

Swin Transformer V2 - Paper explained

Swin Transformer V2 - Paper explained

ConvNet beats Vision Transformers (ConvNeXt) Paper explained

ConvNet beats Vision Transformers (ConvNeXt) Paper explained

Harvard Medical AI: Mark Endo presents "Masked Autoencoders Are Scalable Vision Learners"

Harvard Medical AI: Mark Endo presents "Masked Autoencoders Are Scalable Vision Learners"

Gladiator II | Official Trailer (2024 Movie) - Paul Mescal, Pedro Pascal, Denzel Washington

Gladiator II | Official Trailer (2024 Movie) - Paul Mescal, Pedro Pascal, Denzel Washington

HAPPY BIRTHDAY @mozabrick 🎉 #cat #funny

HAPPY BIRTHDAY @mozabrick 🎉 #cat #funny

KDYŽ SE NADECHNEŠ, PROHRAJEŠ… 😱 #shorts

KDYŽ SE NADECHNEŠ, PROHRAJEŠ… 😱 #shorts

POV: Když tě jako malého, musel hlídat starší sourozenec #fyp #foryou #siblings #marcel

POV: Když tě jako malého, musel hlídat starší sourozenec #fyp #foryou #siblings #marcel

CV Study Group: Masked Autoencoders Paper Walkthrough

CV Study Group: Masked Autoencoders Paper Walkthrough

Autoencoders | Deep Learning Animated

Autoencoders | Deep Learning Animated

MetaFormer is Actually What You Need for Vision

MetaFormer is Actually What You Need for Vision

Masked Autoencoders Are Scalable Vision Learners - Paper explained and animated!

Masked Autoencoders Are Scalable Vision Learners – Paper explained and animated!

computers suck at division (a painful discovery)

computers suck at division (a painful discovery)

MADE: Masked Autoencoder for Distribution Estimation

MADE: Masked Autoencoder for Distribution Estimation

VQ-VAEs: Neural Discrete Representation Learning | Paper + PyTorch Code Explained

VQ-VAEs: Neural Discrete Representation Learning | Paper + PyTorch Code Explained

Variational Autoencoders

Variational Autoencoders

How DINO learns to see the world - Paper Explained

How DINO learns to see the world - Paper Explained

Level 1 To Level 1,000 Package Delivery

Level 1 To Level 1,000 Package Delivery

OPAKUJ PO MNĚ 🫵 😂

OPAKUJ PO MNĚ 🫵 😂

Haiset @HaiseT - RECAST

Haiset @HaiseT - RECAST

Nerd TOYS with Defender’s ANKLES

Nerd TOYS with Defender’s ANKLES

Rope climb tutorial !! 😱😱

Rope climb tutorial !! 😱😱

Shots fired at Trump rally

Shots fired at Trump rally

How Many Balloons Does It Take To Fly?

How Many Balloons Does It Take To Fly?

Mixér Challenge Poslepu!

Mixér Challenge Poslepu!