Video není dostupné.

Omlouváme se.

FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence

Yannic Kilcher

zhlédnutí 18 444

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 18. 08. 2024
FixMatch is a simple, yet surprisingly effective approach to semi-supervised learning. It combines two previous methods in a clever way and achieves state-of-the-art in regimes with few and very few labeled examples.
Paper: arxiv.org/abs/...
Code: github.com/goo...
Abstract:
Semi-supervised learning (SSL) provides an effective means of leveraging unlabeled data to improve a model's performance. In this paper, we demonstrate the power of a simple combination of two common SSL methods: consistency regularization and pseudo-labeling. Our algorithm, FixMatch, first generates pseudo-labels using the model's predictions on weakly-augmented unlabeled images. For a given image, the pseudo-label is only retained if the model produces a high-confidence prediction. The model is then trained to predict the pseudo-label when fed a strongly-augmented version of the same image. Despite its simplicity, we show that FixMatch achieves state-of-the-art performance across a variety of standard semi-supervised learning benchmarks, including 94.93% accuracy on CIFAR-10 with 250 labels and 88.61% accuracy with 40 -- just 4 labels per class. Since FixMatch bears many similarities to existing SSL methods that achieve worse performance, we carry out an extensive ablation study to tease apart the experimental factors that are most important to FixMatch's success. We make our code available at this https URL.
Authors: Kihyuk Sohn, David Berthelot, Chun-Liang Li, Zizhao Zhang, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Han Zhang, Colin Raffel
Links:
CZcams: / yannickilcher
Twitter: / ykilcher
BitChute: www.bitchute.c...
Minds: www.minds.com/...

Komentáře • 21

@manuelpariente2288 Před 4 lety ⁺²²
Thanks again :-)
Loved the critic at the end.
Also, nice from them that they report these results, lots of papers would silence it to make it seem like the method brought all the gains !
@shrinathdeshpande5004 Před 4 lety ⁺⁸
definitely one of the best ways to explain a paper!! Kudos to you
@herp_derpingson Před 4 lety ⁺¹⁹
78% accuracy from 1 image per class. This blew my mind.
What a time to be alive.
@TeoZarkopafilis Před 4 lety ⁺⁶
HOLD ON TO YOUR PAPERS
@meudta293 Před 4 lety ⁺¹
my brain matter is all over the floor right now hhh
@matthewtang1489 Před 4 lety ⁺¹
@@TeoZarkopafilis Woah! A fellow scholar here!
@sora4222 Před rokem
I loved the critique at the end. Thanks.
@hungdungnguyen8258 Před 3 měsíci
well explained. Thank you
@hihiendru Před 4 lety ⁺¹
just like UDA, emphasis on way you augment. and poor UDA got rejected. ps LOVE your breakdowns, please keep them coming.
@jurischaber6935 Před rokem
Thanks again...Great teacher for us students. 🙂
@AmitKumar-ts8br Před 3 lety
Really nice explanation and concise...
@vishalahuja2502 Před 3 lety ⁺¹
Yannic, nice coverage of the paper. I have one question: at 15:05, you explain that the pseudo-label is used only if the confidence is above a certain threshold (which is also a hyperparameter). Where is the confidence coming from? It is well known that the confidence score coming out of softmax is not reliable. Can you please explain?
@tengotooborn Před 3 lety
Something which I find weird: isn’t a constant pseudolabel always correct? It seems that there are only positive examples in the scheme which uses the unlabeled data, and so there is nothing in the loss which forces the model to not always output the same pseudolabel for everything.
Yes, one can argue that this would fail the supervised loss, but then the question becomes “how is the supervised loss weighted w.r.t. the unsupervised loss”. In any case, it seems that one would also desire to have negative examples in the unsupervised case
@NooBiNAcTioN1334 Před 2 lety
Fantastic!
@reginaldanderson7218 Před 4 lety ⁺¹
Nice edit
@ramonbullock6630 Před 4 lety ⁺¹
I love this content :D
@christianleininger2954 Před 4 lety
Really Good Job please keep going
@abhishekmaiti8332 Před 4 lety ⁺¹
In what order do they train the model, feed the labelled image first and then the unlabelled ones? Also, can two unlabelled images of the same class have a different pseudo label?
@YannicKilcher Před 4 lety ⁺⁴
I think they do everything at the same time. I guess the labelled images can also go the unlabelled way, yes. But not the other way around, obviously :)
@Manu-lc4ob Před 4 lety ⁺¹
What is the software that you are using to annotate papers Yannic ? I am using Margin notes but it does not seem as smooth
@Dr.Z.Moravcik-inventor-of-AGI Před 3 lety
Google again, wow! 😂

Další v pořadí

Automatické přehrávání

Gradient Surgery for Multi-Task Learning

Gradient Surgery for Multi-Task Learning

Supervised Contrastive Learning

Supervised Contrastive Learning

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Kdo je silnější??? 🔥🔥🔥

Kdo je silnější??? 🔥🔥🔥

Woman attacks Uber driver with pepper spray

Woman attacks Uber driver with pepper spray

This is the Biggest SAW in the World 😱🪚 #camping #survival #bushcraft #outdoors #lifehack

This is the Biggest SAW in the World 😱🪚 #camping #survival #bushcraft #outdoors #lifehack

How Countries eat spaghetti

How Countries eat spaghetti

Variational Autoencoders

Variational Autoencoders

What are AI Agents?

What are AI Agents?

The Sun is NOT the Center of the Solar System

The Sun is NOT the Center of the Solar System

The Most Misunderstood Concepts in Science

The Most Misunderstood Concepts in Science

Yann LeCun: Self-Supervised Learning Explained | Lex Fridman Podcast Clips

Yann LeCun: Self-Supervised Learning Explained | Lex Fridman Podcast Clips

DETR: End-to-End Object Detection with Transformers (Paper Explained)

DETR: End-to-End Object Detection with Transformers (Paper Explained)

Backpropagation and the brain

Backpropagation and the brain

BYOL: Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning (Paper Explained)

BYOL: Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning (Paper Explained)

FIZI DRINK VS. COCA COLA? Co je zdravější? #shorts #marcel #cocacola #fizistyle

FIZI DRINK VS. COCA COLA? Co je zdravější? #shorts #marcel #cocacola #fizistyle

How Countries eat spaghetti

How Countries eat spaghetti

Getting her riled up for no reason 🤣

Getting her riled up for no reason 🤣

Woman attacks Uber driver with pepper spray

Woman attacks Uber driver with pepper spray

艾莎生气，王子粗暴化解尴尬#艾莎

艾莎生气，王子粗暴化解尴尬#艾莎

Co divného umíš ty? 😝

Co divného umíš ty? 😝

Replacing a valve on a full water tank! 🫣💦 - 🎥 the_ladyplumber

Replacing a valve on a full water tank! 🫣💦 - 🎥 the_ladyplumber

Proč první Deadpool nemĕl ústa? #deadpool #wolverine #shorts

Proč první Deadpool nemĕl ústa? #deadpool #wolverine #shorts