Talks # 5: Parul Pandey: Data Science, Diversity and Kaggle

Deep Learning for Tabular Data: A Bag of Tricks | ODSC 2020

Auto-Tuning Hyperparameters with Optuna and PyTorch

IShowSpeed Plays 'This or That'

Survival Skills: Amazing Basket for Extreme Conditions. #survival #camping #bushcraft #lifehacks

小偷捂晕妈妈强行闯入室内，写作业的小朋友灵机一动用两个橙子成功吓跑小偷！#儿童安全教育 #防拐 #儿童安全#儿童自救

Talks # 4: Sebastien Fischman - Pytorch-TabNet: Beating XGBoost on Tabular Data Using Deep Learning

Abhishek Thakur

zhlédnutí 32 045

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 4. 06. 2020
Talks # 4:
Speaker: Sebastien Fischman ( / sebastienfischman )
Title : Pytorch-tabnet : Beating XGBoost on tabular data with deep learning?
Abstract: #DeepLearning has set up new benchmarks for Computer Vision, NLP, Speech, Reinforcement Learning in the past few years.
However tabular data competitions are still dominated by gradient boosted trees (GBTs) libraries like XGBoost, LightGBM and Catboost.
Tabnet is a new promising deep learning architecture based on sequential attention transformers proposed by Arik & Pfister that aims to fill the gap between GBTs and neural networks.
Pytorch-tabnet is an open source library that provides a scikit-like interface for training a TabNetClassifier or TabNetRegressor. It's ease of use allow any developer to quickly try a #TabNet architecture on any dataset, hopefully setting up new benchmarks.
Bio: Worked as a Data Scientist in France and Australia on very different topics:
- user segmentation based on their shopping habits for WoolWorth @Quantium
- real time bidding advertising @Tradelab
- stock market predictions based on sentiment analysis from social medias @SESAMm
- auto ML platform with explainable AI @DreamQuark
- now working on early stage cancer detection on new OCT-3D images @DamaeMedical
To give a talk in Talks, fill out this form here: bit.ly/AbhishekTalks
----
Follow me on:
Twitter: / abhi1thakur
LinkedIn: / abhi1thakur
Kaggle: kaggle.com/abhishek

Komentáře • 33

@abhishekkrthakur Před 4 lety ⁺¹⁹
Slides: www.slideshare.net/SebastienFischman/tab-netpresentation/SebastienFischman/tab-netpresentation
GitHub: github.com/dreamquark-ai/tabnet
Thank you Sebastien for the great Talk!
@jacquepang Před 4 měsíci ⁺¹
2:55 Tabnet Paper introduction
4:20 Main ideas from Tabnet
7:21 Architecture
8:55 feature transformer block
10:51 attentive transformer block
14:25 individual explainability intro
15:10 self supervised learning ( pretrainning )
17:10 pytorch implementation intro ( 19:18 fastai wrapper avialable )
20:59 demo from a notebook
29:34 Kaggle competition notebooks using Tabnet Pytorch
29:55 Code base architecture
32:18 tricky implementation tips!
34:36 future work
40:52 QA session
41:09 explainability
42:30 computing resource
43:50 tabnet parameters explain
47:55 feature selection ( from sparse mask)
@ritamshome Před 4 lety ⁺²
Actually an in-depth session and Sebastian answered most of the queries. Great work!
@risabb Před 4 lety ⁺²
This is the best Talk Session! Learnt a lot and a great explanation. Thanks Abhishek and Sebastien!
@memories2692 Před 3 lety
Thanks so much guys! It's a perfect architecture (and lecturer). I've implemented it easily for couple of days, works great!
@FrankHerfert Před 4 lety ⁺¹
This is great! Thank you both.
@solomonadeyemi53 Před rokem
hi from South Africa ......have been using Tabnet for 2yrs now in R studio ......works very well.... will give the pytorch-tabnet a trial
@abhishekkrthakur Před 4 lety ⁺³
To give a talk in Talks, fill out this form here: bit.ly/AbhishekTalks
@davidvictor7124 Před 4 lety ⁺¹
Can you please post the link of the code in the description.
@sebastienfischman8671 Před 4 lety ⁺¹
@@davidvictor7124 All the code is available here github.com/dreamquark-ai/tabnet
I'll also add all the links and the presentation on this same page, so this is the place to go for any information!
@nirjharyou Před 4 lety ⁺¹
Thank you so much Abhishek for this . I am also extremely happy to see my kernel and my name on your video , even though for a flash :)
@matteomele3303 Před rokem
Thank you, excellent work for both of you!
@AIPlayerrrr Před 4 lety ⁺¹¹
After watching this video, I jumped right into implementing it on some of the kaggle competitions and my research. LGB still works better than Tabnet in most of my implementations. Pytorch-Tabnet is really user-friendly tho if you are new to deep learning for tabular data.
@user-yl5em5kg2m Před 3 lety ⁺¹
Hi, Tony. Do you know how much LGB perform better than Tabnet and what kind of tasks LGB beats Tabnet? Do you tune the parameter size of Tabnet?
@sayedathar2507 Před 2 lety
Amazing Talk , thanks for sharing , your channel is best :)
@ParsiadAzimzadeh Před 3 lety
Great talk.
You mentioned being uncertain about the origin of sqrt(0.5) factor. I believe the reason the authors use it is because given two IID random variables X and Y,
Var(sqrt(0.5) X + sqrt(0.5) Y) = 0.5 Var(X) + 0.5 Var(Y) = Var(X).
In the context of the GLU summation, it is a heuristic to ensure that the variance does not increase.
@aditya_01 Před rokem
u r doing really great thanks a lot for such a awesome content
@deepaksadulla8974 Před 4 lety
Really good explanations...
@JaskaranSingh-hp3zy Před 4 lety ⁺¹
Great Session
@vslaykovsky Před rokem ⁺¹
9:17 should be "element-wise multiplication" I guess
@tempdeltavalue Před rokem ⁺²
It's strange what author call it "transformers" because (if I understand correctly) here's not used attention masks (I mean QVK matrices)
@jacquepang Před 4 měsíci
I have the same confusion. Do you have a clue?
@oculustech1904 Před 4 lety
Great thank Abhishek and Sebastien !!!. you mention about copy of book, how to get that, please share link.
@abhishekkrthakur Před 4 lety
Sebastien explains it at the end of the the talk
@shrikantnarayankar4778 Před 4 lety
Hi Abhishek..I was trying to buy your book but link said it will be available on 15 july..how to buy it today? ...u held a session with krish ..
@razzor_hero Před 4 lety
Hey, do you know how to monitor and fit the tabnet based on a metric other than accuracy, say roc_auc_score ? I tried looking for this in the github, couldn't find it :/
@sebastienfischman8671 Před 4 lety ⁺¹
default monitoring for binary classification is already roc_auc_score, for multi class it's accuracy, for regression it's MSE. Easy way of changing early stopping metrics still need to be added!
@manelallani4746 Před 3 lety
@@sebastienfischman8671 Is it possible now to use a customized loss function ?
@consistentthoughts826 Před 3 lety
I applied this Santander Classification Kaggle dataset and got 81% accuracy without any preprocessing
@mahery_ranaivoson Před 4 lety
Where to get the notebooks?
@abhishekkrthakur Před 4 lety
added in pinned comments

Další v pořadí

Automatické přehrávání

Talks # 5: Parul Pandey: Data Science, Diversity and Kaggle

Talks # 5: Parul Pandey: Data Science, Diversity and Kaggle

Deep Learning for Tabular Data: A Bag of Tricks | ODSC 2020

Deep Learning for Tabular Data: A Bag of Tricks | ODSC 2020

Auto-Tuning Hyperparameters with Optuna and PyTorch

Auto-Tuning Hyperparameters with Optuna and PyTorch

IShowSpeed Plays 'This or That'

IShowSpeed Plays 'This or That'

Survival Skills: Amazing Basket for Extreme Conditions. #survival #camping #bushcraft #lifehacks

Survival Skills: Amazing Basket for Extreme Conditions. #survival #camping #bushcraft #lifehacks

小偷捂晕妈妈强行闯入室内，写作业的小朋友灵机一动用两个橙子成功吓跑小偷！#儿童安全教育 #防拐 #儿童安全#儿童自救

小偷捂晕妈妈强行闯入室内，写作业的小朋友灵机一动用两个橙子成功吓跑小偷！#儿童安全教育 #防拐 #儿童安全#儿童自救

#JasonDeruloTV // Wow 🤩 #GotPermissionToPost From @fasheroisbrasil #FromTheIslands

#JasonDeruloTV // Wow 🤩 #GotPermissionToPost From @fasheroisbrasil #FromTheIslands

Miles Cranmer - The Next Great Scientific Theory is Hiding Inside a Neural Network (April 3, 2024)

Miles Cranmer - The Next Great Scientific Theory is Hiding Inside a Neural Network (April 3, 2024)

Computer Scientist Explains Machine Learning in 5 Levels of Difficulty | WIRED

Computer Scientist Explains Machine Learning in 5 Levels of Difficulty | WIRED

When to Use XGBoost

When to Use XGBoost

193 - What is XGBoost and is it really better than Random Forest and Deep Learning?

193 - What is XGBoost and is it really better than Random Forest and Deep Learning?

The Past, Present and the Future of Machine Learning for Tabular Data - Bojan Tunguz

The Past, Present and the Future of Machine Learning for Tabular Data - Bojan Tunguz

Natalie Hockham: Machine learning with imbalanced data sets

Natalie Hockham: Machine learning with imbalanced data sets

Transformers for Tabular Data at Capital One with Bayan Bruss - #591

Transformers for Tabular Data at Capital One with Bayan Bruss - #591

Keras with TensorFlow Course - Python Deep Learning and Neural Networks for Beginners Tutorial

Keras with TensorFlow Course - Python Deep Learning and Neural Networks for Beginners Tutorial

How to create GPT-powered conversational bot for any website

How to create GPT-powered conversational bot for any website

Mama vs Son vs Daddy 😭🤣

Mama vs Son vs Daddy 😭🤣

ZTRATIL JSEM SE NA OSTROVĚ…

ZTRATIL JSEM SE NA OSTROVĚ…

SÉGRA ŠÍLENĚ VYPRANKOVALA IKONA ?! 🤣 #shorts

SÉGRA ŠÍLENĚ VYPRANKOVALA IKONA ?! 🤣 #shorts

Spot The Fake Animal For $10,000

Spot The Fake Animal For $10,000

Amazing weight loss transformation !! 😱😱

Amazing weight loss transformation !! 😱😱

Cool Items! New Gadgets, Smart Appliances 🌟 By 123 GO! House

Cool Items! New Gadgets, Smart Appliances 🌟 By 123 GO! House

ŠKODA FABIA HELLCAT 🔥🔥🔥 #ukazkaru #kk24 #realityshow

ŠKODA FABIA HELLCAT 🔥🔥🔥 #ukazkaru #kk24 #realityshow

Barbie Style Gear Knob Makeover: Glamour for Your Drive! 💅🏻🚗

Barbie Style Gear Knob Makeover: Glamour for Your Drive! 💅🏻🚗