Gradient Boosting : Data Science's Silver Bullet

Gradient Boost Machine Learning|How Gradient boost work in Machine Learning

Handling Imbalanced Dataset in Machine Learning: Easy Explanation for Data Science Interviews

Playing hide and seek with my dog 🐶

Who has won ?? 😀 #shortvideo #lizzyisaeva

irl stream in Czech Republic 🇨🇿

Gradient Boosting and XGBoost in Machine Learning: Easy Explanation for Data Science Interviews

Emma Ding

zhlédnutí 32 015

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 21. 07. 2024
Questions about Gradient Boosting frequently appear in data science interviews. In this video, I cover what the Gradient Boosting method and XGBoost are, teach you how I would describe the architecture of gradient boosting, and go over some common pros and cons associated with gradient-boosted trees.
🟢Get all my free data science interview resources
www.emmading.com/resources
🟡 Product Case Interview Cheatsheet www.emmading.com/product-case...
🟠 Statistics Interview Cheatsheet www.emmading.com/statistics-i...
🟣 Behavioral Interview Cheatsheet www.emmading.com/behavioral-i...
🔵 Data Science Resume Checklist www.emmading.com/data-science...
✅ We work with Experienced Data Scientists to help them land their next dream jobs. Apply now: www.emmading.com/coaching
// Comment
Got any questions? Something to add?
Write a comment below to chat.
// Let's connect on LinkedIn:
/ emmading001
====================
Contents of this video:
====================
00:00 Introduction
01:01 Gradient Boosting
02:11 Gradient-boosted Trees
02:54 Algorithm
05:53 Hyperparameters
07:55 Pros and Cons
09:00 XGBoost

Komentáře • 27

@jennyhuang7603 Před rokem ⁺³
For 5:10, why the MSE delta r_i is Y-F(X) instead of 2*(Y-F(X))? or is the coefficent doesn't matter?
@anand3064 Před 6 měsíci ⁺³
Beautifully written notes
@aaronsayeb6566 Před 25 dny
there is a mistake in the representation of algorithm. the equation for ri, L(Y, F(X)), and grad ri = Y-F(X) can't hold true at the same time. I think ri= Y-F(X) and grade ri should be something else (right?)
@annialevko5771 Před 9 měsíci ⁺³
Hi! I have a question, how does the parallel tree building work? Because based in the gradient boosting it needs to calculate the error from the previous model in order to create the new one, so I dont really understand in which way is this parallelized
@shashizanje Před 4 měsíci ⁺¹
Its parallelized in such a way that , during formation of tree , it can work parallel....means it can work on multiple independent features parellaly to reduce the computation time....suppose if it has to find root node, it has to check information gain of every single independent feature and then decide which feature would be best for root node...so in this case instead of calculating information gain one by one, it can parallely calculate IG of multiple features....
@jet3111 Před rokem ⁺²
Thank you for the very informative video. It came up at my interview yesterday. I also got a question on time series forecasting and preventing data leakage. I think it would great to have a video about it.
@wallords Před 8 měsíci
How do you add L1 regularization to a tree???
@elvykamunyokomanunebo1441 Před rokem
Hi Emma,
I'm struggling to understand how to build a model on residuals:
1) Do I predict the residuals and then get the mse of the residuals?
What would be the point/use of that?
2) Do I somehow re-run the model considering some factor that
focuses on accounting for more of the variability e.g. adding more
features(important features) which reduce mse/residual?
Then re-running the model adding a new feature to account for
remaining residual until there is no more reduction in mse/residual?
@poshsims4016 Před rokem
Ask Chat GPT every question you just typed. Preferably GPT-4
@Heinz3792 Před 4 měsíci ⁺¹
It's important to understand what the residual is. The residual is a vector giving a magnitude of the prediction error AND the direction, i.e. the gradient. Thus, regarding your questions:
1) we predict the residual with a weak model, h, in order to know in what direction to move the prediction of the overall model F_i(X) so that it is reduced. We assume h makes a decent prediction, and thus we treat it like the gradient.
2) we then calculate alpha, the regulation parameter, in order to know HOW FAR to move in the direction of the gradient which h provides. I.e., how much weight to give model h. Minimizing the loss function gives us this value, and keeps us from over or undershooting the step size.
@kandiahchandrakumaran8521 Před 2 měsíci
Excellent video Many thanks.
Could you kindly make a video for time to event with survival SVM, RSF, or XGBLC?
@user-hq4ge6no3p Před 2 měsíci ⁺¹
An excellent video
@nihalnetha96 Před měsícem
is there a way to get the notion notes?
@zhenwang5872 Před rokem
I usually watch Emma's video when I doing revision.
@emmafan713 Před rokem ⁺⁴
I am confused about the notation, so h_i is a function to predict r_i and r_i is the gradient of the loss function w.r.t the last prediction F(X). so h_i should be similar to r_i why h_i is similar to gradient of r_i
@Heinz3792 Před 4 měsíci ⁺¹
I believe there is an error in this video. r_i is the gradient of the loss function w.r.t. the CURRENT F(X), i.e. F_i(X). The NEXT weak model h_i+1 is then trained to be able to predict r_i, the PREVIOUS residual. Alternatively all this could be written with i-1 instead of i, and i instead of i+1.
TLDR: Emma should have called the first step "compute residual r_i-1", not r_i. And in the gradient formula, she should have written r_i-1.
@Leo-xd9et Před rokem
Really like the way you use Notion!
@emma_ding Před rokem
Thanks for the feedback, Leo! I tried out a bunch of different presentation methods before this one, so I'm glad to hear you're finding this platform useful! 😊
@emma_ding Před rokem ⁺³
Many of you have asked me to share my presentation notes, and now… I have them for you! Download all the PDFs of my Notion pages at www.emmading.com/get-all-my-free-resources. Enjoy!
@SanuSatyam Před rokem
Thanks a lot. Can you please make a video on Time Series Analysis? Thanks in Advance!
@objectobjectobject4707 Před 3 měsíci
Okay subscribed !
@riswandaayu5930 Před 9 měsíci
Hallo Miss, thankyou for the knowledge, Miss can I request your file in this presentation ?
@PhucHoang-ng4vh Před 8 měsíci ⁺⁸
just read out loud, no explanation at all
@ermiaazarkhalili5586 Před rokem ⁺¹
Any chance to have slides?
@NguyenSon-ew9wn Před rokem ⁺¹
Agree. Hope to have that note
@emma_ding Před rokem ⁺¹
Yes! Download all the PDFs of my Notion pages at emmading.com/resources by navigating to the individual posts. Enjoy!
@faisalsal1 Před 5 měsíci ⁺²
She just read the text with zero knowledge about the content. U no good.

Další v pořadí

Automatické přehrávání

Gradient Boosting : Data Science's Silver Bullet

Gradient Boosting : Data Science's Silver Bullet

Gradient Boost Machine Learning|How Gradient boost work in Machine Learning

Gradient Boost Machine Learning|How Gradient boost work in Machine Learning

Handling Imbalanced Dataset in Machine Learning: Easy Explanation for Data Science Interviews

Handling Imbalanced Dataset in Machine Learning: Easy Explanation for Data Science Interviews

Playing hide and seek with my dog 🐶

Playing hide and seek with my dog 🐶

Who has won ?? 😀 #shortvideo #lizzyisaeva

Who has won ?? 😀 #shortvideo #lizzyisaeva

irl stream in Czech Republic 🇨🇿

irl stream in Czech Republic 🇨🇿

SÉGRA ŠÍLENĚ VYPRANKOVALA IKONA ?! 🤣 #shorts

SÉGRA ŠÍLENĚ VYPRANKOVALA IKONA ?! 🤣 #shorts

Ensemble (Boosting, Bagging, and Stacking) in Machine Learning: Easy Explanation for Data Scientists

Ensemble (Boosting, Bagging, and Stacking) in Machine Learning: Easy Explanation for Data Scientists

Data Science Project - RFM model

Data Science Project - RFM model

Feature Selection in Machine Learning: Easy Explanation for Data Science Interviews

Feature Selection in Machine Learning: Easy Explanation for Data Science Interviews

681: XGBoost: The Ultimate Classifier - with Matt Harrison

681: XGBoost: The Ultimate Classifier — with Matt Harrison

ML Was Hard Until I Learned These 5 Secrets!

ML Was Hard Until I Learned These 5 Secrets!

193 - What is XGBoost and is it really better than Random Forest and Deep Learning?

193 - What is XGBoost and is it really better than Random Forest and Deep Learning?

XGBoost Made Easy | Extreme Gradient Boosting | AWS SageMaker

XGBoost Made Easy | Extreme Gradient Boosting | AWS SageMaker

L1 and L2 Regularization in Machine Learning: Easy Explanation for Data Science Interviews

L1 and L2 Regularization in Machine Learning: Easy Explanation for Data Science Interviews

All Learning Algorithms Explained in 14 Minutes

All Learning Algorithms Explained in 14 Minutes

PŘEŽIL JSEM NOC V NEJLEVNĚJŠÍM HOTELU! (5KČ)

PŘEŽIL JSEM NOC V NEJLEVNĚJŠÍM HOTELU! (5KČ)

Lady Plays Hide and Seek with Her Dog

Lady Plays Hide and Seek with Her Dog

Koupil jsem Nejrychlejší Autíčko na Ovládání za 30 000 Kč!

Koupil jsem Nejrychlejší Autíčko na Ovládání za 30 000 Kč!

ZKOUŠIME MYSTERY OBJEDNÁVKU Z McDONALDU 😅

ZKOUŠIME MYSTERY OBJEDNÁVKU Z McDONALDU 😅

Káže vodu, pije tvoje nervy #komedie #sranda #emperkingvision #shorts

Káže vodu, pije tvoje nervy #komedie #sranda #emperkingvision #shorts

Mixér Challenge Poslepu!

Mixér Challenge Poslepu!

Slow motion boy #shorts by Tsuriki Show

Slow motion boy #shorts by Tsuriki Show

How Many Balloons Does It Take To Fly?

How Many Balloons Does It Take To Fly?