Top 5 Statistics Concepts in Data Science Interviews: P-value, Confidence Interval, Power, Errors
Vložit
- čas přidán 28. 06. 2024
- Top 5 Statistics Concepts in Data Science Interviews
In this video, we will talk about the top 5 statistics concepts in Data Science interviews. I will show you how to explain those concept to both technical and non-technical audiences.
Typos
10:09 "hull" hypothesis should be "null" hypothesis
🟢Get all my free data science interview resources
www.emmading.com/resources
🟡 Product Case Interview Cheatsheet www.emmading.com/product-case...
🟠 Statistics Interview Cheatsheet www.emmading.com/statistics-i...
🟣 Behavioral Interview Cheatsheet www.emmading.com/behavioral-i...
🔵 Data Science Resume Checklist www.emmading.com/data-science...
✅ We work with Experienced Data Scientists to help them land their next dream jobs. Apply now: www.emmading.com/coaching
// Comment
Got any questions? Something to add?
Write a comment below to chat.
// Let's connect on LinkedIn:
/ emmading001
====================
Contents of this video:
====================
0:00 Intro
1:27 Structure your answer for technical audience
2:08 Structure your answer for non-technical audience
3:04 Power, Type I error, Type II error (for technical audience)
5:15 Power, Type I error, Type II error (for non-technical audience)
6:17 Confidence interval (for technical audience)
8:33 Confidence interval (for non-technical audience)
9:20 P value (for technical audience)
11:29 P value (for non-technical audience)
自己复习才发现,Emma真是将这些内容完全吃透,整理成自己的体系。不管是product sense还是stat,全部是干货并且非常organized。多余的废话一句没有(对比我自己的录音回答发现了一堆废话hhh)。非常感谢行业内有这样的领路人。继续期待product sense实例分析/stat & probablity 考点/take home & presentation思路总结和其他DS相关内容!Emma 新年快乐!新的一年身体健康,工作顺利,万事如意!
Emma thank you so much for all of your quality content!! You're doing so much for the community
Thought I already known those stats concepts but still learned a lot from your video. The tips for technical and non-technical audience are very helpful! Thanks Emma. Love your content!
The way you structure your response is concise, and it makes it easy to understand these concepts. Thank you Emma!
Yay! Exactly what I was looking for! Thanks Emma
So glad I came across this goldmine of a channel, honestly such great relevant topics with the most useful explanations - I trust you 100% to help with my interviews haha
Amazing videos Emma ! I am preparing for data science interviews and feel so lucky and grateful that I found your channel ! I am making it a point to follow your advice to the words ! Thank you so much for what you are sharing with us!
This is amazing Emma! Thank you so much for such great content. I'm prepping for DS intern interview and your videos literally save me
作为一个在面试的人,来回来去看了好多次emma的视频了,常看常新。谢谢Emma
So well explained! Thank you Emma!
Thanks Emma. Very clear description and helpful to see the categorization accordingly for technical and non-technical audience.
This is super clear, and now I have a good sense or expectation from the interviewer! Thanks Emma!
Great video Emma !! Technical vs non technical explanations were very impressive !!
This is really great. I've been thinking about how to explain p value to non-technical person and find a great example for a while. This is definitely very clear! Hope you can continue to make some videos for stats concept like Simpson Paradox etc
I came across your video and it turns out to be super helpful! Thank you! subscribed.
Love the video! Thank you so much for the tips!
Super useful. One of the best DS videos I have ever seen !
The content you publish is so helpful for us to learn data science and prepare for interviews.
Keep up the great work, and all the best :-)
Well explained. Thank you!
thank you. it's really helpful!
Extremely helpful. Thank you.
Thanks Emma! Awesome video also for practicing data scientists, it’s a great video to brush up on our stats knowledge 😆
Thank you Data Professor!
Great content!
We are going to moon on Data Science 🚀🚀🚀🚀 🌜🌜🌜 ! Thanks Emma
Very intuitive video. Please also consider making a video explaining the metrics for regression, classification and clustering machine learning models from both technical and business perspective.
Very clear explanation, thanks
Great video, helps a lot
Thank you so much!
thank you Emma
Wooo, smart and elegant lady! Thanks for your video, helped me a lot!
Landed here preparing for my upcoming interview and this is very useful as a revision material as well.
NO one word of bullshit. Appreciate it, Emma.
Hi Emma, I have watched a lot of videos you made and they are super clear and helpful for preparing my DS interviews. Thank you so much!
Hey, I'm so happy to hear that my videos have been helpful. Best of luck with your interviews!
I am very grateful for your useful videos! Great content! You are so smart and beautiful! 😇 Also preparing for DS interview, these videos help a lot!!!
给你一个大大的赞!
This is really helpful. Now I know where my mistakes were!
Like it!!!!!
Significance (p-value 80%) is the probability of correctly [rejecting the null hypothesis while it is false.].
(probability of not testing positive pregnancy for male)
for 3 or more outcome, [testing negative] >< [not testing positive].
Significance is thus the probability of Type I error, whereas 1−power is the probability of Type II error.
really helpful! Thank you very much for do this! Emma, can you introduce * how to do a project* for the people who want to transfer to data science from other unrelated fields? Appreciate ahead of time!
For learning purpose, Kaggle is a really place to start. For "real-life" projects, you have to look for opportunities of side projects or in your current position.
Thank you for the video, can you please share another example for p-value in the layman's term?
Wow, super cool summary! Really practical! Thanks Emma. Would you mind sharing slides or text then?
Sorry there are no slides. It's part of the video editing.
Hi Emma, thanks for the great explanation, one question though -- how is power used to determine the sample size? I thought the sample size determined the power, i.e. the larger the sample size the higher the statistical power.
hi emma, i am kind of confused to the p value. At 10:33 you mentioned small p, more convinced of difference. But at 11:22, you said p value represents there is a diff given null hypo is true, meaning higher p, more convinced of difference. But given the height example, i believe small p larger difference, so at 11:22, why would you say p means there is a diff given null hypo is true?
good explanation! better to put non-technical part first
Great Video. It would be great if you can also provide the info on how to deal with these concepts in practical scenario. I mean to say, how to increase power of test. How to decrease FP / FN / countereffects. That will give a complete end to end picture while dealing with them when someone encountered in such problems while implementing these things in practice. Loved all other videos which I have seen till today in your channel.
Very informative and helpful ❤
So happy to be of assistance, Mrinal! 😊
@@emma_ding just ended up with my data scientist internship interview and it was very very good. Thankyou for such amazing content. It was very helpful for last minute brushup of key skills and i am hoping for positive results from my interviewer 🤞✨
That's fantastic to hear, Mrinal! Feel free to keep me posted with how your results go. Fingers crossed, and sending you good luck! 💛
@@emma_dingThankyou so much for the good wishes and all your hard work in videos was worth it because we benefited from them a lot.
Also, I would like to share that I have accepted the Data Scientist Internship with Loblaws Companies in Toronto, Canada, for the coming Winter of 2023.
I am so excited and obliged to start my new journey in Data Science. It was difficult but with consistent hard work and good resources such as your channel, I am now going to follow my dream career.
Thankyou once again for all good work and keep posting such insights and helpful resources on DS, as it will still help me during my professional career.
Mrinal! This is fantastic news! Thank you for sharing this huge win with me, and congratulations on your new role. I can't wait to hear what else is in store for you in the future. Sending you all the best! 🥳
Great Vid! Follow up question: how do you get a feel for how technical your audience actually is?
Look at their public profile like LinkedIn :)
In one of my technical interviews, the interviewer asked me how do you explain the concept to your grandma?
95% confidence interval shows 95% from the center of a normal distribution population is represented.
ie: 5% outliers are not represented by the equation
1:41 "It should not be obscure like what you see in Wikipedia" 😅😁😁
A comment on the confidence interval, I think your interpretation (and a lot of data analyst) is from Frequentist's point of views. For Bayesian, there is no fixed true value.
What if N increase, does it affect P-value?
13 mins saves me at least 3 hours
Hi Emma, Can you please share a link to the slides.
Sorry, there's no slides, it's all part of the video editing. But I'll definitely consider providing it in the future if it helps!
You think your thumbnails are so cute!!! Well they are
Emma, 可以不可以出一个视频总结一下常用的distribution,有的时候面试的时候被问到sales data是什么样的distribution,我每次都答normal。。。
poisson distribution
This is really basic... how do jobs require multiple years of experience when these interview questions are just basic thing you learn in an intro stats class... ???
#datascience
could you explain the "AT LEAST as extreme as the data is actually observed" in the definition of the p value?
Hey so an example would be when you are doing a test - if the means of two populations are the same, your null hypothesis is that those two are the same. Now you have observed data that shows that the difference is 1. “AT LEAST as extreme as the data is actually observed" means the difference is 1 or larger. 1 is the observed data and AT LEAST as extreme means that is the minimum difference. I hope this helps!
@@emma_ding Thank you for this clear explanation!
What a beautiful lady with high-quality content!
Why did you delete most of the previous movies?
You can find all my videos under the VIDEOS tab on my channel page. I changed the thumbnails of some videos a few weeks ago. :)
10:09 some typo on the slides. Should be "null" not "hull" hypothesis :D
Thanks for catching the typos!
Non-technical audience!
Hi Emma I suggest you name your channel so every time you introduce you can say welcome to !@$!@#$!@#~!# instead of my channel and it's unique to impress people.
the higher CL -> wider c.I? Is that a typo? I thought the opposite
Hey Ruiruo! It's not a typo, the higher CL, the wider the CI, because increasing the confidence will increase the margin of error resulting in a wider interval.
Such a poor pronunciation