Top 5 Statistics Concepts in Data Science Interviews: P-value, Confidence Interval, Power, Errors

Sdílet
Vložit
  • čas přidán 28. 06. 2024
  • Top 5 Statistics Concepts in Data Science Interviews
    In this video, we will talk about the top 5 statistics concepts in Data Science interviews. I will show you how to explain those concept to both technical and non-technical audiences.
    Typos
    10:09 "hull" hypothesis should be "null" hypothesis
    🟢Get all my free data science interview resources
    www.emmading.com/resources
    🟡 Product Case Interview Cheatsheet www.emmading.com/product-case...
    🟠 Statistics Interview Cheatsheet www.emmading.com/statistics-i...
    🟣 Behavioral Interview Cheatsheet www.emmading.com/behavioral-i...
    🔵 Data Science Resume Checklist www.emmading.com/data-science...
    ✅ We work with Experienced Data Scientists to help them land their next dream jobs. Apply now: www.emmading.com/coaching
    // Comment
    Got any questions? Something to add?
    Write a comment below to chat.
    // Let's connect on LinkedIn:
    / emmading001
    ====================
    Contents of this video:
    ====================
    0:00 Intro
    1:27 Structure your answer for technical audience
    2:08 Structure your answer for non-technical audience
    3:04 Power, Type I error, Type II error (for technical audience)
    5:15 Power, Type I error, Type II error (for non-technical audience)
    6:17 Confidence interval (for technical audience)
    8:33 Confidence interval (for non-technical audience)
    9:20 P value (for technical audience)
    11:29 P value (for non-technical audience)

Komentáře • 84

  • @songxiyou2347
    @songxiyou2347 Před 3 lety +95

    自己复习才发现,Emma真是将这些内容完全吃透,整理成自己的体系。不管是product sense还是stat,全部是干货并且非常organized。多余的废话一句没有(对比我自己的录音回答发现了一堆废话hhh)。非常感谢行业内有这样的领路人。继续期待product sense实例分析/stat & probablity 考点/take home & presentation思路总结和其他DS相关内容!Emma 新年快乐!新的一年身体健康,工作顺利,万事如意!

  • @goodjuju2132
    @goodjuju2132 Před 3 lety +1

    Emma thank you so much for all of your quality content!! You're doing so much for the community

  • @zihenglin5294
    @zihenglin5294 Před 3 lety +5

    Thought I already known those stats concepts but still learned a lot from your video. The tips for technical and non-technical audience are very helpful! Thanks Emma. Love your content!

  • @insigh01
    @insigh01 Před 3 lety

    The way you structure your response is concise, and it makes it easy to understand these concepts. Thank you Emma!

  • @weiyang2116
    @weiyang2116 Před 3 lety

    Yay! Exactly what I was looking for! Thanks Emma

  • @josephjoestar995
    @josephjoestar995 Před 2 lety

    So glad I came across this goldmine of a channel, honestly such great relevant topics with the most useful explanations - I trust you 100% to help with my interviews haha

  • @ishitasadhukhan1
    @ishitasadhukhan1 Před 2 lety

    Amazing videos Emma ! I am preparing for data science interviews and feel so lucky and grateful that I found your channel ! I am making it a point to follow your advice to the words ! Thank you so much for what you are sharing with us!

  • @thudang2597
    @thudang2597 Před 2 lety +1

    This is amazing Emma! Thank you so much for such great content. I'm prepping for DS intern interview and your videos literally save me

  • @katekatebangbang2435
    @katekatebangbang2435 Před 3 lety

    作为一个在面试的人,来回来去看了好多次emma的视频了,常看常新。谢谢Emma

  • @yinqiu6780
    @yinqiu6780 Před 3 lety +1

    So well explained! Thank you Emma!

  • @281019641
    @281019641 Před 3 lety

    Thanks Emma. Very clear description and helpful to see the categorization accordingly for technical and non-technical audience.

  • @sitongchen6688
    @sitongchen6688 Před 3 lety +1

    This is super clear, and now I have a good sense or expectation from the interviewer! Thanks Emma!

  • @Nancy-wr7zb
    @Nancy-wr7zb Před 3 lety

    Great video Emma !! Technical vs non technical explanations were very impressive !!

  • @jingyou3481
    @jingyou3481 Před 3 lety +1

    This is really great. I've been thinking about how to explain p value to non-technical person and find a great example for a while. This is definitely very clear! Hope you can continue to make some videos for stats concept like Simpson Paradox etc

  • @yuanliu2496
    @yuanliu2496 Před 3 lety +1

    I came across your video and it turns out to be super helpful! Thank you! subscribed.

  • @fengzhoupan771
    @fengzhoupan771 Před rokem

    Love the video! Thank you so much for the tips!

  • @user-kq3qv5mv4y
    @user-kq3qv5mv4y Před rokem +1

    Super useful. One of the best DS videos I have ever seen !

  • @mihirbosemj
    @mihirbosemj Před 3 lety

    The content you publish is so helpful for us to learn data science and prepare for interviews.
    Keep up the great work, and all the best :-)

  • @jeoffleonora4612
    @jeoffleonora4612 Před 3 lety

    Well explained. Thank you!

  • @taozhang7696
    @taozhang7696 Před 3 lety

    thank you. it's really helpful!

  • @Sethsm1
    @Sethsm1 Před 2 lety

    Extremely helpful. Thank you.

  • @DataProfessor
    @DataProfessor Před 3 lety

    Thanks Emma! Awesome video also for practicing data scientists, it’s a great video to brush up on our stats knowledge 😆

  • @hameddadgour
    @hameddadgour Před rokem

    Great content!

  • @mussdroid
    @mussdroid Před 3 lety +4

    We are going to moon on Data Science 🚀🚀🚀🚀 🌜🌜🌜 ! Thanks Emma

  • @alifiaz7792
    @alifiaz7792 Před 3 lety

    Very intuitive video. Please also consider making a video explaining the metrics for regression, classification and clustering machine learning models from both technical and business perspective.

  • @nisithaukkarapattanakul8860

    Very clear explanation, thanks

  • @guimaraesalysson
    @guimaraesalysson Před 2 lety

    Great video, helps a lot

  • @shauniktaneja4733
    @shauniktaneja4733 Před 3 lety

    Thank you so much!

  • @wongkitlongmarcus9310
    @wongkitlongmarcus9310 Před 3 měsíci

    thank you Emma

  • @jayzune1752
    @jayzune1752 Před rokem

    Wooo, smart and elegant lady! Thanks for your video, helped me a lot!

  • @spotting_experiment
    @spotting_experiment Před 2 lety

    Landed here preparing for my upcoming interview and this is very useful as a revision material as well.

  • @jaden2582
    @jaden2582 Před 2 lety

    NO one word of bullshit. Appreciate it, Emma.

  • @hehuang3536
    @hehuang3536 Před 2 lety

    Hi Emma, I have watched a lot of videos you made and they are super clear and helpful for preparing my DS interviews. Thank you so much!

    • @emma_ding
      @emma_ding  Před 2 lety

      Hey, I'm so happy to hear that my videos have been helpful. Best of luck with your interviews!

  • @yenliknurasheva6322
    @yenliknurasheva6322 Před rokem

    I am very grateful for your useful videos! Great content! You are so smart and beautiful! 😇 Also preparing for DS interview, these videos help a lot!!!

  • @chengqian5737
    @chengqian5737 Před 2 lety

    给你一个大大的赞!

  • @qingchuanlyu4605
    @qingchuanlyu4605 Před 3 lety +1

    This is really helpful. Now I know where my mistakes were!

  • @michellewww8036
    @michellewww8036 Před 2 lety

    Like it!!!!!

  • @yogiHalim
    @yogiHalim Před rokem

    Significance (p-value 80%) is the probability of correctly [rejecting the null hypothesis while it is false.].
    (probability of not testing positive pregnancy for male)
    for 3 or more outcome, [testing negative] >< [not testing positive].
    Significance is thus the probability of Type I error, whereas 1−power is the probability of Type II error.

  • @aliciama1745
    @aliciama1745 Před 3 lety

    really helpful! Thank you very much for do this! Emma, can you introduce * how to do a project* for the people who want to transfer to data science from other unrelated fields? Appreciate ahead of time!

    • @emma_ding
      @emma_ding  Před 3 lety

      For learning purpose, Kaggle is a really place to start. For "real-life" projects, you have to look for opportunities of side projects or in your current position.

  • @niveditakumari701
    @niveditakumari701 Před 2 lety

    Thank you for the video, can you please share another example for p-value in the layman's term?

  • @jiayiwu4101
    @jiayiwu4101 Před 3 lety

    Wow, super cool summary! Really practical! Thanks Emma. Would you mind sharing slides or text then?

    • @emma_ding
      @emma_ding  Před 3 lety

      Sorry there are no slides. It's part of the video editing.

  • @liumx31
    @liumx31 Před 2 lety

    Hi Emma, thanks for the great explanation, one question though -- how is power used to determine the sample size? I thought the sample size determined the power, i.e. the larger the sample size the higher the statistical power.

  • @anathemaconscience5666
    @anathemaconscience5666 Před 2 lety +1

    hi emma, i am kind of confused to the p value. At 10:33 you mentioned small p, more convinced of difference. But at 11:22, you said p value represents there is a diff given null hypo is true, meaning higher p, more convinced of difference. But given the height example, i believe small p larger difference, so at 11:22, why would you say p means there is a diff given null hypo is true?

  • @kuifeiliu3203
    @kuifeiliu3203 Před 3 lety

    good explanation! better to put non-technical part first

  • @amitkhandelwal2999
    @amitkhandelwal2999 Před 3 lety

    Great Video. It would be great if you can also provide the info on how to deal with these concepts in practical scenario. I mean to say, how to increase power of test. How to decrease FP / FN / countereffects. That will give a complete end to end picture while dealing with them when someone encountered in such problems while implementing these things in practice. Loved all other videos which I have seen till today in your channel.

  • @waliatv
    @waliatv Před rokem

    Very informative and helpful ❤

    • @emma_ding
      @emma_ding  Před rokem +1

      So happy to be of assistance, Mrinal! 😊

    • @waliatv
      @waliatv Před rokem

      @@emma_ding just ended up with my data scientist internship interview and it was very very good. Thankyou for such amazing content. It was very helpful for last minute brushup of key skills and i am hoping for positive results from my interviewer 🤞✨

    • @emma_ding
      @emma_ding  Před rokem +1

      That's fantastic to hear, Mrinal! Feel free to keep me posted with how your results go. Fingers crossed, and sending you good luck! 💛

    • @waliatv
      @waliatv Před rokem

      ​@@emma_dingThankyou so much for the good wishes and all your hard work in videos was worth it because we benefited from them a lot.
      Also, I would like to share that I have accepted the Data Scientist Internship with Loblaws Companies in Toronto, Canada, for the coming Winter of 2023.
      I am so excited and obliged to start my new journey in Data Science. It was difficult but with consistent hard work and good resources such as your channel, I am now going to follow my dream career.
      Thankyou once again for all good work and keep posting such insights and helpful resources on DS, as it will still help me during my professional career.

    • @emma_ding
      @emma_ding  Před rokem +1

      Mrinal! This is fantastic news! Thank you for sharing this huge win with me, and congratulations on your new role. I can't wait to hear what else is in store for you in the future. Sending you all the best! 🥳

  • @Mackymon
    @Mackymon Před 3 lety +1

    Great Vid! Follow up question: how do you get a feel for how technical your audience actually is?

    • @emma_ding
      @emma_ding  Před 3 lety

      Look at their public profile like LinkedIn :)

    • @hehuang3536
      @hehuang3536 Před 2 lety +1

      In one of my technical interviews, the interviewer asked me how do you explain the concept to your grandma?

  • @yogiHalim
    @yogiHalim Před rokem

    95% confidence interval shows 95% from the center of a normal distribution population is represented.
    ie: 5% outliers are not represented by the equation

  • @muse3324
    @muse3324 Před 2 měsíci

    1:41 "It should not be obscure like what you see in Wikipedia" 😅😁😁

  • @LouisChiaki
    @LouisChiaki Před 3 lety

    A comment on the confidence interval, I think your interpretation (and a lot of data analyst) is from Frequentist's point of views. For Bayesian, there is no fixed true value.

  • @plttji2615
    @plttji2615 Před 2 lety

    What if N increase, does it affect P-value?

  • @InoHimeYa
    @InoHimeYa Před 2 lety

    13 mins saves me at least 3 hours

  • @bhageerathbogi4951
    @bhageerathbogi4951 Před 3 lety +1

    Hi Emma, Can you please share a link to the slides.

    • @emma_ding
      @emma_ding  Před 3 lety +1

      Sorry, there's no slides, it's all part of the video editing. But I'll definitely consider providing it in the future if it helps!

  • @nikhilmuthukrishnan7222

    You think your thumbnails are so cute!!! Well they are

  • @jennywu799
    @jennywu799 Před 3 lety +1

    Emma, 可以不可以出一个视频总结一下常用的distribution,有的时候面试的时候被问到sales data是什么样的distribution,我每次都答normal。。。

  • @robertwilsoniii2048
    @robertwilsoniii2048 Před 10 měsíci

    This is really basic... how do jobs require multiple years of experience when these interview questions are just basic thing you learn in an intro stats class... ???

  • @mussdroid
    @mussdroid Před 3 lety

    #datascience

  • @jaden2582
    @jaden2582 Před 2 lety +1

    could you explain the "AT LEAST as extreme as the data is actually observed" in the definition of the p value?

    • @emma_ding
      @emma_ding  Před 2 lety +2

      Hey so an example would be when you are doing a test - if the means of two populations are the same, your null hypothesis is that those two are the same. Now you have observed data that shows that the difference is 1. “AT LEAST as extreme as the data is actually observed" means the difference is 1 or larger. 1 is the observed data and AT LEAST as extreme means that is the minimum difference. I hope this helps!

    • @jaden2582
      @jaden2582 Před 2 lety

      @@emma_ding Thank you for this clear explanation!

  • @djjiang3718
    @djjiang3718 Před 3 lety +3

    What a beautiful lady with high-quality content!

  • @mahdimerced
    @mahdimerced Před 3 lety

    Why did you delete most of the previous movies?

    • @emma_ding
      @emma_ding  Před 3 lety

      You can find all my videos under the VIDEOS tab on my channel page. I changed the thumbnails of some videos a few weeks ago. :)

  • @LouisChiaki
    @LouisChiaki Před 3 lety

    10:09 some typo on the slides. Should be "null" not "hull" hypothesis :D

    • @emma_ding
      @emma_ding  Před 3 lety

      Thanks for catching the typos!

  • @brothermalcolm
    @brothermalcolm Před 3 lety

    Non-technical audience!

  • @Galax224
    @Galax224 Před 3 lety

    Hi Emma I suggest you name your channel so every time you introduce you can say welcome to !@$!@#$!@#~!# instead of my channel and it's unique to impress people.

  • @poopah4497
    @poopah4497 Před 2 lety

    the higher CL -> wider c.I? Is that a typo? I thought the opposite

    • @emma_ding
      @emma_ding  Před 2 lety

      Hey Ruiruo! It's not a typo, the higher CL, the wider the CI, because increasing the confidence will increase the margin of error resulting in a wider interval.

  • @davidwarner1248
    @davidwarner1248 Před 2 měsíci +1

    Such a poor pronunciation