5 Concepts in Statistics You Should Know | Data Science Interview

Sdílet
Vložit
  • čas přidán 13. 07. 2024
  • 🚀 Land your dream data job using datainterview.com/.
    ====== ✅ Details ======
    Dan, formerly a data scientist at Google and PayPal, reviews 5 fundamental topics candidates need to review in preparation for data science interviews. These are topics that are asked in business-case, statistics, and statistical-coding rounds. For more prep content, check out datainterview.com/
    👍 Make sure to subscribe, like and share!
    ====== ⏱️ Timestamps ======
    0:00 Intro
    00:51 Central Tendency
    05:05 Dispersion
    06:17 Correlation
    10:42 Normal Distribution
    12:53 Hypothesis Testing
    20:00 Other Concepts to Know
    20:41 Conclusion
    ====== 📚 Other Useful Contents ======
    1. Principles and Frameworks of Product Metrics | CZcams Case Study
    Link: / principles-and-framewo...
    2. How to Crack the Data Scientist Case Interview
    Link: / crack-the-data-scienti...
    3. How to Crack the Amazon Data Scientist Interview
    Link: / crack-the-amazon-data-...
    ====== Connect ======
    📗 LinkedIn - / danleedata
    📘 Medium - / datainterview
  • Věda a technologie

Komentáře • 28

  • @WebsterLincoln
    @WebsterLincoln Před 2 lety +9

    I would describe that as a positively skewed normal distribution, not an exponential distribution. Also, it's the 68-95-99.7 rule

  • @abdallahelmoctar7635
    @abdallahelmoctar7635 Před rokem +3

    Such a simple and straight forward refresher. I'm grateful for your work

  • @mahmutozmen1261
    @mahmutozmen1261 Před 2 lety +2

    Thanks for such a great content and your effort. Would you mind explaining further why you think that mode = median? Since this graph seems like a positively skewed graph, I though mode is around 3, median 4 or 5 and mean between 6 and 10.

  • @AllieZhao
    @AllieZhao Před 2 lety

    These are crucial concepts. Thanks

  • @SaramaKamal
    @SaramaKamal Před rokem

    Could you mention tools used to design and present your slides thanks!!!

  • @shir0tei
    @shir0tei Před 2 lety +2

    Thanks for the video! I The correlation formula is wrong though, the covariance is the numerator divided by n.

  • @jacksun7999
    @jacksun7999 Před 2 měsíci

    6:43 should the numerator be cov(X,Y)? Seems there is a 1/(N-1) term missing.

  • @dreamingaparisdream3178

    Also where is the link for Meta Statistical Interview questions video please?

  • @basmaelkhamlichi8223
    @basmaelkhamlichi8223 Před 2 lety +8

    Hypothesis testing and P value nicely explained, thank you!

  • @benxneo
    @benxneo Před 2 lety +2

    could you give me ideas for data science projects that deliver value to businesses

  • @HarryPotter-st2cn
    @HarryPotter-st2cn Před 2 lety

    Great content. Is non-normal distributions listed separately to put emphasis on it? I believe it will be included within the concept of the overall distributions

  • @anirbansarkar6306
    @anirbansarkar6306 Před 10 měsíci

    Can you help me understand on what basis have you assumed population standard deviation to be 20?

  • @jcokonkwo
    @jcokonkwo Před 2 lety +4

    I definitely appreciate the explanation then the applied DS examples right after. Thank you!

  • @RedShipsofSpainAgain
    @RedShipsofSpainAgain Před rokem +5

    11:16. I think you have a typo: The Normal distribution should be 68-95-99.7%, not 65-95-99.7%

  • @stanislavdidenko8436
    @stanislavdidenko8436 Před rokem

    2pm - poisson distribution

  • @bandai2
    @bandai2 Před 2 lety

    could you also use Spearman Correlation if you have outliers in your data?

  • @pal999
    @pal999 Před 2 lety

    If you're using a real world example, you shouldn't "ASSUME" the SD to be something. Can you find out how it's determined in real world?

  • @Foba_Bett
    @Foba_Bett Před rokem +1

    I am binge-watching your channel ! 😎
    In the correlation section - why not just straight up remove the outliers? 🤔

    • @gaboqv
      @gaboqv Před rokem

      that's what he is telling with a fancy name, you will use quartiles to confirm which of the points are outliers

  • @dreamingaparisdream3178
    @dreamingaparisdream3178 Před 2 lety +4

    For the normal distribution, is it 66-95-99.7 rule or 68-95-99.7?

    • @TheNIK21HIL
      @TheNIK21HIL Před 2 lety +1

      it is 68% within 1 SD. it must be a typo on Dan's end. The graph though does represent it correctly.

    • @ASHISHDHIMAN1610
      @ASHISHDHIMAN1610 Před 2 lety

      @@TheNIK21HIL yeah typo

  • @BrianSalamone
    @BrianSalamone Před 4 měsíci

    1:08 8 hours a day in Facebook????? What is the X at the bottom?

  • @michaell9804
    @michaell9804 Před 2 lety +3

    You failed to mention bayes theorem and binomial distribution which is used here just as heavily as normal distribution particularly when quantifying the probability distribution of the accuracy of unsupervised learning models. This video is not comprehensive at all

    • @Omegageekk
      @Omegageekk Před 2 lety +13

      If you thought a video titled “5 concepts in statistics you should know” would be a comprehensive breakdown of literally every stats concept you need for data science, then I have a bridge to sell you.