Regression: Crash Course Statistics #32

Sdílet
Vložit
  • čas přidán 2. 10. 2018
  • Today we're going to introduce one of the most flexible statistical tools - the General Linear Model (or GLM). GLMs allow us to create many different models to help describe the world - you see them a lot in science, economics, and politics. Today we're going to build a hypothetical model to look at the relationship between likes and comments on a trending CZcams video using the Regression Model. We'll be introducing other popular models over the next few episodes.
    Crash Course is on Patreon! You can support us directly by signing up at / crashcourse
    Thanks to the following Patrons for their generous monthly contributions that help keep Crash Course free for everyone forever:
    Mark Brouwer, Kenneth F Penttinen, Trevin Beattie, Satya Ridhima Parvathaneni, Erika & Alexa Saur, Glenn Elliott, Justin Zingsheim, Jessica Wode, Eric Prestemon, Kathrin Benoit, Tom Trval, Jason Saslow, Nathan Taylor, Brian Thomas Gossett, Khaled El Shalakany, Indika Siriwardena, SR Foxley, Sam Ferguson, Yasenia Cruz, Eric Koslow, Caleb Weeks, D.A. Noe, Shawn Arnold, Malcolm Callis, Advait Shinde, William McGraw, Andrei Krishkevich, Rachel Bright, Mayumi Maeda, Kathy & Tim Philip, Jirat, Ian Dundore
    --
    Want to find Crash Course elsewhere on the internet?
    Facebook - / youtubecrashcourse
    Twitter - / thecrashcourse
    Tumblr - / thecrashcourse
    Support Crash Course on Patreon: / crashcourse
    CC Kids: / crashcoursekids

Komentáře • 187

  • @justynaizabela2495
    @justynaizabela2495 Před 5 lety +564

    This series is amazing! I have majored in Statistics and still this series explains everything much better than college classes.

    • @panosshady6168
      @panosshady6168 Před 4 lety +66

      You majored in statistics? Wow, and here I thought that I hated myself.

  • @Waltham1892
    @Waltham1892 Před 5 lety +766

    BRAIN HURT!!!

    • @MasterofPlay7
      @MasterofPlay7 Před 5 lety +13

      nah actually I find it very fun, Z test T test F test and anova all has to do with std and variance which has to do with the mean

    • @nikkid4890
      @nikkid4890 Před 4 lety +6

      @@MasterofPlay7 I also LOVE stats. And this from a person that sold my math text books for toffee at school! Once you get it, it's so much fun!

    • @MasterofPlay7
      @MasterofPlay7 Před 4 lety +8

      ​@@nikkid4890 yes we human are too dumb, most statistical analysis have to do with a straight line (most models are based on y=ax+b) cuz we can only perceive the relationship through a straight line

  • @sarlut
    @sarlut Před 4 lety +57

    I swear this series is the reason I am actually doing well in statistics! Wish I had this in my BSC (MSc Student)

  • @benbernanke4037
    @benbernanke4037 Před 4 lety +15

    These graphical presentations are so good, especially at 8:30 the didferent sums of square types

  • @NN_2000
    @NN_2000 Před 5 lety +170

    Only crash course can make statistics interesting. Thank you for making quality educational videos for free! :D

  • @raposo_debora
    @raposo_debora Před 4 lety +59

    This course is sooo good. I'm using the Covid-19 Quaratine to educate myself in Statistics and this Crash Course was THE finding. Thanks a lot!

  • @alfredgustafsson4708
    @alfredgustafsson4708 Před 4 lety +30

    This is great, especially the explaination of degrees of freedom. I never really understood it through five years of Economics so thank you.

  • @williamkee6578
    @williamkee6578 Před 4 lety +3

    This video is absolutely helpful! One single video and I understand the contents from 2 hours class.

  • @dondacurator
    @dondacurator Před 5 lety +4

    This right here is the most entertaining and intriguing statistical video Ive ever watched.. it actually made stats fun, thanks for incorporating art and creativity to this piece ,,instead of old and boring numbers presented in a monotonic go to sleep now voice

  • @theidiotboy100
    @theidiotboy100 Před rokem +8

    I think you guys are the reason people study or stay in school. net positive for humanity. thanks for helping people.

  • @klaras5703
    @klaras5703 Před 5 lety +148

    the best thing about the video is how the pumpkin and the transformer slowly eat all the candy worms that were on the table during the video

  • @mayankjacky
    @mayankjacky Před 7 měsíci +1

    It was a very comprehensive, concise and crisp presentation on a complex topic. Kudos to the entire team for an excellent effort.

  • @stankalfon2170
    @stankalfon2170 Před 5 lety +5

    Thank you this helped me so much! Will you do a video on multiple regression and econometrics in general? Keep up the good work you guys rock!

  • @kanitoneko
    @kanitoneko Před 5 lety +264

    How can she keep speaking without inhaling!?

  • @nikkid4890
    @nikkid4890 Před 4 lety +3

    Wow! You are brilliant. I'm post-grad and needed to refresh. Brilliant

  • @Maria-hd4hk
    @Maria-hd4hk Před 5 lety +7

    First minute and a half and i've actually learnt so much

  • @Ureallydontknow
    @Ureallydontknow Před 5 lety +2

    this video is top production quality and expert instruction. thank you so much.

  • @ravindukarunarathne507
    @ravindukarunarathne507 Před 4 lety +1

    So nice, can keep watching for hours.. Well done

  • @TaroQuispe
    @TaroQuispe Před 4 lety +2

    You guys rock the house, super clear, super helpful!

  • @RaulPelcastreRealEstate
    @RaulPelcastreRealEstate Před 4 lety +42

    She speaks a little too fast for me but clearly explained. I like it.

  • @Malik-jt8hi
    @Malik-jt8hi Před 5 lety +1

    Thank god for crash course lol, godsend channel to start to learn a topic when I gotta teach my brother about a topic I’ve never learned myself

  • @florentinfrank3671
    @florentinfrank3671 Před 4 lety +1

    Well done! thanks so much for all the efforts! now i understand better!

  • @greensteve9307
    @greensteve9307 Před 5 lety +2

    So much clearer than my uni stats lecture!

  • @navroopgill7594
    @navroopgill7594 Před 4 lety

    Amazing . Thanks for the info shot!

  • @OlleLindestad
    @OlleLindestad Před 5 lety +80

    NOTE: This video uses the abbreviation "GLM" incorrectly (or at least very misleadingly) throughout.
    The general linear model is NOT usually what is meant by "GLM". Instead, GLM stands for generaLIZED linear model, which is a special kind of linear model that (among other things) allows for a response variable that is not normally distributed. (Yes, this is extremely confusing. Don't even get me started on the word "linear", which doesn't even mean "straight lines" in this context.)
    Bottom line: substitute simply "linear model" whenever Adrienne says "GLM" in this video, and you'll be fine.

  • @dianarinker8429
    @dianarinker8429 Před rokem

    what a great explanation!! thank you so much!

  • @MasterofPlay7
    @MasterofPlay7 Před 5 lety

    Wow you opened my brains and I found it statistic is so fun!

  • @dilnoza2168
    @dilnoza2168 Před 5 lety

    I Love this channel:) this is very helpful channel

  • @neilcidial-masrysandagesid7796

    Insightful. Will read watch.

  • @kanoutema
    @kanoutema Před 4 lety

    Bless you guys !

  • @ramanshaw8922
    @ramanshaw8922 Před 5 lety

    good work ..it is really helping.

  • @breakinchico
    @breakinchico Před 4 lety

    This is Incredible

  • @corinneblair8795
    @corinneblair8795 Před 2 měsíci

    So good! So helpful! Thank You!!

  • @TilleTheo
    @TilleTheo Před 4 lety +11

    It was a bit too fast, but very helpfull still! Will watch it a few more times.

  • @HinamiMel
    @HinamiMel Před rokem

    every night brings a dream but the day, relentlessly, keeps me awakeee

  • @jigarpatel2792
    @jigarpatel2792 Před 5 lety

    Great explanation

  • @saivishnutulugu5014
    @saivishnutulugu5014 Před 5 lety +3

    Can you go over nonlinear data models(exponential, power, etc) and also Simpson's paradox in the future?

  • @seltsamerjunge3642
    @seltsamerjunge3642 Před 5 lety +5

    Interesting. I just factchecked the theory about the comment-to-likes ratio, and it met pretty well: At the time I've written this, there were 41 comments and 391 likes, which is just the value "4000/100" shown in the diagram... As it turned out, this time it's above the regression line, but with an increase in the y-value by less than 35%

  • @ZzDC2
    @ZzDC2 Před 5 lety +1

    new video yay!

  • @EconomicalUnicorn
    @EconomicalUnicorn Před 4 lety +6

    Did anyone notice how as the video goes on, there are less and less lollies (sweets) near the pumpkin lmao

  • @mw79863s
    @mw79863s Před 5 lety +30

    Some unnecessarily confusing parts:
    It would have been helpful to explain that our zero-coefficient line IS the line y='y hat'.
    The point referred to at 7:05 is not highlighted or pointed out (and as it sits far above its distance for SSR it isn't instantly recognizable as connected).
    Positioning of the equations at 8:50 gives strong and erroneous implication that each refers specifically to the diagram above.
    The equation given for F-statistic at 8:58 is then instantly revised as not being correct.
    The correct f-statistic equation is only on screen at 10:07 for a fraction of the time needed to read it - let alone fully digest it.

    • @BastiPROTON
      @BastiPROTON Před 4 lety +1

      Exactly! I was so confused the whole time, a lot of it makes little sense if you see this stuff for the first time.

    • @jaceychang5785
      @jaceychang5785 Před 4 lety

      I think the F-statistic formulas are wrong at both 8:58 and 10:07 ! At 10:07 the denominator and numerator should be reversed!

  • @jimivie
    @jimivie Před 4 lety

    100/100 - great video

  • @danieldelacruz7642
    @danieldelacruz7642 Před 5 lety +175

    Who needs a regression calculation when you have "add trendline" in Excel?

    • @Tntpker
      @Tntpker Před 5 lety +30

      who still uses excel in 2018 lol. keep up and learn python noob

    • @avinoamr
      @avinoamr Před 5 lety +3

      The folks that want to work at Microsoft or any of it's competitors

    • @alanle18
      @alanle18 Před 5 lety +97

      Tntpker excel will forever be used, it has a great balance between learning curve and power. You must feel good about yourself putting strangers down over the internet.

    • @OlleLindestad
      @OlleLindestad Před 5 lety +20

      I mean, that's what "add trendline" does. It does a regression on your data. :D

    • @nytmare3448
      @nytmare3448 Před 5 lety

      But a B-spline looks sooo much nicer!!!

  • @SaraAB98
    @SaraAB98 Před 3 lety

    Thank you very much ❤

  • @adamacosta5019
    @adamacosta5019 Před 4 lety

    So lost I want to cry, but seeing @AstroKatie was a nice pick me up

  • @Sagitarria
    @Sagitarria Před 5 lety +2

    For the trick or treat example, would it be appropriate to try a logarithmic transformation?

  • @kcthewanderer
    @kcthewanderer Před 5 lety

    Ooo this is starting to get good...

  • @PhysqueLab
    @PhysqueLab Před 5 lety +3

    May i ask when the logistic regression video will be uploaded?

  • @emilydack2074
    @emilydack2074 Před 5 lety

    Bless this video

  • @brodyreingold5992
    @brodyreingold5992 Před rokem

    great video

  • @manzurekhoda7013
    @manzurekhoda7013 Před 4 lety +1

    You should have include nonlinear methods of regression in this video. Anyway, great video.

  • @caioraimundo1609
    @caioraimundo1609 Před 4 lety

    AMAZING

  • @carlosfloresventuri6452
    @carlosfloresventuri6452 Před 5 lety +1

    Thank you so much, this helped me a lot! :D

  • @expansivegymnast1020
    @expansivegymnast1020 Před 11 měsíci

    This series is pointless... until you actually need this stuff for class and then you're thankful to God that it exists. Thanks for everything y'all do!

  • @viktorianas
    @viktorianas Před 4 lety

    One of the best episodes of the series. Mustaches - 9:22

  • @alexbe3136
    @alexbe3136 Před 4 lety

    Easter egg alert: The candies dissapear while the video goes on :)

  • @berfeito
    @berfeito Před 5 lety +2

    Can anyone recommend an exercise book or a site with practice questions for statistics? I feel like I need to practice it on my own. Cheers.

  • @mayankjacky
    @mayankjacky Před 7 měsíci

    Thanks

  • @treelight1707
    @treelight1707 Před 5 lety +5

    I finally figured out the issue with this series, why it is so hard to follow. The animations are too much, too fast for statistics. I can barely follow through with the examples, or cannot follow at all. Example: the calculations; you can't remove each line before the next. I would want to see what numbers went where, and it is not that long of a calculation that you need to have space. Other than that, I think everything else is fine. Crash course Economics was awesome btw.

  • @bharathreddy4806
    @bharathreddy4806 Před 5 lety

    please add content of full machine learning algorithms

  • @sapandeepsandhu4410
    @sapandeepsandhu4410 Před rokem

    love it

  • @circleofideas9549
    @circleofideas9549 Před 4 lety

    Mam you are so sweet. thankx you teaching us.

  • @samsonthelionhearted6873
    @samsonthelionhearted6873 Před 4 lety +109

    You’re going so fast you lost me a bit.

  • @medslarge
    @medslarge Před 5 lety +9

    Didn’t really understand the degrees freedom part 🤔

  • @jeremiahharemza1235
    @jeremiahharemza1235 Před 4 lety +5

    "I know Kung Fu" - Neo
    "Show me." - Morpheus

  • @libbylebyane3681
    @libbylebyane3681 Před rokem

    Linear Regression is the building block in Artificial Intelligence predictions

  • @siddharththomas5740
    @siddharththomas5740 Před 5 lety

    How many more episodes will there be?

  • @tutukkunoor
    @tutukkunoor Před 5 lety +1

    At 9:33, she says 'The sums of squares for regression (SSR) has one degree of freedom as one degree is consumed in calculating slope of the model line'. How is that o.O

  • @iamdrscript
    @iamdrscript Před 5 lety

    best explaination ever!!!

  • @demonika2060
    @demonika2060 Před rokem +2

    lmao i didn't understand anything

  • @cbottube
    @cbottube Před 5 lety +1

    *watches Optimus very closely throughout the video*

  • @tatsianatati8375
    @tatsianatati8375 Před 5 lety +9

    Oh my , too fast for me🤯🤯🤯

  • @aldorosas1136
    @aldorosas1136 Před 5 lety +2

    Very interesting. One comment though. "The regression line is the one straight line that minimizes the sum of the squared distances of each point to the line" (3:50) can be slightly misleading. It seems to suggest the actual distance from each point to the line, which (except for a horizontal line) would not be vertical. It should say, "...minimizes the sum of the squared vertical distances from each point to the line."

  • @sunsusan2739
    @sunsusan2739 Před 4 lety +1

    I'm confused by the equation at 2:24. Should "increase in likes per comment" be in blue, standing for m instead of x?

  • @mpilosov
    @mpilosov Před 5 lety +2

    At 9:43, do you mean “the mean”
    In the null model, we are just using the mean of the data (one independent piece of info) to predict the outcome. You say “slope,” but aren’t we not using slope, i.e. setting it to zero?

    • @mikail5682
      @mikail5682 Před 5 lety

      It's at 9:34
      "The sums of squares for regression (SSR) has one degree of freedom, because we are using one piece of independent information to estimate our coefficient, the slope"
      Correct me if I'm wrong, but the sentence has to be "...we are using one piece of independent information to estimate SSR, the mean".
      If this is incorrect, please explain why.

  • @davidcampos1463
    @davidcampos1463 Před 5 lety

    You mean it's all of us human being random number generators against CZcamss mechanical algorithms. "Of course you realize, this means war!"

  • @Nshiime
    @Nshiime Před 5 lety

    Hi at 3:51 is it sum of(observed value minus predicted value)^2 or is it sum of(observed value minus average of values observed)^2

  • @likiee
    @likiee Před 4 lety

    Math makes me cry

  • @iefe65
    @iefe65 Před 5 lety

    I don't get why in 9:35 she says that we only need 1 degree of freedom to calculate the slop. I understand the 98 DF for SSE but I don't get why SSR has only 1 DF

  • @kensaville513
    @kensaville513 Před 8 měsíci

    Useful video thanls. it looks like the alpha value used was 0.5. Should this be 0.05?

  • @milesbrown1889
    @milesbrown1889 Před 4 lety

    Did anyone else notice the bell curve in the background and where she sits positions her as being among the average? how funny is that! I’m not showing off my observational skills at all it’s just an observation.

  • @NikitaSamourai
    @NikitaSamourai Před 5 lety +3

    i don't understand why the sums of squares for regression has one degree of freedom

    • @NikitaSamourai
      @NikitaSamourai Před 5 lety

      I DONT WANT TO OPEN TABACHNIK AND FIDELL

    • @iefe65
      @iefe65 Před 5 lety

      same, I don't get it

    • @cycla
      @cycla Před 5 lety

      Because only 1 independent variable is used to generate the regression

  • @danyypao1824
    @danyypao1824 Před 5 lety +4

    Im so lost

  • @kowalityjesus
    @kowalityjesus Před 5 lety +15

    I really appreciate this explanation, but I think you started moving too quick when discussing degrees of freedom. I can't get what you're talking about after listening to it even several times. Specifically the lines she says at 19:13 are completely non-understandable to me. Thanks, though

  • @jaceychang5785
    @jaceychang5785 Před 4 lety

    The F-statistic formulas are wrong at both 8:58 and 10:07 ! Though the calculation is correct.

  • @ArjunTheCreator8
    @ArjunTheCreator8 Před rokem

    Guess I'm the only one who noticed the gummy worms slowly disappearing...

  • @missaster1902
    @missaster1902 Před 4 lety

    You can’t be like that glasses 👓 guy 🥺

  • @rajdubey4389
    @rajdubey4389 Před 5 lety

    SIR PLZ MAKE VIDEOS ON MATHS IF U WANNA CROSS 10M COZ THERES MILLION OF SAME DEMAND

  • @jennyskene4777
    @jennyskene4777 Před 5 lety

    This lady is a great presenter. I wish she was my teacher!

  • @technofeeliak
    @technofeeliak Před 5 lety +8

    Statistics are the ultimate rationalization of life's experiences through math. Unfortunately, the government and other organizations can take this oversimplification to back up their fallacies.

    • @KitsuneSoftware
      @KitsuneSoftware Před 5 lety +1

      Only when their audience doesn't understand the stats. It's like small print in contracts (who reads those?) or those disclaimers in adverts in tiny print or really fast voices. Lessons like these help us to not be fooled.

  • @actualprogramming
    @actualprogramming Před 5 lety

    Next what? Correlation?

  • @emopeterparker7
    @emopeterparker7 Před 4 lety

    hi apitong 👋

  • @st0lf
    @st0lf Před 5 lety +1

    This video is really helpful and the explanations are easy to understand. I think however some quiet music might be able to break the tension a bit, especially as the sound effects tend to get repetitive. Some quiet Lo-Fi instrumentals in the background could really help the videos seem even more polished.
    Keep up the good work. ^^

  • @sungkim1397
    @sungkim1397 Před 5 lety +1

    I am lost +_+

  • @higher_haze
    @higher_haze Před 5 lety +1

    I hope Crash Course helps me in college.

  • @Abhalerao96
    @Abhalerao96 Před 5 lety

    OPTIMUS!!! You're distracting me!

  • @lagh
    @lagh Před 4 lety +1

    Just use SPSS 😂

  • @JEOGRAPHYSongs
    @JEOGRAPHYSongs Před 5 lety +2

    There is certainly something to be said for flexibility.

  • @alikhan81
    @alikhan81 Před 4 lety +2

    Just to F-up the F-Test, I'm gonna leave a comment without liking the video

  • @unleashingpotential-psycho9433

    I remember statistics class in school was very challenging T_T

    • @gardenhead92
      @gardenhead92 Před 5 lety +3

      Said like a true psychologist

    • @DPMixing
      @DPMixing Před 5 lety +1

      Well it seems easy when you get to watch entertaining, visually-stimulating videos and not have to be assessed for your application of the concepts with homework and exams...😂😂😂

    • @gnometheory3831
      @gnometheory3831 Před 5 lety

      @@DPMixing Yup, I am in AP stats with a 97% and watched this for fun to see how dumbed down it is. The answer: very.