Data Analysis 2: Data Visualisation - Computerphile

Sdílet
Vložit
  • čas přidán 24. 08. 2024
  • Seeing is believing - Dr Mike Pound helps us understand how to turn our datapoints into Powerpoints. This is part 2 of the Data Analysis Learning Playlist: • Data Analysis with Dr ...
    This Learning Playlist was designed by Dr Mercedes Torres-Torres & Dr Michael Pound of the University of Nottingham Computer Science Department. Find out more about Computer Science at Nottingham here: bit.ly/2IqwtNg
    This series was made possible by sponsorship from by Google.
    The ‘Chicken’ dataset was adapted from this dataset:
    stat.ethz.ch/R...
    / computerphile
    / computer_phile
    This video was filmed and edited by Sean Riley.
    Computer Science at the University of Nottingham: bit.ly/nottsco...
    Computerphile is a sister project to Brady Haran's Numberphile. More at www.bradyharan.com

Komentáře • 111

  • @Computerphile
    @Computerphile  Před 5 lety +16

    Check out the full Data Analysis Learning Playlist: czcams.com/play/PLzH6n4zXuckpfMu_4Ff8E7Z1behQks5ba.html

    • @chadross3176
      @chadross3176 Před 5 lety +5

      Did you or can you make the Data sets and R code used in this series available some place ?

    • @donovan30081995
      @donovan30081995 Před 4 lety

      @@chadross3176 Yes, this would be really helpful!!

    • @donovan30081995
      @donovan30081995 Před 4 lety

      Please make the datasets used in this series available! It would really help us!

    • @angelogriffin2679
      @angelogriffin2679 Před 3 lety

      pro tip : watch series at Flixzone. Me and my gf have been using them for watching a lot of movies these days.

    • @crosbykabir2418
      @crosbykabir2418 Před 3 lety

      @Angelo Griffin yup, I have been using Flixzone for months myself =)

  • @mokopa
    @mokopa Před 5 lety +96

    The chickens/eggs/diet example was wisely chosen, perfectly presented and definitely relevant. Well done, you certaily deserve your thumbs-up!

  •  Před 5 lety +193

    I came up with the optimal diet for chicken... unfortunately it only works for spherical chicken in a perfect vacuum.

    • @Ethelgiggle
      @Ethelgiggle Před 4 lety +4

      Do you know the aerodynamics of the chicken too?

    • @stevanmiladinovic4007
      @stevanmiladinovic4007 Před 4 lety +10

      @@Ethelgiggle Of course not. That's why it only works in a vacuum.

    • @dylanhjulian
      @dylanhjulian Před 2 lety

      Can you fix me the data? I'm researching for an utopian colony for harvest in orbit.

    • @tex1297
      @tex1297 Před rokem

      😂😂😂

  • @Ethelgiggle
    @Ethelgiggle Před 4 lety +3

    Im really regretting not studying computer science in nottingham. All of those people are awesome especially Dr. Mike. To have a class with him would be nice for sure.

  • @dimitriouchemistry2215
    @dimitriouchemistry2215 Před 5 lety +61

    It's funny how you were talking about types of data, and then the commercial before your video said that the "tacos are 15% tastier"

    • @Abby_Liu
      @Abby_Liu Před 4 lety +2

      15% more people said that they were tasty? or the average rating they gave rose by 15%??

    • @Abby_Liu
      @Abby_Liu Před 4 lety +1

      or the cows for the beef got 15% fatter? idk

  • @black_platypus
    @black_platypus Před 5 lety +42

    15:56 Your favorite sensationalist news outlet: *Diet A kills older chickens!*

  • @Nayus
    @Nayus Před 5 lety +14

    Data Visualisation's impact in the conclusion is often overlooked or underated. I remember in experimental physics class how when we had to make the final visualisations of our measurements, specially in histograms, the different in width of the intervals made a huge difference in the overall look of the graph, without even touching any of the measurements. It's interesting how changing something completely "legal" and without obscuring any result you still could imply a different behavior

    • @myothersoul1953
      @myothersoul1953 Před 5 lety

      That's a bug not a feature. If the width or your histogram bars or the distance between them impacts your conclusion then it's a problem (or you're doing marketing, not science).

    • @Nayus
      @Nayus Před 5 lety

      @@myothersoul1953 definitely. It was kinda marketing. Because to see that the experiment was done "properly", they expected a bell curve on our measurements' histogram. So when we did it, and it did not really look like a bell curve (even though in truth it was) a tweak on those things made a much more "pleasant" looking curve. It didn't change much of the conclusion, but it was just way more clear to the eye that it behaved as declared.

    • @jtbozify
      @jtbozify Před 5 lety

      if you like data visualizations, I just started this channel where i'll visualize data on a variety of different topics like sports, gaming, politics, etc! Come check it out if you like this stuff...

  • @tomaspinguim
    @tomaspinguim Před 5 lety +5

    The boxplot do not plot the maximum and minimum of the data. It actually plots the wiskers until the 3rd quartile (q3) + 1.95 * IQR or 1st quartile (q1) - 1.95 IQR. IQR means InterQuartile Range or the distance between q1 and q3.

  • @Izzy-ve3xz
    @Izzy-ve3xz Před 5 lety +5

    I didn’t think I could like Dr. Mike any more than I already do, but then he said one of his favorite TV shows was Frasier and I was proven wrong.

  • @SomebootyElse
    @SomebootyElse Před 5 lety +29

    Those down votes came from the pie chart enthusiasts.

  • @ExplicableCashew
    @ExplicableCashew Před 5 lety +9

    I see Mike Pound, I click. What a good day

  • @zerokelvin3626
    @zerokelvin3626 Před 5 lety +28

    What makes a good graph? I can really recommend the book "The Visual Display of Quantitative Information" by Edward Tufte. In fact every scientist, politician and journalist should be mandated to read it in my opinion

    • @jtbozify
      @jtbozify Před 5 lety +3

      i've started doing data visualization myself on my channel. I just started so the topics are a bit boring, but im about to move into sports, gaming, politics, pop culture, etc...come check it out if you want!

  • @isabellabihy8631
    @isabellabihy8631 Před 5 lety +2

    Thank you making the appropriate visualization clear. Excellent! People get duped easily by impressive visuals. Looks good, the scale becomes unimportant. Along comes a convincing presenter, and the truth gets lost. MS Excel's (and other spreadsheet applications) statistical functions are fantastic, but if you apply them inappropriately, your visualization may look and be colorful, but doesn't really have a meaning.

  • @ThePinkArmy
    @ThePinkArmy Před 4 lety +2

    Came here for the data talk, got me some shared Frasier love! Great video

  • @dreznik
    @dreznik Před 5 lety +1

    the hard part of this work is not the analysis, but the preparation. 9:1 of the work. the R/tidyverse is a good tool

  • @ramixnudles7958
    @ramixnudles7958 Před 5 lety +5

    In the last analysis, would it be appropriate to use the min/max of the data for B and C to limit the ages of the chickens for A? I.e., take only those chickens on diet A which were at least as old as the youngest chickens of B or C?
    This might wind up reducing the data points available for A, but would remove the younger age bias.

  • @Alex-jv6bs
    @Alex-jv6bs Před 5 lety +2

    This whole series is awesome. Well explained. I wish you did it before I wrote my thesis :D

  • @alik250
    @alik250 Před 3 lety

    This guy loves when things make sense

  • @theshuman100
    @theshuman100 Před 5 lety +13

    farmers hate him. see how he gets younger chickens with this amazing diet

  • @TheEgesko
    @TheEgesko Před 5 lety +2

    Mike Pound is back!!

  • @TheAsymmetrical
    @TheAsymmetrical Před 5 lety +8

    Currently writing out letters so people will hire me as a "data manager" listening to D A D D Y P O U N D gives me strength doing so. Ty ty
    (this is a great resource genuinely ty)

  • @haraldurkarlsson1147
    @haraldurkarlsson1147 Před rokem

    Wait a second. That first graph, a time series, does make sense to geoscientists and climate change. I do agree there are too many curves but a legend should explain that.

  • @morganwilliams835
    @morganwilliams835 Před 5 lety +9

    Love it! Can you please link to your R-code?

  • @JohnDoe-ev9fo
    @JohnDoe-ev9fo Před 2 lety

    A +1 as always, but especially for Frasier this time. :D

  • @Yaxqb
    @Yaxqb Před 3 lety

    The chicken thing was extremely insightful and makes me question my own diet

  • @rickl8280
    @rickl8280 Před 5 lety +5

    Would that be too much if i ask for the upload of the sample data sheet?

  • @polares8187
    @polares8187 Před 5 lety +12

    What's with the dutch angle?

    • @Macieks300
      @Macieks300 Před 5 lety +2

      to distinguish theory from practice parts

  • @niallmckeon8758
    @niallmckeon8758 Před 4 lety +2

    For a novice to practice, would be great if you could make the Chicken.csv file available for download???

  • @hisheighnessthesupremebeing

    @15:00.. from 3 to 4 eggs are not a 20% increase but a 33% increase.. (3x1.20 = 3.6 and 3x1.33..= 4)

  • @simonh1994
    @simonh1994 Před 5 lety +2

    Could you explain why it makes sense to create plots about the age with the chicken dataset. It seems unintuitive to me when you measure the age of the same chicken over time and then create a histogram / boxplot.

  • @thatchessguy7072
    @thatchessguy7072 Před 2 lety

    @9:15 that’s not an assumption, that’s the intermediate value theorem.

  • @MrTechguy365
    @MrTechguy365 Před 5 lety

    Better than my lectures! And I am studying Computer science...
    Might consider switching to Nottingham :D

    • @jtbozify
      @jtbozify Před 5 lety +1

      If you're into data, i just started my channel where I'll do different data visualization mini-projects. The topics have started off basic, but ill get into gaming graphs and sports stats, politics, etc very soon. Come check it out!

  • @DevMeSteve
    @DevMeSteve Před 2 lety

    good jab.

  • @javiergonzalezarmas8250

    Excelent video!

  • @notoriouskiller1
    @notoriouskiller1 Před 2 lety

    Mike’s sweater is fresh. Where do I get one

  • @ev.c6
    @ev.c6 Před 3 lety

    Data analysts graduated in online courses: it is what it is.

  • @jesstatz8695
    @jesstatz8695 Před 3 lety

    What’s the price point between the diets and how much money per egg is being realised? Some more questions for charting

  • @DavidHar
    @DavidHar Před 3 lety

    Pie charts are great if you want to see how much a data point takes up from the total

  • @MikeDolanFliss
    @MikeDolanFliss Před 5 lety

    Love that little glimpse of R and ggplot. :)

  • @youngzproduction7498
    @youngzproduction7498 Před 4 lety

    Very nice analysis on the age, diet, eggs part. Thanks for the nice vid, though.

  • @williamchamberlain2263

    OK - I've finally subscribed

  • @karengomez3143
    @karengomez3143 Před rokem

    Is there a way to prove causation using stats? Or is only through lots of probes you can proof causation?

  • @khattami240193
    @khattami240193 Před 5 lety

    I like that you use ggplot2

  • @kalamatej
    @kalamatej Před rokem

    Imho you do not necessarily need more chicken. you could look on the effect of age within each diet and later even check for interactions between observations e.g. age and diet. Although there you might miss some young chicken on B diet.
    ...feels odd to write things like that 😂
    Hopefully I see that in later episodes. It has been a great series so far 🤗👍

  • @donovan30081995
    @donovan30081995 Před 4 lety +2

    Is there any way I can download the dataset to practice?

  • @markmorillo2954
    @markmorillo2954 Před 3 lety

    Amazing !!!!!

  • @daltonyon
    @daltonyon Před 5 lety +1

    Awesome class! Thank u very much o /

  • @pseudo880
    @pseudo880 Před 2 lety

    To add a mathematicians view of the bar chart, we would say the labels == categorical data. In the previous video I think i would map the N of NOIR i.e. Nominal data to Categorical data. Would we say Nominal Data --> implies Categorical Data? Is the reverse true?

    • @pseudo880
      @pseudo880 Před 2 lety

      Sorry, more maths chatter on the histogram. The histogram 'labels' are usually called bins and will show the frequency of chickens on the y axis which fall into a range of continuous data. I guess in a computer science sence the range would probably (but not always) be a float rather than an integer. All this said, Dr. Mike Pound is wonderful - but just as he has bug bears with plotting subsets of data and wrong chart types, as a mathematician saying the frequency of chickens at 250 months is highest is not strictly correct because the bin it refers to is a range.

  • @richfi9576
    @richfi9576 Před 5 lety +1

    I had to look up the typical lifespan of a chicken, which is something I've never had pause to consider before.

  • @JBleher
    @JBleher Před 5 lety

    Please have look again at the "bar chart for frequencies". you shouldn't use it to deduce the density or distribution. it's not a histogram unless you have a class width of one. it's the standardized frequency that you plot in a histogram. nice video though.

  • @josecruz2574
    @josecruz2574 Před 4 lety

    Dr. Mike given this data and assuming that perhaps this is the only data we had - if one were to make an operational change to their chicken business, would it not be logical to alter the diet of the older chickens in order to test the diet hypothesis? (because we can not make the older chickens any younger)

  • @wijzijnwij
    @wijzijnwij Před 5 lety +1

    Are the R scripts and csv files that are used available for download anywhere?

  • @parakhmody1413
    @parakhmody1413 Před 2 lety

    4:01
    The value of the USD plummeted, not of the JPY...

  • @ADEpoch
    @ADEpoch Před 5 lety

    This is an amazing set of talks on data. But, perhaps the most amazing part is the old dot-matrix paper you found!!!! Do they still make that????

  • @GoatzAreEpic
    @GoatzAreEpic Před 5 lety +3

    Mike saying T H I C C in the first 10 seconds = insta like

  • @AndersTherkelsen
    @AndersTherkelsen Před 5 lety +3

    I see Mike is an "=" man, the heathen. GREAT video, nonetheless!

  • @tHeplAyiER
    @tHeplAyiER Před 4 lety

    Very interesting! Thanks :)

  • @mohamedhabas7391
    @mohamedhabas7391 Před rokem

    Bravo 🙌 🎉 👏 :)

  • @andthefunkybunch1466
    @andthefunkybunch1466 Před 2 lety

    This is a killer jumper, where can I buy it

  • @simolahlou4793
    @simolahlou4793 Před 4 lety

    On the last two graphs between diet B and C, the chickens on diet B appeared to be the youngest unlike the last box plot graph which shows that chickens on diet C are the youngest.
    Someone noticed the same thing or it 's just me?

  • @ReCaptchaHeinz
    @ReCaptchaHeinz Před 5 lety

    Here in Spain there are thousands of cases of 1:50 example :_(

  • @MrKZee
    @MrKZee Před rokem

    Can you please share csv files?

  • @soraaoixxthebluesky
    @soraaoixxthebluesky Před 5 lety

    6:55 remind me of japanese candlestick to read price.

    • @mokopa
      @mokopa Před 5 lety

      That's why it's sometimes also called a "candlestick graph"

  • @shouldb.studying4670
    @shouldb.studying4670 Před 5 lety +1

    This video hit all the points ... R^2=1 🤣

  • @osys7832
    @osys7832 Před 4 lety

    omg where are the macs :(

  • @joefeely5291
    @joefeely5291 Před 5 lety +1

    Off topic - interesting chicken ages - the average modern chicken would be extremely lucky to get to 100 weeks old (my inner vegan is activated).

  • @cetilly
    @cetilly Před 5 lety

    Frasier!!! 👍🏻👍🏻👍🏻

  • @TheGoluharikesh
    @TheGoluharikesh Před 3 lety

    Why does the guy holding the camera have so much trouble holding it straight

  • @niallmurphy2163
    @niallmurphy2163 Před 2 lety

    9:35
    Laughs in WSL.

  • @stryyker9
    @stryyker9 Před 4 lety

    Lindows?

  • @kkloikok
    @kkloikok Před 4 lety

  • @kaan-tube
    @kaan-tube Před 4 lety

    Yves brought me here.

  • @blackbox4214
    @blackbox4214 Před 5 lety

    Can the chicken be named Bob loool

  • @hahahatall09
    @hahahatall09 Před 5 lety

    T A B L E A U

  • @lindhe
    @lindhe Před 5 lety

    So Mike needed an excuse to learn R, ey? :D

  • @oldcowbb
    @oldcowbb Před 5 lety

    I always though data analysis is something super high level stuff, but seems like it's just statistics?

  • @MD-dk6ms
    @MD-dk6ms Před 5 lety +1

    Did you say OS „Ex“?? 😳😜

  • @sB3rg
    @sB3rg Před 5 lety

    Great videos. Wish they were more concise. Hard to carve out time to watch them in series.

  • @richardisom4783
    @richardisom4783 Před 5 lety

    I feel like people pushing bitcoin do this all the tim...e XD

  • @sourjyod
    @sourjyod Před 5 lety

    Says bar chart, draws column chart. Love the series though!

    • @aksela6912
      @aksela6912 Před 5 lety +3

      Column chart is just the Excel name for it, no? In most cases they'd be referred to as a bar charts or bar plots.

    • @jtbozify
      @jtbozify Před 5 lety

      If you like data stuff like this, Ive started doing small data visualizations on my channel and i'll soon visualize stuff like sports, gaming, politics, etc! come check it out if you want to!

  • @Sajmon9114
    @Sajmon9114 Před 5 lety

    Use data.table, Mike :(

  • @connordavis6487
    @connordavis6487 Před 5 lety

    First

  • @pajeetsingh
    @pajeetsingh Před 3 lety

    You have lots of acne on your face. That's data visualization.