Python Data Science Tutorial: Analyzing the 2019 Stack Overflow Developer Survey

Sdílet
Vložit
  • čas přidán 22. 07. 2024
  • In this video, we will be learning how to use analyze survey data in Python.
    This video is sponsored by Brilliant. Go to brilliant.org/cms to sign up for free. Be one of the first 200 people to sign up with this link and get 20% off your premium subscription.
    In this Python Programming video, we will be learning how to download and analyze real-world data from the 2019 Stack Overflow Developer Survey. This is terrific practice for anyone getting into the data science field. We will learn different ways to analyze this data and also some best practices. Let's get started...
    The code from this video can be found at:
    bit.ly/SO-Survey-2019
    CSV Tutorial - • Python Tutorial: CSV M...
    Jupyter Notebooks Tutorial - • Jupyter Notebook Tutor...
    ✅ Support My Channel Through Patreon:
    / coreyms
    ✅ Become a Channel Member:
    / @coreyms
    ✅ One-Time Contribution Through PayPal:
    goo.gl/649HFY
    ✅ Cryptocurrency Donations:
    Bitcoin Wallet - 3MPH8oY2EAgbLVy7RBMinwcBntggi7qeG3
    Ethereum Wallet - 0x151649418616068fB46C3598083817101d3bCD33
    Litecoin Wallet - MPvEBY5fxGkmPQgocfJbxP6EmTo5UUXMot
    ✅ Corey's Public Amazon Wishlist
    a.co/inIyro1
    ✅ Equipment I Use and Books I Recommend:
    www.amazon.com/shop/coreyschafer
    ▶️ You Can Find Me On:
    My Website - coreyms.com/
    My Second Channel - / coreymschafer
    Facebook - / coreymschafer
    Twitter - / coreymschafer
    Instagram - / coreymschafer
    #Python

Komentáře • 145

  • @coreyms
    @coreyms  Před 4 lety +55

    I hope everyone finds this useful. Going out and practicing with real-world data like this is one of the best (if not the best) way to practice your skills. If any of you do your own analysis that's interesting then be sure to share it with us in the comment section below!
    PS I forgot to mention that as usual, the code for this video can be found in the description section.

    • @shpazhist
      @shpazhist Před 4 lety +4

      Corey, can you please do a video where you explain how to get to that level of intermediate-advanced developer where one can play with code really-really easily? In essence, say if one wants to learn about new library or framework or a new challenge - - can you please explain your process how do you go about actually learning that field rather than doing videos on standalone topics as IMO that would really help a lot of people to sorta move to the next level a lot. Thx

    • @bodekolton5961
      @bodekolton5961 Před 3 lety

      I know it's quite off topic but do anybody know a good website to stream new tv shows online?

    • @troylorenzo6430
      @troylorenzo6430 Před 3 lety

      @Bode Kolton Lately I have been using Flixzone. Just google for it :)

    • @maddoxronin9643
      @maddoxronin9643 Před 3 lety

      @Troy Lorenzo yea, have been watching on Flixzone for since april myself =)

    • @bodekolton5961
      @bodekolton5961 Před 3 lety

      @Troy Lorenzo thank you, I went there and it seems like a nice service :D I really appreciate it!

  • @mrityunjayahiremath3951
    @mrityunjayahiremath3951 Před 4 lety +70

    Python + Data Science + Corey = ❤️

  • @Ghasakable
    @Ghasakable Před 4 lety +19

    Man, you are awesome, you are the reason that I open CZcams, please if you come to Japan one day, I will do my best to show you good places here in Tokyo.

  • @mancalos
    @mancalos Před 3 lety

    Finally a project not using either the Titanic or Iris datasets. Corey, you are a gem. Thanks for all the effort you put into this and every other tutorial.

  • @AvantjerTube
    @AvantjerTube Před 4 lety +5

    Thanks Corey!
    I’ve been following your excellent tutorials for quite awhile.
    Just wanted to say that I’m excited and looking forward to your Pandas course(s).

  • @anaesthesiadolorosa7100
    @anaesthesiadolorosa7100 Před 4 lety +1

    It's awesome that you've started making data science videos. Please keep them coming! ;)

  • @mylesmontclair
    @mylesmontclair Před 4 lety +1

    Great to have you back producing great content for us. Thanks, Corey!

  • @Sauhros
    @Sauhros Před 4 lety

    Just what I needed to start off my winter break. I can't wait for your pandas series! Keep up the great work Corey.

  • @sanukurien2752
    @sanukurien2752 Před 4 lety +1

    @Corey, I love watching your videos when I have my morning coffee. It's elegant and super informative. Thank you!

  • @dadoll1660
    @dadoll1660 Před 4 lety +1

    Wow, this is Amazing! Please make more videos for this DS Tutorial series. Thank you Corey...

  • @GuilhermePalazzoRodrigues

    That was incredible, really easy to follow and replicate, thank you very much.

  • @jayjay6804
    @jayjay6804 Před 4 lety +1

    As hobbiest my first inclination would be to extract the data using pivot tables in libreoffice. Thank you for demonstrating the use of python for this purpose.

  • @Vasko10100
    @Vasko10100 Před 4 lety +3

    Finally, i understand analyzing Data in Python! I went from Coursera via EDX, UDEMY etc. But this.... this is a pure gold sir! Please, do some course or learning path for Data Science with Machine Learning in Python in depth( let it be if needed a paid learning path). You are one of the best! If you somehow manage to come in Macedonia, tell me, i want to bow before you!

    • @coreyms
      @coreyms  Před 4 lety +3

      Thanks! Im still working on a Pandas series. 4 videos are complete but I'm working to finish the rest soon. Glad to have you as a subscriber and member. You all make it possible for me to release these for free. Thanks again!
      media.giphy.com/media/WvUyMX1gB1gnC/giphy.gif

  • @muralitechtimes6487
    @muralitechtimes6487 Před rokem

    You are true MASTER of coding. We are now a days not finding your other videos. Do it for us which re very help us to grow.

  • @philinnc
    @philinnc Před 4 lety

    Looking forward to seeing a future video where pandas is used to analyze this same data. Really appreciated your stepping through the thought process, something I haven't seen in many other data science tutorials. Approaches to data cleaning is another interesting topic that would be nice to see covered in a future video. Recently there were these FEC reports... :-)

  • @HunniiChii
    @HunniiChii Před 4 lety +1

    Hope to see more of these videos from you in the future!

  • @user-gx9hk8gt3k
    @user-gx9hk8gt3k Před 4 lety +1

    Thank you for Python Data Science Tutorial! Keep it up, please!

  • @kizzavincent
    @kizzavincent Před 4 lety +1

    Very nice to have you back. Many thanks.

  • @afternoontiger9319
    @afternoontiger9319 Před 4 lety +14

    I'm definitely interested in a future tutorial of using pandas with this data! Otherwise a fantastic job

  • @BoukariYameogo
    @BoukariYameogo Před 4 lety +2

    This is what I was waiting for! Thank you!

  • @_boris
    @_boris Před 4 lety +1

    Your videos are so brilliant as I want to keep them to myself only... 💎
    ...however, way too good not to share them with others. Shared with my classmates - hopefully, they find it useful :)

  • @yosefco3
    @yosefco3 Před 4 lety +2

    Hi great Corey ! Thank you very much on your channel , your flask tutorial is the best i ever seen ! Please could you do a flask-api tutorial too ?

  • @DistortedV12
    @DistortedV12 Před 4 lety +1

    Wow Corey you do it all; thanks for the helpful tutorials!

  • @Lfppfs
    @Lfppfs Před 3 lety

    Awesome tutorial, thanks Corey!

  • @Strider1066
    @Strider1066 Před 3 lety

    I liked the incremental approach of coding by examples .

  • @KevinOMalleyisonlysmallreally

    You produce some of the best tutorials on python. Itd really help if you did one on advancing jupyter notebook use, and seeing up sublime to work with pandas.

  • @rangavembar
    @rangavembar Před 4 lety +1

    Brilliant as always!! Looking forward to the Pandas series!!

  • @michaelworkspace
    @michaelworkspace Před 4 lety +25

    In case anybody ran into an error like "UnicodeDecodeError: 'charmap' codec can't decode byte 0x81" while reading the csv file like I had, the fix is to add a encoding='utf-8' parameter to the with open() function. e.g. with open('survey_results_public.csv', encoding='utf-8') as f:

    • @Mayur7Garg
      @Mayur7Garg Před 4 lety +4

      Thx man! You are a savior.

    • @shivstn98
      @shivstn98 Před 4 lety +1

      It can be also fixed by using raw string I.e. use r before first quote
      Eg. r"hello world"

    • @jemand4036
      @jemand4036 Před 4 lety +1

      thank u so much, i had no idea what was going on!

    • @irfandy4hmad
      @irfandy4hmad Před 4 lety +1

      Thanks bro. U saved our time

    • @michaelworkspace
      @michaelworkspace Před 4 lety

      Irfandy Ahmad No problem bro. Have a good one!

  • @yomajo
    @yomajo Před 4 lety +1

    It seems like Counter() could be a topic of its own. I need to dig more into that. Awesome code-along experience!

  • @akhilrapalli4118
    @akhilrapalli4118 Před 4 lety +1

    wow what a video from an expert on python like u verymuch corey

  • @amnah.-.4708
    @amnah.-.4708 Před 4 lety

    Hi Corey, thank you for the content you put out, it is beyond exceptional. Also, it would be great if you could make an in-depth course on SpaCy, and its application in Data Science.

  • @ajb4964
    @ajb4964 Před 4 lety +7

    Wow, please do more like this.

  • @arnopisspot5115
    @arnopisspot5115 Před 4 lety

    Looking forward the panda series boss. Thank you for all your hard work

  • @ramil17998
    @ramil17998 Před 4 lety +1

    Thanks a lot for the video! I love it cause its as real as it can get!

  • @MAli-os2yj
    @MAli-os2yj Před 4 lety +3

    Hi Corey! Love your videos man! I have finished your Flask series and currently doing the Django series. I have a few suggestions for future videos around the Blog Website that involve some ML and Data Science:
    1. Introduce 'like and comment' functionality to the application which allows a user to comment and like posts by other users.
    2. Then, integrate a recommendation system engine to the Blog application, the system will recommend content to the individual user based on his own posts and the posts that he has liked.
    3. Also, a user can look into the profile of other users and see a word cloud of all posts that the other has submitted, this way they can get a rough idea of what the other uses posts.
    Keep Doing the good work man!

  • @nikhillingam4630
    @nikhillingam4630 Před 4 lety +1

    absolutely brilliant work well done

  • @SagarSapkota-eo4ei
    @SagarSapkota-eo4ei Před 4 lety +1

    Hi Corey great and amazing content ..thanks a lot buddy

  • @borgestheborg
    @borgestheborg Před 4 lety +2

    Oh baby! Never clicked on a video so fast in my life! :D

  • @bishantadhikari3440
    @bishantadhikari3440 Před 4 lety

    Your videos are great and is probably the most easily understandable in all of youtube.
    Also, can you make videos about implementing AI in python cause there are not any good ones that meet your standard

  • @jackleone4150
    @jackleone4150 Před 4 lety +2

    If anyone else is facing the UnicodeDecodeError try adding this second argument to the open function =>
    with open('data/survey_results_public.csv', encoding="utf8")

    • @sylvanmasha
      @sylvanmasha Před 4 lety

      wow, thank you so so much. i was facing the same problem

  • @gtosXD
    @gtosXD Před 4 lety +1

    Thank you!!!! Really great video!

  • @gaunterodimm3569
    @gaunterodimm3569 Před 4 lety +1

    Great video! This video helped me soooooooo much!

  • @rosyndapee8129
    @rosyndapee8129 Před 4 lety

    Nice! What about a future video with some visualization of geospatial data? I.e. using some GPS tracks converted into CSV file with all points lat and long. Would you suggest to use postrgres or postgis to then make queries to the geodata?

  • @alexanderten5497
    @alexanderten5497 Před 4 lety +1

    Notification from Corey and Iam here to like the video

  • @rangabharath4253
    @rangabharath4253 Před 4 lety +1

    Awesome as always

  • @syedabdulsalam4659
    @syedabdulsalam4659 Před 4 lety +2

    Finally you are back :p

  • @someonesomebody716
    @someonesomebody716 Před 4 lety +1

    Data Science + Corey Sir = YAY!!!

  • @Jo-id9zm
    @Jo-id9zm Před 4 lety +6

    You're doing God's work, Corey!

  • @Medhusalem
    @Medhusalem Před 4 lety

    Hey Corey, I have seen the previous videos on DataFrames with Pandas where you used Jupyter as well. You mentioned in this video that it is nice that you can run code brackets or just certain lines of code.
    My question would be then, what is the difference between Jupyter and Spyder? Since Spyder can also execute certain code lines, without loading everything in again and has a super awesome functionality where you have a variable manager in which you can view every variable you declared.

  • @hserdarkuyuk
    @hserdarkuyuk Před 4 lety +1

    Please do similar live coding, these are great to improve our perspective

  • @javierrivasseau8956
    @javierrivasseau8956 Před rokem

    Hi Corey, how are you? i have been watching your series and videos, and think they are the very best out there, i really like to learn with you, I would like to ask you some things if you could solve them, when you make the matplotlib series, you separate the convertedcomp for all the devs and for the python dev and javascript dev, can you upload a video to solve it? I tried anyway and could not. ?

  • @sandeepgaur2407
    @sandeepgaur2407 Před 4 lety

    Awesome tutorial like always. Please make video on asyncio in python.. please please.

  • @theDrewDag
    @theDrewDag Před 4 lety

    Please make this the first of a Data Science playlist. PLEASE.

  • @aydinjalilov2328
    @aydinjalilov2328 Před 4 lety

    it blew my mind.

  • @736939
    @736939 Před 4 lety

    Thank you Corey! Continue please: ScikitLearn, TensorFlow

  • @BrendanMetcalfe
    @BrendanMetcalfe Před 4 lety +1

    nice vid! thanks

  • @opst1704
    @opst1704 Před 4 lety

    If one day I decide to learn Python, then do I waste my time by learning C language now?
    I am learning C, primarily because I want to understand how memory works and DSA is easier to understand with C. Will this knowledge be helpful if I decide to learn Python? Seems like C and Python have a lot of differences, in Python there are so many things that I don't need to worry about.
    One more question, is it difficult to find out what is happening behind scenes in Python, for example: implementation of input() function?
    Great videos, the main reason why I am considering to learn Python.

  • @nitishchhabra_me
    @nitishchhabra_me Před 4 lety

    Hey @corey please make videos for Airflow DAG's also. They are basically python codes so it would fit nicely within your python series

  • @daltonmarhufu9549
    @daltonmarhufu9549 Před 4 lety

    Great vid! Really helpful, could you possibly do a Pandas version of this?

  • @skandrskon7091
    @skandrskon7091 Před 4 lety

    for line in csv_reader :
    counts[line['hobbyist'] ] += 1
    really great idea for increment on counts dict
    thanks

  • @darrenlefcoe
    @darrenlefcoe Před 4 lety

    On inspection of the data. The csv.DictReader(f) throws 594 errors in the for loop >> line in csvReader:
    Not sure what causes the error, but this exists.
    Needs to be handled with
    try:
    except UnicodeDecodeError:
    who else sees this ?

  • @DistortedV12
    @DistortedV12 Před 4 lety +2

    From this, looks like you can't go wrong with javascript, python, SQL and a C type language. Wonder how long these will dominate.

  • @sprk2009
    @sprk2009 Před 4 lety

    Your videos are awesome. I am new to Python and preparing for Python certification. Can you please let me know which one of the following programs are good to proceed? Thanks in advance.
    1. PCAP: Programming Essentials in Python from Netacad
    2. Microsoft 98-381 Introduction to Programming Using Python

  • @yuliu1105
    @yuliu1105 Před 4 lety

    Please update pandas version of the 2nd half of the problem, my forloops don't work, bummer. Thank you

  • @rolandgerard6064
    @rolandgerard6064 Před 4 lety +1

    man, you are good 👍

  • @aryanvikash1967
    @aryanvikash1967 Před 4 lety

    Corey please create a tutorial on python coroutine.. async await its so confusing .. please

  • @anandinamdar4054
    @anandinamdar4054 Před 4 lety +1

    Legend 🙌🙌

  • @rahulsharma-st5yg
    @rahulsharma-st5yg Před 4 lety +1

    @corey: when to expect some data science real life projects?

  • @md.akib5124
    @md.akib5124 Před 4 lety +1

    heeiiii corey. awesome to see another amazing tut from you

  • @user-bs3jr5ix5r
    @user-bs3jr5ix5r Před 4 lety

    Hello, Corey! Please append your matplotlib tutorial with a lesson about Legend Picking

  • @kingslypaul2999
    @kingslypaul2999 Před 4 lety

    Can u make a video on pyttxs3 ...or something better .....thanks for all your God skilled contents that makes ppl understand anything u teach

  • @ashissahoo4158
    @ashissahoo4158 Před 3 lety

    Is the pandas version of the last question has been uploaded?

  • @christiaan3315
    @christiaan3315 Před 3 lety

    Sorry. I am getting error messages... 'for line in csv.reader: row = next(self.reader) etc

  • @pingojr4048
    @pingojr4048 Před 4 lety

    Can you do something for home bill management?

  • @darrenlefcoe
    @darrenlefcoe Před 4 lety

    for line in csvReader: failed on line 16.
    UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 5739: character maps to
    problem with the data-set or other reason ?

    • @sylvanmasha
      @sylvanmasha Před 4 lety +1

      f anyone else is facing the UnicodeDecodeError try adding this second argument to the open function =>
      with open('data/survey_results_public.csv', encoding="utf8")
      this is a comment i found by Jack Leone. you're welcome

  • @Medhusalem
    @Medhusalem Před 4 lety +1

    It didn't work somehow and I had to do a google search on my error: 'charmap' codec can't decode byte 0x81 in position 5739.
    So I had to change the with open part and specify the encoding part, after that everything worked fine.
    *with open('data/survey_results_public.csv', encoding='utf-8') as f:*

  • @shivamkumraa
    @shivamkumraa Před 4 lety +8

    Who else liked before watching this. Just 1000% sure this guy can't make any shit.

    • @Jmzz542
      @Jmzz542 Před 4 lety +1

      what makes you say that?

  • @amankothiyal7074
    @amankothiyal7074 Před 3 lety

    Hey can you analyse it using numpy library

  • @morganfree100
    @morganfree100 Před 4 lety +1

    More please sir..(:-

  • @barungh
    @barungh Před 4 lety +1

    Badass !👍👌

  • @nowyouknow2249
    @nowyouknow2249 Před 4 lety +1

    Nailed it

  • @chinmayabarik557
    @chinmayabarik557 Před 4 lety

    Please do a pandas tutorial with this survey results sir..

  • @marinbeslo7841
    @marinbeslo7841 Před 4 lety

    Is the video with pandas-way out? :)

  • @arkhamian7586
    @arkhamian7586 Před 4 lety +1

    Don't stop doing this.

  • @mitos95176
    @mitos95176 Před 4 lety +1

    Now I want to perform this task using pandas.

  • @iangitonga2811
    @iangitonga2811 Před 4 lety +2

    Great content.

  • @samishabli5191
    @samishabli5191 Před 4 lety

    i got this error: Yes
    Traceback (most recent call last):
    and it just shows me the first yes and no
    and in the end it says: return codecs.charmap_decode(input,self.errors,decoding_table)[0]
    UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 5739: character maps to

    • @atishayshukla1117
      @atishayshukla1117 Před 4 lety +2

      Its an encoding issue with the file, open a file by specifying proper encoding something like: with open('data/survey_results_public.csv', encoding='utf8')

    • @ThePasupathi
      @ThePasupathi Před 4 lety

      @@atishayshukla1117 thank you..it works

  • @ebrahimm.roshdy4487
    @ebrahimm.roshdy4487 Před 4 lety

    Please do it using pandas since it's way easier and so common for data scientists

  • @johnnya590
    @johnnya590 Před 4 lety +1

    I don't know if you realize this, but at the start of EVERY video you say "Hey there how's it going everybody?" :)

  • @strange_man
    @strange_man Před 4 lety +1

    The video that I need :D

  • @graybits4111
    @graybits4111 Před 4 lety

    thanks for this great video
    the total % for each dev type should be 100%, which is not the case

    • @coreyms
      @coreyms  Před 4 lety

      Not necessarily. They can choose multiple languages. So for example, for front-end developers, most said they know JavaScript, but they also knew other languages as well. So each language is a percentage of how many developers said they know a particular language. Hope that clears it up

    • @graybits4111
      @graybits4111 Před 4 lety

      @@coreyms it is clear now, thanks

  • @Ajithkumarsekar96
    @Ajithkumarsekar96 Před 4 lety

    Almost everyone has used Javascript and html/css? From front-end developer to system-administrator

  • @natesh1
    @natesh1 Před 4 lety

    Please do a pandas tutorial series with this survey results too.

    • @bluekeybo
      @bluekeybo Před 4 lety

      You didn't watch the video till the end did you

  • @gnuter3000
    @gnuter3000 Před 4 lety +1

    greatgreatgreat

  • @boolloop
    @boolloop Před 4 lety

    I love your videos, but I wondered why you didn't use pandas for this?

    • @coreyms
      @coreyms  Před 4 lety +3

      I mentioned the reason in the video. I wanted to show how to do this without Pandas first, and once I release my Pandas series then I will redo this video using Pandas as well. That way people who are unfamiliar with Pandas can do this as well

    • @boolloop
      @boolloop Před 4 lety

      @@coreyms neat, looking forward to it!

  • @darrenlefcoe
    @darrenlefcoe Před 4 lety

    30 minutes:
    requires the following to pass bad data...
    try:
    # put the code here
    except AttributeError:
    pass

  • @adildeshmukh4140
    @adildeshmukh4140 Před 4 lety +1

    yes, please pandas

  • @hemanth6951
    @hemanth6951 Před 4 lety +1

    i saw you in another CZcams channels

    • @coreyms
      @coreyms  Před 4 lety +1

      Which one was that?

    • @hemanth6951
      @hemanth6951 Před 4 lety +1

      @@coreyms some thing pyconf channel with some one interviewing with you last year

    • @coreyms
      @coreyms  Před 4 lety +1

      @@hemanth6951 Oh yeah! I think that was Dan Bader. He's a great guy.

    • @hemanth6951
      @hemanth6951 Před 4 lety +2

      @@coreyms yes yes yes finally I saw you and this second time I'm chatting with you I'm so happy and luky 😊😀😀

  • @pavanmandikal
    @pavanmandikal Před 4 lety +1

    Please do make a video on numpy also

  • @shaguftakhan302
    @shaguftakhan302 Před 3 lety

    I clicked on this video just to see what data scientists do.