Twitter Data Pipeline using Airflow for Beginners | Data Engineering Project

Sdílet
Vložit
  • čas přidán 29. 05. 2024
  • Join Course Python for Data Engineering - bit.ly/4140KI9
    This is End-To-End Data Engineering Project using Airflow and Python. In this project, we will extract data using Twitter API, use python to transform data, deploy the code on Airflow/EC2 and save the final result on Amazon S3
    Code and commands used in the project - github.com/darshilparmar/twit...
    Other resources:
    Twitter API - / twitter-api
    Python (Installation & Basics) - • Python Tutorial - Pyth...
    Setup AWS Account - • AWS Account Setup - St...
    Airflow Doc - airflow.apache.org/docs/
    Join Data With Darshil Discord Server: / discord
    Timestamps
    0:00 Project introduction
    0:56 Section 1 (Prerequisite)
    2:21 Section 2 (Understand basic concept)
    7:55 Section 3 (Project execution)
    👦🏻 My Linkedin - / darshil-parmar
    📷 Instagram - / darshilparmarr
    🎯Twitter - / parmardarshil07
    🌟 Please leave a LIKE ❤️ and SUBSCRIBE for more AMAZING content! 🌟
    3 Books You Should Read
    📈Principles: Life and Work: amzn.to/3HQJDyP
    👀Deep Work: amzn.to/3IParkk
    💼Rework: amzn.to/3HW981O
    Tech I use every day
    💻MacBook Pro M1: amzn.to/3CiFVwC
    📺LG 22 Inch Monitor: amzn.to/3zk0Dts
    🎥Sony ZV1: amzn.to/3hRpSMJ
    🎙Maono AU-A04: amzn.to/3Bnu53n
    ⽴Tripod Stand: amzn.to/3tA7hu7
    🔅Osaka Ring Light and Stand: amzn.to/3MtLAEG
    🎧Sony WH-1000XM4 Headphone: amzn.to/3sM4sXS
    🖱Zebronics Zeb-War Keyboard and Mouse: amzn.to/3zeF1yq
    💺CELLBELL C104 Office Chair: amzn.to/3IRpiL2
    👉Data Engineering Complete Roadmap: • Data Engineer Complete...
    👉Data Engineering Project Series: • Data Engineering Proje...
    👉Become Full-Time Freelancer: • Best Freelancer Series...
    👉Data With Darshil Podcast: • Podcast Series - Data ...
    ✨ Tags ✨
    data engineering project, complete data engineering project for beginners, learn data engineering, data engineering roadmap
    ✨ Hashtags ✨
    #airflow #dataengineering #project

Komentáře • 490

  • @DarshilParmar
    @DarshilParmar  Před rokem +91

    EDIT:
    Elon Musk ruined the API part, you can use some other APIs
    Here's one of the subscriber built the same project using other APIs - chenmeiqiao.notion.site/How-I-Reach-Out-to-CZcamsrs-I-Like-As-A-Data-Engineer-c37bdddefde54c3789229ffa5a789432
    or you can use a static dataset from Kaggle and then use Airflow to process it
    www.kaggle.com/datasets/mmmarchetti/tweets-dataset
    FAQ:
    1. Twitter removed free access: It still has free access but with limits of requests you can make - developer.twitter.com/en/docs/twitter-api/getting-started/about-twitter-api
    2. You need to request for V2 access: You will get an error but just by reading the doc you should be able to access Elevated Access
    Worked very hard for this project🤞
    Don’t forget to hit the like button and if you want to support my work you can join channel membership, it's only 59rs per month so that I can keep these content for free

    • @mohammadtanvirhossain5576
      @mohammadtanvirhossain5576 Před rokem +1

      already did !!! appriciate your efforts brother !!

    • @akshunjain968
      @akshunjain968 Před rokem

      Hey! Are you using Elevated access level because Essential doesn't work with the code you are using

    • @nishantagarwal8016
      @nishantagarwal8016 Před rokem

      Amazing Darshil!! I was able to replicate the same in GCP

    • @DarshilParmar
      @DarshilParmar  Před rokem

      @@nishantagarwal8016 Awesome!
      Create linkedin post and tag me

    • @DarshilParmar
      @DarshilParmar  Před rokem

      @@Soulfulreader786 There is a free version also

  • @sumantaghosh9299
    @sumantaghosh9299 Před rokem +154

    this is what is called content and actual do something for community ..rather than just doing faltu things...thank u so much Darshil ❤️

  • @venkatah9847
    @venkatah9847 Před rokem +14

    My sincere thanks to your efforts for contributing these kind of real time scenario based use case examples. 🙏

  • @faramarzzrn
    @faramarzzrn Před rokem +1

    one of the best tutorials I've ever seen on CZcams, a real-world example that was really interesting

  • @ANKITKUMAR-nv8ur
    @ANKITKUMAR-nv8ur Před rokem +26

    In data engineering community you are the GEM. Thanks from the bottom of my heart Darshil. Keep growing keep inspiring.

  • @owennowakowski7558
    @owennowakowski7558 Před rokem +7

    This was a very helpful video for me. I have spent the last year learning web development, and I am expanding my skills to include ETL. This video gave me a great overview of the process.

  • @AshisRaj
    @AshisRaj Před 4 měsíci +2

    Mr. Darshil you have saved a lot of time for us by this video. This video has more content than others talkative videos of other youtubers. Thanku so much

  • @ThankGod143
    @ThankGod143 Před rokem

    I came here seeing your LinkedIn post with comments,Being a fresher I was struggling for a project. This is Absolutely what I was looking for. Just Awsome. Thank you so much for your time and efforts.

  • @raimanisha5197
    @raimanisha5197 Před rokem +1

    Hi Darshil, Thank you for this .It was a great learning experience and it was fun too ! 😛.I am eagerly waiting for more such videos on airflow

  • @eternal2187
    @eternal2187 Před rokem +1

    You always continue to be great asset in data engineering content, thanks again!

  • @chandrakantsingh9568
    @chandrakantsingh9568 Před 8 měsíci

    Thanks for creating this project, with the help of this i schedule my data extraction task ,currently i am using solcast API to fetch weather data .

  • @TehWaver
    @TehWaver Před 4 měsíci

    Hey Darshil, thank you so much for this. Of course I got errors but as you said in the beginning, those are part of the journey. Definitely going to follow other tutorials from you as well regarding Airflow and AWS!

  • @keyursolanki
    @keyursolanki Před rokem +1

    Love the way you have explained things. Loads of success to you😇

  • @manuprakash1989
    @manuprakash1989 Před rokem

    Great! Got broader understanding of api, airflow and aws. Thank you!

  • @mirirshadali33
    @mirirshadali33 Před rokem +4

    As usual very nice content and appreciate your effort. If you could make a video with OOPs approach and include some test cases that will be really great.

  • @kanduladivya5804
    @kanduladivya5804 Před rokem +1

    This video is worth more than paid content on many websites.Thanks for such content.

  • @harshalrana1336
    @harshalrana1336 Před rokem +1

    This is really Amazing Darshil. I would also like to see Architecture level videos and how all tools all integrated into the cluster.

  • @abdulrazakshaik1128
    @abdulrazakshaik1128 Před rokem +4

    Really Appreciate your efforts to the Data Community you are doing a great job. Thank you ❤️

  • @kanui3618
    @kanui3618 Před rokem

    this is awsome !
    can't wait for the next data engineering projects, darshil🔥
    greetings from indonesia

  • @karunakaranr2473
    @karunakaranr2473 Před rokem

    Thank you for your time and effort to make these videos. Really helpful.

  • @gopi10aug
    @gopi10aug Před rokem

    I really appreciate the effort and simplicity..! I was looking for actual content like this ... Thank you Darshil 😇

  • @mrcrblr850
    @mrcrblr850 Před 8 měsíci

    Amazing Video Darshil, i really appreciate your video, you are a great professor!! Thank you very much keep doing this great content!!

  • @kundan8618
    @kundan8618 Před 15 hodinami

    This video is very helpful. Keep up the good work Darshil.

  • @nomadicengineer22
    @nomadicengineer22 Před rokem +5

    Please keep on bringing content like this it will be really helpful for freshers who want to understand, gain knowledge and shape their career in Data engineering🙏.

  • @Cosmos1987
    @Cosmos1987 Před rokem

    Now with the arrival of ChatGPT, even though one can generate CODE without analyzing all these, it is always better to understand the logic, nuance and intricacies of CODING. This vid helps a lot in deconstructing this step-by-step.👌

  • @aritra1414
    @aritra1414 Před rokem +8

    Please take out time and create deeper conceptual projects like this. If possible please provide real time scenario based projects. We will be really interested to get further depth. This one is awesome!!

  • @namvu607
    @namvu607 Před 10 měsíci

    Super practical, valuable tutorial. Thank you for sharing, sir.

  • @shaikusman536
    @shaikusman536 Před rokem

    Awsome brother...Loved the way to teach...Hoping for detailed projects.....thanks a ton brother.....

  • @anshujaiswal5622
    @anshujaiswal5622 Před měsícem

    Really like your approach towards Learning Data Engineering concepts by doing project stuff. A good short smart project to learn basic working of a Data-pipeline using Python, AWS services and Beginner usage of Airflow. Thanks Darshil :)

  • @lylabata6813
    @lylabata6813 Před rokem

    I have been following u for a long time now I don't even look at others videos......u r just great ....keep up the great work

  • @Lucaslopes-yf7ey
    @Lucaslopes-yf7ey Před rokem

    Very good Darshil, keep the good work, cheers from Brasil.

  • @kirankv1956
    @kirankv1956 Před rokem

    Really love the content darshil.
    Very precise explanation.

  • @tassamoo
    @tassamoo Před rokem +2

    Thanks sir, I learn a lot following this project guide. I understand more about how to use tweepy, AWS services and airflow by following this tutorial. I hope you always have a nice day.

    • @DarshilParmar
      @DarshilParmar  Před rokem

      You are welcome, check out more projects available on channel

  • @aaravkumarsingh4018
    @aaravkumarsingh4018 Před rokem +3

    Thanku darshil for making this project, please make this type of project on regular basis so that it will help us to enhance our skill

  • @hritikapal683
    @hritikapal683 Před rokem +2

    Hey thanks man I've been eagerly waiting for this. Would now explore and learn!

  • @gabrielfds3461
    @gabrielfds3461 Před rokem

    Darshil, amazing! I do not have words to say thank yoU!

  • @avshekraj
    @avshekraj Před 10 měsíci

    Amazing...got to know something intresting. Thanks for the detail explanation.

  • @iamdare
    @iamdare Před rokem

    Good job mate. I thoroughly enjoy the video. Thank you.

  • @jitendraparihar56
    @jitendraparihar56 Před rokem

    Eagerly waiting for this project. Thank you darshil for such amazing projects...

  • @gomes8335
    @gomes8335 Před rokem +2

    Excellent job Darshil.

  • @PrincessNaveena
    @PrincessNaveena Před rokem +3

    Wowww., Wonderful explanation.., never before ever after...
    True master for Big Data ,🙂Darshil

  • @partabparmar5537
    @partabparmar5537 Před rokem

    THANK YOU SO MUCH FOR SUCHA QUALITY CONTENT ......GOD BLESS YOU

  • @aaosoto
    @aaosoto Před rokem

    Keep up the good work! Your project is the best. Greetings from Chile 😁

  • @shahriarentertainer4749

    This is so impessive. Keep up your good work. I wish for a true success in your life

  • @bilalhassankhan221
    @bilalhassankhan221 Před 8 měsíci

    This is what exactly whay I'm looking for! Thanks brother.

  • @abhijeetpradhan143
    @abhijeetpradhan143 Před rokem

    Best content ever..👍
    So nice of you..
    Really want more such Projects..

  • @gimmestonks5333
    @gimmestonks5333 Před rokem

    Thanks for teaching basics of air flow and Dags..

  • @AmitSharma-mv5xe
    @AmitSharma-mv5xe Před rokem

    precise and clear content. Amazing

  • @sujeetnair1748
    @sujeetnair1748 Před rokem

    i m elder than u.. but u r an inspiration bro... i m new to data engineering..

  • @pearlr.2411
    @pearlr.2411 Před rokem

    Darshillllllll this is Gold. I literally love you rn 😭

  • @amirazad1762
    @amirazad1762 Před 4 měsíci

    Great and useful video. Thanks Darshil 🙏

  • @yhsreddy
    @yhsreddy Před 8 měsíci

    Awesome demonstration, truly appreciated l, waiting to see lot more soon

  • @dele_data
    @dele_data Před rokem

    I find it worthy to join a youtube channel membership for the first time. I hope with smart and harwork from myself and your help, I am able to land a Data Engineering job soon.

    • @rohitpandey9920
      @rohitpandey9920 Před 11 měsíci

      Bro I am unable to get access to Twitter API v2. Could you please help me out with that part?

  • @africarising360
    @africarising360 Před rokem

    Your video was really helpful. Thank you very much

  • @ashishveer4591
    @ashishveer4591 Před 11 měsíci

    Perfect Explanation .. keep it up .. thank you 🙌

  • @tejasborkar5917
    @tejasborkar5917 Před rokem

    Great darshil this is really helpful video
    Thank you.. keep uploading content like this please

  • @jashanbansal2613
    @jashanbansal2613 Před rokem

    Great Video Darshil. Keep up the Good Work :)

  • @abegpatel5300
    @abegpatel5300 Před rokem +1

    Thank you👍 for the awesome content We want more data engineering real world end to end project with real time streaming to deal with real world problem and situation.

  • @turquoisetravels
    @turquoisetravels Před rokem +1

    Great videos, Darshil! Also a side note, I often watch CZcams videos at 1.5. Yours feel faster even at a normal pace. :)

    • @DarshilParmar
      @DarshilParmar  Před rokem +1

      My natural talking pace is faster, many people complained about it but I can’t help it

  • @realsrvbhtngr
    @realsrvbhtngr Před rokem

    great content
    Thanks Darshil bhai, very cool

  • @AjaySharma-jv6qn
    @AjaySharma-jv6qn Před rokem

    thanks for sharing info - A step by step guide

  • @reypaulobae4895
    @reypaulobae4895 Před rokem

    Thank you so much! I encountered some issues along the way, but was able to resolve them!

    • @rohitpandey9920
      @rohitpandey9920 Před 11 měsíci

      Bro I am unable to get access to Twitter API v2. Could you please help me out with that part?

    • @reypaulobae4895
      @reypaulobae4895 Před 11 měsíci

      @@rohitpandey9920 Yeah, just delete the default and create a new one. Worked after

  • @siddhantkhanna9053
    @siddhantkhanna9053 Před rokem

    clear and concise content!! Great!

  • @Duniya_Daari
    @Duniya_Daari Před rokem

    I watch u everyday ✨ and working on my skills to became a Data engineer 🙂

  • @apoorvpandkar
    @apoorvpandkar Před rokem

    Real content is finally here ❤ Loved the tutorial.

    • @rohitpandey9920
      @rohitpandey9920 Před 11 měsíci

      Bro I am unable to get access to Twitter API v2. Could you please help me out with that part?

  • @KK-rh6cd
    @KK-rh6cd Před rokem

    Very informative and practical video. Thank you for making a video 😊. Waiting for advanced level video on airflow.

  • @ini_affiah
    @ini_affiah Před 11 měsíci

    Thank you for the video. It was helpful.

  • @mohsinrajpoot3780
    @mohsinrajpoot3780 Před 8 měsíci

    great work you helping lot of people.

  • @mehmetkaya4330
    @mehmetkaya4330 Před rokem

    Thank you for another great tutorial!!

  • @santoshchavan1929
    @santoshchavan1929 Před rokem

    Awesome sir...... That's what I actually want...

  • @marchanselthomas
    @marchanselthomas Před rokem

    enjoyed the process! took me more than 12 hours to do this lol with all the trouble shooting and error fixing!

    • @avanishyadav3705
      @avanishyadav3705 Před rokem

      can u suggest me how error on " from airflow import dag" error was solved on vscode ??????

  • @_strollingaroundtheworld_

    Appreciate the content. Cleared the basics by watching it.

  • @akashsoni7925
    @akashsoni7925 Před 5 měsíci

    Those who are stuck in the api part can look for some other apis and perform ETL on it too and make your own project.
    Darshil helped us to understand the concepts therefore its not mandatory to use only Twitter api.
    Thanks a lot Darshil :)

    • @snehalsylasmalladi9320
      @snehalsylasmalladi9320 Před 5 měsíci +2

      What other APIs are a good place to start other than the Twitter/X API used in the video?

    • @akashsoni7925
      @akashsoni7925 Před 5 měsíci

      @@snehalsylasmalladi9320 you can search for rapid api or search for open source apis

  • @cheedellagupta1459
    @cheedellagupta1459 Před 11 měsíci

    Very clean and understandable 🎉

  • @jeeteshkumartiwari7389
    @jeeteshkumartiwari7389 Před 11 měsíci

    Great tutorial, appreciate it, keep up the good work brother. :)

  • @mangeshgawali7683
    @mangeshgawali7683 Před rokem

    Great explanation. Thanks, Darshil! 😊

  • @rajivjani8594
    @rajivjani8594 Před rokem

    Awesome! Thanks for sharing it!

  • @avnishgupta1903
    @avnishgupta1903 Před 11 dny

    Awesome Work !!!

  • @vidyanishok7735
    @vidyanishok7735 Před rokem

    Thanks and keep doing more... Please also make python part 2 ..

  • @khushahmed97
    @khushahmed97 Před rokem +2

    Great explanation! Will try to replicate in GCP

  • @lokeswarnaidu2398
    @lokeswarnaidu2398 Před rokem +3

    Please do a complete project based on streaming data and transform the data using spark and orchestrate it using airflow and then store it cloud or any database.

  • @mayurb6088
    @mayurb6088 Před měsícem

    Great work!

  • @shubhampatil-xx5qq
    @shubhampatil-xx5qq Před rokem

    Thank you soo much darshil sir❤

  • @anandahs6078
    @anandahs6078 Před 7 měsíci

    Wish i found you earlier. i am learning a lot from you 🙂, recommended your channel to all my colleagues that are in data field

  • @channuangadi7504
    @channuangadi7504 Před rokem

    Very crystal clear 🔮 explanation 🎉

    • @rohitpandey9920
      @rohitpandey9920 Před 11 měsíci

      Bro I am unable to get access to Twitter API v2. Could you please help me out with that part?

  • @payday_1150
    @payday_1150 Před rokem +2

    Please keep making more videos like this!

  • @karangupta_DE
    @karangupta_DE Před rokem

    this is perfect, thank you Darshil

  • @kunalberry5776
    @kunalberry5776 Před rokem

    thanks, very helpful video

  • @moeketsekuenane324
    @moeketsekuenane324 Před 7 měsíci

    Darshil is just amazing 😊 . Good content!
    Ah Whatever is happening to the twitter API doesn’t really matter 😂

  • @shubairabbas5480
    @shubairabbas5480 Před rokem +7

    Hi darshil you are doing a great job,have to request you to create some projects around Databricks and synapse in Azure. We all are waiting with desperation. Likewise AWS,Azure is also hitting popularity and companies ask more for Azure Data Engineers. I know you are very busy creating other playlists. But it's a humble request to create only 2-3 Data pipeline projects in Azure. Those who wants #Azure#Data Engineering. Please hit like so it came at top and Darshil got to know.
    Super excited for that series🌟⭐

    • @prabhatgupta6415
      @prabhatgupta6415 Před rokem +1

      Yes I see azure in demand.

    • @shubairabbas5480
      @shubairabbas5480 Před rokem +2

      @@prabhatgupta6415 Yes prabhat in early level of carrer from 1-5 Yrs of experience, Azure is famous.

  • @user-yd6bq3kr3g
    @user-yd6bq3kr3g Před rokem

    Mentor of many data engineer ❤

  • @rolinejohnaguilar5272

    cool man, just what I needed

  • @user-zh9lh4hw1f
    @user-zh9lh4hw1f Před 7 měsíci

    excellent, thanks for the content

  • @maheshbhatm9998
    @maheshbhatm9998 Před 11 měsíci

    great video. Thank you

  • @ogissgi7441
    @ogissgi7441 Před rokem +2

    Thank you! thank you! thank you!

  • @manisha4130
    @manisha4130 Před rokem

    Very informative👍

  • @marionagi2914
    @marionagi2914 Před rokem

    amazing as usual

  • @sohailhosseini2266
    @sohailhosseini2266 Před 10 měsíci

    Thanks for the video!

  • @karthikprabhu416
    @karthikprabhu416 Před rokem

    Thank you for everything it helps me so much

  • @may_win
    @may_win Před 8 měsíci

    good video, very detailed