ETL | AWS Glue | AWS S3 | Data Cleansing | Transforming data with AWS Glue in ETL workflows

Sdílet
Vložit
  • čas přidán 18. 01. 2024
  • ===================================================================
    1. SUBSCRIBE FOR MORE LEARNING :
    / @cloudquicklabs
    ===================================================================
    2. CLOUD QUICK LABS - CHANNEL MEMBERSHIP FOR MORE BENEFITS :
    / @cloudquicklabs
    ===================================================================
    3. BUY ME A COFFEE AS A TOKEN OF APPRECIATION :
    www.buymeacoffee.com/cloudqui...
    ===================================================================
    In this lab you will learn about AWS Glue, which is a serverless data integration service that makes it easier to discover, prepare, move, and integrate data from multiple sources for analytics, machine learning (ML), and application development. You can use a crawler to populate the AWS Glue Data Catalog with tables. This is the primary method used by most AWS Glue users. A crawler can crawl multiple data stores in a single run. Upon completion, the crawler creates or updates one or more tables in your Data Catalog. Extract, transform, and load (ETL) jobs that you define in AWS Glue use these Data Catalog tables as sources and targets. The ETL job reads from and writes to the data stores that are specified in the source and target Data Catalog tables.
    example data : github.com/RekhuGopal/PythonH...
    #awsglue #etl #datatransformation #s3 #cloudquicklabs #crawler #etljob
    #dataengineering #data #datacleansing #datamodificada
  • Věda a technologie

Komentáře • 16

  • @user-mg8ol5gp1j
    @user-mg8ol5gp1j Před 5 měsíci +1

    Excellent video, please make more videos like that on ETL, admire your hard work, very smooth explanation…… Thanks

    • @cloudquicklabs
      @cloudquicklabs  Před 5 měsíci

      Thank you for watching my videos.
      Indeed , I shall make videos on data engineering.
      Glad that my videos helped you.

  • @sujithadas9844
    @sujithadas9844 Před 5 měsíci +1

    Loved your series on AWS Glue.. nicely explained.. 😊 Please upload more scenarios with other transformations such as join and filter !
    Subscribed already.. thank you !!

    • @cloudquicklabs
      @cloudquicklabs  Před 5 měsíci

      Thank you for watching my videos.
      Glad that it helped you.
      Indeed I shall make more videos around this topic ahead.

  • @rashmishah5682
    @rashmishah5682 Před 5 měsíci +1

    Thanks a lot....please make more videos with different transformation and also the use of cloudwatch and lambda with it....

    • @cloudquicklabs
      @cloudquicklabs  Před 5 měsíci

      Thank you for watching my videos.
      Glad that it helper you.
      I shall make more videos around this topic and Thank you for giving inputs.

  • @WolfmaninKannada
    @WolfmaninKannada Před 4 měsíci +1

    Excellent brother❤

    • @cloudquicklabs
      @cloudquicklabs  Před 4 měsíci

      Thank you for watching my videos.
      Glad that it helped you.

  • @yadi4diamond
    @yadi4diamond Před měsícem +1

    excellent

    • @cloudquicklabs
      @cloudquicklabs  Před měsícem

      Thank you for watching my videos.
      Glad that it helped you.

  • @bellamkondapushpashree4923
    @bellamkondapushpashree4923 Před 3 měsíci +1

    It is showing failed at the end after running the job ...can you make a video on how to debug this issue

    • @cloudquicklabs
      @cloudquicklabs  Před 3 měsíci

      Thank you for watching my videos.
      You can find ETL pipeline logs if you go pipeline runs or you can also find the same at Cloudwatch log groups.

  • @faisalmali3809
    @faisalmali3809 Před 3 měsíci +1

    How can I run multiple glue jobs.

    • @cloudquicklabs
      @cloudquicklabs  Před 3 měsíci

      Thank you for watching my videos.
      Do you mean creating multiple job or multi threading within a job ?

  • @NandhuVlogs.
    @NandhuVlogs. Před 4 měsíci +2

    Bro You create Glue job by using python script...... In real time..... Visual ETL is Never preferred......

    • @cloudquicklabs
      @cloudquicklabs  Před 4 měsíci

      Thank you for watching my videos.
      Indeed , I would creating videos on creating job via python script. But this video is to focus on ETL topic.