AWS re:Invent 2021 - Building a data lake on Amazon S3

Sdílet
Vložit
  • čas přidán 6. 08. 2024
  • Flexibility is key when building and scaling a data lake, and by choosing the right storage architecture, you will have the agility to quickly experiment and migrate to AWS. This session explores best practices for building a data lake on Amazon S3, which allows you to leverage industry-leading AWS, open-source, and third-party analytics and ML tools and gain insights from your data. This session also explores how to optimize your storage on Amazon S3 for data lakes, including information on storage classes, S3 access points, and running HPC workloads with Amazon FSx for Lustre.
    Learn more about re:Invent 2021 at bit.ly/3IvOLtK
    Subscribe:
    More AWS videos bit.ly/2O3zS75
    More AWS events videos bit.ly/316g9t4
    ABOUT AWS
    Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts.
    AWS is the world’s most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers-including the fastest-growing startups, largest enterprises, and leading government agencies-are using AWS to lower costs, become more agile, and innovate faster.
    #AWS #AmazonWebServices #CloudComputing
  • Věda a technologie

Komentáře • 16

  • @MrDottyrock
    @MrDottyrock Před rokem +2

    This is very eye opening. I used AWS for several years and never thought S3 could serve such purpose. this is fantastic!

    • @awssupport
      @awssupport Před rokem

      We're glad to see you enjoyed it, Jamiu! 👀 ^SA

  • @djohnjimmy
    @djohnjimmy Před 2 lety

    This is a very helpful intro into days lake design with S3. Thank you

  • @umairqamar2672
    @umairqamar2672 Před rokem +1

    This was super duper amazingly wonderful to watch !

  • @samuel_william
    @samuel_william Před rokem

    This video clearly explains about the storage-s3. Very good video to learn about s3

  • @mbaapohelviszonepoh1284
    @mbaapohelviszonepoh1284 Před rokem +1

    This is very helpful. Thanks very much

  • @alexfaith5562
    @alexfaith5562 Před 2 lety

    What an awesome video!

  • @severtone263
    @severtone263 Před 2 lety +1

    Thank you for this, this was very helpful

    • @masterek1998
      @masterek1998 Před rokem

      Ijiibjj😅jiiiihbiijiiiiijiibijiiiiiiii

  • @rifkiamil
    @rifkiamil Před rokem

    We had 200gb of data in MS OLAP in 2008 coming out of terabyte ERP system. Not sure where getting his numbers from. 9:46

  • @yogenderpal
    @yogenderpal Před 2 lety

    Hadoop replicates data in three different nodes, so we need to lose 2 nodes before we start to worry about data loss. He said we need to lose 3 data nodes.:)

  • @user-sw9kd9pv4n
    @user-sw9kd9pv4n Před 2 lety +3

    24:24

  • @lordlee6473
    @lordlee6473 Před 2 lety +3

    Not much on data lake, more of a talk about S3 features for what it’s intended for, data storage. Actual data analysis and reporting is done with other AWS services.

    • @rbr951
      @rbr951 Před 2 lety +1

      True that. To that extent a little disappointing. Datalake != s3