Data Engineer Mock Interview | ADF | Medallion Architecture | BRONZE, SILVER & GOLD Layer| ADLS GEN2

Sdรญlet
Vloลพit
  • ฤas pล™idรกn 25. 03. 2024
  • ๐“๐จ ๐ž๐ง๐ก๐š๐ง๐œ๐ž ๐ฒ๐จ๐ฎ๐ซ ๐œ๐š๐ซ๐ž๐ž๐ซ ๐š๐ฌ ๐š ๐‚๐ฅ๐จ๐ฎ๐ ๐ƒ๐š๐ญ๐š ๐„๐ง๐ ๐ข๐ง๐ž๐ž๐ซ, ๐‚๐ก๐ž๐œ๐ค trendytech.in/?src=youtube&su... for curated courses developed by me.
    I have trained over 20,000+ professionals in the field of Data Engineering in the last 5 years.
    ๐–๐š๐ง๐ญ ๐ญ๐จ ๐Œ๐š๐ฌ๐ญ๐ž๐ซ ๐’๐๐‹? ๐‹๐ž๐š๐ซ๐ง ๐’๐๐‹ ๐ญ๐ก๐ž ๐ซ๐ข๐ ๐ก๐ญ ๐ฐ๐š๐ฒ ๐ญ๐ก๐ซ๐จ๐ฎ๐ ๐ก ๐ญ๐ก๐ž ๐ฆ๐จ๐ฌ๐ญ ๐ฌ๐จ๐ฎ๐ ๐ก๐ญ ๐š๐Ÿ๐ญ๐ž๐ซ ๐œ๐จ๐ฎ๐ซ๐ฌ๐ž - ๐’๐๐‹ ๐‚๐ก๐š๐ฆ๐ฉ๐ข๐จ๐ง๐ฌ ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ!
    "๐€ 8 ๐ฐ๐ž๐ž๐ค ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ ๐๐ž๐ฌ๐ข๐ ๐ง๐ž๐ ๐ญ๐จ ๐ก๐ž๐ฅ๐ฉ ๐ฒ๐จ๐ฎ ๐œ๐ซ๐š๐œ๐ค ๐ญ๐ก๐ž ๐ข๐ง๐ญ๐ž๐ซ๐ฏ๐ข๐ž๐ฐ๐ฌ ๐จ๐Ÿ ๐ญ๐จ๐ฉ ๐ฉ๐ซ๐จ๐๐ฎ๐œ๐ญ ๐›๐š๐ฌ๐ž๐ ๐œ๐จ๐ฆ๐ฉ๐š๐ง๐ข๐ž๐ฌ ๐›๐ฒ ๐๐ž๐ฏ๐ž๐ฅ๐จ๐ฉ๐ข๐ง๐  ๐š ๐ญ๐ก๐จ๐ฎ๐ ๐ก๐ญ ๐ฉ๐ซ๐จ๐œ๐ž๐ฌ๐ฌ ๐š๐ง๐ ๐š๐ง ๐š๐ฉ๐ฉ๐ซ๐จ๐š๐œ๐ก ๐ญ๐จ ๐ฌ๐จ๐ฅ๐ฏ๐ž ๐š๐ง ๐ฎ๐ง๐ฌ๐ž๐ž๐ง ๐๐ซ๐จ๐›๐ฅ๐ž๐ฆ."
    ๐‡๐ž๐ซ๐ž ๐ข๐ฌ ๐ก๐จ๐ฐ ๐ฒ๐จ๐ฎ ๐œ๐š๐ง ๐ซ๐ž๐ ๐ข๐ฌ๐ญ๐ž๐ซ ๐Ÿ๐จ๐ซ ๐ญ๐ก๐ž ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ -
    ๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค (๐‚๐จ๐ฎ๐ซ๐ฌ๐ž ๐€๐œ๐œ๐ž๐ฌ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐ˆ๐ง๐๐ข๐š) : rzp.io/l/SQLINR
    ๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค (๐‚๐จ๐ฎ๐ซ๐ฌ๐ž ๐€๐œ๐œ๐ž๐ฌ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐จ๐ฎ๐ญ๐ฌ๐ข๐๐ž ๐ˆ๐ง๐๐ข๐š) : rzp.io/l/SQLUSD
    30 INTERVIEWS IN 30 DAYS- BIG DATA INTERVIEW SERIES
    This mock interview series is launched as a community initiative under Data Engineers Club aimed at aiding the community's growth and development
    Our highly experienced guest interviewer, Ankur Bhattacharya, / ankur-bhattacharya-100... shares invaluable insights and practical advice coming from his extensive experience, catering to aspiring data engineers and seasoned professionals alike.
    Our talented guest interviewee, Sasiram Kolisetty / kolisetty-sasiram-0285... has a remarkable approach to answering the interview questions in a very well articulated manner. Where he talks about the complete Medallion Architecture followed in the projects that he worked on belonging to Insurance and Green Energy Domain. He describes how the raw data was initially added to the bronze layer tables. Then performed cleaning and transformations on this data which was then added to the Silver layer tables. Finally, based on the business requirements, data was aggregated and the results were added to the Gold Layer.
    Link of Free SQL & Python series developed by me are given below -
    SQL Playlist - โ€ข SQL tutorial for every...
    Python Playlist - โ€ข Complete Python By Sum...
    Don't miss out - Subscribe to the channel for more such informative interviews and unlock the secrets to success in this thriving field!
    Social Media Links :
    LinkedIn - / bigdatabysumit
    Twitter - / bigdatasumit
    Instagram - / bigdatabysumit
    Student Testimonials - trendytech.in/#testimonials
    Discussed Questions with Timestamp
    3:51 Brief discussion about the project.
    9:06 What business problem are you solving in your project?
    9:56 What work have you done to secure your data?
    11:20 Do you use Unity Catalog for role-based policy?
    11:33 How do you resolve bottlenecks in terms of latency in your pipeline?
    15:14 If you have 2 large tables, what kind of join strategy will you follow?
    16:12 Difference between bucketing and Z-ordering.
    16:48 How do you perform data ingestion?
    18:20 How would you create a pipeline for real-time data updates, aggregation, and live dashboard creation?
    20:13 What triggers are available in ADF?
    21:18 How does lazy evaluation help in Spark optimization?
    22:45 What approach would you take for cluster configuration
    24:53 SQL coding question
    Tags
    #mockinterview #bigdata #career #dataengineering #data #datascience #dataanalysis #productbasedcompanies #interviewquestions #apachespark #google #interview #faang #companies #amazon #walmart #flipkart #microsoft #azure #databricks #jobs

Komentรกล™e • 67

  • @satyanarayanabadam9764
    @satyanarayanabadam9764 Pล™ed 3 mฤ›sรญci +8

    This is the best interview till now. He worked on security , CDC concepts etc. Thanks sumit sir for conducting these interviews. Really helpful who is preparing for the interviews.

  • @joerokcz
    @joerokcz Pล™ed 3 mฤ›sรญci +11

    Ram is one of the best candidates of the series.
    All the best Ram.

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci +3

      yes definitely, he has a great exposure

  • @SiddharthVerma-lm3sj
    @SiddharthVerma-lm3sj Pล™ed dnem

    These videos have been very helpful. I have gone over many and this would be beneficial for a long time. Great work, initiative and thanks for the efforts.

  • @NaveenKumar-ln9vg
    @NaveenKumar-ln9vg Pล™ed 3 mฤ›sรญci +2

    I am getting prepared for the interview next week. As you said failed interview better then not attending interview and these mock interviews giving me the confidence to face the interview. Great initiative, learning and help. Thanks!

  • @skillhorizon
    @skillhorizon Pล™ed 14 dny

    You are doing great work. Lot of respect

  • @sandhyasandy9072
    @sandhyasandy9072 Pล™ed 3 mฤ›sรญci +1

    This is one of the best interviews and got good insights Sumit sir.Learnt the new concept of thin executor and fat executor and his optimisation techniques are also unique like join re order Adq etcโ€ฆ

  • @sonurohini6764
    @sonurohini6764 Pล™ed 3 mฤ›sรญci +1

    Definitely. It is a great help sir.

  • @ramprasadh5322
    @ramprasadh5322 Pล™ed 3 mฤ›sรญci

    This serires is great and gives me confidence to and sit in the interview.

  • @niridha23
    @niridha23 Pล™ed 21 dnem

    Thanks for conducting these mock interviews Sumit sir. It is really helpful๐Ÿ˜Š

  • @asaisvlogs7258
    @asaisvlogs7258 Pล™ed 2 mฤ›sรญci

    It just wonderful task... Its very much helpful โคโคโคโคโค

  • @FF-bp9bj
    @FF-bp9bj Pล™ed 3 mฤ›sรญci

    Sumit sir you are a legend in BIGDATA...
    You are the best trainer I've seen so far... Thanks for making the bigdata enthusiastics life easier....

  • @user-jg2tn1wb3d
    @user-jg2tn1wb3d Pล™ed 3 mฤ›sรญci

    Thank you sumit sir and team, your hard work really means a lot

  • @Sudeep-ow4pe
    @Sudeep-ow4pe Pล™ed 21 dnem

    The interview series is really helpful, Thank you

  • @nsinha27
    @nsinha27 Pล™ed 2 mฤ›sรญci

    Sumit - these mock interviews are worth it's weight in gold.

  • @sandipansaha6847
    @sandipansaha6847 Pล™ed 3 mฤ›sรญci

    Very helpful video.... gr8 help ...

  • @AnandPatil-eu1tl
    @AnandPatil-eu1tl Pล™ed 28 dny

    Thank you sir , this videos are very helpful

  • @BooksWala
    @BooksWala Pล™ed 3 mฤ›sรญci +4

    best interview till now

  • @rajdeepreddycharla
    @rajdeepreddycharla Pล™ed 3 mฤ›sรญci

    Been watching the interviews everyday. Definitely helpful. IMO based on the performance of the interviewee feedback at the end would be great.

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci

      we will add it in upcoming ones

  • @pranavadhav597
    @pranavadhav597 Pล™ed 3 mฤ›sรญci

    What an amazing interview. This series is so helpful. Thanks sumit sir

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci

      Keep watching for more such engaging interviews

  • @ravikanth6178
    @ravikanth6178 Pล™ed 3 mฤ›sรญci

    The interviews are very helpfulโ€ฆ
    Thank you Sumit and Team๐ŸŽ‰๐ŸŽ‰

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci

      Glad to hear that you found the mock interviews insightful. More such sessions are scheduled for release in the upcoming days.

  • @pradumanyadav493
    @pradumanyadav493 Pล™ed 2 mฤ›sรญci

    this is very helpful series

  • @prasadrajupericharla5545
    @prasadrajupericharla5545 Pล™ed 2 mฤ›sรญci

    Great one ๐Ÿ™Œ

  • @ameygoesgaming8793
    @ameygoesgaming8793 Pล™ed 3 mฤ›sรญci

    The series is amazing , I am learning a lot. Please keep making it

  • @anikethdeshpande8336
    @anikethdeshpande8336 Pล™ed 3 mฤ›sรญci

    Thanks for sharing. Very helpful

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci +1

      Glad it was helpful! Keep watching for more such interesting content

  • @swapnildande4706
    @swapnildande4706 Pล™ed 3 mฤ›sรญci

    Really helpful for all Data engineer

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci

      Happy to know that you are finding the interview series helpful

  • @bhagirathpandey3560
    @bhagirathpandey3560 Pล™ed 3 mฤ›sรญci

    Gr8 effort ๐ŸŒŸ

  • @salilagarwal7915
    @salilagarwal7915 Pล™ed 3 mฤ›sรญci

    Best interview, every interviewer should give feedback at the end. it can be more helpful.

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci

      noted the suggestion, will incorporate it from the upcoming interviews

  • @ManaviVideos
    @ManaviVideos Pล™ed 3 mฤ›sรญci

    Thanks for the interview series!!

  • @user-dj4ht7rg2f
    @user-dj4ht7rg2f Pล™ed 3 mฤ›sรญci

    Good one!!

  • @DataJourneyHuub
    @DataJourneyHuub Pล™ed 3 mฤ›sรญci

    Thank you as always ๐Ÿ™๐Ÿป

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci +1

      Always happy to share good content for all the data enthusiasts

  • @chinmayamahapatra6797
    @chinmayamahapatra6797 Pล™ed 3 mฤ›sรญci

    It was really effective to us very very helpfull

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci

      Happy to know that you found the interview helpful

  • @VishalKumar-ie3gr
    @VishalKumar-ie3gr Pล™ed 3 mฤ›sรญci

    Very informative.

  • @bhushanthoke858
    @bhushanthoke858 Pล™ed 3 mฤ›sรญci

    Realistic interview โค

  • @abhirambandi6091
    @abhirambandi6091 Pล™ed 2 mฤ›sรญci

    Too technical man๐Ÿ”ฅ

  • @gouthambheema1779
    @gouthambheema1779 Pล™ed 3 mฤ›sรญci

    Very helpful โค

  • @boreddymanohar8339
    @boreddymanohar8339 Pล™ed 3 mฤ›sรญci

    It was awesome

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci

      Glad that you are finding the mock interviews informative.

  • @RohitSharma-ny1oq
    @RohitSharma-ny1oq Pล™ed 3 mฤ›sรญci

    Best

  • @Visionpro33
    @Visionpro33 Pล™ed dnem

    What is the common salary range for Big data developer (4yrs) ?

  • @PurbaragPChoudhury
    @PurbaragPChoudhury Pล™ed 3 mฤ›sรญci

    Most of the interviews seem to focus on junior and mid-level DE. Iโ€™d also appreciate some video for Senior DE role.

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci

      let me know if you would like to attend?

  • @vijaykumarvc3037
    @vijaykumarvc3037 Pล™ed 3 mฤ›sรญci

    Sumit Sir, Please share any mock interview which will cover Azure cloud alone if possible

  • @chandranshuyadav3515
    @chandranshuyadav3515 Pล™ed 3 mฤ›sรญci

    Very nice interview sir can we have an interview for someone who is having more than 5 years of experience and is looking to join as an Architect level or Team Lead, Manager in Data Engineering world.

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci

      Definitely, there would be many more such videos released as part of this interview series. Will try to consider all possible scenarios

  • @ThanmayiTan
    @ThanmayiTan Pล™ed 3 mฤ›sรญci

    I would like to know the answer for the question, "why in spark why we will get duplicate of the column on which join is performed, whereas in sql we do not get that duplicate ?"
    From my limited knowledge, I know that in both spark and sql joins we will get the duplicate columns when join is performed on two tables.
    I would really appreciate it if you could please let me know the correct answer.

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci

      The reason behind this lies in the fundamental differences between Spark and traditional SQL databases. Spark performs parallel processing to handle partitioned data distributed across different nodes in the cluster. It is inevitable that there would be duplicates while performing joins on such datasets in this context. In contrast, traditional SQL databases usually operates on a single machine or a tightly coupled cluster and ensures uniqueness by preventing the duplicated from being generated in the final results

    • @ThanmayiTan
      @ThanmayiTan Pล™ed 3 mฤ›sรญci

      @@sumitmittal07Thankyou so much ! That really makes sense now.

  • @ameygoesgaming8793
    @ameygoesgaming8793 Pล™ed 3 mฤ›sรญci

    I would like to attend interview, let me know what is the process?

    • @sumitmittal07
      @sumitmittal07  Pล™ed 3 mฤ›sรญci

      Please fill this form - forms.gle/UMpNCZvAHgoLvvuJ6