Data Engineering Mock Interview | Spark Optimization Interview Questions | Best Coding Practices

Sdรญlet
Vloลพit
  • ฤas pล™idรกn 20. 03. 2024
  • ๐“๐จ ๐ž๐ง๐ก๐š๐ง๐œ๐ž ๐ฒ๐จ๐ฎ๐ซ ๐œ๐š๐ซ๐ž๐ž๐ซ ๐š๐ฌ ๐š ๐‚๐ฅ๐จ๐ฎ๐ ๐ƒ๐š๐ญ๐š ๐„๐ง๐ ๐ข๐ง๐ž๐ž๐ซ, ๐‚๐ก๐ž๐œ๐ค trendytech.in/?src=youtube&su... for curated courses developed by me.
    I have trained over 20,000+ professionals in the field of Data Engineering in the last 5 years.
    ๐–๐š๐ง๐ญ ๐ญ๐จ ๐Œ๐š๐ฌ๐ญ๐ž๐ซ ๐’๐๐‹? ๐‹๐ž๐š๐ซ๐ง ๐’๐๐‹ ๐ญ๐ก๐ž ๐ซ๐ข๐ ๐ก๐ญ ๐ฐ๐š๐ฒ ๐ญ๐ก๐ซ๐จ๐ฎ๐ ๐ก ๐ญ๐ก๐ž ๐ฆ๐จ๐ฌ๐ญ ๐ฌ๐จ๐ฎ๐ ๐ก๐ญ ๐š๐Ÿ๐ญ๐ž๐ซ ๐œ๐จ๐ฎ๐ซ๐ฌ๐ž - ๐’๐๐‹ ๐‚๐ก๐š๐ฆ๐ฉ๐ข๐จ๐ง๐ฌ ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ!
    "๐€ 8 ๐ฐ๐ž๐ž๐ค ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ ๐๐ž๐ฌ๐ข๐ ๐ง๐ž๐ ๐ญ๐จ ๐ก๐ž๐ฅ๐ฉ ๐ฒ๐จ๐ฎ ๐œ๐ซ๐š๐œ๐ค ๐ญ๐ก๐ž ๐ข๐ง๐ญ๐ž๐ซ๐ฏ๐ข๐ž๐ฐ๐ฌ ๐จ๐Ÿ ๐ญ๐จ๐ฉ ๐ฉ๐ซ๐จ๐๐ฎ๐œ๐ญ ๐›๐š๐ฌ๐ž๐ ๐œ๐จ๐ฆ๐ฉ๐š๐ง๐ข๐ž๐ฌ ๐›๐ฒ ๐๐ž๐ฏ๐ž๐ฅ๐จ๐ฉ๐ข๐ง๐  ๐š ๐ญ๐ก๐จ๐ฎ๐ ๐ก๐ญ ๐ฉ๐ซ๐จ๐œ๐ž๐ฌ๐ฌ ๐š๐ง๐ ๐š๐ง ๐š๐ฉ๐ฉ๐ซ๐จ๐š๐œ๐ก ๐ญ๐จ ๐ฌ๐จ๐ฅ๐ฏ๐ž ๐š๐ง ๐ฎ๐ง๐ฌ๐ž๐ž๐ง ๐๐ซ๐จ๐›๐ฅ๐ž๐ฆ."
    ๐‡๐ž๐ซ๐ž ๐ข๐ฌ ๐ก๐จ๐ฐ ๐ฒ๐จ๐ฎ ๐œ๐š๐ง ๐ซ๐ž๐ ๐ข๐ฌ๐ญ๐ž๐ซ ๐Ÿ๐จ๐ซ ๐ญ๐ก๐ž ๐๐ซ๐จ๐ ๐ซ๐š๐ฆ -
    ๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค (๐‚๐จ๐ฎ๐ซ๐ฌ๐ž ๐€๐œ๐œ๐ž๐ฌ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐ˆ๐ง๐๐ข๐š) : rzp.io/l/SQLINR
    ๐‘๐ž๐ ๐ข๐ฌ๐ญ๐ซ๐š๐ญ๐ข๐จ๐ง ๐‹๐ข๐ง๐ค (๐‚๐จ๐ฎ๐ซ๐ฌ๐ž ๐€๐œ๐œ๐ž๐ฌ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐จ๐ฎ๐ญ๐ฌ๐ข๐๐ž ๐ˆ๐ง๐๐ข๐š) : rzp.io/l/SQLUSD
    30 INTERVIEWS IN 30 DAYS- BIG DATA INTERVIEW SERIES
    This mock interview series is launched as a community initiative under Data Engineers Club aimed at aiding the community's growth and development
    Expert guest interviewer, Sachin R, / sachin-r27 imparts invaluable insights and practical advice derived from extensive experience.
    Suman Basu, / basusuman23 skilled guest interviewee, showcases an exceptional approach in answering interview questions.
    Link of Free SQL & Python series developed by me are given below -
    SQL Playlist - โ€ข SQL tutorial for every...
    Python Playlist - โ€ข Complete Python By Sum...
    Don't miss out - Subscribe to the channel for more such informative interviews and unlock the secrets to success in this thriving field!
    Social Media Links :
    LinkedIn - / bigdatabysumit
    Twitter - / bigdatasumit
    Instagram - / bigdatabysumit
    Student Testimonials - trendytech.in/#testimonials
    Discussed Questions : Timestamp
    1:37 Introduction
    2:50 Brief about your project responsibilities
    5:26 Discuss SQL code documentation best practices for ensuring query efficiency.
    9:56 What are transformations and actions in PySpark DataFrames?
    10:35 What are the best practices you have followed specific to PySpark?
    12:39 What is the difference between cache and persist?
    13:33 Explain the concept of partitioning.
    14:58 When allocating multiple worker nodes/executors, how to increase or decrease the number of partitions?
    16:38 Which is more effective in avoiding data skewness. Repartitioning or coalesce? what is data skewness?
    18:07 Coding questions
    36:20 Dealing with data quality issues
    38:30 After fetching data from CSV files, how would you define the schema?
    41:00 Preferred file format for data loading.
    Tags
    #mockinterview #bigdata #career #dataengineering #data #datascience #dataanalysis #productbasedcompanies #interviewquestions #apachespark #google #interview #faang #companies #amazon #walmart #flipkart #microsoft #azure #databricks #jobs

Komentรกล™e • 19

  • @Vlogs..573
    @Vlogs..573 Pล™ed 4 mฤ›sรญci +3

    Sachin is really knowledgeable, and he is helping to answer the questions as well with Suman.

    • @sumitmittal07
      @sumitmittal07  Pล™ed 4 mฤ›sรญci +1

      yes both have been great. Kudos to Sachin & Suman.

  • @isenhiem
    @isenhiem Pล™ed mฤ›sรญcem +1

    This is such an amazing initiative...While watching the video I felt like as if I was being interviewed...I cant stress on how helpful this will be for so many people. It gave me a very good idea of the level of my preparation. Thanks a lot and I hope you will create more videos like this.

  • @sharankarthick3364
    @sharankarthick3364 Pล™ed 2 mฤ›sรญci

    Informative!

  • @prannay19
    @prannay19 Pล™ed 4 mฤ›sรญci +2

    Great initiative. Thank you Sumit Sir ๐Ÿ™. Looking forward to more such videos. Keep up the good work ๐Ÿ‘

  • @user-ji9ke8yb2d
    @user-ji9ke8yb2d Pล™ed 4 mฤ›sรญci +2

    Thank you so much Sumit sir.Really a great initiative

  • @DataJourneyHuub
    @DataJourneyHuub Pล™ed 4 mฤ›sรญci

    Thank you Sumit Sir

  • @AliKhanLuckky
    @AliKhanLuckky Pล™ed 4 mฤ›sรญci +3

    36:03 1.he is asking only highest
    2. Dept vise highest
    Use sql code as follow
    1.select max(salary) from emp;
    2 select dept,max(salary) from emp group by dept;
    As simple as that he did not asked you to write window function if he ask you then do it ๐Ÿ˜Š

    • @sriharidhanakshirur9245
      @sriharidhanakshirur9245 Pล™ed 4 mฤ›sรญci +1

      In case 1 , we should use WinDow function bcoz, we need to print id and name as well

    • @AliKhanLuckky
      @AliKhanLuckky Pล™ed 4 mฤ›sรญci

      @@sriharidhanakshirur9245 in this case u can use sub query as well if anyone explicitly ask you is there any other way or do it using windows then at that time interviewer will get impress ๐Ÿ˜Š

  • @crunchyworks6374
    @crunchyworks6374 Pล™ed 4 mฤ›sรญci +3

    Sir as I see from last 3 days everytime cloud tech you use is Azure only , please make it on AWS too itโ€™s very helpful

    • @sumitmittal07
      @sumitmittal07  Pล™ed 4 mฤ›sรญci +1

      definitely, you will see a lot of variety

  • @RohitSharma-ny1oq
    @RohitSharma-ny1oq Pล™ed 4 mฤ›sรญci +1

    Plz increase little bit complexity of interview because in actual its more complex ๐Ÿ˜Š

    • @sumitmittal07
      @sumitmittal07  Pล™ed 4 mฤ›sรญci

      candidates mostly get stuck in basic fundamentals. These are actual people who conduct interviews in companies.

  • @IsmailKhan-jy9ew
    @IsmailKhan-jy9ew Pล™ed 4 mฤ›sรญci

    Thankyou sumit sir for this initiative.

  • @user-oy9cc8dv8i
    @user-oy9cc8dv8i Pล™ed mฤ›sรญcem

    if possible mention the experience also , to which experience level these interview are targeting (like this is for 1 year, fresher or for 3 year experience )