4 Recently asked Pyspark Coding Questions | Apache Spark Interview

Sdílet
Vložit
  • čas přidán 26. 06. 2024
  • visit my website trendytech.in to know more about my big data program.
    In this session I have talked about 4 interview questions which were recently asked in pyspark coding interview.
    I am sure this session is going to help all the big data enthusiasts.
    #bigdata #dataengineering #pyspark

Komentáře • 31

  • @venugopal-nc3nz
    @venugopal-nc3nz Před 4 měsíci +5

    It will be great if you put questions in comment . Others can try without looking at solution first

  • @adityatomar9820
    @adityatomar9820 Před 4 měsíci +1

    One of the great explanation so far on youtube. I wish i could afford your course :(

  • @gudiatoka
    @gudiatoka Před 4 měsíci

    Sir...Share need more .. please continue this playlist

  • @veerugandhad3437
    @veerugandhad3437 Před 4 měsíci

    Very useful informative video which gives more confidence to the bigdata aspirants. Thanks Sumit.

  • @praptijoshi9102
    @praptijoshi9102 Před 2 měsíci

    You are doing a great job posting these❤

  • @singhjirajeev
    @singhjirajeev Před 3 měsíci

    00:03 Recently asked Pyspark Coding Questions
    02:37 Writing and executing Pyspark pseudo code
    05:21 Creating a Spark dataframe from input and performing group by aggregation
    08:04 Using aggregation functions and collect list in Pyspark.
    11:15 Spark SQL solution for creating DataFrame and running queries.
    14:18 Understanding the data frame reader API for reading JSON and the usage of explode function
    17:11 Creating a Spark dataframe and performing operations on it.
    19:44 Converting string to date and performing group by in Pyspark DataFrame
    22:32 Finding the average stock value using PySpark
    25:38 Practice more on data frames for interviews
    28:15 Practice more to gain confidence in writing correct syntax for Pyspark coding

  • @sravankumar1767
    @sravankumar1767 Před 3 měsíci

    Superb

  • @sopankardile2603
    @sopankardile2603 Před 4 měsíci +1

    One of the best interview series Thank you sumit sir .

  • @rohit-ll3rj
    @rohit-ll3rj Před 3 měsíci

    We can apply distinct() too I guess for avoiding duplicate values in df.

  • @prasoonvijay5775
    @prasoonvijay5775 Před 4 měsíci

    Hi Sumit,
    Could you please create Video explaining pipelines on AWS Databricks End-End along with Orchestration of those.

  • @shashankgupta2776
    @shashankgupta2776 Před měsícem

    Thank you Sir greatly explained, would be good if you can post data/schemas also in the decription box for us to query and do hands on. Thanks.! :)

  • @2412_Sujoy_Das
    @2412_Sujoy_Das Před 4 měsíci

    Much needed sir.....!!!

    • @sumitmittal07
      @sumitmittal07  Před 4 měsíci +1

      Sujoy, I am sure you will enjoy watching this.

  • @satishutnal
    @satishutnal Před 4 měsíci

    Best explanation sir thanks

  • @TheUMESH34
    @TheUMESH34 Před 4 měsíci

    This is great!

  • @electricalsir
    @electricalsir Před 4 měsíci +1

    What about remaining 10 questions on pyspark you told we are covering it in next video but still you not uploaded on CZcams and when you will upload it on CZcams we are waiting for remaining 10 questions on pyspark
    Thank you ❤

  • @electricalsir
    @electricalsir Před 4 měsíci

    thanks sumit make videos like this .

  • @Nikhil-qi4oz
    @Nikhil-qi4oz Před 4 měsíci

    Amazing sir

    • @sumitmittal07
      @sumitmittal07  Před 4 měsíci +2

      Nikhil, I am sure you will find it useful.

  • @mdasif2411
    @mdasif2411 Před 4 měsíci

    Hi Sir, can we not write in Spark sql in interview? As there is no difference in performance.

  • @anjibabumakkena
    @anjibabumakkena Před 4 měsíci

    Nice explanation sir, kindly post scenario based questions

  • @sonurohini6764
    @sonurohini6764 Před 25 dny

    Sir create coding interview playlist

  • @sharankarchella2688
    @sharankarchella2688 Před 4 měsíci

    Nice video

  • @user-dl3ck6ym4r
    @user-dl3ck6ym4r Před 3 měsíci

    in question number 2 = do we not need to remove duplicate as last can you please clear me on it ?

  • @user-jl5cb3cs1j
    @user-jl5cb3cs1j Před 4 měsíci

    Hello sir, how can I run pyspark code online, are you also using any online utilty to run pyspark code as shown in this video , could you please share the source, it would be very helpful.