Cleaning Messy Data | Power Query Case Study

Sdílet
Vložit
  • čas přidán 5. 08. 2024
  • This video is dedicated to a user's question on Data Cleansing, which I thought was nearly impossible but Power Query makes it super easy to clean messy data sets. There are ton of learnings you can draw from this data cleansing case study. Enjoy!
    ===== ONLINE COURSES =====
    ✔️ Mastering DAX in Power BI -
    goodly.co.in/learn-dax-powerbi/
    ✔️ Power Query Course-
    goodly.co.in/learn-power-query/
    ✔️ Master Excel Step by Step-
    goodly.co.in/learn-excel/
    ✔️ Business Intelligence Dashboards-
    goodly.co.in/learn-excel-dash...
    ===== LINKS 🔗 =====
    Blog 📰 - www.goodly.co.in/blog/
    Corporate Training 👨‍🏫 - www.goodly.co.in/training/
    Need my help with a Project 💻- www.goodly.co.in/consulting/
    Download File ⬇️- www.goodly.co.in/wp-content/u...
    ===== CONTACT 🌐 =====
    Twitter - / chandeep2786
    LinkedIn - / chandeepchhabra
    Email - goodly.wordpress@gmail.com
    ===== CHAPTERS =====
    0:00 Intro
    1:08 The Data Cleaning Problem
    4:20 Power Query Clean Up
    12:48 My Online Courses
    ===== WHO AM I? =====
    A lot of people think that my name is Goodly, it's NOT ;)
    My name is Chandeep. Goodly is my full-time venture where I share what I learn about Excel and Power BI.
    Please browse around, you'd find a ton of interesting videos that I have created :) Cheers!
    - - - - -
    Music By: "After The Fall"
    Track Name: "Tears Of Gaia"
    Published by: Chill Out Records
    - Source: goo.gl/fh3rEJ​
    Official After The Fall CZcams Channel Below
    czcams.com/channels/GQE.html...
    License: Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
    Full license here: creativecommons.org/licenses
  • Věda a technologie

Komentáře • 108

  • @JoseMariaGomezMartinez
    @JoseMariaGomezMartinez Před 2 lety +9

    Hi Chandeep! Thanks again gentleman! Superb solution

  • @alphamaniac9411
    @alphamaniac9411 Před 2 lety +1

    Great example. It forces someone to logically think about what to do, and allow PQ to do the rest. It's a powerful tool. Thanks for your insight and great explanation!

  • @krishnakishorepeddisetti4387

    Chandeep... You are awesome man.... Your content is great... I work in power bi day in and day out... But after seeing your videos... I feel... There is still a lot to learn

  • @CraigDG
    @CraigDG Před rokem

    Thankyou Goodly, your info helped a little. I have a very messy daily data dump to clean up every day coming from external software within the company. The cleanup of the data becomes crucial in enabling an accounting reconciliation of the data, and until the data is clean, the manual reconciliation process is long and arduous and takes me away from my other work. I have found no one yet on You Tube, who has the exact problem I have and provides an exact solution, but bit by bit I am putting pieces together from different people like you that will hopefully give me the right solution i need.

  • @pondyanand
    @pondyanand Před 2 lety

    Excellent solution ! Really helpful.

  • @Belakavadi
    @Belakavadi Před 2 lety

    Awesome mate. Love the approach.

  • @cherianiype
    @cherianiype Před 2 lety

    Super terrific Chandeep! Incredible stuff man! Thank you for this!

  • @somanathking4694
    @somanathking4694 Před 2 lety +1

    This is literally superb sir..I was like mesmerised no words. Thankyou sir .

  • @martyc5674
    @martyc5674 Před 2 lety

    Brilliant Chandeep, great practical content.

  • @kennethstephani692
    @kennethstephani692 Před 2 lety

    Great video!

  • @viveksharma4193
    @viveksharma4193 Před 2 lety +1

    Truly awesome

  • @mathewinmuscat
    @mathewinmuscat Před 2 lety

    Super!!!!! Thanks a ton

  • @decentmendreams
    @decentmendreams Před 2 lety +2

    Thanks. I would have used if column A = null then give me column B else null to get the months . But happy to learn this technique of isolating cells by data type . Also, I love your technique of getting the date/month/year . I have projects that I would like to visit after watching this vid.

  • @dikushnukenjeh9072
    @dikushnukenjeh9072 Před rokem

    This is not data cleaning; this is data rearranging. Cool applications.

  • @asian1salim
    @asian1salim Před 12 dny

    Hi Chandeep @ you are always Power Query Data cleaning Super Hero

  • @querobinenator
    @querobinenator Před rokem

    I too came across the exact hurdle and used similar techniques in PQ. Great work Chandeep

  • @GosCee
    @GosCee Před 2 lety

    Thanks for that, Chandeep. Impressive! This will undoubtedly help me in my quest to master power query.

  • @wayneedmondson1065
    @wayneedmondson1065 Před 2 lety

    Hi Chandeep. Awesome solution! Well explained and very instructive. Thanks for demonstrating. Thumbs up!!

  • @BiggBrro
    @BiggBrro Před rokem

    Magnificent!

  • @luisjavier1284
    @luisjavier1284 Před rokem

    Hi, Chandeep that was awesome!

  • @boissierepascal5755
    @boissierepascal5755 Před rokem

    Clever, brilliant !

  • @judyrodbryanvicente8638

    More power to your channel.. Good stuff mate

  • @subbu_ca
    @subbu_ca Před 2 lety

    That was awesome.

  • @saniyanulkar9093
    @saniyanulkar9093 Před 2 lety

    Really amazing solution 👍

  • @MdShahidulIslamshafimbd

    Great solution 👌

  • @leosaghathan2895
    @leosaghathan2895 Před rokem

    Yes definitely helpful👌👍🏼

  • @RobertJohnstonrobjomabri
    @RobertJohnstonrobjomabri Před 11 měsíci

    Thank you very much Chandeep, I have to solve this problem every year to get family rosters into a Google calendar - I usually spent time getting the data manually in a row then do the transformations 🤦‍♂️.
    I will try this for next year

  • @ravimalik1264
    @ravimalik1264 Před 2 lety +1

    Thank you for the nice video . It was really helpful.
    Request please make some videos related to 1mmt,3mmt,6mmt,YTD and calendar year and comparing them with previous year or prior period. Also, some tips and tricks using time intelligence dax

  • @user-tp3jq8qg8l
    @user-tp3jq8qg8l Před 5 měsíci

    Really excellent sir.

  • @ArmanAper7
    @ArmanAper7 Před rokem +1

    Hi Chandeep!, thanks for your videos. Do you have any advice for my case where I have an excel file downloaded from an accounting system and this file is nothing but a banch of grouped rows and number of levels (subgroups) vary all in one column. The report which is the goal is supposed to be dynamic. Is there a way to transform grouped excel file into flat file? Thanks in advance

  • @pabeader1941
    @pabeader1941 Před 2 lety +1

    I like how you used that Value.Type function to isolate the date. I had a similar problem to solve and used the fact that US dates have a / in them. I like yours better as it would be more generic.

  • @jaychaudhary1284
    @jaychaudhary1284 Před měsícem

    great case study

  • @lucianoriquet8552
    @lucianoriquet8552 Před rokem

    Amazing!

  • @Laxmanmane007
    @Laxmanmane007 Před 2 lety

    superb Video😊

  • @cblondhe
    @cblondhe Před rokem

    Chandeep , this was another amazing video, thanks for sharing.

  • @vashisht1
    @vashisht1 Před 2 lety

    👌 awesome.. I like these challenge video...also the logic to the problem was great 👍

  • @abdulrehman56
    @abdulrehman56 Před 2 lety

    Awesome Awesome Awesome man.. You are rock star ... Dear please make a video that how you learn and journey of Power Query. I have asked you in one of previous video....

  • @tanveerabbas3271
    @tanveerabbas3271 Před 10 měsíci

    GREAT BOSS

  • @neerajnirantar
    @neerajnirantar Před rokem

    Great teaching technique.

  • @KhalilAhmad74036
    @KhalilAhmad74036 Před rokem

    Great, information, Sir. Thanks,

  • @ahmedbenchaoued9765
    @ahmedbenchaoued9765 Před 2 lety

    Perfect bro keep going , you are good 🤙

  • @yousrymaarouf2931
    @yousrymaarouf2931 Před 2 lety

    You are great

  • @maheshmulik2399
    @maheshmulik2399 Před rokem +1

    Thanks for the video, helped me a lot. Just one question wouldn't it be better to add one more column called present/absent and shift the "H" value there and replace "H" with 0 in the attendance column, so the whole column becomes numeric and we can perform numeric operation like sum on it ??

  • @fahadea1
    @fahadea1 Před rokem

    Awesome bruh 😮😮

  • @excel-k-sir
    @excel-k-sir Před 2 lety +1

    Hey Chandeep, It was really a well presented solution. Right now I have been struggling to clean the Mutual funds Consolidated Account Statement that comes in PDF but with no success using power query. Just wanted to know can that also be cleaned and get exported to excel. If you want to have a look at the data please confirm will share the same.

  • @adityasharma0101
    @adityasharma0101 Před 2 lety +2

    I am doing the similar power query data exploitation but in different ways. I found yours to be more dynamic.
    Like, when you needed to add a custom column containing only the months, you applied the logic that, if, column 2 date type is month, then bring column 2 else null.
    I used to apply the following logic,
    If column one is null, then bring column 2 else null.
    Also, I would have done the specific date extraction differently because in your logic, I was relying on column number logic (-1). I would have kept the row containing “team” in column 1 and Would have separated out the date and unpivoted. Thereby using the file date instead of my own logic.
    The date combining logic was very cool though(bringing day from one column and month and year from another)

  • @frankschadler9407
    @frankschadler9407 Před 2 lety

    Smart and clean solution, if the dates are always complete. Excel can be used to be 'creative'with data in one or another way. ;-)

  • @RandomlyWisdom
    @RandomlyWisdom Před 2 lety

    I paused the video at the start and tried on my own. The solution was not easy. Thanks for new data cleaning tricks.

    • @GoodlyChandeep
      @GoodlyChandeep  Před 2 lety +1

      It never works in the first go. I understand, I've been in your position several times :)

    • @RandomlyWisdom
      @RandomlyWisdom Před 2 lety

      @@GoodlyChandeep I think we skipped weekend dates. Is there any way to include complete calendar dates?

  • @mohitupadhayay1439
    @mohitupadhayay1439 Před rokem

    We talk about Dax so much but forget that Power Query is so much effective at doing lot of things too.
    Much better than python or other languages to cleanse data and transformation.

  • @pratyushnigam8956
    @pratyushnigam8956 Před rokem

    Fantastic sir... it's just amazing would you please suggest any dataset examples to hone my data cleaning skills on power query.........if you could please help me sir.....

  • @purepenmicheal140
    @purepenmicheal140 Před rokem

    Thanks chandeep for this video please I have two issues firstly, what if the data type of the date it date/time? With this your formula for extracting date brings error. How can I adjusted it? I tried changing the data type of column 2 to date but it destroyed other datas in that column to errors.
    Secondly can you help on my power bi date changing from 2022 to 1899 automatically.

  • @Nethra7
    @Nethra7 Před rokem

    Thank you sir, i have a doubt in this video that in this example after unpivote, the null values are missing i mean that the null should get zero am not getting can u please help me...

  • @bharathramc.n7796
    @bharathramc.n7796 Před 2 lety

    Thanks for explaining the custom column usage of converting the number to the required date.
    Please do explain how to use PQ when the raw data is 40,000+ rows and is cleaned and converted to the required format,
    when we go to the query for necessary correction PQ takes time or msg is displayed something has gone wrong.

  • @gravestoner2488
    @gravestoner2488 Před rokem

    Man... when my boss says "make a report, heres my data, and every time I change it, i want it to update" and I say "sure thing boss, can you organize it in this format?" And he says "no, I prefer pretty colors and 4 tables with 3 sub tables each all on the same sheet, make it work"
    I can now say "sure thing boss man"

  • @WernervanWyk2
    @WernervanWyk2 Před 2 lety

    Lekker Chandeep! Your biggest fan in Africa🌍
    Do you have a solution wherby one can take change entered data BACK TO the the original complex date template shown above?
    Keep up the good work 👏

    • @GoodlyChandeep
      @GoodlyChandeep  Před 2 lety

      Going back to the problem via power query would be quite a task. However Power BI visuals might help reformat the data back into its original form.

  • @kastenivkimbo5347
    @kastenivkimbo5347 Před 10 měsíci

    How to get this data sheet dataset to practice

  • @eslamfahmy87
    @eslamfahmy87 Před 10 měsíci

    Actually, amazing 👏 but I am facing an issue with knowing which function it needs to use on most of PQ data ...
    I know ( that's why you should take my course !... but I trying to do my best, but 😢 the result is above 50%..so could you provide with me the tips & tricks to be follow

  • @amahcynthia7405
    @amahcynthia7405 Před 2 lety

    Hi Goodly, thank you so much for this. I have been able to replicate it. However, I wanted to confirm, so at the end of the steps I tried to convert the date type to UK date structure using locale, however this didn't work. Then I tried slitting the individual dates, rearranging and merging them back, but when I converted the data type to date it showed error. Is there a reason why.

    • @GoodlyChandeep
      @GoodlyChandeep  Před 2 lety

      Can't say unless I see the screenshot or your query :|

  • @abeerattia4523
    @abeerattia4523 Před 2 lety

    Hi Chandeep , when i upload the fill to power query and tried to extract the date i got this Erro
    (Expression.Error: We cannot apply field access to the type Function.
    Details:
    Value=[Function]
    Key=Column3
    If Value.Type [Column2] = Date.Type then [Column] else null
    Pls. advise

  • @nettenette2298
    @nettenette2298 Před rokem

    Do you have any videos on organizing messy payroll data using power query?

    • @GoodlyChandeep
      @GoodlyChandeep  Před rokem

      Send me a sample and expected output. If it seems a common problem, I'll make a video on it :)
      goodly.wordpress@gmail.com

  • @user-uo4yf9eo9q
    @user-uo4yf9eo9q Před 13 dny

    H helpful

  • @txreal2
    @txreal2 Před rokem

    I have a messy data question.
    How do you remove the line feeds or carriage returns in column headers in Power Query? Please help.

    • @GoodlyChandeep
      @GoodlyChandeep  Před rokem +1

      I'll have to do a video on this. Thanks for the suggestion!

  • @sanjeevkhakre2990
    @sanjeevkhakre2990 Před 2 lety

    Sir when I load the data in power query. Then the date converting into any data type which creating prblm to apply the function. Please help

  • @apoorva528
    @apoorva528 Před 7 měsíci

    Hi Chandeep, Could you provide us the excel sheet of the dirty data?

  • @amahcynthia7405
    @amahcynthia7405 Před 2 lety

    Hi Goodly, this is awesome. Please I will like to ask is step three (removing other columns) necessary, cos I can only see 32 columns in mine. Are you assuming that the other columns are invisible and you don't want it to disrupt your clean up. In summary what is the purpose of step 3. I will appreciate your response. Also how did you remove "other columns". Did you manually type the formular or .........

  • @lohitgowda5889
    @lohitgowda5889 Před 10 měsíci

    where do I find the file for practice ?

    • @GoodlyChandeep
      @GoodlyChandeep  Před 10 měsíci

      www.goodly.co.in/wp-content/uploads/2022/02/Data.zip

  • @003kashif
    @003kashif Před 10 měsíci

    Can you provide us the file to practice alongside the video?

  • @kartickchakraborty9135

    Hi Sir, how are you? I recently finished your power query tutorials. Now, I'm learning Dax from basic. But, Recently I came across a term "Granularity" while learning Dax from ExcelIsFun. I am unable to understand why did he enforce on this term repeatedly? Please kindly make a video on this topic.

  • @sauravsinha6939
    @sauravsinha6939 Před rokem

    Is there any course on M language that I can study

  • @stephenbui490
    @stephenbui490 Před měsícem

    1 Step I've put my leg on ;P

  • @deepaksahu-nf1vb
    @deepaksahu-nf1vb Před 2 lety

    We want to join your course but it little high there is no one who teach like you with experience you possess but only cost is taking us back

    • @GoodlyChandeep
      @GoodlyChandeep  Před 2 lety

      Hi Deepak.
      I understand. Please wait for Black Friday offer and enjoy CZcams videos until then :)

  • @preethiagarwal5355
    @preethiagarwal5355 Před 2 lety

    I dint understand the problem itself , bro. 2 nd column became first of the month 🙄

  • @powerbinareal
    @powerbinareal Před rokem +1

    Insano!!! #powerbinareal

  • @deshn21
    @deshn21 Před 11 měsíci +1

    Instead of going through the steps you already made. Go through the steps in real time. This is extremely lazy work.

    • @goldylock
      @goldylock Před 5 měsíci +1

      Ive no idea about all his code there, no live explanation

  • @robertowerneck6902
    @robertowerneck6902 Před 2 lety

    Great video!