How would a Data Scientist analyze Customer Churn?

Sdílet
Vložit
  • čas přidán 6. 08. 2024

Komentáře • 44

  • @jursmin
    @jursmin Před 3 lety +16

    Great video. However, I think that the precision/recall explanation at the end should be the other way around.

    • @CodeEmporium
      @CodeEmporium  Před 3 lety +3

      Good eye. I'll clear this up in a comment. Thanks for catching this

    • @Akshaylive
      @Akshaylive Před 3 lety

      Came here to say this!

    • @falak88
      @falak88 Před 3 lety

      Aye

    • @bahmanaboulhasanzadeh8512
      @bahmanaboulhasanzadeh8512 Před 2 lety

      It would be great if you add a comment on the video itself or correct that section. Many may not read the comment section and thus not realize it.

    • @wuneyeproduction
      @wuneyeproduction Před 3 měsíci

      @@bahmanaboulhasanzadeh8512 Exactly!! I almost did not read the comment section.

  • @mohamedgaal5340
    @mohamedgaal5340 Před 2 lety

    Thank you for yet another awesome video! I like how you simplify data science concepts.

  • @faizaanazam
    @faizaanazam Před rokem

    Love it!

  • @deepanshudashora5887
    @deepanshudashora5887 Před 3 lety +1

    Another masterpiece by you 🙏

    • @CodeEmporium
      @CodeEmporium  Před 3 lety

      Thanks for the compliments! Means a lot!

    • @user-zn6fs9cu5b
      @user-zn6fs9cu5b Před 2 lety

      @@CodeEmporium you please tell me what is work order

  • @cocoarecords
    @cocoarecords Před 2 lety

    amazing

  • @19AKS58
    @19AKS58 Před 6 dny

    Great video

  • @snjjain
    @snjjain Před 3 lety

    how should we treat the test dataset? Does that also have to be randomly sampled? How would you present recall and precision? at the user level or daily observation level? Thanks

  • @munshir.c6161
    @munshir.c6161 Před 2 lety

    Thank you,
    Could u please help me how can make a mathematical equation for churn model?

  • @snjjain
    @snjjain Před 3 lety +6

    I think you got the precision and recall swapped, precision = Tp/ tp + fp , Recall = Tp/ Tp + Fn , The false negatives + true positives in recall stands for the users who actually churned and the true positives + false positives in precisions are the predictions by the model!

    • @CodeEmporium
      @CodeEmporium  Před 3 lety +2

      Yep. I saw this in another comment too. Good eye! I'll write a clarifying comment on this soon

    • @migi7787
      @migi7787 Před rokem

      @@CodeEmporium I loved this presentation - you're so good at touching the main points (e.g. data leakage) and all in 13 mins!! Exactly what I was looking for, well done!

  • @chessplayer0106
    @chessplayer0106 Před 2 lety +1

    Wait why can we only train on data up to 3 months in the past? If you have the same features for customers from last year and also see their usage history last year, couldn't you use that historical data to predict churn as well?

  • @irfanalfathramadhan4797
    @irfanalfathramadhan4797 Před 8 měsíci

    Can you provide examples of practices in tools related to churn, for example in Orange Data Maining?
    I'm still learning to churn, thank you

  • @deadlysk8
    @deadlysk8 Před 3 lety

    awesome

  • @rohitsharma0389
    @rohitsharma0389 Před 3 lety

    There are two points that you mentioned in the model training slides
    1. Create snapshots randomly for active users at a given time
    2. Only train on data up to 3 months in the past
    Does that mean that when I want to create a random snapshot, let's say of 18th Mar 2021, I only include customers who have purchased at least once in the last 3 months as anyone whose last purchase was 4 months prior to 18th Mar is already a churned and hence inactive violating the first point.
    Also of those users who are considered active, can we look back at data prior to 3 months or not. I am guessing not because that would violate rule 2 but then you are using a feature #work orders in last 6 months. So I am a little confused here.

  • @dragomir44
    @dragomir44 Před 3 lety +1

    Awesome video! The step at 9:05 went a bit too fast in my opinion. Could you show us, perhaps with made up data, how you verify it using SQL? What goes into the process of defining a "Correct" and an "incorrect"?

    • @rohitsharma0389
      @rohitsharma0389 Před 3 lety

      Performing uni-variate analysis of predictor variable's relationship with target variable should answer the "hunches"/hypothesis. For instance, plotting a bar chart of Gender vs Churn will tell us whether males are likely or females are more likely to churn. In this case, we can visualize #WorkOrdersIn6Months against Churn

  • @EtienneTremblay
    @EtienneTremblay Před 3 lety

    Great video! I don’t think you could use regression though because of right censoring. You don’t know if some of those customers will churn and when. Survival analysis models deal with this kind of censoring.

    • @CodeEmporium
      @CodeEmporium  Před 3 lety +1

      Ah. Good call. I think you're right with this

  • @ipvikas
    @ipvikas Před 3 lety +3

    Precision: Of the users who churned, how many did the model predict would churn
    Precision: छोड़ कर गए लोगों में से कितनों के बारे में model ने बताया था कि ये छोड़ के जा सकते हैं |
    Recall: Of the users the model said would churn, how many users actually churned?
    Recall: Model ने जिनको बताया था कि ये जा सकते हैं , उनमे से कितने वाकई में छोड़ कर चले गए |

  • @YeeLeeHaw
    @YeeLeeHaw Před 3 lety

    Start porting your videos to LBRY's Odysee as well. You can set that to automatically upload it there whenever it uploads to CZcams and can be activated when you create your Odysee account. You deserve way more exposure, maybe that can help a little since there is far less competition there.

    • @CodeEmporium
      @CodeEmporium  Před 3 lety

      Thanks for the suggestion Daniel. I def might consider this

  • @RidingWithGerdas
    @RidingWithGerdas Před 3 lety +1

    Want to ask about perhpas too bias features. Isn't something like "days from last orders" or "total order count" would be too bias to predict churn. I mean it's common sense that if the customer hasn't ordered for a while or in a long time ordered once or twice, he is SUPER likely to churn when the churn period arrives. Wouldn't such features somehow hide or overshadow other features which perhaps could indicate us which features make negative churn predicitons ?

    • @CodeEmporium
      @CodeEmporium  Před 3 lety

      True. Like if the feature is 89 days since last purchase, chances are they will churn. And now that i think about it, it probably won't be useful making that prediction since acting on it is impossible. We could just exclude that as a feature and say "What is the probability the customer churns 3 months from making this prediction". Good eye

    • @RidingWithGerdas
      @RidingWithGerdas Před 3 lety

      @@CodeEmporium Then this would be related to Survival Analysis?

  • @user-zn6fs9cu5b
    @user-zn6fs9cu5b Před 2 lety

    what is work order ???

  • @kabeerjaffri4015
    @kabeerjaffri4015 Před 3 lety +1

    Great video. Love from pakistan

  • @varunnayyar3138
    @varunnayyar3138 Před 2 lety +1

    Precision recall definition seems to be goofed up.

    • @CodeEmporium
      @CodeEmporium  Před 2 lety +1

      Yea. My b. Pinned a previous comment that said the same. Thanks!

  • @anjaliudoshi5729
    @anjaliudoshi5729 Před 2 lety

    Is this a Hancock 'good job'??

  • @SMHasan9
    @SMHasan9 Před rokem

    I don't agree with the 2nd feature.
    still great vdo 👍