Forecasting with the FB Prophet Model

Sdílet
Vložit
  • čas přidán 3. 07. 2024
  • In this video I show how you can use facebook's prophet model to easily do time series forecasting in python. This model is very powerful because it uses an additive model where non-linear trends are fit with yearly, weekly, and daily seasonality, plus holiday effects.
    Kaggle Notebook: www.kaggle.com/robikscube/tim...
    Forecasting with Machine Learning Part 1: • Time Series Forecastin...
    Forecasting with Machine Learning Part 2: • Time Series Forecastin...
    Timeline:
    00:00 Intro
    00:50 Data and Imports
    03:43 Features and EDA
    05:54 Test Split
    06:49 Train and Predict
    10:01 Evaluate Forecast
    15:01 Adding Holidays
    19:19 Make Future Dataframe
    Follow me on twitch for live coding streams: / medallionstallion_
    My other videos:
    Speed Up Your Pandas Code: • Make Your Pandas Code ...
    Speed up Pandas Code: • Make Your Pandas Code ...
    Intro to Pandas video: • A Gentle Introduction ...
    Exploratory Data Analysis Video: • Exploratory Data Analy...
    Working with Audio data in Python: • Audio Data Processing ...
    Efficient Pandas Dataframes: • Speed Up Your Pandas D...
    * CZcams: / @robmulla
    * Twitch: / medallionstallion_
    * Twitter: / rob_mulla
    * Kaggle: www.kaggle.com/robikscube
    #fbprophet #python #machinelearning
  • Věda a technologie

Komentáře • 114

  • @devnull711
    @devnull711 Před rokem

    Your videos are pure gold, thank you.

  • @ChristenKult
    @ChristenKult Před rokem +2

    Thanks a lot, I love your forecasting content, especially with that dataset!
    Would love to see some more models in the future. Wish you the best!

    • @robmulla
      @robmulla  Před rokem +1

      Thanks! I plan to do more at some point. Glad you've been enjoying the ones I have so far.

  • @mschuer100
    @mschuer100 Před rokem +3

    Hey Rob, another amazing video, and great overview of Prophet. There are a couple of time series projects i want to tackle and I am going to take a crack and them using Prophet. Just want to say that I think this video is going to help me out a lot. Thanks for the effort you put into making these. Please keep them coming, they are more useful than you know.

    • @robmulla
      @robmulla  Před rokem +1

      Thanks! As many people mentioned there may be better alternatives to prophet but hopefully this tutorial gives you a good idea of where to start.

  • @marcellinusokoh1456
    @marcellinusokoh1456 Před 9 měsíci +2

    Great work... I am actually working on a comparative analysis of ARIMA, prophet, and xgboost in time series forecasting for my project

  • @anthonyfrancisco7954
    @anthonyfrancisco7954 Před rokem +1

    I'm learning a ton, thanks for the great content!

    • @robmulla
      @robmulla  Před rokem

      Glad you are learning from them! Thanks for watching.

  • @TylerMacClane
    @TylerMacClane Před rokem +1

    Surprisingly close. I watch your videos and everything is clear.

    • @robmulla
      @robmulla  Před rokem

      Glad you found it clear and helpful.

  • @davoodastaraky7608
    @davoodastaraky7608 Před rokem +8

    Amazing video Rob. I hope you continue making these videos and sharing your knowledge. You are also a great teacher.

    • @robmulla
      @robmulla  Před rokem

      Glad you learned from the video. I plan to keep making videos when I have time. Hopefully more time soon.

    • @spider279
      @spider279 Před 11 měsíci

      @@robmulla WHAT DOES model.make_future_dataframe(periods=3, freq='H') mean ? does it mean i am doing forecasting for the next 3 hours

  • @sandipbez3433
    @sandipbez3433 Před rokem +1

    Thank you for sharing the knowledge, this helps me a lot

    • @robmulla
      @robmulla  Před rokem

      Glad it was helpful. Thanks for watching.

  • @Lucia-el6ex
    @Lucia-el6ex Před rokem +2

    Me encantan tus vídeos. Eres un excelente profesional en tu campo. Gracias por compartir tus conocimientos con la comunidad de CZcams :)

    • @robmulla
      @robmulla  Před rokem +1

      ¡Muchas gracias! No hablo español pero estoy usando un programa de traducción. Me alegro de que puedas entender mis videos a pesar de que está en inglés.

  • @hasanovmaqsud
    @hasanovmaqsud Před rokem +2

    I see Rob has new video about time series forecasting. I put like and comment automatically. That's how it works! 😉

  • @Technomic-o
    @Technomic-o Před rokem +1

    Great stuff Rob ! thanks

  • @ronbzalen
    @ronbzalen Před rokem +1

    thanks! great content as usual :)

  • @juanpabloaguirre5077
    @juanpabloaguirre5077 Před 10 měsíci

    thanks teacher , greettings from Chile, gracias profesor

  • @charlesbergen8532
    @charlesbergen8532 Před 4 měsíci

    Excellent video!

  • @dfdf4874
    @dfdf4874 Před rokem

    I wish I would have come across this channel months ago, huge fan, amazing work.

  • @AM-ev2ew
    @AM-ev2ew Před rokem +3

    Awesome work!
    You made me interested in ML.
    I'm PhD in Physics, and I always preferred to avoid ML during University days, but you, and your work made it so interesting for me!
    Thanks for your work.

    • @johnbobbypringle
      @johnbobbypringle Před 10 měsíci

      The best thing about machine learning is this... Once you've built your models, you can just train them on different data. I have a single notebook which loads in my data and then the rest of the notebook contains all the models I have learned along with their cross validation and evaluation metrics. I can just change the data and run the entire notebook and then check on it occasionally to see the results. It can take an hour or two to build each model but once you have them all built, you're good to go and you can just add to the notebook as you learn more. You may even want to split your models into separate notebooks but i prefer to run them all in one as i then only need to alter the data in that one.

  • @abdessamedbouchena8823

    Thank you very much this is great work and very useful i hope just make new video about LSTM model

  • @user-arpitadey
    @user-arpitadey Před 4 měsíci

    Thanks a lot from Bangladesh

  • @SkyRiderJavelin
    @SkyRiderJavelin Před 3 měsíci

    great tutorial

  • @muhammadjamalahmed2273
    @muhammadjamalahmed2273 Před rokem +1

    Simply love..

  • @beda9beda
    @beda9beda Před rokem +1

    Like the explanations

  • @kavinyudhitia
    @kavinyudhitia Před rokem +1

    Nice!!! Thanks!!

    • @robmulla
      @robmulla  Před rokem

      Thanks for watching Kavin!

  • @codepour
    @codepour Před rokem +1

    Video is awesome as always! Quick, easy to follow and interesting. Thank you! btw, where did you get that t-shirt?

    • @robmulla
      @robmulla  Před rokem +1

      Glad you liked the video. The Tshirt is actually from Marques Brownlee's merch! I saw him wearing it in one of his videos and had to get it.

  • @feap03
    @feap03 Před 11 měsíci

    Hey Rob, Awesome Video.
    I think you need to take note that if you try to run your kagle notebook with a different data set, it's impossible to evaluate the error metrics because the y_pred array is the same size as the data set. I had to write a few more lines to extract just the last x values I needed.
    Thanks again

    • @johnbobbypringle
      @johnbobbypringle Před 10 měsíci

      did you split your data right? Your X_train and y_train should be the same length. and your X_test and y_test should be the same length. And if that's not the problem, then ensure you are evaluating the metrics using y_test and y_pred and not y_train and y_pred?

  • @Pedrommelos
    @Pedrommelos Před rokem +1

    THE KING IS BACK!

  • @PatricioStegmann
    @PatricioStegmann Před rokem

    Good job! Nive vid!👏

  • @marcounipd1836
    @marcounipd1836 Před rokem +3

    Extra like for the “Model Train” meme ❤

  • @eduardomanotas7403
    @eduardomanotas7403 Před 11 měsíci

    Hey rob nice video!!
    Could you please have another video about using panel datasets and implement AB testing as well.

  • @giordano_vitale_uni
    @giordano_vitale_uni Před 6 měsíci +1

    Great video, thanks for letting us learn this model!
    I have a question. At minute 10:13, the forecast interval includes negative values for the dependent variable: how is it possible? and how can we prevent it?

  • @litttlemooncream5049
    @litttlemooncream5049 Před rokem

    thx!

  • @69nukeee
    @69nukeee Před 7 měsíci

    Hello Rob, great video, thanks for sharing! This was my introduction to the Prophet model, and loved your explanation, crystal clear sa always!
    I have a quick question though: if I understand correctly, Prophet should work as a regression model given this very problem, and hence we could calculate R2 as well as metric along with RSE, MAE and MAPE. Is it recommendable to do it, or it makes no sense for this particular model? What's your take?
    Thanks a lot!

  • @cemberkay8554
    @cemberkay8554 Před 5 měsíci

    Thanks for the explanation. When using the xgboost model, can we include holidays in the model using the method you showed in this video?

  • @CaribouDataScience
    @CaribouDataScience Před 9 měsíci

    What is the default number of periods predicted?

  • @co.n.g.studios5710
    @co.n.g.studios5710 Před rokem +1

    very nice new model an great presentation! How long did it take you to programm the nb from top to bottom (roundabout)?

    • @robmulla
      @robmulla  Před rokem +1

      Thanks! It didn't take too long, maybe an hour or so, but I had already created a similar one a few years back.

  • @ahmetbeskardes8258
    @ahmetbeskardes8258 Před rokem +1

    Hello, in which python version can we run the prophet library? I'm developing on Anaconda-Spyder but I get different errors each time.

  • @ademakgul6768
    @ademakgul6768 Před rokem +2

    Hi Rob, thnks fro great video. Why did you add exrta features to data like hour, day of week? You did not use them as a regressor. They are already embeded in date. I didn't understand why you did add like that.

  • @satyamARAS
    @satyamARAS Před rokem +1

    I am preparing for Quant researcher (new hire) this help a lot . Can we get series of vedios for preparation for quantitative research interviews it will help .

    • @robmulla
      @robmulla  Před rokem

      Glad you are finding these helpful. I'm not a quant myself so I might not have the best insight into interview questions but if you have some resources you can point me to I might be able to make a video out of them! Thanks for watching.

  • @rafaeel731
    @rafaeel731 Před rokem +2

    Have you published the LSTM video? It is surprising that the initial XGBoost model with held-out test dataset performed the best given the simplicity and minimum effort.

    • @kishoretvk
      @kishoretvk Před 9 měsíci

      Well I'm curious to see LSTM with multi variant and then we can actually do understand more about how other factors impact, example temperature of the day and climate change anakysys

  • @vaibhav.shinde
    @vaibhav.shinde Před rokem +1

    Hey Rob,
    Which algorithm you think is good for forecasting power Hourly data?

    • @robmulla
      @robmulla  Před rokem +1

      XGBoost performs best on this dataset (check out my other video here: czcams.com/video/vV12dGe_Fho/video.html) But Prophet would probably perform better on something like sales or exchange rates.

  • @mr.unknowngaming7156
    @mr.unknowngaming7156 Před rokem +1

    Hey rob, in general, which TS model would you recommend as best for forecasting(not considering the MAPE values) for univariate dataset?
    Something that is consistently reliable.

    • @robmulla
      @robmulla  Před rokem

      Thanks for the question. I think it really depends more on the data and the information around it than the model. No model is perfect, but picking the correct one depends a lot on what you are trying to achieve- explainability, accuracy, etc. Sorry for the non-answer but if it was that simple everyone would just use one model.

    • @mr.unknowngaming7156
      @mr.unknowngaming7156 Před rokem

      @@robmulla Sure, but what if one model outperforms and older better performing model with new data being added?
      I’m new into TS forecasting and trying to build an approach towards how to make the process efficient.
      Would you suggest that one always runs all known models to assess which one is most accurate in the latest test or settle with one model which is consistently more accurate?

  • @kumarprabhu
    @kumarprabhu Před 8 měsíci +1

    Need your help. I do not have time components in my datasets, i only have date and the consumption data...how to choose the right model for this.. should i choose Xgboost or prophet or Arima...

  • @lennon4044
    @lennon4044 Před rokem +1

    Is it possible to give a neural prophet lecture?

  • @user-ej1ip3iq1o
    @user-ej1ip3iq1o Před 26 dny

    First of all, thank you for this nice video!
    I have a question why you created the features_and_target dataframe. I know that the features are helpful when we train models like XGBoost, but here we only used the ds and y column. I'd just like to understand what was the reason why you created it and if there are any ways to use this dataframe with Prophet?

  • @thecaptain2000
    @thecaptain2000 Před 10 měsíci

    Thank you for the informative video. however there are two aspects which are puzzling ( am new to pandas and prophet, so please be indulgent) first, once you trained your model you perform: pjme_test_fcst = model.predict(pjme_test_prophet). And it is puzzling why you are passing pjme_test_prophet. Test data are not used to perform any error measurement against predicted data, my speculation is the only column taken into consideration is the 'ds' column, used to specify what predictions we want and not the 'Y' column. am I correct?
    Moreover looking at yhat_lower and yhat_upper for 2015-01-01 01:00:00, meaning one hour after our training data finish, the predicted values are 23737.947439 32670.335550, a range of about 10K, which is about 50% of yhat_lower and about 30% of yhat_upper which would make for a pretty bad model if it needs such a big interval of confidence; if I understood correctly, did I?

  • @user-wr4fm8io6y
    @user-wr4fm8io6y Před měsícem

    is it necesary to make circular the time variables? as a loop for the 24 hours, or the 7 days of the week, the 12 months and the 365 days of the year

  • @phanquochung3924
    @phanquochung3924 Před rokem

    Wonderful. Can you do one tutorial one Neural Prophet as well, lots of thanks

  • @BrianMoyer-kq2gl
    @BrianMoyer-kq2gl Před rokem +1

    Hey Rob! I am getting an error when fitting the model. It appears that Prophet wants the 'ds' column to be in the datetime format, but it's in the datetime64[ns] format .... I can't figure out how to correct this - any ideas?

    • @robmulla
      @robmulla  Před rokem

      I found this stackoverflow post about it. It looks like it could be related to your datetime formatting: "Solution: Change the date format in training data to "%Y-%m-%d""
      stackoverflow.com/questions/64427395/facebook-prophet-future-dataframe

  • @RoyalKnight10
    @RoyalKnight10 Před rokem +1

    Hi Rob, I wanna ask you about: How safe is use Prophet Forecast with a business data? I mean, some business have a strong data policy and I'm not sure if using Prophet could break it. Is prophet in some way collecting some part of data?
    With the Facebook's background about data management I think it's a better idea to ask before.

    • @robmulla
      @robmulla  Před rokem +1

      Oh. Interesting question. I think you can still keep everything local. It’s not like Facebook will see your data they just developed the code. But also check with your companies guidelines. I wouldn’t want you to get in trouble. Good luck!

  • @zwelethuzitha1109
    @zwelethuzitha1109 Před 7 měsíci

    I am facing challenges running the notebook on google colab especially on the Time series features section ,please help anyone

  • @user-eo6su2ou9s
    @user-eo6su2ou9s Před rokem

    Hey, so i am getting yhat as negative since in my data the trend is that number of user decreased in last 4 months, what should i do?

  • @Simon-ez7sm
    @Simon-ez7sm Před rokem +1

    Hello robert!
    What about if you want to predict for multiple cities at the same time? (Having more than 1 entrie per time (for each city for example))

    • @robmulla
      @robmulla  Před rokem +2

      That gets tricky. You would need to train a model for each city, the model only can be trained on a single feature.

    • @Simon-ez7sm
      @Simon-ez7sm Před rokem

      @@robmulla thanks!!

  • @jackxu8476
    @jackxu8476 Před rokem

    Hi Rob, thanks for sharing this. One question - what if i have many products to forecast sales, do i need to train/run model for each product? Or i could do this in one shot meaning ONE model to cover all products?

    • @rafaeel731
      @rafaeel731 Před rokem +1

      I guess you have longitudinal data and in this case you have a collection of time series, not one. In that case you can build a model per product, or use a global model. See M4 / M5 competitions. It is greatly harder to forecast from such data. If I misunderstood, ignore me.

    • @OskarBienko
      @OskarBienko Před 11 měsíci

      You're right, it looks like panel data.

  • @nickr-ls8ny
    @nickr-ls8ny Před 4 měsíci

    Just started with Prophet to model dispensing figures based on previous data but having problems, any help would be appreciated...Thanks Nick

  • @ARich43
    @ARich43 Před rokem

    Great Video! When I try to filter my Date Range for when you're doing "January 2015 Predicted vs Actual" I get an error like
    Failed to convert value(s) to axis units: array([....]) -- Any idea what I could be doing wrong here?

  • @johnbobbypringle
    @johnbobbypringle Před 10 měsíci

    when I try to install fbprophet on my local machine, I get an error saying "ModuleNotFoundError: No module named Cyphon"

  • @digitalnomad2196
    @digitalnomad2196 Před 5 měsíci

    Hi Rob, great video I am wondering if you stopped making machine learning videos as this video was 1 year old

  • @joaquinalejandrogamarramon7099

    can you do with neural prohet?

    • @robmulla
      @robmulla  Před rokem

      I recorded a video but it wasn't good enough to release. Maybe at some point I'll re-try.

  • @GohouDanon-ct4gy
    @GohouDanon-ct4gy Před rokem +1

    Hi Rob, I got a Ph.D in economics and just started Data Science courses. Could please help me to make choice as I am looking for the 5 BEST Machine Learning and Deep Learnin algorithms that I can master to be a Forecasting specialist.

    • @robmulla
      @robmulla  Před rokem +2

      Hey Gohou. Thanks for watching. I’d need to think that through. Certainly a component based model like prophet and tree based model like xgboost would be on the list. Deep learning models are still at the point where I haven’t seen them outperform the others. But things in data science change daily and I’ve heard some big companies use transformer architectures in forecasting.

  • @athithyabalasubramani1045

    when I ran the kaggle notebook Its showing dt is not defined which package is defined as dt? I searched on google it shows pandas series but exactly what is declared as dt?

    • @robmulla
      @robmulla  Před rokem +1

      Great question. dt allows you to apply datetime methods to a datetime type column or series. More info here: pandas.pydata.org/pandas-docs/stable/reference/api/pandas.Series.dt.html Hope that helps!

    • @athithyabalasubramani1045
      @athithyabalasubramani1045 Před rokem +1

      @@robmulla sorry Rob I am a begginer I couldn't understand this I tired to import pandas.series.dt but it shows there is no module named pandas.series I am using Google Collab please help me tackle this problem

    • @robmulla
      @robmulla  Před rokem

      @@athithyabalasubramani1045 Not sure exactly what you are trying to do by importing it directly. These are just methods that you can apply to an existing series. You don't need ot import it directly.

  • @inigoaguilera3178
    @inigoaguilera3178 Před měsícem

    Awesome video !! However, somehow I am getting the following error:
    fig, ax = plt.subplots(figsize=(10, 5))
    ax.scatter(datos_test.index, datos_test['Hs(m)'], color='r')
    fig = model.plot(datos_test_forecast, ax=ax)
    ax.set_xbound(lower='01-01-2019',
    upper='02-01-2019')
    ax.set_ylim(0, 60000)
    plot = plt.suptitle('January 2019 Forecast vs Actuals')
    ConversionError: Failed to convert value(s) to axis units: '01-01-2019'

  • @santiariza15
    @santiariza15 Před rokem +1

    Isn't prophet deprecated? I thought they switched it up to Neural Prophet which seems way better.

    • @robmulla
      @robmulla  Před rokem +1

      I’ve tried neural prophet but still think there is benefit to learning the OG version. Deep learning isn’t always better and can be less explainable. Read some articles where each can be better depending on the dataset.

    • @robmulla
      @robmulla  Před rokem +2

      towardsdatascience.com/prophet-vs-neuralprophet-fc717ab7a9d8

  • @ignorethechannelforcomment4174

    Is it?😅

  • @vivasjimmy
    @vivasjimmy Před 5 měsíci +1

    i dont know but when having this data is very predictable , tuning these models can be done, but the more real world application majority of these models fail... we need to keep feeding new data, new features and yet the model misses.
    puting exampes everywhere like this, I think they are just bread crumbs to feed our imagination or desire or to get people excited but very quick you hit the wall and realize that this forcasting anything is a taugh task...

  • @skduwaragesh8449
    @skduwaragesh8449 Před rokem

    Actually i am getting error in installing fb prophet as it says building wheels for it was unsuccessful. Please solve this i have a very important project for my carrier.

    • @arnabmukherjee3129
      @arnabmukherjee3129 Před rokem

      Im using python version 3.9.13, im also unable to install fbprophet in my local machine, and in kaggle notebook too, fbprophet is not working.

    • @atsa8821
      @atsa8821 Před 3 měsíci

      @@arnabmukherjee3129 I'm also getting lots of error message installing pystan and fbprophet in jupyter notebook. There are 21261 lines of output! I don't know where to start and how to read any of the warnings and errors.

  • @jac8086
    @jac8086 Před rokem +1

    Ts forecasting never been accurate...if yes, everybody has been a billionaire by trading time series data like equity market, ficc etc...not much practical use

    • @TheBarinco
      @TheBarinco Před rokem +2

      Wrong mindset. "All models are wrong, some are useful".

    • @robmulla
      @robmulla  Před rokem +2

      I was also going to respond with the "all models are wrong" quote, but Gerald beat me to it. Especially with forecasting you need to first ask yourself what the end goal is. I also think in most cases a human in the loop is still required, so an explainable model can be very helpful. But I totally agree that trying to predict something like stocks at large time horizons is a fool's errand.

    • @jac8086
      @jac8086 Před rokem

      @@robmulla so what value are we getting from ts models? Telling us something we already knew If no predictability? If it can't do a better job, why to hire expensive data scientist to do a job which a high school graduate can perform using Excel

    • @generationgap416
      @generationgap416 Před rokem

      Prophet is just 1 tool from the toolbox. It is not the whole toolbox. Knowing how to talk doesn't magically make you an orator. You need other skills

  • @aarondelarosa3146
    @aarondelarosa3146 Před 11 měsíci

    Excellent. But graphs are extremely awful! Can you improve them?

  • @comment8767
    @comment8767 Před 2 měsíci

    Music sucks.