DS 786 – Week 2
- Watch Videos
- Introduction to Forecasting (5:59)
- Forecast: a statement about the future
- To be of value, the forecast is about a future event that we don’t have complete certainty for?
- Given what we know now, how much do we expect to sell next month?
- Forecast Time Horizons
- Short-range: immediate future to three months
- Medium-range: several months to a year
- Long-range: more than one year
- Common Aspects of Forecasts
- Assumes causal system: path => future
- Do observations of past behavior have relevance in predicting future behavior?
- Forecasts are never perfect (we just need them to be good enough)
- Forecasts are more accurate for groups vs individuals
- (ie sales of the category greek yogurt not just single-serving non-fat strawberry)
- Forecast accuracy decreases with time
- Predicting next week’s sales has less uncertainty than predicting sales 50 weeks out.
- Steps in the forecasting process
- Step 1 Determine the purpose of the forecast
- Step 2 Establish time horizon
- Step 3 Select forecasting technique
- Step 4 Gather and analyze data in detail
- State any assumptions
- Validate data: may need to cleanse of filter for past events
- Step 5 Calculate forecast
- Step 6 Analyze/monitor the forecast –
- Measure accuracy, are results acceptable?
- No: Return to step 3
- Yes: Publish forecast
- Types of forecasts
- Judgmental
- Use subjective inputs
- Consumer surveys, opinions of managers and staff: ie Delphi method
- Time Series
- Use historical data and assuming the future will be similar to the past
- Associative forecasts
- Use explanatory variables other than time to predict the future.
- Time series foreacasting
- Plot your data and look for patterns
- Trend: Long-term movement in data
- Linear: steady increase or decrease over time
- Nonlinear: demand may be exponential, may both increase and decrease over product life cycle
- Seasonality: short-term regular variations in data
- What season does walgreens sell the most allergy meds
- Not just limited to fall/winter/spring/summer variations
- Which days are busiest for fancy restaurants?
- Daily cycle at starbucks: when do they sell the most coffee drinks?
- Irregular variations: caused by unexpected events
- Infrequent spikes or troughs like toilet paper in 2020
- Random variations: caused by change
- Learn how to do a Naive Forecast (4:33)
- Naive forecasts
- Stable time-series data
- eg. sold 50 tires last week, so expect to sell 50 this week
- Seasonal variations
- sold 50k heart shaped candy boxes last February so expect to sell 50k this February.
- Data with trends
- sold 50 lattes yesterday but 45 the day before so expect to sell 55 today!
- Naive forecasting: pros and cons
- Pros
- You only need one data point
- Can start soon
- It’s easy
- Easy to explain simply
- Cons
- Naive forecasts are useful as a benchmark or lower bound for accuracy
- Working with the Moving Average (4:54)
- Simple moving average
- The average of N most recent observations
- The larger the N, the smoother the forecast, but the greater the lag (ability to respond to “real” changes)
- How to Work with Weighted Moving Averages and Exponential Smoothing (6:38)
- Weighted moving average
- Premise: the most recent observations might have the best predictive value. Yet for simple moving averages, older data points have same importance as most recent
- Solution: Assign weights to give more importance to more recent observations. (Weights must be between 0 and 1 and all weights must sum to 1 or you get bias)
- Advantages: avoids over-smoothing and lag time is decreased
- Weights are often found through trial and error, no one set of weights work best for all situations.
- Exponential smoothing
- Exponential smoothing is a type of weighted moving average that considers all time periods that we have data for, but gives them exponentially less weight the further back in time they fall
- Measuring Forecast Accuracy for Stable Time Series Data (5:23)
- How to select a forecast
- for data without trends or seasonality, we could use
- a simple naive forecast
- a moving average
- a weighted moving average
- an exponential smoother
- We will get different answers with each technique
- How do we pick the one most likely to be close to what will happen
- We calculate past forecast accuracy
- Measuring forecast accuracy
- Many different measures exist, such as MSE (mean squared error) and MAPE but in this class we will only calculate mean absolute deviation (MAD)
- First define an error as the difference between the actual value and predicted value for that time period
- The take the absolute value of this error
- Sum these absolute values and take the average