Simple probabilistic time series benchmark models

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

tablespoon

Time-series Benchmark methods that are Simple and Probabilistic

Introduction

Many methods exist for probabilistic forecasting. If you are looking for an impressive probabilistic forecasting package see the list of recommendation at the bottom of this README. This package is exceptionally ordinary. It is expected that this package may be used as a compliment to what is already out there.

Why Run Simple Methods

We have found, by experience, many good uses for the methods in this package. To often we see that forecast methods go in production without a naive method to accompany it. This is a missed opportunity.

Naive May Be Good Enough: Some applications do not need anything more impressive than a simple forecasting method.
Get A Denominator for Relative Metrics: Though naive methods can usually be beat it is good to know the relative improvement over the benchmark. This can allow a forecasting team to market their alternative forecast when the 'skill score' is impressive.
Easy to productionize and get expectations: Get a sense for how good is good enough. In many applications a forecast team is asked to forecast, but stakeholders provide no line-in-the-sand for when the forecasting work needs to stop. One reasonable approach is to run the benchmarks found in this package in beat the best performing benchmark by a margin that is statistically significant.
Resilience in Production - Why not have many models?: Sometimes, despite out best efforts our production model does something unexpected. In this case it is nice to have a simple backup that is cheap to generate and good enough to fall back on. In this way a production forecast pipeline gains strength from a diversity of models.
Easy Uncertainty Quantification: More and more we see that application are not about forecast accuracy, but instead about forecasting uncertainty. Capturing the full distribution helps firms set "service levels" aka percentiles of the distribution for which they are prepared to serve. Even if you have the worlds most accurate unbiased forecast the median point is an underforecast half the time. For this reason it is best to provide a distribution of simulated future values and the firm may decide for themselves what risks they are or are not willing to take.

Quick Example

We show a quick example below. For more examples see EXAMPLES.md

import pandas as pd
import tablespoon as tbsp
from tablespoon.data import APPL
# Uncomment if this is your first time installing cmdstanpy
# from cmdstanpy import install_cmdstan
# install_cmdstan()

n = tbsp.Naive()
df_n = (n.predict(APPL, horizon=7*4, frequency="D", lag = 1, uncertainty_samples = 8000).assign(model = 'naive'))
print(df_sn.head(25))

           ds  rep    y_sim  model
0  2022-01-02    0  5.20006  naive
1  2022-01-02    1  5.16789  naive
2  2022-01-02    2  5.17641  naive
3  2022-01-02    3  5.19340  naive
4  2022-01-02    4  5.20075  naive
5  2022-01-02    5  5.17681  naive
6  2022-01-02    6  5.20302  naive
7  2022-01-02    7  5.18896  naive
8  2022-01-02    8  5.19622  naive
9  2022-01-02    9  5.17469  naive
10 2022-01-02   10  5.18686  naive
11 2022-01-02   11  5.16293  naive
12 2022-01-02   12  5.17006  naive
13 2022-01-02   13  5.18777  naive
14 2022-01-02   14  5.18617  naive
15 2022-01-02   15  5.18752  naive
16 2022-01-02   16  5.18106  naive
17 2022-01-02   17  5.17399  naive
18 2022-01-02   18  5.17950  naive
19 2022-01-02   19  5.17120  naive
20 2022-01-02   20  5.18699  naive
21 2022-01-02   21  5.16781  naive
22 2022-01-02   22  5.19250  naive
23 2022-01-02   23  5.17976  naive
24 2022-01-02   24  5.15683  naive

Goals of this package

♙Simple: Not just in the forecasts themselves, but also from the users perspective.
♝Documented: It should be very clear exactly how forecasts are getting generated. We document the parameterization of the models to make this as obvious and uninteresting as possible. See Forecast Method Documentation
♜Stable: We want this package to feel rock solid. For this to happen we keep the feature set relatively small. We believe that after the initial development of this package we should spend out time maintaining the code as oppose to thinking of new features.
♞Distributional: Quantification of uncertainty is the name of the game. Because this uses Stan in the backend users get access to state of of the art numerical sampling.
♛Accessible: Because of how important we feel simple forecasting methods are we want as many front end binding as possible to expose these methods to the largest audience possible. We eventually have binding in Shell,Julia,R, and `Python. (This will come with time)

Non-Goals

🔥Circut Melting Focus on Speed: Not to say this is a slow package. In fact, all models do get compiled. It is very fast! We just don't put any extra effort to make it faster than the C++ Stan compiled model.
🤖New/Complex Forecast Models: Again, this is out of scope. If you are looking for recommendations please see the bottom of the page.

Installation

Python

pip3 install tablespoon

Citation

If you would like to cite tablespoon, please cite it as follows:

Alex Hallam. tablespoon: Time-series Benchmark methods that are Simple and Probabilistic https://github.com/alexhallam/tablespoon, 2021. Version 0.1.6.

@misc{tablespoon,
  author={Alex Hallam},
  title={{tablespoon}: {Time-series Benchmark methods that are Simple and Probabilistic},
  howpublished={https://github.com/alexhallam/tablespoon},
  note={Version 0.1.8,
  year={2021}
}

References

Hyndman, R.J., & Athanasopoulos, G. (2021) Forecasting: principles and practice, 3rd edition, OTexts: Melbourne, Australia. OTexts.com/fpp3. Accessed on 2021-09-26.
Stan Development Team. 2021. Stan Modeling Language Users Guide and Reference Manual, 2.27.0. https://mc-stan.org

Recommended probabilistic forecasting packages

There are many packages that can compliment tablespoon

forecast: The king of forecasting packages. Rob Hyndman is a professor of forecasting and has served as editor of the journal "International Journal of Forecasting". If you are new to forecasting please read his free ebook fpp3.

prophet: A very capable and reliable forecasting package. I have never seen a bad forecast come out of prophet.

gluonts. If you are itching to use neural nets for forecasting this is a good one to pick.

Learn more about forecasting

Read fpp3
Join the International Institute of Forecasting and read their publications.

Beta

This package is currently being tested. It is very much unfinished at this point. Feel free to use what is currently available.

Built with poetry and pushed to pypi

poetry publish -u <username> -p <password> --build

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.5.1

Oct 20, 2023

0.5.0

Jul 10, 2023

0.4.9

Jul 7, 2023

0.4.8

Jul 7, 2023

0.4.7

Dec 16, 2022

0.4.6

Jul 31, 2022

0.4.5

May 25, 2022

0.4.4 yanked

May 24, 2022

0.4.1 yanked

May 24, 2022

0.4.0 yanked

Mar 27, 2022

0.3.141 yanked

Mar 12, 2022

0.3.14 yanked

Mar 12, 2022

0.3.3 yanked

Mar 27, 2022

0.3.2 yanked

Mar 26, 2022

0.1.25 yanked

Mar 5, 2022

0.1.24 yanked

Feb 27, 2022

This version

0.1.23 yanked

Feb 27, 2022

0.1.14 yanked

Mar 11, 2022

0.1.13 yanked

Jan 24, 2022

0.1.12 yanked

Dec 3, 2021

0.1.10 yanked

Dec 3, 2021

0.1.9 yanked

Dec 3, 2021

0.1.8 yanked

Oct 14, 2021

0.1.7 yanked

Oct 13, 2021

0.1.6 yanked

Sep 21, 2021

0.1.5 yanked

Sep 21, 2021

0.1.4 yanked

Sep 21, 2021

0.1.3 yanked

Sep 17, 2021

0.1.2 yanked

Sep 17, 2021

0.1.1 yanked

Sep 17, 2021

0.1.0 yanked

Sep 16, 2021

Reason this release was yanked:

old

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tablespoon-0.1.23.tar.gz (20.3 kB view hashes)

Uploaded Feb 27, 2022 Source

Built Distribution

tablespoon-0.1.23-py3-none-any.whl (17.0 kB view hashes)

Uploaded Feb 27, 2022 Python 3

Hashes for tablespoon-0.1.23.tar.gz

Hashes for tablespoon-0.1.23.tar.gz
Algorithm	Hash digest
SHA256	`a491d525b0eb55c5eafcbe231b805c18ac096dc793a9092a0e2b0df091a2318a`
MD5	`77d5cc1088c3ee097bffa49433f88b7b`
BLAKE2b-256	`cb0c0e73d6b056289acbcd99725342a45cea81cc0e0f6e5cd4123427d552ec51`

Hashes for tablespoon-0.1.23-py3-none-any.whl

Hashes for tablespoon-0.1.23-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ccad284b2e2f7b954b44b5a07ac68ee4d7aa722722eab489647e408e0dbaab1f`
MD5	`a76cc8b722189b37ec63404cd0eb6443`
BLAKE2b-256	`c7b953349a3d7f599a8da7a8ebf970627082574925cede3e6f55a817a854e771`