A package for optimization solvers

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

elex-live-model

The Washington Post uses this live election model to generate estimates of the number of outstanding votes on an election night based on the current results of the race. It is agnostic to the quantity that it is estimating. For general elections, we also generate estimates of the partisan split of outstanding votes, and for primaries, we split estimates by candidate.

Generally, the model works by comparing the current results to a historical baseline and regressing on the difference using demographic features. We use quantile regression as the underlying model and conformal prediction to produce our uncertainty estimates.

The first iteration of this model is written in R in this repo.

Installation

We recommend that you set up a virtualenv and activate it (IE mkvirtualenv elex-model via http://virtualenvwrapper.readthedocs.io/en/latest/).
Run pip install elex-model

Usage

We can run the model with a CLI or with Python.

We can use the model to generate current estimates or for a historical evaluation. Historical evaluation means running the "current reporting" subunits with data from a previous election, and then calculating the error that the current set of reporting subunits would have given us. This allows to test how representative the currently reporting subunits are.

CLI

The CLI is for local development and testing purposes only. We cannot run a live election through the CLI because it pulls vote counts from data files located either in S3 or locally. It does not retrieve current data from the Dynamo database of election results.

The CLI takes an election ID, estimands, office ID, and a geographic unit type. If you're running the model with local data files, they should be located at elex-live-model/data/{election_id}/{office_id}/data_{geographic_unit_type}.csv. Otherwise, the model will attempt to find the data files in S3.

Pass in a command like this:

elexmodel 2017-11-07_VA_G --estimands=dem --office_id=G --geographic_unit_type=county

You can also pass in multiple estimands:

elexmodel 2017-11-07_VA_G --estimands=dem --estimands=turnout --office_id=G --geographic_unit_type=county --percent_reporting 40

If you want to run a test with some nonreporting subunits, you can use the --percent_reporting cli parameter:

elexmodel 2017-11-07_VA_G --estimands=dem --office_id=G --geographic_unit_type=county --percent_reporting 40

Historical election

If you want to run a historical election, you can use the --historical flag. For this to succeed, the election must have historical data already prepared.

elexmodel 2021-11-02_VA_G --estimands=dem --office_id=G --geographic_unit_type=county --percent_reporting 60 --historical

Parameters

Parameters for the CLI tool:

Name	Type	Acceptable values
election_id	string	`YYYY-MM-DD_{geography}_{election_type}` geography is the state or `USA` and election type is `G` for general or `'P'` for primary
estimands	list	party name (i.e. `dem`, `gop`) or turnout in a general; `{candidate_last_name}_{polID}` in a primary
office_id	string	Presidential (`P`), Senate (`S`), House (`H`), Governor (`G`), state Senate (`Z`), state House (`Y`)
geographic_unit_type	string	`county`, `precinct`, `county-district`, or `precinct-district`
percent_reporting	numeric	0-100
historical	flag
features	list	features to include in the model
fixed_effects	list	`postal_code`, `county_classification` or `county_fips`, but really any prepared categorical variable
aggregates	list	list of geographies for which to calculate predictions beyond the original `postal_code`, `county_fips`, `district`, `county_classification`
pi_method	string	method for constructing prediction intervals (`nonparametric` or `gaussian`)
beta	numeric	variance inflation for `gaussian` model;
robust	flag	flag for larger set of prediction intervals in the nonparametric case
save_output	list	`results`, `data`, `config`
unexpected_units	int	number of unexpected units to simulate; only used for testing and does not work with historical run

Note: When running the model with multiple fixed effects, make sure they are not linearly dependent. For example, county_fips and county_classification are linearly dependent when run together. That's because every county is in one county class, so all the fixed effect columns of the counties in the county class sum up to the fixed effect column of that county class.

Python

This is the class and function that invokes the general function to generate estimates. You can install elex-model as a Python package and use this code snippet in other projects.

from elexmodel.client import ModelClient

model_client = ModelClient()
model_response = model_client.get_estimates(
  current_results,
  election_id,
  office,
  estimand, 
  prediction_intervals,
  percent_reporting_threshold,
  geographic_unit_type,
)

Historical election

This is the class and function that invokes a historical evaluation. You can install elex-model as a Python package and use this code snippet in other projects.

from elexmodel.client import HistoricalModelClient

historical_model_client = HistoricalModelClient()
model_evaluation = historical_model_client.get_historical_evaluation(
  current_data,
  election_id,
  office,
  estimand,
  prediction_intervals,
  percent_reporting_threshold,
  geographic_unit_type
)

Development

We welcome contributions to this repo. Please open a Github issue for any issues or comments you have.

Installation

Clone the repository and install the requirements:

  pip install -r requirements.txt
  pip install -r requirements-dev.txt

Create a .env file in the top directory and add the below variables. Assuming your S3 bucket and path roots are named elex-models, set these as your variables:

  APP_ENV=local
  DATA_ENV=dev
  MODEL_S3_BUCKET=elex-models
  MODEL_S3_PATH_ROOT=elex-models

Testing

pip install -r requirements-dev.txt
tox

We also have a requirements-test.txt file which is used for running unit tests only. It is installed automatically as part of installing requirements-dev.txt.

Precommit

To run precommit hooks for linting, run:

pre-commit run --all-files

Release

To release a new version:

Decide what the next version will be per semantic versioning: X.X.X
Make a new branch from develop called release/X.X.X
Update the version in setup.py
Update the changelog with all the chnages that will be included in the release
Commit your updates and open a PR against main
Once the PR is merged, tag main (or develop for a beta release) with the version's release number (git tag X.X.X) and push that tag to github (git push --tags)
Merge main into develop

After this is done, we need to get the new release into jfrog:

Make sure requirements-dev.txt is installed
Run python setup.py install bdist_wheel
Check to make sure the correct version is installed in the dist/ folder that should now exist at the base of the repo folder. If you've previously run these commands locally for an earlier version, you may need to delete the older files in dist/ order to upload them correctly in the next step. You can just delete the entire dist/ folder and run the above command again.
Run twine upload dist/* --repository-url REPOSITORY_URL -u USERNAME -p PASSWORD

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

2.0.3

Nov 7, 2023

2.0.2

Nov 2, 2023

2.0.1

Oct 23, 2023

2.0.0

Oct 13, 2023

1.0.11

Aug 14, 2023

1.0.10

Jun 7, 2023

1.0.9

Mar 6, 2023

1.0.8

Jan 13, 2023

1.0.7

Nov 4, 2022

1.0.6

Oct 6, 2022

1.0.5

Sep 21, 2022

This version

1.0.4

Sep 14, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

elex-model-1.0.4.tar.gz (29.0 kB view hashes)

Uploaded Sep 14, 2022 Source

Built Distribution

elex_model-1.0.4-py2.py3-none-any.whl (36.6 kB view hashes)

Uploaded Sep 14, 2022 Python 2 Python 3

Hashes for elex-model-1.0.4.tar.gz

Hashes for elex-model-1.0.4.tar.gz
Algorithm	Hash digest
SHA256	`a29b747b39c2304059122d55d21eb6d7051b85ca41be5745c84b7c0dbfe0e376`
MD5	`1f2de884d159e021fc3cd2d739c9515f`
BLAKE2b-256	`656581ab711c899cb24017add8253afb17639e29a0c18fc3bab7d8f704aaf03a`

Hashes for elex_model-1.0.4-py2.py3-none-any.whl

Hashes for elex_model-1.0.4-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`c6657dd990bb996144f280c9004262516da40e084587e62611ce89c79ba56f49`
MD5	`20da350e75e581378402bbf9eb3a4e2b`
BLAKE2b-256	`2f2cd0fccf47aedd75ab500a8508c7d4c39b807b5dc16aaae4b1c40f34b3b90a`