Framework that helps to train models, compare them and track parameters&metrics along the way.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

🌿 Trava ( initially stands for TrainValidation )

Framework that helps to train models, compare them and track parameters&metrics along the way. Works with tabular data only.

pip install trava

Why

When experimenting with some data&models, notebooks quickly become messy and unreliable. Usually when we solve some problem we are focused on some set of metrics and we want to compare models with each other. This lib tries to provide unified interface for this and other tasks.

Another important thing is experiment tracking. Trava helps you to track all the model parameters as well as metrics. You may subclass TravaTracker to support you tracking system. Now Trava goes with ready-to-go MLFlowTracker.

How

You tell what metrics you want to calculate and how results should be presented to you. Then you just run Trava using a model of your choice and parameters for it. Fit&predict process is customizable as well. See examples/ dir for the details. For now only sklearn-style model are supported. ( fit, predict, predict_proba methods )

Example

Note: See examples/Basics.ipynb for the intro tour.

# what metrics to calculate. sk(...) means wrapper for sklearn metrics, custom metrics are easily supported as well.
scorers = [sk(recall_score), sk(precision_score)]

# how to show the metrics. In this case dictionary with metrics values will be returned
dict_handler = MetricsDictHandler(scorers=output_scorers)

# prepare data
df = pd.read_csv('...')

split_config = DataSplitConfig(split_logic=BasicSplitLogic(shuffle=True),
                               target_col_name='target',
                               test_size=0.3)
# just splits data into Train/Test
split_result = Splitter.split(df=df, config=split_config)

# initialize Trava
trava = TravaSV(results_handlers=[dict_handler])

# get your results
trava.fit_predict(raw_split_data=split_result, 
                  model_id='xgb',  # uniquely identifies your model
                  model_type=xgb.XGBClassifier,  # what model to run
                  model_init_params={'max_depth': 3})  # parameters to init model with

# then go on playing with other models 
...
# call this to get all previous results at once
trava.results

Prerequisites

pandas
numpy
python 3.7 ( the true minimum version is not yet confirmed )

The lib was written using Python 3.7, yet I currently don't know the minimum Python version required.

It's also convenient to use the lib with sklearn ( e.g. for taking metrics from there. ). Also couple of extensions are based on sklearn classes.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.2.12

Apr 25, 2021

0.2.11

Apr 12, 2021

0.2.10

Feb 19, 2021

0.2.9

Feb 18, 2021

0.2.8

Feb 17, 2021

0.2.7

Feb 17, 2021

0.2.6

Feb 17, 2021

0.2.5

Dec 29, 2020

0.2.4

Dec 24, 2020

0.2.3

Aug 27, 2020

0.2.2

Jun 14, 2020

0.2.1

Jun 7, 2020

0.2.0

May 24, 2020

0.1.4

May 14, 2020

This version

0.1.3

May 14, 2020

0.1.2

May 5, 2020

0.1.1

Apr 28, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

trava-0.1.3.tar.gz (305.8 kB view hashes)

Uploaded May 14, 2020 Source

Built Distribution

trava-0.1.3-py3-none-any.whl (43.7 kB view hashes)

Uploaded May 14, 2020 Python 3

Hashes for trava-0.1.3.tar.gz

Hashes for trava-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`7ae5dad8d0da39f3f7b0d32c25ebb891ea04e841b352c7ebc9c83108e1da2216`
MD5	`ddfc6fab0202df361d3be8f92cef2c76`
BLAKE2b-256	`661664607219532196b8586ab89bd7f8426f1af7a8293f9b003c914cc6b2bf3f`

Hashes for trava-0.1.3-py3-none-any.whl

Hashes for trava-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b12587329417d4f71e4efdf4d432dcd4280b16701f4929c8f0ae9d4db3fef24f`
MD5	`3520f4e92b018a03c2474f13dc713cf6`
BLAKE2b-256	`c79bad4457483fe467c3b11d431b158a8be4c66bb1cd981667a6170ea04fdb42`