Benchmark interpretability methods.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

BIM - Benchmark Interpretability Method

This repository contains dataset, models, and metrics for benchmarking interpretability methods (BIM) described in paper: * Title: "BIM: Towards Quantitative Evaluation of Interpretability Methods with Ground Truth" * Authors: Sherry (Mengjiao) Yang, Been Kim

Upon using this library, please cite: @Article{BIM2019, title = {{BIM: Towards Quantitative Evaluation of Interpretability Methods with Ground Truth}}, author = {Yang, Mengjiao and Kim, Been}, year = {2019} }

BIM atasets and models will be fully released by the end of June 2019.

Dataset

The core of BIM dataset, obj and scene, are constructed by pasting object pixels from MSCOCO to scene images from MiniPlaces. The obj set and scene set have object labels and scene labels respectively. In each set, val_loc.txt contains x_min, y_min, x_max, y_max of the objects, and val_mask contains objects' binary masks.

To compute the BIM metrics, we provide additional image sets described in the table below.

Download	Training	Validation	Usage	Description
obj	90,000	10,000	Model contrast	Objects and scenes with object labels
scene	90,000	10,000	Model contrast Input dependence	Objects and scenes with scene labels
scene_only	90,000	10,000	Input dependence	Images in scene with objects removed
dog_bedroom	-	200	Relative model contrast	Dog in bedroom labeled as bedroom
bamboo_forest	-	100	Input independence	Scene-only bamboo forest
bamboo_forest_patch	-	100	Input independence	Bamboo forest with functionally insignificant dog patch

Models

As shown in the figure above, the obj model is trained on object labels and the scene model is trained on scene labels. We also provide the model trained on scene-only images and a set of models where the object occurs in a different number of classes. All models are in TensorFlow's SavedModel format.

Download | obj | scene | scene_only | scene1 | scene2 | scene3 | scene4 | scene5 | scene6 | scene7 | scene8 | scene9 | scene10 -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | --

Metrics

BIM metrics compare how interpretability methods perform across models (model contrast), across inputs to the same model (input dependence), and across functionally equivalent inputs to the same model (input independence).

Model contrast scores

Given images that contain both objects and scenes, model contrast measures the difference in attributions between the model trained on object labels and the model trained on scene labels.

Input dependence rate

Given a model trained on scene labels, input dependence measures the ratio of which the objects are attributed as less important compared to when objects are absent.

Input independence rate

Given a model trained on scene-only images, input independence measures the ratio of which a functionally insignificant patch (e.g., a dog) does not affect explanations significantly.

Examples

Run pip install bim to install python dependencies. You can choose to run download.sh to download the entire dataset and models specified above, or follow the download link for a particular data or model and extract the tar.gz to the corresponding data or models directory. Then you can run python3 metrics.py --metrics=MCS --num_imgs=10 to compute the model contrast scores (MCS) over randomly sampled 10 images. Since computing saliency maps for a large amount of input images can take a while, we also provide precomputed attributions. To compute BIM metrics using precomputed attributions, run python3 metrics.py --metrics=MCS --scratch=0

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.4

Jul 23, 2019

0.3

Jul 23, 2019

0.2

Jul 23, 2019

0.1

Jul 23, 2019

This version

0.0

Jun 20, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bim-0.0.tar.gz (11.3 kB view hashes)

Uploaded Jun 20, 2019 Source

Built Distributions

bim-0.0.0-py3-none-any.whl (17.0 kB view hashes)

Uploaded Jul 23, 2019 Python 3

bim-0.0-py3-none-any.whl (17.1 kB view hashes)

Uploaded Jun 20, 2019 Python 3

Hashes for bim-0.0.tar.gz

Hashes for bim-0.0.tar.gz
Algorithm	Hash digest
SHA256	`a656e28e6fee7f2afb0111304486b58878e6944ff1f354bc29c1b34e2157a795`
MD5	`09bc5ee7c358d98d79cae50467938923`
BLAKE2b-256	`f4e13a43ac5883a9f0033a0d69d56fb796a7ab04437de72b231d4c6717a46753`

Hashes for bim-0.0.0-py3-none-any.whl

Hashes for bim-0.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1413a0871a3bbdf3a8f0f3f02b87bad0ec672d5fb2e43c6485ee1b88c2349052`
MD5	`88334c2c07e223f4cfdd3855d9ecf547`
BLAKE2b-256	`f6d5bf192f8f3953568262bcd9e61e00216f69a554f47fc05a70d4434ca74117`

Hashes for bim-0.0-py3-none-any.whl

Hashes for bim-0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c6010b17e790513ac455d58895816feeea2bff076e2f6b0e1e7725ded46d0711`
MD5	`b7fee904c37db2c0183feca2fe26f9e8`
BLAKE2b-256	`30f49dd8aa082031736cc26255a1dcc19d4410ae42f5d405092bdfba778b2920`