PyTorch dataset wrapper for the
Project description
adult-dataset
A PyTorch dataset wrapper for the Adult (Census Income) dataset. Adult is a popular dataset in machine learning fairness research.
This package provides the adult.Adult
class:
atorch.utils.data.Datasets
loading and, optionally, downloading the
Adult dataset.
It can be used like the MNIST
dataset in
torchvision.
Beyond adult.Adult
, this package also provides adult.AdultRaw
,
which works just as adult.Adult
, but
does not standardize the features in the dataset and does not apply one-hot encoding.
Installation
pip install adult-dataset
Basic Usage
from adult import Adult
# load (if necessary, download) the Adult training dataset
train_set = Adult(root="datasets", download=True)
# load the test set
test_set = Adult(root="datasets", train=False, download=True)
inputs, target = train_set[0] # retrieve the first sample of the training set
# iterate over the training set
for inputs, target in iter(train_set):
... # Do something with a single sample
# use a PyTorch data loader
from torch.utils.data import DataLoader
loader = DataLoader(test_set, batch_size=32, shuffle=True)
for epoch in range(100):
for inputs, targets in iter(loader):
... # Do something with a batch of samples
Advanced Usage
Turn off status messages while downloading the dataset:
Adult(root=..., output_fn=None)
Use the logging
module for logging status messages while downloading the
dataset instead of placing the status messages on sys.stdout
.
import logging
Adult(root=..., output_fn=logging.info)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for adult_dataset-3.0.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8d711c0ce7cd1b07c7c02f6c7e14b8603bed0ec173ca4e3b7c20cca1fb4a0ba2 |
|
MD5 | 7840bb17e68f17ecb9078bbd781ce254 |
|
BLAKE2b-256 | 358242bdf580e63e70c68c7c6f6c027cb6a49ceca128521750041fda9a11ed41 |