A (key-value) data-object-layer to get (pandas) tables from a variety of sources with ease

Project description

tabled

A (key-value) data-object-layer to get (pandas) tables from a variety of sources with ease

To install: pip install tabled

DfFiles

This notebook demonstrates how to use DfFiles to store and retrieve pandas DataFrames using various file formats.

Setup

First, let's import required packages and define our test data:

import os
import shutil
import tempfile

import pandas as pd
from tabled import DfFiles

# Test data dictionary
misc_small_dicts = {
    "fantasy_tavern_menu": {
        "item": ["Dragon Ale", "Elf Bread", "Goblin Stew"],
        "price": [7.5, 3.0, 5.5],
        "is_alcoholic": [True, False, False],
        "servings_left": [12, 25, 8],
    },
    "alien_abduction_log": {
        "abductee_name": ["Bob", "Alice", "Zork"],
        "location": ["Kansas City", "Roswell", "Jupiter"],
        "duration_minutes": [15, 120, 30],
        "was_returned": [True, False, True],
    }
}

Creating Test Directory

We'll create a temporary directory for our files:

def create_test_directory():
    # Create a directory for the test files
    rootdir = os.path.join(tempfile.gettempdir(), 'tabled_df_files_test')
    if os.path.exists(rootdir):
        shutil.rmtree(rootdir)
    os.makedirs(rootdir)
    print(f"Created directory at: {rootdir}")
    return rootdir

rootdir = create_test_directory()
print(f"Created directory at: {rootdir}")

Created directory at: /var/folders/mc/c070wfh51kxd9lft8dl74q1r0000gn/T/tabled_df_files_test
Created directory at: /var/folders/mc/c070wfh51kxd9lft8dl74q1r0000gn/T/tabled_df_files_test

Initialize DfFiles

Create a new DfFiles instance pointing to our directory:

df_files = DfFiles(rootdir)

Let's verify it starts empty:

list(df_files)

[]

Creating and Saving DataFrames

Let's create DataFrames from our test data:

fantasy_tavern_menu_df = pd.DataFrame(misc_small_dicts['fantasy_tavern_menu'])
alien_abduction_log_df = pd.DataFrame(misc_small_dicts['alien_abduction_log'])

print("Fantasy Tavern Menu:")
display(fantasy_tavern_menu_df)
print("\nAlien Abduction Log:")
display(alien_abduction_log_df)

Fantasy Tavern Menu:

	item	price	is_alcoholic	servings_left
0	Dragon Ale	7.5	True	12
1	Elf Bread	3.0	False	25
2	Goblin Stew	5.5	False	8

Alien Abduction Log:

	abductee_name	location	duration_minutes	was_returned
0	Bob	Kansas City	15	True
1	Alice	Roswell	120	False
2	Zork	Jupiter	30	True

Now let's save these DataFrames using different formats:

df_files['fantasy_tavern_menu.csv'] = fantasy_tavern_menu_df
df_files['alien_abduction_log.json'] = alien_abduction_log_df

Reading Data Back

Let's verify we can read the data back correctly:

saved_df = df_files['fantasy_tavern_menu.csv']
saved_df

	item	price	is_alcoholic	servings_left
0	Dragon Ale	7.5	True	12
1	Elf Bread	3.0	False	25
2	Goblin Stew	5.5	False	8

MutableMapping Interface

DfFiles implements the MutableMapping interface, making it behave like a dictionary.

Let's see how many files we have:

len(df_files)

List all available files:

list(df_files)

['fantasy_tavern_menu.csv', 'alien_abduction_log.json']

Check if a file exists:

'fantasy_tavern_menu.csv' in df_files

True

Supported File Extensions

Let's see what file formats DfFiles supports out of the box.

(Note that some of these will require installing extra packages, which you'll realize if you get an ImportError)

print("Encoder supported extensions:")
list_of_encoder_supported_extensions = list(df_files.extension_encoder_mapping)
print(*list_of_encoder_supported_extensions, sep=', ')

Encoder supported extensions:
csv, txt, tsv, json, html, p, pickle, pkl, npy, parquet, zip, feather, h5, hdf5, stata, dta, sql, sqlite, gbq, xls, xlsx, xml, orc

print("Decoder supported extensions:")
list_of_decoder_supported_extensions = list(df_files.extension_decoder_mapping)
print(*list_of_decoder_supported_extensions, sep=', ')

Decoder supported extensions:
csv, txt, tsv, parquet, json, html, p, pickle, pkl, xml, sql, sqlite, feather, stata, dta, sas, h5, hdf5, xls, xlsx, orc, sav

Testing Different Extensions

Let's try saving and loading our test DataFrame in different formats:

extensions_supported_by_encoder_and_decoder = (
    set(list_of_encoder_supported_extensions) & set(list_of_decoder_supported_extensions)
)
sorted(extensions_supported_by_encoder_and_decoder)

['csv',
 'dta',
 'feather',
 'h5',
 'hdf5',
 'html',
 'json',
 'orc',
 'p',
 'parquet',
 'pickle',
 'pkl',
 'sql',
 'sqlite',
 'stata',
 'tsv',
 'txt',
 'xls',
 'xlsx',
 'xml']

def test_extension(ext):
    filename = f'test_file.{ext}'
    try:
        df_files[filename] = fantasy_tavern_menu_df
        df_loaded = df_files[filename]
        # test the decoded df is the same as the one that was saved (round-trip test)
        # Note that we drop the index, since the index is not saved in the file by default for all codecs
        pd.testing.assert_frame_equal(
            fantasy_tavern_menu_df.reset_index(drop=True),
            df_loaded.reset_index(drop=True),
        )
        return True
    except Exception as e:
        return False


test_extensions = [
    'csv',
    'feather',
    'json',
    'orc',
    'parquet',
    'pkl',
    'tsv',  
    # 'dta',  # TODO: fix
    # 'h5',  # TODO: fix
    # 'html',  # TODO: fix
    # 'sql',  # TODO: fix
    # 'xml',  # TODO: fix
]

for ext in test_extensions:
    print("Testing extension:", ext)
    success = test_extension(ext)
    if success:
        print(f"\tExtension {ext}: ✓")
    else:
        print('\033[91m' + f"\tFix extension {ext}: ✗" + '\033[0m')
        
    # marker = '✓' if success else '\033[91m✗\033[0m'
    # print(f"\tExtension {ext}: {marker}")

Testing extension: csv
	Extension csv: ✓
Testing extension: feather
	Extension feather: ✓
Testing extension: json
	Extension json: ✓
Testing extension: orc
	Extension orc: ✓
Testing extension: parquet
	Extension parquet: ✓
Testing extension: pkl
	Extension pkl: ✓
Testing extension: tsv
	Extension tsv: ✓
Testing extension: dta
[91m	Fix extension dta: ✗[0m
Testing extension: h5
[91m	Fix extension h5: ✗[0m
Testing extension: html
[91m	Fix extension html: ✗[0m
Testing extension: sql
[91m	Fix extension sql: ✗[0m
Testing extension: xml
[91m	Fix extension xml: ✗[0m

Project details

Release history Release notifications | RSS feed

This version

0.1.26

Oct 6, 2025

0.1.25

Jul 1, 2025

0.1.24

Mar 31, 2025

0.1.23

Mar 18, 2025

0.1.22

Mar 18, 2025

0.1.21

Mar 17, 2025

0.1.20

Feb 21, 2025

0.1.19

Feb 21, 2025

0.1.18

Feb 19, 2025

0.1.17

Dec 17, 2024

0.1.16

Dec 17, 2024

0.1.15

Sep 21, 2024

0.1.14

Sep 20, 2024

0.1.13

Sep 19, 2024

0.1.12

Aug 9, 2024

0.1.11

Jun 27, 2024

0.1.10

Apr 15, 2024

0.1.9

Feb 12, 2024

0.1.8

Feb 9, 2024

0.1.7

Jan 26, 2024

0.1.6

Jan 24, 2024

0.1.5

Jan 24, 2024

0.1.4

Jan 19, 2024

0.1.3

Jan 18, 2024

0.1.2

Jan 17, 2024

0.1.1

Dec 22, 2023

0.1.0

May 7, 2023

0.0.5

Aug 31, 2021

0.0.4

Apr 5, 2021

0.0.3

Apr 5, 2021

0.0.2

Apr 5, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tabled-0.1.26.tar.gz (47.6 kB view details)

Uploaded Oct 6, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tabled-0.1.26-py3-none-any.whl (50.8 kB view details)

Uploaded Oct 6, 2025 Python 3

File details

Details for the file tabled-0.1.26.tar.gz.

File metadata

Download URL: tabled-0.1.26.tar.gz
Upload date: Oct 6, 2025
Size: 47.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.18

File hashes

Hashes for tabled-0.1.26.tar.gz
Algorithm	Hash digest
SHA256	`7076df384b6b7a4006a12604eb715ffe3b95a6e8d60dbc198a3188ab9c9f8e89`
MD5	`9e7057c1e14cdb958a5d2bd470c01edb`
BLAKE2b-256	`fe4bcfbab1a7727243d2914ae8bfaa88a5ee109a049a84b994442f0d15efdb6b`

See more details on using hashes here.

File details

Details for the file tabled-0.1.26-py3-none-any.whl.

File metadata

Download URL: tabled-0.1.26-py3-none-any.whl
Upload date: Oct 6, 2025
Size: 50.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.18

File hashes

Hashes for tabled-0.1.26-py3-none-any.whl
Algorithm	Hash digest
SHA256	`db87184e8c841150b38cd248ae21e3e5930dc867da3d68e346bcdd6daeb79ac8`
MD5	`c88489b8d4a1b632728ef95cd2b55e7d`
BLAKE2b-256	`6267c4f2a603896b1085d8af6635a664db0ab19681384ba3c2d720d4a0f4d6ab`

See more details on using hashes here.

tabled 0.1.26

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

tabled

DfFiles

Setup

Creating Test Directory

Initialize DfFiles

Creating and Saving DataFrames

Reading Data Back

MutableMapping Interface

Supported File Extensions

Testing Different Extensions

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes