65 projects
sparkless
PySpark-like DataFrame API in Python—no JVM. Uses robin-sparkless (Rust/Polars) as the execution engine.
capper
Semantic, typed wrappers for Faker with automatic Polyfactory integration
polypolars
Generate Polars DataFrames using polyfactory for testing and development
polypandas
Generate pandas DataFrames using polyfactory for testing and development
semblance
Schema-driven REST API simulation with FastAPI, Pydantic, and Polyfactory
rapsqlite
True async SQLite — no fake async, no GIL stalls.
robin-pkg
A bare bones Python package
rapcsv
Streaming async CSV — no fake async, no GIL stalls.
rapfiles
True async filesystem I/O — no fake async, no GIL stalls.
formatparse
Parse strings using a specification based on the Python format() syntax (Rust implementation)
arrayops
Rust-backed acceleration for Python array.array
arraystream
Iterator-style transformations for numeric arrays, similar to itertools but for array.array
lavendertown
A Streamlit-first Python package for detecting and visualizing data quality issues
stringent
Parse strings into Pydantic models using pattern matching
sparkless-testing
Easy testing with sparkless or PySpark on demand
mock-spark
Lightning-fast PySpark testing without JVM - 10x faster with 100% API compatibility
cubchoo
Convert between Pandas and Polars schemas with ease
charmander
Convert between Polars schemas and PySpark schemas
rattata
Convert between Polars schemas and Python data structures (dataclasses, TypedDicts, namedtuples)
bulbasaur
Convert between PySpark schemas and SQLAlchemy/SQLModel classes
squirtle
Convert between Polars schemas and SQLAlchemy/SQLModel classes with ease
lugia
Universal schema converter for Python data types with optional dependencies
articuno
Convert Polars or Pandas DataFrames to lists of Pydantic models with schema inference
moltres
DataFrame API with SQL pushdown execution and real SQL CRUD - the missing layer for SQL in Python
raptors
Rust-powered, NumPy-compatible array library scaffold.
polarpandas
A pandas-compatible API layer built on top of Polars for high-performance data manipulation
pandalchemy
A package that integrates pandas and sqlalchemy with change tracking and optimized SQL operations.
pytest-green-light
Pytest plugin that gives SQLAlchemy async engines the green light - automatically fixes MissingGreenlet errors
transmutation
Comprehensive database migration and schema alteration tool using SQLAlchemy and Alembic
fullmetalalchemy
Easy-to-use helpers for SQL table changes with SQLAlchemy.
synnamon
Pure Python package for getting synonyms for words.
toarray
Small Python package for converting iterables to arrays.
csvalchemy
Read, validate, and write CSV files using Pydantic models with dydactic.
dydactic
Validate iterables using Pydantic
dictanykey
A dict that can use unhashable keys
heatfall
Easy to use functions for plotting heat maps of geographic data on static maps
landfall
Easy to use functions to plot geospatial data on maps using staticmaps.
trashpandas
Persistent Pandas DataFrame storage and retrieval using a SQL database, HDF5, CSV files, or pickle files.
tinytable
Pure Python lite version of Pandas DataFrame.
geodude
Calculate GeoHash functions using PyGeodesy
typedict
A dict that only uses types as keys. Converts keys used to their type.
subclassdict
A TypeDict that allows subclasses of type keys to be used as keys
polyspark
Generate PySpark DataFrames using polyfactory for testing and development
mutablekeysdict
Pure Python mutable keys dictionary class.
range-key-dict-2
A modern dict implementation supporting range-based keys with O(log M) lookups
hedron
A python package project for doing analysis on coordinates and clustering them.
poldantic
Convert Pydantic models to Polars schemas
abraxos
Upgrade your Pandas ETL process.
hasattrs
Check if objects have same attrs as collections.abc types.
tinytim
Pure Python data table functions.
spark-ddl-parser
Zero-dependency PySpark DDL schema parser
luxin
Interactive HTML tables with drill-down capabilities for exploring aggregated data
feathertail
A tiny, fast, Rust-backed transformation core for Python table data
csvdir
Read all csv files in a directory with one iterator.
freshspark
Create truly fresh local Spark sessions with isolated temp dirs and reliable teardown.
pydantable
Python package that uses pydantic to validate data in a data table.
fullmetalcopy
Functions for faster PostgreSQL bulk inserts by copying to sql table.
pgcopyinsert
Functions for faster PostgreSQL bulk inserts by copying to temp table then inserting from temp table.
aiocsv-utils
Utility functions for reading and writing CSV files asynchronously.
fullmetal-utils
Utilities for editing SQL databases
fullmetalpandas
An intuitive combination of Pandas and sqlalchemy to manipulate sql databases with Pandas.
elric
Edit sql tables using a simple table class with a sqlalchemy and alembic back end.
fullmetaltable
Edit sql tables using a simple table class with a sqlalchemy and alembic back end.
chaingang
Python class decorator that adds selection chaining