Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion with pyspark.
Project description
Optimus is the missing framework to profile, clean, process and do ML in a distributed fashion using Apache Spark (PySpark).
Requirements:
Apache Spark>=2.3.0
Python>=3.6
Installation:
In your terminal just type:
$ pip install optimuspyspark
Contributors:
Core Team: Argenis León and Favio Vázquez.
License:
Apache 2.0 © Iron
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
optimuspyspark-2.0.0.tar.gz
(29.5 kB
view hashes)
Built Distribution
Close
Hashes for optimuspyspark-2.0.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0004d36b070c5cc65991f597196ba82256e3e6ac3b09a081d282f8a8783630e2 |
|
MD5 | cd856ca67323c21d6d40e08c54e710e7 |
|
BLAKE2b-256 | 5948bec8f3ad1b509b9fa172fad941099ed14502060a90c11db659c6755ca47b |