Skip to main content

Python library for simplifying data science

Project description

Atlantis

Atlantis is a Python library for simplifying programming with Python for data science.

Installation

You can just use pip to install Atlantis:

pip install atlantis

Modules

  • collections helps with working with collections.
  • colour simplifies using colours.
  • ds (datascience) provides tools for:
    • data wrangling,
    • validation,
    • tuning,
    • sampling,
    • evaluation,
    • clustering, and
    • parallel processing of machine learning models.
  • functions manages higher order functions.
  • hash simplifies and standardizes hashing.
  • text makes working with texts and strings easy.
  • time
    • provides methods for interacting with time and date as well as
    • progress bars

collections

This module of the package atlantis helps with working with collections.

flatten

from atlantis.collections import flatten
flatten([1, 2, [3, 4, [5, 6], 7], 8])

returns: [1, 2, 3, 4, 5, 6, 7, 8]

List

This class inherits from Python's list class but implements a few additional functionalities.

from atlantis.collections import List
l = List(1, 2, 3, 4, 2, [1, 2], [1, 2])

Flattening:

l.flatten()
>>> List: [1, 2, 3, 4, 2, 1, 2, 1, 2]

Finding duplicates:

l.get_duplicates()
>>> List: [2, List: [1, 2]]

Note: the list elements of a List automatically get converted to Lists, recursively.

ds (Data Science)

This module provides data science tools for:

  • data wrangling,
  • validation,
  • tuning,
  • sampling,
  • evaluation,
  • clustering, and
  • parallel processing of machine learning models.

KMeans Clustering

I have used the KMeans class from both sklearn and that of pyspark and was frustrated by two problems: (a) even though the two classes do exactly the same thing their interfaces are vastly different and (b) some of the simplest operations are very hard to do with both classes. I solved this problem by creating my own KMeans class that is a wrapper aroung both of those classes and uses the appropriate one automatically without complicating it for the data scientist programmer.

Clustering Optimization

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

atlantis-2021.9.8.1.tar.gz (109.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

atlantis-2021.9.8.1-py3-none-any.whl (163.4 kB view details)

Uploaded Python 3

File details

Details for the file atlantis-2021.9.8.1.tar.gz.

File metadata

  • Download URL: atlantis-2021.9.8.1.tar.gz
  • Upload date:
  • Size: 109.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.9.1

File hashes

Hashes for atlantis-2021.9.8.1.tar.gz
Algorithm Hash digest
SHA256 c274309d535af7765e69c49d9c914b2d517b08019dcbea99031606bf162506f9
MD5 2f8447fd7475bf6ebda85a94f6d2366e
BLAKE2b-256 30724948fb8be42b326bdedc7624daeca9d749ff447545ec6b42b4f7a9f5592c

See more details on using hashes here.

File details

Details for the file atlantis-2021.9.8.1-py3-none-any.whl.

File metadata

  • Download URL: atlantis-2021.9.8.1-py3-none-any.whl
  • Upload date:
  • Size: 163.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.9.1

File hashes

Hashes for atlantis-2021.9.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4694ce7dcac504ae3743a377d108cbb8349b94dd6c58ca801949f29dbde9056d
MD5 5af794f625ed441c439ccc218f3796a2
BLAKE2b-256 70ce0c5e8a97706aadf5df09b05bc92fc4d7873c754e7f85f221327eeeafae07

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page