Skip to main content

Distributed computing for everyone in modern Python.

Project description

achilles

Extremely accessible distributed/parallel computing (BYOB - Build Your Own Beowulf) in modern Python based on:

  • Twisted;
  • cloudpickle;
  • multiprocessing; and,
  • lessons learned from studying Hadoop, Apache Spark, Apache Arrow, Parallel Python, Ray, Jug, Pathos, Disco, SCOOP, Cluster-Workers and the questions asked regarding these projects on StackOverflow.

The goal of the achilles framework is to drastically reduce the barriers to entry for developers to use all of the computational power available on their networks and provide engingeers with an accessible, highly flexible framework for building distributed/parallel computing into their applications applications.

As achilles continues to be actively developed, the project will continue to be designed in alignment with our four demands for the project:

  • achilles must be simple to use;
  • achilles must 'just work' at all scales, from running all of the components on an individual's laptop to running on powerful clusters in distributed systems;
  • achilles must be scalable and accommodate 'big data' workloads (i.e. files too large to open in memory on a single machine);
  • achilles should be fast as hell and regularly benchmarked against alternative solutions; and,
  • achilles must be SECURE. Currently, achilles is not suitable for deployment outside of localhost. We are currently working on an SSH implementation.

Consists of:

  • achilles_server.py
    • Runs a Twisted TCP server listening at the endpoint described in the .env configuration file. Establishes and maintains persistent connections with multiple achilles_nodes (potentially running across numerous machines), distributes computations among them at the instruction of the achilles_controller, and sends the result(s) to the achilles_controller in the response_mode specified in achilles_compute().
  • achilles_node.py
    • Connect to the achilles_server and receive ID assignment. Wait for a job to be started, receive a function over the wire, receive an argument over the wire and then keep feeding the server results/receiving new arguments until all arguments are exhausted.
  • achilles_controller.py
    • Connect to the achilles_server and instruct it which function to perform against which arguments with which response_mode. Verify the job and wait for the results to be returned - achilles_server will do the rest.
    • Available commands in achilles_controller command interface:
      • achilles_compute, cluster_status, kill_cluster, help

Configuration files:

  • achilles_config.yaml
    • Use pyYaml to load the configuration file for a job into achilles_controller.py. See below for additional information on how to structure this file.
  • achilles_function.py
    • achilles_function() is cloudpickled and distributed to all connected clients as a part of the self.startJob() handshake initiated after achilles_controller.py verifies the commencement of a job.
  • .env
    • Use python-dotenv to load the local file into the system's environmental variables for configuration and authentication.
    • Basic security precaution. To-do: explore encryption.

Installation

pip install achilles

Usage

Deploy achilles server:

  1. from achilles.lineReceiver.achilles_server import genConfig, runAchillesServer
  2. genConfig() -> enter preferred settings or use default
  3. runAchillesServer() -> run an achilles_server using the specified HOST and PORT in the generated .env file

achilles is in the early stages of active development and your suggestions/contributions are kindly welcomed. achilles is written and maintained by Alejandro Peña. Email me at adpena<3gmail.com.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

achilles-0.0.15.tar.gz (15.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

achilles-0.0.15-py3-none-any.whl (25.6 kB view details)

Uploaded Python 3

File details

Details for the file achilles-0.0.15.tar.gz.

File metadata

  • Download URL: achilles-0.0.15.tar.gz
  • Upload date:
  • Size: 15.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.3

File hashes

Hashes for achilles-0.0.15.tar.gz
Algorithm Hash digest
SHA256 ca16e99b522ed0cb5152faa9f8b3f0a443e08765004ae5678243d24c38316a22
MD5 32e00603cd6787abf41e8a4a9aaa6f7f
BLAKE2b-256 09412ce4ff81c6b23fd40e10b4143e52bb9eda54f1b5b61066177c97c2998388

See more details on using hashes here.

File details

Details for the file achilles-0.0.15-py3-none-any.whl.

File metadata

  • Download URL: achilles-0.0.15-py3-none-any.whl
  • Upload date:
  • Size: 25.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.3

File hashes

Hashes for achilles-0.0.15-py3-none-any.whl
Algorithm Hash digest
SHA256 a7fa67042a16fdfccb6cd5a701616ef550d94f5d2dbfaa9c10ce3d7679efec5f
MD5 051edde5bea6e8bc840ece309d2ca54a
BLAKE2b-256 ae46e1166ad042c70b023ef3e71d9c40826cecf238435ef07761bd892b68412b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page