flatland-rl

Multi Agent Reinforcement Learning on Trains

These details have been verified by PyPI

Maintainers

3dhelp aicrowd fnberta manuschn spMohanty

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 2 - Pre-Alpha
Intended Audience
- Developers
Natural Language
- English
Programming Language
- Python :: 3.6
- Python :: 3.7

Project description

Flatland is a toolkit for developing and comparing multi agent reinforcement learning algorithms on grids. The base environment is a two-dimensional grid in which many agents can be placed. Each agent must solve one or more tasks in the grid world. In general, agents can freely navigate from cell to cell. However, cell-to-cell navigation can be restricted by transition maps. Each cell can hold an own transition map. By default, each cell has a default transition map defined which allows all transitions to its eight neighbor cells (go up and left, go up, go up and right, go right, go down and right, go down, go down and left, go left). So, the agents can freely move from cell to cell.

The general purpose of the implementation allows to implement any kind of two-dimensional gird based environments. It can be used for many learning task where a two-dimensional grid could be the base of the environment.

Flatland delivers a python implementation which can be easily extended. And it provides different baselines for different environments. Each environment enables an interesting task to solve. For example, the mutli-agent navigation task for railway train dispatching is a very exciting topic. It can be easily extended or adapted to the airplane landing problem. This can further be the basic implementation for many other tasks in transportation and logistics.

Mapping a railway infrastructure into a grid world is an excellent example showing how the movement of an agent must be restricted. As trains can normally not run backwards and they have to follow rails the transition for one cell to the other depends also on train’s orientation, respectively on train’s travel direction. Trains can only change the traveling path at switches. There are two variants of switches. The first kind of switch is the splitting “switch”, where trains can change rails and in consequence they can change the traveling path. The second kind of switch is the fusion switch, where train can change the sequence. That means two rails come together. Thus, the navigation behavior of a train is very restricted. The railway planning problem where many agents share same infrastructure is a very complex problem.

Furthermore, trains have a departing location where they cannot depart earlier than the committed departure time. Then they must arrive at destination not later than the committed arrival time. This makes the whole planning problem very complex. In such a complex environment cooperation is essential. Thus, agents must learn to cooperate in a way that all trains (agents) arrive on time.

Getting Started

Online Docs

The documentation for the latest code on the master branch is found at : flatland-rl-docs

The documentation includes a few tutorials at : Getting Started

Run Notebooks with Examples with one Click

Under getting_started, there are two scripts

getting_started/run_notebooks.bat
getting_started/run_notebooks.sh

They require git and Python>=3.6 installed with venv (python3-venv has to be installed under Linux). They create a virtual environment, install Flatland and all dependencies into into and start they Jupyter notebooks in your browser.

Generate Docs

The docs have a lot more details about how to interact with this codebase.

git clone git@gitlab.aicrowd.com:flatland/flatland.git
cd flatland
pip install -r requirements_dev.txt

On, Linux and macOS
```
make docs
```

On, Windows

python setup.py develop (or)
python setup.py install
python make_docs.py

Installation

Stable Release

To install flatland, run this command in your terminal

pip install flatland-rl

This is the preferred method to install flatland, as it will always install the most recent stable release.

If you don’t have pip installed, this Python installation guide can guide you through the process.

From Sources

The sources for flatland can be downloaded from the Gitlab repo.

You can clone the public repository

$ git clone git@gitlab.aicrowd.com:flatland/flatland.git

Once you have a copy of the source, you can install it with

$ python setup.py install

Basic Usage

Basic usage of the RailEnv environment used by the Flatland Challenge

import numpy as np
import time
from flatland.envs.generators import complex_rail_generator
from flatland.envs.rail_env import RailEnv
from flatland.utils.rendertools import RenderTool

env = RailEnv(
            width=7,
            height=7,
            rail_generator=complex_rail_generator(
                                    nr_start_goal=10,
                                    nr_extra=1,
                                    min_dist=8,
                                    max_dist=99999,
                                    seed=0),
            number_of_agents=2)

env_renderer = RenderTool(env, gl="PILSVG")

for step in range(100):
    obs, all_rewards, done, _ = env.step(
                            {
                                0:np.random.randint(0, 5),
                                1:np.random.randint(0, 5)
                            })
    print("Rewards: {}, [done={}]".format( all_rewards, done)
    env_renderer.renderEnv(show=True, frames=False, show_observations=False)
    time.sleep(0.3)

Authors

Sharada Mohanty <mohanty@aicrowd.com>
Giacomo Spigler <giacomo.spigler@gmail.com>
Mattias Ljungström
Jeremy Watson
Erik Nygren <erik.nygren@sbb.ch>
Adrian Egli <adrian.egli@sbb.ch>
Christian Eichenberger <christian.markus.eichenberger@sbb.ch>
Guillaume Mollard <guillaume.mollard2@gmail.com>

Acknowledgements

Vaibhav Agrawal <theinfamouswayne@gmail.com>
Anurag Ghosh

Project details

These details have been verified by PyPI

Maintainers

3dhelp aicrowd fnberta manuschn spMohanty

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 2 - Pre-Alpha
Intended Audience
- Developers
Natural Language
- English
Programming Language
- Python :: 3.6
- Python :: 3.7

Release history Release notifications | RSS feed

4.0.3

Apr 23, 2024

4.0.2

Apr 23, 2024

4.0.1

Oct 30, 2023

4.0.0

Oct 27, 2023

3.0.15

Jan 31, 2022

3.0.14

Jan 30, 2022

3.0.13

Jan 24, 2022

3.0.12

Jan 17, 2022

3.0.11

Jan 12, 2022

3.0.10

Jan 12, 2022

3.0.9

Jan 3, 2022

3.0.8

Dec 7, 2021

3.0.7

Dec 1, 2021

3.0.6

Nov 19, 2021

3.0.5

Nov 15, 2021

3.0.4

Nov 8, 2021

3.0.3

Nov 1, 2021

3.0.2

Oct 25, 2021

3.0.1

Sep 25, 2021

3.0.0

Sep 17, 2021

3.0.0rc1 pre-release

Sep 4, 2021

2.2.2

Sep 16, 2020

2.2.1

Jun 12, 2020

2.2.0

Jun 7, 2020

2.1.10

Nov 6, 2019

2.1.9

Nov 6, 2019

2.1.8

Oct 24, 2019

2.1.7

Oct 18, 2019

2.1.6

Oct 14, 2019

2.1.5

Oct 13, 2019

2.1.4

Oct 10, 2019

2.1.3

Oct 10, 2019

2.1.2

Oct 9, 2019

2.1.1

Oct 9, 2019

2.1.0

Oct 9, 2019

2.0.0

Sep 6, 2019

0.3.10

Jul 30, 2019

0.3.9

Jul 30, 2019

0.3.8

Jul 30, 2019

0.3.6

Jul 30, 2019

0.3.5

Jul 29, 2019

0.3.4

Jul 29, 2019

0.3.3

Jul 29, 2019

0.3.2

Jul 26, 2019

0.3.1

Jul 25, 2019

0.3.0

Jul 24, 2019

0.2.0

Jul 8, 2019

This version

0.1.2

Jul 3, 2019

0.1.1

Apr 3, 2019

0.1.0

Apr 3, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flatland-rl-0.1.2.tar.gz (2.1 MB view hashes)

Uploaded Jul 3, 2019 Source

Hashes for flatland-rl-0.1.2.tar.gz

Hashes for flatland-rl-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`f7b62e4b3936147ca12af864b54e441c85cc26f4076aebef07534820fe362c85`
MD5	`5dae09c28a7d04558cde86ad0bba88d6`
BLAKE2b-256	`c3c805eb63985624a654cdd0ec734b8e349974fbe53515f8c21738bdb8df3262`