Skip to main content

TEA - Translation Engine Architect

Project description

# TEA - Translation Engine Architect

A command line tool to create translation engine.

## Install First install [pipx](https://github.com/pipxproject/pipx) then:

` pipx install pangeamt-tea `

## Usage

### Step 1: Create a new project

` tea new --customer customer --srcLang es --tgtLang en --flavor automotion --version 0.0.1 `

This command will create the project directory structure:

` ├── customer_es_en_automotion_0.0.1 │ ├── config.yml │ └── data `

Then enter in the directory

` cd customer_es_en_automotion_0.0.1 `

### Step 2: Configuration

#### Tokenizer

A tokenizer can be applied to source and target

` tea tokenizer --src mecab --tgt moses `

To list all available tokenizer:

` tea tokenizer --list `

#### Truecaser

` tea truecaser --src --tgt `

#### BPE tea bpe -s -t

data[‘processors’],

data[‘tokenizer’], data[‘truecaser’], data[‘bpe’], data[‘trainer’])

### Step 3: Copy some multilingual ressources (.tmx, bilingual files, .af ) into the ‘data’ directory

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pangeamt-tea-0.0.103.tar.gz (7.7 kB view hashes)

Uploaded Source

Built Distribution

pangeamt_tea-0.0.103-py3-none-any.whl (15.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page