Skip to main content

Natural-Language-Toolkit for bahasa Malaysia, powered by Deep Learning Tensorflow.

Project description

.. figure:: https://raw.githubusercontent.com/DevconX/Malaya/master/session/towns-of-malaya.jpg

|Downloads| |Downloads GPU| |Latest Version| |Python Version| |MIT| |Build Status| |Documentation Status|

.. |Latest Version| image:: https://badge.fury.io/py/malaya.svg
:target: https://pypi.python.org/pypi/malaya
.. |MIT| image:: https://img.shields.io/badge/License-MIT-yellow.svg
:target: https://github.com/huseinzol05/Malaya/blob/master/LICENSE
.. |Python Version| image:: https://img.shields.io/pypi/pyversions/malaya.svg
:target: https://pypi.python.org/pypi/malaya
.. |Build Status| image:: https://travis-ci.org/huseinzol05/Malaya.svg?branch=master
:target: https://travis-ci.org/huseinzol05/Malaya
.. |Documentation Status| image:: https://readthedocs.org/projects/malaya/badge/?version=latest
:target: https://malaya.readthedocs.io/

Natural-Language-Toolkit for bahasa Malaysia, powered by Deep Learning
Tensorflow.

Documentation
--------------

Proper documentation is available at https://malaya.readthedocs.io/

Installing from the PyPI
----------------------------------

CPU version
::

$ pip install malaya

GPU version
::

$ pip install malaya-gpu

Only **Python 3.6.x and above** and **Tensorflow 1.X** are supported.

Features
--------

- **Emotion Analysis**

From BERT, Fast-Text, Dynamic-Memory Network, Sparse Tensorflow, Attention Neural Network to build deep emotion analysis models.
- **Entities Recognition**

Latest state-of-art CRF deep learning models to do Naming Entity Recognition.

- **Language Detection**

using Multinomial, SGD, XGB, Fast-text N-grams deep learning to distinguish Malay, English, and Indonesian.
- **Normalizer**

using local Malaysia NLP researches to normalize any
bahasa texts.
- **Num2Word**

Convert from numbers to cardinal or ordinal representation.
- **Part-of-Speech Recognition**

Latest state-of-art CRF deep learning models to do Naming Entity Recognition.
- **Dependency Parsing**

Latest state-of-art CRF deep learning models to do analyzes the grammatical structure of a sentence, establishing relationships between words.
- **ELMO (biLM)**

Provide pretrained bahasa wikipedia and bahasa news ELMO, with easy interface and visualization.
- **Sentiment Analysis**

From BERT, Fast-Text, Dynamic-Memory Network, Sparse Tensorflow, Attention Neural Network to build deep sentiment analysis models.
- **Spell Correction**

Using local Malaysia NLP researches to auto-correct any bahasa words.
- Stemmer
- **Subjectivity Analysis**

From BERT, Fast-Text, Dynamic-Memory Network, Sparse Tensorflow, Attention Neural Network to build deep subjectivity analysis models.
- **Summarization**

Using skip-thought with attention state-of-art to give precise unsupervised summarization.
- **Topic Modelling**

Provide LDA2Vec, LDA, NMF and LSA interface for easy topic modelling with topics visualization.
- **Toxicity Analysis**

From BERT, Fast-Text, Dynamic-Memory Network, Attention Neural Network to build deep toxicity analysis models.
- **Word2Vec**

Provide pretrained bahasa wikipedia and bahasa news Word2Vec, with easy interface and visualization.
- **Fast-text**

Provide pretrained bahasa wikipedia Fast-text, with easy interface and visualization.

License
--------

.. |License| image:: https://app.fossa.io/api/projects/git%2Bgithub.com%2Fhuseinzol05%2FMalaya.svg?type=large
:target: https://app.fossa.io/projects/git%2Bgithub.com%2Fhuseinzol05%2FMalaya?ref=badge_large

|License|


Project details


Release history Release notifications | RSS feed

This version

2.3.3

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

malaya-2.3.3.tar.gz (1.4 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

malaya-2.3.3-py3-none-any.whl (1.5 MB view details)

Uploaded Python 3

File details

Details for the file malaya-2.3.3.tar.gz.

File metadata

  • Download URL: malaya-2.3.3.tar.gz
  • Upload date:
  • Size: 1.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/39.2.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.5

File hashes

Hashes for malaya-2.3.3.tar.gz
Algorithm Hash digest
SHA256 0298beb74275d60e07194ad2bfc7252e70b48da569cd50a593c71ed965bbb247
MD5 4b0063c4084a53738451ea6dfebc2505
BLAKE2b-256 8fe4a466b0dc6345dc43e1c8b57654c1ea96c13c55a17a28207ecf5f17e4e570

See more details on using hashes here.

File details

Details for the file malaya-2.3.3-py3-none-any.whl.

File metadata

  • Download URL: malaya-2.3.3-py3-none-any.whl
  • Upload date:
  • Size: 1.5 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/39.2.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.5

File hashes

Hashes for malaya-2.3.3-py3-none-any.whl
Algorithm Hash digest
SHA256 151aeb08af80b7b9e08b6b23f161ea9894ebc078a0d5fd57450e5e240f49e6ed
MD5 603b80d8d6beed39b0d245071ba18d13
BLAKE2b-256 b3b5bf26b8d6cbdb4eb424a93b468385db3e845d3de1d1b2667283141d04887d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page