Natural-Language-Toolkit for bahasa Malaysia, powered by Deep Learning Tensorflow.
Project description
.. figure:: https://raw.githubusercontent.com/DevconX/Malaya/master/session/towns-of-malaya.jpg
|Downloads| |Downloads GPU| |Latest Version| |Python Version| |MIT| |Build Status| |Documentation Status|
.. |Latest Version| image:: https://badge.fury.io/py/malaya.svg
:target: https://pypi.python.org/pypi/malaya
.. |MIT| image:: https://img.shields.io/badge/License-MIT-yellow.svg
:target: https://github.com/huseinzol05/Malaya/blob/master/LICENSE
.. |Python Version| image:: https://img.shields.io/pypi/pyversions/malaya.svg
:target: https://pypi.python.org/pypi/malaya
.. |Build Status| image:: https://travis-ci.org/huseinzol05/Malaya.svg?branch=master
:target: https://travis-ci.org/huseinzol05/Malaya
.. |Documentation Status| image:: https://readthedocs.org/projects/malaya/badge/?version=latest
:target: https://malaya.readthedocs.io/
Natural-Language-Toolkit for bahasa Malaysia, powered by Deep Learning
Tensorflow.
Documentation
--------------
Proper documentation is available at https://malaya.readthedocs.io/
Installing from the PyPI
----------------------------------
CPU version
::
$ pip install malaya
GPU version
::
$ pip install malaya-gpu
Only **Python 3.6.x and above** and **Tensorflow 1.X** are supported.
Features
--------
- **Emotion Analysis**
From BERT, Fast-Text, Dynamic-Memory Network, Sparse Tensorflow, Attention Neural Network to build deep emotion analysis models.
- **Entities Recognition**
Latest state-of-art CRF deep learning models to do Naming Entity Recognition.
- **Language Detection**
using Multinomial, SGD, XGB, Fast-text N-grams deep learning to distinguish Malay, English, and Indonesian.
- **Normalizer**
using local Malaysia NLP researches to normalize any
bahasa texts.
- **Num2Word**
Convert from numbers to cardinal or ordinal representation.
- **Part-of-Speech Recognition**
Latest state-of-art CRF deep learning models to do Naming Entity Recognition.
- **Dependency Parsing**
Latest state-of-art CRF deep learning models to do analyzes the grammatical structure of a sentence, establishing relationships between words.
- **ELMO (biLM)**
Provide pretrained bahasa wikipedia and bahasa news ELMO, with easy interface and visualization.
- **Sentiment Analysis**
From BERT, Fast-Text, Dynamic-Memory Network, Sparse Tensorflow, Attention Neural Network to build deep sentiment analysis models.
- **Spell Correction**
Using local Malaysia NLP researches to auto-correct any bahasa words.
- Stemmer
- **Subjectivity Analysis**
From BERT, Fast-Text, Dynamic-Memory Network, Sparse Tensorflow, Attention Neural Network to build deep subjectivity analysis models.
- **Summarization**
Using skip-thought with attention state-of-art to give precise unsupervised summarization.
- **Topic Modelling**
Provide LDA2Vec, LDA, NMF and LSA interface for easy topic modelling with topics visualization.
- **Toxicity Analysis**
From BERT, Fast-Text, Dynamic-Memory Network, Attention Neural Network to build deep toxicity analysis models.
- **Word2Vec**
Provide pretrained bahasa wikipedia and bahasa news Word2Vec, with easy interface and visualization.
- **Fast-text**
Provide pretrained bahasa wikipedia Fast-text, with easy interface and visualization.
License
--------
.. |License| image:: https://app.fossa.io/api/projects/git%2Bgithub.com%2Fhuseinzol05%2FMalaya.svg?type=large
:target: https://app.fossa.io/projects/git%2Bgithub.com%2Fhuseinzol05%2FMalaya?ref=badge_large
|License|
|Downloads| |Downloads GPU| |Latest Version| |Python Version| |MIT| |Build Status| |Documentation Status|
.. |Latest Version| image:: https://badge.fury.io/py/malaya.svg
:target: https://pypi.python.org/pypi/malaya
.. |MIT| image:: https://img.shields.io/badge/License-MIT-yellow.svg
:target: https://github.com/huseinzol05/Malaya/blob/master/LICENSE
.. |Python Version| image:: https://img.shields.io/pypi/pyversions/malaya.svg
:target: https://pypi.python.org/pypi/malaya
.. |Build Status| image:: https://travis-ci.org/huseinzol05/Malaya.svg?branch=master
:target: https://travis-ci.org/huseinzol05/Malaya
.. |Documentation Status| image:: https://readthedocs.org/projects/malaya/badge/?version=latest
:target: https://malaya.readthedocs.io/
Natural-Language-Toolkit for bahasa Malaysia, powered by Deep Learning
Tensorflow.
Documentation
--------------
Proper documentation is available at https://malaya.readthedocs.io/
Installing from the PyPI
----------------------------------
CPU version
::
$ pip install malaya
GPU version
::
$ pip install malaya-gpu
Only **Python 3.6.x and above** and **Tensorflow 1.X** are supported.
Features
--------
- **Emotion Analysis**
From BERT, Fast-Text, Dynamic-Memory Network, Sparse Tensorflow, Attention Neural Network to build deep emotion analysis models.
- **Entities Recognition**
Latest state-of-art CRF deep learning models to do Naming Entity Recognition.
- **Language Detection**
using Multinomial, SGD, XGB, Fast-text N-grams deep learning to distinguish Malay, English, and Indonesian.
- **Normalizer**
using local Malaysia NLP researches to normalize any
bahasa texts.
- **Num2Word**
Convert from numbers to cardinal or ordinal representation.
- **Part-of-Speech Recognition**
Latest state-of-art CRF deep learning models to do Naming Entity Recognition.
- **Dependency Parsing**
Latest state-of-art CRF deep learning models to do analyzes the grammatical structure of a sentence, establishing relationships between words.
- **ELMO (biLM)**
Provide pretrained bahasa wikipedia and bahasa news ELMO, with easy interface and visualization.
- **Sentiment Analysis**
From BERT, Fast-Text, Dynamic-Memory Network, Sparse Tensorflow, Attention Neural Network to build deep sentiment analysis models.
- **Spell Correction**
Using local Malaysia NLP researches to auto-correct any bahasa words.
- Stemmer
- **Subjectivity Analysis**
From BERT, Fast-Text, Dynamic-Memory Network, Sparse Tensorflow, Attention Neural Network to build deep subjectivity analysis models.
- **Summarization**
Using skip-thought with attention state-of-art to give precise unsupervised summarization.
- **Topic Modelling**
Provide LDA2Vec, LDA, NMF and LSA interface for easy topic modelling with topics visualization.
- **Toxicity Analysis**
From BERT, Fast-Text, Dynamic-Memory Network, Attention Neural Network to build deep toxicity analysis models.
- **Word2Vec**
Provide pretrained bahasa wikipedia and bahasa news Word2Vec, with easy interface and visualization.
- **Fast-text**
Provide pretrained bahasa wikipedia Fast-text, with easy interface and visualization.
License
--------
.. |License| image:: https://app.fossa.io/api/projects/git%2Bgithub.com%2Fhuseinzol05%2FMalaya.svg?type=large
:target: https://app.fossa.io/projects/git%2Bgithub.com%2Fhuseinzol05%2FMalaya?ref=badge_large
|License|
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
malaya-2.3.3.tar.gz
(1.4 MB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file malaya-2.3.3.tar.gz.
File metadata
- Download URL: malaya-2.3.3.tar.gz
- Upload date:
- Size: 1.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/39.2.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0298beb74275d60e07194ad2bfc7252e70b48da569cd50a593c71ed965bbb247
|
|
| MD5 |
4b0063c4084a53738451ea6dfebc2505
|
|
| BLAKE2b-256 |
8fe4a466b0dc6345dc43e1c8b57654c1ea96c13c55a17a28207ecf5f17e4e570
|
File details
Details for the file malaya-2.3.3-py3-none-any.whl.
File metadata
- Download URL: malaya-2.3.3-py3-none-any.whl
- Upload date:
- Size: 1.5 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/39.2.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
151aeb08af80b7b9e08b6b23f161ea9894ebc078a0d5fd57450e5e240f49e6ed
|
|
| MD5 |
603b80d8d6beed39b0d245071ba18d13
|
|
| BLAKE2b-256 |
b3b5bf26b8d6cbdb4eb424a93b468385db3e845d3de1d1b2667283141d04887d
|