Skip to main content

Named Entity Recognition for DAnish based on Transformers

Project description

NERDA

Build status codecov PyPI PyPI - Downloads License

'NERDA' is not only a mesmerizing muppet-like character. 'NERDA' is also a python package, that offers a complete framework for fine-tuning of pretrained transformer-models for Named Entity Recognition (=NER) tasks.

Installation guide

pip install NERDA

NER tasks

Named Entity Recognition (NER) tasks are all about identifying and extracting names of named entitites from natural language texts.

Read more about NER on Wikipedia.

Performance

The table below summarizes the performance (=F1-scores) of the model configurations, that NERDA ships with.

| Level | MBERT | DABERT | ELECTRA | |---------------------------------------------------| | B-PER | 0.92 | 0.93 | 0.92 | | I-PER | 0.97 | 0.99 | 0.97 | | B-ORG | 0.68 | 0.79 | 0.65 | | I-ORG | 0.67 | 0.79 | 0.72 | | B-LOC | 0.86 | 0.85 | 0.79 | | I-LOC | 0.33 | 0.32 | 0.44 | | B-MISC | 0.73 | 0.74 | 0.61 | | I-MISC | 0.70 | 0.86 | 0.65 | | AVG_MICRO | 0.81 | 0.85 | 0.79 | | AVG_MACRO | 0.73 | 0.78 | 0.72 |

'NERDA'?

'NERDA' originally stands for 'Named Entity Recognition for DAnish'. However, this is somewhat misleading, since the functionality is no longer limited to Danish. On the contrary it generalizes to all other languages, i.e. NERDA supports fine-tuning of transformer-based models for NER tasks for any arbitrary language.

Read more

The documentation for NERDA including code references and examples can be accessed here.

Contact

We hope, that you will find NERDA useful.

Please direct any questions and feedbacks to us!

If you want to contribute (which we encourage you to), open a PR.

If you encounter a bug or want to suggest an enhancement, please open an issue.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

NERDA-0.0.15.tar.gz (10.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

NERDA-0.0.15-py3-none-any.whl (11.6 kB view details)

Uploaded Python 3

File details

Details for the file NERDA-0.0.15.tar.gz.

File metadata

  • Download URL: NERDA-0.0.15.tar.gz
  • Upload date:
  • Size: 10.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.1 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.9

File hashes

Hashes for NERDA-0.0.15.tar.gz
Algorithm Hash digest
SHA256 eb58d867ca0c69203ec362a68977e8581fc0acaa2101f201e628ffe674138e4c
MD5 e0c90147aad2fcde2d87490122fc852d
BLAKE2b-256 f74139fc9d926ad159a41a67807fd8578b20174be127a2d8689532dbce64476b

See more details on using hashes here.

File details

Details for the file NERDA-0.0.15-py3-none-any.whl.

File metadata

  • Download URL: NERDA-0.0.15-py3-none-any.whl
  • Upload date:
  • Size: 11.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.1 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.9

File hashes

Hashes for NERDA-0.0.15-py3-none-any.whl
Algorithm Hash digest
SHA256 2cfa19007f36d39528fe8dea677cb54f769504eff53de41764995326c9e84251
MD5 9c98794fd0512b64c3e4d7a4b0d1adb5
BLAKE2b-256 9b76bbfbf1ea8f019aedaf43e2a75c3b3441989f18740892b23dff870d57614b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page