Skip to main content

Named Entity Recognition for DAnish based on Transformers

Project description

NERDA

Build status codecov PyPI PyPI - Downloads License

NERDA is not only a mesmerizing muppet-like character. NERDA is also a python package, that offers a complete framework for fine-tuning pretrained huggingface transformers for Named Entity Recognition (=NER) tasks.

Installation guide

pip install NERDA

NER tasks

Named Entity Recognition (NER) tasks are all about identifying and extracting names of named entitites from natural language texts.

Read more about NER on Wikipedia.

Performance

The table below summarizes the performance (=F1-scores) of the model configurations, that NERDA ships with.

Level MBERT DABERT ELECTRA XLMROBERTA DISTILMBERT
B-PER 0.92 0.93 0.92 0.94 0.89
I-PER 0.97 0.99 0.97 0.99 0.96
B-ORG 0.68 0.79 0.65 0.78 0.66
I-ORG 0.67 0.79 0.72 0.77 0.61
B-LOC 0.86 0.85 0.79 0.87 0.80
I-LOC 0.33 0.32 0.44 0.24 0.29
B-MISC 0.73 0.74 0.61 0.77 0.70
I-MISC 0.70 0.86 0.65 0.91 0.61
AVG_MICRO 0.81 0.85 0.79 0.86 0.78
AVG_MACRO 0.73 0.78 0.72 0.78 0.69

AVG_ stands for micro- and macro AVeraGed F1-scores.

'NERDA'?

'NERDA' originally stands for 'Named Entity Recognition for DAnish'. However, this is somewhat misleading, since the functionality is no longer limited to Danish. On the contrary it generalizes to all other languages, i.e. NERDA supports fine-tuning of transformer-based models for NER tasks for any arbitrary language.

Read more

The documentation for NERDA including code references and examples can be accessed here.

Contact

We hope, that you will find NERDA useful.

Please direct any questions and feedbacks to us!

If you want to contribute (which we encourage you to), open a PR.

If you encounter a bug or want to suggest an enhancement, please open an issue.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

NERDA-0.0.19.tar.gz (11.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

NERDA-0.0.19-py3-none-any.whl (13.3 kB view details)

Uploaded Python 3

File details

Details for the file NERDA-0.0.19.tar.gz.

File metadata

  • Download URL: NERDA-0.0.19.tar.gz
  • Upload date:
  • Size: 11.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.9

File hashes

Hashes for NERDA-0.0.19.tar.gz
Algorithm Hash digest
SHA256 affd4a84de3eb094a56b1a6ee4cfbeca4eae80e8ceb484af06b7823f49a1b5d0
MD5 dfc78d27e8099e63ac7e259085515631
BLAKE2b-256 cf32402371a80da5761b8507c651cf3a6e600c1a972a1e172bd298272bbbd1af

See more details on using hashes here.

File details

Details for the file NERDA-0.0.19-py3-none-any.whl.

File metadata

  • Download URL: NERDA-0.0.19-py3-none-any.whl
  • Upload date:
  • Size: 13.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.9

File hashes

Hashes for NERDA-0.0.19-py3-none-any.whl
Algorithm Hash digest
SHA256 ee374b78ff4547112e014efe49789f81a7add19c94bacd8dc8afe44e749b2de0
MD5 ab7de930de21a0c694d7d13b29aee4e6
BLAKE2b-256 538de72cca5866f5b569d7d5bb7618784917e83c98529d6891d3f15a09e0fdad

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page