Skip to main content

icd embedding for machine learning

Project description

icdcodex

https://img.shields.io/pypi/v/icdcodex.svg Documentation Status

ICD embedding for machine learning, created for MedHacks2020 ❤️.

What is Medhacks?

MedHacks hosted by Johns Hopkins University aims to unite talented and diverse minds from all backgrounds in order to foster a collaborative environment that aims to solve the world’s medical obstacles and issues.

The Problem

ICD coding is a laborous, but difficult to automate by machine learning because the output space if intractably large. (ICD-10CM has over 70,000 codes.) icdcodex creates a vector embedding for this input space, making it simpler for machine learning practioners to efficiently adapt algorithms for ICD coding.

Our Solution

We rely on the word2vec model to generate this embedding. In this set up, each ICD code represents a “word,” whereas a path sampled from breadth-first or depth-first search represents the “sentence.”

The Team

  • Jeremy Adams Fisher

  • Alhusain Abdalla

  • Natasha Nehra

  • Tejas Patel

  • Hamrish Saravanakumar

Features

  • Curated networkX graphs representing ICD9 and ICD10 hierarchies

  • A simple API to generate continuous embeddings for these hierarchies

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

History

0.1.0 (2020-09-04)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

icdcodex-0.2.0.tar.gz (25.3 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

icdcodex-0.2.0-py3.7.egg (10.4 kB view details)

Uploaded Egg

icdcodex-0.2.0-py2.py3-none-any.whl (6.3 kB view details)

Uploaded Python 2Python 3

File details

Details for the file icdcodex-0.2.0.tar.gz.

File metadata

  • Download URL: icdcodex-0.2.0.tar.gz
  • Upload date:
  • Size: 25.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for icdcodex-0.2.0.tar.gz
Algorithm Hash digest
SHA256 ae6fab24abe24304ff7944eb19be815828d7089ce3939c9724445a0496c9745f
MD5 96b6e805f6528d03cc4d75325e98c322
BLAKE2b-256 3c581491674e88a98d38886e2c46dd4cddc49b78c610dce816934e94ad5d86ef

See more details on using hashes here.

File details

Details for the file icdcodex-0.2.0-py3.7.egg.

File metadata

  • Download URL: icdcodex-0.2.0-py3.7.egg
  • Upload date:
  • Size: 10.4 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for icdcodex-0.2.0-py3.7.egg
Algorithm Hash digest
SHA256 628484cf9c2deceeacd7458fd6cdd031604d09180190bd6aca7ac2f28e204d40
MD5 64b699609fdc59d1db540f91aecda786
BLAKE2b-256 04f1bacf5bee9d244819e57590bc62bb9ec9a499febaa1a426ddd0508b97be59

See more details on using hashes here.

File details

Details for the file icdcodex-0.2.0-py2.py3-none-any.whl.

File metadata

  • Download URL: icdcodex-0.2.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 6.3 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for icdcodex-0.2.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 e13cbc3306e2ed96e3dd91dd0c68a5b4e90ef77069ba866461eddd6f574bc15f
MD5 a0fd762c7b76d2ebb964372b65b899f1
BLAKE2b-256 de78aa42370978ea8fdba16131386554795b23fd48007b5df8abfb572c3c9326

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page