Skip to main content

Tesserocr bindings

Project description

Segment region, line, recognize with tesserocr

https://travis-ci.org/OCR-D/ocrd_tesserocr.svg?branch=master Docker Automated build

Installation

Required ubuntu packages:

  • Tesseract headers (libtesseract-dev)

  • Some tesseract language models (tesseract-ocr-{eng,deu,frk,...} or script models (tesseract-ocr-script-{latn,frak,...})

  • Leptonica headers (libleptonica-dev)

pip install -r requirements
pip install .

If tesserocr fails to compile with an error::

$PREFIX/include/tesseract/unicharset.h:241:10: error: ‘string’ does not name a type; did you mean ‘stdin’?
       static string CleanupString(const char* utf8_str) {
              ^~~~~~
              stdin

This is due to some inconsistencies in the installed tesseract C headers (fix expected for next Ubuntu upgrade, already fixed for Debian). Replace string with std::string in $PREFIX/include/tesseract/unicharset.h:265:5: and $PREFIX/include/tesseract/unichar.h:164:10: ff.

If tesserocr fails with an error about LSTM/CUBE, you have a mismatch between tesseract header/data/pkg-config versions. apt policy libtesseract-dev lists the apt-installable versions, keep it consistent. Make sure there are no spurious pkg-config artifacts, e.g. in /usr/local/lib/pkgconfig/tesseract.pc. The same goes for language models.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ocrd_tesserocr-0.1.0.tar.gz (9.2 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

ocrd_tesserocr-0.1.0-py3-none-any.whl (15.4 kB view details)

Uploaded Python 3

ocrd_tesserocr-0.1.0-py2-none-any.whl (17.1 kB view details)

Uploaded Python 2

File details

Details for the file ocrd_tesserocr-0.1.0.tar.gz.

File metadata

  • Download URL: ocrd_tesserocr-0.1.0.tar.gz
  • Upload date:
  • Size: 9.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/5.5.1 requests-toolbelt/0.8.0 tqdm/4.23.3 CPython/2.7.9

File hashes

Hashes for ocrd_tesserocr-0.1.0.tar.gz
Algorithm Hash digest
SHA256 3fecd0a93d9a711552fbd2cf15af1f150f04f503f7b3f09d9c025267601bb42d
MD5 d45fa7a24f23d22313e4314df42cf984
BLAKE2b-256 eba766775daafba5937821fd643b6d1069570b262af3a48d701712d2a94350a2

See more details on using hashes here.

File details

Details for the file ocrd_tesserocr-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: ocrd_tesserocr-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 15.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/5.5.1 requests-toolbelt/0.8.0 tqdm/4.23.3 CPython/2.7.9

File hashes

Hashes for ocrd_tesserocr-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b2a7fd61a97bb222f2ac5a6f85b3d2ce43da843509993eef189f09b48f44027f
MD5 ad528712e13eecf578b236a7ab8457cd
BLAKE2b-256 4d48282d1d793137f1ec30118a9a0bd48534a6a8053bc74a830b6c4eb389653f

See more details on using hashes here.

File details

Details for the file ocrd_tesserocr-0.1.0-py2-none-any.whl.

File metadata

  • Download URL: ocrd_tesserocr-0.1.0-py2-none-any.whl
  • Upload date:
  • Size: 17.1 kB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/5.5.1 requests-toolbelt/0.8.0 tqdm/4.23.3 CPython/2.7.9

File hashes

Hashes for ocrd_tesserocr-0.1.0-py2-none-any.whl
Algorithm Hash digest
SHA256 64ec4e7a43ddaf199af7da8966996e260454dae4d30f79cb112149cddf5b8fd2
MD5 e12ea0e2f580c6e152d334c470029dc2
BLAKE2b-256 0763e617002f9c2013f8a9ce10baeab48acffc0dff3d21ab160ee67428e08ebd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page