OCR for Japanese manga

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Manga OCR

Optical character recognition for Japanese text, with the main focus being Japanese manga. It uses a custom end-to-end model built with Transformers' Vision Encoder Decoder framework.

Manga OCR can be used as a general purpose printed Japanese OCR, but its main goal was to provide a high quality text recognition, robust against various scenarios specific to manga:

both vertical and horizontal text
text with furigana
text overlaid on images
wide variety of fonts and font styles
low quality images

Unlike many OCR models, Manga OCR supports recognizing multi-line text in a single forward pass, so that text bubbles found in manga can be processed at once, without splitting them into lines.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.1.11

Aug 27, 2023

0.1.10

May 7, 2023

0.1.9

May 7, 2023

0.1.8

Nov 5, 2022

0.1.7

Mar 9, 2022

0.1.6 yanked

Mar 9, 2022

Reason this release was yanked:

bug

0.1.5

Jan 23, 2022

0.1.4

Jan 21, 2022

0.1.3

Jan 20, 2022

This version

0.1.2

Jan 20, 2022

0.1.1

Jan 17, 2022

0.1.0

Jan 17, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

manga-ocr-0.1.2.tar.gz (64.6 kB view hashes)

Uploaded Jan 20, 2022 Source

Built Distribution

manga_ocr-0.1.2-py3-none-any.whl (62.0 kB view hashes)

Uploaded Jan 20, 2022 Python 3

Hashes for manga-ocr-0.1.2.tar.gz

Hashes for manga-ocr-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`80aebcdac4394f15ddec974c11df93ad5595db79f6ef9c6591e8810ba786c271`
MD5	`c6b8d4206caaa84fd4440aca0f812031`
BLAKE2b-256	`799d313658e77d870d804367ada60bd2353e7893151de49ebe8d5fa003757dce`

Hashes for manga_ocr-0.1.2-py3-none-any.whl

Hashes for manga_ocr-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5432b65de6dbd30560b64cc465dabf771232e85ff3cea446e32f5897360d4038`
MD5	`6fca001b537218c78798aeec75ec158e`
BLAKE2b-256	`4d92deab428f052bccd7d3f3b7cfb24e32dc256ed18574cd1cee26c894c96eae`