Page segmentation and segmentation evaluation
Project description
ocrd_segment
This repository aims to provide a number of OCR-D-compliant processors for layout analysis and evaluation.
Installation
In your virtual environment, run:
pip install .
Usage
- exporting page images (including results from preprocessing like cropping/masking, deskewing, dewarping or binarization) along with region polygon coordinates and metadata, also MS-COCO:
- exporting region images (including results from preprocessing like cropping/masking, deskewing, dewarping or binarization) along with region polygon coordinates and metadata:
- exporting line images (including results from preprocessing like cropping/masking, deskewing, dewarping or binarization) along with line polygon coordinates and metadata:
- importing layout segmentations from other formats (mask images, MS-COCO JSON annotation):
- repairing layout segmentations (input file groups N >= 1, based on heuristics implemented using Shapely):
- ocrd-segment-repair :construction: (much to be done)
- comparing different layout segmentations (input file groups N = 2, compute the distance between two segmentations, e.g. automatic vs. manual):
- ocrd-segment-evaluate :construction: (very early stage)
- pattern-based segmentation (input file groups N=1, based on a PAGE template, e.g. from Aletheia, and some XSLT or Python to apply it to the input file group)
ocrd-segment-via-template:construction: (unpublished)
- data-driven segmentation (input file groups N=1, based on a statistical model, e.g. Neural Network)
ocrd-segment-via-model:construction: (unpublished)
For detailed description on input/output and parameters, see ocrd-tool.json
Testing
None yet.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ocrd_segment-0.1.9.tar.gz
(28.7 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ocrd_segment-0.1.9.tar.gz.
File metadata
- Download URL: ocrd_segment-0.1.9.tar.gz
- Upload date:
- Size: 28.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.6.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bcd1cce4fee25f872078c8cc3352abe9a96be7387fea74797edef829eab0943c
|
|
| MD5 |
9f04f948193ef55dd248a5d3cf2ac5fb
|
|
| BLAKE2b-256 |
d9c44a4c506b21426b2a3be3a3658ce71bf1bf1823d4ddb537b3124b7d35d8d5
|
File details
Details for the file ocrd_segment-0.1.9-py3-none-any.whl.
File metadata
- Download URL: ocrd_segment-0.1.9-py3-none-any.whl
- Upload date:
- Size: 41.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.6.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7fb375c10220ea7285542ceffd79cb3d4f44fc369a016bb739ff5f477108da36
|
|
| MD5 |
226e4c3e100eb68014252be26786e02e
|
|
| BLAKE2b-256 |
06e16048e45cc27dd93135dbbdb6f0f38be9c664486e11c86b43f3a96c065b77
|