Skip to main content

Amazon Textract Helper tools for pretty printing

Project description

Textract-PrettyPrinter

Provides functions to format the output received from Textract in more easily consumable formats incl. CSV or Markdown. amazon-textract-prettyprinter

Install

> python -m pip install amazon-textract-prettyprinter

Make sure your environment is setup with AWS credentials through configuration files or environment variables or an attached role. (https://docs.aws.amazon.com/cli/latest/userguide/cli-chap-configure.html)

Samples

Get FORMS and TABLES as CSV

from textractcaller.t_call import call_textract, Textract_Features
from textractprettyprinter.t_pretty_print import Pretty_Print_Table_Format, Textract_Pretty_Print, get_string

textract_json = call_textract(input_document=input_document, features=[Textract_Features.FORMS, Textract_Features.TABLES])
print(get_string(textract_json=textract_json,
               table_format=Pretty_Print_Table_Format.csv,
               output_type=[Textract_Pretty_Print.TABLES, Textract_Pretty_Print.FORMS]))

Get string for TABLES using the get_string method

from textractcaller.t_call import call_textract, Textract_Features
from textractprettyprinter.t_pretty_print import Textract_Pretty_Print, get_string

textract_json = call_textract(input_document=input_document, features=[Textract_Features.TABLES])
get_string(textract_json=textract_json, output_type=Textract_Pretty_Print.TABLES)

Print out tables in LaTeX format

from textractcaller.t_call import call_textract, Textract_Features
from textractprettyprinter.t_pretty_print import Textract_Pretty_Print, get_string

textract_json = call_textract(input_document=input_document, features=[Textract_Features.FORMS, Textract_Features.TABLES])
get_tables_string(textract_json=textract_json, table_format=Pretty_Print_Table_Format.latex)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

amazon-textract-prettyprinter-0.1.0.tar.gz (11.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

amazon_textract_prettyprinter-0.1.0-py2.py3-none-any.whl (13.3 kB view details)

Uploaded Python 2Python 3

File details

Details for the file amazon-textract-prettyprinter-0.1.0.tar.gz.

File metadata

File hashes

Hashes for amazon-textract-prettyprinter-0.1.0.tar.gz
Algorithm Hash digest
SHA256 048b45c91a1782652b54576328b54cd609f8a26770a606a63b26085b6caaab25
MD5 36995357c939fce139262c74b95d8bb4
BLAKE2b-256 d60387843dd5f8d941b05e1d40581b942672d49805a59057d60a44ddf583f4b4

See more details on using hashes here.

File details

Details for the file amazon_textract_prettyprinter-0.1.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for amazon_textract_prettyprinter-0.1.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 6c6ee24366b1821bf4a14ab930f93ec7b5ed861536bfa7b627fe06f246a1924d
MD5 79f6fd0be6abb7d9124570cd33b3b08b
BLAKE2b-256 f714dabce3b930709fcc3408a6df44b8ebed20d97c22001cbcdb9ad5b4242378

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page