rapidocr-onnxruntime

RapidOCR

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Python版RapidOCR

Python版RapidOCR

简介和说明

各个版本的ONNX模型下载地址：百度网盘 | Google Drive
所有常用的参数配置都在config.yaml下，一目了然，更加便捷
目前config.yaml中配置为权衡速度和准确度的最优组合。
每个独立的模块下均有独立的config.yaml配置文件，可以单独使用
det部分：
- det中mobile和server版，推理代码一致，直接更改配置文件中模型路径即可
- det中v2和v3两个版本，推理代码一致，直接更改配置文件中模型路径即可
```
Det:
    module_name: ch_ppocr_v2_det
    class_name: TextDetector
    model_path: resources/models/ch_PP-OCRv3_det_infer.onnx
```
rec部分：
- rec中mobile和server版本，推理代码一致，直接更改配置文件中模型路径即可
- rec中v2和v3两个版本，共用同一个推理代码。
  - 两版本差别仅在输入shape和模型。经过测试，采用v3 rec模型+[3, 48, 320]效果最好。
  - 目前配置文件config.yaml中（如下所示）已经改为该最优组合。
```
module_name: ch_ppocr_v2_rec
class_name: TextRecognizer
model_path: resources/models/ch_PP-OCRv3_rec_infer.onnx

rec_img_shape: [3, 48, 320]
rec_batch_num: 6
keys_path: resources/rec_dict/ppocr_keys_v1.txt
```

onnxruntime和openvino调用方式如下:

# 基于onnxruntime引擎推理
from rapidocr_onnxruntime import TextSystem

# 基于openvino引擎推理
from rapidocr_openvino import TextSystem

值得说明的是，基于openvino推理部分中ch_ppocr_v2_cls部分仍然是基于onnxruntime的，原因是openvino有bug，详情见openvino/issue

使用步骤

下载当前下的rapidocr_onnxruntime/rapidocr_openvino目录到本地

下载链接下的resources目录（包含模型和显示的字体文件）

下载链接：百度网盘 | Google Drive
resources/models下模型搭配已经为最优组合（速度和精度平衡）
```
ch_PP-OCRv3_det + ch_ppocr_mobile_v2.0_cls +  ch_ppocr_mobile_v2.0_rec
```

最终目录如下:

.
├── README.md
├── config.yaml
├── test_demo.py
├── rapidocr_onnxruntime
│   ├── __init__.py
│   ├── ch_ppocr_v2_cls
│   ├── ch_ppocr_v2_det
│   ├── ch_ppocr_v2_rec
│   └── rapid_ocr_api.py
├── rapidocr_openvino
│   ├── __init__.py
│   ├── README.md
│   ├── ch_ppocr_v2_cls
│   ├── ch_ppocr_v2_det
│   ├── ch_ppocr_v2_rec
│   └── rapid_ocr_api.py
├── requirements.txt
├── resources
│    ├── fonts
│    │   └── msyh.ttc
│    ├── models
│    │   ├── ch_PP-OCRv3_det_infer.onnx
│    │   ├── ch_ppocr_mobile_v2.0_cls_infer.onnx
│    │   └── ch_PP-OCRv3_rec_infer.onnx
│    └── rec_dict
│        └── ppocr_keys_v1.txt
└── test_images
    ├── ch_en_num.jpg
    └── single_line_text.jpg

安装运行环境

基于onnxruntime推理所需环境安装：

pip install onnxruntime>=1.7.0

pip install -r requirements.txt -i https://pypi.douban.com/simple/

基于openvino推理所需环境安装：

# Windows端
pip install openvino==2022.1.0

pip install -r requirements.txt -i https://pypi.douban.com/simple/

Note: 在Windows端，Shapely库可能自动安装会有问题，解决方案参见Q15

运行示例

运行单元测试
```
cd tests
pytest test_*.py
```

接口调用

import cv2

# 基于onnxruntime引擎推理
from rapidocr_onnxruntime import TextSystem

# 基于openvino引擎推理
# from rapidocr_openvino import TextSystem

config_path = 'config.yaml'
text_sys = TextSystem(config_path)

image_path = r'test_images/det_images/ch_en_num.jpg'
img = cv2.imread(image_path)
dt_boxes, rec_res = text_sys(img)
print(rec_res)

直接运行test_demo.py，可直接可视化查看结果。
```
python test_demp.py
```

`config.yaml`中常用参数介绍

Global部分

参数名称	取值范围	默认值	作用
`text_score`	[0, 1]	0.5	文本识别结果置信度，值越大，把握越大
`use_angle_cls`	`bool`	`true`	是否使用文本行的方向分类
`print_verbose`	`bool`	`true`	是否打印各个部分耗时信息
`min_height`	`int`	30	图像最小高度（单位是像素）低于这个值，会跳过文本检测阶段，直接进行后续识别

min_height是用来过滤只有一行文本的图像（如下图），这类图像不会进入文本检测模块，直接进入后续过程。

Det部分

参数名称	取值范围	默认值	作用
`use_cuda`	`bool`	`false`	是否使用CUDA，加速推理
`limit_side_len`	-	736	限制图像边的长度的像素值
`limit_type`	`[min, max]`	`min`	限制图像的最小边长度还是最大边为`limit_side_len` 示例解释：当`limit_type=min`和`limit_side_len=736`时，图像最小边小于736时，会将图像最小边拉伸到736，另一边则按图像原始比例等比缩放。
`thresh`	[0, 1]	0.3	图像中文字部分和背景部分分割阈值值越大，文字部分会越小
`box_thresh`	[0, 1]	0.5	文本检测所得框是否保留的阈值，值越大，召回率越低
`max_candidates`	-	1000	图像中最大可检测到的文本框数目，一般够用
`unclip_ratio`	[1.6, 2.0]	1.6	控制文本检测框的大小，值越大，检测框整体越大
`use_dilation`	`bool`	`true`	是否使用形态学中的膨胀操作，一般采用默认值即可

Cls部分

参数名称	取值范围	默认值	作用
`cls_img_shape`	-	`[3, 48, 192]`	输入方向分类模型的图像Shape（CHW）
`cls_batch_num`	-	6	批次推理的batch大小，一般采用默认值即可，太大并没有明显提速，效果还可能会差
`cls_thresh`	`[0, 1]`	0.9	方向分类结果的置信度
`label_list`	-	[0, 180]	方向分类的标签，0°或者180°，该参数不能动

Rec部分

参数名称	取值范围	默认值	作用
`rec_img_shape`	-	`[3, 48, 320]`	输入文本识别模型的图像Shape（CHW）
`rec_batch_num`	-	6	批次推理的batch大小，一般采用默认值即可，太大并没有明显提速，效果还可能会差
`keys_path`	-	-	文本识别模型推理所使用字典文件，始识别哪种类型文本而定（中英、日文等）

onnxruntime-gpu版推理配置

onnxruntime-gpu需要严格按照与cuda、cudnn版本对应来安装，具体参考文档，这一步关乎后面是否可以成功调用GPU。
```
$ pip install onnxruntime-gpu==1.xxx
```

更改config.yaml中对应部分的参数即可，详细参数介绍参见官方文档。

use_cuda: true
CUDAExecutionProvider:
    device_id: 0
    arena_extend_strategy: kNextPowerOfTwo
    gpu_mem_limit: 2 * 1024 * 1024 * 1024
    cudnn_conv_algo_search: EXHAUSTIVE
    do_copy_in_default_stream: true

推理情况
1. 下载基准测试数据集（test_images_benchmark），放到tests/benchmark目录下。
  - 百度网盘 | Google Drive
  - 最终目录结构如下：
```
tests/benchmark/
    ├── benchmark.py
    ├── config_gpu.yaml
    ├── config.yaml
    └── test_images_benchmark
```
2. 运行以下代码（python目录下运行）：
```
# CPU
python tests/benchmark/benchmark.py --yaml_path config.yaml

# GPU
python tests/benchmark/benchmark.py --yaml_path config_gpu.yaml
```
3. 运行相关信息汇总：（以下仅为个人测试情况，具体情况请自行测试）
  - 来自zhsunlight的测试，感谢
    - 设备型号：宏碁(Acer) 暗影骑士·威N50-N93游戏台式机
    - CPU型号：十代i5-10400F 16G 512G SSD
    - GPU型号：NVIDIA GeForce GTX 1660Super 6G
    - onnxruntime-gpu: 1.11.0
    - 耗时情况：
      
      设备总耗时(s) 平均耗时(s/img)
      
      CPU 296.8841 1.18282
      
      GPU 646.14667 2.57429
  - 来自SWHL的测试
    - 设备型号：Docker
    - CPU型号：-
    - GPU型号：NVIDIA V100S 16G
    - onnxruntime-gpu: 1.7.0
    - 耗时情况：
      
      设备总耗时(s) 平均耗时(s/img)
      
      CPU 1079.4726 4.3001
      
      GPU 525.8244 2.0989

设备	总耗时(s)	平均耗时(s/img)
CPU	296.8841	1.18282
GPU	646.14667	2.57429

设备	总耗时(s)	平均耗时(s/img)
CPU	1079.4726	4.3001
GPU	525.8244	2.0989

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.3.19

May 15, 2024

1.3.18

May 12, 2024

1.3.17

Apr 19, 2024

1.3.16

Apr 7, 2024

1.3.15

Mar 7, 2024

1.3.14

Mar 5, 2024

1.3.13

Feb 28, 2024

1.3.12

Feb 27, 2024

1.3.11

Feb 4, 2024

1.3.10

Jan 30, 2024

1.3.9

Dec 28, 2023

1.3.8

Oct 25, 2023

1.3.7

Sep 21, 2023

1.3.6

Sep 20, 2023

1.3.5

Sep 20, 2023

1.3.4

Sep 19, 2023

1.3.3 yanked

Sep 18, 2023

Reason this release was yanked:

error parameter

1.3.2

Sep 6, 2023

1.3.1

Aug 29, 2023

1.3.0

Aug 27, 2023

1.2.13

Jul 12, 2023

1.2.12

Jul 12, 2023

1.2.11

Jun 30, 2023

1.2.10

Jun 24, 2023

1.2.9

Jun 18, 2023

1.2.8

May 13, 2023

1.2.7

Apr 24, 2023

1.2.6

Apr 10, 2023

1.2.5

Apr 7, 2023

1.2.4

Mar 28, 2023

1.2.3

Mar 11, 2023

1.2.2

Mar 11, 2023

1.2.1

Mar 10, 2023

1.2.0

Mar 7, 2023

1.1.30

Mar 7, 2023

1.1.29

Feb 22, 2023

1.1.28

Feb 15, 2023

1.1.27

Feb 15, 2023

1.1.26

Feb 13, 2023

1.1.25

Feb 12, 2023

1.1.24

Jan 13, 2023

1.1.23

Jan 9, 2023

1.1.22

Jan 4, 2023

1.1.21

Jan 4, 2023

1.1.20

Dec 19, 2022

1.1.19

Dec 19, 2022

1.1.18

Dec 15, 2022

1.1.17

Dec 14, 2022

1.1.16

Dec 14, 2022

1.1.15

Dec 14, 2022

1.1.14

Dec 14, 2022

1.1.13

Nov 23, 2022

1.1.12

Nov 20, 2022

1.1.11

Nov 15, 2022

1.1.10

Nov 14, 2022

1.1.9

Oct 11, 2022

1.1.8

Sep 30, 2022

1.1.7

Sep 30, 2022

1.1.6

Sep 25, 2022

1.1.5

Sep 16, 2022

1.1.4

Sep 1, 2022

1.1.3

Aug 17, 2022

1.1.2

Aug 17, 2022

1.1.1

Aug 17, 2022

1.1.0

Aug 17, 2022

1.0.9

Aug 17, 2022

1.0.8

Aug 17, 2022

1.0.7

Aug 14, 2022

1.0.6

Jul 14, 2022

1.0.5

Jul 13, 2022

1.0.4

Jul 12, 2022

1.0.3

Jul 11, 2022

1.0.2

Jul 10, 2022

This version

1.0.1

Jul 10, 2022

0.0.0

Jul 10, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

rapidocr_onnxruntime-1.0.1-py3-none-any.whl (22.0 kB view hashes)

Uploaded Jul 10, 2022 Python 3

Hashes for rapidocr_onnxruntime-1.0.1-py3-none-any.whl

Hashes for rapidocr_onnxruntime-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6abc9556bee954204e6819698bf91b446f99ee51ac7ad9932333f848db050622`
MD5	`8dbf8ca08c2d92d48d1705575a367a89`
BLAKE2b-256	`66e7af05d833d87ddb216b5cf4384d1fcb44cd68ff0f2d59b8ce6cde638a0d45`

rapidocr-onnxruntime 1.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Project description

Python版RapidOCR

简介和说明

使用步骤

`config.yaml`中常用参数介绍

onnxruntime-gpu版推理配置

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

rapidocr-onnxruntime 1.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Project description

Python版RapidOCR

简介和说明

使用步骤

config.yaml中常用参数介绍

onnxruntime-gpu版推理配置

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

`config.yaml`中常用参数介绍