使用 pinyin-data 和 phrase-pinyin-data 中的拼音数据文件覆盖 pypinyin 中的自带拼音数据,实现只使用某个或某些拼音数据文件中的拼音数据的需求
Project description
pypinyin-dict
使用 pinyin-data 和 phrase-pinyin-data 中的拼音数据文件覆盖 pypinyin 中的内置拼音数据, 实现只使用某个或某些拼音数据文件中的拼音数据的需求。
安装
$ pip install pypinyin-dict
使用示例
>>> from pypinyin import pinyin
>>> pinyin('枯萎')
[['kū'], ['wēi']]
# 使用 phrase-pinyin-data 项目中 cc_cedict.txt 文件中的拼音数据优化结果
>>> from pypinyin_dict.phrase_pinyin_data import cc_cedict
>>> cc_cedict.load()
>>> pinyin('枯萎')
[['kū'], ['wěi']]
>>> pinyin('扔', heteronym=True)
[['rēng', 'rèng']]
# 使用 pinyin-data 项目中 kXHC1983.txt 文件中的拼音数据优化结果
>>> from pypinyin_dict.pinyin_data import kxhc1983
>>> kxhc1983.load()
>>> pinyin('扔', heteronym=True)
[['rēng']]
模块介绍
各个模块与数据文件关系如下(所有模块中都有一个 load() 函数用于导入对应的拼音数据,使用方法详见上方【使用示例】):
# pinyin-data/kTGHZ2013.txt
>> from pypinyin_dict.pinyin_data import ktghz2013
# pinyin-data/kHanyuPinyin.txt
>> from pypinyin_dict.pinyin_data import khanyupinyin
# pinyin-data/kXHC1983.txt
>> from pypinyin_dict.pinyin_data import kxhc1983
# pinyin-data/kHanyuPinlu.txt
>> from pypinyin_dict.pinyin_data import khanyupinlu
# pinyin-data/kMandarin_8105.txt
>> from pypinyin_dict.pinyin_data import kmandarin_8105
# pinyin-data/pinyin.txt
>> from pypinyin_dict.pinyin_data import pinyin
# pinyin-data/zdic.txt
>> from pypinyin_dict.pinyin_data import zdic
# pinyin-data/cc_cedict.txt
>> from pypinyin_dict.pinyin_data import cc_cedict
# phrase-pinyin-data/pinyin.txt
>> from pypinyin_dict.phrase_pinyin_data import pinyin
# phrase-pinyin-data/zdic_cibs.txt
>> from pypinyin_dict.phrase_pinyin_data import zdic_cibs
# phrase-pinyin-data/zdic_cybs.txt
>> from pypinyin_dict.phrase_pinyin_data import zdic_cybs
# phrase-pinyin-data/cc_cedict.txt
>> from pypinyin_dict.phrase_pinyin_data import cc_cedict
# phrase-pinyin-data/di.txt
>> from pypinyin_dict.phrase_pinyin_data import di
# phrase-pinyin-data/large_pinyin.txt
>> from pypinyin_dict.phrase_pinyin_data import large_pinyin
生成文件
依赖 Python 3 环境。
clone 子模块:
git submodule init
git submodule update
生成 py 文件:
make generate
编辑数据文件然后生成 py 文件(可选):
- 进入 tools 目录
- 编辑 pinyin-data 或 phrase-pinyin-data 里的 txt 文件(tools 目录中 Makefile 里涉及的 txt 文件)
- 执行
make generate生成最新的 py 文件 - 可以通过
git diff查看变更内容
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pypinyin_dict-0.9.0.tar.gz.
File metadata
- Download URL: pypinyin_dict-0.9.0.tar.gz
- Upload date:
- Size: 9.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8c491396baa1567311f2ec759cbc154638f3bcefdc711d34e53e373e3a429fa5
|
|
| MD5 |
247c1d7cd0d0b5c4a0ab5ef0a5fcd128
|
|
| BLAKE2b-256 |
647af56b7096cde930a65f8d5dc8cb726136d53c23175148f6aa1daa75419126
|
File details
Details for the file pypinyin_dict-0.9.0-py2.py3-none-any.whl.
File metadata
- Download URL: pypinyin_dict-0.9.0-py2.py3-none-any.whl
- Upload date:
- Size: 9.5 MB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
10cfbe40af87d704b867533177be8cd72837da9e224755dd275798e88097067a
|
|
| MD5 |
31df43a092f49e06f49b817a94ff8584
|
|
| BLAKE2b-256 |
418fadd772a61256a9ac91d95bf5ec3dffc1de97c8e5da53d40655044b2e1509
|