hcgf · PyPI

Humanable ChatGPT/GLM Fine-tuning.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

先clone仓库或pip安装：

pip install hcgf

需要的依赖在requirements.txt中，通过下面命令安装：

pip install -r requirements.txt

注意：不支持PyTorch2.0，历史版本请参考下面链接安装：

https://pytorch.org/get-started/previous-versions/

微调

准备数据

每一行一个json，必须包含prompt和completion两个字段。示例如下：

{"prompt": "你是谁？\n", "completion": "不告诉你。"}

正常微调

至少需要一张16G显存的卡。如果不指定显卡，默认为cuda。

#===== 微调 =====#
import hcgf
gl = hcgf.GlmLora("THUDM/chatglm-6b", device="cuda:0")
gl.load_data("/path/to/data.json").tune()

#===== 推理 =====#
gl = hcgf.GlmLora("THUDM/chatglm-6b", device="cuda:0")
gl.load_pretrained("/path/to/lora_pt").eval()
gl.chat("你是谁?")

#===== 切换模式 =====#
gl = hcgf.GlmLora("THUDM/chatglm-6b", device="cuda:0")
gl.load_data("/path/to/data.json").tune()
# 切换到推理模式
gl.eval()
gl.chat("你是谁？")
# 切换回微调模式，还是用原来的数据重新跑
gl.tune()
# 如果有新的数据集，参考上面的写法，先加载数据
gl.load_data("/path/to/new_data.json").tune()

8bit微调

至少需要一张12G显存的卡。不指定device。

需要安装依赖: bitsandbytes

只需要初始化时改一下即可，其他操作和上面正常微调一样。

gl = hcgf.GlmLora("THUDM/chatglm-6b", load_in_8bit=True)

继续微调

先加载之前的pt文件，然后加载数据微调。

gl.load_pretrained("/path/to/lora_pt").load_data("/path/to/new_data").tune()

参数说明

主要有三个方法的参数，有值的表示默认值。

load_data(
    data_path: str, 
    max_seq_len: int = 512, # 句子最大长度，超过会截断
)
tune(
    batch_size: int = 1,
    lr: float = 2e-4,
    num_epochs: int = 10,
    warmup_steps: Optional[int] = None,     # 为None时会用第一个Epoch进行warmup
    accumulate_steps: Optional[int] = 32,
    out_dir: str = "./output/",
    print_every: int = 10,                  # 每隔多少个Step打印一次输出（Step、Loss、LearningRate）
)
chat(
    inp: str, 
    history: List[Tuple[str, str]] = None,  # (问，答)Pair对
    max_len: int = 512,                     # 上下文的最大长度，超过就不生成了
    stop: List[str] = []                    # 停止文本，可以是标点、特定词或句子等
)

配置

有几个影响显存的参数可以配置：max_seq_len，batch_size。

(
gl
.load_data("./data/chatgpt_finetune_faq.json", max_seq_len=128)
.tune(batch_size=1)
)

不同配置 8bit 资源占用：

max_seq_len	batch_size	memory
`64`	1	11G
`128`	1	12G
`512`	1	22G
128	`2`	15G
128	`4`	21G

不同配置正常资源占用：

max_seq_len	batch_size	memory
`64`	1	15G
`128`	1	16G
`512`	1	30G
128	`2`	19G
128	`4`	25G

RM

使用小模型（如BERT等）训练。

训练

准备数据

需要pair对数据，计算logits过程和普通预训练模型一样（一个Batch多个pair对）；计算loss时属于同一个pair对的logits放一块算。

推理时直接用logits就行。

推理

测试

# 全部测试
python -m pytest
# 测试训练和推理，比较慢
python -m pytest -s -m slow
# 测试其他的
python -m pytest -m "not slow"

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.4.2

Sep 19, 2023

0.4.1

Sep 15, 2023

0.4.0

Sep 10, 2023

0.2.1

May 13, 2023

0.2.0

May 13, 2023

0.1.0

Apr 11, 2023

This version

0.0.7

Apr 4, 2023

0.0.6

Apr 3, 2023

0.0.5

Apr 2, 2023

0.0.4

Mar 27, 2023

0.0.3

Mar 25, 2023

0.0.2

Mar 25, 2023

0.0.1

Mar 25, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hcgf-0.0.7.tar.gz (46.0 kB view hashes)

Uploaded Apr 4, 2023 Source

Built Distribution

hcgf-0.0.7-py3-none-any.whl (45.4 kB view hashes)

Uploaded Apr 4, 2023 Python 3

Hashes for hcgf-0.0.7.tar.gz

Hashes for hcgf-0.0.7.tar.gz
Algorithm	Hash digest
SHA256	`980036c6ebc29ecc2a5da5bbf3867767e1279d00e740a08f81e4cecf148bed31`
MD5	`f9a8eea5f91555e54fef18b1507bfd8c`
BLAKE2b-256	`064eb1f59e6250af2049abd6b460836fb5945ca577c7093e3dc5b4f94ac99200`

Hashes for hcgf-0.0.7-py3-none-any.whl

Hashes for hcgf-0.0.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a52dc29e5bc840a52740bd8bfa8c19ec22b5ed5cc1f331f8fe97838ef85d13e0`
MD5	`70980d5f78d5dc287483b85e4ee20f03`
BLAKE2b-256	`efb9d50e8db43882a39feecee89ad5646e6465374f3ac68bc2e026189619b4f7`