Re-implementation of the self-instruct paper, with updated prompts, models, etc.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Operating System
- POSIX :: Linux
Programming Language
- Python :: 3.10

Project description

AIroboros: Using large language models to fine-tune large language models.

This is my take on implementing the Self-Instruct paper. The approach is quite heavily modified, and uses the human generated seeds provided by Databricks Dolly Project

This updated implementation supports either the /v1/completions endpoint or /v1/chat/completions, which is particularly useful in that it supports gpt-4 and gpt-3.5-turbo (which is 1/10 the cost of text-davinci-003).

Generating instructions

See available options via:

airoboros generate-instructions --help

Example output:

usage: self_instruct.py [-h] [--model MODEL]
                        [--organization-id ORGANIZATION_ID]
                        [--openai-api-key OPENAI_API_KEY]
                        [--instruction-count INSTRUCTION_COUNT]
                        [--seed-tasks-path SEED_TASKS_PATH]
                        [--output-path OUTPUT_PATH] [--overwrite]
                        [--default-prompt-prefix DEFAULT_PROMPT_PREFIX]
                        [--classification-prompt-prefix CLASSIFICATION_PROMPT_PREFIX]
                        [--contextual-prompt-prefix CONTEXTUAL_PROMPT_PREFIX]
                        [--skip-instruction-re SKIP_INSTRUCTION_RE]
                        [--code-gen-re CODE_GEN_RE]
                        [--min-instruction-length MIN_INSTRUCTION_LENGTH]
                        [--max-instruction-length MAX_INSTRUCTION_LENGTH]
                        [--max-tokens MAX_TOKENS] [--temperature TEMPERATURE]
                        [--top-p TOP_P]
                        [--frequency-penalty FREQUENCY_PENALTY]
                        [--presence-penalty PRESENCE_PENALTY]
                        [--max-usage-tokens MAX_USAGE_TOKENS]

options:
  -h, --help            show this help message and exit
  --model MODEL         OpenAI model/engine to use for prompt generation,
                        which can be either part of the /v1/completions or
                        /v1/chat/completions endpoints
  --organization-id ORGANIZATION_ID
                        organization ID to include in the request to OpenAI,
                        defaults to organization ID tied to the API key
  --openai-api-key OPENAI_API_KEY
                        OpenAI API key to use, defaults to the OPENAI_API_KEY
                        environment variable
  --instruction-count INSTRUCTION_COUNT
                        number of instructions to generate, not including the
                        seed instructions
  --seed-tasks-path SEED_TASKS_PATH
                        path to an input seed instructions JSONL file
  --output-path OUTPUT_PATH
                        path to store all generated instructions in
  --overwrite           overwrite output path if it exists
  --default-prompt-prefix DEFAULT_PROMPT_PREFIX
                        prompt prefix to use for generating open, non-
                        classification tasks
  --classification-prompt-prefix CLASSIFICATION_PROMPT_PREFIX
                        prompt prefix to use for generating classification
                        tasks
  --contextual-prompt-prefix CONTEXTUAL_PROMPT_PREFIX
                        prompt prefix to use for generating tasks with
                        context, e.g. closed Q&A
  --skip-instruction-re SKIP_INSTRUCTION_RE
                        regular expression used to filter low-quality/unusable
                        instructions
  --code-gen-re CODE_GEN_RE
                        regular expression used to filter coding/programming
                        tasks
  --min-instruction-length MIN_INSTRUCTION_LENGTH
                        minimum instruction length
  --max-instruction-length MAX_INSTRUCTION_LENGTH
                        maximum instruction length
  --max-tokens MAX_TOKENS
                        maximum number of tokens in an instruction
  --temperature TEMPERATURE
                        temperature parameter to use in OpenAI requests
  --top-p TOP_P         top-p parameter to use in OpenAI requests
  --frequency-penalty FREQUENCY_PENALTY
                        frequency penalty to use in OpenAI requests
  --presence-penalty PRESENCE_PENALTY
                        presence penalty to use in OpenAI requests
  --max-usage-tokens MAX_USAGE_TOKENS
                        Maximum token usage, calculated as sum of total_tokens
                        from responses

Coming soon

Scripts for fine-tuning various models using the self-instruct (and human-generated) prompts.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Operating System
- POSIX :: Linux
Programming Language
- Python :: 3.10

Release history Release notifications | RSS feed

2.2.2

Mar 7, 2024

2.2.1

Dec 6, 2023

2.2.0

Sep 4, 2023

2.1.6

Aug 25, 2023

2.1.4

Aug 23, 2023

2.1.3

Aug 23, 2023

2.1.2

Aug 23, 2023

2.1.1

Aug 22, 2023

2.0.24

Aug 12, 2023

2.0.23

Aug 12, 2023

2.0.22

Aug 12, 2023

2.0.21

Aug 11, 2023

2.0.20

Aug 10, 2023

2.0.19

Aug 9, 2023

2.0.18

Aug 6, 2023

2.0.16

Aug 3, 2023

2.0.15

Aug 3, 2023

2.0.14

Aug 3, 2023

2.0.13

Jul 25, 2023

2.0.12

Jul 25, 2023

2.0.11

Jul 24, 2023

2.0.10

Jul 24, 2023

2.0.9

Jul 24, 2023

2.0.8

Jul 24, 2023

2.0.7

Jul 22, 2023

2.0.6

Jul 22, 2023

2.0.5

Jul 22, 2023

2.0.4

Jul 22, 2023

2.0.3

Jul 22, 2023

2.0.2

Jul 18, 2023

2.0.1

Jul 18, 2023

0.2.8

Jun 25, 2023

0.2.7

May 16, 2023

0.2.6

May 13, 2023

0.2.5

May 13, 2023

0.2.4

May 13, 2023

0.2.3

May 13, 2023

0.2.2

May 12, 2023

0.2.0

May 11, 2023

0.1.0

May 10, 2023

0.0.6

May 4, 2023

0.0.5

May 2, 2023

This version

0.0.4

May 2, 2023

0.0.3

May 2, 2023

0.0.2

May 1, 2023

0.0.1

May 1, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

airoboros-0.0.4-py3-none-any.whl (15.4 kB view hashes)

Uploaded May 2, 2023 Python 3

Hashes for airoboros-0.0.4-py3-none-any.whl

Hashes for airoboros-0.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4b924c057ca9c437fc31805b5dcf2525a7ef550598c342bb4f63ee13c9d952b6`
MD5	`9cc9f94c69add0b0ec50bcd5dc2160b2`
BLAKE2b-256	`290ac93da2105e01817e3a4cfc708befcfbd5332d28d3040f9f8ae66f310be85`