Tools for testing, debugging, and evaluating LLM features.

These details have not been verified by PyPI

Project links

Project description

Baserun

Baserun is the testing and observability platform for LLM apps.

Quick Start

1. Install Baserun

pip install baserun

2. Set up Baserun in your application

Set the Baserun API key

Create an account at https://baserun.ai. Then generate an API key for your project in the settings tab. Set it as an environment variable:

export BASERUN_API_KEY="your_api_key_here"

Or set baserun.api_key to its value:

baserun.api_key = "br-..."

Initialize Baserun

At some point during your application's startup you need to call baserun.init(). This sets up the observability system and enables Baserun. If init is not called, Baserun will be disabled.

3. Set up your traces

A trace comprises a series of events executed within an your application. Tracing enables Baserun to display an LLM chain’s entire lifecycle, whether synchronous or asynchronous.

To start tracing add the @baserun.trace decorator to the function you want to observe (e.g. a request/response handler or your main function).

Here is a simple example. In this case, Baserun is initialized at application startup and the answer_question function is traced. The LLM call within that function will now be traced.

import sys
from openai import OpenAI
import baserun


@baserun.trace
def answer_question(question: str) -> str:
    client = OpenAI()
    response = client.chat.completions.create(
        model="gpt-3.5-turbo",
        messages=[{"role": "user", "content": question}],
    )
    return response["choices"][0]["message"]["content"]


if __name__ == "__main__":
    baserun.init()
    print(answer_question(sys.argv[-1]))

4. (Optional) Set up User Sessions

If your application involves interaction with a user and you wish to associate logs and traces with a particular user, you can use User Sessions. You can do this using with_sessions:

from openai import OpenAI
import baserun

@baserun.trace
def use_sessions(prompt="What is the capitol of the US?") -> str:
    client = OpenAI()
    with baserun.with_session(user_identifier="example@test.com"):
        completion = client.chat.completions.create(
            model="gpt-3.5-turbo",
            messages=[{"role": "user", "content": prompt}],
        )
        content = completion.choices[0].message.content
        return content

5. (Optional) Set up your test suite

Use our pytest plugin and start immediately testing with Baserun. By default all OpenAI and Anthropic requests will be automatically logged.

# test_module.py

import openai

def test_paris_trip():
    response = openai.ChatCompletion.create(
        model="gpt-3.5-turbo",
        temperature=0.7,
        messages=[
            {
                "role": "user",
                "content": "What are three activities to do in Paris?"
            }
        ],
    )
    
    assert "Eiffel Tower" in response['choices'][0]['message']['content']

To run the test and log to baserun:

pytest --baserun test_module.py
...
========================Baserun========================
Test results available at: https://baserun.ai/runs/<id>
=======================================================

6. (Optional) Set up checks

Baserun supports checks (also more broadly known as "evaluations"). These are assertions that the LLM response you received matches whatever criteria you require. To use a check, you can use baserun.check like so:

from openai import OpenAI
import baserun

client = OpenAI()
completion = client.chat.completions.create(
    model="gpt-3.5-turbo",
    messages=[{"role": "user", "content": "What is the capital of the United States?"}],
)
content = completion.choices[0].message.content
baserun.check(name="capital_answer", result="Washington" in content)

Further Documentation

For a deeper dive on all capabilities and more advanced usage, please refer to our Documentation.

License

MIT License

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.0.9

Jun 26, 2024

2.0.8

Jun 24, 2024

2.0.7

Jun 24, 2024

2.0.6

Jun 20, 2024

2.0.5

Jun 18, 2024

2.0.4

Jun 17, 2024

2.0.3

Jun 10, 2024

2.0.2

Jun 6, 2024

2.0.0

Jun 6, 2024

1.0.0b11 pre-release

Jun 5, 2024

1.0.0b10 pre-release

Jun 5, 2024

1.0.0b9 pre-release

Jun 4, 2024

1.0.0b8 pre-release

May 31, 2024

1.0.0b7 pre-release

May 29, 2024

1.0.0b6 pre-release

May 29, 2024

1.0.0b5 pre-release

May 29, 2024

1.0.0b4 pre-release

May 28, 2024

1.0.0b2 pre-release

May 22, 2024

1.0.0b1 pre-release

May 22, 2024

1.0.0b0 pre-release

May 22, 2024

0.9.36

May 7, 2024

0.9.35

Apr 30, 2024

0.9.34

Apr 22, 2024

0.9.33

Mar 25, 2024

0.9.32

Mar 21, 2024

0.9.31

Mar 20, 2024

0.9.30

Mar 7, 2024

0.9.29

Mar 7, 2024

0.9.28

Mar 6, 2024

0.9.27

Mar 6, 2024

0.9.26

Mar 5, 2024

0.9.25

Mar 4, 2024

0.9.24

Mar 1, 2024

0.9.23

Feb 26, 2024

0.9.22

Feb 26, 2024

0.9.21

Feb 21, 2024

0.9.20

Feb 20, 2024

0.9.19

Feb 20, 2024

0.9.17

Feb 16, 2024

0.9.16

Feb 15, 2024

0.9.15

Feb 12, 2024

0.9.14

Feb 12, 2024

0.9.13

Feb 7, 2024

0.9.12

Feb 6, 2024

0.9.11

Feb 5, 2024

0.9.10

Feb 2, 2024

0.9.9

Jan 29, 2024

0.9.8

Jan 23, 2024

0.9.8b2 pre-release

Jan 17, 2024

0.9.8b1 pre-release

Jan 16, 2024

0.9.7b1 pre-release

Jan 10, 2024

0.9.6b2 pre-release

Dec 7, 2023

0.9.6b1 pre-release

Dec 7, 2023

0.9.5

Dec 6, 2023

0.9.4

Dec 6, 2023

0.9.3

Dec 1, 2023

0.9.3b2 pre-release

Dec 1, 2023

0.9.3b1 pre-release

Dec 1, 2023

0.9.2

Nov 30, 2023

0.9.1

Nov 30, 2023

This version

0.9.0

Nov 30, 2023

0.9.0b1 pre-release

Nov 28, 2023

0.8.1

Nov 10, 2023

0.8.0

Nov 10, 2023

0.8.0b1 pre-release

Nov 7, 2023

0.7.0b3 pre-release

Nov 3, 2023

0.7.0b2 pre-release

Nov 3, 2023

0.6.2

Oct 31, 2023

0.6.1

Oct 31, 2023

0.6.0

Oct 20, 2023

0.6.0b3 pre-release

Oct 11, 2023

0.6.0b2 pre-release

Oct 4, 2023

0.6.0b1 pre-release

Oct 2, 2023

0.5.6

Sep 18, 2023

0.5.5

Sep 12, 2023

0.5.4

Sep 11, 2023

0.5.3

Sep 8, 2023

0.5.2

Aug 31, 2023

0.5.1

Aug 23, 2023

0.5.0

Aug 23, 2023

0.4.2

Aug 16, 2023

0.4.1

Aug 14, 2023

0.4.0

Aug 13, 2023

0.3.1

Aug 10, 2023

0.3

Aug 4, 2023

0.2

Aug 2, 2023

0.1

Aug 1, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

baserun-0.9.0.tar.gz (39.0 kB view details)

Uploaded Nov 30, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

baserun-0.9.0-py3-none-any.whl (43.2 kB view details)

Uploaded Nov 30, 2023 Python 3

File details

Details for the file baserun-0.9.0.tar.gz.

File metadata

Download URL: baserun-0.9.0.tar.gz
Upload date: Nov 30, 2023
Size: 39.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.3

File hashes

Hashes for baserun-0.9.0.tar.gz
Algorithm	Hash digest
SHA256	`25c64d9ab73f026d31ef193a054a70a4d71f8d3d976f3d0c749af7a7a35a391e`
MD5	`ea34a089e845cb2c5eeba5ead5e1c4f5`
BLAKE2b-256	`d802f61eef5f2d5e04e8ff06c98c1e47683be80f789a67cb76c54934da675479`

See more details on using hashes here.

File details

Details for the file baserun-0.9.0-py3-none-any.whl.

File metadata

Download URL: baserun-0.9.0-py3-none-any.whl
Upload date: Nov 30, 2023
Size: 43.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.3

File hashes

Hashes for baserun-0.9.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f383af25b3d070fd5194a866f196a68fe9d8ba6db0b107e4931348f47f9915f7`
MD5	`48bae18922aafc34b55bbb86da391059`
BLAKE2b-256	`e567662dd0946127a21e40f798879fb372404161dea1472d447a84269a9eeab6`

See more details on using hashes here.

baserun 0.9.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Baserun

Quick Start

1. Install Baserun

2. Set up Baserun in your application

Set the Baserun API key

Initialize Baserun

3. Set up your traces

4. (Optional) Set up User Sessions

5. (Optional) Set up your test suite

6. (Optional) Set up checks

Further Documentation

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes