Python client library and CLI for using Redis as a vector database

These details have been verified by PyPI

Maintainers

abrookins bsbodden rbs333 tchutch vishal-bala

These details have not been verified by PyPI

Project links

Project description

Redis Vector Library

The AI-native Redis Python client

PyPI - Downloads

Language GitHub last commit

Documentation • Recipes • GitHub

Introduction

Redis Vector Library (RedisVL) is the production-ready Python client for AI applications built on Redis. Lightning-fast vector search meets enterprise-grade reliability.

Perfect for building RAG pipelines with real-time retrieval, AI agents with memory and semantic routing, and recommendation systems with fast search and reranking.

🎯 Core Capabilities	🚀 AI Extensions	🛠️ Dev Utilities
Index Management Schema design, data loading, CRUD ops	Semantic Caching Reduce LLM costs & boost throughput	CLI Index management from terminal
Vector Search Similarity search with metadata filters	LLM Memory Agentic AI context management	Async Support Async indexing and search for improved performance
Complex Filtering Combine multiple filter types	Semantic Routing Intelligent query classification	Vectorizers 8+ embedding provider integrations
Hybrid Search Combine semantic & full-text signals	Embedding Caching Cache embeddings for efficiency	Rerankers Improve search result relevancy

💪 Getting Started

Installation

Install redisvl into your Python (>=3.9) environment using pip:

pip install redisvl

For more detailed instructions, visit the installation guide.

Redis

Choose from multiple Redis deployment options:

Redis Cloud - Managed cloud database (free tier available)

Redis Cloud offers a fully managed Redis service with a free tier, perfect for getting started quickly.

Docker - Local development

Run Redis locally using Docker:

docker run -d --name redis -p 6379:6379 redis:latest

This runs Redis 8+ with built-in vector search capabilities.

Redis Enterprise - Commercial, self-hosted database

Redis Enterprise provides enterprise-grade features for production deployments.

Redis Sentinel - High availability with automatic failover

Configure Redis Sentinel for high availability:

# Connect via Sentinel
redis_url="redis+sentinel://sentinel1:26379,sentinel2:26379/mymaster"

Azure Managed Redis - Fully managed Redis Enterprise on Azure

Azure Managed Redis provides fully managed Redis Enterprise on Microsoft Azure.

💡 Tip: Enhance your experience and observability with the free Redis Insight GUI.

Overview

Index Management

Design a schema for your use case that models your dataset with built-in Redis indexable fields (e.g. text, tags, numerics, geo, and vectors).

Load schema from YAML file

index:
  name: user-idx
  prefix: user
  storage_type: json

fields:
  - name: user
    type: tag
  - name: credit_score
    type: tag
  - name: job_title
    type: text
    attrs:
      sortable: true
      no_index: false  # Index for search (default)
      unf: false       # Normalize case for sorting (default)
  - name: embedding
    type: vector
    attrs:
      algorithm: flat
      dims: 4
      distance_metric: cosine
      datatype: float32

from redisvl.schema import IndexSchema

schema = IndexSchema.from_yaml("schemas/schema.yaml")

Load schema from Python dictionary

from redisvl.schema import IndexSchema

schema = IndexSchema.from_dict({
    "index": {
        "name": "user-idx",
        "prefix": "user",
        "storage_type": "json"
    },
    "fields": [
        {"name": "user", "type": "tag"},
        {"name": "credit_score", "type": "tag"},
        {
            "name": "job_title",
            "type": "text",
            "attrs": {
                "sortable": True,
                "no_index": False,  # Index for search
                "unf": False        # Normalize case for sorting
            }
        },
        {
            "name": "embedding",
            "type": "vector",
            "attrs": {
                "algorithm": "flat",
                "datatype": "float32",
                "dims": 4,
                "distance_metric": "cosine"
            }
        }
    ]
})

📚 Learn more about schema design and schema creation.

Create a SearchIndex class with an input schema to perform admin and search operations on your index in Redis:

from redis import Redis
from redisvl.index import SearchIndex

# Define the index
index = SearchIndex(schema, redis_url="redis://localhost:6379")

# Create the index in Redis
index.create()

An async-compatible index class also available: AsyncSearchIndex.

Load and fetch data to/from your Redis instance:

data = {"user": "john", "credit_score": "high", "embedding": [0.23, 0.49, -0.18, 0.95]}

# load list of dictionaries, specify the "id" field
index.load([data], id_field="user")

# fetch by "id"
john = index.fetch("john")

Retrieval

Define queries and perform advanced searches over your indices, including vector search, complex filtering, and hybrid search combining semantic and full-text signals.

Quick Reference: Query Types

Query Type	Use Case	Description
`VectorQuery`	Semantic similarity search	Find similar vectors with optional filters
`RangeQuery`	Distance-based search	Vector search within a defined distance range
`FilterQuery`	Metadata filtering	Filter and search using metadata fields
`TextQuery`	Full-text search	BM25-based keyword search with field weighting
`HybridQuery`	Combined search	Combine semantic + full-text signals (Redis 8.4.0+)
`CountQuery`	Counting records	Count documents matching filter criteria

Vector Search

VectorQuery - Flexible vector queries with customizable filters enabling semantic search:

from redisvl.query import VectorQuery

query = VectorQuery(
  vector=[0.16, -0.34, 0.98, 0.23],
  vector_field_name="embedding",
  num_results=3,
  # Optional: tune search performance with runtime parameters
  ef_runtime=100  # HNSW: higher for better recall
)
# run the vector search query against the embedding field
results = index.query(query)

RangeQuery - Vector search within a defined range paired with customizable filters

Complex Filtering

Build complex filtering queries by combining multiple filter types (tags, numerics, text, geo, timestamps) using logical operators:

```python
from redisvl.query import VectorQuery
from redisvl.query.filter import Tag, Num

# Combine multiple filter types
tag_filter = Tag("user") == "john"
price_filter = Num("price") >= 100

# Create complex filtering query with combined filters
query = VectorQuery(
    vector=[0.16, -0.34, 0.98, 0.23],
    vector_field_name="embedding",
    filter_expression=tag_filter & price_filter,
    num_results=10
)
results = index.query(query)
```

FilterQuery - Standard search using filters and full-text search
CountQuery - Count the number of indexed records given attributes
TextQuery - Full-text search with support for field weighting and BM25 scoring

Learn more about building complex filtering queries.

Hybrid Search

Combine semantic (vector) search with full-text (BM25) search signals for improved search quality:

HybridQuery - Native hybrid search combining text and vector similarity (Redis 8.4.0+):

from redisvl.query import HybridQuery

hybrid_query = HybridQuery(
    text="running shoes",
    text_field_name="description",
    vector=[0.1, 0.2, 0.3],
    vector_field_name="embedding",
    combination_method="LINEAR",  # or "RRF"
    num_results=10
)
results = index.query(hybrid_query)

AggregateHybridQuery - Hybrid search using aggregation (compatible with earlier Redis versions)

Learn more about hybrid search.

Dev Utilities

Vectorizers

Integrate with popular embedding providers to greatly simplify the process of vectorizing unstructured data for your index and queries.

Supported Vectorizer Providers

from redisvl.utils.vectorize import CohereTextVectorizer

# set COHERE_API_KEY in your environment
co = CohereTextVectorizer()

embedding = co.embed(
    text="What is the capital city of France?",
    input_type="search_query"
)

embeddings = co.embed_many(
    texts=["my document chunk content", "my other document chunk content"],
    input_type="search_document"
)

Learn more about using vectorizers in your embedding workflows.

Rerankers

Integrate with popular reranking providers to improve the relevancy of the initial search results from Redis

Extensions

RedisVL Extensions provide production-ready modules implementing best practices and design patterns for working with LLM memory and agents. These extensions encapsulate learnings from our user community and enterprise customers.

💡 Have an idea for another extension? Open a PR or reach out to us at applied.ai@redis.com. We're always open to feedback.

Semantic Caching

Increase application throughput and reduce the cost of using LLM models in production by leveraging previously generated knowledge with the SemanticCache.

Example: Semantic Cache Usage

from redisvl.extensions.cache.llm import SemanticCache

# init cache with TTL and semantic distance threshold
llmcache = SemanticCache(
    name="llmcache",
    ttl=360,
    redis_url="redis://localhost:6379",
    distance_threshold=0.1  # Redis COSINE distance [0-2], lower is stricter
)

# store user queries and LLM responses in the semantic cache
llmcache.store(
    prompt="What is the capital city of France?",
    response="Paris"
)

# quickly check the cache with a slightly different prompt (before invoking an LLM)
response = llmcache.check(prompt="What is France's capital city?")
print(response[0]["response"])

>>> Paris

Learn more about semantic caching for LLMs.

Embedding Caching

Reduce computational costs and improve performance by caching embedding vectors with their associated text and metadata using the EmbeddingsCache.

Example: Embedding Cache Usage

from redisvl.extensions.cache.embeddings import EmbeddingsCache
from redisvl.utils.vectorize import HFTextVectorizer

# Initialize embedding cache
embed_cache = EmbeddingsCache(
    name="embed_cache",
    redis_url="redis://localhost:6379",
    ttl=3600  # 1 hour TTL
)

# Initialize vectorizer with cache
vectorizer = HFTextVectorizer(
    model="sentence-transformers/all-MiniLM-L6-v2",
    cache=embed_cache
)

# First call computes and caches the embedding
embedding = vectorizer.embed("What is machine learning?")

# Subsequent calls retrieve from cache (much faster!)
cached_embedding = vectorizer.embed("What is machine learning?")

>>> Cache hit! Retrieved from Redis in <1ms

Learn more about embedding caching for improved performance.

LLM Memory

Improve personalization and accuracy of LLM responses by providing user conversation context. Manage access to memory data using recency or relevancy, powered by vector search with the MessageHistory.

Example: Message History Usage

from redisvl.extensions.message_history import SemanticMessageHistory

history = SemanticMessageHistory(
    name="my-session",
    redis_url="redis://localhost:6379",
    distance_threshold=0.7
)

# Supports roles: system, user, llm, tool
# Optional metadata field for additional context
history.add_messages([
    {"role": "user", "content": "hello, how are you?"},
    {"role": "llm", "content": "I'm doing fine, thanks."},
    {"role": "user", "content": "what is the weather going to be today?"},
    {"role": "llm", "content": "I don't know", "metadata": {"model": "gpt-4"}}
])

# Get recent chat history
history.get_recent(top_k=1)
# >>> [{"role": "llm", "content": "I don't know", "metadata": {"model": "gpt-4"}}]

# Get relevant chat history (powered by vector search)
history.get_relevant("weather", top_k=1)
# >>> [{"role": "user", "content": "what is the weather going to be today?"}]

# Filter messages by role
history.get_recent(role="user")  # Get only user messages
history.get_recent(role=["user", "system"])  # Or multiple roles

Learn more about LLM memory.

Semantic Routing

Build fast decision models that run directly in Redis and route user queries to the nearest "route" or "topic".

Example: Semantic Router Usage

from redisvl.extensions.router import Route, SemanticRouter

routes = [
    Route(
        name="greeting",
        references=["hello", "hi"],
        metadata={"type": "greeting"},
        distance_threshold=0.3,
    ),
    Route(
        name="farewell",
        references=["bye", "goodbye"],
        metadata={"type": "farewell"},
        distance_threshold=0.3,
    ),
]

# build semantic router from routes
router = SemanticRouter(
    name="topic-router",
    routes=routes,
    redis_url="redis://localhost:6379",
)

router("Hi, good morning")
# >>> RouteMatch(name='greeting', distance=0.273891836405)

Learn more about semantic routing.

Command Line Interface

Create, destroy, and manage Redis index configurations from a purpose-built CLI interface: rvl.

$ rvl -h

usage: rvl <command> [<args>]

Commands:
        index       Index manipulation (create, delete, etc.)
        version     Obtain the version of RedisVL
        stats       Obtain statistics about an index

Read more about using the CLI.

🚀 Why RedisVL?

Redis is a proven, high-performance database that excels at real-time workloads. With RedisVL, you get a production-ready Python client that makes Redis's vector search, caching, and session management capabilities easily accessible for AI applications.

Built on the Redis Python client, RedisVL provides an intuitive interface for vector search, LLM caching, and conversational AI memory - all the core components needed for modern AI workloads.

😁 Helpful Links

For additional help, check out the following resources:

🫱🏼‍🫲🏽 Contributing

Please help us by contributing PRs, opening GitHub issues for bugs or new feature ideas, improving documentation, or increasing test coverage. Read more about how to contribute!

🚧 Maintenance

This project is supported by Redis, Inc on a good faith effort basis. To report bugs, request features, or receive assistance, please file an issue.

Project details

These details have been verified by PyPI

Maintainers

abrookins bsbodden rbs333 tchutch vishal-bala

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.15.0

Feb 27, 2026

0.14.1 yanked

Feb 26, 2026

Reason this release was yanked:

Wrong version bump

0.14.0

Feb 6, 2026

0.13.2

Dec 19, 2025

0.13.0

Dec 19, 2025

0.12.1

Nov 25, 2025

0.12.0

Nov 21, 2025

0.11.1

Nov 21, 2025

0.11.0

Nov 7, 2025

0.10.0

Oct 16, 2025

0.9.1

Oct 1, 2025

0.9.0

Oct 1, 2025

0.8.2

Aug 26, 2025

0.8.1

Aug 19, 2025

0.8.0

Jun 24, 2025

0.7.0

May 30, 2025

0.6.0

May 1, 2025

0.5.2

Apr 9, 2025

0.5.1

Apr 6, 2025

0.5.0 yanked

Apr 4, 2025

Reason this release was yanked:

A new dependency is marked as optional but used during common imports

0.4.1

Feb 21, 2025

0.4.0

Feb 20, 2025

0.3.9

Jan 27, 2025

0.3.8

Jan 10, 2025

0.3.7

Dec 3, 2024

0.3.6

Oct 31, 2024

0.3.5

Oct 10, 2024

0.3.4

Oct 2, 2024

0.3.3

Sep 9, 2024

0.3.2

Aug 26, 2024

0.3.1

Aug 16, 2024

0.3.0

Aug 2, 2024

0.2.3

Jul 2, 2024

0.2.2

Jun 27, 2024

0.2.1

Jun 18, 2024

0.2.0

May 1, 2024

0.1.3

Apr 9, 2024

0.1.2

Mar 1, 2024

0.1.1

Feb 19, 2024

0.1.0

Feb 8, 2024

0.0.7

Jan 19, 2024

0.0.6 yanked

Jan 19, 2024

Reason this release was yanked:

broken dep chain

0.0.5

Nov 22, 2023

0.0.4

Sep 7, 2023

0.0.3

Aug 12, 2023

0.0.2

Aug 7, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

redisvl-0.15.0.tar.gz (861.5 kB view details)

Uploaded Feb 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

redisvl-0.15.0-py3-none-any.whl (197.9 kB view details)

Uploaded Feb 27, 2026 Python 3

File details

Details for the file redisvl-0.15.0.tar.gz.

File metadata

Download URL: redisvl-0.15.0.tar.gz
Upload date: Feb 27, 2026
Size: 861.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.13

File hashes

Hashes for redisvl-0.15.0.tar.gz
Algorithm	Hash digest
SHA256	`0e382e9b6cd8378dfe1515b18f92d125cfba905f6f3c5fe9b8904b3ca840d1ca`
MD5	`7be012011b7a2e287bf217e67e328469`
BLAKE2b-256	`721af1f0ff963622c34a9e9a9f2a0c6ad82bfbd05c082ecc89e38e092e3e9069`

See more details on using hashes here.

File details

Details for the file redisvl-0.15.0-py3-none-any.whl.

File metadata

Download URL: redisvl-0.15.0-py3-none-any.whl
Upload date: Feb 27, 2026
Size: 197.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.13

File hashes

Hashes for redisvl-0.15.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`aff716b9a9c4aef9c81de9a12d9939a0170ff3b3a1fe9d4164e94b131a754290`
MD5	`dfeb94cbc0f0625d62283943e5023eb1`
BLAKE2b-256	`cc235c5263a3cfc66957fa3bb154ef9441fbbcfb2f4eae910eb18e316db168b1`

See more details on using hashes here.

redisvl 0.15.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Redis Vector Library

Introduction

💪 Getting Started

Installation

Redis

Overview

Index Management

Retrieval

Vector Search

Complex Filtering

Hybrid Search

Dev Utilities

Vectorizers

Rerankers

Extensions

Semantic Caching

Embedding Caching

LLM Memory

Semantic Routing

Command Line Interface

🚀 Why RedisVL?

😁 Helpful Links

🫱🏼‍🫲🏽 Contributing

🚧 Maintenance

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes