How to Perform Hybrid Search with Multiple Vector Stores#

Ragbits comes with a special type of vector store called HybridSearchVectorStore, which allows you to combine multiple vector stores into a single search index. It acts as a single vector store but internally manages querying and updating multiple vector stores during operations like storing, searching, and deleting entries.

The main use cases for using a hybrid vector store are:

Combining Different Modalities: You can combine multiple vector stores that store different types of data, like text and images. This allows you to store multiple modality-specific vectors for the same entry (for example, an image embedding and a text embedding of a description of the image) and search them together.
Combining Different Types of Embeddings: You can combine multiple vector stores that store different types of embeddings, like dense and sparse embeddings. This allows you to store multiple embeddings for the same entry and search them simultaneously.

Using a Hybrid Vector Store with Different Modalities#

To create a hybrid vector store, you need to pass a list of vector stores to the constructor of the HybridSearchVectorStore class. For example, this creates two in-memory vector stores—one for text and one for images:

from ragbits.core.vector_stores.hybrid import HybridSearchVectorStore
from ragbits.core.vector_stores.in_memory import InMemoryVectorStore
from ragbits.core.embeddings.dense.vertex_multimodal import VertexAIMultimodelEmbedder

embedder = VertexAIMultimodelEmbedder()

vector_store_text = InMemoryVectorStore(embedder=embedder, embedding_type=EmbeddingType.TEXT)
vector_store_image = InMemoryVectorStore(embedder=embedder, embedding_type=EmbeddingType.IMAGE)

vector_store_hybrid = HybridSearchVectorStore(vector_store_text, vector_store_image)

You can then use the vector_store_hybrid object to store, search, and delete entries, just as you would use a regular vector store, or pass it to Ragbits' Document Search. When you store an entry in the hybrid vector store, it will be stored in all the vector stores it contains. In this case, one will store the text embedding and the other will store the image embedding.

Using a Hybrid Vector Store with Different Types of Embeddings#

You can create a hybrid vector store with different types of embeddings, including combining dense and sparse embeddings for improved search performance. Here's an example that creates two in-memory vector stores—one using a dense embedder and one using a sparse embedder:

from ragbits.core.vector_stores.hybrid import HybridSearchVectorStore
from ragbits.core.vector_stores.in_memory import InMemoryVectorStore
from ragbits.core.embeddings.dense import LiteLLMEmbedder
from ragbits.core.embeddings.sparse.fastembed import FastEmbedSparseEmbedder

# Create a dense vector store using OpenAI embeddings
vector_store_dense = InMemoryVectorStore(
    embedder=LiteLLMEmbedder(model="text-embedding-3-small")
)

# Create a sparse vector store using sparse embeddings
vector_store_sparse = InMemoryVectorStore(
    embedder=FastEmbedSparseEmbedder(model_name="BAAI/bge-small-en-v1.5")
)

# Combine them into a hybrid search vector store
vector_store_hybrid = HybridSearchVectorStore(vector_store_dense, vector_store_sparse)

You can then use the vector_store_hybrid object to store, search, and delete entries, just as you would use a regular vector store, or pass it to Ragbits' Document Search. When you store an entry in the hybrid vector store, it will be stored in all the vector stores it contains. In this case, one will store the dense embedding and the other will store the sparse embedding.

For more details about using sparse vectors with vector stores, see How to Use Sparse Vectors with Vector Stores.

Note that you can pass an arbitrary number of vector stores to the HybridSearchVectorStore constructor, and they can be of any type as long as they implement the VectorStore interface. For example, this combines three vector stores—one Chroma vector store, one Qdrant vector store, and one PgVector vector store:

import asyncpg
from chromadb import EphemeralClient
from qdrant_client import AsyncQdrantClient

from ragbits.core.vector_stores.hybrid import HybridSearchVectorStore
from ragbits.core.vector_stores.chroma import ChromaVectorStore
from ragbits.core.vector_stores.qdrant import QdrantVectorStore
from ragbits.core.vector_stores.pgvector import PgVectorStore
from ragbits.core.embeddings.dense import LiteLLMEmbedder

postgres_pool = await asyncpg.create_pool("postgresql://user:password@localhost/db")

vector_store_hybrid = HybridSearchVectorStore(
    ChromaVectorStore(
        client=EphemeralClient(),
        index_name="chroma_example",
        embedder=LiteLLMEmbedder(),
    ),
    QdrantVectorStore(
        client=AsyncQdrantClient(location=":memory:"),
        index_name="qdrant_example",
        embedder=LiteLLMEmbedder(),
    ),
    PgVectorStore(
        client=pool,
        table_name="postgres_example",
        vector_size=1536,
        embedder=LiteLLMEmbedder(),
    ),
)

# The entry will be stored in all three vector stores
await vector_store_hybrid.store([VectorStoreEntry(id=uuid.uuid4(), text="Example entry")])

Specifying the Retrieval Strategy for a Hybrid Vector Store#

When you search a hybrid vector store, you can specify a retrieval strategy to determine how the results from the different vector stores are combined. Ragbits comes with the following retrieval strategies:

OrderedHybridRetrivalStrategy: This strategy returns the results from the vector stores ordered by their score. If the same entry is found in multiple vector stores, either the highest score is used or if the sum_scores parameter is set to True, the scores are summed. This is the default strategy.
ReciprocalRankFusion: This strategy combines the results from the vector stores using the Reciprocal Rank Fusion algorithm, which prioritizes entries that appear at the top of the results from individual vector stores. If the same entry is found in multiple vector stores, the scores are summed by default, or if the sum_scores parameter is set to False, the highest score is used.
DistributionBasedScoreFusion: This strategy combines the results from the vector stores using the Distribution-Based Score Fusion algorithm, which normalizes the scores from the individual vector stores so they can be compared and combined sensibly. If the same entry is found in multiple vector stores, either the highest score is used or if the sum_scores parameter is set to True, the scores are summed.

Note that summing the scores from individual stores boosts the entries found in multiple stores. This can be useful when searching through multiple types of embeddings but may not be desirable when searching through multiple modalities since entries containing both text and image embeddings would have an advantage over those containing only one.

To specify a retrieval strategy when searching a hybrid vector store, you can pass it as the retrieval_strategy parameter to the constructor of the HybridSearchVectorStore class. For example, this creates a hybrid vector store with the DistributionBasedScoreFusion retrieval strategy:

from ragbits.core.vector_stores.hybrid import HybridSearchVectorStore
from ragbits.core.vector_stores.in_memory import InMemoryVectorStore
from ragbits.core.vector_stores.hybrid_strategies import DistributionBasedScoreFusion
from ragbits.core.embeddings.dense import LiteLLMEmbedder

embedder = LiteLLMEmbedder()

vector_store_text = InMemoryVectorStore(embedder=embedder, embedding_type=EmbeddingType.TEXT)
vector_store_image = InMemoryVectorStore(embedder=embedder, embedding_type=EmbeddingType.IMAGE)

vector_store_hybrid = HybridSearchVectorStore(
    vector_store_text,
    vector_store_image,
    retrieval_strategy=DistributionBasedScoreFusion(),
)