Adding a vector database to an agent application

For agent applications that need grounding in proprietary documents, this guide shows one approach. Use a knowledge base, internal documentation, or a collection of PDFs to give the agent retrieval-based context.

This guide walks through adding a vector database (VDB) to an agent application. A folder of documents becomes searchable semantic knowledge the agent uses for retrieval.

Prerequisites

An App Framework recipe with the Agentic Starter Application or a similar setup.
The LLM component already applied.

The setup

First, configure the infrastructure to use the LLM configuration that supports vector databases. In .env:

INFRA_ENABLE_LLM="blueprint_with_llm_gateway.py"

This switches to an LLM Blueprint that supports VDB retrieval alongside the LLM Gateway.

Knowledge base

Create a knowledgebase/ folder at the project root and add documents, such as PDFs, text files, or Markdown files:

your-project/
├── knowledgebase/
│   ├── important-doc.pdf
│   ├── team-guidelines.md
│   └── product-specs.txt
├── infra/
└── ...

This folder is version-controlled alongside the application code. When documents change, redeploy.

The VDB infrastructure

Create infra/infra/vdb.py:

"""
Vector DB Infrastructure: Dataset, Vector Database, and Deployment.
"""
import os
import shutil
import tempfile
from pathlib import Path

import pulumi
import pulumi_datarobot
from datarobot_pulumi_utils.pulumi import export
from datarobot_pulumi_utils.pulumi.stack import PROJECT_NAME

from . import project_dir, use_case

__all__ = [
    "knowledgebase_dataset",
    "vector_database",
]

KNOWLEDGEBASE_FOLDER_NAME = "knowledgebase"


def create_knowledgebase_zip() -> str:
    """Create a zip file of the knowledgebase folder."""
    knowledgebase_dir = project_dir.parent / "knowledgebase"

    if not knowledgebase_dir.exists():
        raise FileNotFoundError(
            f"Knowledgebase directory not found: {knowledgebase_dir}"
        )

    temp_dir = tempfile.mkdtemp()
    zip_path = Path(temp_dir) / KNOWLEDGEBASE_FOLDER_NAME
    shutil.make_archive(str(zip_path), "zip", knowledgebase_dir)
    return f"{zip_path}.zip"


knowledgebase_zip_path = create_knowledgebase_zip()

knowledgebase_dataset = pulumi_datarobot.DatasetFromFile(
    resource_name=f"Agentic Starter App Knowledgebase [{PROJECT_NAME}]",
    file_path=knowledgebase_zip_path,
    use_case_ids=[use_case.id],
    opts=pulumi.ResourceOptions(depends_on=[use_case]),
)

# Chunking parameters are all configurable via environment variables.
chunking_method = os.environ.get("VDB_CHUNKING_METHOD", "recursive")
chunk_size = int(os.environ.get("VDB_CHUNK_SIZE", "512"))
chunk_overlap = int(os.environ.get("VDB_CHUNK_OVERLAP_PERCENTAGE", "10"))
embedding_model = os.environ.get("VDB_EMBEDDING_MODEL", "intfloat/e5-large-v2")

vector_database = pulumi_datarobot.VectorDatabase(
    resource_name=f"Agentic Starter Application VDB [{PROJECT_NAME}]",
    use_case_id=use_case.id,
    dataset_id=knowledgebase_dataset.id,
    chunking_parameters=pulumi_datarobot.VectorDatabaseChunkingParametersArgs(
        chunking_method=chunking_method,
        chunk_size=chunk_size,
        chunk_overlap_percentage=chunk_overlap,
        embedding_model=embedding_model,
    ),
    opts=pulumi.ResourceOptions(depends_on=[use_case, knowledgebase_dataset]),
)

export("AGENTIC_STARTER_DATASET_ID", knowledgebase_dataset.id)
export("AGENTIC_STARTER_VECTOR_DATABASE_ID", vector_database.id)

This code:

Zips up the knowledgebase/ folder.
Creates a DataRobot dataset from that zip.
Builds a vector database with configurable chunking (512 token chunks, 10% overlap by default).
Uses intfloat/e5-large-v2 as the embedding model.

All chunking parameters are configurable through environment variables in .env, so no code changes are required.

Wire it up

Open infra/infra/llm.py and add the import at the top:

from .vdb import vector_database

Then locate where the llm_blueprint is created and add the vector_database_id parameter:

llm_blueprint = datarobot.LlmBlueprint(
    resource_name="LLM Blueprint " + llm_resource_name,
    playground_id=playground.id,
    llm_id=default_llm_id,
    vector_database_id=vector_database.id,  # add this
    llm_settings=datarobot.LlmBlueprintLlmSettingsArgs(
        max_completion_length=2048,
        temperature=0.1,
        top_p=None,
    ),
)

The agent is now grounded in the knowledge base documents.

Deploy and test

task deploy

Once deployed, the agent pulls relevant context from the knowledge base instead of hallucinating. Ask questions about the documents to validate the retrieval flow.

Why this approach works

Version-controlled — The knowledge base lives in git alongside the application code.
Tunable without code changes — Chunking parameters are env vars.
Automatically rebuilt — Updating documents means adding files and redeploying.
Infrastructure-as-code — The whole stack is reproducible.

This works everywhere

This same approach works for any App template, including Talk to My Docs and Talk to My Data. The pattern stays the same: create vdb.py, wire it into llm.py, and redeploy.

Pro tips

Chunk sizes:

Smaller chunks (256–512 tokens) — better for precise, targeted retrieval.
Larger chunks (1024+ tokens) — better when more context per retrieval hit is important.

Embedding models:

The default intfloat/e5-large-v2 is solid for general use, but domain-specific embedding models may work better for specialized content (legal, medical, technical). Set VDB_EMBEDDING_MODEL in .env to experiment without touching code.

Multiple knowledge bases:

Fork this pattern and create multiple vdb.py files — one per knowledge base. Implement smart routing between VDBs in agent logic to serve different document sets for different query types.