Blocks
RAG Agent
Retrieval-Augmented Generation agent
Create a RAG (Retrieval-Augmented Generation) agent that retrieves information from document sources to provide accurate, grounded responses.
Block Preview
RAG Agent
Retrieval-Augmented Generation agent
Usage
- Add the block to your workflow and connect it to the upstream step.
- Configure any required credentials or tokens in the inputs.
- Fill in required inputs and optional parameters for the run.
- Run a test execution, inspect outputs, and iterate before deploying.
- Deploy the rag_agent block with monitoring enabled in production.
Inputs (UI)
System Prompt
long-inputPlaceholder: Enter system prompt for the RAG agent...
Layout: full
Query
long-inputPlaceholder: Enter the query to search for information...
Layout: full
Model
dropdownLayout: half
Options: Dynamic options
Temperature
sliderLayout: half
Condition: model = ["gpt-4o","gpt-4o-mini","gpt-4-turbo","gpt-4","gpt-3.5-turbo","o3-mini","gpt-5-mini","gpt-5","o1-mini","o1-preview","gemini-2.5-pro-preview-05-06","gemini-2.5-flash-preview-05-20","gemini-2.0-flash","gemini-2.0-flash-lite","deepseek-chat","deepseek-reasoner","llama-4-scout-17b-16e-instruct","llama-4-maverick-17b-128e-instruct","llama-3.3-70b-versatile","qwen-qwq-32b","deepseek-r1-distill-llama-70b"]
Temperature
sliderLayout: half
Condition: model = ["claude-opus-4-5-20251022","claude-sonnet-4-20250514","claude-opus-4-20250514","claude-haiku-4-5-20251001","claude-3-7-sonnet-20250219","grok-3-latest","grok-3-fast-latest"]
API Key
short-inputPlaceholder: Enter your API key
Layout: full
Condition: model = ["qwen2.5:0.5b","qwen2.5:1.5b","qwen2.5:3b","llama3.2:1b","llama3.2:3b","phi3.5:3.8b"]
Data Source
dropdownLayout: half
Options: Pinecone, Supabase, MongoDB, Airtable
Data Source Configuration
codePlaceholder: { "connectionString": "your-connection-string", "collection": "your-collection-name" }
Layout: full
Retrieval Options
codePlaceholder: { "topK": 5, "similarityThreshold": 0.7, "includeMetadata": true }
Layout: full
Cite Sources
switchLayout: half
Enable Memory
switchLayout: half
Memory Limit
sliderLayout: half
Condition: memoryEnabled = true
Importance Threshold
sliderLayout: half
Condition: memoryEnabled = true
Memory Tags
short-inputPlaceholder: customer-support, billing, technical
Layout: half
Condition: memoryEnabled = true
Inputs (API)
systemPrompt
stringOptional
query
stringRequired
model
stringRequired
apiKey
stringRequired
dataSource
stringRequired
dataSourceConfig
jsonRequired
retrievalOptions
jsonOptional
temperature
numberOptional
citeSources
booleanOptional
memoryEnabled
booleanOptional
memoryLimit
numberOptional
memoryImportance
numberOptional
memoryTags
stringOptional
Outputs
Primary response type:
{
"content": "string",
"model": "string",
"sources": "json",
"tokens": "json"
}