Blocks

RAG Agent

Retrieval-Augmented Generation agent

Create a RAG (Retrieval-Augmented Generation) agent that retrieves information from document sources to provide accurate, grounded responses.

Back to Catalog
Agentsrag_agent

Block Preview

RAG Agent

Retrieval-Augmented Generation agent

Usage

  1. Add the block to your workflow and connect it to the upstream step.
  2. Configure any required credentials or tokens in the inputs.
  3. Fill in required inputs and optional parameters for the run.
  4. Run a test execution, inspect outputs, and iterate before deploying.
  5. Deploy the rag_agent block with monitoring enabled in production.

Inputs (UI)

System Prompt

long-input

Placeholder: Enter system prompt for the RAG agent...

Layout: full

Query

long-input

Placeholder: Enter the query to search for information...

Layout: full

Model

dropdown

Layout: half

Options: Dynamic options

Temperature

slider

Layout: half

Condition: model = ["gpt-4o","gpt-4o-mini","gpt-4-turbo","gpt-4","gpt-3.5-turbo","o3-mini","gpt-5-mini","gpt-5","o1-mini","o1-preview","gemini-2.5-pro-preview-05-06","gemini-2.5-flash-preview-05-20","gemini-2.0-flash","gemini-2.0-flash-lite","deepseek-chat","deepseek-reasoner","llama-4-scout-17b-16e-instruct","llama-4-maverick-17b-128e-instruct","llama-3.3-70b-versatile","qwen-qwq-32b","deepseek-r1-distill-llama-70b"]

Temperature

slider

Layout: half

Condition: model = ["claude-opus-4-5-20251022","claude-sonnet-4-20250514","claude-opus-4-20250514","claude-haiku-4-5-20251001","claude-3-7-sonnet-20250219","grok-3-latest","grok-3-fast-latest"]

API Key

short-input

Placeholder: Enter your API key

Layout: full

Condition: model = ["qwen2.5:0.5b","qwen2.5:1.5b","qwen2.5:3b","llama3.2:1b","llama3.2:3b","phi3.5:3.8b"]

Data Source

dropdown

Layout: half

Options: Pinecone, Supabase, MongoDB, Airtable

Data Source Configuration

code

Placeholder: { "connectionString": "your-connection-string", "collection": "your-collection-name" }

Layout: full

Retrieval Options

code

Placeholder: { "topK": 5, "similarityThreshold": 0.7, "includeMetadata": true }

Layout: full

Cite Sources

switch

Layout: half

Enable Memory

switch

Layout: half

Memory Limit

slider

Layout: half

Condition: memoryEnabled = true

Importance Threshold

slider

Layout: half

Condition: memoryEnabled = true

Memory Tags

short-input

Placeholder: customer-support, billing, technical

Layout: half

Condition: memoryEnabled = true

Inputs (API)

systemPrompt

string

Optional

query

string

Required

model

string

Required

apiKey

string

Required

dataSource

string

Required

dataSourceConfig

json

Required

retrievalOptions

json

Optional

temperature

number

Optional

citeSources

boolean

Optional

memoryEnabled

boolean

Optional

memoryLimit

number

Optional

memoryImportance

number

Optional

memoryTags

string

Optional

Outputs

Primary response type:

{
  "content": "string",
  "model": "string",
  "sources": "json",
  "tokens": "json"
}

Tool Access

openai_chatanthropic_chatgoogle_chatxai_chatdeepseek_chatpineconesupabasemongodbairtable
RAG Agent | NowFlow Blocks