FAQ

Frequently Asked Questions

Common questions about RubyLLM::Agents.

General

What is RubyLLM::Agents?

RubyLLM::Agents is a Rails engine for building, managing, and monitoring LLM-powered AI agents. It provides:

Clean DSL for agent configuration
Automatic execution tracking
Cost analytics and budget controls
Reliability features (retries, fallbacks, circuit breakers)
Real-time dashboard

How is it different from LangChain?

Aspect	RubyLLM::Agents	LangChain
Language	Ruby/Rails	Python/JS
Integration	Rails-native	Framework-agnostic
Focus	Production operations	Rapid prototyping
Observability	Built-in dashboard	Requires add-ons
Cost tracking	Automatic	Manual

What LLM providers are supported?

Through RubyLLM, we support:

OpenAI (GPT-4, GPT-4o, GPT-3.5)
Anthropic (Claude 3.5, Claude 3)
Google (Gemini 2.0, Gemini 1.5)
And more via RubyLLM

What Ruby/Rails versions are required?

Ruby >= 3.1.0
Rails >= 7.0

Configuration

How do I set API keys?

Unified configuration (recommended, v2.1+):

# config/initializers/ruby_llm_agents.rb
RubyLLM::Agents.configure do |config|
  config.openai_api_key = ENV["OPENAI_API_KEY"]
  config.anthropic_api_key = ENV["ANTHROPIC_API_KEY"]
  config.gemini_api_key = ENV["GOOGLE_API_KEY"]
end

Or use environment variables (auto-detected by RubyLLM):

OPENAI_API_KEY=sk-...
ANTHROPIC_API_KEY=sk-ant-...
GOOGLE_API_KEY=...

Do I still need a separate ruby_llm.rb initializer?

No. As of v2.1.0, all RubyLLM provider settings (API keys, custom endpoints, Bedrock credentials, etc.) can be configured directly in RubyLLM::Agents.configure. They are automatically forwarded to RubyLLM.config. See Configuration for the full list of forwarded attributes.

How do I change the default model?

# config/initializers/ruby_llm_agents.rb
RubyLLM::Agents.configure do |config|
  config.default_model = "gpt-4o"
end

How do I enable caching?

class MyAgent < ApplicationAgent
  cache 1.hour  # Cache for 1 hour
end

How do I configure the dashboard?

config.dashboard_auth = ->(controller) {
  controller.current_user&.admin?
}

Usage

How do I call an agent?

result = MyAgent.call(query: "test")
result.content      # The response
result.total_cost   # Cost in USD

How do I use streaming?

class StreamingAgent < ApplicationAgent
  streaming true
end

StreamingAgent.call(user: "Write a story") do |chunk|
  print chunk
end

How do I send images to an agent?

result = VisionAgent.call(
  question: "Describe this image",
  with: "photo.jpg"
)

How do I get structured output?

def schema
  @schema ||= RubyLLM::Schema.create do
    string :title
    array :tags, of: :string
  end
end

How do I debug an agent?

result = MyAgent.call(query: "test", dry_run: true)
# Shows prompts without making API call

Costs & Budgets

How are costs calculated?

Costs are calculated based on:

Input tokens × model input price
Output tokens × model output price

Prices are from RubyLLM's model pricing data.

How do I set budget limits?

config.budgets = {
  global_daily: 100.0,      # $100/day
  global_monthly: 2000.0,   # $2000/month
  enforcement: :hard        # Block when exceeded
}

How do I check current spending?

RubyLLM::Agents::BudgetTracker.status
# => { global_daily: { limit: 100, current: 45.50, ... } }

Why is my agent blocked?

Check if budget is exceeded:

RubyLLM::Agents::BudgetTracker.exceeded?(:global, :daily)

Reliability

How do retries work?

retries max: 3, backoff: :exponential

Failed requests are automatically retried with increasing delays.

How do fallbacks work?

model "gpt-4o"
fallback_models "gpt-4o-mini", "claude-3-haiku"

If the primary model fails, fallbacks are tried in order.

What is a circuit breaker?

Circuit breakers prevent cascading failures by temporarily blocking requests to failing services.

circuit_breaker errors: 10, within: 60, cooldown: 300

After 10 errors in 60 seconds, requests are blocked for 5 minutes.

Performance

How do I improve latency?

Enable caching: cache 1.hour
Use streaming: streaming true
Use faster models: model "gpt-4o-mini"
Enable async logging: config.async_logging = true

How do I reduce costs?

Enable caching
Use cheaper models for simple tasks
Set budget limits
Optimize prompts (shorter = cheaper)

Why is the dashboard slow?

Too much data: Set config.retention_period = 30.days
Missing indexes: Run rails generate ruby_llm_agents:upgrade
Complex queries: Reduce config.dashboard_per_page

Data & Privacy

What data is logged?

By default:

Agent type, model, status
Token counts, costs, duration
Parameters
Prompts (optional)
Responses (optional)

How do I disable prompt storage?

config.persist_prompts = false
config.persist_responses = false

How long is data retained?

config.retention_period = 30.days

Run cleanup regularly to delete old data.

Chaining Agents

How do I chain agents together?

Compose agents by calling one from another's result:

# Sequential composition
intent_result = IntentAgent.call(query: user_input)
response_result = ResponseAgent.call(
  query: user_input,
  intent: intent_result.content[:intent]
)

For complex orchestration patterns (pipelines, parallel execution, routing), use a dedicated workflow library like Temporal or Sidekiq.

Troubleshooting

Agent returns nil

Check for errors: result.success?
Check schema matches response
Try dry_run to see prompts

Executions not appearing in dashboard

Check async logging: Is job processor running?
Try sync: config.async_logging = false
Check for database errors

Rate limit errors

Add retries with backoff
Add fallback models
Implement request queuing

Memory issues

Disable response storage
Set retention period
Use streaming for large responses

Getting Help

Related Pages

Troubleshooting - Detailed solutions
Configuration - Full config reference
API Reference - Class documentation

FAQ

Frequently Asked Questions

General

What is RubyLLM::Agents?

How is it different from LangChain?

What LLM providers are supported?

What Ruby/Rails versions are required?

Configuration

How do I set API keys?

Do I still need a separate ruby_llm.rb initializer?

How do I change the default model?

How do I enable caching?

How do I configure the dashboard?

Usage

How do I call an agent?

How do I use streaming?

How do I send images to an agent?

How do I get structured output?

How do I debug an agent?

Costs & Budgets

How are costs calculated?

How do I set budget limits?

How do I check current spending?

Why is my agent blocked?

Reliability

How do retries work?

How do fallbacks work?

What is a circuit breaker?

Performance

How do I improve latency?

How do I reduce costs?

Why is the dashboard slow?

Data & Privacy

What data is logged?

How do I disable prompt storage?

How long is data retained?

Chaining Agents

How do I chain agents together?

Troubleshooting

Agent returns nil

Executions not appearing in dashboard

Rate limit errors

Memory issues

Getting Help

Where can I report bugs?

Where can I ask questions?

How do I contribute?

Related Pages

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!