See and control your entire AI system

Refario gives you full visibility into runs, costs, tools, and failures across your AI workflows.

  • Trace every request end-to-end
  • Understand cost across models and workflows
  • Catch failures before they reach users
No credit card required.
Command Center

Command Center

Operate your AI system from one command surface with natural-language analysis.

Ask Command Center
Natural-language analysis
Results are based on verified metrics so you can trust what you see.
Thread preview
Submit a prompt
Start with a question

Submit a prompt to preview how Command Center turns a question into a guided next step into the web app.

Suggested inquiries
System snapshot
Spend trend
+12% vs yesterday
Daily cost aggregates show the increase concentrated in refund-policy and support-triage.
Latency health
2 workflows spiking
Command Center surfaces the slowest workflows and links the result back to the source page.
Error watch
8 recent failures
Recent failed runs are clustered around tool retries and one provider fallback path.
Recent threads
What changed in spend since yesterday?
Just now
Show recent errors in the last 24 hours
Just now
What were the slowest workflows today?
Just now
Explore the System

Explore a real AI system in motion

Click any step to inspect runs, tools, and decisions

Hover highlights path. Click reveals step telemetry.
Selected step

Tool execution

healthy

Connected tool completed on first attempt and returned structured data to the agent.

Cost
€0.003
Latency
280ms
Errors
0
Errors
No errors on this step in the selected scenario.
Tool input
{ tool: 'policy.lookup', order_id: 'rf-8841' }
Tool output
{ status: 'eligible', refund_window_days: 30 }
Why it matters

From system visibility to clear outcomes

Translate what you see in the system map into operational value for engineering, product, and finance.

Debug instantly

Find exactly where requests fail

Control costs

Track spend across models and workflows

Prevent issues

Catch failures and guardrail violations

Understand behavior

See how agents and tools interact

System Layers

Depth without complexity

Three focused layers, each tied back to the same system map.

Runs & Traces

Every request, fully visible

Timeline view of spans, tool calls, and outputs connected to one request path.

1request.received
2agent.route.selected
3llm.response.generated
4tool.policy.lookup
5output.delivered
Cost & Usage

Know exactly where your spend goes

See model and workflow spend split without leaving the operating context.

Monthly spend€12,420
GPT
Claude
Gemini
Other
Guardrails & Reliability

Catch issues before users do

Failures and violation signals surface early with clear ownership and status.

violations
12
blocked
4
passed
2,641
Failure detected: tool timeout increased output latency to 6.1s.
How It Works

Get started in minutes

Remove friction and go from instrumentation to operational insight quickly.

  1. 1
    Send your AI runs via SDK
  2. 2
    Instantly see traces, costs, and tools
  3. 3
    Optimize performance and reliability
Why Refario

Built for real AI systems

Subtle differentiation for teams running production AI, not isolated demos.

Works across LLM apps, agents, and MCP tools

Combines observability, cost, and operations

Built for engineering, product, and finance teams

Pricing

Start free. Get full visibility from day one.

Low-friction entry with a clear free plan and no credit card required.

Free plan availableNo credit card required
Free
For evaluating Refario with real traffic
€0
/mo
  • 10k runs / month
  • 1 project
  • 1 user
  • 7-day retention
  • Runs, traces, dashboards, workflow views, and SDK ingestion
Start Free
Starter
For solo teams that need proactive monitoring
€19
/mo
  • 100k runs / month
  • 3 projects
  • 2 users
  • 30-day retention
  • Anomaly detection, guardrails, alerts, and workflow analytics
Upgrade to Starter
Team
Most popular
For teams running shared AI operations
€59
/mo
  • 1M runs / month
  • 10 projects
  • 10 users
  • 90-day retention
  • Finance dashboard, cost attribution, scheduled reports, and Slack/webhook alerts
Upgrade to Team
Enterprise
For scale and custom rollout needs
Custom
  • Higher negotiated limits
  • SSO, audit logs, and SLA
  • Custom retention
  • Dedicated onboarding
  • Custom deployment options
  • Commercial terms for larger rollouts
Contact Us
FAQ

Answers to common questions

Current capabilities teams ask about most.

Which integrations are available right now?

Provider options in-app include OpenAI (API key), Claude (Anthropic) (API key), Gemini (Google) (OAuth or API key), Slack AI (OAuth, API key, or MCP), Notion AI (OAuth or MCP), Cursor (API key), and Custom MCP (MCP). Live usage sync flows are currently active for OpenAI, Anthropic, Cursor, and Slack.

Can we analyze specific models?

Yes. Model breakdown views show provider/model rows with calls, error rate, average latency, token totals, and cost. Typical examples include GPT-5.3-Codex, GPT-5.2-Codex, claude-sonnet-4-6, and gemini-3.1-pro-preview.

Do you support tools and guardrails?

Yes. The current web app includes a Tools page for MCP registry, tool-call volume, success rate, and latency monitoring, with alerts for newly detected tools and reliability drops, plus a Guardrails page for rule coverage, violations logs, and timeseries.

Do you support budget and reporting workflows?

Yes. Budget Guardrails support shared thresholds by scope, Spend Forecast projects monthly cost, and Scheduled Reports can be delivered by email or webhook.

Why pick Refario over a tracing-only tool?

Choose Refario when the hard production question is no longer just what happened inside one model call, but which workflow, tool, or guardrail changed outcomes and what that shift cost.

Start understanding your AI system today

See runs, costs, and performance in minutes.

Start free