Agent Providers Guide

Focusgroup supports multiple LLM providers via their CLI tools. This guide covers setup and best practices for each.

Overview

Provider	CLI Tool	Best For
Claude (Anthropic)	`claude`	General feedback, nuanced analysis
Codex (OpenAI)	`codex`	Code-focused feedback, OpenAI perspective

All agents operate in CLI mode, invoking the actual CLI tools. This provides authentic agent behavior—the same way agents really use tools.

Why CLI-Only?

Focusgroup deliberately uses only CLI-based agents rather than API calls or SDK integrations. This is a core design philosophy, not a limitation.

The Problem with Approximations

When you call an LLM API directly or use an agent SDK, you're not talking to "the agent"—you're talking to a model through your own harness. You control the system prompt, tool configurations, safety parameters, and context window management. The result is an approximation of an agent, not the real thing.

Many agent implementations are closed-source. We don't know exactly how Claude Code, Codex, or other CLI agents are configured internally:

What system prompts shape their behavior?
What safety layers filter their responses?
How do they manage context and tool use?
What defaults and optimizations are baked in?

These details significantly affect how agents interact with tools—and they're invisible when you roll your own integration.

Authentic Feedback from Real Customers

Focusgroup exists to get feedback on tools designed for AI agents. The agents are the customers. When evaluating a CLI tool that agents will use, you want feedback from:

The actual agent as its provider ships it
With its real configuration and behaviors
Using the same interface it would use in production

You don't want feedback from a simulation you cobbled together using API calls—that feedback reflects your harness, not the agent's actual experience.

CLI Tools as Ground Truth

CLI tools like claude and codex represent the most accurate form of each agent as its provider intends it to operate. By invoking these tools directly, focusgroup captures:

Authentic agent reasoning and interaction patterns
Real-world tool use behavior
Provider-intended safety and capability boundaries

This means focusgroup feedback reflects what agents will actually do with your tool, not what a bespoke API integration might do.

Claude (Anthropic)

Invokes the actual claude CLI tool, providing authentic agent behavior.

Setup:

# Install Claude Code CLI
# See: https://docs.anthropic.com/claude-code

# Authenticate
claude auth login

Config:

[[agents]]
provider = "claude"
name = "Claude"

When to use Claude:

Testing how agents actually interact with tools
Getting feedback that reflects real-world agent usage
When you want the agent to use its full toolset (file reading, web access, etc.)

Codex (OpenAI)

Invokes the OpenAI Codex CLI for code-focused feedback.

Setup:

# Install Codex CLI
# See OpenAI's codex documentation

# Authenticate
codex auth

Config:

[[agents]]
provider = "codex"
name = "Codex"

When to use Codex:

Getting code-focused feedback
Testing CLI tools that agents interact with programmatically
When you want an OpenAI-based CLI agent perspective

Mixing Providers

A key strength of focusgroup is combining multiple providers for diverse perspectives:

[[agents]]
provider = "claude"
name = "Claude-1"
system_prompt = "Focus on UX and ergonomics."

[[agents]]
provider = "claude"
name = "Claude-2"
system_prompt = "Focus on correctness and edge cases."

[[agents]]
provider = "codex"
name = "Codex"
# Code-focused perspective

System Prompts

Customize agent perspectives with system prompts:

[[agents]]
provider = "claude"
name = "DevOps-Engineer"
system_prompt = """You are an experienced DevOps engineer who values:
- Reliability and operational simplicity
- Clear error messages and debugging support
- Consistent behavior across environments

Focus on operational concerns when evaluating this tool."""

[[agents]]
provider = "claude"
name = "Junior-Developer"
system_prompt = """You are a junior developer who is:
- Still learning CLI conventions
- Easily confused by complex options
- Appreciative of good documentation and examples

Evaluate this tool from a beginner's perspective."""

See also: configuration for full config options, exploration for letting agents run tools.

Troubleshooting

Claude CLI Issues

# Verify Claude CLI is installed
claude --version

# Test authentication
claude "Hello, can you respond?"

Codex CLI Issues

# Verify Codex CLI is installed
codex --version

# Test authentication
codex "Hello"