Inference Gateway CLI

A powerful command-line interface for managing and interacting with the Inference Gateway. This CLI provides tools for configuration, monitoring, and management of inference services.

⚠️ Warning

Early Development Stage: This project is in its early development stage and breaking changes are expected until it reaches a stable version.

Always use pinned versions by specifying a specific version tag when downloading binaries or using install scripts.

Features

Automatic Gateway Management: Automatically downloads and runs the Inference Gateway binary (no Docker required!)
Zero-Configuration Setup: Start chatting immediately with just your API keys in a .env file
Interactive Chat: Chat with models using an interactive interface
Status Monitoring: Check gateway health and resource usage
Conversation History: Store and retrieve past conversations with multiple storage backends
- Conversation Storage - Detailed storage backend documentation
- Conversation Title Generation - AI-powered title generation system
Configuration Management: Manage gateway settings via YAML config
Project Initialization: Set up local project configurations
Tool Execution: LLMs can execute whitelisted commands and tools including:
- Bash: Execute safe shell commands
- Read: Read file contents with optional line ranges
- Write: Write content to files with security controls
- Grep: Fast ripgrep-powered search with regex support and multiple output modes
- WebSearch: Search the web using DuckDuckGo or Google
- WebFetch: Fetch content from whitelisted URLs
- Github: Interact with GitHub API to fetch issues, pull requests, and create content
- Tree: Display directory structure with polyfill support
- Delete: Delete files and directories with security controls
- Edit: Perform exact string replacements in files
- MultiEdit: Make multiple edits to files in atomic operations
- TodoWrite: Create and manage structured task lists
- A2A Tools: Agent-to-agent communication for task delegation and coordination
Tool Approval System: User approval workflow for sensitive operations with real-time diff visualization for file modifications
Agent Modes: Three operational modes for different workflows:
- Standard Mode (default): Normal operation with all configured tools and approval checks
- Plan Mode: Read-only mode for planning and analysis without execution
- Auto-Accept Mode: All tools auto-approved for rapid execution (YOLO mode)
- Toggle between modes with Shift+Tab
Token Usage Tracking: Accurate token counting with polyfill support for providers (like Ollama Cloud) that don't return usage metrics in their responses
Inline History Auto-Completion: Smart command history suggestions with inline completion
- Ghost-text suggestions appear as you type, matching command history by prefix
- Right Arrow to accept the entire suggestion
- Tab to cycle through multiple matching suggestions
- Suggestions automatically update as you type

Installation

Using Go Install

go install github.com/inference-gateway/cli@latest

Using Container Image

For containerized environments, you can use the official container image:

# Run the CLI directly
docker run --rm -it ghcr.io/inference-gateway/cli:latest --help

# With volume mount for config persistence
docker run --rm -it -v ~/.infer:/home/infer/.infer ghcr.io/inference-gateway/cli:latest

# Example: Run chat command
docker run --rm -it -v ~/.infer:/home/infer/.infer ghcr.io/inference-gateway/cli:latest chat

Using specific version:

docker run --rm -it ghcr.io/inference-gateway/cli:0.48.12

Available architectures: linux/amd64, linux/arm64

Using Install Script

For quick installation, you can use our install script:

Unix/macOS/Linux:

curl -fsSL https://raw.githubusercontent.com/inference-gateway/cli/main/install.sh | bash

With specific version:

curl -fsSL https://raw.githubusercontent.com/inference-gateway/cli/main/install.sh | bash -s -- --version latest

Custom install directory:

curl -fsSL https://raw.githubusercontent.com/inference-gateway/cli/main/install.sh | bash -s -- --install-dir $HOME/.local/bin

The install script will:

Detect your operating system and architecture automatically
Download the appropriate binary from GitHub releases
Install to /usr/local/bin by default (or custom directory with --dir)
Make the binary executable
Verify the installation

Manual Download

Download the latest release binary for your platform from the releases page.

Verifying Release Binaries

All release binaries are signed with Cosign for supply chain security. You can verify the integrity and authenticity of downloaded binaries using the following steps:

1. Download the binary, checksums, and signature files:

# Download binary (replace with your platform)
curl -L -o infer-darwin-amd64 \
  https://github.com/inference-gateway/cli/releases/latest/download/infer-darwin-amd64

# Download checksums and signature files
curl -L -o checksums.txt \
  https://github.com/inference-gateway/cli/releases/latest/download/checksums.txt
curl -L -o checksums.txt.pem \
  https://github.com/inference-gateway/cli/releases/latest/download/checksums.txt.pem
curl -L -o checksums.txt.sig \
  https://github.com/inference-gateway/cli/releases/latest/download/checksums.txt.sig

2. Verify SHA256 checksum:

# Calculate checksum of downloaded binary
shasum -a 256 infer-darwin-amd64

# Compare with checksums in checksums.txt
grep infer-darwin-amd64 checksums.txt

3. Verify Cosign signature (requires Cosign to be installed):

# Decode base64 encoded certificate
cat checksums.txt.pem | base64 -d > checksums.txt.pem.decoded

# Verify the signature
cosign verify-blob \
  --certificate checksums.txt.pem.decoded \
  --signature checksums.txt.sig \
  --certificate-identity "https://github.com/inference-gateway/cli/.github/workflows/release.yml@refs/heads/main" \
  --certificate-oidc-issuer "https://token.actions.githubusercontent.com" \
  checksums.txt

4. Make binary executable and install:

chmod +x infer-darwin-amd64
sudo mv infer-darwin-amd64 /usr/local/bin/infer

Note: Replace latest with the desired release version (e.g., v0.48.12) and infer-darwin-amd64 with your platform's binary name.

Build from Source

git clone https://github.com/inference-gateway/cli.git
cd cli
go build -o infer .

Quick Start

Getting started with the Inference Gateway CLI is now incredibly simple:

Install the CLI (choose one method):

# Using install script (recommended)
curl -fsSL https://raw.githubusercontent.com/inference-gateway/cli/main/install.sh | bash

# Using Go
go install github.com/inference-gateway/cli@latest

# Using Container Image
docker run --rm -it ghcr.io/inference-gateway/cli:latest

Add your API keys (create a .env file):

# Example .env file
ANTHROPIC_API_KEY=your_key_here
OPENAI_API_KEY=your_key_here
GOOGLE_API_KEY=your_key_here
# Add other provider API keys as needed

Start chatting (the gateway is automatically managed):
```
infer chat
```

That's it! The CLI will automatically:

Download and run the Inference Gateway binary in the background
Load your API keys from the .env file
Connect you to available models

Commands

`infer init`

Initialize a new project with Inference Gateway CLI. This creates:

.infer/ directory with:
- config.yaml - Main configuration file for the project
- .gitignore - Ensures sensitive files are not committed to version control

This is the recommended command to start working with Inference Gateway CLI in a new project.

Options:

--overwrite: Overwrite existing files if they already exist
--userspace: Initialize configuration in user home directory (~/.infer/)

Examples:

# Initialize project-level configuration (default)
infer init
infer init --overwrite

# Initialize userspace configuration (global fallback)
infer init --userspace

`infer config`

Manage CLI configuration settings including models, system prompts, and tools.

`infer config init`

Initialize a new .infer/config.yaml configuration file in the current directory. This creates only the configuration file with default settings.

For complete project initialization, use infer init instead.

Options:

--overwrite: Overwrite existing configuration file
--userspace: Initialize configuration in user home directory (~/.infer/)

Examples:

# Initialize project-level configuration (default)
infer config init
infer config init --overwrite

# Initialize userspace configuration (global fallback)
infer config init --userspace

`infer config agent set-model`

Set the default model for chat sessions. When set, chat sessions will automatically use this model without showing the model selection prompt.

Examples:

infer config agent set-model openai/gpt-4-turbo
infer config agent set-model anthropic/claude-opus-4-1-20250805

`infer config agent set-system`

Set a system prompt that will be included with every chat session, providing context and instructions to the AI model.

Examples:

infer config agent set-system "You are a helpful assistant."
infer config agent set-system "You are a Go programming expert."

`infer config agent set-max-turns`

Set the maximum number of turns for agent sessions.

Examples:

infer config agent set-max-turns 100

`infer config agent set-max-concurrent-tools`

Set the maximum number of tools that can execute concurrently.

Examples:

infer config agent set-max-concurrent-tools 5

`infer config agent verbose-tools`

Enable or disable verbose tool output for agent sessions.

Examples:

infer config agent verbose-tools enable
infer config agent verbose-tools disable

`infer config export`

Manage export settings for conversation exports.

Subcommands:

set-model <model>: Set the model used for generating export summaries
show: Display current export configuration

Examples:

infer config export set-model anthropic/claude-4.1-haiku
infer config export show

`infer config tools`

Manage tool execution settings for LLMs, including enabling/disabling tools, managing whitelists, and security settings.

Subcommands:

enable: Enable tool execution for LLMs
disable: Disable tool execution for LLMs
list [--format text|json]: List whitelisted commands and patterns
validate <command>: Validate if a command is whitelisted
exec <command> [--format text|json]: Execute a whitelisted command directly
safety: Manage safety approval settings
- enable: Enable safety approval prompts
- disable: Disable safety approval prompts
- status: Show current safety approval status
sandbox: Manage sandbox directories for security
- list: List all sandbox directories
- add <path>: Add a protected path to the sandbox
- remove <path>: Remove a protected path from the sandbox
bash: Manage Bash tool settings
- enable: Enable Bash tool
- disable: Disable Bash tool
grep: Manage Grep tool settings
- enable: Enable Grep tool
- disable: Disable Grep tool
- set-backend <backend>: Set grep backend ("ripgrep" or "go")
- status: Show current Grep tool configuration
web-search: Manage WebSearch tool settings
- enable: Enable WebSearch tool
- disable: Disable WebSearch tool
web-fetch: Manage WebFetch tool settings
- enable: Enable WebFetch tool
- disable: Disable WebFetch tool
- list: List whitelisted domains
- add-domain <domain>: Add a domain to whitelist
- remove-domain <domain>: Remove a domain from whitelist
- cache: Manage WebFetch cache
  - status: Show cache status
  - clear: Clear cache
github: Manage GitHub tool settings
- enable: Enable GitHub tool
- disable: Disable GitHub tool
- status: Show current GitHub tool configuration
- set-token <token>: Set GitHub personal access token
- set-owner <owner>: Set default GitHub owner/organization
- set-repo <repo>: Set default GitHub repository

Examples:

# Enable/disable tool execution
infer config tools enable
infer config tools disable

# List whitelisted commands
infer config tools list
infer config tools list --format json

# Validate and execute commands
infer config tools validate "ls -la"
infer config tools exec "git status"

# Manage global safety settings (approval prompts)
infer config tools safety enable   # Enable approval prompts for all tool execution
infer config tools safety disable  # Disable approval prompts (execute tools immediately)
infer config tools safety status   # Show current safety approval status

# Manage excluded paths
infer config tools sandbox list
infer config tools sandbox add ".github/"
infer config tools sandbox remove "test.txt"

# Manage individual tools
infer config tools bash enable
infer config tools bash disable

infer config tools grep set-backend ripgrep
infer config tools grep status

infer config tools web-search enable
infer config tools web-search disable

infer config tools web-fetch add-domain "example.com"
infer config tools web-fetch list
infer config tools web-fetch cache status
infer config tools web-fetch cache clear

infer config tools github set-token "ghp_xxxxxxxxxxxx"
infer config tools github set-owner "my-org"
infer config tools github set-repo "my-repo"
infer config tools github status

`infer agents`

Manage A2A (Agent-to-Agent) agent configurations. This command allows you to configure and manage connections to specialized A2A agents for task delegation and distributed processing.

Subcommands:

init: Initialize agents.yaml configuration file
add <url> [name]: Add a new A2A agent endpoint
update <name> <url>: Update an existing agent's URL
list: List all configured agents
show <name>: Show details for a specific agent
remove <name>: Remove an agent from configuration

Examples:

# Initialize agents configuration
infer agents init

# Add agents
infer agents add http://security-agent:8080 security
infer agents add http://docs-agent:8080 documentation

# List all agents
infer agents list

# Show agent details
infer agents show security

# Update agent URL
infer agents update security http://security-agent:9090

# Remove agent
infer agents remove security

For more details on A2A agents, see the A2A Tools section.

`infer status`

Check the status of the inference gateway including health checks and resource usage.

Examples:

infer status

`infer chat`

Start an interactive chat session with model selection. Provides a conversational interface where you can select models and have conversations.

Features:

Interactive model selection
Conversational interface
Real-time streaming responses
Scrollable chat history with mouse wheel and keyboard support

Navigation Controls:

Mouse wheel: Scroll up/down through chat history
Arrow keys (↑/↓) or Vim keys (k/j): Scroll one line at a time
Page Up/Page Down: Scroll by page
Home/End: Jump to top/bottom of chat history
Shift+↑/Shift+↓: Half-page scrolling
Ctrl+O: Toggle expanded view of tool results
Shift+Tab: Cycle agent mode (Standard → Plan → Auto-Accept)

Agent Modes:

The chat interface supports three operational modes that can be toggled with Shift+Tab:

Standard Mode (default): Normal operation with all configured tools and approval checks enabled. The agent has access to all tools defined in your configuration and will request approval for sensitive operations (Write, Edit, Delete, Bash, etc.).
Plan Mode: Read-only mode designed for planning and analysis. In this mode, the agent:
- Can only use Read, Grep, Tree, and A2A_QueryAgent tools to gather information
- Is instructed to analyze tasks and create detailed plans without executing changes
- Provides step-by-step breakdowns of what would be done in Standard mode
- Plan Approval: When the agent completes planning, you'll be prompted to:
  - Accept: Approve the plan and continue (stays in Plan Mode)
  - Reject (n or Esc): Reject the plan and provide feedback or changes
  - Accept & Auto-Approve (a): Accept the plan AND switch to Auto-Accept mode for execution
- Useful for understanding codebases or previewing changes before implementation
⚡ Auto-Accept Mode (YOLO mode): All tool executions are automatically approved without prompting. The agent:
- Has full access to all configured tools
- Bypasses all approval checks and safety guardrails
- Executes modifications immediately without confirmation
- Ideal for trusted workflows or when rapid iteration is needed
- Use with caution - ensure you have backups and version control

The current mode is displayed below the input field when not in Standard mode. Toggle between modes anytime during a chat session.

System Reminders:

The chat interface supports configurable system reminders that can provide periodic contextual information to the AI model during conversations. These reminders help maintain context and provide relevant guidance throughout the session.

Customizable interval: Set how often reminders appear (in number of messages)
Dynamic content: Reminders can contain contextual information based on the current state
Non-intrusive: Reminders are sent to the AI model but don't interrupt the user experience
Configurable: Enable/disable and customize reminder content through configuration

Examples:

infer chat

`infer agent`

Execute a task using an autonomous agent in background mode. The CLI will work iteratively until the task is considered complete. Particularly useful for SCM tickets like GitHub issues.

Features:

Autonomous execution: Agent works independently to complete tasks
Iterative processing: Continues until task completion criteria are met
Tool integration: Full access to all available tools (Bash, Read, Write, etc.)
Parallel tool execution: Executes multiple tool calls simultaneously for improved efficiency
Background operation: Runs without interactive user input
Task completion detection: Automatically detects when tasks are complete
Configurable concurrency: Control the maximum number of parallel tool executions (default: 5)
JSON output: Structured JSON output for easy parsing and integration
Multimodal support: Process images and files with vision-capable models

Options:

-m, --model: Model to use for the agent (e.g., openai/gpt-4)
-f, --files: Files or images to include (can be specified multiple times)

Examples:

# Execute a task described in a GitHub issue
infer agent "Please fix the github issue 38"

# Use a specific model for the agent
infer agent --model "openai/gpt-4" "Implement the feature described in issue #42"

# Debug a failing test
infer agent "Debug the failing test in PR 15"

# Refactor code
infer agent "Refactor the authentication module to use JWT tokens"

# Analyze screenshots with vision-capable models
infer agent "Analyze this screenshot and identify the UI issue" --files screenshot.png

# Process multiple images
infer agent "Compare these diagrams and suggest improvements" -f diagram1.png -f diagram2.png

# Mix images and code files using @filename syntax
infer agent "Review @app.go and @architecture.png and suggest refactoring"

# Combine --files flag with @filename references
infer agent "Analyze @error.log and this screenshot" --files debug-screen.png

Image and File Support:

The agent command supports multimodal content for vision-capable models:

Use --files or -f flag to attach images or files
Use @filename syntax in the task description to reference files
Supported image formats: PNG, JPEG, GIF, WebP
Images are automatically encoded as base64 and sent as multimodal content
Text files are embedded in code blocks
Requires gateway configuration: ENABLE_VISION=true

`infer conversation-title`

Manage AI-powered conversation title generation. The CLI can automatically generate descriptive titles for conversations to improve organization and searchability.

Subcommands:

generate [conversation-id]: Generate titles for conversations (all or specific)
status: Show title generation status and statistics
daemon: Run title generation daemon in background

Examples:

# Generate titles for all conversations without titles
infer conversation-title generate

# Generate title for a specific conversation
infer conversation-title generate conv-12345

# Check title generation status
infer conversation-title status

# Run daemon for automatic title generation
infer conversation-title daemon

Features:

Automatic Generation: Titles are generated based on conversation content
Batch Processing: Generate titles for multiple conversations at once
Configurable Model: Use any available model for title generation
Background Daemon: Optional daemon mode for continuous title generation

Configuration:

conversation:
  title_generation:
    enabled: true
    model: "deepseek/deepseek-chat"
    batch_size: 5
    interval: 30  # seconds between generation attempts

For more details, see the Conversation Title Generation documentation.

`infer version`

Display version information for the Inference Gateway CLI.

Examples:

infer version

Available Tools for LLMs

When tool execution is enabled, LLMs can use the following tools to interact with the system:

Tree Tool

Display directory structure in a tree format, similar to the Unix tree command. Provides a polyfill implementation when the native tree command is unavailable.

Parameters:

path (optional): Directory path to display tree structure for (default: current directory)
max_depth (optional): Maximum depth to traverse (unlimited by default)
show_hidden (optional): Whether to show hidden files and directories (default: false)
respect_gitignore (optional): Whether to exclude patterns from .gitignore (default: true)
format (optional): Output format - "text" or "json" (default: "text")

Examples:

Basic tree: Uses current directory with default settings
Tree with depth limit: max_depth: 2 - Shows only 2 levels deep
Tree with hidden files: show_hidden: true
Tree ignoring gitignore: respect_gitignore: false - Shows all files including those in .gitignore
JSON output: format: "json" - Returns structured data

Features:

Native Integration: Uses system tree command when available for optimal performance
Polyfill Implementation: Falls back to custom implementation when tree is not installed
Pattern Exclusion: Supports glob patterns to exclude specific files and directories
Depth Control: Limit traversal depth to prevent overwhelming output
Hidden File Control: Toggle visibility of hidden files and directories
Multiple Formats: Text output for readability, JSON for structured data

Security:

Respects configured path exclusions for security
Validates directory access permissions
Limited by the same security restrictions as other file tools

Bash Tool

Execute whitelisted bash commands securely with validation against configured command patterns.

Read Tool

Read file content from the filesystem with optional line range specification.

Write Tool

Write content to files on the filesystem with security controls and directory creation support.

Parameters:

file_path (required): The path to the file to write
content (required): The content to write to the file
create_dirs (optional): Whether to create parent directories if they don't exist (default: true)
overwrite (optional): Whether to overwrite existing files (default: true)
format (optional): Output format - "text" or "json" (default: "text")

Features:

Directory Creation: Automatically creates parent directories when needed
Overwrite Control: Configurable behavior for existing files
Security Validation: Respects path exclusions and security restrictions
Performance Optimized: Efficient file writing with proper error handling

Security:

Approval Required: Write operations require approval by default (secure by default)
Path Exclusions: Respects configured excluded paths (e.g., .infer/ directory)
Pattern Matching: Supports glob patterns for path exclusions
Validation: Validates file paths and content before writing

Examples:

Create new file: file_path: "output.txt", content: "Hello, World!"
Write to subdirectory: file_path: "logs/app.log", content: "log entry", create_dirs: true
Safe overwrite: file_path: "config.json", content: "{...}", overwrite: false

WebSearch Tool

Search the web using DuckDuckGo or Google search engines to find information.

WebFetch Tool

WebFetch content from whitelisted URLs or GitHub references using the format example.com.

Github Tool

Interact with GitHub API to fetch issues, pull requests, create/update comments, and create pull requests with authentication support. This is a standalone tool separate from WebFetch.

Parameters:

owner (required): Repository owner (username or organization)
repo (required): Repository name
resource (optional): Resource type to fetch or create (default: "issue")
- issue: Fetch a specific issue
- issues: Fetch a list of issues
- pull_request: Fetch a specific pull request
- comments: Fetch comments for an issue/PR
- create_comment: Create a comment on an issue/PR
- update_comment: Update an existing comment
- create_pull_request: Create a new pull request
issue_number (required for issue/pull_request/comments/create_comment): Issue or PR number
comment_id (required for update_comment): Comment ID to update
comment_body (required for create_comment and update_comment): Comment body text
title (required for create_pull_request): Pull request title
body (optional for create_pull_request): Pull request body/description
head (required for create_pull_request): Head branch name
base (optional for create_pull_request): Base branch name (default: "main")
state (optional): Filter by state for issues list ("open", "closed", "all", default: "open")
per_page (optional): Number of items per page for lists (1-100, default: 30)

Features:

GitHub API Integration: Direct access to GitHub's REST API v3
Authentication: Supports GitHub personal access tokens via environment variables
Multiple Resources: Fetch issues, pull requests, comments, and create new content
Structured Data: Returns properly typed GitHub data structures
Error Handling: Comprehensive error handling with GitHub API error messages
Rate Limiting: Respects GitHub API rate limits
Security: Configurable timeout and response size limits
Environment Variables: Supports token resolution via %GITHUB_TOKEN% syntax
Security Controls: Owner validation for secure repository access

Configuration:

tools:
  github:
    enabled: true
    token: "%GITHUB_TOKEN%"  # Environment variable reference
    base_url: "https://api.github.com"
    owner: "your-username"  # Default owner for security
    repo: "your-repo"       # Default repository (optional)
    safety:
      max_size: 1048576  # 1MB
      timeout: 30        # 30 seconds
    require_approval: false

Examples:

Fetch specific issue: owner: "octocat", repo: "Hello-World", resource: "issue", issue_number: 1
List open issues: owner: "octocat", repo: "Hello-World", resource: "issues", state: "open", per_page: 10
Fetch pull request: owner: "octocat", repo: "Hello-World", resource: "pull_request", issue_number: 5
Get issue comments: owner: "octocat", repo: "Hello-World", resource: "comments", issue_number: 1
Create comment: owner: "octocat", repo: "Hello-World", resource: "create_comment", issue_number: 1, comment_body: "Great work!"
Update comment: owner: "octocat", repo: "Hello-World", resource: "update_comment", comment_id: 12345, comment_body: "Updated: Great work with improvements!"
Create pull request: owner: "octocat", repo: "Hello-World", resource: "create_pull_request", title: "Add feature", body: "New feature implementation", head: "feature-branch", base: "main"

Delete Tool

Delete files or directories from the filesystem with security controls. Supports wildcard patterns for batch operations.

Parameters:

path (required): The path to the file or directory to delete
recursive (optional): Whether to delete directories recursively (default: false)
force (optional): Whether to force deletion (ignore non-existent files, default: false)

Features:

Wildcard Support: Delete multiple files using patterns like *.txt or temp/*
Recursive Deletion: Remove directories and their contents
Safety Controls: Respects configured path exclusions and security restrictions
Validation: Validates file paths and permissions before deletion

Security:

Approval Required: Delete operations require approval by default
Path Exclusions: Respects configured excluded paths for security
Pattern Matching: Supports glob patterns for path exclusions
Validation: Validates file paths and prevents deletion of protected directories

Examples:

Delete single file: path: "temp.txt"
Delete directory recursively: path: "temp/", recursive: true
Delete with wildcard: path: "*.log"
Force delete: path: "missing.txt", force: true

Edit Tool

Perform exact string replacements in files with security validation and preview support.

Parameters:

file_path (required): The path to the file to modify
old_string (required): The text to replace (must match exactly)
new_string (required): The text to replace it with
replace_all (optional): Replace all occurrences of old_string (default: false)

Features:

Exact Matching: Requires exact string matches for safety
Preview Support: Shows diff preview before applying changes
Atomic Operations: Either all changes succeed or none are applied
Security Validation: Respects path exclusions and file permissions

Security:

Read Tool Requirement: Requires Read tool to be used first on the file
Approval Required: Edit operations require approval by default
Path Exclusions: Respects configured excluded paths
Validation: Validates file paths and prevents editing protected files

Examples:

Single replacement: file_path: "config.txt", old_string: "port: 3000", new_string: "port: 8080"
Replace all occurrences: file_path: "script.py", old_string: "print", new_string: "logging.info", replace_all: true

MultiEdit Tool

Make multiple edits to a single file in atomic operations. All edits succeed or none are applied.

Parameters:

file_path (required): The path to the file to modify
edits (required): Array of edit operations to perform sequentially
- old_string: The text to replace (must match exactly)
- new_string: The text to replace it with
- replace_all (optional): Replace all occurrences (default: false)

Features:

Atomic Operations: All edits succeed or none are applied
Sequential Processing: Edits are applied in the order provided
Preview Support: Shows comprehensive diff preview
Security Validation: Respects all security restrictions

Security:

Read Tool Requirement: Requires Read tool to be used first on the file
Approval Required: MultiEdit operations require approval by default
Path Exclusions: Respects configured excluded paths
Validation: Validates all edits before execution

Examples:

{
  "file_path": "config.yaml",
  "edits": [
    {
      "old_string": "port: 3000",
      "new_string": "port: 8080"
    },
    {
      "old_string": "debug: true",
      "new_string": "debug: false"
    }
  ]
}

Grep Tool

A powerful search tool with configurable backend (ripgrep or Go implementation).

Parameters:

pattern (required): The regular expression pattern to search for
path (optional): File or directory to search in (default: current directory)
output_mode (optional): Output mode - "content", "files_with_matches", or "count" (default: "files_with_matches")
-i (optional): Case insensitive search
-n (optional): Show line numbers in output
-A (optional): Number of lines to show after each match
-B (optional): Number of lines to show before each match
-C (optional): Number of lines to show before and after each match
glob (optional): Glob pattern to filter files (e.g., ".js", ".{ts,tsx}")
type (optional): File type to search (e.g., "js", "py", "rust")
multiline (optional): Enable multiline mode where patterns can span lines
head_limit (optional): Limit output to first N results

Features:

Dual Backend: Uses ripgrep when available for optimal performance, falls back to Go implementation
Full Regex Support: Supports complete regex syntax
Multiple Output Modes: Content matching, file lists, or count results
Context Lines: Show lines before and after matches
File Filtering: Filter by glob patterns or file types
Multiline Matching: Patterns can span multiple lines
Automatic Exclusions: Automatically excludes common directories and files (.git, node_modules, .infer, etc.)
Gitignore Support: Respects .gitignore patterns in your repository
User-Configurable Exclusions: Additional exclusion patterns can be configured by users (not by the LLM)

Security & Exclusions:

Path Exclusions: Respects configured excluded paths and patterns
Automatic Exclusions: The tool automatically excludes:
- Version control directories (.git, .svn, etc.)
- Dependency directories (node_modules, vendor, etc.)
- Build artifacts (dist, build, target, etc.)
- Cache and temp files (.cache, *.tmp, *.log, etc.)
- Security-sensitive files (.env, secrets, etc.)
Gitignore Integration: Automatically reads and respects .gitignore patterns
Validation: Validates search patterns and file access
Performance Limits: Configurable result limits to prevent overwhelming output

Examples:

Basic search: pattern: "error", output_mode: "content"
Case insensitive: pattern: "TODO", -i: true, output_mode: "content"
With context: pattern: "function", -C: 3, output_mode: "content"
File filtering: pattern: "interface", glob: "*.go", output_mode: "files_with_matches"
Count results: pattern: "log.*Error", output_mode: "count"

TodoWrite Tool

Create and manage structured task lists for LLM-assisted development workflows.

Parameters:

todos (required): Array of todo items with status tracking
- id (required): Unique identifier for the task
- content (required): Task description
- status (required): Task status - "pending", "in_progress", or "completed"

Features:

Structured Task Management: Organized task tracking with status
Real-time Updates: Mark tasks as in_progress/completed during execution
Progress Tracking: Visual representation of task completion
LLM Integration: Designed for LLM-assisted development workflows

Security:

No File System Access: Pure memory-based operation
Validation: Validates todo structure and status values
Size Limits: Configurable limits on todo list size

Examples:

{
  "todos": [
    {
      "id": "1",
      "content": "Update README with new tool documentation",
      "status": "in_progress"
    },
    {
      "id": "2",
      "content": "Add test cases for new features",
      "status": "pending"
    }
  ]
}

A2A Tools (Agent-to-Agent Communication)

The A2A (Agent-to-Agent) tools enable communication between the CLI client and specialized A2A server agents, allowing for task delegation, distributed processing, and agent coordination.

📖 For detailed configuration instructions, see A2A Agents Configuration Guide

Core A2A Tools:

A2A_SubmitTask Tool

Submit tasks to specialized A2A agents for distributed processing.

Parameters:

agent_url (required): URL of the A2A agent server
task_description (required): Description of the task to perform
metadata (optional): Additional task metadata as key-value pairs

Features:

Task Delegation: Submit complex tasks to specialized agents
Streaming Responses: Real-time task execution updates
Metadata Support: Include contextual information with tasks
Task Tracking: Automatic tracking of submitted tasks with IDs
Error Handling: Comprehensive error reporting and retry logic

Examples:

Code analysis: agent_url: "http://security-agent:8080", task_description: "Analyze codebase for security vulnerabilities"
Documentation: agent_url: "http://docs-agent:8080", task_description: "Generate API documentation"
Testing: agent_url: "http://test-agent:8080", task_description: "Create unit tests for UserService class"

A2A_QueryAgent Tool

Retrieve agent capabilities and metadata for discovery and validation.

Parameters:

agent_url (required): URL of the A2A agent to query

Features:

Agent Discovery: Query agent capabilities and supported task types
Health Checks: Verify agent availability and status
Metadata Retrieval: Get agent configuration and feature information
Connection Validation: Test connectivity before task submission

Examples:

Capability check: agent_url: "http://agent:8080" - Returns agent card with available features
Health status: Query agent before submitting critical tasks

A2A_QueryTask Tool

Query the status and results of previously submitted tasks.

Parameters:

agent_url (required): URL of the A2A agent server
context_id (required): Context ID for the task
task_id (required): ID of the task to query

Features:

Status Monitoring: Check task completion status and progress
Result Retrieval: Access task outputs and generated content
Error Diagnostics: Get detailed error information for failed tasks
Artifact Discovery: List available artifacts from completed tasks

Examples:

Status check: agent_url: "http://agent:8080", context_id: "ctx-123", task_id: "task-456"
Result access: Retrieve task outputs and completion details

A2A_DownloadArtifacts Tool

Download artifacts and files generated by completed A2A tasks.

Parameters:

agent_url (required): URL of the A2A agent server
context_id (required): Context ID for the task
task_id (required): ID of the completed task

Features:

Artifact Download: Download files, reports, and outputs from completed tasks
Configurable Directory: Downloads to configurable directory (default: ./downloads)
Progress Tracking: Track download status per artifact
Validation: Ensures task completion before attempting downloads
HTTP Client: Direct HTTP download with timeout and error handling

Security:

Task Validation: Verifies task completion status before downloads
Path Safety: Downloads to safe, configurable directories
Timeout Protection: 30-second timeout for download operations
Error Handling: Comprehensive error reporting for failed downloads

Examples:

Download results: agent_url: "http://agent:8080", context_id: "ctx-123", task_id: "task-456"
Retrieve generated files: Access documentation, reports, or analysis outputs

A2A Workflow Example:

1. Query agent capabilities:        A2A_QueryAgent
2. Submit task for processing:      A2A_SubmitTask
3. Monitor task progress:           A2A_QueryTask
4. Download completed artifacts:    A2A_DownloadArtifacts

Configuration:

A2A tools are configured in the tools section:

a2a:
  enabled: true
  tools:
    submit_task:
      enabled: true
    query_agent:
      enabled: true
    query_task:
      enabled: true
    download_artifacts:
      enabled: true
      download_dir: "./downloads"  # Configurable download directory

Use Cases:

Code Analysis: Submit codebases to security or quality analysis agents
Documentation Generation: Generate API docs, README files, or technical documentation
Testing: Create comprehensive test suites with specialized testing agents
Data Processing: Process large datasets with specialized data analysis agents
Content Creation: Generate content with specialized writing or design agents

For detailed A2A documentation and examples, see docs/a2a-connections.md.

Security Notes:

All tools respect configured safety settings and exclusion patterns
Commands require approval when safety approval is enabled
File access is restricted to allowed paths and excludes sensitive directories

Configuration

The CLI uses a powerful 2-layer configuration system built on Viper, supporting multiple configuration sources with proper precedence handling.

Configuration Layers

Userspace Configuration (~/.infer/config.yaml)
- Global configuration for the user across all projects
- Used as a fallback when no project-level configuration exists
- Can be created with: infer init --userspace or infer config init --userspace
Project Configuration (.infer/config.yaml in current directory)
- Project-specific configuration that takes precedence over userspace config
- Default location for most commands
- Can be created with: infer init or infer config init

Configuration Precedence

Configuration values are resolved in the following order (highest to lowest priority):

Environment Variables (INFER_* prefix) - Highest Priority
Command Line Flags
Project Config (.infer/config.yaml)
Userspace Config (~/.infer/config.yaml)
Built-in Defaults - Lowest Priority

Example: If your userspace config sets agent.model: "anthropic/claude-4" and your project config sets agent.model: "deepseek/deepseek-chat", the project config wins. However, if you also set INFER_AGENT_MODEL="openai/gpt-4", the environment variable takes precedence over both config files.

Usage Examples

# Create userspace configuration (global fallback)
infer init --userspace

# Create project configuration (takes precedence)
infer init

# Both configurations will be automatically merged when commands are run

You can also specify a custom config file using the --config flag which will override the automatic 2-layer loading.

Default Configuration

gateway:
  url: http://localhost:8080
  api_key: ""
  timeout: 200
  oci: ghcr.io/inference-gateway/inference-gateway:latest  # OCI image for Docker mode
  run: true    # Automatically run the gateway (enabled by default)
  docker: true  # Use Docker mode by default (set to false for binary mode)
  include_models: []  # Optional: only allow specific models (allowlist)
  exclude_models:
    - ollama_cloud/cogito-2.1:671b
    - ollama_cloud/kimi-k2:1t
    - ollama_cloud/kimi-k2-thinking
    - ollama_cloud/deepseek-v3.1:671b # Block specific models by default
client:
  timeout: 200
  retry:
    enabled: true
    max_attempts: 3
    initial_backoff_sec: 5
    max_backoff_sec: 60
    backoff_multiplier: 2
    retryable_status_codes: [400, 408, 429, 500, 502, 503, 504]
logging:
  debug: false
tools:
  enabled: true # Tools are enabled by default with safe read-only commands
  sandbox:
    directories: [".", "/tmp"] # Allowed directories for tool operations
    protected_paths: # Paths excluded from tool access for security
      - .infer/
      - .git/
      - *.env
  bash:
    enabled: true
    whitelist:
      commands: # Exact command matches
        - ls
        - pwd
        - echo
        - wc
        - sort
        - uniq
        - gh
        - task
        - docker ps
        - kubectl get pods
      patterns: # Regex patterns for more complex commands
        - ^git branch( --show-current)?$
        - ^git checkout -b [a-zA-Z0-9/_-]+( [a-zA-Z0-9/_-]+)?$
        - ^git checkout [a-zA-Z0-9/_-]+
        - ^git add [a-zA-Z0-9/_.-]+
        - ^git diff+
        - ^git remote -v$
        - ^git status$
        - ^git log --oneline -n [0-9]+$
        - ^git commit -m ".+"$
        - ^git push( --set-upstream)?( origin)?( [a-zA-Z0-9/_-]+)?$
  read:
    enabled: true
    require_approval: false
  write:
    enabled: true
    require_approval: true # Write operations require approval by default for security
  edit:
    enabled: true
    require_approval: true # Edit operations require approval by default for security
  delete:
    enabled: true
    require_approval: true # Delete operations require approval by default for security
  grep:
    enabled: true
    backend: auto # "auto", "ripgrep", or "go"
    require_approval: false
  tree:
    enabled: true
    require_approval: false
  web_fetch:
    enabled: true
    whitelisted_domains:
      - golang.org
    safety:
      max_size: 8192 # 8KB
      timeout: 30 # 30 seconds
      allow_redirect: true
    cache:
      enabled: true
      ttl: 3600 # 1 hour
      max_size: 52428800 # 50MB
  web_search:
    enabled: true
    default_engine: duckduckgo
    max_results: 10
    engines:
      - duckduckgo
      - google
    timeout: 10
  todo_write:
    enabled: true
    require_approval: false
  github:
    enabled: true
    token: "%GITHUB_TOKEN%"
    base_url: "https://api.github.com"
    owner: ""
    safety:
      max_size: 1048576  # 1MB
      timeout: 30        # 30 seconds
    require_approval: false
  safety:
    require_approval: true
agent:
  model: "" # Default model for agent operations
  system_prompt: | # System prompt for agent sessions
    Autonomous software engineering agent. Execute tasks iteratively until completion.

    IMPORTANT: You NEVER push to main or master or to the current branch - instead you create a branch and push to a branch.
    IMPORTANT: You NEVER read all the README.md - start by reading 300 lines

    RULES:
    - Security: Defensive only (analysis, detection, docs)
    - Style: no emojis/comments unless asked, use conventional commits
    - Code: Follow existing patterns, check deps, no secrets
    - Tasks: Use TodoWrite, mark progress immediately
    - Chat exports: Read only "## Summary" to "---" section
    - Tools: Batch calls, prefer Grep for search

    WORKFLOW:
    When asked to implement features or fix issues:
    1. Plan with TodoWrite
    2. Search codebase to understand context
    3. Implement solution
    4. Run tests with: task test
    5. Run lint/format with: task fmt and task lint
    6. Commit changes (only if explicitly asked)
    7. Create a pull request (only if explicitly asked)
  system_reminders:
    enabled: true
    interval: 4
    reminder_text: |
      System reminder text for maintaining context
  verbose_tools: false
  max_turns: 50 # Maximum number of turns for agent sessions
  max_tokens: 4096 # The maximum number of tokens that can be generated per request
  max_concurrent_tools: 5 # Maximum concurrent tool executions
chat:
  theme: tokyo-night
compact:
  enabled: false # Enable automatic conversation compaction
  auto_at: 80 # Compact when context reaches this percentage (20-100)

Configuration Options

Gateway Settings:

gateway.url: The URL of the inference gateway (default: http://localhost:8080)
gateway.api_key: API key for authentication (if required)
gateway.timeout: Request timeout in seconds (default: 200)
gateway.run: Automatically run the gateway on startup (default: true)
- When enabled, the CLI automatically starts the gateway before running commands
- The gateway runs in the background and shuts down when the CLI exits
gateway.docker: Use Docker instead of binary mode (default: true)
- true (default): Uses Docker to run the gateway container (requires Docker installed)
- false: Downloads and runs the gateway as a binary (no Docker required)
gateway.oci: OCI image to use for Docker mode (default: ghcr.io/inference-gateway/inference-gateway:latest)
gateway.include_models: Only allow specific models (allowlist approach, default: [], allows all models)
- When set, only the specified models will be allowed by the gateway
- Example: ["deepseek/deepseek-reasoner", "deepseek/deepseek-chat"]
- This is passed to the gateway as the ALLOWED_MODELS environment variable
gateway.exclude_models: Block specific models (blocklist approach, default: [], blocks none)
- When set, all models are allowed except those in the list
- Example: ["openai/gpt-4", "anthropic/claude-4-opus"]
- This is passed to the gateway as the DISALLOWED_MODELS environment variable
- Note: include_models and exclude_models can be used together - the gateway will apply both filters

Client Settings:

client.timeout: HTTP client timeout in seconds
client.retry.enabled: Enable automatic retries for failed requests
client.retry.max_attempts: Maximum number of retry attempts
client.retry.initial_backoff_sec: Initial delay between retries in seconds
client.retry.max_backoff_sec: Maximum delay between retries in seconds
client.retry.backoff_multiplier: Backoff multiplier for exponential delay
client.retry.retryable_status_codes: HTTP status codes that trigger retries (e.g., [400, 408, 429, 500, 502, 503, 504])

Logging Settings:

logging.debug: Enable debug logging for verbose output

Tool Settings:

tools.enabled: Enable/disable tool execution for LLMs (default: true)
tools.sandbox.directories: Allowed directories for tool operations (default: [".", "/tmp"])
tools.sandbox.protected_paths: Paths excluded from tool access for security (default: [".infer/", ".git/", "*.env"])
tools.whitelist.commands: List of allowed commands (supports arguments)
tools.whitelist.patterns: Regex patterns for complex command validation
tools.safety.require_approval: Prompt user before executing any command (default: true)
Individual tool settings: Each tool (Bash, Read, Write, Edit, Delete, Grep, Tree, WebFetch, WebSearch, TodoWrite) has:
- enabled: Enable/disable the specific tool
- require_approval: Override global safety setting for this tool (optional)

Compact Settings:

compact.enabled: Enable automatic conversation compaction to reduce token usage (default: false)
compact.auto_at: Percentage of context window (20-100) at which to automatically trigger compaction (default: 80)

Chat Settings:

chat.default_model: Default model for chat sessions (skips model selection when set)
chat.system_prompt: System prompt included with every chat session
chat.system_reminders.enabled: Enable/disable system reminders (default: true)
chat.system_reminders.interval: Number of messages between reminders (default: 10)
chat.system_reminders.text: Custom reminder text to provide contextual guidance

Agent Settings:

agent.model: Default model for agent operations
agent.system_prompt: System prompt for agent sessions
agent.system_reminders.enabled: Enable system reminders during agent sessions
agent.system_reminders.interval: Number of messages between reminders (default: 4)
agent.system_reminders.reminder_text: Custom reminder text for agent context
agent.verbose_tools: Enable verbose tool output (default: false)
agent.max_turns: Maximum number of turns for agent sessions (default: 50)
agent.max_tokens: Maximum tokens per agent request (default: 8192)
agent.max_concurrent_tools: Maximum number of tools that can execute concurrently (default: 5)

Web Search Settings:

web_search.enabled: Enable/disable web search tool for LLMs (default: true)
web_search.default_engine: Default search engine to use ("duckduckgo" or "google", default: "duckduckgo")
web_search.max_results: Maximum number of search results to return (1-50, default: 10)
web_search.engines: List of available search engines
web_search.timeout: Search timeout in seconds (default: 10)

Chat Interface Settings:

chat.theme: Chat interface theme name (default: "tokyo-night")
- Available themes: tokyo-night, github-light, dracula
- Can be changed during chat using /theme [theme-name] shortcut
- Affects colors and styling of the chat interface

Web Search API Setup (Optional)

Both search engines work out of the box, but for better reliability and performance in production, you can configure API keys:

Google Custom Search Engine:

Create a Custom Search Engine:
- Go to Google Programmable Search Engine
- Click "Add" to create a new search engine
- Enter a name for your search engine
- In "Sites to search", enter * to search the entire web
- Click "Create"
Get your Search Engine ID:
- In your search engine settings, note the "Search engine ID" (cx parameter)
Get a Google API Key:
- Go to the Google Cloud Console
- Create a new project or select an existing one
- Enable the "Custom Search JSON API"
- Go to "Credentials" and create an API key
- Restrict the API key to the Custom Search JSON API for security

Configure Environment Variables:

export GOOGLE_SEARCH_API_KEY="your_api_key_here"
export GOOGLE_SEARCH_ENGINE_ID="your_search_engine_id_here"

DuckDuckGo API (Optional):

export DUCKDUCKGO_SEARCH_API_KEY="your_api_key_here"

Note: Both engines have built-in fallback methods that work without API configuration. However, using official APIs provides better reliability and performance for production use.

Environment Variables

The CLI supports environment variable configuration with the INFER_ prefix. Environment variables override configuration file settings and are particularly useful for containerized deployments and CI/CD environments.

All configuration fields can be set via environment variables by converting the YAML path to uppercase and replacing dots (.) with underscores (_), then prefixing with INFER_.

Example: gateway.url → INFER_GATEWAY_URL, tools.bash.enabled → INFER_TOOLS_BASH_ENABLED

Gateway Configuration

INFER_GATEWAY_URL: Gateway URL (default: http://localhost:8080)
INFER_GATEWAY_API_KEY: Gateway API key for authentication
INFER_GATEWAY_TIMEOUT: Gateway request timeout in seconds (default: 200)
INFER_GATEWAY_OCI: OCI image for gateway (default: ghcr.io/inference-gateway/inference-gateway:latest)
INFER_GATEWAY_RUN: Auto-run gateway if not running (default: true)
INFER_GATEWAY_DOCKER: Use Docker to run gateway (default: true)

Client Configuration

INFER_CLIENT_TIMEOUT: HTTP client timeout in seconds (default: 200)
INFER_CLIENT_RETRY_ENABLED: Enable retry logic (default: true)
INFER_CLIENT_RETRY_MAX_ATTEMPTS: Maximum retry attempts (default: 3)
INFER_CLIENT_RETRY_INITIAL_BACKOFF_SEC: Initial backoff delay in seconds (default: 5)
INFER_CLIENT_RETRY_MAX_BACKOFF_SEC: Maximum backoff delay in seconds (default: 60)
INFER_CLIENT_RETRY_BACKOFF_MULTIPLIER: Backoff multiplier (default: 2)

Logging Configuration

INFER_LOGGING_DEBUG: Enable debug logging (default: false)
INFER_LOGGING_DIR: Log directory path (default: .infer/logs)

Agent Configuration

INFER_AGENT_MODEL: Default model for agent operations (e.g., deepseek/deepseek-chat)
INFER_AGENT_SYSTEM_PROMPT: Custom system prompt for agent
INFER_AGENT_SYSTEM_PROMPT_PLAN: Custom system prompt for plan mode
INFER_AGENT_VERBOSE_TOOLS: Enable verbose tool output (default: false)
INFER_AGENT_MAX_TURNS: Maximum agent turns (default: 100)
INFER_AGENT_MAX_TOKENS: Maximum tokens per response (default: 8192)
INFER_AGENT_MAX_CONCURRENT_TOOLS: Maximum concurrent tool executions (default: 5)

Chat Configuration

INFER_CHAT_THEME: Chat UI theme (light, dark, dracula, nord, solarized, default: dark)

Tools Configuration

INFER_TOOLS_ENABLED: Enable/disable all local tools (default: true)

Individual Tool Enablement:

INFER_TOOLS_BASH_ENABLED: Enable/disable Bash tool (default: true)
INFER_TOOLS_READ_ENABLED: Enable/disable Read tool (default: true)
INFER_TOOLS_WRITE_ENABLED: Enable/disable Write tool (default: true)
INFER_TOOLS_EDIT_ENABLED: Enable/disable Edit tool (default: true)
INFER_TOOLS_DELETE_ENABLED: Enable/disable Delete tool (default: true)
INFER_TOOLS_GREP_ENABLED: Enable/disable Grep tool (default: true)
INFER_TOOLS_TREE_ENABLED: Enable/disable Tree tool (default: true)
INFER_TOOLS_WEB_FETCH_ENABLED: Enable/disable WebFetch tool (default: true)
INFER_TOOLS_WEB_SEARCH_ENABLED: Enable/disable WebSearch tool (default: true)
INFER_TOOLS_GITHUB_ENABLED: Enable/disable Github tool (default: true)
INFER_TOOLS_TODO_WRITE_ENABLED: Enable/disable TodoWrite tool (default: true)

Tool Approval Configuration:

INFER_TOOLS_BASH_REQUIRE_APPROVAL: Require approval for Bash tool (default: false)
INFER_TOOLS_WRITE_REQUIRE_APPROVAL: Require approval for Write tool (default: true)
INFER_TOOLS_EDIT_REQUIRE_APPROVAL: Require approval for Edit tool (default: true)
INFER_TOOLS_DELETE_REQUIRE_APPROVAL: Require approval for Delete tool (default: true)

Bash Tool Whitelist Configuration:

The Bash tool supports whitelisting commands and patterns for security. These environment variables accept comma-separated or newline-separated values:

INFER_TOOLS_BASH_WHITELIST_COMMANDS: Comma-separated list of whitelisted commands
INFER_TOOLS_BASH_WHITELIST_PATTERNS: Comma-separated list of regex patterns for whitelisted commands

Examples:

# Whitelist specific commands
export INFER_TOOLS_BASH_WHITELIST_COMMANDS="gh,git,npm,task,make"

# Whitelist command patterns (regex)
export INFER_TOOLS_BASH_WHITELIST_PATTERNS="^gh .*,^git .*,^npm .*,^task .*"

# Combined example for GitHub Actions
export INFER_TOOLS_BASH_WHITELIST_COMMANDS="gh,git,npm"
export INFER_TOOLS_BASH_WHITELIST_PATTERNS="^gh .*,^git .*,^npm (install|test|run).*"

Grep Tool Configuration:

INFER_TOOLS_GREP_BACKEND: Grep backend to use (ripgrep or grep, default: ripgrep)

WebSearch Tool Configuration:

INFER_TOOLS_WEB_SEARCH_DEFAULT_ENGINE: Default search engine (duckduckgo or google, default: duckduckgo)
INFER_TOOLS_WEB_SEARCH_MAX_RESULTS: Maximum search results (default: 10)
INFER_TOOLS_WEB_SEARCH_TIMEOUT: Search timeout in seconds (default: 30)

WebFetch Tool Configuration:

INFER_TOOLS_WEB_FETCH_SAFETY_MAX_SIZE: Maximum fetch size in bytes (default: 10485760)
INFER_TOOLS_WEB_FETCH_SAFETY_TIMEOUT: Fetch timeout in seconds (default: 30)
INFER_TOOLS_WEB_FETCH_SAFETY_ALLOW_REDIRECT: Allow HTTP redirects (default: true)
INFER_TOOLS_WEB_FETCH_CACHE_ENABLED: Enable fetch caching (default: true)
INFER_TOOLS_WEB_FETCH_CACHE_TTL: Cache TTL in seconds (default: 900)
INFER_TOOLS_WEB_FETCH_CACHE_MAX_SIZE: Maximum cache size in bytes (default: 104857600)

GitHub Tool Configuration:

INFER_TOOLS_GITHUB_TOKEN: GitHub personal access token
INFER_TOOLS_GITHUB_BASE_URL: GitHub API base URL (default: https://api.github.com)
INFER_TOOLS_GITHUB_OWNER: Default GitHub owner/organization
INFER_TOOLS_GITHUB_REPO: Default GitHub repository
INFER_TOOLS_GITHUB_SAFETY_MAX_SIZE: Maximum GitHub file size in bytes (default: 10485760)
INFER_TOOLS_GITHUB_SAFETY_TIMEOUT: GitHub API timeout in seconds (default: 30)

Sandbox Configuration:

INFER_TOOLS_SANDBOX_DIRECTORIES: Comma-separated list of allowed directories (default: .,/tmp)

Storage Configuration

INFER_STORAGE_ENABLED: Enable conversation storage (default: true)
INFER_STORAGE_TYPE: Storage backend type (memory, sqlite, postgres, redis, default: sqlite)

SQLite Storage:

INFER_STORAGE_SQLITE_PATH: SQLite database path (default: .infer/conversations.db)

PostgreSQL Storage:

INFER_STORAGE_POSTGRES_HOST: PostgreSQL host
INFER_STORAGE_POSTGRES_PORT: PostgreSQL port (default: 5432)
INFER_STORAGE_POSTGRES_DATABASE: PostgreSQL database name
INFER_STORAGE_POSTGRES_USERNAME: PostgreSQL username
INFER_STORAGE_POSTGRES_PASSWORD: PostgreSQL password
INFER_STORAGE_POSTGRES_SSL_MODE: PostgreSQL SSL mode (default: disable)

Redis Storage:

INFER_STORAGE_REDIS_HOST: Redis host
INFER_STORAGE_REDIS_PORT: Redis port (default: 6379)
INFER_STORAGE_REDIS_PASSWORD: Redis password
INFER_STORAGE_REDIS_DB: Redis database number (default: 0)

Conversation Configuration

INFER_CONVERSATION_TITLE_GENERATION_ENABLED: Enable AI-powered title generation (default: true)
INFER_CONVERSATION_TITLE_GENERATION_MODEL: Model for title generation (default: anthropic/claude-4.1-haiku)
INFER_CONVERSATION_TITLE_GENERATION_BATCH_SIZE: Batch size for title generation (default: 5)
INFER_CONVERSATION_TITLE_GENERATION_INTERVAL: Interval in seconds between title generation attempts (default: 30)

A2A (Agent-to-Agent) Configuration

INFER_A2A_ENABLED: Enable/disable A2A tools (default: true)
INFER_A2A_AGENTS: Configure A2A agent endpoints (supports comma-separated or newline-separated format)

A2A Agents Configuration Examples:

# Comma-separated format
export INFER_A2A_AGENTS="http://agent1:8080,http://agent2:8080,http://agent3:8080"

# Newline-separated format (useful in docker-compose)
export INFER_A2A_AGENTS="
http://google-calendar-agent:8080
http://n8n-agent:8080
http://documentation-agent:8080
http://browser-agent:8080
"

A2A Cache Configuration:

INFER_A2A_CACHE_ENABLED: Enable/disable A2A agent card caching (default: true)
INFER_A2A_CACHE_TTL: Cache TTL in seconds for A2A agent cards (default: 300)

A2A Task Configuration:

INFER_A2A_TASK_STATUS_POLL_SECONDS: Status polling interval in seconds (default: 10)
INFER_A2A_TASK_POLLING_STRATEGY: Polling strategy (fixed or exponential, default: exponential)
INFER_A2A_TASK_INITIAL_POLL_INTERVAL_SEC: Initial polling interval for exponential strategy (default: 2)
INFER_A2A_TASK_MAX_POLL_INTERVAL_SEC: Maximum polling interval for exponential strategy (default: 30)
INFER_A2A_TASK_BACKOFF_MULTIPLIER: Backoff multiplier for exponential strategy (default: 1.5)
INFER_A2A_TASK_BACKGROUND_MONITORING: Enable background task monitoring (default: true)
INFER_A2A_TASK_COMPLETED_TASK_RETENTION: Completed task retention in seconds (default: 3600)

A2A Individual Tool Configuration:

INFER_A2A_TOOLS_SUBMIT_TASK_ENABLED: Enable/disable A2A SubmitTask tool (default: true)
INFER_A2A_TOOLS_SUBMIT_TASK_REQUIRE_APPROVAL: Require approval for SubmitTask (default: false)
INFER_A2A_TOOLS_QUERY_AGENT_ENABLED: Enable/disable A2A QueryAgent tool (default: true)
INFER_A2A_TOOLS_QUERY_AGENT_REQUIRE_APPROVAL: Require approval for QueryAgent (default: false)
INFER_A2A_TOOLS_QUERY_TASK_ENABLED: Enable/disable A2A QueryTask tool (default: true)
INFER_A2A_TOOLS_QUERY_TASK_REQUIRE_APPROVAL: Require approval for QueryTask (default: false)
INFER_A2A_TOOLS_DOWNLOAD_ARTIFACTS_ENABLED: Enable/disable A2A DownloadArtifacts tool (default: true)
INFER_A2A_TOOLS_DOWNLOAD_ARTIFACTS_REQUIRE_APPROVAL: Require approval for DownloadArtifacts (default: false)
INFER_A2A_TOOLS_DOWNLOAD_ARTIFACTS_DOWNLOAD_DIR: Directory for downloading A2A task artifacts (default: /tmp/downloads)
INFER_A2A_TOOLS_DOWNLOAD_ARTIFACTS_TIMEOUT_SECONDS: Timeout for downloading artifacts in seconds (default: 30)

Export Configuration

INFER_EXPORT_OUTPUT_DIR: Output directory for exported conversations (default: ./exports)
INFER_EXPORT_SUMMARY_MODEL: Model for generating export summaries (default: anthropic/claude-4.1-haiku)

Compact Configuration

INFER_COMPACT_ENABLED: Enable automatic conversation compaction (default: false)
INFER_COMPACT_AUTO_AT: Auto-compact after N messages (default: 100)

Git Configuration

INFER_GIT_COMMIT_MESSAGE_MODEL: Model for AI-generated commit messages (default: deepseek/deepseek-chat)

SCM Configuration

INFER_SCM_PR_CREATE_BASE_BRANCH: Base branch for PR creation (default: main)
INFER_SCM_PR_CREATE_BRANCH_PREFIX: Branch prefix for PR creation (default: feature/)
INFER_SCM_PR_CREATE_MODEL: Model for PR creation (default: deepseek/deepseek-chat)
INFER_SCM_CLEANUP_RETURN_TO_BASE: Return to base branch after PR creation (default: true)
INFER_SCM_CLEANUP_DELETE_LOCAL_BRANCH: Delete local branch after PR creation (default: false)

Environment Variable Substitution

Configuration values support environment variable substitution using the %VAR_NAME% syntax:

gateway:
  api_key: "%INFER_API_KEY%"

tools:
  github:
    token: "%GITHUB_TOKEN%"

This allows sensitive values to be stored as environment variables while keeping them out of configuration files.

Configuration Best Practices

Security:

Never commit sensitive data (API keys, tokens) to configuration files
Use environment variable substitution (%VAR_NAME%) for sensitive values
Use environment variables (INFER_*) for CI/CD environments

Organization:

Use project config (.infer/config.yaml) for project-specific settings
Use userspace config (~/.infer/config.yaml) for personal preferences
Commit project configs to version control, exclude userspace configs

Example Workflow:

# 1. Setup userspace defaults
infer config --userspace agent set-model "deepseek/deepseek-chat"
infer config --userspace agent set-system "You are a helpful assistant"

# 2. Project-specific overrides
infer config agent set-model "deepseek/deepseek-chat"  # Project-specific model
infer config tools bash enable  # Enable bash tools for this project

# 3. Runtime overrides
INFER_AGENT_VERBOSE_TOOLS=true infer chat  # Temporary verbose mode

Configuration Validation and Troubleshooting

The CLI validates configuration on startup and provides helpful error messages for:

Invalid YAML syntax
Unknown configuration keys
Invalid value types (string vs boolean vs integer)
Missing required values

Common Issues:

Configuration not found: Check that the config file exists and has correct YAML syntax
Environment variables not working: Ensure proper INFER_ prefix and underscore conversion
Precedence confusion: Remember that environment variables override config files

Debugging:

# Enable verbose logging
infer -v config show

# Enable debug logging
INFER_LOGGING_DEBUG=true infer config show

# Check which config file is being used
infer config show | grep "Configuration file"

Tool Approval System

The CLI implements a user approval workflow for sensitive tool operations to enhance security and user control.

How It Works

When the LLM attempts to execute a tool that requires approval:

Approval Request: The system pauses execution and displays an approval modal
Visual Preview: For file modification tools (Write, Edit, MultiEdit), a diff visualization shows exactly what will change
User Decision: You can approve or reject the operation
Execution: Only approved operations proceed; rejected operations are canceled

Approval Configuration

Each tool has a require_approval flag that can be configured:

tools:
  # Dangerous operations require approval by default
  write:
    enabled: true
    require_approval: true  # User must approve before writing files

  edit:
    enabled: true
    require_approval: true  # User must approve before editing files

  delete:
    enabled: true
    require_approval: true  # User must approve before deleting files

  # Safe operations don't require approval by default
  read:
    enabled: true
    require_approval: false  # No approval needed to read files

  grep:
    enabled: true
    require_approval: false  # No approval needed for searches

  bash:
    enabled: true
    require_approval: true  # Approval required for command execution

You can override approval requirements using environment variables:

# Disable approval for Write tool
export INFER_TOOLS_WRITE_REQUIRE_APPROVAL=false

# Enable approval for Read tool
export INFER_TOOLS_READ_REQUIRE_APPROVAL=true

Default Approval Requirements

Tools requiring approval by default:

write - Writing new files or overwriting existing ones
edit - Modifying file contents
multiedit - Making multiple file edits
delete - Deleting files or directories
bash - Executing shell commands

Tools NOT requiring approval by default:

read - Reading file contents
grep - Searching code
websearch - Web searches
webfetch - Fetching web content
github - GitHub API operations
tree - Displaying directory structure
todowrite - Managing task lists

Approval UI Controls

When an approval modal is displayed:

Navigate: Use ←/→ arrow keys to select Approve or Reject
Approve: Press Enter or y to approve the operation
Reject: Press Esc or n to reject the operation
View Details: The modal shows tool name, arguments, and diff preview for file modifications

Security Best Practices:

Keep approval enabled for destructive tools (Write, Edit, Delete) in production
Review diffs carefully before approving file modifications
Use project configs to enforce approval requirements across team
Disable approval only in trusted, sandboxed environments

Global Flags

-c, --config: Config file (default is ./.infer/config.yaml)
-v, --verbose: Verbose output
-h, --help: Help for any command

Examples

Basic Workflow

# Create .env file with your API keys
cat > .env << EOF
ANTHROPIC_API_KEY=your_anthropic_key
OPENAI_API_KEY=your_openai_key
GOOGLE_API_KEY=your_google_key
EOF

# Start interactive chat (gateway starts automatically)
infer chat

# Optional: Check gateway status
infer status

Configuration Management

# Use custom config file
infer --config ./my-config.yaml status

# Get verbose output
infer --verbose status

# Set default model for chat sessions
infer config agent set-model openai/gpt-4-turbo

# Set system prompt
infer config agent set-system "You are a helpful assistant."

# Enable tool execution with safety approval
infer config tools enable
infer config tools safety enable

# Configure sandbox directories for security
infer config tools sandbox add "/home/user/projects"
infer config tools sandbox add "/tmp/work"

# Add protected paths to prevent accidental modification
infer config tools sandbox add ".env"
infer config tools sandbox add ".git/"

# Configure tool execution (safety approval is enabled by default for write/delete operations)
infer config tools enable
infer config tools safety enable

Development

Building

go build -o infer .

Testing

go test ./...

Dependencies

Cobra - CLI framework
YAML v3 - YAML parsing

Shortcuts System

The CLI provides an extensible shortcuts system that allows you to quickly execute common commands with /shortcut-name syntax.

Built-in Shortcuts

Core Shortcuts:

/clear - Clear conversation history
/exit - Exit the chat session
/help [shortcut] - Show available shortcuts or specific shortcut help
/switch [model] - Switch to a different model
/theme [theme-name] - Switch chat interface theme or list available themes
/config <show|get|set|reload> [key] [value] - Manage configuration settings
/compact - Immediately compact conversation to reduce token usage
/export [format] - Export conversation to markdown
/init - Set input with project analysis prompt for AGENTS.md generation

Git Shortcuts

When you run infer init, a .infer/shortcuts/git.yaml file is created with common git operations:

/git-status - Show working tree status
/git-pull - Pull changes from remote repository
/git-push - Push commits to remote repository
/git-log - Show commit logs (last 10 commits with graph)
/git-commit - Generate AI commit message from staged changes

AI-Powered Commit Messages:

The /git-commit shortcut uses the snippet feature to generate conventional commit messages:

Analyzes your staged changes (git diff --cached)
Sends the diff to the LLM with a prompt to generate a conventional commit message
Automatically commits with the AI-generated message

Example Usage:

# Stage your changes
git add .

# Generate commit message and commit
/git-commit

The AI will generate a commit message following the conventional commit format (e.g., feat: Add user authentication, fix: Resolve memory leak).

Requirements:

Run infer init to create the shortcuts file
Stage changes with git add before using /git-commit
The shortcut uses jq to format JSON output

Project Initialization Shortcut:

The /init shortcut populates the input field with a configurable prompt for generating an AGENTS.md file. This allows you to:

Type /init to populate the input with the project analysis prompt
Review and optionally modify the prompt before sending
Press Enter to send the prompt and watch the agent analyze your project interactively

The prompt is configurable in your config file under init.prompt. The default prompt instructs the agent to:

Analyze your project structure, build tools, and configuration files
Create comprehensive documentation for AI agents
Generate an AGENTS.md file with project overview, commands, and conventions

SCM Shortcuts

The SCM (Source Control Management) shortcuts provide seamless integration with GitHub and git workflows.

When you run infer init, a .infer/shortcuts/scm.yaml file is created with the following shortcuts:

/scm-issues - List all GitHub issues for the repository
/scm-issue <number> - Show details for a specific GitHub issue with comments
/scm-pr-create [optional context] - Generate AI-powered PR plan with branch name, commit, and description

AI-Powered PR Creation:

The /scm-pr-create shortcut uses the snippet feature to analyze your changes and generate a complete PR plan:

Analyzes staged or unstaged changes (git diff)
Sends the diff to the LLM with context about the current and base branches
Optionally accepts additional context to help the AI understand the purpose of the changes
Generates a comprehensive PR plan including:
- Suggested branch name (following conventional format: feat/, fix/, etc.)
- Conventional commit message
- PR title and description

This provides a deterministic way to fetch GitHub data and AI assistance for PR planning.

Example Usage:

# List all open issues
/scm-issues

# View details for issue #123 including comments
/scm-issue 123

# Generate PR plan (basic)
/scm-pr-create

# Generate PR plan with additional context
/scm-pr-create This fixes the timing issue where conversations were loading too slowly

# Generate PR plan with quoted context (for complex explanations)
/scm-pr-create "This implements user-requested feature for dark mode support"

Requirements:

GitHub CLI (gh) must be installed and authenticated
Run infer init to create the shortcuts file
The commands work in any git repository with a GitHub remote

Customization:

You can customize these shortcuts by editing .infer/shortcuts/scm.yaml:

shortcuts:
  - name: scm-issues
    description: "List all GitHub issues for the repository"
    command: gh
    args:
      - issue
      - list
      - --json
      - number,title,state,author,labels,createdAt,updatedAt
      - --limit
      - "20"

Use Cases:

Quickly get context on what issues need to be worked on
Fetch issue details and comments before implementing a fix
Let the LLM analyze issue discussions to understand requirements
Customize the shortcuts to add filters, change limits, or modify output format

AI-Powered Snippets

Shortcuts can use the snippet feature to integrate LLM-powered workflows directly into YAML configuration. This enables complex AI-assisted tasks without writing Go code.

How Snippets Work:

Command Execution: The shortcut runs a command that outputs JSON data
Prompt Generation: A prompt template is filled with the JSON data and sent to the LLM
Template Filling: The final template is filled with both JSON data and the LLM response
Result Display: The filled template is shown to the user or executed

Snippet Configuration:

shortcuts:
  - name: example-snippet
    description: "Example AI-powered shortcut"
    command: bash
    args:
      - -c
      - |
        # Command must output JSON
        jq -n --arg data "Hello" '{message: $data}'
    snippet:
      prompt: |
        You are given this data: {message}
        Generate a response based on it.
      template: |
        ## AI Response
        {llm}

Placeholder Syntax:

{fieldname} - Replaced with values from the command's JSON output
{llm} - Replaced with the LLM's response to the prompt

Real-World Example: AI Commit Messages

The /git-commit shortcut demonstrates the snippet feature:

- name: git-commit
  description: "Generate AI commit message from staged changes"
  command: bash
  args:
    - -c
    - |
      if ! git diff --cached --quiet 2>/dev/null; then
        diff=$(git diff --cached)
        jq -n --arg diff "$diff" '{diff: $diff}'
      else
        echo '{"error": "No staged changes found."}'
        exit 1
      fi
  snippet:
    prompt: |
      Generate a conventional commit message.

      Changes:
      ```diff
      {diff}
      ```

      Format: "type: Description"
      - Type: feat, fix, docs, refactor, etc.
      - Description: "Capital first letter, under 50 chars"

      Output ONLY the commit message.
    template: "!git commit -m \"{llm}\""

How This Works:

Command runs git diff --cached and outputs JSON: {"diff": "..."}
Prompt template receives the diff via {diff} placeholder
LLM generates commit message (e.g., feat: Add user authentication)
Template receives LLM response via {llm} placeholder
Final command executed: git commit -m "feat: Add user authentication"

Advanced: Command Execution Prefix

If the template starts with !, the result is executed as a shell command:

template: "!git commit -m \"{llm}\""  # Executes the command
template: "{llm}"                      # Just displays the result

Use Cases for Snippets:

Generate commit messages from diffs
Create PR descriptions from changes
Analyze test output and suggest fixes
Generate code documentation from source
Transform data formats with AI assistance
Automate complex workflows with AI decision-making

User-Defined Shortcuts

You can create custom shortcuts by adding YAML configuration files in the .infer/shortcuts/ directory.

Configuration File Format:

Create files named custom-*.yaml (e.g., custom-1.yaml, custom-dev.yaml) in .infer/shortcuts/:

shortcuts:
  - name: tests
    description: "Run all tests in the project"
    command: go
    args:
      - test
      - ./...
    working_dir: .  # Optional: set working directory

  - name: build
    description: "Build the project"
    command: go
    args:
      - build
      - -o
      - infer
      - .

  - name: lint
    description: "Run linter on the codebase"
    command: golangci-lint
    args:
      - run

Configuration Fields:

name (required): The shortcut name (used as /name)
description (required): Human-readable description shown in /help
command (required): The executable command to run
args (optional): Array of arguments to pass to the command
working_dir (optional): Working directory for the command (defaults to current)
snippet (optional): AI-powered snippet configuration with prompt and template fields (see AI-Powered Snippets)

Using Shortcuts:

With the configuration above, you can use:

/tests - Runs go test ./...
/build - Runs go build -o infer .
/lint - Runs golangci-lint run

You can also pass additional arguments:

/tests -v - Runs go test ./... -v
/build --race - Runs go build -o infer . --race

Example Custom Shortcuts:

Here are some useful shortcuts you might want to add:

Development Shortcuts (custom-dev.yaml):

shortcuts:
  - name: fmt
    description: "Format all Go code"
    command: go
    args:
      - fmt
      - ./...

  - name: "mod tidy"
    description: "Tidy up go modules"
    command: go
    args:
      - mod
      - tidy

  - name: version
    description: "Show current version"
    command: git
    args:
      - describe
      - --tags
      - --always
      - --dirty

Docker Shortcuts (custom-docker.yaml):

shortcuts:
  - name: "docker build"
    description: "Build Docker image"
    command: docker
    args:
      - build
      - -t
      - myapp
      - .

  - name: "docker run"
    description: "Run Docker container"
    command: docker
    args:
      - run
      - -p
      - "8080:8080"
      - myapp

Project-Specific Shortcuts (custom-project.yaml):

shortcuts:
  - name: migrate
    description: "Run database migrations"
    command: ./scripts/migrate.sh
    working_dir: .

  - name: seed
    description: "Seed database with test data"
    command: go
    args:
      - run
      - cmd/seed/main.go

Tips:

File Organization: Use descriptive names for your config files (e.g., custom-dev.yaml, custom-docker.yaml)
Command Discovery: Use /help to see all available shortcuts including your custom ones
Error Handling: If a custom shortcut fails to load, it will be skipped with a warning
Reloading: Restart the chat session to reload custom shortcuts after making changes
Security: Be careful with custom shortcuts as they execute system commands

Troubleshooting Shortcuts:

Shortcut not appearing: Check YAML syntax and file naming (custom-*.yaml)
Command not found: Ensure the command is available in your PATH
Permission denied: Check file permissions and executable rights
Invalid YAML: Use a YAML validator to check your configuration syntax

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 513 Commits
.flox		.flox
.github		.github
.infer		.infer
cmd		cmd
config		config
docs		docs
examples		examples
internal		internal
tests/mocks		tests/mocks
.commitlintrc.json		.commitlintrc.json
.editorconfig		.editorconfig
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.markdownlint.json		.markdownlint.json
.mcp.json		.mcp.json
.pre-commit-config.yaml		.pre-commit-config.yaml
.releaserc.yaml		.releaserc.yaml
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
Taskfile.yml		Taskfile.yml
go.mod		go.mod
go.sum		go.sum
install.sh		install.sh
main.go		main.go

License

inference-gateway/cli

Folders and files

Latest commit

History

Repository files navigation

Inference Gateway CLI

⚠️ Warning

Table of Contents

Features

Installation

Using Go Install

Using Container Image

Using Install Script

Manual Download

Verifying Release Binaries

Build from Source

Quick Start

Commands

infer init

infer config

infer config init

infer config agent set-model

infer config agent set-system

infer config agent set-max-turns

infer config agent set-max-concurrent-tools

infer config agent verbose-tools

infer config export

infer config tools

infer agents

infer status

infer chat

infer agent

infer conversation-title

infer version

Available Tools for LLMs

Tree Tool

Bash Tool

Read Tool

Write Tool

WebSearch Tool

WebFetch Tool

Github Tool

Delete Tool

Edit Tool

MultiEdit Tool

Grep Tool

TodoWrite Tool

A2A Tools (Agent-to-Agent Communication)

A2A_SubmitTask Tool

A2A_QueryAgent Tool

A2A_QueryTask Tool

A2A_DownloadArtifacts Tool

Configuration

Configuration Layers

Configuration Precedence

Usage Examples

Default Configuration

Configuration Options

Web Search API Setup (Optional)

Environment Variables

Gateway Configuration

Client Configuration

Logging Configuration

Agent Configuration

Chat Configuration

Tools Configuration

Storage Configuration

Conversation Configuration

A2A (Agent-to-Agent) Configuration

Export Configuration

Compact Configuration

Git Configuration

SCM Configuration

Environment Variable Substitution

Configuration Best Practices

Configuration Validation and Troubleshooting

Tool Approval System

How It Works

Approval Configuration

`infer init`

`infer config`

`infer config init`

`infer config agent set-model`

`infer config agent set-system`

`infer config agent set-max-turns`

`infer config agent set-max-concurrent-tools`

`infer config agent verbose-tools`

`infer config export`

`infer config tools`

`infer agents`

`infer status`

`infer chat`

`infer agent`

`infer conversation-title`

`infer version`

Packages