Agent-Gantry Documentation

Universal Tool Orchestration Platform for LLM-Based Agent Systems

Context is precious. Execution is sacred. Trust is earned.

Welcome

Agent-Gantry is a Python library that solves three critical problems in LLM-based agent systems:

✨ What Agent-Gantry Does

1. **Context Window Tax**: Reduces token costs by ~90% through semantic routing instead of sending all tools in every prompt 2. **Tool/Protocol Fragmentation**: Write Once, Run Anywhere - supports OpenAI, Claude, Gemini, A2A agents, and MCP clients 3. **Operational Blindness**: Zero-trust security with policies, capabilities, and circuit breakers

Quick Links

🚀 Get Started

Install and run your first example in 5 minutes

Quick Start Guide →

📚 Guides

Learn key concepts and advanced patterns

Browse Guides →

📖 API Reference

Complete API documentation and examples

API Docs →

🏗️ Architecture

Understand the system design and best practices

Architecture →

Installation

# Basic installation
pip install agent-gantry

# With all LLM providers
pip install agent-gantry[llm-providers]

# With local persistence (LanceDB + Nomic embeddings)
pip install agent-gantry[lancedb,nomic]

# Everything
pip install agent-gantry[all]

5-Minute Quick Start

Transform your existing LLM code into a semantically-aware agent system:

from openai import AsyncOpenAI
from agent_gantry import AgentGantry, with_semantic_tools, set_default_gantry

# Initialize
client = AsyncOpenAI()
gantry = AgentGantry()
set_default_gantry(gantry)

# Register tools
@gantry.register(tags=["weather"])
def get_weather(city: str) -> str:
    """Get the current weather for a city."""
    return f"The weather in {city} is 72°F and sunny."

# Apply decorator - tools are automatically injected!
@with_semantic_tools(limit=3)
async def ask_llm(prompt: str, *, tools=None):
    return await client.chat.completions.create(
        model="gpt-4o",
        messages=[{"role": "user", "content": prompt}],
        tools=tools  # Agent-Gantry injects relevant tools here
    )

# Just call it - semantic routing happens automatically
await ask_llm("What's the weather in San Francisco?")

That’s it! Agent-Gantry automatically:

🎯 Selects only relevant tools based on the query (reducing token costs by ~79%)
🔄 Converts tool schemas to any LLM provider format
🛡️ Executes tools with circuit breakers and security policies

Key Features

Semantic Routing

Intelligent tool selection using vector similarity, reducing context window usage by ~90%

Multi-Protocol Support

Native support for:

MCP (Model Context Protocol) - Client and Server
A2A (Agent-to-Agent Protocol)
OpenAI, Anthropic, Google Gemini, Mistral, Groq

Production-Ready

Circuit breakers and health tracking
Retries with exponential backoff
Structured logging and telemetry
Zero-trust security with capability-based permissions

Framework Agnostic

Works seamlessly with:

LangChain
AutoGen
CrewAI
LlamaIndex
Semantic Kernel
Custom agents

What’s New in v0.3.0

✨ Microsoft Agent Framework 1.3.0 + Mistral via OpenAI SDK

v0.3.0 lands two ecosystem updates and a regression-guarded routing test suite: - **Mistral integration switched to the OpenAI SDK.** The official `mistralai` package was quarantined on PyPI on 2026-05-12; Gantry's `provider="mistral"` now drives `AsyncOpenAI` against `base_url="https://api.mistral.ai/v1"`. Public behaviour is preserved — `LLMConfig(provider="mistral")` works unchanged. - **`agent-framework` floor bumped to `1.3.0`.** Picks up `ClassSkill`, `allowed_tools` for OpenAI and Gemini tool-choice, and the function-approval flow in Foundry hosted agents. - **`anthropic` floor bumped to `>=0.101.0`.** - **New end-to-end routing tests** (`tests/test_agent_framework_orchestration.py`) drive a `ScriptedChatClient` through the full `function_call → Gantry-execute → function_result → final-text` loop, proving Gantry's tool surface adapts per chat round. ```python # Mistral via the OpenAI-compatible endpoint (0.3.0+) from openai import AsyncOpenAI client = AsyncOpenAI( api_key="...", base_url="https://api.mistral.ai/v1", ) tools = await gantry.retrieve_tools("Translate hello to French", limit=1) ``` Learn more about Dynamic MCP Selection →

Context Window Savings

Agent-Gantry significantly reduces token usage by dynamically surfacing only the most relevant tools.

Benchmark Results:

Scenario	Tools Passed	Prompt Tokens	Cost Reduction
Standard (All Tools)	15	366	-
Agent-Gantry (Top 2)	2	78	~79%

Measured using gpt-3.5-turbo with provider-reported token usage.

Stress Test: 100 Tools

Metric	Value
Total Tools	100
Retrieval Limit	Top 2
Accuracy	100% (10/10 queries)
Embedder	Nomic (`nomic-embed-text-v1.5`)

Documentation Structure

Getting Started - Installation, quick start, and first steps
Guides - Topic-specific tutorials and patterns
Reference - API documentation and configuration
Architecture - System design and best practices
- System Overview
- Best Practices
Troubleshooting - Common issues and solutions

Community & Support

GitHub Repository - Source code, issues, and contributions
Report a Bug - Found an issue? Let us know
Feature Requests - Suggest improvements

License

Agent-Gantry is open-source software licensed under the MIT License.

### Ready to Get Started?

Transform your LLM agent system with semantic tool orchestration

Get Started Now →