Format Pipeline Configuration
The format pipeline works out of the box with sensible defaults. All configuration is optional and goes into your .devboy.toml file.
Quick Start
No configuration needed. The pipeline uses TOON format with 8,000 token budget by default.
Typical savings on real projects (kubernetes, vscode, rust-lang, golang):
- TOON Full: 3-17% fewer tokens than JSON
- TOON Standard (with budget trimming): ~44% savings
- TOON Minimal (with budget trimming): ~92% savings
Run devboy benchmark --owner <owner> --repo <repo> to measure savings on your project.
See Format Pipeline Architecture for detailed benchmarks.
Full Configuration Reference
Default Strategy Mapping
These built-in mappings are applied automatically:
Proxy Tools
When proxy tools (from upstream MCP servers) are used, the pipeline automatically strips the prefix to find a matching strategy:
You can override this by explicitly mapping proxy tool names:
Or disable automatic matching entirely:
Common Scenarios
Increase budget for large projects
If your project has many issues/MRs and you want more data per response:
Use JSON for CI/CD integration
If you're processing tool output programmatically:
Custom strategy for a proxy tool
If you have a proxy tool that returns issue-like data:
Disable budget trimming
Set a very high budget to effectively disable trimming:
Chunk-Based Behavior
When tool output exceeds the budget, the pipeline automatically splits the response into chunks. The first response includes chunk 1 (highest-value items based on the active trimming strategy) and a chunk index describing all available chunks. Agents use offset and limit parameters in subsequent tool calls to fetch specific chunks on demand, without needing to read all data sequentially.
See Format Pipeline Architecture — Chunk-Based Lazy Loading for details on the chunk index format and data flow.
Provider Result Metadata
When providers return list data, pagination and sort metadata from the upstream API (e.g., GitLab X-Total headers, Jira total/startAt/maxResults) is captured in ProviderResult<T> and flows through to FormatMetadata. This allows agents to understand the total dataset size and available sort options without additional API calls.
See Format Pipeline Architecture — Provider Metadata for details.