Eviction¶

The eviction capability automatically saves large tool outputs to files and replaces them with a preview + file reference. This prevents context pollution from tools that return massive results (e.g., grep or read_file on large files).

Quick Start¶

Eviction is enabled by default with a 20,000 token threshold:

Python

from pydantic_deep import create_deep_agent

agent = create_deep_agent()  # eviction_token_limit=20_000 by default

# Custom threshold
agent = create_deep_agent(eviction_token_limit=50_000)

# Disable
agent = create_deep_agent(eviction_token_limit=None)

How It Works¶

The EvictionCapability uses the after_tool_execute hook to intercept large tool results before they enter the conversation history. This means the full output never bloats the message list in memory.

After each tool call, after_tool_execute checks the result size
If the result exceeds the token limit (chars / 4), it:
Saves the full output to a file in the backend (/large_tool_results/{id})
Returns a preview showing head/tail lines + file path instead
The agent can then use read_file with offset/limit to access the full output

Why a capability hook instead of a history processor?

The previous EvictionProcessor (history processor) ran after the large output was already stored in the message list — the full content sat in memory until the next model call. The EvictionCapability intercepts via after_tool_execute, so the large output never enters the history.

Before Eviction¶

Text Only

Tool result: [50,000 characters of grep output]

After Eviction¶

Text Only

Tool result too large, saved to: /large_tool_results/call_abc123

Read the result using read_file with offset and limit parameters.
Example: read_file(path="/large_tool_results/call_abc123", offset=0, limit=100)

Preview (head/tail):

[first 5 lines]

... [990 lines truncated] ...

[last 5 lines]

Configuration¶

Parameter	Type	Default	Description
`eviction_token_limit`	`int \\| None`	`None`	Token threshold for eviction. `None` disables eviction.

Standalone Usage¶

For custom agent setups, use EvictionProcessor directly:

Python

from pydantic_ai import Agent
from pydantic_ai_backends import StateBackend
from pydantic_deep.processors.eviction import EvictionProcessor

processor = EvictionProcessor(
    backend=StateBackend(),
    token_limit=20000,
    eviction_path="/large_tool_results",
    head_lines=5,
    tail_lines=5,
)

agent = Agent("anthropic:claude-sonnet-4-6", history_processors=[processor])

Factory Function¶

Python

from pydantic_ai_backends import StateBackend
from pydantic_deep import create_eviction_processor

processor = create_eviction_processor(
    backend=StateBackend(),
    token_limit=20000,
    eviction_path="/large_tool_results",
    head_lines=10,
    tail_lines=10,
)

Multi-User Applications

Evicted files are written to the backend. In multi-user apps, use separate backends per user to avoid mixing evicted outputs. See Multi-User Guide.

Runtime Backend Resolution¶

When used via create_deep_agent(), the processor resolves the backend from ctx.deps.backend at runtime. This ensures evicted files are written to the same backend that read_file, grep, and other console tools use. It falls back to the backend passed at initialization for standalone usage.

Components¶

Component	Description
[`EvictionCapability`][pydantic_deep.processors.eviction.EvictionCapability]	Capability that intercepts large outputs via `after_tool_execute` (default)
[`EvictionProcessor`][pydantic_deep.processors.eviction.EvictionProcessor]	Legacy history processor (for standalone/backward-compatible use)
[`create_eviction_processor`][pydantic_deep.processors.eviction.create_eviction_processor]	Factory function for the legacy processor
[`create_content_preview`][pydantic_deep.processors.eviction.create_content_preview]	Create head/tail preview
`DEFAULT_TOKEN_LIMIT`	Default threshold: 20,000 tokens
`DEFAULT_EVICTION_PATH`	Default path: `/large_tool_results`

Next Steps¶

History Processors — Summarization and sliding window
Cost Tracking — Token and cost monitoring
Agents — Full agent configuration