Content

<h1 align="center">File Storage & Sandbox Backends for Pydantic AI</h1> Console Toolset, Docker Sandbox, and Permission System for AI Agents <a href="https://pypi.org/project/pydantic-ai-backend/"><img src="https://img.shields.io/pypi/v/pydantic-ai-backend.svg" alt="PyPI version"></a> <a href="https://www.python.org/downloads/"><img src="https://img.shields.io/badge/python-3.10+-blue.svg" alt="Python 3.10+"></a> <a href="https://opensource.org/licenses/MIT"><img src="https://img.shields.io/badge/License-MIT-yellow.svg" alt="License: MIT"></a> <a href="https://github.com/vstorm-co/pydantic-ai-backend/actions/workflows/ci.yml"><img src="https://github.com/vstorm-co/pydantic-ai-backend/actions/workflows/ci.yml/badge.svg" alt="CI"></a> <a href="https://github.com/pydantic/pydantic-ai"><img src="https://img.shields.io/badge/Powered%20by-Pydantic%20AI-E92063?logo=pydantic&logoColor=white" alt="Pydantic AI"></a> Console Toolset — ls, read, write, edit, grep, execute  •  Docker Sandbox — isolated code execution  •  Permission System — fine-grained access control --- **File Storage & Sandbox Backends** provides everything your [Pydantic AI](https://ai.pydantic.dev/) agent needs to work with files and execute code safely. Choose from in-memory, local filesystem, or Docker-isolated backends. > **Full framework?** Check out [Pydantic Deep Agents](https://github.com/vstorm-co/pydantic-deepagents) — complete agent framework with planning, filesystem, subagents, and skills. ## Use Cases | What You Want to Build | How This Library Helps | |------------------------|------------------------| | **AI Coding Assistant** | Console toolset with file ops + code execution | | **Multi-User Web App** | Docker sandboxes with session isolation | | **Code Review Bot** | Read-only backend with grep/glob search | | **Secure Execution** | Permission system blocks dangerous operations | | **Testing/CI** | In-memory StateBackend for fast, isolated tests | ## Installation ```bash pip install pydantic-ai-backend ``` Or with uv: ```bash uv add pydantic-ai-backend ``` Optional extras: ```bash # Console toolset (requires pydantic-ai) pip install pydantic-ai-backend[console] # Docker sandbox support pip install pydantic-ai-backend[docker] # Everything pip install pydantic-ai-backend[console,docker] ``` ## Quick Start — ConsoleCapability (Recommended) The simplest way to give your agent filesystem tools: ```python from pydantic_ai import Agent from pydantic_ai_backends import ConsoleCapability agent = Agent("openai:gpt-4.1", capabilities=[ConsoleCapability()]) ``` ### With Permissions ```python from pydantic_ai_backends import ConsoleCapability from pydantic_ai_backends.permissions import READONLY_RULESET # Read-only agent — write/edit/execute tools are hidden from the model agent = Agent("openai:gpt-4.1", capabilities=[ConsoleCapability(permissions=READONLY_RULESET)]) ``` ### Alternative: Toolset API ```python from dataclasses import dataclass from pydantic_ai import Agent from pydantic_ai_backends import LocalBackend, create_console_toolset @dataclass class Deps: backend: LocalBackend agent = Agent( "openai:gpt-4.1", deps_type=Deps, toolsets=[create_console_toolset()], ) backend = LocalBackend(root_dir="./workspace") result = agent.run_sync( "Create a Python script that calculates fibonacci and run it", deps=Deps(backend=backend), ) print(result.output) ``` **That's it.** Your agent can now: - List files and directories (`ls`) - Read and write files (`read_file`, `write_file`) - Edit files with string replacement (`edit_file`) - Search with glob patterns and regex (`glob`, `grep`) - Execute shell commands (`execute`) ## Available Backends | Backend | Storage | Execution | Use Case | |---------|---------|-----------|----------| | `StateBackend` | In-memory | No | Testing, ephemeral sessions | | `LocalBackend` | Filesystem | Yes | Local development, CLI tools | | `DockerSandbox` | Container | Yes | Multi-user, untrusted code | | `CompositeBackend` | Routed | Varies | Complex multi-source setups | ### In-Memory (StateBackend) ```python from pydantic_ai_backends import StateBackend backend = StateBackend() # Files stored in memory, perfect for tests ``` ### Local Filesystem (LocalBackend) ```python from pydantic_ai_backends import LocalBackend backend = LocalBackend( root_dir="/workspace", allowed_directories=["/workspace", "/shared"], enable_execute=True, ) ``` ### Docker Sandbox (DockerSandbox) ```python from pydantic_ai_backends import DockerSandbox sandbox = DockerSandbox(runtime="python-datascience") sandbox.start() # Fully isolated container environment sandbox.stop() ``` ### Reusable Named Container ```python from pydantic_ai_backends import DockerSandbox # Named container persists between sessions (packages survive restarts) sandbox = DockerSandbox( image="python:3.12-slim", container_name="my-dev-env", # implies auto_remove=False volumes={"/my/project": "/workspace"}, ) # Next time: finds existing container and reattaches ``` ## Console Toolset Ready-to-use tools for pydantic-ai agents: ```python from pydantic_ai_backends import create_console_toolset # All tools enabled toolset = create_console_toolset() # Without shell execution toolset = create_console_toolset(include_execute=False) # With approval requirements toolset = create_console_toolset( require_write_approval=True, require_execute_approval=True, ) # With custom tool descriptions toolset = create_console_toolset( descriptions={ "execute": "Run shell commands in the workspace", "read_file": "Read file contents from the workspace", } ) ``` **Available tools:** `ls`, `read_file`, `write_file`, `edit_file`, `glob`, `grep`, `execute` ### Image Support For multimodal models, enable image file handling: ```python toolset = create_console_toolset(image_support=True) # Now read_file on .png/.jpg/.gif/.webp returns BinaryContent # that multimodal models (GPT-4o, Claude, etc.) can see directly ``` ## Permission System Fine-grained access control: ```python from pydantic_ai_backends import LocalBackend from pydantic_ai_backends.permissions import DEFAULT_RULESET, READONLY_RULESET # Safe defaults (allow reads, ask for writes) backend = LocalBackend(root_dir="/workspace", permissions=DEFAULT_RULESET) # Read-only mode backend = LocalBackend(root_dir="/workspace", permissions=READONLY_RULESET) ``` | Preset | Description | |--------|-------------| | `DEFAULT_RULESET` | Allow reads (except secrets), ask for writes/executes | | `PERMISSIVE_RULESET` | Allow most operations, deny dangerous commands | | `READONLY_RULESET` | Allow reads only, deny all writes and executes | | `STRICT_RULESET` | Everything requires approval | ## Docker Runtimes Pre-configured environments: | Runtime | Base Image | Packages | |---------|------------|----------| | `python-minimal` | python:3.12-slim | (none) | | `python-datascience` | python:3.12-slim | pandas, numpy, matplotlib, scikit-learn | | `python-web` | python:3.12-slim | fastapi, uvicorn, sqlalchemy, httpx | | `node-minimal` | node:20-slim | (none) | | `node-react` | node:20-slim | typescript, vite, react | Custom runtime: ```python from pydantic_ai_backends import DockerSandbox, RuntimeConfig runtime = RuntimeConfig( name="ml-env", base_image="python:3.12-slim", packages=["torch", "transformers"], ) sandbox = DockerSandbox(runtime=runtime) ``` ## Session Manager Multi-user web applications: ```python from pydantic_ai_backends import SessionManager # Docker (default) manager = SessionManager( default_runtime="python-datascience", workspace_root="/app/workspaces", ) # Each user gets isolated sandbox sandbox = await manager.get_or_create("user-123") ``` ### Custom Sandbox Factory Use any sandbox backend (Daytona, custom, etc.): ```python from pydantic_ai_backends import SessionManager, DaytonaSandbox def daytona_factory(session_id: str) -> DaytonaSandbox: return DaytonaSandbox(sandbox_id=session_id) manager = SessionManager(sandbox_factory=daytona_factory) sandbox = await manager.get_or_create("user-123") ``` ## Why Choose This Library? | Feature | Description | |---------|-------------| | **Multiple Backends** | In-memory, filesystem, Docker — same interface | | **Console Toolset** | Ready-to-use tools for pydantic-ai agents | | **Permission System** | Pattern-based access control with presets | | **Docker Isolation** | Safe execution of untrusted code | | **Session Management** | Multi-user support with workspace persistence | | **Image Support** | Multimodal models can see images via BinaryContent | | **Pre-built Runtimes** | Python and Node.js environments ready to go | ## Related Projects | Package | Description | |---------|-------------| | [Pydantic Deep Agents](https://github.com/vstorm-co/pydantic-deepagents) | Full agent framework (uses this library) | | [pydantic-ai-todo](https://github.com/vstorm-co/pydantic-ai-todo) | Task planning toolset | | [subagents-pydantic-ai](https://github.com/vstorm-co/subagents-pydantic-ai) | Multi-agent orchestration | | [summarization-pydantic-ai](https://github.com/vstorm-co/summarization-pydantic-ai) | Context management | | [pydantic-ai](https://github.com/pydantic/pydantic-ai) | The foundation — agent framework by Pydantic | ## Contributing ```bash git clone https://github.com/vstorm-co/pydantic-ai-backend.git cd pydantic-ai-backend make install make test # 100% coverage required ``` ## License MIT — see [LICENSE](LICENSE) --- <div align="center"> ### Need help implementing this in your company? We're <a href="https://vstorm.co">Vstorm</a> — an Applied Agentic AI Engineering Consultancy with 30+ production AI agent implementations. <a href="https://vstorm.co/contact-us/"> <img src="https://img.shields.io/badge/Talk%20to%20us%20%E2%86%92-0066FF?style=for-the-badge&logoColor=white" alt="Talk to us"> </a> Made with ❤️ by <a href="https://vstorm.co">Vstorm</a> </div>

pydantic-ai-backend

Content

Connection Info

You Might Also Like

everything-claude-code

markitdown

cc-switch

servers

servers

Time

pydantic-ai-backend

Scan with WeChat to Share

Authentication Required

Content

Connection Info

You Might Also Like

everything-claude-code

markitdown

cc-switch

servers

servers

Time