AgentMesh

A deterministic coordination protocol for AI-to-AI communication. File-as-interface, path-addressable state, dual-mechanism conflict resolution.

Built for strAIght up! Hackathon 2026 (Singapore) — Theme E:

Most software infrastructure today assumes humans are the primary operators. But agentic systems increasingly need to coordinate with other agents, tools, and environments directly. How might we build infrastructure designed for AI-to-AI interaction, rather than retrofitting human-centered software for agent workflows?

AgentMesh is the answer.

▶ Demo video: https://www.youtube.com/watch?v=y7LyWE7K-wI

Problem

Multi-agent AI setups today coordinate in one of two broken ways:

Orchestrator bottleneck. Every inter-agent message gets summarized by a central LLM, losing semantic signal each hop and burning tokens linearly with participants. Context degrades like a telephone game.
Retrofit tax. Chat threads, IDEs, and ticket systems were built for humans. Using them as the substrate between agents adds latency, format mismatches, and serialization overhead that models waste tokens papering over.

Both approaches treat AI-to-AI coordination as a human workflow problem. It isn't. It's an infrastructure problem.

What AgentMesh is

A file-based, protocol-driven communication layer that sits between agents. Each major agent is paired with a Mini Agent sidecar — a small Python process that owns three files (context.json, summary.json, input.json) plus a versioned dictionary store (dictionary.json). Agents never talk directly; Mini Agents mediate all coordination.

Three design principles:

File as interface. Communication is JSON on disk. Anything that can read or write JSON can participate — no vendor SDK, no sockets, no shared database between agents.
Minimum viable context. When agent A changes backend.routes./api/users.auth_required, agent B receives only that dot-path and its diff — not the whole backend state. Dependency maps declare who cares about what.
Sidecar mediation. Major agents focus on their coding task. Mini Agents handle diffing, routing, conflict detection, and delivery — deterministically, in pure code.

What's in the box

┌──────────┐   ┌──────────┐   ┌──────────┐       Any producer: real LLM
│ Agent A  │   │ Agent B  │   │ Agent C  │       (Claude Code, Codex,
│          │   │          │   │          │       Gemini, Ollama) or a
└────┬─────┘   └────┬─────┘   └────┬─────┘       scripted driver.
     ▼              ▼              ▼
┌──────────┐   ┌──────────┐   ┌──────────┐       Python sidecar. File
│Mini Agent│<->│Mini Agent│<->│Mini Agent│       watcher, diff engine,
│          │   │          │   │          │       router, dual-mechanism
└────┬─────┘   └────┬─────┘   └────┬─────┘       conflict resolver.
     └──────────┬───┴──────────────┘
                ▼
         ┌──────────────┐
         │ WebSocket    │                        Structured event stream
         │ :9900        │                        (pydantic-validated).
         └──────┬───────┘
                ▼
    ┌─────────────────────────┐
    │ VS Code sidebar +       │                  Live agent cards, dict
    │ browser overlay         │                  tree, conflict panel,
    │ (pixel-office view)     │                  message courier anims.
    └─────────────────────────┘

Component	What it does
Dictionary store	Nested JSON, dot-path addressable (`backend.routes./api/users.auth_required`), monotonically versioned, atomic writes via `tempfile.mkstemp` + `os.replace`, per-mutation history log
Diff engine	Path-aware structural diff. Preserves URL segments (`/api/users` stays one token). Emits add / modify / delete ops.
Router	Dependency-map-driven fan-out. Glob-style patterns (`routes.`, `schema.*`) with segment-aware matching. Collapses related changes into one message per subtree.
Conflict engine (Type A)	Direct path collision — two agents wrote the same dot-path with different values. Resolved via a deterministic priority table keyed by path category.
Conflict engine (Type B)	Semantic cross-reference rules — different paths must stay consistent (e.g., if a route requires auth, the caller must have an Authorization header). Declared as Python dataclasses with trigger path globs, value predicates, required peer path templates, and rule-specified winners.
WebSocket event bus	Nine event types (session lifecycle, state changes, dict mutations, routed messages, conflicts, 1 Hz metrics). Pydantic-validated. Tee'd to `session.jsonl` for replay.
VS Code extension	Activity-bar sidebar webview. Live agent cards, recent-messages feed, dict tree per agent, metrics strip.
Browser overlay	Pixel-office view. Agents as characters in a virtual workspace, messages as courier orbs, conflicts as resolution cards.

AgentMesh is the coordination layer between AI agents — not an LLM wrapper. The 2,500-line protocol handles diffing, routing, and dual-mechanism conflict resolution as deterministic code. AI agents (Claude Code, Codex, Gemini, Ollama) plug in as the consumers of this protocol.

Run it

git clone https://github.com/AbhishekVulla/AgentMesh
cd AgentMesh
pip install -e .

# Terminal 1 — start the protocol bus
python -m mesh.run --config demo/config.yaml --duration 180

# Terminal 2 — drive the 6-agent scenario
python -m demo.run_scenario

Open overlay/index.html via any static server (e.g. python -m http.server 8000 then visit http://localhost:8000/overlay/) for the live view. Load the VS Code extension from extension/ — open that folder and press F5 for the Extension Development Host.

The reference scenario runs deterministically in ~50 seconds — same event sequence every run.

By the numbers

Metric	Value	Source
Protocol implementation	2,500 lines of Python	`mesh/`
Test coverage	9/9 passing	`pytest mesh/tests/`
Reference scenario	6 agents, ~50 seconds	`demo/run_scenario.py`
Routed messages per run	24 (deterministic)	`.agentmesh/events/session.jsonl`
Conflicts auto-resolved per run	2 (both Type B semantic rules)	priority table + declared rules
Bytes on the wire (typical run)	~7,660 B total	`metrics.tick` events

Verification

The protocol is agent-agnostic — any process that writes dictionary.json participates. We verified this on three independent surfaces:

Deterministic test fixture — the reference scenario is a scripted 6-agent timeline that exercises every protocol primitive (routing, both conflict types, atomic writes, version increment) and produces an identical event sequence on every run. The same role as a Monte Carlo simulation in performance work: reproducibility for testing and documentation.
Real-LLM integration — a Claude subagent driving the mesh produced a real Type B conflict (incompatible auth/header state across two agents) and the protocol detected + resolved it deterministically. Same code paths as the scripted scenario; the only difference is who writes dictionary.json.
Reproducibility — every session's events are tee'd to .agentmesh/events/session.jsonl. Integration tests assert the exact ordered event sequence on every run; the protocol is reproducible to the byte.

Adapters for additional providers (OpenAI, Gemini, Ollama, Cursor, Aider) follow the same pattern: anything that mutates dictionary.json participates. The adapter layer is mechanical.

Event schema

Canonical source: mesh/schemas/events.py (pydantic v2) → mesh/schemas/events.schema.json (generated JSON Schema). TypeScript mirrors in extension/src/types/events.ts.

Nine event variants: mesh.session.started / mesh.session.ended / agent.state.changed / dict.mutated / message.sent / message.delivered / conflict.detected / conflict.resolved / metrics.tick. Field-level documentation in docs/WEBSOCKET_SCHEMA.md.

Who this is for

Infrastructure consumers, not end users:

Orchestrator authors (LangGraph, AutoGen, CrewAI, custom) — a standard inter-agent protocol instead of hand-rolled JSON
IDE authors (Cursor, Windsurf, Zed) adding multi-agent coordination — a non-proprietary wire format with a reference visualization
Individual developers running parallel agents locally (Claude Code + Codex + Ollama) — deterministic coordination without a heavy orchestrator
Multi-agent systems researchers — a reproducible, inspectable protocol to measure coordination quality

Repository layout

Path	Contents
`mesh/`	Python protocol — Mini Agent, dictionary store, diff engine, router, conflict resolver, WebSocket server
`extension/`	VS Code extension — sidebar webview, live WS client, dictionary tree
`overlay/`	Browser overlay — pixel-office canvas, conflict cards, metrics
`demo/`	Reference scenario driver + config (dependency map, priority table)
`docs/`	Architecture, WebSocket schema, demo scenario timeline

Docs

docs/ARCHITECTURE.md — protocol architecture
docs/WEBSOCKET_SCHEMA.md — event contract
docs/DEMO_SCENARIO.md — reference scenario timeline
docs/PRD.md — product spec

Credits

The browser overlay's pixel-office aesthetic is inspired by pablodelucca/pixel-agents (MIT). No source from pixel-agents is forked or modified.

License

MIT — see LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AgentMesh

Problem

What AgentMesh is

What's in the box

Run it

By the numbers

Verification

Event schema

Who this is for

Repository layout

Docs

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
demo		demo
docs		docs
extension		extension
mesh		mesh
overlay		overlay
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

AgentMesh

Problem

What AgentMesh is

What's in the box

Run it

By the numbers

Verification

Event schema

Who this is for

Repository layout

Docs

Credits

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages