Nexus Agents Documentation

Canonical Documentation Index | Last Updated: 2026-06-17 (the skills under skills/, 46 MCP tools, 12 expert types, 11 workflow templates)

This is the single source of truth for all nexus-agents documentation. All documentation must be indexed here to be considered valid.

Nexus-agents is an autonomic control plane for AI coding agents. This index is organized around that model: docs are grouped by the control-plane role they describe — scheduler, admission control, event log, data plane — and by the MAPE-K self-* loop they implement, rather than as a flat catalog. If you know which part of the control plane you care about, you can find its docs directly. See the root README for the framing and ADR-0017 for the authority model the loops are bounded by.

Quick Start by Role

Role	Start Here	Then Read
New User	Your First Task	Installation
Contributor	Contributing	Development Guide
Operator	ENTRYPOINTS.md	Configuration

AI agents working in this repo (Claude Code, Cursor, etc.) — see CLAUDE.md for project instructions, governance protocols, and canonical paths. CLAUDE.md is the rule book the agents follow, not a user-facing surface.

The Control-Plane Map

The docs below are grouped by where they sit in the control plane. Each classic control-plane role maps to a shipped component (see the root README’s control-plane mapping); the docs that describe each role are gathered under it.

Control-plane role	What it is	Jump to
Entry point	One door in — install, verify, first task, the CLI/MCP/REST surfaces	§ Entry Point
Scheduler	`run` / MetaOrchestrator picks and routes the right strategy	§ Scheduler
Admission control	Gates — adversarial review, consensus, quality gates — decide what ships	§ Admission Control
Event log	Tamper-evident hash-chained audit of every decision	§ Event Log
Data plane	The engineering CLIs that do the file edits, tests, PRs	§ Data Plane
*Self- loops**	MAPE-K self-configuring / healing / optimizing / protecting	§ Self-* Loops
Governance	Authority ladder, claims registry, the rules behind the loops	§ Governance
Build & operate	Contributor guides, ops runbooks, distribution	§ Build & Operate
Reference & history	Interfaces, generated references, research, ADRs, archive	§ Reference & History

Pipeline Terminology

The word “pipeline” gets used for several distinct things. This table disambiguates the namespace — read it once and the rest of the docs are unambiguous:

Term	What it is	Example entry point(s)
Research loop	Deliberately gather and synthesize evidence before deciding. Discovers papers/repos, clusters findings, surfaces them into planning context.	`research_discover`, `research_synthesize`, `research_query`
Consensus loop	A multi-agent vote on a proposal/plan (approve/reject with a quorum + strategy). Used as a gate, not a producer.	`consensus_vote` · `createConsensusEngine`
Dev pipeline	The batteries-included build flow: research → plan → vote → decompose → implement → QA → security. The common path for “build feature X”.	`run_dev_pipeline` · `run_pipeline`
Workflow template	A reusable, declarative (YAML) sequence of stages you can run by name — a saved pipeline shape.	`run_workflow` · `list_workflows` (WORKFLOW_TEMPLATES)
Composition / orchestration	Wiring the orthogonal primitives (spec→graph→execute, `GraphBuilder`, consensus) into a custom pipeline beyond the built-ins.	`orchestrate`, `execute_spec`, `GraphBuilder` (COMPOSITION_PATTERNS)

Mental model: a dev pipeline and workflow templates are pre-built pipelines you run; composition is how you build a new one from the research loop, consensus loop, and graph primitives. When a doc says “the full pipeline” (e.g. in CLAUDE.md’s working mode) it means the agent’s research→vote→plan→implement decision sequence — the same shape as the dev pipeline, applied to its own work.

Documentation Structure

docs/
├── README.md              # THIS FILE - Canonical index (control-plane organized)
├── INDEX.yaml             # Machine-parseable index
│
├── getting-started/       # Entry point: installation and configuration
├── architecture/          # System design — scheduler, gates, event log, data plane
├── development/           # Build & operate: contributor guides
├── governance/            # Authority ladder, loop promotion criteria
├── security/              # Self-protecting: trust tiers, audit threat model
├── research/              # Research loop tracking
├── adr/                   # Architecture Decision Records
├── guides/                # How-to guides
├── interfaces/            # Interface specifications
├── ops/                   # Operational runbooks
├── reference/             # Generated CLI/MCP/strategy references
├── metrics/               # System metrics
└── workflows/             # Workflow documentation

Tier 1: Essential (Always Current)

These documents define the system and must be kept up-to-date:

Document	Location	Description
Project Instructions	CLAUDE.md	Agent behavior, protocols, governance
Your First Task	FIRST_TASK.md	5-minute first task (canonical entry)
README	README.md	Project overview (control-plane framing)
Architecture Overview	architecture/README.md	System design hub
Development Guide	development/README.md	Contributor hub
API Reference	ENTRYPOINTS.md	CLI, MCP, REST, API docs
Troubleshooting	TROUBLESHOOTING.md	Common issues, FAQ

Entry Point — Getting In

One door in: install, verify, run your first task, then learn the full surface (CLI / MCP / REST). This is the run scheduler’s front door.

Document	Description	Status
FIRST_TASK.md	Start here. Install → verify → real vote → editor wiring (~5 min)	Canonical
COMPOSE_YOUR_FIRST_PIPELINE.md	Next step: chain MCP tools (research → vote → build) toward a goal (~15 min)	Canonical
INSTALLATION.md	Platform installation deep-dive	Canonical
CONFIGURATION.md	YAML and env configuration	Canonical
PLUGIN_INSTALL.md	Install nexus-agents as a Claude Code plugin	Canonical
SANDBOXED-USAGE.md	Docker / restricted-FS / team-distribution flows	Canonical
ENTRYPOINTS.md	The full CLI / MCP / REST entry-point reference	Canonical

Scheduler — Routing & Orchestration

The scheduler is run / MetaOrchestrator: one entry point picks (and optionally runs) the right strategy for a goal. These docs describe how strategies are selected, routed across CLIs, and composed into pipelines.

Document	Description	Status
AGENT_SYSTEM.md	Agent framework design	Canonical
ROUTING_SYSTEM.md	Model routing pipeline	Canonical
CONTEXT_LOAD_BALANCING.md	Claude/Gemini/Codex routing	Canonical
ORCHESTRATOR_WORKFLOW_ENGINE.md	Orchestrator vs WorkflowEngine	Canonical
ICTM_PATTERN.md	Dynamic sub-agent creation	Canonical
MULTI_REPO_ORCHESTRATION.md	Cross-repo task coordination	Canonical
RESEARCH_PIPELINE.md	Research-to-project runner	Canonical
MEMORY_SYSTEM.md	7-type memory architecture	Canonical

How-to (composing & routing):

Document	Description
COMPOSITION_PATTERNS.md	Compose spec pipeline + GraphBuilder + consensus into custom pipelines
WORKFLOW_TEMPLATES.md	Creating YAML workflows
RULE_PRECEDENCE.md	Per-adapter rule-loading precedence (Claude/Codex/Gemini/OpenCode)

Admission Control — Gates

Nothing ships without passing a gate. Adversarial PR review, multi-voter consensus, and quality gates decide what is allowed through. These are the loops that sit at advisory/enforce on the authority ladder.

Document	Description	Status
CONSENSUS_PROTOCOLS.md	5 consensus algorithms	Canonical

How-to (running gates):

Document	Description
PR_REVIEW_LOCAL.md	Run pr_review on your machine using subscription CLI auth

Evidence (gate evaluations):

Document	Description	Status
pr-review-experiment-results.md	pr_review #2233 baseline experiment results	Canonical
pr-review-experiment-results-v5.md	pr_review v5 — JSON-native findings; 100% bug-catch + caught a real bug	Canonical
pr-review-eval-labeling-rubric.md	pr_review eval labeling rubric v1 + v5 re-adjudication (#3846)	Canonical
pr-review-dataset-curation.md	pr_review eval dataset curation pipeline + n≥50 assessment (#3847)	Canonical
pr-review-eval-curation.md	pr_review eval candidate-mining curation pipeline (#3847)	Canonical

Event Log — Audit & Observability

Append-only, tamper-evident record of every decision, plus the observability surfaces that monitor the running system (the Monitor arm of MAPE-K).

Document	Description	Status
EVENT_BUS_BOUNDARIES.md	Observability vs messaging bus	Canonical
audit-hash-chain-threat-model	Threat model for the audit hash chain	Canonical

How-to (observability):

Document	Description
DEBUGGING_OBSERVABILITY.md	Debug logging, tracing
Claude Code Observability	Hooks, status line, MCP logging for Claude Code

Observability design:

Document	Description	Status
EXECUTION_DASHBOARD_DESIGN.md	Dashboard design	Canonical
SWARM_OBSERVER_DESIGN.md	OrchestrationObserver design	Canonical

Data Plane — Agents & Execution

The data plane is the engineering CLIs (Claude Code, Codex, Gemini, OpenCode) that do the actual file edits, tests, and PRs — and the harness/federation machinery that wires nexus-agents to them.

Document	Description	Status
MCP_PROTOCOL.md	MCP integration details	Canonical
AGENT_COMPATIBILITY.md	Per-harness federation matrix	Canonical
SWE_BENCH_HARNESS.md	SWE-Bench integration	Canonical

How-to (wiring agents & providers):

Document	Description
MCP_INTEGRATION.md	MCP server configuration
HARNESS_COMPATIBILITY.md	Wire nexus-agents from OpenCode/Codex/Cursor/Aider/Cline
CUSTOM_ENDPOINT_SETUP.md	Custom OpenAI-compatible gateway (direct SDK + OpenCode paths)
CLOUD_PROVIDERS.md	Bedrock/Vertex/Azure via OpenRouter / LiteLLM / custom-gateway
PARALLEL_AGENT_WORKTREES.md	Safe parallel agents via `isolation: "worktree"` + the empirical WorktreeCreate/WorktreeRemove hook contract

Self-* Loops (MAPE-K)

The autonomic loops. Each maps to a shipped mechanism and sits at a declared rung of the authority ladder (ADR-0017): self-configuring (setup/doctor), self-healing (circuit-breaker demotion), self-optimizing (LinUCB/TOPSIS), self-protecting (trust tiers / ClawGuard / sandbox).

Self-configuring & self-optimizing

Document	Description	Status
MEMORY_SYSTEM.md	7-type memory architecture (shared Knowledge)	Canonical
RESEARCH_PIPELINE.md	Research-to-project runner	Canonical

Self-protecting (security)

Document	Description	Status
SECURITY.md	Security model, sandboxing	Canonical
UNTRUSTED_INPUT_HARDENING.md	Input trust & sanitization	Canonical
SOFTWARE_FACTORY_REPORT.md	Factory hardening review	Canonical
security/API_KEY_BOUNDARIES.md	API key boundary documentation	Canonical
SECRETS_SETUP.md	Secrets configuration	Canonical

Note: rows in the self-* loops are not equally autonomous. Where each loop sits on the ladder (observe → suggest → advisory → enforce) and how it earns promotion is governed by ADR-0017 and the loop promotion criteria under § Governance.

Governance & Authority

The rules that bound the loops: the authority ladder, claims-drift detection, and the runbooks that keep capability changes ratified and audited rather than autonomous.

Document	Description	Status
adr/0017-authority-ladder.md	Four-tier earned-autonomy model (observe → suggest → advisory → enforce)	Accepted
governance/loop-promotion-criteria.md	Per-loop authority-ladder promotion/demotion criteria (ADR-0017, #3844)	Canonical
governance/tool-removal-runbook.md	Tool removal/consolidation: ratified, audited, never autonomous (#3853)	Canonical
CLAIMS_REGISTRY.md	Add/verify claims, drift gate	Canonical

Build & Operate

For contributors building on the control plane and operators running it.

Development (contributor guides)

Document	Description	Status
AGENT_DEVELOPMENT.md	Building new agents	Canonical
TOOL_DEVELOPMENT.md	Adding MCP tools	Canonical
MEMORY_DEVELOPMENT.md	Memory system extensions	Canonical
CLI_DELEGATION_GUIDE.md	CLI adapter patterns	Canonical
CONTRIBUTION_GUIDE.md	PR workflow, git conventions	Canonical
SHELL_TESTING_ANTI_PATTERNS.md	Shell testing pitfalls to avoid	Canonical
CLI_UX_QUALITY_GATE.md	CLI UX pre-release checklist	Canonical

Operations & runbooks

Document	Description	Status
docops-spec.md	Canonical DocOps pipeline spec	Canonical
docops-manifest.json	DocOps enforcement manifest	Canonical
docs-inventory.md	Documentation inventory	Canonical
release-changeset-race.md	Publish-race runbook (#2382)	Canonical
git-housekeeping.md	Git GC cleanup runbook (#3062)	Canonical
governed-decision-cost.md	Governed-decision cost (#3857)	Canonical

CI gates & coverage

Document	Description	Status
REGISTRY_COVERAGE.md	Wiring-completeness CI gate	Canonical
SCHEMA_FANOUT_COVERAGE.md	Schema-fan-out CI check	Canonical
IMPORT_GRAPH_ORPHANS.md	Import-graph orphan detection	Canonical

Distribution

Document	Description
distribution/LISTING_SUBMISSIONS.md	MCP registry & directory submission tracker
distribution/PUBLISHING_GUIDE.md	Step-by-step publishing commands

Reference & History

Interface specs, generated references, the research loop’s catalog, ADRs, and historical/archived material.

Interfaces

Document	Description
README.md	Interface documentation hub
agent.md	Agent interface spec
model-adapter.md	Model adapter spec
orchestrator.md	Orchestrator interface spec
tool.md	Tool interface spec
workflow-engine.md	Workflow engine spec

Generated references

Owned by the docs-site single-sourcing epics (#3532/#3688/#3763): these pages are emitted by their generators and live where the generators write them. This IA pass indexes them but does not own their bodies.

Document	Description	Status
capabilities.md	CLI/MCP/Workflow capability index	Generated
MCP tool reference	Per-tool MCP reference (name, description, input schema)	Generated
Strategy reference	Force-strategy escape hatches: entrypoint tool, when-to-force, tiers, executor	Generated
skills-index.md	LLM context loading index	Canonical
dependency-graph.md	Module dependency diagram	Generated
ARCHITECTURE_MAP.json	Machine-readable component map	Generated

System maps & design

Document	Description	Status
system-map.md	System component map	Canonical
invocation-matrix.md	Component invocation map	Canonical
redundancy-analysis.md	Code redundancy analysis	Canonical
deprecation-pipeline.md	Deprecation tracking & v3.0 migration guide	Canonical
components.md	Component inventory	Canonical
interfaces.md	Key interfaces and contracts	Canonical
flows.md	Dataflow traces	Canonical
scaling-coordination-predictor.md	Scaling design	Canonical

Research loop

Document	Description	Status
RESEARCH_INDEX.md	Research tracking hub	Canonical
CONTRIBUTING.md	Adding research	Canonical
registry/papers.yaml	Paper metadata	Canonical
registry/techniques.yaml	Implementation status	Canonical
cli-first-adapter-strategy.md	CLI-first adapter research	Canonical
mcp-tool-distinctness-v1.md	MCP tool-description pairwise similarity report (#2650)	Canonical
fitness-stratified-v1.md	Stratified runtime-outcome report — per adapter / task-type / role (#2662)	Canonical
defending-code-harness-eval.md	Eval of Anthropic defending-code-reference-harness (#3574)	Canonical
fork-session-spike.md	Spike: fork_session / branch-comparison on the graph builder (#2665)	Canonical

Strategy & alignment

Document	Description	Status
ALIGNMENT_ROADMAP.md	Strategic alignment, north star, gaps	Canonical

Maintainer reference (ADRs, V2 architecture, design proposals)

For maintainers — ADRs, V2 architecture artifacts, design proposals (expand if you're editing the codebase)

Architecture Decision Records (ADRs)

ADR	Title	Status
0001	ADR Template	Template
0002	Orchestrator Interface	Accepted
0003	Quorum Validator	Accepted
0004	SharedTaskAnalyzer	Accepted
0005	Router Consolidation	Accepted
0006	Determinism Providers	Accepted
0007	Utility Consolidation	Accepted
0008	Routing Storage	Accepted
0009	Error Class Hierarchy	Accepted
0010	Memory Helpers Separation	Superseded
0011	Orchestrator Interface Defer	Superseded
0012	Registry API Unification	Accepted
0013	Memory Helpers Consolidation	Accepted
0014	Orchestrator Interface Unification	Accepted
0015	Multi-Repo Orchestration	Proposed
0016	Multi-Round Consensus Voting	Accepted
0017	Authority Ladder	Accepted
0018	Org/Scope Naming	Accepted
0019	Governance-Record Signing	Accepted

Design Documents (archived V1)

Document	Description	Status
as-is.md	Current system state assessment	Archived
v2-proposal.md	V2 pipeline OS architecture	Archived
gaps.md	Intended vs actual gaps	Archived

V2 Rearchitecture (Pipeline OS)

Document	Description	Status
04-v2-architecture-pipeline-os.md	Final V2 specification	Approved
00-executive-summary.md	Executive summary	Approved
01-as-is-architecture.md	Current architecture analysis	Approved
02-system-goals-non-goals.md	Goals and non-goals	Approved
03-user-story-user-journey.md	User stories and journeys	Approved
05-plugin-system-spec.md	Plugin system specification	Approved
06-graph-execution-model.md	Graph execution model	Approved
07-policy-governance-gates.md	Policy engine and governance gates	Approved
08-observability-eventing.md	Event bus and observability	Approved
09-context-store-turn-dag-option.md	Context store options	Approved
10-migration-plan-v1-to-v2.md	Migration plan	Approved
api-contracts.md	TypeScript API contracts	Approved
threat-model.md	V2 threat model	Approved
epics-and-issues.md	Work breakdown with acceptance criteria	Approved
ADR-0001	Pipeline OS as orchestration primitive	Proposed
ADR-0002	Unified TaskContract + PlanContract	Proposed
ADR-0003	Extend GraphBuilder (LangGraph-aligned)	Proposed
ADR-0004	Structural plugin flags	Proposed
ADR-0005	Unified adapter boundary	Proposed
ADR-0006	Observability as first-class	Proposed

Proposals (archived)

Document	Description	Status
cli-pr-review-workflow.md	PR review workflow	Archived

Workflows

Document	Description
SELF_DEVELOPMENT_WORKFLOW.md	Historical pointer — engine deleted in #2402, see epic notes

Deprecated / Historical

Documents kept for historical reference only:

Document	Reason	Replacement
archive/system-reviews-2026-01.md	Historical system review transcripts	Current system reviews
archive/REVIEW_2026-01-23.md	Archived system review	Current system reviews
archive/SECURITY_AUDIT_2026-01-23.md	Archived security audit	Current security docs
archive/consensus-vote-2026-01-17.md	Archived consensus vote	Current consensus protocols
archive/system-review-2026-05-31.md	Full 13-domain system review (epic #3143)	ALIGNMENT_ROADMAP.md

Previously Archived/Removed:

PROJECT_PLAN.md, plan docs — archived/removed
RESEARCH_SUMMARY.md — content in research/topics/agent-skills/
SECURITY_AUDIT_2026-01-23.md — moved to archive/
Proposal docs (implemented) — removed after completion

Root-Level Documents

Documents at repository root (for discoverability):

Document	Description	Canonical Location
CLAUDE.md	Agent instructions	Root (canonical)
README.md	Project overview	Root (canonical)
QUICK_START.md	Getting started	Root (canonical)
CONTRIBUTING.md	Contribution guide	Root (canonical)
CODING_STANDARDS.md	Code standards	Root (canonical)
ARCHITECTURE.md	Architecture summary	Root (canonical)
SECURITY.md	Security policy	Root (canonical)
CHANGELOG.md	Version history	Root (canonical)

Machine-Parseable Index

For programmatic access, see INDEX.yaml.

Governance Rules

Canonical Index: This file is the single documentation entry point
Indexing Required: New documentation must be added to this index
No Parallel Indexes: This is the only documentation index permitted
Update on Change: Update this index when any documentation changes
Classification Required: All docs must have a classification (Canonical/Supporting/Deprecated)

Non-conflict with the docs-site epic (#3532)

This IA pass owns the narrative-doc grouping in this index — the control-plane sections above. It does not own the generated API/MCP/strategy reference pages (see § Generated references), which are emitted by the docs-site single-sourcing generators (#3532/#3688/#3763) and live where those generators write them. The two are coordinated: this index links the generated pages by their canonical generator-emitted paths; the generators are free to regenerate those bodies without touching this index’s structure.

Files Not Indexed (Intentionally Excluded)

The following are excluded from this index:

.rules/ - Agent-specific configuration (loaded automatically)
.claude/skills/ - Agent skill definitions (loaded automatically)
coverage/ - Test coverage reports (generated)
node_modules/ - Dependencies
packages/nexus-agents/docs/api/ - Generated TypeDoc output

This index follows the documentation governance defined in CLAUDE.md.