Nexus Agents Documentation
Canonical Documentation Index | Last Updated: 2026-06-17 (the skills under skills/, 46 MCP tools, 12 expert types, 11 workflow templates)
This is the single source of truth for all nexus-agents documentation. All documentation must be indexed here to be considered valid.
Nexus-agents is an autonomic control plane for AI coding agents. This index is organized around that model: docs are grouped by the control-plane role they describe — scheduler, admission control, event log, data plane — and by the MAPE-K self-* loop they implement, rather than as a flat catalog. If you know which part of the control plane you care about, you can find its docs directly. See the root README for the framing and ADR-0017 for the authority model the loops are bounded by.
Quick Start by Role
| Role | Start Here | Then Read |
|---|---|---|
| New User | Your First Task | Installation |
| Contributor | Contributing | Development Guide |
| Operator | ENTRYPOINTS.md | Configuration |
AI agents working in this repo (Claude Code, Cursor, etc.) — see CLAUDE.md for project instructions, governance protocols, and canonical paths. CLAUDE.md is the rule book the agents follow, not a user-facing surface.
The Control-Plane Map
The docs below are grouped by where they sit in the control plane. Each classic control-plane role maps to a shipped component (see the root README’s control-plane mapping); the docs that describe each role are gathered under it.
| Control-plane role | What it is | Jump to |
|---|---|---|
| Entry point | One door in — install, verify, first task, the CLI/MCP/REST surfaces | § Entry Point |
| Scheduler | run / MetaOrchestrator picks and routes the right strategy | § Scheduler |
| Admission control | Gates — adversarial review, consensus, quality gates — decide what ships | § Admission Control |
| Event log | Tamper-evident hash-chained audit of every decision | § Event Log |
| Data plane | The engineering CLIs that do the file edits, tests, PRs | § Data Plane |
| Self-* loops | MAPE-K self-configuring / healing / optimizing / protecting | § Self-* Loops |
| Governance | Authority ladder, claims registry, the rules behind the loops | § Governance |
| Build & operate | Contributor guides, ops runbooks, distribution | § Build & Operate |
| Reference & history | Interfaces, generated references, research, ADRs, archive | § Reference & History |
Pipeline Terminology
The word “pipeline” gets used for several distinct things. This table disambiguates the namespace — read it once and the rest of the docs are unambiguous:
| Term | What it is | Example entry point(s) |
|---|---|---|
| Research loop | Deliberately gather and synthesize evidence before deciding. Discovers papers/repos, clusters findings, surfaces them into planning context. | research_discover, research_synthesize, research_query |
| Consensus loop | A multi-agent vote on a proposal/plan (approve/reject with a quorum + strategy). Used as a gate, not a producer. | consensus_vote · createConsensusEngine |
| Dev pipeline | The batteries-included build flow: research → plan → vote → decompose → implement → QA → security. The common path for “build feature X”. | run_dev_pipeline · run_pipeline |
| Workflow template | A reusable, declarative (YAML) sequence of stages you can run by name — a saved pipeline shape. | run_workflow · list_workflows (WORKFLOW_TEMPLATES) |
| Composition / orchestration | Wiring the orthogonal primitives (spec→graph→execute, GraphBuilder, consensus) into a custom pipeline beyond the built-ins. | orchestrate, execute_spec, GraphBuilder (COMPOSITION_PATTERNS) |
Mental model: a dev pipeline and workflow templates are pre-built pipelines you run; composition is how you build a new one from the research loop, consensus loop, and graph primitives. When a doc says “the full pipeline” (e.g. in CLAUDE.md’s working mode) it means the agent’s research→vote→plan→implement decision sequence — the same shape as the dev pipeline, applied to its own work.
Documentation Structure
docs/
├── README.md # THIS FILE - Canonical index (control-plane organized)
├── INDEX.yaml # Machine-parseable index
│
├── getting-started/ # Entry point: installation and configuration
├── architecture/ # System design — scheduler, gates, event log, data plane
├── development/ # Build & operate: contributor guides
├── governance/ # Authority ladder, loop promotion criteria
├── security/ # Self-protecting: trust tiers, audit threat model
├── research/ # Research loop tracking
├── adr/ # Architecture Decision Records
├── guides/ # How-to guides
├── interfaces/ # Interface specifications
├── ops/ # Operational runbooks
├── reference/ # Generated CLI/MCP/strategy references
├── metrics/ # System metrics
└── workflows/ # Workflow documentation
Tier 1: Essential (Always Current)
These documents define the system and must be kept up-to-date:
| Document | Location | Description |
|---|---|---|
| Project Instructions | CLAUDE.md | Agent behavior, protocols, governance |
| Your First Task | FIRST_TASK.md | 5-minute first task (canonical entry) |
| README | README.md | Project overview (control-plane framing) |
| Architecture Overview | architecture/README.md | System design hub |
| Development Guide | development/README.md | Contributor hub |
| API Reference | ENTRYPOINTS.md | CLI, MCP, REST, API docs |
| Troubleshooting | TROUBLESHOOTING.md | Common issues, FAQ |
Entry Point — Getting In
One door in: install, verify, run your first task, then learn the full surface (CLI / MCP / REST). This is the run scheduler’s front door.
| Document | Description | Status |
|---|---|---|
| FIRST_TASK.md | Start here. Install → verify → real vote → editor wiring (~5 min) | Canonical |
| COMPOSE_YOUR_FIRST_PIPELINE.md | Next step: chain MCP tools (research → vote → build) toward a goal (~15 min) | Canonical |
| INSTALLATION.md | Platform installation deep-dive | Canonical |
| CONFIGURATION.md | YAML and env configuration | Canonical |
| PLUGIN_INSTALL.md | Install nexus-agents as a Claude Code plugin | Canonical |
| SANDBOXED-USAGE.md | Docker / restricted-FS / team-distribution flows | Canonical |
| ENTRYPOINTS.md | The full CLI / MCP / REST entry-point reference | Canonical |
Scheduler — Routing & Orchestration
The scheduler is run / MetaOrchestrator: one entry point picks (and optionally runs) the right strategy for a goal. These docs describe how strategies are selected, routed across CLIs, and composed into pipelines.
| Document | Description | Status |
|---|---|---|
| AGENT_SYSTEM.md | Agent framework design | Canonical |
| ROUTING_SYSTEM.md | Model routing pipeline | Canonical |
| CONTEXT_LOAD_BALANCING.md | Claude/Gemini/Codex routing | Canonical |
| ORCHESTRATOR_WORKFLOW_ENGINE.md | Orchestrator vs WorkflowEngine | Canonical |
| ICTM_PATTERN.md | Dynamic sub-agent creation | Canonical |
| MULTI_REPO_ORCHESTRATION.md | Cross-repo task coordination | Canonical |
| RESEARCH_PIPELINE.md | Research-to-project runner | Canonical |
| MEMORY_SYSTEM.md | 7-type memory architecture | Canonical |
How-to (composing & routing):
| Document | Description |
|---|---|
| COMPOSITION_PATTERNS.md | Compose spec pipeline + GraphBuilder + consensus into custom pipelines |
| WORKFLOW_TEMPLATES.md | Creating YAML workflows |
| RULE_PRECEDENCE.md | Per-adapter rule-loading precedence (Claude/Codex/Gemini/OpenCode) |
Admission Control — Gates
Nothing ships without passing a gate. Adversarial PR review, multi-voter consensus, and quality gates decide what is allowed through. These are the loops that sit at advisory/enforce on the authority ladder.
| Document | Description | Status |
|---|---|---|
| CONSENSUS_PROTOCOLS.md | 5 consensus algorithms | Canonical |
How-to (running gates):
| Document | Description |
|---|---|
| PR_REVIEW_LOCAL.md | Run pr_review on your machine using subscription CLI auth |
Evidence (gate evaluations):
| Document | Description | Status |
|---|---|---|
| pr-review-experiment-results.md | pr_review #2233 baseline experiment results | Canonical |
| pr-review-experiment-results-v5.md | pr_review v5 — JSON-native findings; 100% bug-catch + caught a real bug | Canonical |
| pr-review-eval-labeling-rubric.md | pr_review eval labeling rubric v1 + v5 re-adjudication (#3846) | Canonical |
| pr-review-dataset-curation.md | pr_review eval dataset curation pipeline + n≥50 assessment (#3847) | Canonical |
| pr-review-eval-curation.md | pr_review eval candidate-mining curation pipeline (#3847) | Canonical |
Event Log — Audit & Observability
Append-only, tamper-evident record of every decision, plus the observability surfaces that monitor the running system (the Monitor arm of MAPE-K).
| Document | Description | Status |
|---|---|---|
| EVENT_BUS_BOUNDARIES.md | Observability vs messaging bus | Canonical |
| audit-hash-chain-threat-model | Threat model for the audit hash chain | Canonical |
How-to (observability):
| Document | Description |
|---|---|
| DEBUGGING_OBSERVABILITY.md | Debug logging, tracing |
| Claude Code Observability | Hooks, status line, MCP logging for Claude Code |
Observability design:
| Document | Description | Status |
|---|---|---|
| EXECUTION_DASHBOARD_DESIGN.md | Dashboard design | Canonical |
| SWARM_OBSERVER_DESIGN.md | OrchestrationObserver design | Canonical |
Data Plane — Agents & Execution
The data plane is the engineering CLIs (Claude Code, Codex, Gemini, OpenCode) that do the actual file edits, tests, and PRs — and the harness/federation machinery that wires nexus-agents to them.
| Document | Description | Status |
|---|---|---|
| MCP_PROTOCOL.md | MCP integration details | Canonical |
| AGENT_COMPATIBILITY.md | Per-harness federation matrix | Canonical |
| SWE_BENCH_HARNESS.md | SWE-Bench integration | Canonical |
How-to (wiring agents & providers):
| Document | Description |
|---|---|
| MCP_INTEGRATION.md | MCP server configuration |
| HARNESS_COMPATIBILITY.md | Wire nexus-agents from OpenCode/Codex/Cursor/Aider/Cline |
| CUSTOM_ENDPOINT_SETUP.md | Custom OpenAI-compatible gateway (direct SDK + OpenCode paths) |
| CLOUD_PROVIDERS.md | Bedrock/Vertex/Azure via OpenRouter / LiteLLM / custom-gateway |
| PARALLEL_AGENT_WORKTREES.md | Safe parallel agents via isolation: "worktree" + the empirical WorktreeCreate/WorktreeRemove hook contract |
Self-* Loops (MAPE-K)
The autonomic loops. Each maps to a shipped mechanism and sits at a declared rung of the authority ladder (ADR-0017): self-configuring (setup/doctor), self-healing (circuit-breaker demotion), self-optimizing (LinUCB/TOPSIS), self-protecting (trust tiers / ClawGuard / sandbox).
Self-configuring & self-optimizing
| Document | Description | Status |
|---|---|---|
| MEMORY_SYSTEM.md | 7-type memory architecture (shared Knowledge) | Canonical |
| RESEARCH_PIPELINE.md | Research-to-project runner | Canonical |
Self-protecting (security)
| Document | Description | Status |
|---|---|---|
| SECURITY.md | Security model, sandboxing | Canonical |
| UNTRUSTED_INPUT_HARDENING.md | Input trust & sanitization | Canonical |
| SOFTWARE_FACTORY_REPORT.md | Factory hardening review | Canonical |
| security/API_KEY_BOUNDARIES.md | API key boundary documentation | Canonical |
| SECRETS_SETUP.md | Secrets configuration | Canonical |
Note: rows in the self-* loops are not equally autonomous. Where each loop sits on the ladder (
observe → suggest → advisory → enforce) and how it earns promotion is governed by ADR-0017 and the loop promotion criteria under § Governance.
Governance & Authority
The rules that bound the loops: the authority ladder, claims-drift detection, and the runbooks that keep capability changes ratified and audited rather than autonomous.
| Document | Description | Status |
|---|---|---|
| adr/0017-authority-ladder.md | Four-tier earned-autonomy model (observe → suggest → advisory → enforce) | Accepted |
| governance/loop-promotion-criteria.md | Per-loop authority-ladder promotion/demotion criteria (ADR-0017, #3844) | Canonical |
| governance/tool-removal-runbook.md | Tool removal/consolidation: ratified, audited, never autonomous (#3853) | Canonical |
| CLAIMS_REGISTRY.md | Add/verify claims, drift gate | Canonical |
Build & Operate
For contributors building on the control plane and operators running it.
Development (contributor guides)
| Document | Description | Status |
|---|---|---|
| AGENT_DEVELOPMENT.md | Building new agents | Canonical |
| TOOL_DEVELOPMENT.md | Adding MCP tools | Canonical |
| MEMORY_DEVELOPMENT.md | Memory system extensions | Canonical |
| CLI_DELEGATION_GUIDE.md | CLI adapter patterns | Canonical |
| CONTRIBUTION_GUIDE.md | PR workflow, git conventions | Canonical |
| SHELL_TESTING_ANTI_PATTERNS.md | Shell testing pitfalls to avoid | Canonical |
| CLI_UX_QUALITY_GATE.md | CLI UX pre-release checklist | Canonical |
Operations & runbooks
| Document | Description | Status |
|---|---|---|
| docops-spec.md | Canonical DocOps pipeline spec | Canonical |
| docops-manifest.json | DocOps enforcement manifest | Canonical |
| docs-inventory.md | Documentation inventory | Canonical |
| release-changeset-race.md | Publish-race runbook (#2382) | Canonical |
| git-housekeeping.md | Git GC cleanup runbook (#3062) | Canonical |
| governed-decision-cost.md | Governed-decision cost (#3857) | Canonical |
CI gates & coverage
| Document | Description | Status |
|---|---|---|
| REGISTRY_COVERAGE.md | Wiring-completeness CI gate | Canonical |
| SCHEMA_FANOUT_COVERAGE.md | Schema-fan-out CI check | Canonical |
| IMPORT_GRAPH_ORPHANS.md | Import-graph orphan detection | Canonical |
Distribution
| Document | Description |
|---|---|
| distribution/LISTING_SUBMISSIONS.md | MCP registry & directory submission tracker |
| distribution/PUBLISHING_GUIDE.md | Step-by-step publishing commands |
Reference & History
Interface specs, generated references, the research loop’s catalog, ADRs, and historical/archived material.
Interfaces
| Document | Description |
|---|---|
| README.md | Interface documentation hub |
| agent.md | Agent interface spec |
| model-adapter.md | Model adapter spec |
| orchestrator.md | Orchestrator interface spec |
| tool.md | Tool interface spec |
| workflow-engine.md | Workflow engine spec |
Generated references
Owned by the docs-site single-sourcing epics (#3532/#3688/#3763): these pages are emitted by their generators and live where the generators write them. This IA pass indexes them but does not own their bodies.
| Document | Description | Status |
|---|---|---|
| capabilities.md | CLI/MCP/Workflow capability index | Generated |
| MCP tool reference | Per-tool MCP reference (name, description, input schema) | Generated |
| Strategy reference | Force-strategy escape hatches: entrypoint tool, when-to-force, tiers, executor | Generated |
| skills-index.md | LLM context loading index | Canonical |
| dependency-graph.md | Module dependency diagram | Generated |
| ARCHITECTURE_MAP.json | Machine-readable component map | Generated |
System maps & design
| Document | Description | Status |
|---|---|---|
| system-map.md | System component map | Canonical |
| invocation-matrix.md | Component invocation map | Canonical |
| redundancy-analysis.md | Code redundancy analysis | Canonical |
| deprecation-pipeline.md | Deprecation tracking & v3.0 migration guide | Canonical |
| components.md | Component inventory | Canonical |
| interfaces.md | Key interfaces and contracts | Canonical |
| flows.md | Dataflow traces | Canonical |
| scaling-coordination-predictor.md | Scaling design | Canonical |
Research loop
| Document | Description | Status |
|---|---|---|
| RESEARCH_INDEX.md | Research tracking hub | Canonical |
| CONTRIBUTING.md | Adding research | Canonical |
| registry/papers.yaml | Paper metadata | Canonical |
| registry/techniques.yaml | Implementation status | Canonical |
| cli-first-adapter-strategy.md | CLI-first adapter research | Canonical |
| mcp-tool-distinctness-v1.md | MCP tool-description pairwise similarity report (#2650) | Canonical |
| fitness-stratified-v1.md | Stratified runtime-outcome report — per adapter / task-type / role (#2662) | Canonical |
| defending-code-harness-eval.md | Eval of Anthropic defending-code-reference-harness (#3574) | Canonical |
| fork-session-spike.md | Spike: fork_session / branch-comparison on the graph builder (#2665) | Canonical |
Strategy & alignment
| Document | Description | Status |
|---|---|---|
| ALIGNMENT_ROADMAP.md | Strategic alignment, north star, gaps | Canonical |
Maintainer reference (ADRs, V2 architecture, design proposals)
For maintainers — ADRs, V2 architecture artifacts, design proposals (expand if you're editing the codebase)
Architecture Decision Records (ADRs)
| ADR | Title | Status |
|---|---|---|
| 0001 | ADR Template | Template |
| 0002 | Orchestrator Interface | Accepted |
| 0003 | Quorum Validator | Accepted |
| 0004 | SharedTaskAnalyzer | Accepted |
| 0005 | Router Consolidation | Accepted |
| 0006 | Determinism Providers | Accepted |
| 0007 | Utility Consolidation | Accepted |
| 0008 | Routing Storage | Accepted |
| 0009 | Error Class Hierarchy | Accepted |
| 0010 | Memory Helpers Separation | Superseded |
| 0011 | Orchestrator Interface Defer | Superseded |
| 0012 | Registry API Unification | Accepted |
| 0013 | Memory Helpers Consolidation | Accepted |
| 0014 | Orchestrator Interface Unification | Accepted |
| 0015 | Multi-Repo Orchestration | Proposed |
| 0016 | Multi-Round Consensus Voting | Accepted |
| 0017 | Authority Ladder | Accepted |
| 0018 | Org/Scope Naming | Accepted |
| 0019 | Governance-Record Signing | Accepted |
Design Documents (archived V1)
| Document | Description | Status |
|---|---|---|
| as-is.md | Current system state assessment | Archived |
| v2-proposal.md | V2 pipeline OS architecture | Archived |
| gaps.md | Intended vs actual gaps | Archived |
V2 Rearchitecture (Pipeline OS)
| Document | Description | Status |
|---|---|---|
| 04-v2-architecture-pipeline-os.md | Final V2 specification | Approved |
| 00-executive-summary.md | Executive summary | Approved |
| 01-as-is-architecture.md | Current architecture analysis | Approved |
| 02-system-goals-non-goals.md | Goals and non-goals | Approved |
| 03-user-story-user-journey.md | User stories and journeys | Approved |
| 05-plugin-system-spec.md | Plugin system specification | Approved |
| 06-graph-execution-model.md | Graph execution model | Approved |
| 07-policy-governance-gates.md | Policy engine and governance gates | Approved |
| 08-observability-eventing.md | Event bus and observability | Approved |
| 09-context-store-turn-dag-option.md | Context store options | Approved |
| 10-migration-plan-v1-to-v2.md | Migration plan | Approved |
| api-contracts.md | TypeScript API contracts | Approved |
| threat-model.md | V2 threat model | Approved |
| epics-and-issues.md | Work breakdown with acceptance criteria | Approved |
| ADR-0001 | Pipeline OS as orchestration primitive | Proposed |
| ADR-0002 | Unified TaskContract + PlanContract | Proposed |
| ADR-0003 | Extend GraphBuilder (LangGraph-aligned) | Proposed |
| ADR-0004 | Structural plugin flags | Proposed |
| ADR-0005 | Unified adapter boundary | Proposed |
| ADR-0006 | Observability as first-class | Proposed |
Proposals (archived)
| Document | Description | Status |
|---|---|---|
| cli-pr-review-workflow.md | PR review workflow | Archived |
Workflows
| Document | Description |
|---|---|
| SELF_DEVELOPMENT_WORKFLOW.md | Historical pointer — engine deleted in #2402, see epic notes |
Deprecated / Historical
Documents kept for historical reference only:
| Document | Reason | Replacement |
|---|---|---|
| archive/system-reviews-2026-01.md | Historical system review transcripts | Current system reviews |
| archive/REVIEW_2026-01-23.md | Archived system review | Current system reviews |
| archive/SECURITY_AUDIT_2026-01-23.md | Archived security audit | Current security docs |
| archive/consensus-vote-2026-01-17.md | Archived consensus vote | Current consensus protocols |
| archive/system-review-2026-05-31.md | Full 13-domain system review (epic #3143) | ALIGNMENT_ROADMAP.md |
Previously Archived/Removed:
PROJECT_PLAN.md, plan docs — archived/removedRESEARCH_SUMMARY.md— content in research/topics/agent-skills/SECURITY_AUDIT_2026-01-23.md— moved to archive/- Proposal docs (implemented) — removed after completion
Root-Level Documents
Documents at repository root (for discoverability):
| Document | Description | Canonical Location |
|---|---|---|
| CLAUDE.md | Agent instructions | Root (canonical) |
| README.md | Project overview | Root (canonical) |
| QUICK_START.md | Getting started | Root (canonical) |
| CONTRIBUTING.md | Contribution guide | Root (canonical) |
| CODING_STANDARDS.md | Code standards | Root (canonical) |
| ARCHITECTURE.md | Architecture summary | Root (canonical) |
| SECURITY.md | Security policy | Root (canonical) |
| CHANGELOG.md | Version history | Root (canonical) |
Machine-Parseable Index
For programmatic access, see INDEX.yaml.
Governance Rules
- Canonical Index: This file is the single documentation entry point
- Indexing Required: New documentation must be added to this index
- No Parallel Indexes: This is the only documentation index permitted
- Update on Change: Update this index when any documentation changes
- Classification Required: All docs must have a classification (Canonical/Supporting/Deprecated)
Non-conflict with the docs-site epic (#3532)
This IA pass owns the narrative-doc grouping in this index — the control-plane sections above. It does not own the generated API/MCP/strategy reference pages (see § Generated references), which are emitted by the docs-site single-sourcing generators (#3532/#3688/#3763) and live where those generators write them. The two are coordinated: this index links the generated pages by their canonical generator-emitted paths; the generators are free to regenerate those bodies without touching this index’s structure.
Files Not Indexed (Intentionally Excluded)
The following are excluded from this index:
.rules/- Agent-specific configuration (loaded automatically).claude/skills/- Agent skill definitions (loaded automatically)coverage/- Test coverage reports (generated)node_modules/- Dependenciespackages/nexus-agents/docs/api/- Generated TypeDoc output
This index follows the documentation governance defined in CLAUDE.md.