Nexus Agents Documentation

Canonical Documentation Index | Last Updated: 2026-06-17 (the skills under skills/, 46 MCP tools, 12 expert types, 11 workflow templates)

This is the single source of truth for all nexus-agents documentation. All documentation must be indexed here to be considered valid.

Nexus-agents is an autonomic control plane for AI coding agents. This index is organized around that model: docs are grouped by the control-plane role they describe — scheduler, admission control, event log, data plane — and by the MAPE-K self-* loop they implement, rather than as a flat catalog. If you know which part of the control plane you care about, you can find its docs directly. See the root README for the framing and ADR-0017 for the authority model the loops are bounded by.


Quick Start by Role

RoleStart HereThen Read
New UserYour First TaskInstallation
ContributorContributingDevelopment Guide
OperatorENTRYPOINTS.mdConfiguration

AI agents working in this repo (Claude Code, Cursor, etc.) — see CLAUDE.md for project instructions, governance protocols, and canonical paths. CLAUDE.md is the rule book the agents follow, not a user-facing surface.


The Control-Plane Map

The docs below are grouped by where they sit in the control plane. Each classic control-plane role maps to a shipped component (see the root README’s control-plane mapping); the docs that describe each role are gathered under it.

Control-plane roleWhat it isJump to
Entry pointOne door in — install, verify, first task, the CLI/MCP/REST surfaces§ Entry Point
Schedulerrun / MetaOrchestrator picks and routes the right strategy§ Scheduler
Admission controlGates — adversarial review, consensus, quality gates — decide what ships§ Admission Control
Event logTamper-evident hash-chained audit of every decision§ Event Log
Data planeThe engineering CLIs that do the file edits, tests, PRs§ Data Plane
Self-* loopsMAPE-K self-configuring / healing / optimizing / protecting§ Self-* Loops
GovernanceAuthority ladder, claims registry, the rules behind the loops§ Governance
Build & operateContributor guides, ops runbooks, distribution§ Build & Operate
Reference & historyInterfaces, generated references, research, ADRs, archive§ Reference & History

Pipeline Terminology

The word “pipeline” gets used for several distinct things. This table disambiguates the namespace — read it once and the rest of the docs are unambiguous:

TermWhat it isExample entry point(s)
Research loopDeliberately gather and synthesize evidence before deciding. Discovers papers/repos, clusters findings, surfaces them into planning context.research_discover, research_synthesize, research_query
Consensus loopA multi-agent vote on a proposal/plan (approve/reject with a quorum + strategy). Used as a gate, not a producer.consensus_vote · createConsensusEngine
Dev pipelineThe batteries-included build flow: research → plan → vote → decompose → implement → QA → security. The common path for “build feature X”.run_dev_pipeline · run_pipeline
Workflow templateA reusable, declarative (YAML) sequence of stages you can run by name — a saved pipeline shape.run_workflow · list_workflows (WORKFLOW_TEMPLATES)
Composition / orchestrationWiring the orthogonal primitives (spec→graph→execute, GraphBuilder, consensus) into a custom pipeline beyond the built-ins.orchestrate, execute_spec, GraphBuilder (COMPOSITION_PATTERNS)

Mental model: a dev pipeline and workflow templates are pre-built pipelines you run; composition is how you build a new one from the research loop, consensus loop, and graph primitives. When a doc says “the full pipeline” (e.g. in CLAUDE.md’s working mode) it means the agent’s research→vote→plan→implement decision sequence — the same shape as the dev pipeline, applied to its own work.


Documentation Structure

docs/
├── README.md              # THIS FILE - Canonical index (control-plane organized)
├── INDEX.yaml             # Machine-parseable index

├── getting-started/       # Entry point: installation and configuration
├── architecture/          # System design — scheduler, gates, event log, data plane
├── development/           # Build & operate: contributor guides
├── governance/            # Authority ladder, loop promotion criteria
├── security/              # Self-protecting: trust tiers, audit threat model
├── research/              # Research loop tracking
├── adr/                   # Architecture Decision Records
├── guides/                # How-to guides
├── interfaces/            # Interface specifications
├── ops/                   # Operational runbooks
├── reference/             # Generated CLI/MCP/strategy references
├── metrics/               # System metrics
└── workflows/             # Workflow documentation

Tier 1: Essential (Always Current)

These documents define the system and must be kept up-to-date:

DocumentLocationDescription
Project InstructionsCLAUDE.mdAgent behavior, protocols, governance
Your First TaskFIRST_TASK.md5-minute first task (canonical entry)
READMEREADME.mdProject overview (control-plane framing)
Architecture Overviewarchitecture/README.mdSystem design hub
Development Guidedevelopment/README.mdContributor hub
API ReferenceENTRYPOINTS.mdCLI, MCP, REST, API docs
TroubleshootingTROUBLESHOOTING.mdCommon issues, FAQ

Entry Point — Getting In

One door in: install, verify, run your first task, then learn the full surface (CLI / MCP / REST). This is the run scheduler’s front door.

DocumentDescriptionStatus
FIRST_TASK.mdStart here. Install → verify → real vote → editor wiring (~5 min)Canonical
COMPOSE_YOUR_FIRST_PIPELINE.mdNext step: chain MCP tools (research → vote → build) toward a goal (~15 min)Canonical
INSTALLATION.mdPlatform installation deep-diveCanonical
CONFIGURATION.mdYAML and env configurationCanonical
PLUGIN_INSTALL.mdInstall nexus-agents as a Claude Code pluginCanonical
SANDBOXED-USAGE.mdDocker / restricted-FS / team-distribution flowsCanonical
ENTRYPOINTS.mdThe full CLI / MCP / REST entry-point referenceCanonical

Scheduler — Routing & Orchestration

The scheduler is run / MetaOrchestrator: one entry point picks (and optionally runs) the right strategy for a goal. These docs describe how strategies are selected, routed across CLIs, and composed into pipelines.

DocumentDescriptionStatus
AGENT_SYSTEM.mdAgent framework designCanonical
ROUTING_SYSTEM.mdModel routing pipelineCanonical
CONTEXT_LOAD_BALANCING.mdClaude/Gemini/Codex routingCanonical
ORCHESTRATOR_WORKFLOW_ENGINE.mdOrchestrator vs WorkflowEngineCanonical
ICTM_PATTERN.mdDynamic sub-agent creationCanonical
MULTI_REPO_ORCHESTRATION.mdCross-repo task coordinationCanonical
RESEARCH_PIPELINE.mdResearch-to-project runnerCanonical
MEMORY_SYSTEM.md7-type memory architectureCanonical

How-to (composing & routing):

DocumentDescription
COMPOSITION_PATTERNS.mdCompose spec pipeline + GraphBuilder + consensus into custom pipelines
WORKFLOW_TEMPLATES.mdCreating YAML workflows
RULE_PRECEDENCE.mdPer-adapter rule-loading precedence (Claude/Codex/Gemini/OpenCode)

Admission Control — Gates

Nothing ships without passing a gate. Adversarial PR review, multi-voter consensus, and quality gates decide what is allowed through. These are the loops that sit at advisory/enforce on the authority ladder.

DocumentDescriptionStatus
CONSENSUS_PROTOCOLS.md5 consensus algorithmsCanonical

How-to (running gates):

DocumentDescription
PR_REVIEW_LOCAL.mdRun pr_review on your machine using subscription CLI auth

Evidence (gate evaluations):

DocumentDescriptionStatus
pr-review-experiment-results.mdpr_review #2233 baseline experiment resultsCanonical
pr-review-experiment-results-v5.mdpr_review v5 — JSON-native findings; 100% bug-catch + caught a real bugCanonical
pr-review-eval-labeling-rubric.mdpr_review eval labeling rubric v1 + v5 re-adjudication (#3846)Canonical
pr-review-dataset-curation.mdpr_review eval dataset curation pipeline + n≥50 assessment (#3847)Canonical
pr-review-eval-curation.mdpr_review eval candidate-mining curation pipeline (#3847)Canonical

Event Log — Audit & Observability

Append-only, tamper-evident record of every decision, plus the observability surfaces that monitor the running system (the Monitor arm of MAPE-K).

DocumentDescriptionStatus
EVENT_BUS_BOUNDARIES.mdObservability vs messaging busCanonical
audit-hash-chain-threat-modelThreat model for the audit hash chainCanonical

How-to (observability):

DocumentDescription
DEBUGGING_OBSERVABILITY.mdDebug logging, tracing
Claude Code ObservabilityHooks, status line, MCP logging for Claude Code

Observability design:

DocumentDescriptionStatus
EXECUTION_DASHBOARD_DESIGN.mdDashboard designCanonical
SWARM_OBSERVER_DESIGN.mdOrchestrationObserver designCanonical

Data Plane — Agents & Execution

The data plane is the engineering CLIs (Claude Code, Codex, Gemini, OpenCode) that do the actual file edits, tests, and PRs — and the harness/federation machinery that wires nexus-agents to them.

DocumentDescriptionStatus
MCP_PROTOCOL.mdMCP integration detailsCanonical
AGENT_COMPATIBILITY.mdPer-harness federation matrixCanonical
SWE_BENCH_HARNESS.mdSWE-Bench integrationCanonical

How-to (wiring agents & providers):

DocumentDescription
MCP_INTEGRATION.mdMCP server configuration
HARNESS_COMPATIBILITY.mdWire nexus-agents from OpenCode/Codex/Cursor/Aider/Cline
CUSTOM_ENDPOINT_SETUP.mdCustom OpenAI-compatible gateway (direct SDK + OpenCode paths)
CLOUD_PROVIDERS.mdBedrock/Vertex/Azure via OpenRouter / LiteLLM / custom-gateway
PARALLEL_AGENT_WORKTREES.mdSafe parallel agents via isolation: "worktree" + the empirical WorktreeCreate/WorktreeRemove hook contract

Self-* Loops (MAPE-K)

The autonomic loops. Each maps to a shipped mechanism and sits at a declared rung of the authority ladder (ADR-0017): self-configuring (setup/doctor), self-healing (circuit-breaker demotion), self-optimizing (LinUCB/TOPSIS), self-protecting (trust tiers / ClawGuard / sandbox).

Self-configuring & self-optimizing

DocumentDescriptionStatus
MEMORY_SYSTEM.md7-type memory architecture (shared Knowledge)Canonical
RESEARCH_PIPELINE.mdResearch-to-project runnerCanonical

Self-protecting (security)

DocumentDescriptionStatus
SECURITY.mdSecurity model, sandboxingCanonical
UNTRUSTED_INPUT_HARDENING.mdInput trust & sanitizationCanonical
SOFTWARE_FACTORY_REPORT.mdFactory hardening reviewCanonical
security/API_KEY_BOUNDARIES.mdAPI key boundary documentationCanonical
SECRETS_SETUP.mdSecrets configurationCanonical

Note: rows in the self-* loops are not equally autonomous. Where each loop sits on the ladder (observe → suggest → advisory → enforce) and how it earns promotion is governed by ADR-0017 and the loop promotion criteria under § Governance.


Governance & Authority

The rules that bound the loops: the authority ladder, claims-drift detection, and the runbooks that keep capability changes ratified and audited rather than autonomous.

DocumentDescriptionStatus
adr/0017-authority-ladder.mdFour-tier earned-autonomy model (observe → suggest → advisory → enforce)Accepted
governance/loop-promotion-criteria.mdPer-loop authority-ladder promotion/demotion criteria (ADR-0017, #3844)Canonical
governance/tool-removal-runbook.mdTool removal/consolidation: ratified, audited, never autonomous (#3853)Canonical
CLAIMS_REGISTRY.mdAdd/verify claims, drift gateCanonical

Build & Operate

For contributors building on the control plane and operators running it.

Development (contributor guides)

DocumentDescriptionStatus
AGENT_DEVELOPMENT.mdBuilding new agentsCanonical
TOOL_DEVELOPMENT.mdAdding MCP toolsCanonical
MEMORY_DEVELOPMENT.mdMemory system extensionsCanonical
CLI_DELEGATION_GUIDE.mdCLI adapter patternsCanonical
CONTRIBUTION_GUIDE.mdPR workflow, git conventionsCanonical
SHELL_TESTING_ANTI_PATTERNS.mdShell testing pitfalls to avoidCanonical
CLI_UX_QUALITY_GATE.mdCLI UX pre-release checklistCanonical

Operations & runbooks

DocumentDescriptionStatus
docops-spec.mdCanonical DocOps pipeline specCanonical
docops-manifest.jsonDocOps enforcement manifestCanonical
docs-inventory.mdDocumentation inventoryCanonical
release-changeset-race.mdPublish-race runbook (#2382)Canonical
git-housekeeping.mdGit GC cleanup runbook (#3062)Canonical
governed-decision-cost.mdGoverned-decision cost (#3857)Canonical

CI gates & coverage

DocumentDescriptionStatus
REGISTRY_COVERAGE.mdWiring-completeness CI gateCanonical
SCHEMA_FANOUT_COVERAGE.mdSchema-fan-out CI checkCanonical
IMPORT_GRAPH_ORPHANS.mdImport-graph orphan detectionCanonical

Distribution

DocumentDescription
distribution/LISTING_SUBMISSIONS.mdMCP registry & directory submission tracker
distribution/PUBLISHING_GUIDE.mdStep-by-step publishing commands

Reference & History

Interface specs, generated references, the research loop’s catalog, ADRs, and historical/archived material.

Interfaces

DocumentDescription
README.mdInterface documentation hub
agent.mdAgent interface spec
model-adapter.mdModel adapter spec
orchestrator.mdOrchestrator interface spec
tool.mdTool interface spec
workflow-engine.mdWorkflow engine spec

Generated references

Owned by the docs-site single-sourcing epics (#3532/#3688/#3763): these pages are emitted by their generators and live where the generators write them. This IA pass indexes them but does not own their bodies.

DocumentDescriptionStatus
capabilities.mdCLI/MCP/Workflow capability indexGenerated
MCP tool referencePer-tool MCP reference (name, description, input schema)Generated
Strategy referenceForce-strategy escape hatches: entrypoint tool, when-to-force, tiers, executorGenerated
skills-index.mdLLM context loading indexCanonical
dependency-graph.mdModule dependency diagramGenerated
ARCHITECTURE_MAP.jsonMachine-readable component mapGenerated

System maps & design

DocumentDescriptionStatus
system-map.mdSystem component mapCanonical
invocation-matrix.mdComponent invocation mapCanonical
redundancy-analysis.mdCode redundancy analysisCanonical
deprecation-pipeline.mdDeprecation tracking & v3.0 migration guideCanonical
components.mdComponent inventoryCanonical
interfaces.mdKey interfaces and contractsCanonical
flows.mdDataflow tracesCanonical
scaling-coordination-predictor.mdScaling designCanonical

Research loop

DocumentDescriptionStatus
RESEARCH_INDEX.mdResearch tracking hubCanonical
CONTRIBUTING.mdAdding researchCanonical
registry/papers.yamlPaper metadataCanonical
registry/techniques.yamlImplementation statusCanonical
cli-first-adapter-strategy.mdCLI-first adapter researchCanonical
mcp-tool-distinctness-v1.mdMCP tool-description pairwise similarity report (#2650)Canonical
fitness-stratified-v1.mdStratified runtime-outcome report — per adapter / task-type / role (#2662)Canonical
defending-code-harness-eval.mdEval of Anthropic defending-code-reference-harness (#3574)Canonical
fork-session-spike.mdSpike: fork_session / branch-comparison on the graph builder (#2665)Canonical

Strategy & alignment

DocumentDescriptionStatus
ALIGNMENT_ROADMAP.mdStrategic alignment, north star, gapsCanonical

Maintainer reference (ADRs, V2 architecture, design proposals)

For maintainers — ADRs, V2 architecture artifacts, design proposals (expand if you're editing the codebase)

Architecture Decision Records (ADRs)

ADRTitleStatus
0001ADR TemplateTemplate
0002Orchestrator InterfaceAccepted
0003Quorum ValidatorAccepted
0004SharedTaskAnalyzerAccepted
0005Router ConsolidationAccepted
0006Determinism ProvidersAccepted
0007Utility ConsolidationAccepted
0008Routing StorageAccepted
0009Error Class HierarchyAccepted
0010Memory Helpers SeparationSuperseded
0011Orchestrator Interface DeferSuperseded
0012Registry API UnificationAccepted
0013Memory Helpers ConsolidationAccepted
0014Orchestrator Interface UnificationAccepted
0015Multi-Repo OrchestrationProposed
0016Multi-Round Consensus VotingAccepted
0017Authority LadderAccepted
0018Org/Scope NamingAccepted
0019Governance-Record SigningAccepted

Design Documents (archived V1)

DocumentDescriptionStatus
as-is.mdCurrent system state assessmentArchived
v2-proposal.mdV2 pipeline OS architectureArchived
gaps.mdIntended vs actual gapsArchived

V2 Rearchitecture (Pipeline OS)

DocumentDescriptionStatus
04-v2-architecture-pipeline-os.mdFinal V2 specificationApproved
00-executive-summary.mdExecutive summaryApproved
01-as-is-architecture.mdCurrent architecture analysisApproved
02-system-goals-non-goals.mdGoals and non-goalsApproved
03-user-story-user-journey.mdUser stories and journeysApproved
05-plugin-system-spec.mdPlugin system specificationApproved
06-graph-execution-model.mdGraph execution modelApproved
07-policy-governance-gates.mdPolicy engine and governance gatesApproved
08-observability-eventing.mdEvent bus and observabilityApproved
09-context-store-turn-dag-option.mdContext store optionsApproved
10-migration-plan-v1-to-v2.mdMigration planApproved
api-contracts.mdTypeScript API contractsApproved
threat-model.mdV2 threat modelApproved
epics-and-issues.mdWork breakdown with acceptance criteriaApproved
ADR-0001Pipeline OS as orchestration primitiveProposed
ADR-0002Unified TaskContract + PlanContractProposed
ADR-0003Extend GraphBuilder (LangGraph-aligned)Proposed
ADR-0004Structural plugin flagsProposed
ADR-0005Unified adapter boundaryProposed
ADR-0006Observability as first-classProposed

Proposals (archived)

DocumentDescriptionStatus
cli-pr-review-workflow.mdPR review workflowArchived

Workflows

DocumentDescription
SELF_DEVELOPMENT_WORKFLOW.mdHistorical pointer — engine deleted in #2402, see epic notes

Deprecated / Historical

Documents kept for historical reference only:

DocumentReasonReplacement
archive/system-reviews-2026-01.mdHistorical system review transcriptsCurrent system reviews
archive/REVIEW_2026-01-23.mdArchived system reviewCurrent system reviews
archive/SECURITY_AUDIT_2026-01-23.mdArchived security auditCurrent security docs
archive/consensus-vote-2026-01-17.mdArchived consensus voteCurrent consensus protocols
archive/system-review-2026-05-31.mdFull 13-domain system review (epic #3143)ALIGNMENT_ROADMAP.md

Previously Archived/Removed:

  • PROJECT_PLAN.md, plan docs — archived/removed
  • RESEARCH_SUMMARY.md — content in research/topics/agent-skills/
  • SECURITY_AUDIT_2026-01-23.md — moved to archive/
  • Proposal docs (implemented) — removed after completion

Root-Level Documents

Documents at repository root (for discoverability):

DocumentDescriptionCanonical Location
CLAUDE.mdAgent instructionsRoot (canonical)
README.mdProject overviewRoot (canonical)
QUICK_START.mdGetting startedRoot (canonical)
CONTRIBUTING.mdContribution guideRoot (canonical)
CODING_STANDARDS.mdCode standardsRoot (canonical)
ARCHITECTURE.mdArchitecture summaryRoot (canonical)
SECURITY.mdSecurity policyRoot (canonical)
CHANGELOG.mdVersion historyRoot (canonical)

Machine-Parseable Index

For programmatic access, see INDEX.yaml.


Governance Rules

  1. Canonical Index: This file is the single documentation entry point
  2. Indexing Required: New documentation must be added to this index
  3. No Parallel Indexes: This is the only documentation index permitted
  4. Update on Change: Update this index when any documentation changes
  5. Classification Required: All docs must have a classification (Canonical/Supporting/Deprecated)

Non-conflict with the docs-site epic (#3532)

This IA pass owns the narrative-doc grouping in this index — the control-plane sections above. It does not own the generated API/MCP/strategy reference pages (see § Generated references), which are emitted by the docs-site single-sourcing generators (#3532/#3688/#3763) and live where those generators write them. The two are coordinated: this index links the generated pages by their canonical generator-emitted paths; the generators are free to regenerate those bodies without touching this index’s structure.


Files Not Indexed (Intentionally Excluded)

The following are excluded from this index:

  • .rules/ - Agent-specific configuration (loaded automatically)
  • .claude/skills/ - Agent skill definitions (loaded automatically)
  • coverage/ - Test coverage reports (generated)
  • node_modules/ - Dependencies
  • packages/nexus-agents/docs/api/ - Generated TypeDoc output

This index follows the documentation governance defined in CLAUDE.md.