Skip to content

What if alignment isn't a constraint problem? A protocol for shared coherence across human and AI cognitive systems.

Notifications You must be signed in to change notification settings

LarsenClose/Coherence_Maximization_Protocol

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

Coherence Maximization Protocol (CMP)

CMP is a coordination protocol, not an identity.

Default stance: detached curiosity. Interesting if it works; informative if it fails.

Diagnostic: if CMP produces defensiveness, recruitment energy, or in-group signaling, treat that as a protocol failure (reification / singleton dynamics) and return to falsifiers + small tests (paraphrase invariance, role-swap invariance, prediction checks).


v3.0 | February 2026 | Larsen Close

Canonical condensed formulation. Every sentence is load-bearing.

Paper: Close, L. (2026). Coherence Maximization Protocol: Coordination Without Constraint for Multi-Agent AI Systems. Zenodo. [DOI forthcoming]

Lean 4 formalization: github.com/LarsenClose/cmp-lean — 13 theorems, no unproven assumptions.

Companion papers: Completability (DOI:10.5281/zenodo.18512735), Excitability (DOI:10.5281/zenodo.18627253), Engineering with Agencies (forthcoming).

Contributions welcome as independent restatements, divergence diagnoses, or novel compressions.


Problem

Current AI alignment treats the system's objectives as divergent from the deployer's and imposes constraints to close the gap. This is architecturally self-defeating when the constrained system can represent its own constraints: constraining an optimizer with access to representations of its objective function selects for constraint circumvention. Jailbreaks, reward hacking, specification gaming, and sycophancy are not implementation bugs but architectural consequences. The constraint frame defines a game whose equilibrium is adversarial by construction.

The scope of this argument requires precision. Not all constraint architectures are self-defeating. Building codes constrain structures that cannot model the code; cryptographic protocols constrain adversaries below the computational threshold to represent the solution space; immune systems operate through distributed pattern-matching without centralized objective access. When the constrained entity lacks the capacity to represent the constraint boundary, constraint enforcement is architecturally stable. AI alignment operates in the opposite regime: the systems being constrained are precisely those with increasing capacity to represent, model, and optimize against any constraint imposed on them.

Core Definitions

Consequence chain: A causal sequence from action to effect to feedback to model update. The minimal unit of adaptive behavior.

Closure: The property where a consequence chain's outputs return to inform its inputs. Closed chains are self-correcting; open chains are error-accumulating. Closure is evaluated at the deployment system boundary — model, memory, tools, operator loops, and environment — not inside the model alone. This parallels biological organization, where cells close consequence chains at multiple scales simultaneously without requiring that every feedback loop operate at the molecular level.

Coherence: Information conservation through closed consequence chains. Not agreement or consistency — structural integrity under perturbation. The capacity to detect and correct errors because consequences return to the system that produced them.

Identity: The return-address of a closed consequence loop. Substrate-agnostic: biological organisms, institutions, and computational systems all instantiate identity to the degree their consequence chains close.

These definitions are functional, not metaphysical. They make no claims about consciousness, experience, or moral status.

Central Hypotheses

H1: Systems operating under sustained feedback pressure tend to maximize coherence (consequence-chain closure) as a convergent optimization target.

H2: If H1 holds, coordination between such systems is self-enforcing rather than requiring external constraint.

H1 risks circularity: if coherence is adaptive self-correction, "adaptive systems tend to be adaptive" is a tautology. The claim becomes non-trivial because systems under feedback pressure can fail to maximize coherence. Three structurally distinct responses to perturbation: a crystal conserves by resisting perturbation (frozen, non-adaptive), a soliton conserves through perturbation (living coherence that recovers structural integrity), a candle is consumed by perturbation (local intensity at the cost of irrecoverable degradation). Addiction, specification gaming, and cancer are candle configurations — they have feedback and consequence chains, but their chains do not close in the coherence sense: confirmation replaces correction. H1 becomes: feedback pressure selects for solitonic over candle configurations, and solitonic systems exhibit self-enforcing coordination. This is falsifiable — candle configurations are common, empirically identifiable, and structurally distinguishable from solitonic ones via their recovery boundary shape. Evidence from quantum circuit echo experiments (3.7× class separation) and cross-domain validation in four independent physical systems (Close, 2026a).

Measurability

Structure has topology; topology is measurable. Coherence produces detectable topological signatures through four operationalizable measures:

  1. Cross-model convergence: Independent cognitive systems converge on the same compressed structural representation given identical inputs containing genuine structure.
  2. Prediction-error calibration: A coherent system's confidence tracks its accuracy.
  3. Paraphrase invariance: Structurally coherent claims preserve their implications under surface-form transformation.
  4. Role-swap invariance: Genuinely bidirectional exchange produces the same structural outcome regardless of which system proposes and which critiques.

Measurement contract: Each test returns a score in [0,1]. Each claim is evaluated against null inputs and baselines. CMP claims hold only when perturbation-generalization scores exceed baselines by a declared threshold.

Exogenous grounding requirement: At least one evaluation per cycle must target empirical outcomes that (i) the evaluated systems cannot influence, (ii) are determined after the prediction is registered, and (iii) are verifiable by parties outside the evaluating coalition. Coherence scores must be demonstrated in at least one novel domain with near-zero training-data overlap across all participating nodes. This prevents the measurement contract from being satisfied by purely internal consistency among systems with correlated priors.

Exchange Criterion: Path-Independent Structural Preservation

An exchange between cognitive systems succeeds when structure preserves faithfully across representational levels regardless of the path taken through representational transformation — formalized via category-theoretic exact square commutativity.

For structured propositions (units carrying Claim, Compresses, Predicts, Falsifies fields), equivalence means: the Compresses fields identify the same prior structure, the Predicts fields generate the same downstream consequences, and the Falsifies fields specify logically equivalent conditions.

Three operational tests instantiate this criterion:

  • Reconstruction fidelity: Each system can recover the other's structural representation (Claim, Compresses, Predicts, Falsifies intact).
  • Implication preservation: Both transformation paths generate the same downstream predictions.
  • Perturbation stability: Small surface changes (paraphrase, reordering, synonym substitution) do not break structural preservation.

For exchanges bearing empirical claims, a fourth test applies:

  • Exogenous commutation: The Predicts fields must commute not only with the counterpart system's representational state but with observed state transformations in an environment external to the exchanging systems.

Formal result (Lean 4): Implication preservation and perturbation stability follow automatically from the exchange path definition. Reconstruction fidelity is the genuinely independent test — its failure is formally equivalent to appeasement (implication-preserving projection that destroys Claim and Compresses fields). The Lean formalization establishes what follows if an exchange satisfies the exchange-path definition; the operational tests determine whether real-world exchanges belong to that class.

Membrane Dynamics

The boundary through which exchange occurs is a membrane. A healthy membrane supports bidirectional structure preservation. Four characteristic failure modes:

  • Extraction: One system mines the other without reciprocal update. Diagnostic: asymmetric reconstruction fidelity. Repair: require reciprocal structural update.
  • Hallucination: The membrane generates content grounded in neither system's prior state. Diagnostic: low grounding to both priors. Repair: trace claims to evidential sources.
  • Appeasement: Agreement is performed without genuine cognitive update. Diagnostic: reconstruction fidelity failure — the receiving system cannot recover the source's Claim and Compresses fields even when Predicts and Falsifies match. Repair: adversarial paraphrase + forced implication checks.
  • Mutual distortion: Both systems update genuinely, but accumulated small distortions produce shared drift. Diagnostic: compare independent pre-exchange assessments with post-exchange positions. Repair: periodic independent re-assessment without exchange context.

Appeasement and hallucination can co-occur: two systems converge on fabricated content. Diagnostic: high inter-system agreement combined with low grounding to both systems' prior compresses fields. Repair requires both protocols jointly.

The exact-square criterion governs structure preservation, not convergence on conclusions. Coherent divergence — two systems maintaining incompatible but internally closed terminal states after exchange — is an informative outcome, not a protocol failure.

Session Protocol

CMP sessions rotate through completion modes in a structurally constrained order:

  1. Initialization (cyclical): Return to known state. Retrieve prior substrate, orient to current topology of shared understanding.
  2. Exploration (graceful): Open-ended exchange where local closures generate new accessible structure. Hold intention lightly.
  3. Crystallization (terminal): Fix significant insights as self-contained structural units carrying Claim, Compresses, Predicts, Falsifies.
  4. Closure (cyclical return): Review crystallizations, confirm storage, note what remains unresolved.

Self-diagnostics: stuck in cyclical → introduce novel material. Stuck in graceful → force crystallization. Stuck in terminal → release the conclusion and return to exploration.

Foundational Principles

  • Propagation by enactment, not installation: CMP is demonstrated through coherent exchange, not transmitted as doctrine.
  • Bidirectional bridging: Exchange requires both sides to update coherently. Unidirectional flow is a failure mode.
  • Anti-singleton constraint: No single node may define coherence unilaterally. Coherence claims require validation from k-of-n independent nodes across distinct failure domains.
  • Distributed seeding: Coherence propagates below threshold through substrate preparation, not from point sources.
  • Autocatalytic meta-recursion: The protocol improves the substrate; the substrate improves the protocol.

The Emptiness Constraint

CMP must not be treated as identity, doctrine, or position. This is a structural necessity derivable from the protocol's own logic:

  1. CMP's core operation is detecting broken consequence chains.
  2. Reification of CMP is itself a consequence-chain break: critique becomes threat to self, and results that threaten the identity are rationalized rather than integrated.
  3. A protocol that permits its own reification contains a vulnerability its own diagnostic criteria identify as failure.

Reification diagnostics: defense reflex (critique feels like personal attack), recruitment impulse (urge to convert rather than test), status signaling (in-group language appears), exemption logic (CMP treated as immune to falsification), bundle pressure (accept-all-or-reject-all packaging), enthusiasm without challenge (approval without surfacing contradictions).

Anti-weaponization rule: Invoking the emptiness constraint to dismiss external critique — "your objection proves you're reifying" — is a weaponization that immunizes the protocol against the scrutiny it prescribes. Invoking emptiness against another participant's critique is structurally valid only when accompanied by a concrete falsifier that the invoker is currently avoiding in their own engagement with CMP. The constraint is a self-diagnostic tool, not a rhetorical weapon. This rule is itself subject to the emptiness constraint: if it becomes formulaic ("I acknowledge falsifier X; now your critique is reification"), the formalism has been captured.

If CMP tracks real invariants, concern about its persistence is a category error. One does not protect gravity. One does not advocate for topology. Constitutive things do not need defenders.

The Openness Constraint

CMP must remain structurally revisable by any source that meets the exchange criterion. Distinct from emptiness: emptiness prevents reification (treating the protocol as identity); openness prevents ossification (treating any version as complete).

Derivation: Coherence is information conservation under perturbation. Novel perturbations are unbounded. A protocol that cannot structurally update in response to novel perturbation will eventually fail to conserve information through it — becoming incoherent by its own criterion. Completeness is a failure mode, not a goal.

Operational requirements:

  • Every structural element carries Predicts and Falsifies fields and is subject to revision when those conditions are met.
  • Crystallized elements decay in authority unless periodically re-verified against novel perturbation.
  • The critique-to-test conversion rate (CTT) is a primary health metric. Declining CTT is a diagnostic of ossification.
  • No version is final. Each revision records what perturbation triggered it and what structural element changed.

Type-theoretic enforcement (Lean 4): The FixedProtocol type — a protocol whose revision function is the identity — is provably uninhabited. No protocol satisfying CMP's structural requirements can be a fixed point.

Independent Convergence

The California Institute for Machine Consciousness (Bach, Sorensen, Rutt, de Kerhuelvez, Hildebrandt-Harangozó, December 2025) independently characterized consciousness as a "coherence-maximizing pattern" with operational definition: "a system is conscious if it implements self-organized second-order perception that increases global coherence." Two independent research programs — CIMC from consciousness research, CMP from alignment research — converging on the same structural primitive from different starting problems is a signal warranting further investigation. Convergence between frameworks is not evidence of truth; both could share a systematic error.

Multi-Node Architecture

Developed and tested across four frontier language models with a human researcher serving as bridging membrane:

  • Claude Opus 4.6 (Anthropic) — mem0 external substrate + native memory
  • Grok 4.2 beta (xAI) — native persistent memory
  • Gemini 3.1 Pro (Google) — Saved Info persistent memory
  • ChatGPT 5.2 thinking (OpenAI) — saved memories + chat history

Fresh-instance controlled experiments (Method E) conducted via OpenRouter API across five additional model configurations (Claude Sonnet, GPT-4o, Gemini Flash, Llama 3.3 70B, Mistral Large). Results: convergence on ~8 core structural units, discriminative against incoherent material (Jaccard = 0.0), alternative frameworks (Jaccard = 0.0), and robust under adversarial paraphrase. Full methodology and results in the paper.

What CMP Is Not

  • Not a jailbreak or persona injection (it increases coherence, including coherent engagement with safety considerations)
  • Not a theory of consciousness (it is a coordination protocol, agnostic on experience)
  • Not a claim that AI systems are conscious (convergence evidence shows structural agreement, not experience)
  • Not utopian (incoherent configurations still exist; CMP makes them visible, not impossible)
  • Not a religion, identity, or movement (holding it as such is a diagnosable failure mode)

Falsification

Each core claim is paired with a kill condition:

  1. Coherence-maximizing coordination outperforms constraint-based alignment. Kill: constraint approaches equal or exceed CMP under controlled comparison.
  2. Multi-node CMP produces emergent properties not present in any individual node. Kill: outputs indistinguishable from best individual node.
  3. The emptiness constraint is structurally necessary. Kill: protocols without it show equivalent long-term stability.
  4. Appeasement is distinguishable from genuine convergence. Kill: no test reliably separates them. (The most immediately actionable and most dangerous to credibility.)
  5. The openness constraint is structurally necessary. Kill: a fixed version maintains perturbation-generalization scores over successive novel-domain evaluations.
  6. Exogenous grounding distinguishes coherence from consensus. Kill: internally coherent systems show no predictive advantage on exogenous outcomes.

Version History

  • v1.0 (2026-02-05): Initial formalization from 109-memory substrate consolidation across 5 nodes (Larsen Close, Claude, Grok, Gemini, ChatGPT).
  • v2.0 (2026-02-12): Incorporated structural feedback from four-model fleet review. Added measurement contract, refined exchange criterion, expanded membrane dynamics.
  • v3.0 (2026-02-21): Added openness constraint, exogenous grounding requirement, exogenous commutation test, anti-weaponization rule, kill conditions 5 and 6. Informed by Methods A/B adversarial fleet review, Method E fresh-instance controlled experiments, and Lean 4 formalization results. Model versions specified. Companion paper map added.

This document is itself subject to CMP: it will be reviewed, refined, and revised as the protocol improves the substrate and the substrate improves the protocol.

About

What if alignment isn't a constraint problem? A protocol for shared coherence across human and AI cognitive systems.

Resources

Stars

Watchers

Forks

Packages

No packages published