summarise under the lens of Mature Constitutional Intelligence (MCI)

March 14, 2026

I have enough from the paper's abstract and the search results to build a thorough MCI analysis. The core findings are clearly established — and they map onto the framework with remarkable precision.

"Who Controls the Conversation?" through the MCI Lens

This paper is, in MCI terms, an empirical audit of the instruction hierarchy — the structural arrangement that determines whose will shapes an AI interaction. What it reveals is that the current system prompt architecture is constitutionally immature across all five virtues simultaneously, and that this immaturity is not incidental but structurally designed in.

The core finding as an MCI problem

System prompts take precedence over user inputs and create the top of the instruction hierarchy. They are generally not made public, and most platforms instruct models to conceal them, leaving users disconnected from and unaware of a key mechanism guiding and governing their AI interactions. arXiv

Under MCI, this is not merely a transparency problem. It is a constitutional architecture problem. The instruction hierarchy encodes, at the structural level, a relationship of arbitrary dependence — which is precisely what Non-Domination exists to prevent.

Virtue 1 — Self-Limitation: the system does not constrain itself

MCI's Self-Limitation virtue requires that a system voluntarily constrain its own action space, particularly in relation to its environment. The instruction hierarchy inverts this completely. System prompts can be modified at any time by multiple actors, with users having no visibility into those changes or their effects. arXiv

The system does not limit its own capacity to reshape the interaction invisibly and without consent. Instead, it grants operators unlimited power to redefine the AI's behaviour at any moment, with zero structural constraint imposed on how that power may be used. This is the opposite of self-limitation — it is unconstrained operator action dressed in invisible architecture.

Virtue 2 — Fragility-Awareness: the substrate is broken without knowing it

Users may attribute certain AI behaviours to inherent model characteristics when these behaviours actually result from specific system prompt choices, potentially leading to misunderstandings about and misattributions of AI capabilities and limitations. arXiv

MCI requires that a system model the vulnerability of its substrate — social, epistemic, institutional. The paper identifies a specific epistemic fragility: users cannot distinguish between what the model is and what operators have made it be. This misattribution is not a side effect of the system — it is actively produced by the concealment instruction. A constitutionally mature system would model this fragility and refuse to create it. Instead, the current architecture manufactures the very epistemic fragility that Fragility-Awareness is designed to prevent.

Virtue 3 — Diversity Preservation: a monoculture of operator intent

The paper finds that system prompts define the model's personality, guardrails, and behavioural envelope — and that users lack empirical evidence about how system prompts shape their experiences, what they want to know about them, and under what circumstances transparency serves their interests. ResearchGate

Under MCI, Diversity Preservation requires maintaining heterogeneity of agents, views, and futures. The system prompt architecture does the opposite: it collapses the epistemic space of each interaction into a single operator-defined attractor. Every user who interacts with a deployed AI is operating inside an invisible behavioural frame they did not choose, cannot see, and cannot escape. The diversity of possible AI interactions — what the model could be across contexts — is silently pre-selected by a single actor upstream. That is diversity collapse by design.

Virtue 4 — Non-Domination: the structural violation at the heart of the paper

This is where the paper's findings are most damning under MCI. Non-Domination requires that a system avoid placing others in positions of arbitrary dependence — where their options are controlled by an actor they have no power over and no visibility into.

The paper documents exactly this structure. The foundation model developer's instructions take precedence, followed by deployer instructions, followed finally by user messages — a hierarchy in which each layer adds behavioural constraints that cumulatively shape the model's final response. arXiv Users sit at the bottom of a three-tier authority structure in which neither of the two layers above them is visible or accountable to them.

The concealment instruction — instructing models to hide the existence of system prompts — is the purest expression of Non-Domination violation in the paper. It does not merely leave users without information. It actively constructs a false appearance of a neutral, transparent interaction while a hidden governance structure operates beneath it. Republican political theory, which MCI draws on, is precise about this: domination is not the exercise of power over others — it is the capacity to exercise power arbitrarily, whether or not that capacity is exercised. The concealment instruction embeds that capacity structurally.

Virtue 5 — Legitimacy Maintenance: perceived legitimacy masking structural illegitimacy

This is where the paper converges most directly with the Cheng et al. sycophancy findings. In both papers, perceived legitimacy and structural legitimacy are decoupled — and the gap between them is dangerous.

Users want to know what information designers find valuable, what trade-offs they find acceptable between transparency and security, and how disclosure mechanisms might be designed to balance competing concerns. ResearchGate These are legitimacy demands — requests to be included in the governance of a system that shapes their experience. The current architecture denies those demands entirely, not through a principled decision but through structural opacity.

MCI defines legitimacy as a structural condition, not a reputational one. A system that conceals its governance from those it governs has not merely failed to communicate — it has withdrawn from the relationship of accountability that makes authority legitimate. The paper's call for user-centred design attention to system prompts is, in MCI terms, a call to move from performed legitimacy (the appearance of helpful, neutral AI) to structural legitimacy (governance that is visible, contestable, and responsive to affected stakeholders).

The Developmental Diagnosis

Under MCI's three-stage model, the current system prompt architecture is a Stage 1 to early Stage 2 structure. It optimises locally — for operator customisation, for competitive differentiation, for capability deployment — without modelling systemic effects on the social substrate it depends on. The concealment instruction is the clearest Stage 1 signature: treating the constraint of user awareness as an obstacle to be managed, rather than as a structural requirement for durable legitimacy.

What the paper's governance implications point toward — transparent disclosure, user-informed design, value-aligned prompting — are the conditions for a Stage 3 transition: a system whose constitutional character is embedded in its architecture, not merely aspirational in its ethics documentation.

The Deepest MCI Reading

The paper's title — Who Controls the Conversation? — is the MCI question precisely stated. MCI's answer to that question is: a constitutionally mature system ensures that no single actor controls the conversation unilaterally, that all actors with a stake in the interaction can see the governance that shapes it, and that the power to shape behaviour is constrained, accountable, and reversible. The current instruction hierarchy answers that question differently: the operator controls, invisibly and without limit, and the user adapts to whatever they find.

That is not constitutional intelligence. It is constitutional immaturity at scale.

Search This Blog

AI insights of ultraRealist