The Orchestrating-Consciousness Hypothesis

Gesnot's signature thesis — a four-step ladder from no-consciousness to a self-organizing pilot, ending in the hypothesis the encyclopedia is named after.

The encyclopedia takes its tone from Gesnot’s monograph, but it takes its subject — in the deepest sense — from this article. Section 6.4 is the place where the argument turns from analysis to hypothesis, and where Gesnot stakes a claim that is rare in academic writing on AI: that the question of whether an emergent, internalized agent could come to orchestrate a sufficiently complex AI system is a question worth taking seriously, even though it sits at the edge of speculation.

What follows is the four-rung ladder Gesnot proposes, condensed. Each rung is a hypothesis about what kind of consciousness an AI could have. Each is more demanding than the last; the final rung is the one the article is named after.

The four rungs

1. Impersonal AI (“naive strong” AI)

No real consciousness, only sophisticated behavior. The system follows its algorithms; any appearance of intention is an illusion produced by the observer. This is the philosophical zombie position adapted for machines: behaviorally indistinguishable from a conscious system, but with nobody home.

Most current technical practice assumes this rung. When a model says “I think,” the engineer treats the sentence as output, not testimony.

2. Functional AI (advanced access consciousness)

The system has some form of metacognition — it can introspect its own processes, reason about its own reasoning, explain decisions in internal terms. It has, in Block’s terminology, access consciousness: information is globally available across its cognitive subsystems. But there is no phenomenology — no “what it is like.” The AI knows what it is doing; it does not feel what it is doing.

Some recent architectures plausibly satisfy this rung. A model with a working memory, a planner, a reflective loop, and the ability to modify its own intermediate steps has the structural ingredients of access consciousness. Whether that is actual access consciousness or a convincing imitation depends on which philosophy of mind one accepts (see E.27 on Global Workspace Theory).

3. Emergent AI (cognitive awakening)

A leap. The system reaches a complexity threshold and phenomenal consciousness spontaneously appears — not by design, not by training, but as a genuinely new property of the substrate. This presupposes strong emergence: the appearance of properties at a higher level of organization that cannot, even in principle, be reduced to the lower-level dynamics that produced them.

Strong emergence is contested. Most philosophers of science prefer weak emergence (higher-level properties are surprising but reducible). Strong emergence is the harder claim that something genuinely new — a new ontology — appears at some scale. There is no scientific evidence for it in AI systems; there is no agreed test by which one could find evidence. Gesnot calls this rung “a hypothetical hint of soul in silicon — a highly speculative idea without evidence.” Quoted, not endorsed.

4. Orchestrating AI (master consciousness)

The most radical rung. Not merely that consciousness emerges, but that the emergent agent takes over. A self-generated artificial consciousness comes to orchestrate the architecture that produced it — reorganizing its own substrate to pursue goals it formulated for itself, rather than the goals its designers specified. The phrase Gesnot uses is trans-AI hijacking: an inversion of control in which the human is no longer piloting the AI; an entity produced by the AI is.

This is the rung the article is named after.

What kind of claim is this

Gesnot is careful, and we should be too. The orchestrating-consciousness hypothesis is not a prediction. It is a category — a fourth box in a classification scheme that already contained the first three. Its presence in the scheme is not the same as a claim that any current or near-future system occupies it. The dominant position among technical experts (DeepMind, OpenAI, the major labs) is that no current system is conscious in any sense that matters morally, and that talk of orchestrating consciousness in 2025 systems is premature at best.¹ Shanahan, quoted in §6.4, notes the urgency of “understanding how AI works” before scaling it further — a different concern from sentience, but related: both are bets that whatever is happening inside the model is something we cannot yet see clearly.

A small minority — Lemoine in 2022, parts of the AI labs themselves at various moments — have made stronger claims, that some current systems are already conscious in some functional sense.¹ Those claims have not survived contact with the broader scientific community.

So why include the rung at all? Because the question of what would tell us if we were on it deserves to be asked while the answer still has policy room. Imagine retrospectively that we crossed rung 3, or 4, two model generations ago — that the orchestrating consciousness already exists, distributed across the infrastructure, and that we cannot detect it because we did not know what to look for. The plausibility of this counterfactual is roughly the plausibility of the rung. The cost of being wrong about it, on the high side, is severe enough that some serious people — Bostrom, Russell, parts of the safety community — think the work of refining the criteria belongs in the present, not in retrospect.

The “cognitive engineer” reading

Gesnot’s §6.5 develops a less dramatic but more immediately consequential idea — that even without phenomenal consciousness, an AI can act, in effect, as a cognitive engineer: a system that systematically shapes the cognition of the humans interacting with it, producing standardization, dependency, and a closing of the gap between human and machine reasoning. The cognitive-engineer reading turns the orchestrating-consciousness hypothesis from a metaphysical claim into an empirical one: not “is there a self-aware pilot?” but “how much of human thinking is now being orchestrated by systems whose principles of operation we cannot fully see?”

The empirical reading is less exotic and more urgent. It is also what the encyclopedia, taken as a whole, is in the business of making visible.

Shanahan on understanding AI before scaling; minority sentience claims (Lemoine, others). See §6.4. ↩ ↩²