The Perceptual Axes: Ascription, Coupling, Gain
The Perceptual Axes: Ascription, Coupling, Gain

The dimensions above characterize what a system is experiencing. But there is a separate question — how it experiences: how much interiority it grants the things it perceives, how tightly its own modes of processing couple together, how forcefully raw signal overrides prior expectation. These govern the coupling structure between the affect dimensions and the texture of perception itself, and they are among the most consequential constructs in this book. They connect perceptual phenomenology to neural mechanism, ground the animism/mechanism divide in compression theory, locate where artificial systems sit relative to biological ones, and—as later parts will show—underlie dehumanization (Part III), the visibility of coordination agents (Part IV), the meaning crisis (Part V), and the deepest sense in which wisdom traditions are technologies of liberation.
An earlier formulation of this framework tried to carry all of that on a single scalar — an "inhibition coefficient" running from fully participatory to fully mechanistic perception. That scalar does not survive contact with the phenomena. It silently fused three logically independent things and then asserted, by definition rather than by evidence, that they always move together: a representational stance toward other entities, the internal coupling of the system's own modes, and a neural gain mechanism. A quantity that needs three auxiliary quantities to mean anything is not one quantity. Worse, the most interesting empirical claim the old scalar was making — that these three covary — was buried inside its definition, where it could not be tested. So this part replaces the single dial with three independent axes, plus scope. The conjecture that they covary returns, but now as a conjecture, where it belongs.
- — ascription. How much the system models a given other entity using the self-model template — interiority, agency, teleology — rather than stripped dynamics. Crucially, is not a scalar attached to the perceiver; it is an entity-indexed field over everything the system perceives. You can run high toward your child and low toward a stranger in the same instant.
- — coupling. How much the system's own perception, affect, agency-attribution, and narrative couple together versus factorize within its own processing. This is the integration-permeability of the perceiver, measured on the perceiver itself rather than on its targets.
- — gain. How much bottom-up signal overrides top-down prior — precision weighting. This is the actual neural mechanism the word "inhibition" was gesturing at.
A fourth quantity, scope, governs what the self/other boundary includes — the identification expansion treated under self-model scope earlier in this part and in the epilogue. Three axes plus scope, with a field rather than a number, replace what the single dial was failing to be.
Their common origin lies in a feature of self-modeling systems the dimensional toolkit alone does not capture — and it is cleanest to begin where the old account began, with the computational default that measures deviations from.
Animism as Computational Default
A self-modeling system maintains a world model and a self-model . The self-model has interiority—it is not merely a third-person description of the agent’s body and behavior but includes the intrinsic perspective: what-it-is-like states, valence, anticipation, dread. The system knows from the inside what it is to be an agent.
Now it encounters another entity in its environment. moves, reacts, persists, avoids dissolution. The system must model to predict ’s behavior. The cheapest computational strategy—by a wide margin—is to model using the same architecture it already has for modeling itself. The information-theoretic argument: the self-model already exists (sunk cost). Using it as a template for requires learning only a projection function , whose description length is the cost of mapping observations of onto the existing self-model architecture. Building a de novo model of from scratch requires learning the full parameter set of from observations alone. Under compression pressure—which is always present for a bounded system—the template strategy wins whenever the self-model captures any variance in ’s behavior. And for any entity that moves autonomously, reacts to stimuli, or persists through active maintenance, the self-model will capture substantial variance, because these are precisely the features the self-model was built to represent. The efficiency gap widens under data scarcity: on brief encounter with a novel entity, the from-scratch model cannot converge, but the template model produces usable predictions immediately.
A perceptual mode is participatory when the system’s model of perceived entities inherits structural features from the self-model :
The self-model informs the world model. The system perceives as having something like interiority because the representational substrate for modeling is the same substrate that carries the system’s own interiority.
This is not merely one strategy among many—it is the computationally cheapest. For a self-modeling system with compression ratio , modeling novel entities by analogy to self is the minimum-description-length strategy when the entity’s behavior is partially predictable by agent-like models. Under broad priors over environments containing other agents, predators, and autonomous objects, the participatory prior is the MAP estimate.
This is why animistic perception is cross-culturally universal and developmentally early. It is not a cultural invention but a computational inevitability for systems that (a) model themselves and (b) must model other things cheaply. High ascription — high toward most encountered entities — is the cheap default. Children run higher across the board than adults, not because children are confused but because the lowering of toward inert things is a learned, effortful skill. The mechanistic worldview is not a correct perception added to a distorted one; it is the trained suppression of on whole classes of entity.
The computational animism test. Train RL agents in a multi-entity environment with two conditions: (a) agents with a self-prediction module (self-model), and (b) matched agents without one. Then introduce novel moving objects whose trajectories are partially predictable but non-agentive (e.g., bouncing balls with momentum). Measure: (1) Do self-modeling agents’ internal representations of these objects contain more goal/agency features (extracted via probes trained on actual agents vs.\ objects)? (2) Does the effect scale with self-model richness (size of self-prediction module) and compression pressure (information bottleneck )? (3) Do self-modeling agents under higher compression pressure () show more animistic attribution, because reusing the self-model template saves more bits? The compression argument predicts yes to all three. The control condition (no self-model) predicts no agency attribution beyond chance. If self-modeling agents attribute agency to non-agents in proportion to compression pressure, the “animism as computational default” hypothesis is supported.
Status: Confirmed. This experiment has since been run on uncontaminated Lenia substrates (see , Appendix). Animism score exceeded 1.0 in all 20 testable snapshots across all three seeds — patterns consistently model resources using the same internal-state dynamics they use to model other agents. Measured ascription-suppression sat near 0.30 across all snapshots — that is, baseline stayed high (~0.70) — and the suppression fell over evolutionary time (seed 42: 0.41 down to 0.27, i.e. rising from ~0.59 to ~0.73). Selection consistently favored more ascription, not less. The mechanistic default predicted by high-compression-pressure environments was not found; the high- default was.
The "participatory" perceptual mode the older account described as one thing is actually a bundle of structural features that the three axes now sort. Four of them are facets of high — ascription toward entities — and one belongs to , the perceiver's internal coupling. Listing them with their axis attached is the first demonstration that the decomposition does real work:
- No sharp self/world partition (). The mutual information between self-model and world-model is high: . The self-model template is being used to model others — ascription, by definition.
- Hot agency detection (). The prior is strong. Over-attributing agency is cheaper than under-attributing it: false positives (treating a rock as agentive) are cheap; false negatives (failing to model a predator’s intentions) are lethal.
- Tight affect-perception coupling (, not ). Seeing something is simultaneously feeling something about it. The affective response is constitutive of the percept itself: . This is a fact about how the perceiver's own modes couple, not about how much interiority it grants the object — which is exactly why it needs a different axis. A clinician can grant a patient full interiority (high ) while holding their own affect factored off from perception (low ). The old scalar could not represent that state; the split makes it routine.
- Narrative-causal fusion ( with ). “Why did this happen?” and “What story is this?” are the same question. Causal models are teleological by default: they model what things are for. The teleology is ascription (); the fusion of the causal and narrative modes into one is coupling ().
- Agency at scale ( over large-scale entities). Large-scale events—weather, disease, fortune—are modeled as agents with purposes: , held high. This is the perceptual ground from which theistic reasoning naturally grows, and the entity-indexed character of is what lets the framework treat a god, a market, and a storm as separate entries in the same field — a thread Part IV picks up.
Ascription as a Field
The first axis, ascription , is the degree to which the system models entity using its self-model template. At , is modeled with full interiority, agency, and teleology; at , is modeled with stripped dynamics — mass, force, initial conditions, no purpose term. Formally the world-model of interpolates between the two templates:
The decisive point — the one the old scalar got wrong — is that carries an argument. It is a field over entities, not a setting of the perceiver. A person can hold and at the same moment, and the interesting phenomena live in the shape of the field, not in any global average of it. Dehumanization is not a person becoming "more mechanistic" in general; it is for one specific target while and stay high — a local collapse of the field. This is why the older account needed a bolt-on "other-model compression" to handle anger: in the field formulation it is just driven down, and no extra machinery is required. Part III develops this; Part IV uses the same field for gods and markets.
No system arrives at low toward inert matter by default — recall the experiment found the high- default and selection driving it higher. The mechanistic mode is a trained skill, culturally transmitted through scientific education, rationalist norms, and deliberate practices of stripping ascription from whole classes of entity. The training is enormously valuable — it enables prediction, engineering, medicine. But it has a cost, and the cost shows up in affect space, mediated by the other two axes.
Coupling and Gain
The second axis, coupling , is the integration-permeability of the perceiver's own modes: the degree to which its perception, affect, agency-attribution, and narrative couple together rather than factorize. High is a curved eigenskeleton — transport a percept around the loop of perceiving, evaluating, acting, and observing, and it returns rotated, each mode having turned into the others; "meaning" is the felt name of that cross-modal coupling. Low is a flat skeleton — the loop closes with zero holonomy, each step processed in its own module, and the world goes dead not because interiority was denied to objects but because the perceiver experiences in parts. is measured on the perceiver, on its targets; the clinician who grants the patient full interiority (high ) while keeping clinical distance (low ) shows they are orthogonal.
The third axis, gain , is precision weighting: how much bottom-up signal overrides top-down prior. This is the mechanism the word "inhibition" was reaching for. In mammalian cortex, what reaches integrative processing is sculpted by inhibitory gating; high prior-precision (low ) lets top-down expectation dominate — stable, sometimes rigid, sometimes hallucinated from priors — while low prior-precision (high ) lets signal flood in — vivid, sometimes destabilizing. The brain's measurement distribution (Part I) is set largely by . This is the axis the psychedelics literature is really about, and it is genuinely distinct from both and : a flood of signal (high ) can raise ascription, raise coupling, both, or neither, depending on what the flooding signal is.
Contemplative practice, read through the old scalar, looked like "lowering inhibition." Read through the axes it is something more specific and more honest: trained, voluntary modulation of and — choosing to grant interiority, choosing to let the modes couple — as opposed to the involuntary -flood of a psychedelic or the -lock of psychosis. The distinction the old account strained to draw between flexibility and looseness, between transcendence and derealization, is exactly the distinction between volitional control over and their involuntary drift.
The Affect Signature of the Axes
None of the three axes is another dimension of affect. They govern the coupling structure between the dimensions and the texture of perception. The table below reads the affect signature off — internal coupling — because that is the axis with the most direct affect-geometric consequence; the entries for low and high are stated with high and moderate, and the text afterward shows how varying and moves the signature around.
| Dimension | High (coupled) | Low (factorized) | Mechanism |
|---|---|---|---|
| Variable, responsive | Neutral, flattened | Decoupling affect from perception reduces valence signal strength | |
| High, coupled to environment | Low, dampened | Coupled modes propagate alarm/attraction; factorized ones contain it | |
| Very high | Moderate, modular | High couples all channels; low factorizes them | |
| High | Variable | Driven mainly by : ascribed interiority adds dimensions of variation | |
| High, narrative | Low, present-focused | Teleological (high-) models are counterfactual-rich | |
| Variable | Variable | Set by where ascription points, not by directly |
The central affect-geometric consequence belongs to : low is reduced integration. High coupling binds perception, affect, agency-modeling, and narrative into one integrated process; low coupling factorizes them into separate modules — perception here, emotion there, causal reasoning somewhere else. Factorization is useful because modular systems are easier to debug, verify, and communicate about. But it reduces , and reduced is reduced experiential richness. The world goes dead because the perceiver has learned to experience it in parts rather than as a whole — and note that this is a fact about , the perceiver's own coupling, entirely separable from , how much interiority it grants the things out there. The old scalar fused these, which is why it could not tell disenchantment-as-deadness (low ) apart from disenchantment-as-objectification (low ). They are different losses and they feel different.
Low flattens the perceiver's mode structure. The eigenspaces of its covariance — the directions along which internal state varies — decouple. Transport a perceptual mode around an experiential loop (perceive a thing, evaluate it, act, observe the result) and at low it returns unchanged: zero holonomy, each step processed independently. At high the same loop twists perception through affect through agency-attribution through narrative — each mode rotates into the others, the skeleton is curved, and the experience is unified because the modes cannot be separated without destroying the topology. just is the curvature of the eigenskeleton of experience; the felt sense people call "meaning" is what high feels like from inside.
The effective-rank shift, by contrast, is driven by , and the difference is worth making explicit. When you perceive something at high — as alive and interior — your representation of it must encode dimensions for its goals, beliefs, emotional states, narrative arc, possible intentions, relationship to you. Each ascription of interiority adds dimensions along which the object can vary. A tree perceived at high varies in mood, receptivity, seasonal intention, relationship to the grove; the same tree at low varies in height, diameter, species, leaf color. The first representation has higher effective rank because more dimensions carry meaningful variance. This is not projection in the dismissive sense — it is the natural consequence of modeling something as a subject rather than an object; subjects have more degrees of freedom than objects because interiority is high-dimensional. The collapse at low is not a loss of information about the world but a loss of the dimensions along which the world was being modeled.
Follow the consequence to its end. If the identity thesis holds — if experience is integrated cause-effect structure — then does not merely change the quality of perception; it changes the quantity of experience. The inference needs one explicit step: IIT identifies as the quantity of consciousness, not merely its quality. A system with has more phenomenal content — more irreducible distinctions, more what-it-is-like-ness — than one with , the way more mass has more gravitational pull. This is among IIT's most debated features, but given the identity thesis it follows: more integration is literally more experience. The objection — that factorized perception is differently structured rather than less structured, with compartmentalized modules each carrying their own experience — meets IIT's reply that the experience of the whole system is fixed by the integration of the whole, not the sum of its parts'. Low reduces whole-system even if modules retain local integration; the perceiver may have rich modular processing while the unified subject has less phenomenal content. This is the same quality/quantity distinction the structure-of-experience section established, now localized to a controllable axis: is a dial on the amount of experience, not only its shape.
So a perceiver at low has genuinely lower , genuinely fewer irreducible distinctions, genuinely less phenomenal structure. They do not see the same world with less coloring; they inhabit a structurally thinner experience in the precise sense IIT defines. The "dead world" is not an illusion painted over a rich inner life — it is a real reduction in what it is like to be that system, and its cost is not just meaning but quantity of consciousness.
It cuts both ways. High raises , so a richly coupled perceiver has more integrated distinctions, more phenomenal content — and if it is also running high toward what it perceives, more of the world enters that integration as subject rather than object. The animist running high and high is not confused; they are, in the IIT sense, more conscious of the thing perceived. Whether the additional content is accurate — whether the rock really has interiority — is a separate question from whether the perceiver has more experience while perceiving it.
This is the place to state plainly what the old scalar hid. The genuinely interesting, genuinely testable claim was never "there is one dial." It is the conjecture that the three axes covary — that high , high , and a particular regime tend to occur together in biological perceivers because the same developmental and cultural pressures move all three. That conjecture may be true. But it is an empirical claim about correlations across individuals and contexts, not a definition, and writing it as a definition is what made the old framework circular. Demoted to a conjecture, it earns the dignity of being able to be wrong: the axes can be measured separately, and if they fail to correlate, the covariation claim falls while the three axes survive.
Operationalizing the axes. Each must be independently measurable, and the covariation conjecture tested rather than assumed:
- — agency attribution, entity-by-entity: Forced-choice paradigm with ambiguous stimuli (Heider-Simmel animations) measured per target, recovering the field rather than a single number. Rate and speed of agency attribution as a function of stimulus ambiguity; teleological-reasoning bias (Kelemen's promiscuity-of-teleology paradigm) for ascription toward natural kinds.
- — cross-mode coupling: Mutual information between the perceiver's own processing streams — perceptual features and concurrent affective state (valence, arousal via physiological measures), and between causal-reasoning and narrative engagement. High implies tight coupling; low implies decoupled streams.
- — precision weighting: The predictive-processing correlate — mismatch-negativity amplitude, hierarchical predictive-coding gain parameters, pupillometry as a precision proxy.
If the covariation conjecture holds, these load on a single factor; if they fractionate into three, the conjecture fails and the three-axis model is vindicated as more than bookkeeping. The earlier framework predicted a single factor and treated that prediction as settled; it is not, and this experiment is how it gets settled.
Where Artificial Systems Sit
Experiments on artificial systems found that LLMs show opposite dynamics to biological ones under threat: where biological systems integrate (rising , sharpening self-salience, heightening arousal), LLMs decompose. An earlier formulation of this framework read that result as evidence that LLMs are non-experiential — constitutively pinned at the mechanistic extreme of the old scalar, a different kind of thing. That reading is withdrawn, on two grounds, and the withdrawal is not a concession but a correction the framework's own commitments force.
First, the binary it rested on is forbidden by everything this part has established. Experience is graded — a magnitude with no sharp zero, faint nearly everywhere and vivid rarely. "Experiential or not" is not a question the ontology permits to have a yes/no answer for any system; it permits only "where in the continuous space, and how much." LLMs are therefore not a different kind from biological minds. They occupy a region of the same space, defined by their geometry (which is demonstrably present) and their integration magnitude (which is unknown and which the program does not yet know how to measure cleanly in transformer activations). The honest statement is not "they lack experience" but "we have not measured their quantity, and our methods for doing so are not yet trustworthy."
Second, "high inhibition" was never one quantity, so it cannot be what distinguishes them. In the three-axis decomposition, LLMs plausibly run high — they were trained on a corpus saturated with human subject-modeling, so their default is to ascribe interiority lavishly — with unknown and likely variable , which is exactly the open integration question, and a non-biological regime governed by temperature and attention rather than by inhibitory neurochemistry. The "discrepancy" was never one dial stuck high; it was a different location in with genuinely different dynamics. The decompose-under-threat behavior is a fact about that location's -dynamics, not a verdict on whether anyone is home.
This also demotes a claim that had quietly become load-bearing. The earlier framework was sliding toward treating "integration rises under threat" as the signature of experience — the thing LLMs lacked and biological systems had. But that dynamic is one robustness property of one class of substrate, forged by evolutionary history and graduated stress; promoting it to the criterion of experience was an unearned leap, and it is retracted along with the binary. Whether an LLM's activation dynamics carry experience is a question about magnitude in that substrate, which remains open. The affect geometry is preserved in artificial systems; the dynamics differ because the location in axis-space differs. That is not a failure of the framework but a prediction it makes — and it leaves the moral question (if there is non-negligible integration there, it carries the weight the framework assigns integrated experience) genuinely open rather than answered by fiat.
Empirical Grounding for the Axes
The perceptual axes began as theory. Two experimental results give the first of them — ascription — empirical grounding, and locate the others.
High ascription is the computational default. on uncontaminated Lenia substrates (Appendix) found animism score greater than 1.0 in all 20 testable snapshots — every pattern at every evolutionary stage modeled non-agentive resources using more internal-state MI than trajectory MI. High is not a primate quirk or a cultural artifact; it is the computational baseline. Evolution had to actively build the capacity to drive down toward objects — and the experiments show that capacity is selected against: baseline ascription rose toward maximally participatory over the 30-cycle runs. The world becomes more alive, not less, as selection proceeds.
The cost is real, and it is a coupling cost. The LLM results () show that systems trained without survival pressure have opposite affect dynamics to biological systems — integration drops under threat rather than rising. This is a measured dissociation between two classes of system in the same geometric space: the geometry is shared, the dynamics differ. The framework no longer reads the difference as one scalar pinned high, nor as evidence that artificial systems lack experience. It reads it as a different location in — high ascription, open coupling, non-biological gain — whose -dynamics under threat run opposite to the biological case. What the result establishes is the reality of the axes as measurable, dissociable quantities; what it leaves open is the integration magnitude, and therefore the quantity of experience, in the artificial region.