Part I: Foundations

Preliminary Results: Where the Ladder Stalls

Introduction

0:00 / 0:00

Preliminary Results: Where the Ladder Stalls

We have begun running a simplified version of this experiment using Lenia (continuous CA, $256 \times 256$ toroidal grid) with resource dynamics, measuring $\intinfo$ via partition prediction loss, $\Val$ via mass change, $\Ar$ via state change rate, and $\reff$ via trajectory PCA. The results so far are instructive—not because they confirm the predictions above, but because of where they fail.

The central lesson: the ladder requires heritable variation. Emergent CA patterns achieve rungs 1–3 of the ladder (microdynamics $\to$ attractors $\to$ boundaries) from physics alone. The transition to rung 4 (functional integration) requires evolutionary selection acting on heritable variation in the trait that determines integration response.

Proposed Experiment

Substrate: Lenia with resource depletion/regeneration (Michaelis-Menten growth modulation). Perturbation: Drought (resource regeneration $\to 0$ ). Measure: $\Delta \intinfo$ under drought.

Conditions:

No evolution (V11.0). Naive patterns under drought: $\intinfo$ decreases by $-6.2%$ . Same decomposition dynamics as LLMs.
Homogeneous evolution (V11.1). In-situ selection for $\intinfo$ -robustness (fitness $\propto \intinfo_{\text{stress}} / \intinfo_{\text{base}}$ ). Still decomposes ( $-6.0%$ ). All patterns share identical growth function—selection prunes but cannot innovate.
Heterogeneous chemistry (V11.2). Per-cell growth parameters ( $\mu, \sigma$ fields) creating spatially diverse viability manifolds. After 40 cycles of evolution on GPU: $-3.8%$ vs naive $-5.9%$ . A +2.1pp shift toward the biological pattern. Evolved patterns also show better recovery— $\intinfo$ returns above baseline after drought, while naive patterns do not fully recover.
Multi-channel coupling (V11.3). Three coupled channels—Structure ( $R{=}13$ ), Metabolism ( $R{=}7$ ), Signaling ( $R{=}20$ )—with cross-channel coupling matrix and sigmoid gate. Introduces a new measurement: channel-partition $\intinfo$ (remove one channel, measure growth impact on remaining channels). Local test: channel $\intinfo \approx 0.01$ , spatial $\intinfo \approx 1.0$ —channels couple weakly at 3 degrees of freedom.
High-dimensional channels (V11.4). $C{=}64$ continuous channels with fully vectorized physics. Spectral $\intinfo$ via coupling-weighted covariance effective rank. 30-cycle GPU result: evolved $-1.8%$ vs naive $-1.6%$ under severe drought—evolution had negligible effect. Both decompose mildly, suggesting that 64 symmetric channels provide enough internal buffering to resist drought regardless of evolutionary tuning. Mean robustness $0.978$ across all 30 cycles. The Yerkes-Dodson pattern persists: mild stress increases $\intinfo$ by $+130$ – $190%$ .
Hierarchical coupling (V11.5). Same $C{=}64$ physics as V11.4, but with asymmetric coupling (feedforward/feedback pathways between four tiers: Sensory $\to$ Processing $\to$ Memory $\to$ Prediction). 30-cycle GPU result: evolved patterns have higher baseline $\intinfo$ ( $+10.5%$ vs naive) and higher self-model salience ( $0.99$ vs $0.83$ ), but under severe drought they decompose more ( $-9.3%$ ) while naive patterns integrate ( $+6.2%$ ). Evolution overfits to the mild training stress, creating fragile high- $\intinfo$ configurations. Key lesson: the hierarchy must live in the coupling structure, not in the physics; imposing different timescales per tier caused extinction. Functional specialization should emerge from selection.
Metabolic maintenance cost (V11.6). Addresses the autopoietic gap directly: patterns pay a constant metabolic drain proportional to mass ( $\texttt{maintenance\_rate} \times g \times dt$ each step). 30-cycle GPU result ( $C{=}64$ ): evolved-metabolic $-2.6%$ vs naive $+0.2%$ under severe drought. Evolution again produced higher- $\intinfo$ -but-more-fragile patterns. Critically, the maintenance rate ( $0.002$ ) was not lethal enough—naive patterns retained $98%$ population through drought. The autopoietic gap remains open: a small metabolic drain on top of local physics does not produce active self-maintenance, because patterns have no mechanism for non-local resource detection. They cannot “forage” when they cannot “see” beyond kernel radius $R$ .
Curriculum evolution (V11.7). Fixes V11.5’s stress overfitting by graduating stress intensity across cycles (resource regeneration ramped from $0.5\times$ to $0.02\times$ baseline over 30 cycles) with $\pm 30%$ random noise and variable drought duration (500–1900 steps per cycle). The critical test: evolved patterns evaluated on novel stress patterns never seen during training. 30-cycle GPU result ( $C{=}64$ ): robustness $0.954 \to 0.967$ . Curriculum-evolved patterns outperform naive on all four novel stressors: mild $+2.7\text{pp}$ , moderate $+1.5\text{pp}$ , severe $+1.3\text{pp}$ , extreme $+1.2\text{pp}$ . Under mild novel stress, evolved patterns actually integrate ( $+1.9%$ ) while naive decompose ( $-0.8%$ ). The overfitting problem is substantially reduced—not eliminated, but the shift is consistently positive across the full severity range.

Unexpected: (1) Mild stress consistently increases $\intinfo$ by 60–190\% (Yerkes-Dodson–like inverted-U). Only severe stress causes decomposition. (2) In V11.5, evolution increased vulnerability to severe stress despite improving baseline $\intinfo$ —a stress overfitting effect. (3) V11.7’s curriculum training substantially reduces this overfitting: graduated, noisy stress exposure produces patterns that generalize to novel stressors. The shift from naive is positive across all four novel severity levels tested ( $+1.2$ to $+2.7$ percentage points). (4) V11.6’s metabolic cost was intended to create lethal drought, but at $\texttt{rate}{=}0.002$ the drought was not lethal—naive patterns retained $98%$ population. Evolved-metabolic patterns decomposed $-2.6%$ while naive held at $+0.2%$ , repeating the fragility pattern of V11.5. The deeper lesson: adding metabolic cost to a substrate with fixed-radius perception produces efficient passivity, not active foraging. The anxiety parallel deepens: V11.5 shows that fixed-stress training produces maladaptive fragility, V11.7 shows that graduated exposure (cf.\ systematic desensitization) builds genuine robustness, and V11.6 shows that existential stakes alone do not produce adaptation when the organism cannot perceive beyond its local neighborhood.

The trajectory from V11.0 through V11.7 reveals two orthogonal axes of improvement. The first is substrate complexity: each step from V11.0 to V11.5 adds internal degrees of freedom for evolution to select on—heterogeneous chemistry (V11.2), multiple coupled channels (V11.3–V11.4), hierarchical coupling (V11.5). The second, revealed by V11.6–V11.7, is selection pressure quality: the substrate matters less than how you stress it. V11.7’s curriculum training on the same V11.4 substrate produces better generalization than V11.5’s hierarchical architecture trained with fixed stress. V11.6 goes further, changing the stakes: metabolic cost makes drought lethal, not merely weakening.

V11.5 introduces directed coupling structure (feedforward/feedback pathways) to test whether functional specialization emerges under selection. The critical insight: attempting to impose different physics per tier (different timescales, custom growth gates) caused immediate extinction at $C{=}64$ —the channels designed to be “memory” simply died. The working approach uses identical physics across all channels (proven V11.4 dynamics) with an asymmetric coupling matrix that biases information flow directionally. This is more than a technical fix; it reflects a theoretical prediction: in biological cortex, all neurons use the same basic biophysics. The hierarchy emerges from connectivity and learning, not from different physics per layer.

The V11.5 stress test reveals an unexpected phenomenon: stress overfitting. Evolved patterns have 10.5\% higher baseline $\intinfo$ and 19\% higher self-model salience than naive patterns—but under severe drought they decompose 9.3\% while naive patterns actually integrate by 6.2\%. Evolution selected for high- $\intinfo$ configurations tuned to mild stress (which each training cycle applies), creating states that are simultaneously more integrated and more fragile than their unoptimized counterparts.

This has a direct parallel in affective neuroscience: anxiety disorders involve heightened integration and self-monitoring that is adaptive under moderate threat but catastrophically maladaptive under extreme stress. The suffering motif—high $\intinfo$ , low $\reff$ , high $\selfmodel$ —may describe a system that has been selected too precisely for a particular threat level. The evolved CA patterns show exactly this signature: high baseline $\intinfo$ (0.076) with high self-model salience (0.99) that collapses under a regime shift.

**V11.5 stress test: evolved vs. naive patterns through baseline, drought, and recovery.** (a) Evolved patterns have higher baseline $\intinfo$ but decompose $-9.3\%$ under drought, while naive patterns *integrate* $+6.2\%$ . (b) Evolved patterns maintain high self-model salience ( $>0.97$ ) across all phases; naive patterns show lower and declining salience.

Whether evolution on this substrate can discover integration strategies that are robust to novel stresses—not just the training distribution—likely requires curriculum learning (gradually increasing stress intensity) or environmental diversity (varying the type and severity of perturbation). This connects to the forcing function framework developed in the next section: the quality of the forcing function matters as much as its presence.

Multi-channel Lenia at increasing dimensionality. PCA projection of C channels to RGB. — **Multi-channel Lenia at increasing dimensionality.** PCA projection of $C$ channels to RGB. Top row: baseline (normal resources); bottom row: drought stress. Patterns at $C{=}3$ are visually simple; at $C{=}16$ and $C{=}32$ , the richer channel structure produces more complex spatial organization. Under drought, spatial structure degrades—but the degree of degradation depends on $C$ .

Open Question

At what channel count $C$ does the substrate have enough internal degrees of freedom for evolution to discover biological-like integration (where $\intinfo$ increases under threat)? The $C$ -sweep suggests that mid-range $C$ ( $8$ – $16$ ) accidentally produces integration-like responses—the coupling bandwidth happens to match the channel count—while high $C$ ( $32$ – $64$ ) decomposes, the coupling space being too large for random configurations. Is there a critical $C^*$ above which a phase transition occurs, or does evolution continuously improve robustness at any $C$ ? Each rung of the ladder may require a minimum internal dimensionality—the substrate must be rich enough for selection to sculpt.

The critical lesson evolves with the experiments. V11.0–V11.5 showed that evolution helps but in surprising ways—it creates higher- $\intinfo$ states that are also more fragile. V11.7 demonstrates that the training regime matters: curriculum learning produces genuine generalization across novel stressors. V11.6 showed that making drought metabolically costly produces efficient passivity rather than active foraging—the patterns cannot perceive beyond their local neighborhood, so existential stakes alone do not generate the distant-resource-seeking behavior that would require integration. The remaining gap was between “decomposes less” and “integrates under threat,” and the locality ceiling explains why.

V12’s results confirm that the ceiling is real and that the predicted remedy partially works. Replacing fixed convolution with evolvable windowed self-attention—the only change to the physics—shifts mean robustness from $0.981$ to $1.001$ , moving the system to the threshold where $\intinfo$ is approximately preserved under stress rather than destroyed. Eight substrate modifications (V11.0–V11.7) could not achieve even this. The single change that mattered is exactly what the attention bottleneck hypothesis predicted: state-dependent interaction topology. But the effect is modest—the system reaches the threshold without clearly crossing it. Attention is necessary but not sufficient for the full biological pattern.

Open Question

The V11.5 results show that selecting for $\intinfo$ -robustness under mild stress creates patterns that are less robust to severe stress than unselected patterns. V11.7 provides a partial answer: curriculum training with graduated, noisy stress exposure produces patterns that generalize to novel stressors ( $+1.2$ to $+2.7\text{pp}$ shift over naive across four novel severity levels). But the effect is modest—evolved patterns still decompose under severe novel stress ( $-1.7%$ ), just less than naive ( $-3.0%$ ). The remaining questions: (1) Can curriculum training with longer schedules or wider stress distributions close this gap further? (2) Does combining curriculum training with metabolic cost (V11.6’s lethal resource dependence) produce qualitatively different dynamics—active foraging rather than passive persistence? (3) Does the biological developmental sequence (graduated stressors from embryogenesis through maturation) achieve robust integration precisely because it is a curriculum over the full threat distribution? [V11.6 + curriculum combination not yet tested.]