Experiments

Falsification Map

Falsification Map

ExperimentPredictionOutcome
V10 (MARL)Forcing functions create geometryContradicted. All conditions show alignment; removal increases it.
Exp 2 (World Model)Cwm\mathcal{C}_{\text{wm}} increases with evolutionPartial. 100x at bottleneck, flat in general population.
Exp 3 (Representation)Compression and modeling co-emergePartial. Co-emerge under bottleneck only. Compression is cheap.
Exp 4 (Language)Compositional communicationNot confirmed. Chemical commons but ρtopo0\rho_{\text{topo}} \approx 0.
Exp 5 (Counterfactual)Reactive-to-detached transitionNull. Wall at ρsync0\rho_{\text{sync}} \approx 0.
Exp 6 (Self-Model)SM emergence with Φ\intinfo jumpWeak. n=1 event at bottleneck.
Exp 7 (Affect Geometry)Tripartite alignmentPartial. A-C develops over evolution (0.01 to 0.38). A-B null.
Exp 8 (ι\iota)Participatory default, animismConfirmed. ι0.30\iota \approx 0.30, animism > 1.0 in all 20 snapshots.
Exp 9 (Normativity)Exploitation penaltyNull. Requires agency.
Exp 10 (Superorganism)ΦG>Φi\intinfo_G > \sum \intinfo_iNot confirmed. Ratio 1-12%, increasing.
Exp 11 (Entanglement)Co-emergence clustersNot confirmed. Different cluster structure.
Exp 12 (Capstone)Seven criteria for identity thesisAll met (moderate/weak). Geometry confirmed.
V19 (Furnace)Selection vs creationCreation confirmed 2/3 seeds.
V20 (ρ\rho wall)ρsync>0.1\rho_{\text{sync}} > 0.1Confirmed. 0.21 from cycle 0.
V22-V24 (Prediction)Prediction integrationNot confirmed. Linear readout always decomposable.
V27 (MLP)Nonlinear head Φ\intinfo \uparrowConfirmed (seed 7: 0.245). Seed-dependent.
V28 (Width)Bottleneck width mattersNot confirmed. Mechanism is gradient coupling.
V29/V31 (Social)Social target lifts Φ\intinfoNot confirmed. 3-seed fluke; 10-seed: p=0.93p = 0.93.
V30 (Dual)Self+social > eitherNegative. Gradient imbalance; self colonizes.
V31 (Seeds)Seed distributionConfirmed: 30/30/40 split. Post-drought bounce r=0.997r = 0.997.
V32 (Autopsy)First bounce predicts categoryRevised: First bounce NOT predictive (p=0.60p = 0.60). Mean bounce across all droughts IS (ρ=0.60,p<105\rho = 0.60, p < 10^{-5}). Trajectory, not event.
V35 (Language)Referential communication emergesConfirmed: 10/10 seeds (100%). But does NOT lift Φ\intinfo. Language is cheap.
VLM Conv.VLMs recognize affect in protocells (RSA > 0.3)Confirmed: GPT-4o ρ=0.72\rho = 0.72, Claude ρ=0.54\rho = 0.54. Raw numbers: 0.78, 0.72.
Falsification Scoreboard. 7 confirmed, 7 contradicted, 1 revised. The framework survives not by being right everywhere, but by being wrong in specific, informative ways. Each contradiction sharpened the theory — the forcing function failure led to the geometry/dynamics distinction; the social prediction failure revealed the gradient interference pattern; the language failure established the rung 4-5 / rung 8 boundary.