Experiments

V27: Nonlinear MLP Head

V27: Nonlinear MLP Head

Period: 2026-02-19. Substrate: V22 + 2-layer MLP prediction head (tanh activation).

The key insight: A nonlinear readout forces gradient coupling across all hidden units. Through the chain rule via the shared nonlinearity, L/hi\partial L / \partial h_i depends on all hjh_j. No single unit can independently satisfy the objective.

Lh=2(y^y)W2diag(1tanh2(W1h+b1))W1\frac{\partial L}{\partial h} = 2(\hat{y} - y) \cdot W_2^\top \cdot \text{diag}(1 - \tanh^2(W_1 h + b_1)) \cdot W_1
SeedMean Φ\intinfoMax Φ\intinfoEff RankSilhouette
420.0790.1288.240.325
1230.0710.0916.940.343
70.1190.24511.340.112

Seed 7 Φ=0.245\intinfo = 0.245 is the highest integration ever observed — 2.5x V22's maximum. The nonlinear readout can force genuine cross-component coordination. But it's seed-dependent: the architecture creates the possibility space; evolution selects whether to exploit it.

New observable: behavioral modes. Silhouette scores 0.11-0.34 indicate distinct clusters in hidden state space. No previous experiment showed this.

Source code