← Back to Hutter

Archive 2026-02-15

The extended event space: injecting lexical structure into H. Formalizing the correct tock for lexicon injection.

Key insight: The the_inject experiments went wrong by mixing an external predictor with the RNN instead of extending the event space E. KN dominated because the lexical trie was compensating for the RNN’s weakness (6.43 bpc on new data), not adding genuine structure. The correct tock adds lexical events to H—position, accumulator, bag-of-letters, word identity—as first-class events participating in UM patterns.
The hourglass: Bytes → words → bytes (Prop 16 of tock protocol). Extended: I′ = bytes × position × accumulator. H′ = hidden × bag-of-letters × graded word support. O′ = future bytes at multiple offsets. Fully-connected within bounded word context subsumes both RNN-like and transformer-like patterns.

Papers

Navigation

← Previous: 20260214
Lexemes as binary event spaces. Neutral “the” factorization. 31 injection experiments.
Next: 20260216 →
Tock phase empirical validation. Word injection curve, bigram grammar, P-program evaluation.