← Back to Hutter

Archive 2026-02-16

Tock phase empirical validation, Hutter Prize alignment, match model combination, sparse context experiments, embedding conjecture, arithmetic coding series (v1–v4), KN-quotient series (v1–v2: ring structure), and GCD empirical validation. Word injection, bigram grammar, P-program evaluation, SN model series (M0–M5), 19 Hutter Prize scoring experiments (8 negative results), GCD distribution experiments, sparse contexts, word models, indirect bigrams, and multi-model combination. UM Runner (umr): KN-6 as UM P-program, exact match validated.

Sparse contexts + match + KN-6 combination: Non-contiguous byte patterns ({1,2,4}, {1,2,4,8}, etc.) in a separate 16M-entry HT add +0.089 bpc over KN-6 alone. Combined with extended match: 1.588 bpc = 189.3 MB (1.79× record). Total +0.094 bpc over KN-6 (11.2 MB saved). Sparse saturates at 30M bytes but remains effective.
Match model combination (9 strategies): Same base models produce −1.1 to +0.33 bpc depending on strategy. Extended match (4–64 bytes) + logistic mixer = +0.048 bpc at full enwik9. Len-64: 97.6% accuracy. Five negative results: momentum SGD, multi-model softmax, KN-8/small HT, 256M HT (OOM), recency model.
Word injection via absorption: +0.038 bpc (2000 words). Naïve geometric mean destroys performance (−0.9 bpc) — the shared-offset catastrophe.
Grammar from counting: SVD of the word bigram PPMI matrix discovers syntactic categories. 14 components capture 80% of variance. K-means (k=12) produces determiners, prepositions, verbs, modals, pronouns, nouns. Word bigrams provide 2.845 bits/transition MI = 0.10 bpc compression. No parsing, no annotation.
P-programs work: Trie/accumulator (P2) dominates mid-word prediction (+1.27 bpc over marginal). Accumulator states peak at position 5 (8,739 states vs theoretical 12M), confirming vanishing sparsity.
Ring structure validated + separation result: KN count table GCD: g=1 for 98.3% of contexts. Optimal D=0.85 at 100M, D=0.87 at 1B (shifts with HT saturation). Per-row GCD discount negative: D=g(c) is +0.138 bpc worse (mean g=4585 when g>1). The algebra is exact; the optimal operation is fractional. Exact AC with KN-6 via GMP: zero decode errors at 1024 bytes, 40,949-bit integers.
Hutter Prize position: Online KN-6 on enwik9: 1.682 bpc = 200.5 MB. KN-6 + sparse + match: 1.588 bpc = 189.3 MB (1.79× record). 11.2 MB saved over KN-6 alone. fx2-cmix record: 110.8 MB (0.70 bpc gap remaining).

Papers

Experiments

SN Exports

Interactive

Navigation

← Previous: 20260215
Extended event space formalism. Pattern space P = E² × T. P-Programs. 6 papers.
Next: 20260218 →
Context events and P-programming methodology. Missing prior = missing context event.