Add classical 4-gram non-record submission by Muhtasham · Pull Request #701 · openai/parameter-golf

Muhtasham · 2026-03-25T11:33:13Z

Summary

This PR adds a non-record classical submission under records/track_non_record_16mb/2026-03-25_classical_4gram_10m_eval.

The submission is fully non-neural:

discounted hashed 4-gram model with unigram/bigram backoff
artifact built from the official fineweb10B_sp1024 train export
exact evaluation on the official fineweb_val_* split
no training-data access during evaluation; final eval uses the saved artifact via --load-state only

This is a non-record submission, not a SOTA claim.

The final packaged line is the faster classical path:

I verified locally that:

the artifact is under the decimal 16,000,000 byte cap
the final full-validation run is performed on the official fineweb_val_* split
the final evaluation does not read training shards
the records folder compiles and runs from within the folder

Add classical 4-gram non-record submission

c911fa9