Record: Curriculum Learning + LeakyReLU(0.9)² + 7-gram Backoff (val_bpb=0.9633) by ndokutovich · Pull Request #764 · openai/parameter-golf

ndokutovich · 2026-03-25T20:26:04Z

Summary

val_bpb = 0.9633 (seed 42, additional seeds pending compute grant) | 15.56 MB | 8xH100 SXM, 600s

Built on PR #753 (Podracing II) with two novel additions:

1. Curriculum Learning (Shard Reordering)

Training shards reordered by model perplexity — hardest shards first. Based on PR #650 (-0.003 BPB). Zero code change, environment variable only.

2. LeakyReLU(0.9)² Slope Optimization

Following @MatoTeziTanka's controlled sweep (issue #140): slope 0.9 gives -0.013 BPB vs standard 0.5. One parameter change.

Results

Eval Method	BPB
Sliding window (stride=64)	1.1216
Sliding + 7-gram backoff	0.9633
Legal TTT (score-first, 3ep)	1.1216

Artifact: 15,560,351 bytes (< 16MB)
Steps: 6,647 at 90.3ms/step
GPTQ calibration within training budget (issue #677 compliant)

Reproduction

SEED=42 bash run.sh

Acknowledgments

@newjordan (PR #753), @abaybektursun (PR #650), @MatoTeziTanka (slope sweep), @Asukabot0 (n-gram backoff)

Status

1 seed submitted. 2 additional seeds pending OpenAI compute grant.
Previously PR #486 (formerly #2 on leaderboard, TrigramHash originator). $339 personal compute spent.

Test plan

1 seed (42) validated on 8xH100 SXM
Seed 1337 (pending compute)
Seed 2024 (pending compute)

…bpb=0.9633, 1 seed)

Record: Curriculum Learning + LeakyReLU(0.9)^2 + 7-gram Backoff (val_…

36e9649

…bpb=0.9633, 1 seed)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Record: Curriculum Learning + LeakyReLU(0.9)² + 7-gram Backoff (val_bpb=0.9633)#764

Record: Curriculum Learning + LeakyReLU(0.9)² + 7-gram Backoff (val_bpb=0.9633)#764
ndokutovich wants to merge 1 commit intoopenai:mainfrom
ndokutovich:submission-v7-curriculum-ngram

ndokutovich commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ndokutovich commented Mar 25, 2026

Summary

1. Curriculum Learning (Shard Reordering)

2. LeakyReLU(0.9)² Slope Optimization

Results

Reproduction

Acknowledgments

Status

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant