Skip to main content

Practice

Practice and Projects

Last updated: April 22, 2026

The standard

QuestionOutput
State the assumptionsWritten assumptions, limits, and conditions
Build or derive itCode, derivation, or worked calculation
Compare with a baselineMetric, sanity check, or baseline model
Break it onceFailure case, ablation, or counterexample
Write down the resultShort note with results and limitations

Code

Implement the object with tests, shape checks, seed control, and enough structure that another person can run it.

Paper study

Identify the claim, assumptions, comparison point, result, limitations, and one small reproduction or ablation when possible.

Systems

Measure memory, throughput, latency, data movement, or failure modes. Make system behavior visible with measurements.

Explanations

Turn a concept into a diagram, derivation, misconception check, or teaching note that exposes where the idea can fail.

Language models from first principles

The first flagship project path connects mathematical foundations, implementation, evaluation, and systems in a small language-model build that can be run and inspected.

StageOutput
1. TokenizerTests + benchmark
2. Bigram baselineBaseline loss/perplexity
3. TransformerShape/mask/attention checks
4. Training loopCheckpointing + sampling
5. Data pipelineFiltering + dedup notes
6. EvaluationBaselines + uncertainty
7. AblationOne controlled change
8. Systems profileMemory/throughput/KV cache
9. ReportResults, limits, next experiment
Open LabsRead MethodologyView Paths