Parameter Golf - 16MB Language Model Challenge

⭐ Advanced

Contributed to the Parameter Golf challenge focused on training highly efficient language models that fit within a strict 16MB model size budget.

Built and tested lightweight training configurations for compact transformer-style architectures.
Explored trade-offs between model size, context window, and validation loss.
Improved reproducibility by documenting hyperparameter choices and run comparisons.