Ashwath V A 〣 MysticSlice

About
Projects
Blogs
Resume

LLM

Learning GRPO: Experiments and Insights from Fine-Tuning an LLM

This blog documents a series of experiments and results on training an LLM on a toy arithmetic task called Countdown.

November 28, 2025

© 2025 Ashwath V A 〣 MysticSlice Powered by Hugo & PaperMod