Unrolling thread...

    nanochat now trains GPT-2 capability model in just 2 ho