Code for the the paper "Emergence in non-neural models: grokking modular arithmetic via average gradient outer product" which can be found here.