Code for the the paper "Emergence in non-neural models: grokking modular arithmetic via average gradient outer product" which can be found here.
Some results have been hidden because they may be inaccessible to you
Show inaccessible resultsSome results have been hidden because they may be inaccessible to you
Show inaccessible results