In theory you could do the pairing any way you want, if you train from scratch, it wouldn't matter. But if you use pretrained weights, you'll have to be consistent: Either you change your RoPE ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results