In order to run LLMs more performantly, we need to implement fusing patterns for the following ops found in sub issues. These are the ops found in the llama3.2 architecture. More will be added to the ...