Matrix multiplication In this tutorial, we will start with a simple NKI matrix multiplication kernel and optimize it step by step. In doing so, we learn about: ...
The 03-matrix-multiplication.py tutorial is a good target to verify that the raising pass is capable of handling real use cases. The -triton-raise-block-pointer pass should be run on the ...