[Pallas/Mosaic GPU] Implement a more comprehensive matmul kernel to see what we're still missing #62482
Job | Run time |
---|---|
1m 7s | |
9m 20s | |
6m 20s | |
1m 22s | |
6m 37s | |
44s | |
1m 11s | |
26m 41s |
Job | Run time |
---|---|
1m 7s | |
9m 20s | |
6m 20s | |
1m 22s | |
6m 37s | |
44s | |
1m 11s | |
26m 41s |