[Pallas/Mosaic GPU] Implement a more comprehensive matmul kernel to see what we're still missing #62476
Job | Run time |
---|---|
6m 49s | |
42s | |
1m 22s | |
8m 56s | |
1m 17s | |
5m 44s | |
59s | |
25m 49s |
Job | Run time |
---|---|
6m 49s | |
42s | |
1m 22s | |
8m 56s | |
1m 17s | |
5m 44s | |
59s | |
25m 49s |