[Pallas/Mosaic GPU] Implement a more comprehensive matmul kernel to see what we're still missing #23937

copybara-service · 2024-09-26T10:30:00Z

[Pallas/Mosaic GPU] Implement a more comprehensive matmul kernel to see what we're still missing

I annotated a number of issues in the test. To make the test run I also needed to add support
for the accumulator reference allocation and discharge in the main lowering part. Ideally,
we'd defer it all to run_scoped, but run_scoped can't allocate barriers...

…ee what we're still missing I annotated a number of issues in the test. To make the test run I also needed to add support for the accumulator reference allocation and discharge in the main lowering part. Ideally, we'd defer it all to run_scoped, but run_scoped can't allocate barriers... PiperOrigin-RevId: 679143948

copybara-service bot assigned apaszke Sep 26, 2024

copybara-service bot force-pushed the test_679076014 branch 4 times, most recently from 3c0d4ea to 832cd74 Compare September 26, 2024 14:23

copybara-service bot force-pushed the test_679076014 branch from 832cd74 to 8599dbc Compare September 26, 2024 14:40

copybara-service bot merged commit 8599dbc into main Sep 26, 2024
1 check was pending

copybara-service bot deleted the test_679076014 branch September 26, 2024 14:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Pallas/Mosaic GPU] Implement a more comprehensive matmul kernel to see what we're still missing #23937

[Pallas/Mosaic GPU] Implement a more comprehensive matmul kernel to see what we're still missing #23937

copybara-service bot commented Sep 26, 2024

[Pallas/Mosaic GPU] Implement a more comprehensive matmul kernel to see what we're still missing #23937

[Pallas/Mosaic GPU] Implement a more comprehensive matmul kernel to see what we're still missing #23937

Conversation

copybara-service bot commented Sep 26, 2024