Skip to content

[GPUHeuristics] Improve large GEMM intrinsic selection on CDNA4 #32079

[GPUHeuristics] Improve large GEMM intrinsic selection on CDNA4

[GPUHeuristics] Improve large GEMM intrinsic selection on CDNA4 #32079

Triggered via pull request April 15, 2026 04:52
Status Success
Total duration 42m 7s
Artifacts 3

pkgci.yml

on: pull_request
Build Packages  /  Linux Release (x86_64)
6m 29s
Build Packages / Linux Release (x86_64)
Matrix: Test ONNX / test_onnx_models
Matrix: Test ONNX / test_onnx_ops
Matrix: Test PJRT plugin / Build and test
Matrix: Test Sharktank / model-test
Matrix: Test Sharktank / tests
Unit Test  /  Linux (x86_64)
2m 12s
Unit Test / Linux (x86_64)
Test AMD MI325  /  test_mi325
1m 58s
Test AMD MI325 / test_mi325
Test AMD W7900  /  test_w7900
2m 58s
Test AMD W7900 / test_w7900
Test AMD R9700  /  test_r9700
2m 27s
Test AMD R9700 / test_r9700
Test Android  /  android_arm64
7m 55s
Test Android / android_arm64
Test RISC-V 64  /  riscv64
13m 28s
Test RISC-V 64 / riscv64
Test TensorFlow  /  Linux (x86_64)
1m 11s
Test TensorFlow / Linux (x86_64)
Test AMD MI355  /  test_mi355
3m 19s
Test AMD MI355 / test_mi355
Matrix: Test Torch / test_torch_ops
Matrix: Test Torch / tests
pkgci_summary  /  summary
5s
pkgci_summary / summary
Fit to window
Zoom out
Zoom in

Annotations

2 errors
Test AMD MI355 / test_mi355
WARNING: The directory '/github/home/.cache/pip' or its parent directory is not owned or is not writable by the current user. The cache has been disabled. Check the permissions and owner of that directory. If executing pip with sudo, you should use sudo's -H flag.
Test AMD MI355 / test_mi355
WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv

Artifacts

Produced during runtime
Name Size Digest
linux_x86_64_release_packages
86.8 MB
sha256:5658654f4a85d45d95a6c1da507e717d6b0e50c4e88ae202370d6c2c41197372
torch_models_amdgpu_mi325_summary.json
776 Bytes
sha256:092861d91583c545eee02f267831d20484c984ba7c7e2d286f88e76357efa1ae
torch_models_cpu_task_summary.json
410 Bytes
sha256:0d6b57981c67a94ddc6474f14014236f5e41a44cb22fd0cfbd4cb273829b5853