AMD is implementing device-side Profile Guided Optimization (PGO) for ROCm/HIP to improve GPU kernel performance. The new approach includes uniformity-aware PGO that detects whether GPU branches are uniform or divergent at runtime, preventing performance regressions that standard CPU PGO techniques can cause on GPUs. The
Sort: