AMD's MI300X accelerator outperforms NVIDIA's H100 for AI inference workloads, achieving 33% higher throughput in real-world chat use cases. The MI300X proves to be a formidable competitor in the AI market, offering competitive cost, hardware availability, and impressive performance. Further optimization is expected to increase AMD's performance advantage even more.
Table of contents
Inference BenchmarksOffline ResultsOnline Results for Chat Data DistributionConclusionSort: