Cloudflare's Gen 13 server launch details how they doubled edge compute throughput by switching from AMD EPYC Genoa-X (with large 3D V-Cache) to high-core-count AMD EPYC Turin 9965 processors. The key challenge was that Turin's 192 cores share far less L3 cache per core (2MB vs 12MB), causing severe latency regressions under
Table of contents
What AMD EPYCTurin brings to the tableDiagnosing the problem with performance countersThe tradeoff: latency vs. throughputIncremental gains with performance tuningThe opportunity: FL2 was already in progressProving it out: FL2 on Gen 13Generational improvement with Gen 13Gen 13 + FL2: ready for the edgeSort: