An optimization introduced in the .NET 9.0 runtime that combines two loads into a single load may cause performance regression on Apple hardware. The ldp instruction, which loads two consecutive values from memory into two registers simultaneously, may have a penalty on Apple silicon when loading data from recently written memory.

2m read time From lemire.me
Post cover image

Sort: