Glibc has merged a LoongArch-specific optimization that enables transparent hugepage (THP) aligned ELF load segments by default for LoongArch64. This reduces TLB pressure and improves instruction fetch efficiency for large binaries. Benchmarks on a Loongson 3A6000 show instruction TLB misses dropping by 72%, CPU cycles reduced by ~4.7%, and wall time savings of ~4.2% when compiling Rust's Cargo. Compiling the Linux kernel with LLD yielded ~12% wall time reduction. The patch series also introduces a new glibc.elf.thp tunable for THP-aware segment alignment.
Sort: