UTF-16 encoding is significantly simpler to process than UTF-8 when finding the next Unicode code point in C++ strings. While UTF-8 requires 1-4 byte units per code point with complex parsing logic (84 lines of code), UTF-16 uses only 1-2 code units with much simpler implementation (34 lines). The author questions whether
Sort: