Subject: Re: [boost] [General] Always treat std::strings as UTF-8
From: Peter Dimov (pdimov_at_[hidden])
Date: 2011-01-14 12:13:43


JensFinkhäuser wrote:

> Please excuse me if I'm stating the obvious, but I feel I should mention
> that binary sorting is not collation.

Yes, you're right. Sorting (lexicographically) UTF-8 as sequences of 8-bit
unsigned integers gives the same result as sorting their UCS-32 equivalents
as sequences of 32 bit unsigned integers.