$include_dir="/home/hyper-archives/boost/include"; include("$include_dir/msg-header.inc") ?>
Subject: Re: [boost] [rfc] Unicode GSoC project
From: Phil Endecott (spam_from_boost_dev_at_[hidden])
Date: 2009-05-13 06:55:17
Hi Mathias,
Mathias Gaunard wrote:
> I have been working on range adaptors to iterate over code points in
> an UTF-x string as well as converting back those code points to UTF-y
> for the past week
I would be interested to see this code. I encourage you to share what
you have done as soon as possible, to get prompt feedback.
> short documentation
> http://mathias.gaunard.emi.u-bordeaux1.fr/unicode/doc/html/
Some feedback based on that document:
UTF-16
....
This is the recommended encoding for dealing with Unicode.
Recommended by who? It's not the encoding that I would normally recommend.
make_utf8(Range&& range);
Assumes range range is a properly encoded UTF-8 range in
Normalization Form C.
Iterating the range may throw an exception if it isn't.
as_utf8(Range&& range);
Return type is a model of UnicodeRange whose value type is uchar8_t.
To me, the word "make" suggests that the former is actually doing a
conversion. But it's the latter, "as", that does that. Can we think
of something better? (Can anyone suggest any precidents?)
Regards, Phil.