$include_dir="/home/hyper-archives/boost/include"; include("$include_dir/msg-header.inc") ?>
From: Rogier van Dalen (rogiervd_at_[hidden])
Date: 2005-03-18 16:41:51
On Thu, 17 Mar 2005 17:52:25 +0100, Erik Wien <wien_at_[hidden]> wrote:
> What exactly do mean by the term "character"? Abstract characters?
I really need to remember the correct terminology - what I mean is the
thing "a user thinks of as a character", a "grapheme cluster", of
which the Unicode standard says:
"[T]here is a core concept of "characters that should be kept
together" that can be defined for the Unicode Standard in a
language-independent way. This core concept is known as a grapheme
cluster, and it consists of any combining character sequence that
contains only nonspacing combining marks, or any sequence of
characters that constitutes a Hangul syllable (possibly followed by
one or more nonspacing marks)."
I believe this is what a Unicode library should use as its basic unit.
Sorry for any confusion caused,
Rogier