UTF8 Squeak

Alan Lovejoy squeak-dev.sourcery at forum-mail.net
Thu Jun 7 18:55:02 UTC 2007


Each String object should specify its encoding scheme.  UTF-8 should be the
default, but all commonly-encounterd encodings should be supported, and
should all be useable at once (in different String instances.) When a
Character is reified from a String, it should use the Unicode code point
values (full 32-bit value.)  Ideally, the encoding of a String should be a
function of an associated Strategy object, and not be based on having
different subclasses of String.






More information about the Squeak-dev mailing list