[squeak-dev] how to create an UTF-8 character

stephane ducasse stephane.ducasse at free.fr
Mon Sep 29 16:53:06 UTC 2008


>>>> Am I the only one using the generic en/decoding functionality in
>>> Squeak in the form of #convertTo/FromEncoding?
>>>
>>> Convert from "Squeak" to UTF-8
>>> aString convertToEncoding: 'utf-8'
>>
>>
>> do I understand correctly that such a aString is a sequence of  
>> unicode
>> codepoints?
>>>
> At first the utf-8 is a sequence of bytes. These bytes are a space
> optimzed encoding of a code point (utf-8). If you decode those bytes
> you get your code point (unicode). From a sequence of code points
> you can derive a character. In most cases (for us westerners) it will
> be a single code point AFAIK.

I'm trying to really understand in Squeak. :)
What we call character is what then?
Is it a codepoint? or the looked up glyph in a font table?

Stef




More information about the Squeak-dev mailing list