Unicode support

ohshima at is.titech.ac.jp ohshima at is.titech.ac.jp
Sun Sep 26 23:43:48 UTC 1999


  Hi Hannes,

> Could you elaborate on this?
> Which other encodings are you considering?

  I'm thinking of something like an aggregation of domestic
"encodings."  The domestic encodings include GB 2312, JIS X
0208, etc.  In the current implementation, they are encoded
in 30 bit space.  the 6 bit of the 30 is used for what I
call "encoding tag" and the 24 bit is used for the
characters in the encoding.  (The boundary may have to be
changed.)

  The "European part" (the characters come from ISO-8859-*
family except 8859-6) could have (and should have) the same
encoding tag.

  The display system (in Squeak) switches the behavior
according to the encoding tag of the character.

  This explanation might be too rough and too vague, but
there are many TBDs.

  Thank you.

                                             OHSHIMA Yoshiki
                Dept. of Mathematical and Computing Sciences
                               Tokyo Institute of Technology 





More information about the Squeak-dev mailing list