Unicode patch

Damien Cassou damien.cassou at gmail.com
Thu Jun 14 01:18:55 UTC 2007

Hi Janko,

did you try to load your changeset in a Squeak 3.10 image? What is the
status of the tests?

If your changeset is good enough and if you write unit tests, it may
be interesting to put your changeset into 3.10.


2007/6/12, Janko Mivšek <janko.mivsek at eranova.si>:
> Dear Squeakers,
> Please find attached an Unicode patch, which deals with improvements of
> internal representation of Unicode characters. It:
> 1. introduce new class TwoByteString
> 2. change at:put: on ByteString and other such methods to "scale" string
>     to TwoByteString or FourByteString, depending on width of a character
> 3. rename WideString to FourByteString for consistency, also
>     rename all related methods
> 2. add category CollectionTests-Unicode with tests
> 3. add class UnicodeBenchmarking for measuring speed of
>     Unicode handling like at:put speed and UTF8 conversions on included
>     English, French, Slovenian, Russian and Chinese text.
> ByteString and TwoByteString also include UTF8 conversion methods, which
> will probably be moved to UTF8TextConverter later.
> I hope this patch will help improving Squeak Unicode support a bit.
> Best regards
> Janko
> --
> Janko Mivšek
> AIDA/Web
> Smalltalk Web Application Server
> http://www.aidaweb.si

Damien Cassou

More information about the Squeak-dev mailing list