UTF8 Squeak (was Re: New Win32 VM [m17n testers needed])

Janko Mivšek janko.mivsek at eranova.si
Thu Jun 7 16:30:32 UTC 2007


I don't know details but I hope that UTF8 Squeak means full Unicode in 
image and UTF-8 just on the "borders", to OS, to files etc?

Best regards
Janko

subbukk wrote:
> On Wednesday 06 June 2007 5:54 pm, Lex Spoon wrote:
>> Yes, it would seem to simplify matters to use UTF-8 consistently for
>> interfacing between the image and the VM.  Instead of the VM picking
>> an encoding and telling the image which one it picked, it could go
>> ahead and convert it to UTF-8.
>>
>> This applies not just to filenames, but every place where text is
>> exchanged between the Smalltalk world and the VM, for example keyboard
>> events and the clipboard.
> This is not an easy job as the assumption of ASCII pervades Squeak. The only 
> system that I am aware of that bit the bullet and went the whole hog is Plan 
> 9. The team got the kernel, library and utilities to work with UTF8 as basic 
> character unit and wrote about experience:
>    http://plan9.bell-labs.com/sys/doc/utf.html
> 
> Is there a kernel image that just contains basic Squeak and VMMaker where one 
> could try building a UTF-8 Squeak? Smaller the better.
> 
> Regards .. Subbu
> 
> 

-- 
Janko Mivšek
AIDA/Web
Smalltalk Web Application Server
http://www.aidaweb.si



More information about the Squeak-dev mailing list