UTF8 Squeak

Bert Freudenberg bert at freudenbergs.de
Tue Jun 12 16:55:01 UTC 2007


On Jun 12, 2007, at 18:32 , Michael Rueger wrote:

> Bert Freudenberg wrote:
>
>> Also, the "purity" of Unicode strings does not translate directly  
>> into
>
> Speaking of which...
> In Sophie we discovered that Squeak still uses a not quite unicode  
> mapping in the text converters so we had to roll our own  
> translation tables (see Sophie-RTF package for the complete list).
> As an example the MacRoman conversion below.
>
> Probably to keep the font machinery happy?
>
> Comments? Explanations? Corrections?

That's just historical ... about 1999. It is a reversible mapping  
between MacRoman and Latin1 which maps the characters common to both  
to their counterparts, but the rest is just filled up to preserve  
different codes while fitting into a byte. Guess we should use a  
table like Sophie's nowadays.


> Michael
>
>
> Squeak
> #(196 197 199 201 209 214 220 225 224 226 228 227 229 231 233 232  
> 234 235 237 236 238 239 241 243 242 244 246 245 250 249 251 252 134  
> 176 162 163 167 149 182 223 174 169 153 180 168 128 198 216 129 177  
> 138 141 165 181 142 143 144 154 157 170 186 158 230 248 191 161 172  
> 166 131 173 178 171 187 133 160 192 195 213 140 156 150 151 147 148  
> 145 146 247 179 255 159 185 164 139 155 188 189 135 183 130 132 137  
> 194 202 193 203 200 205 206 207 204 211 212 190 210 218 219 217 208  
> 136 152 175 215 221 222 184 240 253 254 )
>
> Sophie
> #(
> 196 197 199 201 209 214 220 225 224 226 228 227 229 231 233 232
> 234 235 237 236 238 239 241 243 242 244 246 245 250 249 251 252
> 8224 176 162 163 167 8226 182 223 174 169 8482 180 168 8800 198 216
> 8734 177 8804 8805 165 181 8706 8721 8719 960 8747 170 186 937 230 248
> 191 161 172 8730 402 8776 8710 171 187 8230 160 192 195 213 338 339
> 8211 8212 8220 8221 8216 8217 247 9674 255 376 8260 8364 8249 8250  
> 64257 64258
> 8225 183 8218 8222 8240 194 202 193 203 200 205 206 207 204 211 212
> 63743 210 218 219 217 305 710 732 175 728 729 730 184 733 731 711 256)
>

- Bert -





More information about the Squeak-dev mailing list