lists.squeakfoundation.org
Sign In
Sign Up
Sign In
Sign Up
Manage this list
×
Keyboard Shortcuts
Thread View
j
: Next unread message
k
: Previous unread message
j a
: Jump to all threads
j l
: Jump to MailingList overview
2023
September
August
July
June
May
April
March
February
January
2022
December
November
October
September
August
July
June
May
April
March
February
January
2021
December
November
October
September
August
July
June
May
April
March
February
January
2020
December
November
October
September
August
July
June
May
April
March
February
January
2019
December
November
October
September
August
July
June
May
April
March
February
January
2018
December
November
October
September
August
July
June
May
April
March
February
January
2017
December
November
October
September
August
July
June
May
April
March
February
January
2016
December
November
October
September
August
July
June
May
April
March
February
January
2015
December
November
October
September
August
July
June
May
April
March
February
January
2014
December
November
October
September
August
July
June
May
April
March
February
January
2013
December
November
October
September
August
July
June
May
April
March
February
January
2012
December
November
October
September
August
July
June
May
April
March
February
January
2011
December
November
October
September
August
July
June
May
April
March
February
January
2010
December
November
October
September
August
July
June
May
April
March
February
January
2009
December
November
October
September
August
July
June
May
April
March
February
January
2008
December
November
October
September
August
July
June
May
April
March
February
January
2007
December
November
October
September
August
July
June
May
April
March
February
January
2006
December
November
October
September
August
July
June
May
April
March
February
January
2005
December
November
October
September
August
July
June
May
April
March
February
List overview
Download
Packages
May 2014
----- 2023 -----
September 2023
August 2023
July 2023
June 2023
May 2023
April 2023
March 2023
February 2023
January 2023
----- 2022 -----
December 2022
November 2022
October 2022
September 2022
August 2022
July 2022
June 2022
May 2022
April 2022
March 2022
February 2022
January 2022
----- 2021 -----
December 2021
November 2021
October 2021
September 2021
August 2021
July 2021
June 2021
May 2021
April 2021
March 2021
February 2021
January 2021
----- 2020 -----
December 2020
November 2020
October 2020
September 2020
August 2020
July 2020
June 2020
May 2020
April 2020
March 2020
February 2020
January 2020
----- 2019 -----
December 2019
November 2019
October 2019
September 2019
August 2019
July 2019
June 2019
May 2019
April 2019
March 2019
February 2019
January 2019
----- 2018 -----
December 2018
November 2018
October 2018
September 2018
August 2018
July 2018
June 2018
May 2018
April 2018
March 2018
February 2018
January 2018
----- 2017 -----
December 2017
November 2017
October 2017
September 2017
August 2017
July 2017
June 2017
May 2017
April 2017
March 2017
February 2017
January 2017
----- 2016 -----
December 2016
November 2016
October 2016
September 2016
August 2016
July 2016
June 2016
May 2016
April 2016
March 2016
February 2016
January 2016
----- 2015 -----
December 2015
November 2015
October 2015
September 2015
August 2015
July 2015
June 2015
May 2015
April 2015
March 2015
February 2015
January 2015
----- 2014 -----
December 2014
November 2014
October 2014
September 2014
August 2014
July 2014
June 2014
May 2014
April 2014
March 2014
February 2014
January 2014
----- 2013 -----
December 2013
November 2013
October 2013
September 2013
August 2013
July 2013
June 2013
May 2013
April 2013
March 2013
February 2013
January 2013
----- 2012 -----
December 2012
November 2012
October 2012
September 2012
August 2012
July 2012
June 2012
May 2012
April 2012
March 2012
February 2012
January 2012
----- 2011 -----
December 2011
November 2011
October 2011
September 2011
August 2011
July 2011
June 2011
May 2011
April 2011
March 2011
February 2011
January 2011
----- 2010 -----
December 2010
November 2010
October 2010
September 2010
August 2010
July 2010
June 2010
May 2010
April 2010
March 2010
February 2010
January 2010
----- 2009 -----
December 2009
November 2009
October 2009
September 2009
August 2009
July 2009
June 2009
May 2009
April 2009
March 2009
February 2009
January 2009
----- 2008 -----
December 2008
November 2008
October 2008
September 2008
August 2008
July 2008
June 2008
May 2008
April 2008
March 2008
February 2008
January 2008
----- 2007 -----
December 2007
November 2007
October 2007
September 2007
August 2007
July 2007
June 2007
May 2007
April 2007
March 2007
February 2007
January 2007
----- 2006 -----
December 2006
November 2006
October 2006
September 2006
August 2006
July 2006
June 2006
May 2006
April 2006
March 2006
February 2006
January 2006
----- 2005 -----
December 2005
November 2005
October 2005
September 2005
August 2005
July 2005
June 2005
May 2005
April 2005
March 2005
February 2005
packages@lists.squeakfoundation.org
1 participants
201 discussions
Start a n
N
ew thread
The Trunk: Multilingual-nice.198.mcz
by commits@source.squeak.org
29 May '14
29 May '14
Nicolas Cellier uploaded a new version of Multilingual to project The Trunk:
http://source.squeak.org/trunk/Multilingual-nice.198.mcz
==================== Summary ==================== Name: Multilingual-nice.198 Author: nice Time: 29 May 2014, 4:56:27.693 pm UUID: 1229d8e0-8203-43a9-bea1-c7f155db6d21 Ancestors: Multilingual-nice.197 Oops, correct my very fresh bug in convertToUnicode: =============== Diff against Multilingual-nice.197 =============== Item was changed: ----- Method: EncodedCharSet class>>convertToUnicode: (in category 'class methods') ----- convertToUnicode: aCode "Translate aCode in our encoding, into equivalent unicode encoding" | table v | (table := self ucsTable) ifNil: [^ 16rFFFD]. + (v := table at: 1 + aCode) = -1 ifTrue: [^ 16rFFFD]. - (v := table at: 1 + self charCode) = -1 ifTrue: [^ 16rFFFD]. ^ v!
1
0
0
0
The Trunk: Multilingual-nice.198.mcz
by commits@source.squeak.org
29 May '14
29 May '14
Nicolas Cellier uploaded a new version of Multilingual to project The Trunk:
http://source.squeak.org/trunk/Multilingual-nice.198.mcz
==================== Summary ==================== Name: Multilingual-nice.198 Author: nice Time: 29 May 2014, 4:56:27.693 pm UUID: 1229d8e0-8203-43a9-bea1-c7f155db6d21 Ancestors: Multilingual-nice.197 Oops, correct my very fresh bug in convertToUnicode: =============== Diff against Multilingual-nice.197 =============== Item was changed: ----- Method: EncodedCharSet class>>convertToUnicode: (in category 'class methods') ----- convertToUnicode: aCode "Translate aCode in our encoding, into equivalent unicode encoding" | table v | (table := self ucsTable) ifNil: [^ 16rFFFD]. + (v := table at: 1 + aCode) = -1 ifTrue: [^ 16rFFFD]. - (v := table at: 1 + self charCode) = -1 ifTrue: [^ 16rFFFD]. ^ v!
1
0
0
0
The Trunk: Multilingual-nice.198.mcz
by commits@source.squeak.org
29 May '14
29 May '14
Nicolas Cellier uploaded a new version of Multilingual to project The Trunk:
http://source.squeak.org/trunk/Multilingual-nice.198.mcz
==================== Summary ==================== Name: Multilingual-nice.198 Author: nice Time: 29 May 2014, 4:56:27.693 pm UUID: 1229d8e0-8203-43a9-bea1-c7f155db6d21 Ancestors: Multilingual-nice.197 Oops, correct my very fresh bug in convertToUnicode: =============== Diff against Multilingual-nice.197 =============== Item was changed: ----- Method: EncodedCharSet class>>convertToUnicode: (in category 'class methods') ----- convertToUnicode: aCode "Translate aCode in our encoding, into equivalent unicode encoding" | table v | (table := self ucsTable) ifNil: [^ 16rFFFD]. + (v := table at: 1 + aCode) = -1 ifTrue: [^ 16rFFFD]. - (v := table at: 1 + self charCode) = -1 ifTrue: [^ 16rFFFD]. ^ v!
1
0
0
0
The Trunk: Multilingual-nice.197.mcz
by commits@source.squeak.org
29 May '14
29 May '14
Nicolas Cellier uploaded a new version of Multilingual to project The Trunk:
http://source.squeak.org/trunk/Multilingual-nice.197.mcz
==================== Summary ==================== Name: Multilingual-nice.197 Author: nice Time: 29 May 2014, 3:42:06.965 pm UUID: bfac7dc8-5362-4332-876f-39a24924c19e Ancestors: Multilingual-nice.196 Cleanup: - Remove now unused isCharset. Anyway, the meaning was strange (more like isEastAsianCharset). - Simplify CompoundTextConverter>>toUnicode: to use new convertToUnicode: rather than duplicate the job. - Add possibly missing EncodedCharSet class>>unicodeLeadingChar. - Provides a fast Latin1 class>>charFromUnicode:. =============== Diff against Multilingual-nice.196 =============== Item was changed: ----- Method: CompoundTextConverter>>toUnicode: (in category 'private') ----- toUnicode: aChar + | charset v | - | table charset v | aChar leadingChar = 0 ifTrue: [^ aChar]. + charset := (EncodedCharSet charsetAt: aChar leadingChar) charsetClass. + v := charset convertToUnicode: aChar charCode. + ^ Character leadingChar: charset unicodeLeadingChar code: v! - charset := EncodedCharSet charsetAt: aChar leadingChar. - charset isCharset ifFalse: [^ aChar]. - table := charset ucsTable. - table isNil ifTrue: [^ Character value: 16rFFFD]. - - v := table at: aChar charCode + 1. - v = -1 ifTrue: [^ Character value: 16rFFFD]. - - ^ Character leadingChar: charset unicodeLeadingChar code: v.! Item was removed: - ----- Method: EncodedCharSet class>>isCharset (in category 'class methods') ----- - isCharset - - ^ true. - ! Item was added: + ----- Method: EncodedCharSet class>>unicodeLeadingChar (in category 'class methods') ----- + unicodeLeadingChar + ^Unicode leadingChar! Item was removed: - ----- Method: LanguageEnvironment class>>isCharset (in category 'accessing') ----- - isCharset - - ^ false. - ! Item was added: + ----- Method: Latin1 class>>charFromUnicode: (in category 'class methods') ----- + charFromUnicode: uniCode + + ^ Character leadingChar: self leadingChar code: uniCode! Item was removed: - ----- Method: Unicode class>>isCharset (in category 'class methods') ----- - isCharset - - ^ false. - !
1
0
0
0
The Trunk: Multilingual-nice.197.mcz
by commits@source.squeak.org
29 May '14
29 May '14
Nicolas Cellier uploaded a new version of Multilingual to project The Trunk:
http://source.squeak.org/trunk/Multilingual-nice.197.mcz
==================== Summary ==================== Name: Multilingual-nice.197 Author: nice Time: 29 May 2014, 3:42:06.965 pm UUID: bfac7dc8-5362-4332-876f-39a24924c19e Ancestors: Multilingual-nice.196 Cleanup: - Remove now unused isCharset. Anyway, the meaning was strange (more like isEastAsianCharset). - Simplify CompoundTextConverter>>toUnicode: to use new convertToUnicode: rather than duplicate the job. - Add possibly missing EncodedCharSet class>>unicodeLeadingChar. - Provides a fast Latin1 class>>charFromUnicode:. =============== Diff against Multilingual-nice.196 =============== Item was changed: ----- Method: CompoundTextConverter>>toUnicode: (in category 'private') ----- toUnicode: aChar + | charset v | - | table charset v | aChar leadingChar = 0 ifTrue: [^ aChar]. + charset := (EncodedCharSet charsetAt: aChar leadingChar) charsetClass. + v := charset convertToUnicode: aChar charCode. + ^ Character leadingChar: charset unicodeLeadingChar code: v! - charset := EncodedCharSet charsetAt: aChar leadingChar. - charset isCharset ifFalse: [^ aChar]. - table := charset ucsTable. - table isNil ifTrue: [^ Character value: 16rFFFD]. - - v := table at: aChar charCode + 1. - v = -1 ifTrue: [^ Character value: 16rFFFD]. - - ^ Character leadingChar: charset unicodeLeadingChar code: v.! Item was removed: - ----- Method: EncodedCharSet class>>isCharset (in category 'class methods') ----- - isCharset - - ^ true. - ! Item was added: + ----- Method: EncodedCharSet class>>unicodeLeadingChar (in category 'class methods') ----- + unicodeLeadingChar + ^Unicode leadingChar! Item was removed: - ----- Method: LanguageEnvironment class>>isCharset (in category 'accessing') ----- - isCharset - - ^ false. - ! Item was added: + ----- Method: Latin1 class>>charFromUnicode: (in category 'class methods') ----- + charFromUnicode: uniCode + + ^ Character leadingChar: self leadingChar code: uniCode! Item was removed: - ----- Method: Unicode class>>isCharset (in category 'class methods') ----- - isCharset - - ^ false. - !
1
0
0
0
The Trunk: Multilingual-nice.197.mcz
by commits@source.squeak.org
29 May '14
29 May '14
Nicolas Cellier uploaded a new version of Multilingual to project The Trunk:
http://source.squeak.org/trunk/Multilingual-nice.197.mcz
==================== Summary ==================== Name: Multilingual-nice.197 Author: nice Time: 29 May 2014, 3:42:06.965 pm UUID: bfac7dc8-5362-4332-876f-39a24924c19e Ancestors: Multilingual-nice.196 Cleanup: - Remove now unused isCharset. Anyway, the meaning was strange (more like isEastAsianCharset). - Simplify CompoundTextConverter>>toUnicode: to use new convertToUnicode: rather than duplicate the job. - Add possibly missing EncodedCharSet class>>unicodeLeadingChar. - Provides a fast Latin1 class>>charFromUnicode:. =============== Diff against Multilingual-nice.196 =============== Item was changed: ----- Method: CompoundTextConverter>>toUnicode: (in category 'private') ----- toUnicode: aChar + | charset v | - | table charset v | aChar leadingChar = 0 ifTrue: [^ aChar]. + charset := (EncodedCharSet charsetAt: aChar leadingChar) charsetClass. + v := charset convertToUnicode: aChar charCode. + ^ Character leadingChar: charset unicodeLeadingChar code: v! - charset := EncodedCharSet charsetAt: aChar leadingChar. - charset isCharset ifFalse: [^ aChar]. - table := charset ucsTable. - table isNil ifTrue: [^ Character value: 16rFFFD]. - - v := table at: aChar charCode + 1. - v = -1 ifTrue: [^ Character value: 16rFFFD]. - - ^ Character leadingChar: charset unicodeLeadingChar code: v.! Item was removed: - ----- Method: EncodedCharSet class>>isCharset (in category 'class methods') ----- - isCharset - - ^ true. - ! Item was added: + ----- Method: EncodedCharSet class>>unicodeLeadingChar (in category 'class methods') ----- + unicodeLeadingChar + ^Unicode leadingChar! Item was removed: - ----- Method: LanguageEnvironment class>>isCharset (in category 'accessing') ----- - isCharset - - ^ false. - ! Item was added: + ----- Method: Latin1 class>>charFromUnicode: (in category 'class methods') ----- + charFromUnicode: uniCode + + ^ Character leadingChar: self leadingChar code: uniCode! Item was removed: - ----- Method: Unicode class>>isCharset (in category 'class methods') ----- - isCharset - - ^ false. - !
1
0
0
0
The Trunk: Collections-nice.572.mcz
by commits@source.squeak.org
29 May '14
29 May '14
Nicolas Cellier uploaded a new version of Collections to project The Trunk:
http://source.squeak.org/trunk/Collections-nice.572.mcz
==================== Summary ==================== Name: Collections-nice.572 Author: nice Time: 29 May 2014, 3:09:56.261 pm UUID: 387e24c8-d4ec-4a93-8bbd-cb72a293fb0b Ancestors: Collections-eem.571 Let asUppercase and asLowercase use the unicode tables for wide strings/characters. Care is also taken to correctly handle characters with east asian encoding, but I'm not sure how healthy is this support in trunk... Remove Character>>basicSqueakToIso which is totally obsolete (does not the right thing) and is not sent. =============== Diff against Collections-eem.571 =============== Item was changed: ----- Method: Character>>asLowercase (in category 'converting') ----- asLowercase "If the receiver is uppercase, answer its matching lowercase Character." "A tentative implementation. Eventually this should consult the Unicode table." | v | v := self charCode. (((8r101 <= v and: [v <= 8r132]) or: [16rC0 <= v and: [v <= 16rD6]]) or: [16rD8 <= v and: [v <= 16rDE]]) + ifTrue: [^ Character value: v + 8r40]. + v < 256 ifTrue: [^self]. + ^self class value: ((value < 16r400000 + ifTrue: [Unicode] + ifFalse: [(EncodedCharSet charsetAt: self leadingChar) charsetClass]) + toLowercaseCode: v)! - ifTrue: [^ Character value: value + 8r40] - ifFalse: [^ self]! Item was changed: ----- Method: Character>>asUnicode (in category 'converting') ----- asUnicode + "Answer the unicode encoding of the receiver" - | table charset v | self leadingChar = 0 ifTrue: [^ value]. + ^(EncodedCharSet charsetAt: self leadingChar) charsetClass convertToUnicode: self charCode - (charset := EncodedCharSet charsetAt: self leadingChar) - isCharset ifFalse: [^ self charCode]. - (table := charset ucsTable) - ifNil: [^ 16rFFFD]. - (v := table at: 1 + self charCode) - = -1 ifTrue: [^ 16rFFFD]. - ^ v. ! Item was changed: ----- Method: Character>>asUppercase (in category 'converting') ----- asUppercase "If the receiver is lowercase, answer its matching uppercase Character." "A tentative implementation. Eventually this should consult the Unicode table." | v | v := self charCode. (((8r141 <= v and: [v <= 8r172]) or: [16rE0 <= v and: [v <= 16rF6]]) or: [16rF8 <= v and: [v <= 16rFE]]) + ifTrue: [^ Character value: v - 8r40]. + v < 256 ifTrue: [^self]. + ^self class value: ((value < 16r400000 + ifTrue: [Unicode] + ifFalse: [(EncodedCharSet charsetAt: self leadingChar) charsetClass]) + toUppercaseCode: v)! - ifTrue: [^ Character value: value - 8r40] - ifFalse: [^ self] - ! Item was removed: - ----- Method: Character>>basicSqueakToIso (in category 'converting') ----- - basicSqueakToIso - | asciiValue | - - value < 128 ifTrue: [^ self]. - value > 255 ifTrue: [^ self]. - asciiValue := #(196 197 199 201 209 214 220 225 224 226 228 227 229 231 233 232 234 235 237 236 238 239 241 243 242 244 246 245 250 249 251 252 134 176 162 163 167 149 182 223 174 169 153 180 168 128 198 216 129 177 138 141 165 181 142 143 144 154 157 170 186 158 230 248 191 161 172 166 131 173 178 171 187 133 160 192 195 213 140 156 150 151 147 148 145 146 247 179 253 159 185 164 139 155 188 189 135 183 130 132 137 194 202 193 203 200 205 206 207 204 211 212 190 210 218 219 217 208 136 152 175 215 221 222 184 240 254 255 256 ) at: self asciiValue - 127. - ^ Character value: asciiValue. - ! Item was added: + ----- Method: WideString>>asLowercase (in category 'converting') ----- + asLowercase + ^self collect: [:e | e asLowercase]! Item was added: + ----- Method: WideString>>asUppercase (in category 'converting') ----- + asUppercase + ^self collect: [:e | e asUppercase]!
1
0
0
0
The Trunk: Collections-nice.572.mcz
by commits@source.squeak.org
29 May '14
29 May '14
Nicolas Cellier uploaded a new version of Collections to project The Trunk:
http://source.squeak.org/trunk/Collections-nice.572.mcz
==================== Summary ==================== Name: Collections-nice.572 Author: nice Time: 29 May 2014, 3:09:56.261 pm UUID: 387e24c8-d4ec-4a93-8bbd-cb72a293fb0b Ancestors: Collections-eem.571 Let asUppercase and asLowercase use the unicode tables for wide strings/characters. Care is also taken to correctly handle characters with east asian encoding, but I'm not sure how healthy is this support in trunk... Remove Character>>basicSqueakToIso which is totally obsolete (does not the right thing) and is not sent. =============== Diff against Collections-eem.571 =============== Item was changed: ----- Method: Character>>asLowercase (in category 'converting') ----- asLowercase "If the receiver is uppercase, answer its matching lowercase Character." "A tentative implementation. Eventually this should consult the Unicode table." | v | v := self charCode. (((8r101 <= v and: [v <= 8r132]) or: [16rC0 <= v and: [v <= 16rD6]]) or: [16rD8 <= v and: [v <= 16rDE]]) + ifTrue: [^ Character value: v + 8r40]. + v < 256 ifTrue: [^self]. + ^self class value: ((value < 16r400000 + ifTrue: [Unicode] + ifFalse: [(EncodedCharSet charsetAt: self leadingChar) charsetClass]) + toLowercaseCode: v)! - ifTrue: [^ Character value: value + 8r40] - ifFalse: [^ self]! Item was changed: ----- Method: Character>>asUnicode (in category 'converting') ----- asUnicode + "Answer the unicode encoding of the receiver" - | table charset v | self leadingChar = 0 ifTrue: [^ value]. + ^(EncodedCharSet charsetAt: self leadingChar) charsetClass convertToUnicode: self charCode - (charset := EncodedCharSet charsetAt: self leadingChar) - isCharset ifFalse: [^ self charCode]. - (table := charset ucsTable) - ifNil: [^ 16rFFFD]. - (v := table at: 1 + self charCode) - = -1 ifTrue: [^ 16rFFFD]. - ^ v. ! Item was changed: ----- Method: Character>>asUppercase (in category 'converting') ----- asUppercase "If the receiver is lowercase, answer its matching uppercase Character." "A tentative implementation. Eventually this should consult the Unicode table." | v | v := self charCode. (((8r141 <= v and: [v <= 8r172]) or: [16rE0 <= v and: [v <= 16rF6]]) or: [16rF8 <= v and: [v <= 16rFE]]) + ifTrue: [^ Character value: v - 8r40]. + v < 256 ifTrue: [^self]. + ^self class value: ((value < 16r400000 + ifTrue: [Unicode] + ifFalse: [(EncodedCharSet charsetAt: self leadingChar) charsetClass]) + toUppercaseCode: v)! - ifTrue: [^ Character value: value - 8r40] - ifFalse: [^ self] - ! Item was removed: - ----- Method: Character>>basicSqueakToIso (in category 'converting') ----- - basicSqueakToIso - | asciiValue | - - value < 128 ifTrue: [^ self]. - value > 255 ifTrue: [^ self]. - asciiValue := #(196 197 199 201 209 214 220 225 224 226 228 227 229 231 233 232 234 235 237 236 238 239 241 243 242 244 246 245 250 249 251 252 134 176 162 163 167 149 182 223 174 169 153 180 168 128 198 216 129 177 138 141 165 181 142 143 144 154 157 170 186 158 230 248 191 161 172 166 131 173 178 171 187 133 160 192 195 213 140 156 150 151 147 148 145 146 247 179 253 159 185 164 139 155 188 189 135 183 130 132 137 194 202 193 203 200 205 206 207 204 211 212 190 210 218 219 217 208 136 152 175 215 221 222 184 240 254 255 256 ) at: self asciiValue - 127. - ^ Character value: asciiValue. - ! Item was added: + ----- Method: WideString>>asLowercase (in category 'converting') ----- + asLowercase + ^self collect: [:e | e asLowercase]! Item was added: + ----- Method: WideString>>asUppercase (in category 'converting') ----- + asUppercase + ^self collect: [:e | e asUppercase]!
1
0
0
0
The Trunk: Collections-nice.572.mcz
by commits@source.squeak.org
29 May '14
29 May '14
Nicolas Cellier uploaded a new version of Collections to project The Trunk:
http://source.squeak.org/trunk/Collections-nice.572.mcz
==================== Summary ==================== Name: Collections-nice.572 Author: nice Time: 29 May 2014, 3:09:56.261 pm UUID: 387e24c8-d4ec-4a93-8bbd-cb72a293fb0b Ancestors: Collections-eem.571 Let asUppercase and asLowercase use the unicode tables for wide strings/characters. Care is also taken to correctly handle characters with east asian encoding, but I'm not sure how healthy is this support in trunk... Remove Character>>basicSqueakToIso which is totally obsolete (does not the right thing) and is not sent. =============== Diff against Collections-eem.571 =============== Item was changed: ----- Method: Character>>asLowercase (in category 'converting') ----- asLowercase "If the receiver is uppercase, answer its matching lowercase Character." "A tentative implementation. Eventually this should consult the Unicode table." | v | v := self charCode. (((8r101 <= v and: [v <= 8r132]) or: [16rC0 <= v and: [v <= 16rD6]]) or: [16rD8 <= v and: [v <= 16rDE]]) + ifTrue: [^ Character value: v + 8r40]. + v < 256 ifTrue: [^self]. + ^self class value: ((value < 16r400000 + ifTrue: [Unicode] + ifFalse: [(EncodedCharSet charsetAt: self leadingChar) charsetClass]) + toLowercaseCode: v)! - ifTrue: [^ Character value: value + 8r40] - ifFalse: [^ self]! Item was changed: ----- Method: Character>>asUnicode (in category 'converting') ----- asUnicode + "Answer the unicode encoding of the receiver" - | table charset v | self leadingChar = 0 ifTrue: [^ value]. + ^(EncodedCharSet charsetAt: self leadingChar) charsetClass convertToUnicode: self charCode - (charset := EncodedCharSet charsetAt: self leadingChar) - isCharset ifFalse: [^ self charCode]. - (table := charset ucsTable) - ifNil: [^ 16rFFFD]. - (v := table at: 1 + self charCode) - = -1 ifTrue: [^ 16rFFFD]. - ^ v. ! Item was changed: ----- Method: Character>>asUppercase (in category 'converting') ----- asUppercase "If the receiver is lowercase, answer its matching uppercase Character." "A tentative implementation. Eventually this should consult the Unicode table." | v | v := self charCode. (((8r141 <= v and: [v <= 8r172]) or: [16rE0 <= v and: [v <= 16rF6]]) or: [16rF8 <= v and: [v <= 16rFE]]) + ifTrue: [^ Character value: v - 8r40]. + v < 256 ifTrue: [^self]. + ^self class value: ((value < 16r400000 + ifTrue: [Unicode] + ifFalse: [(EncodedCharSet charsetAt: self leadingChar) charsetClass]) + toUppercaseCode: v)! - ifTrue: [^ Character value: value - 8r40] - ifFalse: [^ self] - ! Item was removed: - ----- Method: Character>>basicSqueakToIso (in category 'converting') ----- - basicSqueakToIso - | asciiValue | - - value < 128 ifTrue: [^ self]. - value > 255 ifTrue: [^ self]. - asciiValue := #(196 197 199 201 209 214 220 225 224 226 228 227 229 231 233 232 234 235 237 236 238 239 241 243 242 244 246 245 250 249 251 252 134 176 162 163 167 149 182 223 174 169 153 180 168 128 198 216 129 177 138 141 165 181 142 143 144 154 157 170 186 158 230 248 191 161 172 166 131 173 178 171 187 133 160 192 195 213 140 156 150 151 147 148 145 146 247 179 253 159 185 164 139 155 188 189 135 183 130 132 137 194 202 193 203 200 205 206 207 204 211 212 190 210 218 219 217 208 136 152 175 215 221 222 184 240 254 255 256 ) at: self asciiValue - 127. - ^ Character value: asciiValue. - ! Item was added: + ----- Method: WideString>>asLowercase (in category 'converting') ----- + asLowercase + ^self collect: [:e | e asLowercase]! Item was added: + ----- Method: WideString>>asUppercase (in category 'converting') ----- + asUppercase + ^self collect: [:e | e asUppercase]!
1
0
0
0
The Trunk: Multilingual-nice.196.mcz
by commits@source.squeak.org
29 May '14
29 May '14
Nicolas Cellier uploaded a new version of Multilingual to project The Trunk:
http://source.squeak.org/trunk/Multilingual-nice.196.mcz
==================== Summary ==================== Name: Multilingual-nice.196 Author: nice Time: 29 May 2014, 2:59:48.329 pm UUID: 07f39804-8e14-49be-a55f-e8d2b809fddd Ancestors: Multilingual-nice.195 Generalized what I did to Unicode: provide an API dealing with codes Add charsetClass conveniency for this reason: (EncodedCharSet charsetAt: leadingChar) is not always a charset, it can also be a LanguageEnvironment. (EncodedCharSet charsetAt: leadingChar) charsetClass will always be a charset (an EncodedCharSet subclass) =============== Diff against Multilingual-nice.195 =============== Item was added: + ----- Method: EncodedCharSet class>>charsetClass (in category 'class methods') ----- + charsetClass + ^self! Item was added: + ----- Method: EncodedCharSet class>>convertToUnicode: (in category 'class methods') ----- + convertToUnicode: aCode + "Translate aCode in our encoding, into equivalent unicode encoding" + | table v | + (table := self ucsTable) ifNil: [^ 16rFFFD]. + (v := table at: 1 + self charCode) = -1 ifTrue: [^ 16rFFFD]. + ^ v! Item was changed: ----- Method: EncodedCharSet class>>isDigit: (in category 'character classification') ----- isDigit: char + "Answer whether char has the code of a digit in this encoding." + ^self isDigitCode: char charCode - "Answer whether the receiver is a digit." - - | value | - value := char asciiValue. - ^ value >= 48 and: [value <= 57]. ! Item was added: + ----- Method: EncodedCharSet class>>isDigitCode: (in category 'character classification') ----- + isDigitCode: anInteger + "Answer whether anInteger is the code of a digit." + + ^ anInteger >= 48 and: [anInteger <= 57]. + ! Item was changed: ----- Method: EncodedCharSet class>>isLetter: (in category 'character classification') ----- + isLetter: char + "Answer whether char has the code of a letter in this encoding." + ^self isLetterCode: char charCode! - isLetter: char - "Answer whether the receiver is a letter." - - | value | - value := char asciiValue. - ^ (8r141 <= value and: [value <= 8r172]) or: [8r101 <= value and: [value <= 8r132]]. - ! Item was added: + ----- Method: EncodedCharSet class>>isLetterCode: (in category 'character classification') ----- + isLetterCode: anInteger + "Answer whether anInteger is the code of a letter." + + ^ (8r141 <= anInteger and: [anInteger <= 8r172]) or: [8r101 <= anInteger and: [anInteger <= 8r132]]. + ! Item was changed: ----- Method: EncodedCharSet class>>isLowercase: (in category 'character classification') ----- + isLowercase: char + "Answer whether char has the code of a lowercase letter in this encoding." + ^self isLowercaseCode: char charCode! - isLowercase: char - "Answer whether the receiver is a lowercase letter. - (The old implementation answered whether the receiver is not an uppercase letter.)" - - | value | - value := char asciiValue. - ^ 8r141 <= value and: [value <= 8r172]. - ! Item was added: + ----- Method: EncodedCharSet class>>isLowercaseCode: (in category 'character classification') ----- + isLowercaseCode: anInteger + "Answer whether anInteger is the code of a lowercase letter." + + ^ 8r141 <= anInteger and: [anInteger <= 8r172]. + ! Item was changed: ----- Method: EncodedCharSet class>>isUppercase: (in category 'character classification') ----- + isUppercase: char + "Answer whether char has the code of an uppercase letter in this encoding." + ^self isUppercaseCode: char charCode! - isUppercase: char - "Answer whether the receiver is an uppercase letter. - (The old implementation answered whether the receiver is not a lowercase letter.)" - - | value | - value := char asciiValue. - ^ 8r101 <= value and: [value <= 8r132]. - ! Item was added: + ----- Method: EncodedCharSet class>>isUppercaseCode: (in category 'character classification') ----- + isUppercaseCode: anInteger + "Answer whether anInteger is the code of an uppercase letter." + + ^ 8r101 <= anInteger and: [anInteger <= 8r132]. + ! Item was removed: - ----- Method: GB2312 class>>isLetter: (in category 'character classification') ----- - isLetter: char - - | value leading | - - leading := char leadingChar. - value := char charCode. - - leading = 0 ifTrue: [^ super isLetter: char]. - - value := value // 94 + 1. - ^ 1 <= value and: [value < 84]. - ! Item was added: + ----- Method: GB2312 class>>isLetterCode: (in category 'character classification') ----- + isLetterCode: anInteger + | value | + value := anInteger // 94 + 1. + ^ 1 <= value and: [value < 84]. + ! Item was removed: - ----- Method: JISX0208 class>>isLetter: (in category 'character classification') ----- - isLetter: char - - | value leading | - - leading := char leadingChar. - value := char charCode. - - leading = 0 ifTrue: [^ super isLetter: char]. - - value := value // 94 + 1. - ^ 1 <= value and: [value < 84]. - ! Item was added: + ----- Method: JISX0208 class>>isLetterCode: (in category 'character classification') ----- + isLetterCode: anInteger + | value | + value := anInteger // 94 + 1. + ^ 1 <= value and: [value < 84]. + ! Item was removed: - ----- Method: KSX1001 class>>isLetter: (in category 'character classification') ----- - isLetter: char - - | value leading | - - leading := char leadingChar. - value := char charCode. - - leading = 0 ifTrue: [^ super isLetter: char]. - - value := value // 94 + 1. - ^ 1 <= value and: [value < 84]. - ! Item was added: + ----- Method: KSX1001 class>>isLetterCode: (in category 'character classification') ----- + isLetterCode: anInteger + | value | + value := anInteger // 94 + 1. + ^ 1 <= value and: [value < 84]. + ! Item was added: + ----- Method: LanguageEnvironment class>>charsetClass (in category 'accessing') ----- + charsetClass + ^Unicode! Item was changed: ----- Method: LanguageEnvironment class>>digitValueOf: (in category 'accessing') ----- digitValueOf: char "Answer 0-9 if the receiver is $0-$9, 10-35 if it is $A-$Z, and < 0 otherwise. This is used to parse literal numbers of radix 2-36." + ^ self charsetClass digitValueOf: char. - ^ Unicode digitValueOf: char. ! Item was changed: ----- Method: LanguageEnvironment class>>isDigit: (in category 'accessing') ----- isDigit: char + ^ self charsetClass isDigit: char. - ^ Unicode isDigit: char. ! Item was changed: ----- Method: LanguageEnvironment class>>isLetter: (in category 'accessing') ----- isLetter: char + ^ self charsetClass isLetter: char. - ^ Unicode isLetter: char. ! Item was changed: ----- Method: LanguageEnvironment class>>isLowercase: (in category 'accessing') ----- isLowercase: char + ^ self charsetClass isLowercase: char. - ^ Unicode isLowercase: char. ! Item was changed: ----- Method: LanguageEnvironment class>>isUppercase: (in category 'accessing') ----- isUppercase: char + ^ self charsetClass isUppercase: char. - ^ Unicode isUppercase: char. ! Item was added: + ----- Method: Latin1 class>>convertToUnicode: (in category 'class methods') ----- + convertToUnicode: aCode + ^aCode! Item was removed: - ----- Method: Latin1 class>>isLetter: (in category 'character classification') ----- - isLetter: char - "Answer whether the receiver is a letter." - - ^ Unicode isLetter: char. - - ! Item was added: + ----- Method: Latin1 class>>isLetterCode: (in category 'character classification') ----- + isLetterCode: anInteger + ^ Unicode isLetterCode: anInteger + + ! Item was added: + ----- Method: Unicode class>>convertToUnicode: (in category 'class methods') ----- + convertToUnicode: aCode + ^aCode! Item was removed: - ----- Method: Unicode class>>isDigit: (in category 'character classification') ----- - isDigit: char - ^self isDigitCode: char charCode! Item was removed: - ----- Method: Unicode class>>isLetter: (in category 'character classification') ----- - isLetter: char - ^self isLetterCode: char charCode! Item was removed: - ----- Method: Unicode class>>isLowercase: (in category 'character classification') ----- - isLowercase: char - ^self isLowercaseCode: char charCode! Item was removed: - ----- Method: Unicode class>>isUppercase: (in category 'character classification') ----- - isUppercase: char - ^self isUppercaseCode: char charCode!
1
0
0
0
← Newer
1
2
3
4
...
21
Older →
Jump to page:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
Results per page:
10
25
50
100
200