[Cuis] Sorting Unicode strings (Re: [Unicode] collation sequences (Re: [squeak-dev] Unicode Support))

Martin Bähr mbaehr at email.archlab.tuwien.ac.at
Wed Dec 9 01:43:35 UTC 2015


Excerpts from EuanM's message of 2015-12-09 01:59:43 +0100:
> http://www.unicode.org/reports/tr15/#Stable_Code_Points
> Table 7, the discussion of Ligatures, (which uses the ligature of
> "ffi" as its example)

ß is not a ligature of ss, but is a different character.
historically it evolved from a ligature of long s (ſ) and round s
but it is no no longer a true ligature that can be decomposed without sideeffects.

they are pronounced differently and there are german words where the difference
of using ß vs ss results in different meaning of the word: (eg Buße vs Busse:
penance vs busses)

https://en.wikipedia.org/wiki/ß

it is allowed to use ss as a replacement for ß only when ß itself is not available.

this is similar to the german umlauts: ä,ö and ü which can be decomposed into
ae, oe and ue, but those forms are a mere approximation, not equivalent. in a
medium where umlauts are available, using a decomposed form can be considered
an error.

greetings, martin.

-- 
eKita                   -   the online platform for your entire academic life
-- 
chief engineer                                                       eKita.co
pike programmer      pike.lysator.liu.se    caudium.net     societyserver.org
secretary                                                      beijinglug.org
mentor                                                           fossasia.org
foresight developer  foresightlinux.org                            realss.com
unix sysadmin
Martin Bähr          working in china        http://societyserver.org/mbaehr/


More information about the Squeak-dev mailing list