[Cuis] Sorting Unicode strings (Re: [Unicode] collation sequences (Re: [squeak-dev] Unicode Support))

Martin Bähr mbaehr at email.archlab.tuwien.ac.at
Wed Dec 9 02:36:53 UTC 2015


Excerpts from Martin Bähr's message of 2015-12-09 02:43:35 +0100:
> Excerpts from EuanM's message of 2015-12-09 01:59:43 +0100:
> > http://www.unicode.org/reports/tr15/#Stable_Code_Points
> > Table 7, the discussion of Ligatures, (which uses the ligature of
> > "ffi" as its example)
> 
> ß is not a ligature of ss, but is a different character.

rereading this, i think i am wrong, in that this has nothing to do with ß vs ss.

looking at the standard i also don't understand your conclusion.

what the standard seems to say is, that the ffi ligature is not equivalent to
plain ffi because, if i write a string with the ligature, then it is a
different string than plain "ffi",  because both forms are printable.
on the other hand ä and a" (two different encodings for ä) are the same,
because the printed forms are always identical.

however that doesn't mean that they are sorted differently.
german sorting rules for example explicitly state that ß and ss are sorted the same.
(at least according to wikipedia :-)
and surely, ffi ligature and ffi are sorted the same too.

greetings, martin.

-- 
eKita                   -   the online platform for your entire academic life
-- 
chief engineer                                                       eKita.co
pike programmer      pike.lysator.liu.se    caudium.net     societyserver.org
secretary                                                      beijinglug.org
mentor                                                           fossasia.org
foresight developer  foresightlinux.org                            realss.com
unix sysadmin
Martin Bähr          working in china        http://societyserver.org/mbaehr/


More information about the Squeak-dev mailing list