[squeak-dev] a diacritics free version of a string

Philippe Marschall philippe.marschall at gmail.com
Mon Jun 1 19:36:20 UTC 2009


2009/6/1 Sebastian Sastre <ssastre at seaswork.com>:
> ah, thanks for the info.
> And, in case it can be enough for the moment, what would be the poor's man version of this goal?

Build the map of characters with diacritics to ones without "by hand".

The Unicode solution would be to do normalization with full
decomposition and then a regex on \p{InCombiningDiacriticalMarks} and
replace it with an empty string or something similar.

Cheers
Philippe



More information about the Squeak-dev mailing list