[squeak-dev] The Inbox: Regex-Core-ct.68
christoph.thiede at student.hpi.uni-potsdam.de
christoph.thiede at student.hpi.uni-potsdam.de
Mon Aug 23 19:22:57 UTC 2021
So unfortunately there was no notification about this version, once again, because I have inserted some too special characters in its summary. As an alternative, let me announce my changes here again:
Name: Regex-Core-ct.68
Author: ct
Time: 23 August 2021, 9:21:12.58334 pm
UUID: 6159117b-a67f-bd4a-b30a-82fe1b4abb09
Ancestors: Regex-Core-mt.61
Adds support for unicode backslash atoms.
Some examples:
'Squeak is the perfect language' allRegexMatches: '\w*\u{61}\w*'. "--> #('Squeak' 'language')"
'Squeak is beautiful' allRegexMatches: '\w*\x75\w*'. "--> #('Squeak' 'beautiful')"
(WebUtils jsonDecode: '"$1.00 = \u20AC0.85 = \u00A30.73"' readStream) allRegexMatches: '\p{Sc}\d+\.\d+'. "--> ('$1.00' '?0.85' '?0.73')"
'Carpe Squeak!' allRegexMatches: '\p{L}+'. "--> #('Carpe' 'Squeak')"
(WebUtils jsonDecode: '" get rid of \u2007all these nonsense separators"' readStream) allRegexMatches: '\P{Z}+'. "--> ('get' 'rid' 'of' 'all' 'these' 'nonsense' 'separators')"
Requires Multilingual-ct.259.
Tests are in Regex-Tests-Core-ct.24. Looking forward to all your feedback! :-)
Best,
Christoph
---
Sent from Squeak Inbox Talk
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.squeakfoundation.org/pipermail/squeak-dev/attachments/20210823/ad1b2d86/attachment.html>
More information about the Squeak-dev
mailing list
|