[squeak-dev] The Inbox: Regex-Core-ct.68

christoph.thiede at student.hpi.uni-potsdam.de christoph.thiede at student.hpi.uni-potsdam.de
Mon Aug 23 19:22:57 UTC 2021


So unfortunately there was no notification about this version, once again, because I have inserted some too special characters in its summary. As an alternative, let me announce my changes here again:

    Name: Regex-Core-ct.68
    Author: ct
    Time: 23 August 2021, 9:21:12.58334 pm
    UUID: 6159117b-a67f-bd4a-b30a-82fe1b4abb09
    Ancestors: Regex-Core-mt.61

    Adds support for unicode backslash atoms.

    Some examples:

        'Squeak is the perfect language' allRegexMatches: '\w*\u{61}\w*'. "--> #('Squeak' 'language')"
        'Squeak is beautiful' allRegexMatches: '\w*\x75\w*'. "--> #('Squeak' 'beautiful')"
        (WebUtils jsonDecode: '"$1.00 = \u20AC0.85 = \u00A30.73"' readStream) allRegexMatches: '\p{Sc}\d+\.\d+'. "--> ('$1.00' '?0.85' '?0.73')"
        'Carpe Squeak!' allRegexMatches: '\p{L}+'. "--> #('Carpe' 'Squeak')"
        (WebUtils jsonDecode: '" get rid of \u2007all these nonsense separators"' readStream) allRegexMatches: '\P{Z}+'. "--> ('get' 'rid' 'of' 'all' 'these' 'nonsense' 'separators')"

    Requires Multilingual-ct.259.

Tests are in Regex-Tests-Core-ct.24. Looking forward to all your feedback! :-)

Best,
Christoph

---
Sent from Squeak Inbox Talk
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.squeakfoundation.org/pipermail/squeak-dev/attachments/20210823/ad1b2d86/attachment.html>


More information about the Squeak-dev mailing list