[squeak-dev] fastest #isUtf8

Jakob Reschke forums.jakob at resfarm.de
Fri Jan 24 21:15:23 UTC 2020


Hi Chris,

I don't have an implementation ready.
But here a start for the research:
https://lemire.me/blog/2018/05/09/how-quickly-can-you-check-that-a-string-is-valid-unicode-utf-8/
Not Smalltalk, but a few approaches.

Is there any chance to do vectorized computation (using SIMD registers and
instructions) from Squeak? Can the JIT compiler generate such code?

Kind regards,
Jakob

Am Do., 23. Jan. 2020 um 01:46 Uhr schrieb Chris Muller <
ma.chris.m at gmail.com>:

> For the GraphQL engine, I have to validate String inputs as being valid
> UTF-8.  Before researching it, I thought I'd check whether anyone has
> already done it and willing to share their implementation.
>
> I see we have conversions for this, but I need the boolean response
> whether its valid UTF8, as fast as possible.
>
> Thanks!
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.squeakfoundation.org/pipermail/squeak-dev/attachments/20200124/2e77c072/attachment.html>


More information about the Squeak-dev mailing list