[squeak-dev] Unicode Ranges you might want to copy

Eric Gade eric.gade at gmail.com
Mon Feb 1 13:57:32 UTC 2021


Hi Timothy,

I spent several (fruitless) days last year trying to get cuneiform fonts to
render properly in Squeak (see here
<https://www.hethport.uni-wuerzburg.de/cuneifont/> for examples of the
fonts). There does seem to be an issue with rendering glyphs above a
certain code point from what I recall. I am definitely interested in your
work and what you end up finding out.

On Sun, Jan 31, 2021 at 7:20 AM gettimothy via Squeak-dev <
squeak-dev at lists.squeakfoundation.org> wrote:

> Hi Folks,
>
> You might find the below handy.
>
> Fwiw, I have coded the UnicodeRangeBrowser  SeasideApp to display all the
> below (it times out before all display, I will be doing refactor and more
> development).
>
> I am also coding a utility class to provide information such as below in a
> variety of ways.
>
> fwiw, here are my current todo notes for this font information.
>
> unifont
>
> provide link to the unicode spec.
> provide browser fonts list.
> provide squeak fonts list.
> provide phare fonts list ^self squeak fonts list:
> what are Variant Selectors ?
>
> show gaps in the ranges.
>
> provide a link(s) to required fonts that will make a range display.
> displays on: browser list, squeak, emacs, xterm...
> status bar...broken, partial, full.
> use cases for fonts
> pairs with xyz   example superscripts and subscripts
>
>
> The goal is to show what works and to find/test fonts that will support
> stuff.
> I am modelling the Seaside app on
> https://jrgraphix.net/research/unicode_blocks.php
> But...for each unicode range, I intend to display smalltalk specific
> helpers on what/where to get fonts for both the browser side and the image
> side.
>
> --------------snip -------------------
>
> (16r000020 asCharacter to: 16r00007F asCharacter) -> 'Basic Latin'
> (16r0000A0 asCharacter to: 16r0000FF asCharacter) -> 'Latin-1 Supplement'
> (16r000100 asCharacter to: 16r00017F asCharacter) -> 'Latin Extended-A'
> (16r000180 asCharacter to: 16r00024F asCharacter) -> 'Latin Extended-B'
> (16r000250 asCharacter to: 16r0002AF asCharacter) -> 'IPA Extensions'
> (16r0002B0 asCharacter to: 16r0002FF asCharacter) -> 'Spacing Modifier
> Letters'
> (16r000300 asCharacter to: 16r00036F asCharacter) -> 'Combining
> Diacritical Marks'
> (16r000370 asCharacter to: 16r0003FF asCharacter) -> 'Greek and Coptic'
> (16r000400 asCharacter to: 16r0004FF asCharacter) -> 'Cyrillic'
> (16r000500 asCharacter to: 16r00052F asCharacter) -> 'Cyrillic
> Supplementary'
> (16r000530 asCharacter to: 16r00058F asCharacter) -> 'Armenian'
> (16r000590 asCharacter to: 16r0005FF asCharacter) -> 'Hebrew'
> (16r000600 asCharacter to: 16r0006FF asCharacter) -> 'Arabic'
> (16r000700 asCharacter to: 16r00074F asCharacter) -> 'Syriac'
> (16r000780 asCharacter to: 16r0007BF asCharacter) -> 'Thaana'
> (16r000900 asCharacter to: 16r00097F asCharacter) -> 'Devanagari'
> (16r000980 asCharacter to: 16r0009FF asCharacter) -> 'Bengali'
> (16r000A00 asCharacter to: 16r000A7F asCharacter) -> 'Gurmukhi'
> (16r000A80 asCharacter to: 16r000AFF asCharacter) -> 'Gujarati'
> (16r000B00 asCharacter to: 16r000B7F asCharacter) -> 'Oriya'
> (16r000B80 asCharacter to: 16r000BFF asCharacter) -> 'Tamil'
> (16r000C00 asCharacter to: 16r000C7F asCharacter) -> 'Telugu'
> (16r000C80 asCharacter to: 16r000CFF asCharacter) -> 'Kannada'
> (16r000D00 asCharacter to: 16r000D7F asCharacter) -> 'Malayalam'
> (16r000D80 asCharacter to: 16r000DFF asCharacter) -> 'Sinhala'
> (16r000E00 asCharacter to: 16r000E7F asCharacter) -> 'Thai'
> (16r000E80 asCharacter to: 16r000EFF asCharacter) -> 'Lao'
> (16r000F00 asCharacter to: 16r000FFF asCharacter) -> 'Tibetan'
> (16r001000 asCharacter to: 16r00109F asCharacter) -> 'Myanmar'
> (16r0010A0 asCharacter to: 16r0010FF asCharacter) -> 'Georgian'
> (16r001100 asCharacter to: 16r0011FF asCharacter) -> 'Hangul Jamo'
> (16r001200 asCharacter to: 16r00137F asCharacter) -> 'Ethiopic'
> (16r0013A0 asCharacter to: 16r0013FF asCharacter) -> 'Cherokee'
> (16r001400 asCharacter to: 16r00167F asCharacter) -> 'Unified Canadian
> Aboriginal Syllabics'
> (16r001680 asCharacter to: 16r00169F asCharacter) -> 'Ogham'
> (16r0016A0 asCharacter to: 16r0016FF asCharacter) -> 'Runic'
> (16r001700 asCharacter to: 16r00171F asCharacter) -> 'Tagalog'
> (16r001720 asCharacter to: 16r00173F asCharacter) -> 'Hanunoo'
> (16r001740 asCharacter to: 16r00175F asCharacter) -> 'Buhid'
> (16r001760 asCharacter to: 16r00177F asCharacter) -> 'Tagbanwa'
> (16r001780 asCharacter to: 16r0017FF asCharacter) -> 'Khmer'
> (16r001800 asCharacter to: 16r0018AF asCharacter) -> 'Mongolian'
> (16r001900 asCharacter to: 16r00194F asCharacter) -> 'Limbu'
> (16r001950 asCharacter to: 16r00197F asCharacter) -> 'Tai Le'
> (16r0019E0 asCharacter to: 16r0019FF asCharacter) -> 'Khmer Symbols'
> (16r001D00 asCharacter to: 16r001D7F asCharacter) -> 'Phonetic Extensions'
> (16r001E00 asCharacter to: 16r001EFF asCharacter) -> 'Latin Extended
> Additional'
> (16r001F00 asCharacter to: 16r001FFF asCharacter) -> 'Greek Extended'
> (16r002000 asCharacter to: 16r00206F asCharacter) -> 'General Punctuation'
> (16r002070 asCharacter to: 16r00209F asCharacter) -> 'Superscripts and
> Subscripts'
> (16r0020A0 asCharacter to: 16r0020CF asCharacter) -> 'Currency Symbols'
> (16r0020D0 asCharacter to: 16r0020FF asCharacter) -> 'Combining
> Diacritical Marks for Symbols'
> (16r002100 asCharacter to: 16r00214F asCharacter) -> 'Letterlike Symbols'
> (16r002150 asCharacter to: 16r00218F asCharacter) -> 'Number Forms'
> (16r002190 asCharacter to: 16r0021FF asCharacter) -> 'Arrows'
> (16r002200 asCharacter to: 16r0022FF asCharacter) -> 'Mathematical
> Operators'
> (16r002300 asCharacter to: 16r0023FF asCharacter) -> 'Miscellaneous
> Technical'
> (16r002400 asCharacter to: 16r00243F asCharacter) -> 'Control Pictures'
> (16r002440 asCharacter to: 16r00245F asCharacter) -> 'Optical Character
> Recognition'
> (16r002460 asCharacter to: 16r0024FF asCharacter) -> 'Enclosed
> Alphanumerics'
> (16r002500 asCharacter to: 16r00257F asCharacter) -> 'Box Drawing'
> (16r002580 asCharacter to: 16r00259F asCharacter) -> 'Block Elements'
> (16r0025A0 asCharacter to: 16r0025FF asCharacter) -> 'Geometric Shapes'
> (16r002600 asCharacter to: 16r0026FF asCharacter) -> 'Miscellaneous
> Symbols'
> (16r002700 asCharacter to: 16r0027BF asCharacter) -> 'Dingbats'
> (16r0027C0 asCharacter to: 16r0027EF asCharacter) -> 'Miscellaneous
> Mathematical Symbols-A'
> (16r0027F0 asCharacter to: 16r0027FF asCharacter) -> 'Supplemental
> Arrows-A'
> (16r002800 asCharacter to: 16r0028FF asCharacter) -> 'Braille Patterns'
> (16r002900 asCharacter to: 16r00297F asCharacter) -> 'Supplemental
> Arrows-B'
> (16r002980 asCharacter to: 16r0029FF asCharacter) -> 'Miscellaneous
> Mathematical Symbols-B'
> (16r002A00 asCharacter to: 16r002AFF asCharacter) -> 'Supplemental
> Mathematical Operators'
> (16r002B00 asCharacter to: 16r002BFF asCharacter) -> 'Miscellaneous
> Symbols and Arrows'
> (16r002E80 asCharacter to: 16r002EFF asCharacter) -> 'CJK Radicals
> Supplement'
> (16r002F00 asCharacter to: 16r002FDF asCharacter) -> 'Kangxi Radicals'
> (16r002FF0 asCharacter to: 16r002FFF asCharacter) -> 'Ideographic
> Description Characters'
> (16r003000 asCharacter to: 16r00303F asCharacter) -> 'CJK Symbols and
> Punctuation'
> (16r003040 asCharacter to: 16r00309F asCharacter) -> 'Hiragana'
> (16r0030A0 asCharacter to: 16r0030FF asCharacter) -> 'Katakana'
> (16r003100 asCharacter to: 16r00312F asCharacter) -> 'Bopomofo'
> (16r003130 asCharacter to: 16r00318F asCharacter) -> 'Hangul Compatibility
> Jamo'
> (16r003190 asCharacter to: 16r00319F asCharacter) -> 'Kanbun'
> (16r0031A0 asCharacter to: 16r0031BF asCharacter) -> 'Bopomofo Extended'
> (16r0031F0 asCharacter to: 16r0031FF asCharacter) -> 'Katakana Phonetic
> Extensions'
> (16r003200 asCharacter to: 16r0032FF asCharacter) -> 'Enclosed CJK Letters
> and Months'
> (16r003300 asCharacter to: 16r0033FF asCharacter) -> 'CJK Compatibility'
> (16r003400 asCharacter to: 16r004DBF asCharacter) -> 'CJK Unified
> Ideographs Extension A'
> (16r004DC0 asCharacter to: 16r004DFF asCharacter) -> 'Yijing Hexagram
> Symbols'
> (16r004E00 asCharacter to: 16r009FFF asCharacter) -> 'CJK Unified
> Ideographs'
> (16r00A000 asCharacter to: 16r00A48F asCharacter) -> 'Yi Syllables'
> (16r00A490 asCharacter to: 16r00A4CF asCharacter) -> 'Yi Radicals'
> (16r00AC00 asCharacter to: 16r00D7AF asCharacter) -> 'Hangul Syllables'
> (16r00D800 asCharacter to: 16r00DB7F asCharacter) -> 'High Surrogates'
> (16r00DB80 asCharacter to: 16r00DBFF asCharacter) -> 'High Private Use
> Surrogates'
> (16r00DC00 asCharacter to: 16r00DFFF asCharacter) -> 'Low Surrogates'
> (16r00E000 asCharacter to: 16r00F8FF asCharacter) -> 'Private Use Area'
> (16r00F900 asCharacter to: 16r00FAFF asCharacter) -> 'CJK Compatibility
> Ideographs'
> (16r00FB00 asCharacter to: 16r00FB4F asCharacter) -> 'Alphabetic
> Presentation Forms'
> (16r00FB50 asCharacter to: 16r00FDFF asCharacter) -> 'Arabic Presentation
> Forms-A'
> (16r00FE00 asCharacter to: 16r00FE0F asCharacter) -> 'Variation Selectors'
> (16r00FE20 asCharacter to: 16r00FE2F asCharacter) -> 'Combining Half Marks'
> (16r00FE30 asCharacter to: 16r00FE4F asCharacter) -> 'CJK Compatibility
> Forms'
> (16r00FE50 asCharacter to: 16r00FE6F asCharacter) -> 'Small Form Variants'
> (16r00FE70 asCharacter to: 16r00FEFF asCharacter) -> 'Arabic Presentation
> Forms-B'
> (16r00FF00 asCharacter to: 16r00FFEF asCharacter) -> 'Halfwidth and
> Fullwidth Forms'
> (16r00FFF0 asCharacter to: 16r00FFFF asCharacter) -> 'Specials'
> (16r010000 asCharacter to: 16r01007F asCharacter) -> 'Linear B Syllabary'
> (16r010080 asCharacter to: 16r0100FF asCharacter) -> 'Linear B Ideograms'
> (16r010100 asCharacter to: 16r01013F asCharacter) -> 'Aegean Numbers'
> (16r010300 asCharacter to: 16r01032F asCharacter) -> 'Old Italic'
> (16r010330 asCharacter to: 16r01034F asCharacter) -> 'Gothic'
> (16r010380 asCharacter to: 16r01039F asCharacter) -> 'Ugaritic'
> (16r010400 asCharacter to: 16r01044F asCharacter) -> 'Deseret'
> (16r010450 asCharacter to: 16r01047F asCharacter) -> 'Shavian'
> (16r010480 asCharacter to: 16r0104AF asCharacter) -> 'Osmanya'
> (16r010800 asCharacter to: 16r01083F asCharacter) -> 'Cypriot Syllabary'
> (16r01D000 asCharacter to: 16r01D0FF asCharacter) -> 'Byzantine Musical
> Symbols'
> (16r01D100 asCharacter to: 16r01D1FF asCharacter) -> 'Musical Symbols'
> (16r01D300 asCharacter to: 16r01D35F asCharacter) -> 'Tai Xuan Jing
> Symbols'
> (16r01D400 asCharacter to: 16r01D7FF asCharacter) -> 'Mathematical
> Alphanumeric Symbols'
> (16r020000 asCharacter to: 16r02A6DF asCharacter) -> 'CJK Unified
> Ideographs Extension B'
> (16r02F800 asCharacter to: 16r02FA1F asCharacter) -> 'CJK Compatibility
> Ideographs Supplement'
> (16r0E0000 asCharacter to: 16r0E007F asCharacter) -> 'Tags'
>
>
>
>

-- 
Eric
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.squeakfoundation.org/pipermail/squeak-dev/attachments/20210201/d2b8407b/attachment.html>


More information about the Squeak-dev mailing list