Hi all.
I am starting the task of parsing Wikimedia interlanguage links per : https://en.wikipedia.org/wiki/Help:Wikitext#Redirects
stuff like:
[[fr:Plancton]] [[de:Plankton]] [[ru:Планктон]] [[simple:Plankton]]
when I paste this into a Workspace, I get:
[[es:Plancton]]
[[fr:Plancton]]
[[de:Plankton]]
[[ru:????????]]
[[simple:Plankton]]
What should I do?
thanks in advance.
tty
Hi tty
On 12.09.2019, at 19:09, gettimothy gettimothy@zoho.com wrote:
Hi all.
I am starting the task of parsing Wikimedia interlanguage links per : https://en.wikipedia.org/wiki/Help:Wikitext#Redirects
stuff like:
[[fr:Plancton]] [[de:Plankton]] [[ru:Планктон]] [[simple:Plankton]]
when I paste this into a Workspace, I get:
[[es:Plancton]] [[fr:Plancton]] [[de:Plankton]] [[ru:????????]] [[simple:Plankton]]
What should I do?
This is a font problem. The default font in Squeak lacks Cyrillic characters. If you paste the Russian version and make it a string and inspect it, as in '[[ru:Планктон]]' inspect
you will see that from index 6 on, the Unicode codepoints of the Cyrillic characters are actually correct, but just cannot be displayed.
best regards -Tobias
thanks in advance.
tty
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Tobias
Thank you.
These will eventually make it on to an html page via Seaside, so hopefully it will display there correctly.
Thank you for your time.
tty
---- On Thu, 12 Sep 2019 13:26:35 -0400 Tobias Pape Das.Linux@gmx.de wrote ----
Hi tty
On 12.09.2019, at 19:09, gettimothy mailto:gettimothy@zoho.com wrote:
Hi all.
I am starting the task of parsing Wikimedia interlanguage links per : https://en.wikipedia.org/wiki/Help:Wikitext#Redirects
stuff like:
[[fr:Plancton]] [[de:Plankton]] [[ru:Планктон]] [[simple:Plankton]]
when I paste this into a Workspace, I get:
[[es:Plancton]] [[fr:Plancton]] [[de:Plankton]] [[ru:????????]] [[simple:Plankton]]
What should I do?
This is a font problem. The default font in Squeak lacks Cyrillic characters. If you paste the Russian version and make it a string and inspect it, as in '[[ru:Планктон]]' inspect
you will see that from index 6 on, the Unicode codepoints of the Cyrillic characters are actually correct, but just cannot be displayed.
best regards -Tobias
thanks in advance.
tty
Beginners mailing list mailto:Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
_______________________________________________ Beginners mailing list mailto:Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Just tested it on my local copy of SeasideDoc and it does not display correctly.
I will figure it out another day.
Is it a big job to incorporate other Character Sets into Squeak?
thanks
---- On Thu, 12 Sep 2019 13:26:35 -0400 Tobias Pape Das.Linux@gmx.de wrote ----
Hi tty
On 12.09.2019, at 19:09, gettimothy mailto:gettimothy@zoho.com wrote:
Hi all.
I am starting the task of parsing Wikimedia interlanguage links per : https://en.wikipedia.org/wiki/Help:Wikitext#Redirects
stuff like:
[[fr:Plancton]] [[de:Plankton]] [[ru:Планктон]] [[simple:Plankton]]
when I paste this into a Workspace, I get:
[[es:Plancton]] [[fr:Plancton]] [[de:Plankton]] [[ru:????????]] [[simple:Plankton]]
What should I do?
This is a font problem. The default font in Squeak lacks Cyrillic characters. If you paste the Russian version and make it a string and inspect it, as in '[[ru:Планктон]]' inspect
you will see that from index 6 on, the Unicode codepoints of the Cyrillic characters are actually correct, but just cannot be displayed.
best regards -Tobias
thanks in advance.
tty
Beginners mailing list mailto:Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
_______________________________________________ Beginners mailing list mailto:Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
On 12.09.2019, at 23:52, gettimothy gettimothy@zoho.com wrote:
Just tested it on my local copy of SeasideDoc and it does not display correctly.
You most probably have to tell you string to be converted to utf8, because that is typically what is being served via Seaside and rendered by browsers.
IIRC, you can do that by setting the UTF8 Codec for the Seaside Adatpor or simply by converting the string:
'[[ru:Планктон]]' squeakToUtf8
Best regards -Tobias
I will figure it out another day.
Is it a big job to incorporate other Character Sets into Squeak?
thanks
---- On Thu, 12 Sep 2019 13:26:35 -0400 Tobias Pape Das.Linux@gmx.de wrote ----
Hi tty
On 12.09.2019, at 19:09, gettimothy gettimothy@zoho.com wrote:
Hi all.
I am starting the task of parsing Wikimedia interlanguage links per : https://en.wikipedia.org/wiki/Help:Wikitext#Redirects
stuff like:
[[fr:Plancton]] [[de:Plankton]] [[ru:Планктон]] [[simple:Plankton]]
when I paste this into a Workspace, I get:
[[es:Plancton]] [[fr:Plancton]] [[de:Plankton]] [[ru:????????]] [[simple:Plankton]]
What should I do?
This is a font problem. The default font in Squeak lacks Cyrillic characters. If you paste the Russian version and make it a string and inspect it, as in '[[ru:Планктон]]' inspect
you will see that from index 6 on, the Unicode codepoints of the Cyrillic characters are actually correct, but just cannot be displayed.
best regards -Tobias
thanks in advance.
tty
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Tobias, thank you. I will verify that Monday
Is bringing Cyrillic character support to squeak workspace a big job? If it's just tore grunt work, I can contribute
---- On Fri, 13 Sep 2019 01:47:57 -0400 Das.Linux@gmx.de wrote ----
On 12.09.2019, at 23:52, gettimothy gettimothy@zoho.com wrote:
Just tested it on my local copy of SeasideDoc and it does not display correctly.
You most probably have to tell you string to be converted to utf8, because that is typically what is being served via Seaside and rendered by browsers.
IIRC, you can do that by setting the UTF8 Codec for the Seaside Adatpor or simply by converting the string:
'[[ru:Планктон]]' squeakToUtf8
Best regards -Tobias
I will figure it out another day.
Is it a big job to incorporate other Character Sets into Squeak?
thanks
---- On Thu, 12 Sep 2019 13:26:35 -0400 Tobias Pape Das.Linux@gmx.de wrote ----
Hi tty
On 12.09.2019, at 19:09, gettimothy gettimothy@zoho.com wrote:
Hi all.
I am starting the task of parsing Wikimedia interlanguage links per : https://en.wikipedia.org/wiki/Help:Wikitext#Redirects
stuff like:
[[fr:Plancton]] [[de:Plankton]] [[ru:Планктон]] [[simple:Plankton]]
when I paste this into a Workspace, I get:
[[es:Plancton]] [[fr:Plancton]] [[de:Plankton]] [[ru:????????]] [[simple:Plankton]]
What should I do?
This is a font problem. The default font in Squeak lacks Cyrillic characters. If you paste the Russian version and make it a string and inspect it, as in '[[ru:Планктон]]' inspect
you will see that from index 6 on, the Unicode codepoints of the Cyrillic characters are actually correct, but just cannot be displayed.
best regards -Tobias
thanks in advance.
tty
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
_______________________________________________ Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Hi
On 13.09.2019, at 11:53, gettimothy gettimothy@zoho.com wrote:
Tobias, thank you. I will verify that Monday
Is bringing Cyrillic character support to squeak workspace a big job? If it's just tore grunt work, I can contribute
Basically, it is "just" a font problem, things display if one uses a font with the necessary characters...
---- On Fri, 13 Sep 2019 01:47:57 -0400 Das.Linux@gmx.de wrote ----
On 12.09.2019, at 23:52, gettimothy gettimothy@zoho.com wrote:
Just tested it on my local copy of SeasideDoc and it does not display correctly.
You most probably have to tell you string to be converted to utf8, because that is typically what is being served via Seaside and rendered by browsers.
IIRC, you can do that by setting the UTF8 Codec for the Seaside Adatpor or simply by converting the string:
'[[ru:Планктон]]' squeakToUtf8
Best regards -Tobias
I will figure it out another day.
Is it a big job to incorporate other Character Sets into Squeak?
thanks
---- On Thu, 12 Sep 2019 13:26:35 -0400 Tobias Pape Das.Linux@gmx.de wrote ----
Hi tty
On 12.09.2019, at 19:09, gettimothy gettimothy@zoho.com wrote:
Hi all.
I am starting the task of parsing Wikimedia interlanguage links per : https://en.wikipedia.org/wiki/Help:Wikitext#Redirects
stuff like:
[[fr:Plancton]] [[de:Plankton]] [[ru:Планктон]] [[simple:Plankton]]
when I paste this into a Workspace, I get:
[[es:Plancton]] [[fr:Plancton]] [[de:Plankton]] [[ru:????????]] [[simple:Plankton]]
What should I do?
This is a font problem. The default font in Squeak lacks Cyrillic characters. If you paste the Russian version and make it a string and inspect it, as in '[[ru:Планктон]]' inspect
you will see that from index 6 on, the Unicode codepoints of the Cyrillic characters are actually correct, but just cannot be displayed.
best regards -Tobias
thanks in advance.
tty
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
Got it to display in a workspace per your example.
http://slackbuilds.org/repository/14.2/system/webcore-fonts/
Now, I need to make that the default font.
thanks again
-- Sent from: http://forum.world.st/Squeak-Beginners-f107673.html
Hi Tobias
I am on linux and I do not see any unicode font choices available on my System Faults menu options.
Is there someplace I can get and install them?
thanks in advance.
t
you can try the font importer from the apps menu
On 19.09.2019, at 20:25, gettimothy gettimothy@zoho.com wrote:
Hi Tobias
I am on linux and I do not see any unicode font choices available on my System Faults menu options.
Is there someplace I can get and install them?
thanks in advance.
t
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
I will do that Monday, thank you...
---- On Fri, 20 Sep 2019 03:08:47 -0400 Das.Linux@gmx.de wrote ----
you can try the font importer from the apps menu
On 19.09.2019, at 20:25, gettimothy gettimothy@zoho.com wrote:
Hi Tobias
I am on linux and I do not see any unicode font choices available on my System Faults menu options.
Is there someplace I can get and install them?
thanks in advance.
t
Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
_______________________________________________ Beginners mailing list Beginners@lists.squeakfoundation.org http://lists.squeakfoundation.org/mailman/listinfo/beginners
I have no idea how, but it renders correctly in Seaside. Here is a copy-n-paste from the webpage rendered via Seaside:Планктон
here is the URL for that element: Планктон http://ru.Wikipedia.org/wiki/Планктон
Yet, in the Squeak workspaces, I still get the ????? stuff.
strange!
-- Sent from: http://forum.world.st/Squeak-Beginners-f107673.html
beginners@lists.squeakfoundation.org