<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<p>I had found the same stackover flow question. It is the only
place I found that mentions that 0x0C is the tag for it.</p>
<p>I am currently encoding thus: <br>
</p>
<p>aString squeakToUtf8 asByteArray.</p>
<p>and decoding:</p>
<p>bytes asByteArray asString utf8ToSqueak.<br>
</p>
Do you think this lays out the bytes as specified in this page? I
gather from the stackoverflow that this would be the encoded form of
utf8 for asn1.<br>
<br>
<a class="moz-txt-link-freetext" href="https://en.wikipedia.org/wiki/UTF-8#Description">https://en.wikipedia.org/wiki/UTF-8#Description</a><br>
<br>
Alan<br>
<br>
<div class="moz-cite-prefix">On 09/18/2017 01:46 AM, Jakob Reschke
wrote:<br>
</div>
<blockquote type="cite"
cite="mid:CA+ovNC6eWS3c9b=DG+mKGwF5wTgNfTKpGB2rpxMsLYWDcVHagw@mail.gmail.com">
<div dir="auto">I just did a quick search on the web and it seems
like ASN.1 has a UTF8String type (with tag 12) that just
contains the sequence of bytes of the UTF-8-encoded string. Can
you use that? See also this question on stackoverflow: <a
href="https://stackoverflow.com/q/28929809" target="_blank"
moz-do-not-send="true">https://<wbr>stackoverflow.com/q/28929809</a>
<div dir="auto"><br>
</div>
<div dir="auto">In Squeak, you can convert between UTF-8-encoded
byte strings and decoded (Squeak-encoded) character strings
with the help of UTF8TextConverter. Have a look at its
class-side methods. Also, there are conversion methods in
String, IIRC. Try to filter its instance-side methods by
"utf8".</div>
<div dir="auto"><br>
</div>
<div dir="auto">Does this answer your question or are you in
search of something else?</div>
<div dir="auto"><br>
</div>
<div dir="auto">Kind regards,</div>
<div dir="auto">Jakob</div>
</div>
<div class="gmail_extra"><br>
<div class="gmail_quote">Am 18.09.2017 03:49 schrieb "Alan
Pinch" <<a href="mailto:alan.c.pinch@gmail.com"
target="_blank" moz-do-not-send="true">alan.c.pinch@gmail.com</a>>:<br
type="attribution">
<blockquote class="gmail_quote" style="margin:0 0 0
.8ex;border-left:1px #ccc solid;padding-left:1ex">I am
trying to map utf8 into an ASN1 encoding, where the UTF8 is<br>
specified to perhaps extend past one byte in value. I am
also interested<br>
in retaining this UTF8 characters in squeak to interoperate
well. What<br>
would be my best approach to this, mapping to/from these
bytes on a stream?<br>
<br>
Alan<br>
<br>
<br>
</blockquote>
</div>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">
</pre>
</blockquote>
<br>
</body>
</html>