[squeak-dev] XMLTokenizer problem with ampersand

karl ramberg karlramberg at gmail.com
Thu Jun 4 16:23:54 UTC 2015


Hi,
I tested three different HTML parsers and found SOUP to work best for my
needs.
Thank you all.

Karl

On Tue, Jun 2, 2015 at 6:17 PM, Chris Muller <asqueaker at gmail.com> wrote:

> On Mon, Jun 1, 2015 at 9:10 PM, Levente Uzonyi <leves at elte.hu> wrote:
> > XMLTokenizer is not suitable to parse HTML documents. XML and HTML may
> look
> > similar, but are very different.
> > We used to use Soup[1] to parse HTML pages.
>
> Have you used Todd Blanchard's "HTML & CSS Validating Parser" [1], if
> so how does it compare to Soup?
>
> [1] -- http://www.squeaksource.com/htmlcssparser.html
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.squeakfoundation.org/pipermail/squeak-dev/attachments/20150604/6e270a39/attachment.htm


More information about the Squeak-dev mailing list