[squeak-dev] Re: [Pharo-project] HTML parser (again)

Mariano Martinez Peck marianopeck at gmail.com
Wed Aug 18 17:14:57 UTC 2010


On Wed, Aug 18, 2010 at 5:55 PM, Andrei Stebakov <lispercat at gmail.com>wrote:

> I tried to load Scamper's Network-HTML, I got a Syntax Error during
> reloading:
> HtmlTokenizer private-initialization initialize:
> initialize: s
>        text _ s withSqueakLineEndings.
>        pos _ Nothing more expected ->1.
>        textAreaLevel _ 0.
>
>
That code is using underscore as assigment, don't allowed anymore in Pharo
1.1 unless you explicity set a specific setting.

So....or set that setting or update the code (in another image)

cheers

mariano



> On Wed, Aug 18, 2010 at 2:34 AM, laurent laffont
> <laurent.laffont at gmail.com> wrote:
> >
> >
> > On Wed, Aug 18, 2010 at 7:50 AM, Andrei Stebakov <lispercat at gmail.com>
> > wrote:
> >>
> >> I've been looking for a nice and fast HTML parser.
> >> I've found Zulq Alam's Soup
> >> (http://www.squeaksource.com/@vHckXt8_6gVtXFxy/XMrjDbIs) it looks nice
> >> but it's way too slow for me (takes 5 sec to parse the page, my
> >> current lisp parser takes about 1 sec for that.)
> >> I found another one, Todd Blanchard's HTML and CSS parser
> >> (http://www.squeaksource.com/@iMgHmTKVxU00wEdz/A0jkqk71) but I
> >> couldn't load it into Pharo 1.1 or Squeak 4.1.
> >> It complains about some syntax error and leaves the progress bar which
> >> I can't kill...
> >> I wonder if anyone (Todd?) can take a look at the parser and figure
> >> out how to fix it?
> >>
> >> What other options I have for an HTML parser?
> >> Looking at Pharo speed I wonder if there is any way to optimize it? Is
> >> JIT or some other speed optimization in plans for Pharo/Squeak?
> >
> >
> > What do you need to do ?
> > There's XMLSupport http://www.squeaksource.com/XMLSupport.html
> > Scamper might have a standalone HTML
> > parser http://www.squeaksource.com/Scamper.html
> > The CogVM has JIT.
> > Laurent.
> >
> >>
> >> Thank you,
> >> Andrei
> >>
> >> _______________________________________________
> >> Pharo-project mailing list
> >> Pharo-project at lists.gforge.inria.fr
> >> http://lists.gforge.inria.fr/cgi-bin/mailman/listinfo/pharo-project
> >
> >
> >
> >
> >
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.squeakfoundation.org/pipermail/squeak-dev/attachments/20100818/ea8b77f4/attachment.htm


More information about the Squeak-dev mailing list