[Box-Admins] [FWD: ** PROBLEM Service Alert: squeak box2/Squeak website is CRITICAL **]

Ken Causey ken at kencausey.com
Mon Feb 28 23:15:14 UTC 2011


VM dump?  I'm not sure what you mean.  The VM does not crash, the image
is simply locked up.  It's probable that I could have interrupted it
with alt-. but that doesn't seem to work over VNC.

No website process active?  You are mistaken and have created a second
one now:

box2:~# ps auwx | grep website
website   8850  0.0  0.5  6800 5020 ?        S    22:30   0:00 Xtightvnc
:1 -desktop X -auth /home/website/.Xauthority -geometry 1024x768 -depth
24 -rfbwait 120000 -rfbauth /home/website/.vnc/passwd -rfbport 5901 -fp
/usr/X11R6/lib/X11/fonts/Type1/,/usr/X11R6/lib/X11/fonts/Speedo/,/usr/X11R6/lib/X11/fonts/misc/,/usr/X11R6/lib/X11/fonts/75dpi/,/usr/X11R6/lib/X11/fonts/100dpi/
-co /usr/X11R6/lib/X11/rgb
website   8855  2.6  8.1 1052604 79196 ?     S    22:30   1:07
/usr/local/lib/squeak/3.11.3-2135/squeakvm -pathenc UTF-8 -encoding
UTF-8 -plugins /usr/local/lib/squeak/3.11.3-2135
/home/website/website/squeaksite.image
website   9044  0.0  0.6  8044 5920 ?        S    22:32   0:00 Xtightvnc
:4 -desktop X -auth /home/website/.Xauthority -geometry 1024x768 -depth
24 -rfbwait 120000 -rfbauth /home/website/.vnc/passwd -rfbport 5904 -fp
/usr/X11R6/lib/X11/fonts/Type1/,/usr/X11R6/lib/X11/fonts/Speedo/,/usr/X11R6/lib/X11/fonts/misc/,/usr/X11R6/lib/X11/fonts/75dpi/,/usr/X11R6/lib/X11/fonts/100dpi/
-co /usr/X11R6/lib/X11/rgb
website   9049  1.6  7.7 1052616 74836 ?     S    22:32   0:39
/usr/local/lib/squeak/3.11.3-2135/squeakvm -pathenc UTF-8 -encoding
UTF-8 -plugins /usr/local/lib/squeak/3.11.3-2135
/home/website/website/squeaksite.image

That's two VNC servers running now and two website processes active. 
Did you think I didn't test the website after I restarted it?

Ken

> -------- Original Message --------
> Subject: Re: [Box-Admins] [FWD: ** PROBLEM Service Alert: squeak
> box2/Squeak website is CRITICAL **]
> From: Janko Mivšek <janko.mivsek at eranova.si>
> Date: Mon, February 28, 2011 4:52 pm
> To: Squeak Hosting Support <box-admins at lists.squeakfoundation.org>
> 
> 
> Hi Ken,
> 
> I checked too and see no website process active, so I restarted it and
> now I'm connected with VNC to the image.
> 
> Today seems that image crashed, but snapshoted correctly at 9pm GMT last
> time.
> 
> Can we see some vm dump somewhere?
> 
> Best regards
> Janko
> 
> On 28. 02. 2011 23:47, Ken Causey wrote:
> > After I received this notice I checked and the website process had the
> > CPU pegged with normal memory usage.  I tried to connect with VNC and
> > got connected but the image was locked up.  There was a debugger open
> > and I took a screenshot which can be found at
> > 
> > http://users.squeak.org/~kencausey/website_locked.png
> > 
> > I chatted in the IRC channel as I was fiddling with it:
> > 
> > 2011-02-28 16:24:21     kencausey       JankoMivsek: website process is
> > flipping out again
> > 2011-02-28 16:26:14     kencausey       the memory usage is normal this
> > time, it just has the CPU pegged
> > 2011-02-28 16:26:41     kencausey       looking at the logs, the last
> > successful hit was the nagios check oddly enough, 2 hits before google
> > hit the stats page again
> > 2011-02-28 16:26:56     kencausey       I don't see anything suspicious
> > like the last time
> > 2011-02-28 16:29:41     kencausey       there is a debugger open on a
> > send of #bottomContext to UndefinedObject
> > 2011-02-28 16:29:50     kencausey       I can't interact with it
> > 2011-02-28 16:30:54     kencausey       it's in a call to
> > Process>>terminate
> > 2011-02-28 16:32:38     kencausey       restarting it now
> > 2011-02-28 16:34:09     kencausey       website is back up
> > 
> >>From the apache logs:
> > 
> > this is when it went down:
> > 
> > 80.81.242.100 - - [28/Feb/2011:21:41:03 +0000] "GET
> > /stats.html?view=main&year=1684&month=8 HTTP/1.1" 200 24288 "-"
> > "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.
> > google.com/bot.html)"
> > 173.255.225.4 - - [28/Feb/2011:21:42:13 +0000] "GET /favicon.ico
> > HTTP/1.1" 200 1406 "-" "Safari/6533.19.4 CFNetwork/454.11.5
> > Darwin/10.5.0 (i386) (MacBook3%2C1)"
> > 89.212.16.244 - - [28/Feb/2011:21:42:18 +0000] "GET /ping.html HTTP/1.1"
> > 200 - "-" "check_http/v1.4.14 (nagios-plugins 1.4.14)"
> > 38.99.97.225 - - [28/Feb/2011:21:42:49 +0000] "GET /Smalltalk/ HTTP/1.1"
> > 502 399 "-" "Mozilla/5.0 (compatible; ScoutJet;
> > +http://www.scoutjet.com/)"
> > 67.195.112.235 - - [28/Feb/2011:21:43:15 +0000] "GET
> > /Merchandise/?version=3 HTTP/1.0" 502 403 "-" "Mozilla/5.0 (compatible;
> > Yahoo! Slurp; http://help.yahoo.com/help/us/
> > ysearch/slurp)"
> > 
> > We have a googlebot hit to the stats page (relevant?), an irrelevant
> > favicon request, a successful nagios ping which is I assume Janko's and
> > not relevant, then hits start failing.  Before that I see nothing
> > suspicious and no flood of requests.
> > 
> > Ken
> > 
> >> -------- Original Message --------
> >> Subject: ** PROBLEM Service Alert: squeak box2/Squeak website is
> >> CRITICAL **
> >> From: nagios at mivsek.eranova.si (User for Nagios)
> >> Date: Mon, February 28, 2011 3:49 pm
> >> To: ken at kencausey.com
> >>
> >>
> >> ***** Nagios *****
> >>
> >> Notification Type: PROBLEM
> >>
> >> Service: Squeak website
> >> Host: squeak box2
> >> Address: 85.10.195.197
> >> State: CRITICAL
> >>
> >> Date/Time: Mon Feb 28 22:49:47 CET 2011
> >>
> >> Additional Info:
> >>
> >> CRITICAL - Socket timeout after 10 seconds
> > 
> > 
> 
> -- 
> Janko Mivšek
> Svetovalec za informatiko
> Eranova d.o.o.
> Ljubljana, Slovenija
> www.eranova.si
> tel:  01 514 22 55
> faks: 01 514 22 56
> gsm: 031 674 565



More information about the Box-Admins mailing list