[Seaside] I need success stories

Fri Feb 22 03:38:14 UTC 2008

>
> So, does anyone on this list have experience using Seaside in a high 
> traffic scenario.  I'm talking 200+ hits per second at least.  How did 
> it handle the load?  What sort of hardware did you choose?  What load 
> balancing?  What database?
>
>
I've seen this question pop up from time to time and unfortunately there 
is no easy answer to it. However, there are some things you can take 
away that should make you feel more confident about any web framework 
you choose to use.

The 200+ requests per second is probably the thing I like to analyze the 
most, because I always wondered when this sort of traffic would happen. 
A prime example of an unexpectedly high traffic effect is the digg 
effect / slashdot effect / various other names. Usually this effect will 
hit a site that is generally low on traffic, say, 2000+ hits a day. 
They'll suddenly get boosted to 40k hits in a single hour.

http://www.geology.smu.edu/~dpa-www/attention_span/

The important thing to see here is that this 'extreme' where suddenly 
you're getting the attention of a page like google or yahoo for a few 
short minutes is exactly how many hits per second you actually have to 
handle. In the above page they witnessed 78,000 hits per hour on their 
first slashdotting and 1930 hits per minute.

The don't give us a higher resolution than that to know exactly how many 
hits per second they were seeing, but if we take a nieve approach, the 
average hits per second was 32. If say that all of that traffic hit 
within the first 15 seconds, then we get up to 96 hits per second.

Can Seaside handle that? James Robertson did some rather coarse 
statistics to see how Seaside currently compares across Smalltalks and 
between Seaside 2.8 and Seaside 2.7. Admittedly, he could have done more 
but it serves as a nice rough guide:

http://www.cincomsmalltalk.com/blog/blogView?showComments=true&printTitle=Scaling_Seaside_in_Cincom_Smalltalk&entry=3372841275
His results show 79.4 hits per second.

So without real hard data to know exactly what would happen, it appears 
on the surface that you would actually survive a slashdotting without 
any extra hardware or any extra caching strategies.

With that out of the way, we can now talk about what you would do to 
scale beyond. You can of course, put caching strategies in front of your 
application, or distribute it across machines, serve it up on Amazon's 
cloud, use one of the distributed web cache engines, etc... though 
applications that lend themselves to this sort of distribution are 
rarely transaction based, so you can almost certainly rule out a 
requirement for writing to a database as part of that 'massive hit'.

Even if we say the high hit rate is the norm for your application - and 
that would require you to really hit something big like Twitter (and 
Twitter maxed out at a mere 5 hits per second until they started 
optimizing themselves) - then running two servers at once immediately 
doubles your capacity.. it is near linear capacity growth when you add 
more servers.

I wish I had stats on having two Seaside images running simulatenously 
on the same machine with dual cores for you, but I don't just yet.. 
sorry about that.

So we know that Seaside itself isn't really a major bottleneck to 
performance, but there are other obvious factors that you've listed 
above. The big one is Database whether it be an Object database or a 
Relational database, you end up with the same sorts of bottlenecks.. 
number of connections, speed of connection to the database, 
unintentional callback blocking, accidental 'read it all in to memory' 
instead of streaming... that sort of stuff is quite commonly missed.

Luckily all those things have been solved when working with Seaside and 
various databases. Gemstone is multithreaded, Glorps interfaces can 
connection pooling, VWs VM is non-blocking on callbacks.. most database 
drivers are streaming based.

This is in contrast to the way Ruby on Rails is initially setup. It 
comes with one database connection with a global lock on it. In fact, it 
was this design flaw in RoR that caused Twitter to faulter so badly. The 
biggest change they made on their site was not to stop producing pages 
dynamically (although they did in the end) but to have multiple database 
connections at once in a pool.

Another thing that can drastically cripple your application is excessive 
memory use by your application or other applications on your server such 
that memory is forced to be paged out to disk. This is an absolute show 
stopper and can be caused by excessive state storage in your 
application. This is rarely caused by Seasides continuations and is more 
often caused by large object domains read from a database and cached in 
memory for each user or something similar.

Conversely, creating too many objects and throwing them away immediately 
(or worse, putting them in a weak set and throwing them away 
immediately) can tax the server by putting too much pressure on the 
garbage collector. This can easily halve the number of hits per second 
you can support. But again, this is a rather rare scenario to be in and 
it's easy to diagnose with profiling tools.

A few years back the company I worked for did an online federal election 
polling site. It was surprisingly popular and experienced regular 
'slashdotting' effects. The machine and network were over specced.. quad 
core, four servers, load balanced, huge raided multicore database server 
on 1 gb network ... none of the servers batted an eyelid. We could 
easily have taken a hundred fold traffic increase and we'd still 
probably have plenty of CPU time left to play civ4 on a huge map.

Sorry, this has been a long email - but I think my point was - Seaside 
is unlikely to be a major cause of bottleneck if you do start getting a 
lot of traffic. The usual strategies for load balancing and removing 
bottlenecks on the back end work the same as in any other web 
development environment.

Cheers,
Michael