On Feb 2, 2005, at 6:18 PM, jm wrote:

> Processing flow data from a router. The script caches user information 
> from a database then processes a 25-30MB flow file captured from a 
> router using flow-tools. This takes about 5 minutes on a 2.4GHz 
> pentium 4 unloaded by any other process running at 98% utilisation 
> continuously. To put this into context each flow file is only 15 
> minutes worth of data and the current perl version does it in a bit 
> over 2 minutes. This perl version is showing it's lack of design in a 
> variety of ways including the nightmare of trying to add features it 
> was never designed to support. So this was a good opportunity to 
> rewrite it in ruby to make it more maintainable, etc.
>
> While 5 minutes in within the time constraint that is on an unloaded 
> machine and the machine it's destined for has other processes sharing 
> the CPU.

I have an open bug for the postgres dbi handler. It causes about a 7x 
slowdown in performance for even basic queries. I now use the raw 
postgres library to do all my work and the speed is actually fairly 
good.

--
ryand-ruby / zenspider.com - Seattle.rb - 
http://www.zenspider.com/seattle.rb
http://blog.zenspider.com/ - http://rubyforge.org/projects/ruby2c