On Jul 26, 2007, at 6:25 PM, David Rush wrote:

> Hi y'all,
>
> This may end up being regarded as an incendiary posting, but it's not
> meant to be. This is just an observation from a relative Ruby (in
> general) and Rails (in particular) newb.
>
> So I'm beavering away at my lovely little start-up desk and really
> rather enjoying Ruby (in between the moments of utter frustration :)
> and I start coding up some ETL processes to load and merge masses of
> data into my bouncing baby web-system. And all is relatively good
> until I get to my first tricky merge process where I have to
> disambiguate names and otherwise harmonize my various data sources.
>
> The process takes over 12 hours to run using ActiveRecord to provide
> my DB access. For 5500 records.
<snip>
>
> david rush
> --
> http://cyber-rush.org/drr -- a very messy web^Wconstruction site

Although it may be too late, might I suggest that

   ActiveWarehouse ETL

could be a good place to (re-)start?

The Rubyforge site for it is:
http://rubyforge.org/frs/?group_id=2435

-Rob

Rob Biedenharn		http://agileconsultingllc.com
Rob / AgileConsultingLLC.com