On Jul 26, 2007, at 6:25 PM, David Rush wrote: > Hi y'all, > > This may end up being regarded as an incendiary posting, but it's not > meant to be. This is just an observation from a relative Ruby (in > general) and Rails (in particular) newb. > > So I'm beavering away at my lovely little start-up desk and really > rather enjoying Ruby (in between the moments of utter frustration :) > and I start coding up some ETL processes to load and merge masses of > data into my bouncing baby web-system. And all is relatively good > until I get to my first tricky merge process where I have to > disambiguate names and otherwise harmonize my various data sources. > > The process takes over 12 hours to run using ActiveRecord to provide > my DB access. For 5500 records. <snip> > > david rush > -- > http://cyber-rush.org/drr -- a very messy web^Wconstruction site Although it may be too late, might I suggest that ActiveWarehouse ETL could be a good place to (re-)start? The Rubyforge site for it is: http://rubyforge.org/frs/?group_id=2435 -Rob Rob Biedenharn http://agileconsultingllc.com Rob / AgileConsultingLLC.com