At Fri, 06 Feb 2004 10:29:43 +0900 wrote Harry Ohlsen: > Some background ... > > I have an application where there are many identical strings > (the data consists of huge chunks of XML, with a lot of > duplication in both the tag names and the CDATA content). > > I've written a tiny XML parser in C, because trying to load these > documents using REXML ran all night and was still running the next day, > presumably due to the size (hundreds of thousands of tags). Have you already tried xmlparser (wrapper around expat)? It's quite fast. I use it for huge XML documents where rexml and nqxml are way too slow. Ralf.