At Fri, 06 Feb 2004 10:29:43 +0900 wrote Harry Ohlsen:

> Some background ...
> 
> I have an application where there are many identical strings 
> (the data consists of huge chunks of XML, with a lot of 
> duplication in both the tag names and the CDATA content).
> 
> I've written a tiny XML parser in C, because trying to load these
> documents using REXML ran all night and was still running the next day,
> presumably due to the size (hundreds of thousands of tags).

Have you already tried xmlparser (wrapper around
expat)? It's quite fast. I use it for huge XML documents where
rexml and nqxml are way too slow.

Ralf.