Ned Konz wrote:


> Parsing HTML with regexes is not a good idea (that is, it's more complex
> than you'd think). For specific files you could use something like
> 
> <TR>.*?</TR>
> 
> the following question mark makes it non-greedy.

Ah yes. Thanks.

> But you should also look at the html-parser module.

At RAA? Will do. I guess I could also extract the table and parse it as XML, 
since (give or take a little filtering of the odd <br> tag and unquoted 
attributes), it appears to be pseudo XML.

> I've written a module that knows about HTML structure and am writing one
> that builds a tree.

Look forward to seeing it.

Thanks for the reply.

Aidan