In article <003c01c1d2bc$48c35510$0e01a8c0@fightclub>, Max Maischein wrote:

> (and also some other weird stuff that I can think off). The best way IMO is
> to simply parse the HTML with an HTML parser - I've ported the HTML::Parser
> from Perl to Ruby, so if there is no other parser, I'll post it somewhere
> for people to review/criticise.

I have seen, but not used
http://www.jin.gr.jp/~nahi/Ruby/html-parser/README.html

Mike

-- 
mike / stok.co.uk                    |           The "`Stok' disclaimers" apply.
http://www.stok.co.uk/~mike/       | GPG PGP Key      1024D/059913DA 
mike / starnix.com                   | Fingerprint      0570 71CD 6790 7C28 3D60
http://www.starnix.com/            |                  75D2 9EC4 C1C0 0599 13DA