On Friday, February 6, 2004, 5:39:15 AM, Dave wrote:


> Martin Hart wrote:
>> What do people use to parse this into something useful?  Is REXML an option
>> (although the html is not likely to be valid xml)?  I have looked at the
>> html-parser on RAA but do not seem to be able to individually access the
>> components of the returned page (for example I need to see what the contents
>> of a text control are - or what the caption of the <h2> tag is.

> see http://ruby-htmltools.rubyforge.org/

> I used this library about a year ago, and found it pretty buggy.

For the OP: you can use the above library to convert HTML into a
REXML::Document, then pull it apart as you please.

Gavin