------art_69805_851537.1181299014118
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

Hi Peter and George,

I appreciate, though in the end I went with:

# load the Securityfocus home page (id 715 to start)
doc  pricot(open("http://www.securityfocus.com/bid/715"))

# get the content of the div id ulnerability"
p 
oc/'#vulnerability').inner_html

# prints div id  vulnerability'
puts p
However, this is 'only'for the record with the id 715. What do I need to add
to fetch the content of all the different id's (1.... 25000) on
securityfocus.com? Once I have them I'll have an xml created for each using:
securityfocus_data.to_xml.write($stdout, 1)

Thanks,
Tom

--


On 6/8/07, Peter Szinek <peter / rubyrailways.com> wrote:
>
> Sorry, I have meant
>
> p doc/"//div[@idulnerability']"
>
> Cheers,
> Peter
> __
> http://www.rubyrailways.com :: Ruby and Web2.0 blog
> http://scrubyt.org :: Ruby web scraping framework
> http://rubykitchensink.ca/ :: The indexed archive of all things Ruby.
>
>
>
>

------art_69805_851537.1181299014118--