------ art_69805_851537.1181299014118
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
Content-Disposition: inline
Hi Peter and George,
I appreciate, though in the end I went with:
# load the Securityfocus home page (id 715 to start)
doc pricot(open("http://www.securityfocus.com/bid/715"))
# get the content of the div id ulnerability"
p
oc/'#vulnerability').inner_html
# prints div id vulnerability'
puts p
However, this is 'only'for the record with the id 715. What do I need to add
to fetch the content of all the different id's (1.... 25000) on
securityfocus.com? Once I have them I'll have an xml created for each using:
securityfocus_data.to_xml.write($stdout, 1)
Thanks,
Tom
--
On 6/8/07, Peter Szinek <peter / rubyrailways.com> wrote:
>
> Sorry, I have meant
>
> p doc/"//div[@id ulnerability']"
>
> Cheers,
> Peter
> __
> http://www.rubyrailways.com :: Ruby and Web2.0 blog
> http://scrubyt.org :: Ruby web scraping framework
> http://rubykitchensink.ca/ :: The indexed archive of all things Ruby.
>
>
>
>
------ art_69805_851537.1181299014118--