Jano Svitok wrote:
> On 9/20/07, Chuck Dawit <chuckdawit / gmail.com> wrote:
>> I want to run a check to see which pages have forms and which ones don't
>> from a file with url's. I'm using the size of the form to make that
>> determination. But after I get to the 13 url in the file I get an error
>> and the script exists. Does anyone know why?
> 
> The error means mechanize could not read the webpage. Find out if it's
> really the 13th url, no matter in what order they are, or whether is
> it some particular url that makes problems.
> (find the offending url and try that on its own).
> 
> If it's some particular url, try accessing the page from browser.
> Otherwise, it might be a problem with mechanize and/or Net::Http or
> anything that they use.
> 
> Finally few changes/enhancements, not related to your problem:
> 
> File.open("eliminate.txt") do |f|
>    noformfile = File.new("noform.txt", "w+")
>    formfile = File.new("form.txt" , "w+")
> 
>    agent = WWW::Mechanize.new
> 
>    f.each do |line|
>       page = agent.get(line)
>       forms = page.forms
>       if forms.size > 0 then
>          formfile.puts line
>       else
>          noformfile.puts line
>       end
>   end
> end




This is the error message I'm getting. It's not related to the 13th url 
its more like a buf overflow problem. It will crash on anyones pc.

c:/ruby/lib/ruby/1.8/net/protocol.rb:133:in `sysread': end of file
reached (EOFError)
        from c:/ruby/lib/ruby/1.8/net/protocol.rb:133:in `rbuf_fill'
        from c:/ruby/lib/ruby/1.8/timeout.rb:56:in `timeout'
        from c:/ruby/lib/ruby/1.8/timeout.rb:76:in `timeout'
        from c:/ruby/lib/ruby/1.8/net/protocol.rb:132:in `rbuf_fill'
        from c:/ruby/lib/ruby/1.8/net/protocol.rb:116:in `readuntil'
        from c:/ruby/lib/ruby/1.8/net/protocol.rb:126:in `readline'
        from c:/ruby/lib/ruby/1.8/net/http.rb:2017:in `read_status_line'
        from c:/ruby/lib/ruby/1.8/net/http.rb:2006:in `read_new'
        from c:/ruby/lib/ruby/1.8/net/http.rb:1047:in `request'
        from
c:/ruby/lib/ruby/gems/1.8/gems/mechanize-0.6.10/lib/mechanize.rb:514:in
`fetch_page'
        from
c:/ruby/lib/ruby/gems/1.8/gems/mechanize-0.6.10/lib/mechanize.rb:600:in
`fetch_page'
        from
c:/ruby/lib/ruby/gems/1.8/gems/mechanize-0.6.10/lib/mechanize.rb:185:in
`get'
        from ciscoScrape.rb:120
        from ciscoScrape.rb:118:in `each'
        from ciscoScrape.rb:118
-- 
Posted via http://www.ruby-forum.com/.