On Sun, 6 Jun 2004 05:02:25 +0900, Dick Davies
<rasputnik / hellooperator.net> wrote:
> 
> I was trying to use RSSscraper to pul some web forums, and something
> level went bang in the Net::* libraries.
> 
> I found some old references to this error from last year, and I
> got the impression it was platform specific?
> 
> Can anyone else let me know if this causes problems for them?
> 
> It's obviously site specific , url = 'http://www.google.com' has no problems...
> 
> Here's the miniaml code (open(url)... is 'line 6' in the code below):
> 
>   require 'open-uri'
> 
>   url = 'http://p218.ezboard.com/fdebatingukfrm9'
>   page = open(url).readlines
> 
> If I run this I get:
> 
> rasputin@lb:rss$ ./regex.rb
> /data/ruby/lib/ruby/1.9/net/protocol.rb:135:in `sysread': End of file reached (EOFError)
>         from /data/ruby/lib/ruby/1.9/net/protocol.rb:135:in `rbuf_fill'
>         from /data/ruby/lib/ruby/1.9/net/protocol.rb:116:in `readuntil'
>         from /data/ruby/lib/ruby/1.9/net/protocol.rb:126:in `readline'
>         from /data/ruby/lib/ruby/1.9/net/http.rb:1850:in `read_status_line'
>         from /data/ruby/lib/ruby/1.9/net/http.rb:1839:in `read_new'
>         from /data/ruby/lib/ruby/1.9/net/http.rb:934:in `request'
>         from /data/ruby/lib/ruby/1.9/net/http.rb:834:in `request_get'
>         from /data/ruby/lib/ruby/1.9/open-uri.rb:545:in `proxy_open'
>          ... 7 levels...
>         from /data/ruby/lib/ruby/1.9/open-uri.rb:134:in `open_uri'
>         from /data/ruby/lib/ruby/1.9/open-uri.rb:424:in `open'
>         from /data/ruby/lib/ruby/1.9/open-uri.rb:85:in `open'
>         from ./regex.rb:6
> 
> This is exactly the error I was getting on the front of RSSscraper.
> If it helps narrow it down, through a proxy i get:
> 
> rasputin@lb:rss$ ./regex.rb
> /data/ruby/lib/ruby/1.9/open-uri.rb:574:in `proxy_open': 503 Service Unavailable (OpenURI::HTTPError)
>         from /data/ruby/lib/ruby/1.9/open-uri.rb:167:in `open_loop'
>         from /data/ruby/lib/ruby/1.9/open-uri.rb:164:in `catch'
>         from /data/ruby/lib/ruby/1.9/open-uri.rb:164:in `open_loop'
>         from /data/ruby/lib/ruby/1.9/open-uri.rb:134:in `open_uri'
>         from /data/ruby/lib/ruby/1.9/open-uri.rb:424:in `open'
>         from /data/ruby/lib/ruby/1.9/open-uri.rb:85:in `open'
>         from ./regex.rb:6
> 


It appears to me that this site refuses to respond unless you have a
recognized User-agent set in the request header.  That's probably the
problem with open-uri.

Chad