Chad Layton wrote:
> James Britt wrote:
> 
>> Chad Layton wrote:
>>
>>> I'm rather new to both web programming and ruby so forgive me if my 
>>> question is ill formed.
>>>
>>> I'm trying to do some screen scraping on a website that requires a 
>>> login  . What I would like to have happen is for the user to login to 
>>> the website normally, then run my script which uses the existing 
>>> login session to grab the page and do whatever to it.
>>>
>>> To illustrate my problem: If I use 
>>> Net::HTTP.get_response(URI.parse("http://foo.bar/baz.php")).body, 
>>> then it  serves up the index asking for a login. How do I get 
>>> contents of baz.php?
>>
>>
>>
>> I suspect that the user agent (i.e., the code, as opposed to a 
>> browser) needs to include site cookies in the request headers.
>>
>> After you sign in using a browser, you'll need to find the cookie left 
>> by the site, or inspect a session cookie if the browser is not writing 
>> it to disk.  Most browsers have a way to show cookies sent by a site.
>>
>>
>>
>> James
>>
>>
> 
> Thank you, James. I see that when I login to the site 4 cookies are set, 
>   how would I include them in the request headers?

I *think* you pass a hash into the Net::HTTP initializer, or perhaps as 
a parameter to 'get' but I can't find docs or examples to prove this.



James
> 
> 


-- 

http://www.ruby-doc.org       - Ruby Help & Documentation
http://www.artima.com/rubycs/ - Ruby Code & Style: Writers wanted
http://www.rubystuff.com      - The Ruby Store for Ruby Stuff
http://www.jamesbritt.com     - Playing with Better Toys
http://www.30secondrule.com   - Building Better Tools