--000e0cd1186a0922ab0481a2d149 Content-Type: text/plain; charset=ISO-8859-1 On Fri, Mar 12, 2010 at 3:54 PM, Ben Bleything <ben / bleything.net> wrote: > On Fri, Mar 12, 2010 at 1:22 PM, Josh Cheek <josh.cheek / gmail.com> wrote: > > The problem is that the path is relative. > > I've done a little googling, queried my ruby and rails ML archives, > glanced > > at hpricot code, and looked through the method lists for open-uri and > > hpricot. > > So far, I don't see anything that looks very useful. > > Is there a way to have it give me the absolute path so that I can > reference > > the picture later? > > Hpricot is just telling you what's in the HTML. Munging the > document's contents are your responsibility, not the parser's :) > > > The only thing I've found that works so far involves string manipulation, > > which seems like a brittle workaround to replace something that probably > > exists if I could just find it. > > Look into the URI library. > > require 'uri' > > uri RI.parse( "http://yfrog.com/03gssacj" ) > uri.path your hpricot magic to get the image path goes here > > Ben > > Thanks, this is what I am using now: page open url image_path URI.parse page.base_uri.to_s.sub( %r(/$) , '' ) image_path.path (Hpricot(page)%"#main_image").attributes['src'] image_path.to_s It still seems a little excessive, but it's a lot better than what I had before. --000e0cd1186a0922ab0481a2d149--