On Sat, Oct 16, 2010 at 3:50 PM, Corey Watts <cwatts / westmont.edu> wrote: > > Hey there everyone. ¨ֲ§ם טבצימח ףליחטפ נעןגלוץףימֽודטבמיתו® ¨ֲ§ם > trying to scrape the yellowpages.com, and extract information about each > business listing. ¨ֲ§ם ורפעבדפימח בלפטימזןעםבפיןקבמפורדונפ זן> one small portion: the business's website. ¨ֲיף פטטעוז ימףיהו ןז > link that I am trying to scrape. ¨ֲזבבף כמןקֹ§זןללןקימפט> correct xpath rules, but I can't seem to get the part I want. ¨ֲמו > tricky thing that I've had to deal with is that not every listing has a > website. ¨ֲטו קוגףיפלימכ במפט¢לובעמ םןעולימכ בעצועש ףיםילבע¬ > xpath-wise, so I have to use an if statement to check the inner text of > both of them to make sure that I'm extracting the xpath one. > > I'm scraping from > http://yellowpages.com/santa-barbara-ca/restaurants?page=1 and my code > is attached. Your xpath: website = website.search("/a/@href") should be: website = website.search("./a/@href") a leading "/" means that you want the xpath search to begin from the root of the document. "./" means to start from the context node, in this case `website`. > > Thanks so much for your help! > > -- > Posted via http://www.ruby-forum.com/. >