Nicolas Cavigneaux <bounga / altern.org> wrote:
> I've written, some times ago, a Ruby code that allows me to follow web
> links and to retrieve easily interesting files. This little software
> works well. To extract the links from a downloaded webpage I use
> URI.extract and I've noticed that URI.extract miss a lot of links. In
> fact URI.extract doesn't understand (resolve ?) relative links (for
> example link). Am I wrong ? If I don't,
> what way do you advice to me to be sure to retrieve all the relative links ?


I guess you are using Ruby 1.9 from CVS ?

I just read in Oniguruma's ChangeLog :
2004/05/25: [bug]  (thanks Masahiro Sakai) [ruby-dev:23560]
                   ruby -ruri -ve 'URI::ABS_URI =~
                                    "http://example.org/Andr\xC3\xA9"'
                   nested STK_REPEAT type stack can't backtrack repeat_stk[].
                   add OP_REPEAT_INC_SG and OP_REPEAT_INC_NG_SG.

I have no idea what that problem was, only that it was URI related. 


Does it work on Ruby 1.8.1/2 ?

-- 
Simon Strandgaard