On Mar 22, 2009, at 23:49 , Arun Kumar wrote:

> Hi,
> I know that what i'm going to ask is for the solution for a simple
> problem. But as I'm new to Ruby I have not learnt a lot about regular
> expressions in Ruby.
>
> Can anybody tell me how to extract all the contents which are included
> inside the '<html>' and '</html>' tag and also to extract the text  
> given
> in between the '<a>' and '</a>' tag using regular expression. I know  
> it
> can be extracted using the 'scan' method but I dont know what should  
> be
> the matching patterns or expressions. Can anybody pls help me

regexps are about the worst thing to use in this case. Look at this  
instead:

   http://mechanize.rubyforge.org/files/GUIDE_txt.html