On Fri, Nov 9, 2012 at 7:09 AM, <sto.mar / web.de> wrote:

> Am 09.11.2012 03:28, schrieb Mario Trento:
>
> Depending on the formatting etc. of the real data, this could
> get pretty difficult, but for your simple example data
> iterating over the lines and using a regular expression
> (with a named capture group) would work:
>

I find processing tag structures with line oriented tools pretty uncool.
:-)  In fact it's also error prone like you state yourself:

But it certainly is not the "most cool way" and will break when
> the html is formatted differently, like e.g.
>
> <tr>
>   <td>...</td><td>...</td><td>..**.</td>
> </tr>
>
> or
>
> <tr>
>   <td id='whatever'>...</td>
>   ...
> </tr>
>

Nokogiri rules!

Kind regards

robert

-- 
remember.guy do |as, often| as.you_can - without end
http://blog.rubybestpractices.com/