Marcio Francisco wrote:
> Hello
>
> Does anyone knows about an html parser in ruby?
> One that separate all tags...
>
> thx
>
> --
> Posted via http://www.ruby-forum.com/.

I'm time trialling python and ruby parser right now, so I'm going to
spew you a list of links for:

html-parser-2
htree
ymHTML module
htmltools: requires patched html-parser (gem)

rubyful soup:
WWW::Mechanize  # built on htmltools, xmltree,
htmltokenizer: handles mismatched tags (gem)

REXML: Tree & stream parsing
(Yeah, that's a lot of libs)

Here's the spew:

http://raa.ruby-lang.org/project/html-parser-2/
http://diveintopython.org/html_processing/index.html
http://cvs.m17n.org/~akr/htree/

http://www.yoshidam.net/Ruby.html#ymHTML

http://ruby-htmltools.rubyforge.org/
http://ruby-htmltools.rubyforge.org/doc/
http://bike-nomad.com/ruby/

http://rubyforge.org/projects/wee/
http://neurogami.com/cafe-fetcher/
http://rubyforge.org/projects/htmltokenizer/

http://www.germane-software.com/software/rexml/