Marcio Francisco wrote: > Hello > > Does anyone knows about an html parser in ruby? > One that separate all tags... > > thx > > -- > Posted via http://www.ruby-forum.com/. I'm time trialling python and ruby parser right now, so I'm going to spew you a list of links for: html-parser-2 htree ymHTML module htmltools: requires patched html-parser (gem) rubyful soup: WWW::Mechanize # built on htmltools, xmltree, htmltokenizer: handles mismatched tags (gem) REXML: Tree & stream parsing (Yeah, that's a lot of libs) Here's the spew: http://raa.ruby-lang.org/project/html-parser-2/ http://diveintopython.org/html_processing/index.html http://cvs.m17n.org/~akr/htree/ http://www.yoshidam.net/Ruby.html#ymHTML http://ruby-htmltools.rubyforge.org/ http://ruby-htmltools.rubyforge.org/doc/ http://bike-nomad.com/ruby/ http://rubyforge.org/projects/wee/ http://neurogami.com/cafe-fetcher/ http://rubyforge.org/projects/htmltokenizer/ http://www.germane-software.com/software/rexml/