According to Nokogiri's doc, it works internally in UTF-8.
Running this :

# encoding: utf-8

require 'nokogiri'

class MyDoc < Nokogiri::XML::SAX::Document
  def characters(string)
    puts string.encoding
    puts string
  end
end

puts RUBY_VERSION
puts Encoding.default_external

parser = Nokogiri::XML::SAX::Parser.new(MyDoc.new, 'UTF-8')
parser.parse('<foo>épée</foo>')

gives :

1.9.2
UTF-8
UTF-8
¥Æ¦Ä¥¥p¥Æ¦Ä¥¥e

Why ?
_md
-- 
Posted via http://www.ruby-forum.com/.