On Tue, Mar 29, 2011 at 9:45 PM, ctdev <ctdev421 / gmail.com> wrote:
> However now I'm trying to do simple string substitution with gsub()
> and am getting the error:
>
> =A0invalid byte sequence in UTF-8
>
> An example of where this is bombing is the word "PROT\xC9G=C9" as parsed
> by Nokogiri.

What is the encoding of your input HTML file?