Issue #10097 has been updated by Martin Drst.


Nobuyoshi Nakada wrote:
> Is this correct?
> https://github.com/nobu/ruby/compare/windows-1252

Sorry for the very slow response. Please commit. Thanks!

----------------------------------------
Bug #10097: Case-insensitive Regexp matching for Windows-1252 not working for 
https://bugs.ruby-lang.org/issues/10097#change-55458

* Author: Martin Drst
* Status: Open
* Priority: Normal
* Assignee: 
* ruby -v: 1.9.3p545
* Backport: 2.0.0: UNKNOWN, 2.1: UNKNOWN
----------------------------------------
By chance I had a look at enc/iso_8859_1.c and found

~~~C
ENC_REPLICATE("Windows-1252", "ISO-8859-1")
~~~
on line 288. But this does not work for case folding:

~~~ruby
# http://en.wikipedia.org/wiki/Windows-1252
s1 = "\u0160".encode 'windows-1252' # ''
r1 = Regexp.new("\u0161".encode('windows-1252'), Regexp::IGNORECASE) # //i
s1 =~ r1
   # => nil
s2 = "\u0178".encode 'windows-1252' # ''
r2 = Regexp.new("\u00FF".encode('windows-1252'), Regexp::IGNORECASE) # //i
s2 =~ r2
   # => nil
s3 = "\u00C0".encode 'windows-1252' # ''
r3 = Regexp.new("\u00E0".encode('windows-1252'), Regexp::IGNORECASE) # //i
s3 =~ r3
   # => 0
~~~

So case-insensitive matching works when both characters are in iso-8859-1, but not when one () or both () characters are not in iso-8859-1.



-- 
https://bugs.ruby-lang.org/