Issue #13588 has been updated by phluid61 (Matthew Kerwin).


haines (Andrew Haines) wrote:
> 
> ~~~ ruby
>   until char.size == max_char_size || eof?
>     char << read(min_char_size)
> ~~~

I hope there are no encodings where valid characters might not be a multiple of the minimum size.

----------------------------------------
Feature #13588: Add Encoding#min_char_size, #max_char_size, #minmax_char_size
https://bugs.ruby-lang.org/issues/13588#change-65043

* Author: haines (Andrew Haines)
* Status: Feedback
* Priority: Normal
* Assignee: 
* Target version: 
----------------------------------------
When implementing an IO-like object, I'd like to handle encoding correctly. To do so, I need to know the minimum and maximum character sizes for the encoding of the stream I'm reading. However, I can't find a way to access this information from Ruby (I ended up writing a gem with a native extension [1] to do so).

I'd like to propose adding instance methods `min_char_size`, `max_char_size`, and `minmax_char_size` to the `Encoding` class to expose the information stored in the `OnigEncodingType` struct's `min_enc_len` and `max_enc_len` fields.

~~~ ruby
Encoding::UTF_8.min_char_size     # => 1
Encoding::UTF_8.max_char_size     # => 6
Encoding::UTF_8.minmax_char_size  # => [1, 6]
~~~

[1] https://github.com/haines/char_size



-- 
https://bugs.ruby-lang.org/

Unsubscribe: <mailto:ruby-core-request / ruby-lang.org?subject=unsubscribe>
<http://lists.ruby-lang.org/cgi-bin/mailman/options/ruby-core>