[ruby-core:89386] [Ruby trunk Feature#14839] How to deal with capitalizing Georgian in Unicode 11.0.0

From: labulashvili@...
Date: 2018-10-12 10:50:53 UTC
List: ruby-core #89386
Issue #14839 has been updated by webzorg (Lasha Abulashvili).


duerst (Martin Dürst) wrote:
> @mame
> 
> mame (Yusuke Endoh) wrote:
> > Just FYI.  Python 3.7 supports Unicode11, and behaves as follows.
> > 
> > ```
> > $ ./local/bin/python3
> > Python 3.7.0 (default, Oct 12 2018, 11:29:22) 
> > [GCC 7.3.0] on linux
> > Type "help", "copyright", "credits" or "license" for more information.
> > >>> 'იანვარი'.upper()
> > 'ᲘᲐᲜᲕᲐᲠᲘ'
> > >>> 'იანვარი'.title()
> > 'იანვარი'
> > ```
> 
> Many thanks for checking Python. The results make sense given the Unicode data, and align with my current proposal.
> 
> Can you please try `'იანვარი'.upper().title()`? I'm really interested in what result Python produces is that case. A straightforward implementation would produce 'Იანვარი', but as I said above, I'm not sure this is acceptable.

Yes that produced 'Იანვარი', which I would agree with you, is going against the language rules.

----------------------------------------
Feature #14839: How to deal with capitalizing Georgian in Unicode 11.0.0
https://bugs.ruby-lang.org/issues/14839#change-74426

* Author: duerst (Martin Dürst)
* Status: Feedback
* Priority: Normal
* Assignee: duerst (Martin Dürst)
* Target version: 
----------------------------------------
This is a request for feedback. In particular if you are from Georgia (the country, not the US state), or if you know somebody (who knows somebody,...) from Georgia, feedback on this issue is very much appreciated. If I don't get any feedback, I'll precede as explained below.

Unicode 11.0.0 introduces an upper-case version of present-day Georgian letters called Mtavruli (the lower case letters are called Mkhedruli). Mtavruli letters are only used to empthasize whole words; there is no initial-letter capitalization in Georgian. Therefore, the Mkhedruli letters do not have Mtavruli letters as their titlecase, but are explicitly mapped to themselves. This means that in Ruby, `mkhedruli.capitalize` would be a no-op although `mkhedruli.upcase` would convert to Mtavruli letters.

Additional pointers:
http://www.unicode.org/versions/Unicode11.0.0/#Migration
http://www.unicode.org/charts/PDF/Unicode-11.0/U110-1C90.pdf
http://www.unicode.org/versions/Unicode11.0.0/ch07.pdf (Section 7.7, Georgian, pp. 320-321)




-- 
https://bugs.ruby-lang.org/

Unsubscribe: <mailto:ruby-core-request@ruby-lang.org?subject=unsubscribe>
<http://lists.ruby-lang.org/cgi-bin/mailman/options/ruby-core>

In This Thread

Prev Next