Re: Unicode width data inconsistent/outdated
- Date: Wed, 26 Jul 2017 23:43:43 +0200
- From: Thomas Wolff <towo@xxxxxxxx>
- Subject: Re: Unicode width data inconsistent/outdated
Am 26.07.2017 um 11:50 schrieb Corinna Vinschen:
On Jul 26 03:16, Yaakov Selkowitz wrote:
On 2017-07-26 03:08, Corinna Vinschen wrote:
On Jul 26 08:49, Thomas Wolff wrote:
It would be good to keep wcwidth/wcswidth in sync with the installed
Unicode data version (package unicode-ucd).
Currently it seems to be hard-coded (in newlib/libc/string/wcwidth.c);
it refers to Unicode 5.0 while installed Unicode data suggest 9.0 would
I can provide some scripts to generate the respective tables if desired.
If you can update the newlib files this way and send matching patches
to the newlib list, this would be highly appreciated.
Thomas, I just updated unicode-ucd to 10.0 for this purpose.
Oh, a number of other embedded tables. To make the tow* and isw*
functions more easily adaptable to Unicode updates, there will be some
revisions to do here. And the to* and is* ones (without 'w') even refer
to locales in a way I do not understand. Maybe I'll restrict my effort
to wcwidth first...
Oh, and, btw, the comment in wcwidth.c isn't quite correct. The
cwstate in newlib is on Unicode 5.2, see newlib/libc/ctype/towupper.c.
Problem reports: http://cygwin.com/problems.html
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple