Web lists-archives.com

Re: Need help with multibyte UTF-8 characters




On 2017-12-04 18:23, Thomas Taylor wrote:
> I want to use multibyte UTF-8 characters in 64-bit Cygwin under Windows 7.  The
> "vim" editor running in mintty displays the two-byte characters correctly, but
> not the three- (and I assume four-) byte characters, which instead display as
> rectangular filled-in blocks.  The "less" program doesn't even display two-byte
> characters correctly, but instead displays them as <A1> to <FF>, depending on
> the character in question, in reverse color in the terminal window.  The "cat"
> program is even worse, replacing every two-byte character with a character that
> looks like three horizontal bars stacked one above the other.  I've read the
> "Internationalization" page in the Cygwin online manual, but am still baffled. 
> My LANG environment variable is set to "en_US.UTF-8".  Can anyone help?

Check mintty/Options/Text/Locale[en_US]/Character set[UTF-8]/Apply/Save.
Then exit and restart mintty and your shell.

To see what locale Cygwin thinks you are set to, run:
	$ locale

To check all Windows locale settings, you can run:
	$ for o in -s -u -n -i -f ''; do locale $o; done

The first two should show your Windows install locale, the rest should show
anything you have set up, or the same locale.
If any settings don't match LANG, you may have to set LC_ALL=$LANG to force the
setting.
I use the following profile stanza across all systems for consistency:

# Set user-defined locale - use regional settings if available
locale -fU > /dev/null 2>&1     \
        && LC_ALL=`locale -fU`  \
        || LC_ALL=`locale |	\
		/bin/sed '/^LANG=\|^LC_CTYPE=\|^LC_ALL=/{s///;h};$!d;x;s/"//g'`
export LC_ALL

-- 
Take care. Thanks, Brian Inglis, Calgary, Alberta, Canada

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple