Created attachment 207263 [details] Dirty fix Hello! Uxterm puts UTF-8 encoded string to WM_NAME instead of _NET_WM_NAME. If you have non-ASCII characters in window title, it will not be showed correctly by the window manager. If you define IsSetUtf8Title(xw) macro to 1, window titles will be showed correctly. So how it can be fixed correctly? Is it related to bug #229682?
Just to make sure I understand the issue, could you please provide me with a reproducible use case?
For example, you can run ncmpcpp in uxterm and play any song with Russian letters in its title(or just letters like ä or ü) and look at the title of the uxterm window. Most likely it will contain unreadable garbage (as in e16) or will be empty (as in StumpWM).
Answering to comment #1. Another example (you do not have to use ncmpcpp for this one). 1) Check that your locale ends with ".UTF-8". 2) Open uxterm and run python 3) Run a command print("\033]0;Übermensch\a") 4) Run another xterm and xprop | grep WM_NAME in it. 5) Click on xterm with python 6) Get garbage (WM_NAME(STRING) = "Ã?bermensch") in xprop output 7) Apply dirty fix and do the test again. You will get this time: vasily@vonbraun:~ % xprop | grep WM_NAME _NET_WM_NAME(UTF8_STRING) = "Übermensch" WM_NAME(STRING) = "Übermensch"
It's been a while (and perhaps some problem to fix), but the place to start is with the utf8Title and titleModes resources: https://invisible-island.net/xterm/manpage/xterm.html#VT100-Widget-Resources:utf8Title and https://invisible-island.net/xterm/manpage/xterm.html#VT100-Widget-Resources:titleModes (that is, it's configurable, defaulting to the standard behaviour).
That's from patch #210, by the way: https://invisible-island.net/xterm/xterm.log.html#xterm_210
So... if your script writes a UTF-8 string, what do you suppose should be in WM_NAME?
If you write UTF-8 encoded string in WM_NAME, then this property must have its type set to "UTF8_STRING", not "STRING", no matter how xterm is configured. Moreover EWMH suggests, that application should set _NET_WM_NAME to the title of the window in UTF-8 encoding. Then, _NET_WM_NAME has priority over WM_NAME. Setting WM_NAME of type STRING to UTF-8 encoded string is clearly an application error. Checked this on Debian and got the same result. So I, by mistake, assumed that this is FreeBSD specific. Maybe I should report a bug to xterm developers.
See here, for example: https://github.com/stumpwm/stumpwm/issues/641 I had StumpWM crashing over and over, until I figured out that that was uxterm setting wrongly encoded WM_NAME property (and I think user can not know about utf8Title resource). Imagine, that there is another window manager, that will crash because of this behavior. I do not think, that uxterm behaves correctly in this situation. The question for me is the following: should this behavior be fixed in FreeBSD ports (as a patch) or should I report it to xterm developers?
I am rereading xterm manual. > However, some users may wish to write a title string encoded in > UTF-8. The window manager is responsible for drawing window > titles. Some window managers (not all) support UTF-8 encoding > of window titles. Set this resource to "true" to allow UTF-8 > encoded title strings. That cancels the translation to UTF-8, > allowing UTF-8 strings to be displayed as is. If really not all window managers support UTF-8 window titles nowadays, then maybe all must be left as is, I really don't know.
xterm doesn't set WM_NAME directly: it sets _NET_WM_NAME directly. It uses the XtNtitle resource, tell libXt to set the title, in turn libXt calls XSetWMName, which sets WM_NAME. Interestingly, libXt has a titleEncoding resource which predates the UTF8_STRING format. If xterm were to set WM_NAME using UTF8_STRING, that would have to be by setting the X property directly, and probably confuse any application which uses the value. It would be useful to have a table showing what different window managers do with UTF-8 strings in WM_NAME and/or _NET_WM_NAME.
By the way, a UTF-8 string is (probably) a valid ISO-8859-1 string, so (unless xterm is told differently), it just passes the data off via libXt / XSetWMName to handle it.
Looking to see where it's actually setting the UTF8_STRING type, I don't see it directly, but it's based on a XUT8StringStyle value handled in src/xlibi18n/lcTxtPr.c This q/a appears relevant: https://stackoverflow.com/questions/7296302/x11-xm-name-type-is-utf-8-rather-than-string-utf8 saying that your locale settings trigger the behavior, telling libX11 + libXt to create the UTF8_STRING value.
In the past week, I wrote a test-driver for the titleModes resource and the title-stacking feature. At first glance, it might seem to "work" to just use the UTF-8 string as suggested. But there's a problem with that: some of the UTF-8 bytes lie inside the C1 controls, and an earlier stage suppresses those. If you happened to find one of those, you'd see a "?" in the title. There's a command-line option which disables most of the C1 control behavior, but that particular part of the code is not affected. Past that, this report deals with a special case: the UTF-8 can be translated into ISO-8859-1 (what STRING holds) without loss. Most of the UTF-8 possibilities can't do that. That's the reason for the titleModes, to allow for those possibilities (by hex-encoding). You don't see this with other terminals of course, since they emulate VT100 (or some subset of that), while xterm emulates VT220/VT420. The former are 7-bit controls only, while the latter are 7-bit or 8-bit controls.
As I can see this, my issue is not a bug, but just misconfigured xterm. Maybe you can help me. As I understand, I need to set X resource XTerm.vt100.titleModes to 2 to make xterm store window titles as UTF8_STRING. From manual: > titleModes (class TitleModes) > Tells xterm whether to accept or return window- and icon-labels > in ISO-8859-1 (the default) or UTF-8. Either can be encoded in > hexadecimal. The default for this resource is “0”. > 2 Set window/icon labels using UTF-8 (overrides utf8Title resource). I am unfamiliar with X resources. I create a file ~/.Xresources with the following content: > XTerm*vt100.titleModes: 2 and run xrdb -merge ~/.Xresources. But I still get the followin with xprop: > WM_NAME(STRING) = "Ð?оÑ?олÑ? и ШÑ?Ñ? - СмелÑ?Ñ?ак и веÑ?еÑ?" I want something like this: > _NET_WM_NAME(UTF8_STRING) = "Беломорс - Излучатель СВЧ" Now I use my dirty patch, but how can I do it with X resources? Can you give me an instruction? Thanks)
It's a little more complicated than that. The titleModes bit #2 for UTF-8 is only useful if you first hex-encode the string (bit #0 of titleModes). I'll add a screenshot from my test-driver to illustrate.
Created attachment 207592 [details] screenshot showing hex-encoded UTF-8 The title-string uses double-width codes from the FF00 code block.
I'm starting to look into whether I can (cleanly) provide a way using the "+k8" option to accept UTF-8 strings in a way that would let you more easily script this stuff.
The following .Xresources works for me: > xterm*vt100.utf8Title: true I tried different capital letters (like XTerm* or True), nothing else seems to be working. I was completely unfamiliar with X resources and the experience with them is very unpleasant. At least now all works as expected. IMHO, all this X resources stuff is completely redundant and adds more complexity, because we have good old command line arguments ;) Big thanks for the pointing me to the solution.
no problem... Actually that particular choice is a special case (which I'll try to explain in the manual page). Once #349 is ready, the preferred solution would use the allowC1Printable resource - see https://invisible-island.net/xterm/manpage/xterm.html#VT100-Widget-Resources:allowC1Printable (but as I commented earlier, that route doesn't work yet).
A commit references this bug: Author: ehaupt Date: Mon Sep 23 07:14:26 UTC 2019 New revision: 512613 URL: https://svnweb.freebsd.org/changeset/ports/512613 Log: - Update to 349 - Pacify portlint PR: 240393 (ref. in changelog) Changes: head/x11/xterm/Makefile head/x11/xterm/distinfo