gh-130273: Fix traceback color output with unicode characters by grayjk · Pull Request #142529 · python/cpython

grayjk · 2025-12-10T18:32:22Z

Account for the display width of unicode characters so that colors and underlining in traceback output is correct

Closes #130273

Issue: Traceback colors are shifted when the line contains wide unicode characters #130273

python-cla-bot · 2025-12-10T18:32:27Z

All commit authors signed the Contributor License Agreement.

Misc/NEWS.d/next/Library/2025-12-10-15-15-09.gh-issue-130273.iCfiY5.rst

vstinner · 2025-12-11T17:10:36Z

@serhiy-storchaka: Here is a PR about text width and Unicode characters :-)

Closes python#130273

grayjk · 2026-01-28T23:08:03Z

updated to use @serhiy-storchaka's recently added unicodedata.iter_graphemes

grayjk · 2026-02-18T20:57:09Z

@pablogsal @hauntsaninja as recent reviewers of traceback.py, would you mind taking look

StanFromIreland · 2026-02-18T21:22:09Z

There are conflicts, please fix them.

grayjk · 2026-02-18T22:21:49Z

@StanFromIreland conflicts resolved

vstinner · 2026-02-19T10:39:47Z

Lib/traceback.py

-        2 if unicodedata.east_asian_width(char) in _WIDE_CHAR_SPECIFIERS else 1
-        for char in line[:offset]
-    )
+    from _pyrepl.utils import wlen


I would prefer to not depend on _pyrepl in the traceback module. I would prefer to move wlen() here, and modify _pyrepl.utils to get it from traceback.

I've moved wlen/str_width in commit 467656e and made them private (prefixed with _) to avoid putting them in traceback.__all__ but mypy isn't happy about that. Should I make them public?

are # type: ignore comments in this case okay?

or alternatively I could move wlen to a new support file with a name prefixed with _

StanFromIreland · 2026-03-31T17:12:31Z

There are conflicts again I'm afraid, and mypy isn't happy either.

StanFromIreland · 2026-03-31T18:48:40Z

Lib/traceback.py

+    return 2
+
+
+ANSI_ESCAPE_SEQUENCE = re.compile(r"\x1b\[[ -@]*[A-~]")


It should also be private.

vstinner · 2026-04-01T10:22:35Z

Lib/traceback.py

+    import unicodedata
+    if ord(c) < 128:
+        return 1


There is no need to import unicodedata for ASCII characters:

Suggested change

import unicodedata

if ord(c) < 128:

return 1

if ord(c) < 128:

return 1

import unicodedata

vstinner · 2026-04-01T10:25:34Z

Lib/traceback.py

+def _zip_display_width(line, carets):
+    import unicodedata
+    carets = iter(carets)
+    for char in unicodedata.iter_graphemes(line):
+        char = str(char)
+        char_width = _display_width(char)
+        yield char, "".join(itertools.islice(carets, char_width))


Would it be possible to avoid the heavy unicodedata import for ASCII line?

Suggested change

def _zip_display_width(line, carets):

import unicodedata

carets = iter(carets)

for char in unicodedata.iter_graphemes(line):

char = str(char)

char_width = _display_width(char)

yield char, "".join(itertools.islice(carets, char_width))

def _zip_display_width(line, carets):

carets = iter(carets)

if line.isascii():

for char in line:

yield char, next(carets, "")

else:

import unicodedata

for char in unicodedata.iter_graphemes(line):

char = str(char)

char_width = _display_width(char)

yield char, "".join(itertools.islice(carets, char_width))

I'm not sure that my code is correct :-)

vstinner · 2026-04-01T10:26:28Z

Misc/NEWS.d/next/Library/2025-12-10-15-15-09.gh-issue-130273.iCfiY5.rst

@@ -0,0 +1 @@
+Fix traceback color output with unicode characters


Suggested change

Fix traceback color output with unicode characters

Fix traceback color output with Unicode characters.

bedevere-app bot mentioned this pull request Dec 10, 2025

Traceback colors are shifted when the line contains wide unicode characters #130273

Open

bedevere-app bot added the awaiting review label Dec 10, 2025

StanFromIreland reviewed Dec 10, 2025

View reviewed changes

Misc/NEWS.d/next/Library/2025-12-10-15-15-09.gh-issue-130273.iCfiY5.rst Show resolved Hide resolved

grayjk added 3 commits January 28, 2026 17:45

Fix traceback color output with unicode characters

306a690

Closes python#130273

mv news blurb

8edad11

use unicodedata.iter_graphemes

7947033

grayjk force-pushed the issue-130273 branch from 59ca7f0 to 7947033 Compare January 28, 2026 23:05

grayjk requested a review from StanFromIreland January 28, 2026 23:19

Merge branch 'main' into issue-130273

e8d23cd

vstinner reviewed Feb 19, 2026

View reviewed changes

mv wlen/str_width to traceback

467656e

grayjk requested review from ambv, lysnikolaou and pablogsal as code owners February 19, 2026 16:57

StanFromIreland requested a review from serhiy-storchaka March 31, 2026 17:11

StanFromIreland reviewed Mar 31, 2026

View reviewed changes

Lib/traceback.py

return 2

ANSI_ESCAPE_SEQUENCE = re.compile(r"\x1b\[[ -@]*[A-~]")

Copy link
Copy Markdown

Member

StanFromIreland Mar 31, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It should also be private.

vstinner reviewed Apr 1, 2026

View reviewed changes

		return 2


		ANSI_ESCAPE_SEQUENCE = re.compile(r"\x1b\[[ -@]*[A-~]")

		@@ -0,0 +1 @@
		Fix traceback color output with unicode characters

Uh oh!

Conversation

grayjk commented Dec 10, 2025 • edited by bedevere-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

python-cla-bot bot commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

vstinner commented Dec 11, 2025

Uh oh!

grayjk commented Jan 28, 2026

Uh oh!

grayjk commented Feb 18, 2026

Uh oh!

StanFromIreland commented Feb 18, 2026

Uh oh!

grayjk commented Feb 18, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

StanFromIreland commented Mar 31, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

grayjk commented Dec 10, 2025 •

edited by bedevere-app bot

Loading

python-cla-bot bot commented Dec 10, 2025 •

edited

Loading