untokenize() does not round-trip for tab characters

# Bug report

### Bug description:

Tab characters used as white space do not roundtrip when using `untokenize`. Here's an example:

```python
import tokenize, io

source = "a +\tb"

tokens = list(tokenize.generate_tokens(io.StringIO(source).readline))
x = tokenize.untokenize(tokens)
print(x)
# a + b
```

Here, the tab is replaced with a regular space. Given that untokenize tries to match the source string exactly, I think we should fix this. We can do that by getting the characters from the source line rather than always using a normal space:

https://github.com/python/cpython/blob/9c2bb7d551a695f35db953a671a2ddca89426bef/Lib/tokenize.py#L185

I'll send a fix in a moment :)

### CPython versions tested on:

CPython main branch

### Operating systems tested on:

Linux


### Linked PRs
* gh-128032

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

untokenize() does not round-trip for tab characters #128031

Bug report

Bug description:

CPython versions tested on:

Operating systems tested on:

Linked PRs

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

untokenize() does not round-trip for tab characters #128031

Description

Bug report

Bug description:

CPython versions tested on:

Operating systems tested on:

Linked PRs

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions