token length incorrect using --latin1 option

The template files assume UTF-8 in the `alex_scan_tkn` function, even when the --latin1 option is given.

In [templates/GenericTemplate.hs](https://github.com/simonmar/alex/blob/master/templates/GenericTemplate.hs#L178):

``` haskell
alex_scan_tkn user orig_input (if c < 0x80 || c >= 0xC0 then PLUS(len,ILIT(1)) else len)
```

This leads to an incorrect length being given to the lexer actions for tokens containing bytes between 0x80 and 0xC0


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

token length incorrect using --latin1 option #63

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

token length incorrect using --latin1 option #63

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions