-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fatal error on encountering UTF-8 letters outside ASCII #5
Comments
I don't see any problems. It works.
May be problem in your libroxml build? |
Maybe. What is your libroxml version? I used latest version from their git repository, now I will test latest release (2.3.0). |
Or maybe there is some option to download libroxml package (there is no obvious source but...)? |
I tested with 2.3.0, without changes. final strace segment:
For now I have no idea what may be tested (except making sure we use the same libroxml). |
And there is possibility that different environments resulted in differences in what happens. I have 32 bit Ubuntu 14.04.4 LTS (Lubuntu distribution). |
Also, can you check whatever libroxml tests are failing for you - blunderer/libroxml#68 ? |
FWIW - that xml file may not be properly utf-8 encoded as that char exists as 1 byte in other encodings. Use a hex editor - not emacs or vim as they guess encoding - to look at that specific char's byte values. |
utf-8 is not supported by libroxml - see blunderer/libroxml#63 (comment) Potential solution is to replace libroxml by something that works on more than ASCII or to make horrible workaround like
|
In TODO list. |
Example of synthetic input, based on real causing data error:
to compare, following input differing by replacing "ą" with "footway" is not causing crash:
results in
Given that letters like żółćęśąźńŻÓŁĆĘŚĄŹŃ are appearing typically only in tag name that is not rendered in orienteering maps potential band-aid is to process input file and remove UTF-8 letters (obviously, proper solution would allow processing data also with letters beyond ASCII).
Note that such letters may also appear in user field.
The text was updated successfully, but these errors were encountered: