Replace escaped Unicode chars (`\u20ac`) in stored JSON?

While chasing [a Unicode-related bug](https://github.com/OpenTreeOfLife/phylesystem-api/pull/216), I realized that our stored JSON (on GitHub) has ugly escaped Unicode characters, e.g. [in this study](https://github.com/OpenTreeOfLife/phylesystem-0/commit/ebf015fd20c748a1a0628738bc43fa1ec9a6b272) and [this tree collection](https://github.com/OpenTreeOfLife/collections-0/commit/dcf89735869ec89f3179ea39bb57c37e97da6749).

These Unicode characters are handled gracefully in our indexing and web apps, but these escape sequences aren't strictly needed as we store all JSON as utf-8. Meanwhile, they're hideous and make it hard to read and search the stored files on GitHub. 

- Is this something we want or need to fix? 
- Would this fix apply to all document types (studies, tree collections, tax. amendments)?
- Are there other clients or use cases that would be broken by this change?

If we want to restore pretty Unicode for data saved in the future, it seems to all boil down to [a single call to `json.dump` in peyotl](https://github.com/OpenTreeOfLife/peyotl/blob/46fda92f92c7ba99d2cfbad54cfe6e209a45180c/peyotl/utility/input_output.py#L72) that's used for all JSON docs. If we add `ensure_ascii=False` to this call [as shown here](https://github.com/OpenTreeOfLife/peyotl/blob/46fda92f92c7ba99d2cfbad54cfe6e209a45180c/peyotl/utility/input_output.py#L94), it should save Unicode characters directly (sans escape) in phylesystem.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace escaped Unicode chars (`\u20ac`) in stored JSON? #173

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Replace escaped Unicode chars (\u20ac) in stored JSON? #173

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Replace escaped Unicode chars (`\u20ac`) in stored JSON? #173