Skip to content
This repository has been archived by the owner on Mar 10, 2023. It is now read-only.

CSV files should maintain a consistent text encoding #319

Open
dp-github opened this issue Mar 8, 2020 · 3 comments
Open

CSV files should maintain a consistent text encoding #319

dp-github opened this issue Mar 8, 2020 · 3 comments
Labels
data issue Discrepancies in the data enhancement New feature or request

Comments

@dp-github
Copy link

The CSV files in the daily reports directory use different text encodings.

$ file *csv
01-22-2020.csv: UTF-8 Unicode (with BOM) text, with CRLF line terminators
01-23-2020.csv: UTF-8 Unicode (with BOM) text, with CRLF line terminators
01-24-2020.csv: UTF-8 Unicode (with BOM) text, with CRLF line terminators
01-25-2020.csv: UTF-8 Unicode (with BOM) text, with CRLF line terminators
01-26-2020.csv: UTF-8 Unicode (with BOM) text, with CRLF line terminators
01-27-2020.csv: UTF-8 Unicode (with BOM) text, with CRLF line terminators
01-28-2020.csv: UTF-8 Unicode (with BOM) text, with CRLF line terminators
01-29-2020.csv: UTF-8 Unicode (with BOM) text, with CRLF line terminators
01-30-2020.csv: UTF-8 Unicode (with BOM) text, with CRLF line terminators
01-31-2020.csv: UTF-8 Unicode (with BOM) text, with CRLF line terminators
02-01-2020.csv: ASCII text, with CRLF line terminators
02-02-2020.csv: ASCII text, with CRLF line terminators
02-03-2020.csv: ASCII text, with CRLF line terminators
02-04-2020.csv: ASCII text, with CRLF line terminators
02-05-2020.csv: ASCII text, with CRLF line terminators
02-06-2020.csv: ASCII text, with CRLF line terminators
02-07-2020.csv: ASCII text, with CRLF line terminators
02-08-2020.csv: ASCII text, with CRLF line terminators
02-09-2020.csv: ASCII text, with CRLF line terminators
02-10-2020.csv: ASCII text, with CRLF line terminators
02-11-2020.csv: ASCII text, with CRLF line terminators
02-12-2020.csv: ASCII text, with CRLF line terminators
02-13-2020.csv: ASCII text, with CRLF line terminators
02-14-2020.csv: ASCII text
02-15-2020.csv: ASCII text
02-16-2020.csv: ASCII text
02-17-2020.csv: ASCII text, with CRLF line terminators
02-18-2020.csv: ASCII text, with CRLF line terminators
02-19-2020.csv: ASCII text
02-20-2020.csv: ASCII text
02-21-2020.csv: ASCII text
02-22-2020.csv: ASCII text
02-23-2020.csv: ASCII text
02-24-2020.csv: ASCII text
02-25-2020.csv: ASCII text
02-26-2020.csv: ASCII text
02-27-2020.csv: ASCII text
02-28-2020.csv: ASCII text
02-29-2020.csv: ASCII text
03-01-2020.csv: ASCII text
03-02-2020.csv: ASCII text
03-03-2020.csv: ASCII text
03-04-2020.csv: ASCII text
03-05-2020.csv: ASCII text
03-06-2020.csv: ASCII text
03-07-2020.csv: ASCII text
@MehranDHN
Copy link

#271

@CSSEGISandData CSSEGISandData added data issue Discrepancies in the data enhancement New feature or request labels Mar 10, 2020
@CSSEGISandData
Copy link
Owner

Thank you for the observation. We will look into this when building out future capabilities.

@SebastianHelm
Copy link

Which one is preferred? It doesn't seem like Unicode is necessary, since all text has been written using the Latin alphabet; all local names have been transcribed. Any prospect that there may arise a need to change that?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
data issue Discrepancies in the data enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants