-
Notifications
You must be signed in to change notification settings - Fork 146
Closed
Description
I'd like to decide on the Julia structure that CSV.read() returns. Speak now or forever hold your peace (or write your own parser, i don't care). The current candidates are:
- Matrix{Any}: easy to allocate and work with, but we lose type information and header values
- Dict{String,Vector{T}}: column_name => column_values::Vector{T}, use sentinel values for nulls
- Dict{String,NullableArray{T}}: same as above, but we naturally represent nulls
- DataFrame with NullableArray columns: hook into DF, but hook into DF
- Tables.Table: this would be essentially an in-memory or on-disk SQLite database (I'll probably plan on supporting this one regardless, but maybe it could be the only option?)
- Any other recommendations out there?
I'm leaning towards Dict{String,NullableArray{T}} as it's the most straightforward
@johnmyleswhite @davidagold @StefanKarpinski @jiahao @RaviMohan
Metadata
Metadata
Assignees
Labels
No labels