Replies: 2 comments 7 replies
-
We discussed this at lunch the other day. We thought for a given project we might have a folder containing:
|
Beta Was this translation helpful? Give feedback.
-
Recent discussions we're thinking of having scripts for the processing from Raw to Model. And version-controlling the scripts. Depending how long processing takes, we might also save the "Tulipa-friendly" versions of data sources - which users can manipulate/combine for scenarios. But this runs the risk of users permanently editing these "Tulipa-friendly" files without documenting what they did. Data provenance is a hard problem. This (or something similar) might be a solution (thanks @suvayu!): |
Beta Was this translation helpful? Give feedback.
-
@suvayu asked the question in the TIO repo, so thought I'd post it here for discussion.
The thought is that while we're doing analyses, we'll have a tendency to save-off datasets, duplicating a lot of stuff.
Of course Julia+DuckDB can do database manipulations without duplicating data, but then we have to decide the workflow and when we WANT it to save.
I see creating a scenario as a building-up process. So when do we save and when do we leave something as a "run this script again" process?
Issues to think about:
@gnawin @datejada @nope82 @g-moralesespana
Beta Was this translation helpful? Give feedback.
All reactions