-
-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DOC: from examples to tutorials #3564
Comments
In case it's helpful for inspiration, we took a similar approach with the MNE-Python package (neuro electrophysiology package): https://mne.tools/stable/index.html Maybe there are at least 3 levels in there, actually:
Does that make sense? |
@rabernat I'm going to be making a simple plasma physics-oriented xarray tutorial to give at a workshop next week. I was wondering - if we're uploading real data for these, how big can/should the files be? It might affect what dataset I use. |
https://www.divio.com/blog/documentation/ might be a useful reference for this? |
This is a good question. We need the tutorials to be able to run and build within a CI environment. That's the main constraint. For larger datasets, rather than storing them in github, a good approach is to create an archive on https://zenodo.org/ from which the data can be pulled. |
The article linked by @keewis is well worth reading in my opinion - it describes a similar breakdown of different types of documentation:
I think for xarray there is another type, like you suggest @choldgraf:
I personally think xarray in general has reference nailed, lots of good explanation, but is generally a bit weaker on tutorials and how-to guides, and doesn't have many examples of domain use-cases. I have some ideas for how-to's (maybe these should all go in a separate issue?):
So @rabernat for small datasets what might be an appropriate max filesize? I literally have no idea. ~1MB?
I'll look into that. |
Another note from MNE - we have a "datasets" sub-module that knows how to pull a few datasets from various online repositories (and in different structures). These store in a local folder (by default, For datasets that aren't gigantic it's a pretty nice system. https://mne.tools/stable/overview/datasets_index.html?highlight=datasets |
Hello everyone, is this issue still relevant?
|
Hi @apkrelling thanks for offering to help! I think we can still add more domain-specific examples for meteorology and oceanography. @rabernat had some plans for this, maybe he can describe them.
This would be totally great! |
This comment was marked as off-topic.
This comment was marked as off-topic.
This comment was marked as off-topic.
This comment was marked as off-topic.
We've started discussing how to reorganize the xarray-tutorial repository here: xarray-contrib/xarray-tutorial#53 . Comments are welcome! |
Hi folks, Just to mention that we've created a short tutorial on xarray which is meant as a gentle intro to folks coming from the malaria genetics field, who mostly have never heard of xarray before. We illustrate xarray first using outputs from a geostatistical model of how insecticide-treated bednets are used in Africa. We then give a couple of brief examples of how we use xarray for genomic data. There's video walkthroughs in French and English: https://anopheles-genomic-surveillance.github.io/workshop-5/module-1-xarray.html Please feel free to link to this in the xarray tutorial site if you'd like to :) |
@choldgraf seems like this page is down (https://predictablynoisy.com/xarray-explore-ieeg). Are these examples available elsewhere? |
Oops I think the url just changed https://chrisholdgraf.com/blog/2019/2019-10-22-xarray-neuro/ |
It's awesome to see the work we did at Scipy2019 finally hit the live docs! Thanks @keewis and @dcherian for pushing it through.
Now that we have these more detailed, realistic examples, let's think about how we can take our documentation to the next level. I think we need TUTORIALS. The examples are a good start. I think we can build on these to create tutorials which walk through most of xarray's core features with a domain-specific datasets. We could have different tutorials for different fields. For example.
Each tutorial would cover the same core elements (loading data, indexing, aligning, grouping, computations, plotting, etc.), but using a familiar, real dataset, rather than the generic, made-up ones in our current docs.
Yes, this would be a lot of work, but I think it would have a huge impact. Just raising here for discussion.
xref #2980 #2378 #3131
The text was updated successfully, but these errors were encountered: