-
Notifications
You must be signed in to change notification settings - Fork 1
Update faq.md Add Missing Data FAQ #15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
@@ -108,10 +108,29 @@ I have a negative control | |||||||||||||||
|
|
||||||||||||||||
| _There is no specific taxonomy terms for positive and negative controls_ | ||||||||||||||||
|
|
||||||||||||||||
| ## How do fill in negative controls? | ||||||||||||||||
| ## How do I fill in negative controls? | ||||||||||||||||
|
|
||||||||||||||||
| _TODO_ | ||||||||||||||||
|
|
||||||||||||||||
| ## How to fill in missing data | ||||||||||||||||
|
|
||||||||||||||||
| Follow the [INSDC Missing Value Reporting](https://www.insdc.org/technical-specifications/missing-value-reporting/) specifications. | ||||||||||||||||
| - Missing data is (meta)data that **can not** be reported. This can be due to various constraints, such as: | ||||||||||||||||
| - the data is truly missing, meaning it is not known and can not be known; | ||||||||||||||||
| - it can not be shared for privacy reasons; | ||||||||||||||||
| - it can not be shared due to data agreement restrictions; | ||||||||||||||||
| - it is not applicable to that particular field (e.g. it is a negative control and the field does not apply) | ||||||||||||||||
| - Missing data should only be reported for **mandatory** fields, not for **recommended** or **optional** fields. For the latter two, simply leave the field blank if the (meta)data are missing. | ||||||||||||||||
|
Comment on lines
+122
to
+123
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think the mandatory fields sentence should go at the beginning of this section in a preamble paragraph and say more along the lines of: Fields in MIxS that are mandatory always require something filled into the given metadata entry. If you do not have this information, you must encode this using the specific 'missing information' categories as below. While optional fields in MIxS can be left blank, if you have a specific reason the information will never be able to be reported (see examples below), then it is good to use these missing data categories there also. |
||||||||||||||||
| - There are three levels at which you can report missing data, with an increasing amount of specificity for each: _**top level**_, _**lower level**_, and _**reporting level**_. Be as specific/granular as possible when reporting missing values. The _top level_ indicates that the data are missing, while the _lower-_ and _reporting_ levels give a reason (from the [controlled vocabulary](https://www.insdc.org/technical-specifications/missing-value-reporting/)) why the data are missing. | ||||||||||||||||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I would break up the three level descriptions into bullets |
||||||||||||||||
| - Always report the _top level_ (i.e. "not applicable" or "missing") even when reporting at the more granular levels, in which case separate the _top level_ and _lower/reporting level_ terms with ": ". | ||||||||||||||||
| - If using terms from the most granular level (_reporting level_), then exclude the _lower level_ term, as each _reporting level_ term is a "child" of the _lower level_, which can then be inferred based on the [table](https://www.insdc.org/technical-specifications/missing-value-reporting/). | ||||||||||||||||
|
|
||||||||||||||||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||||
| ### Examples | ||||||||||||||||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||||
| - missing | ||||||||||||||||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. And let's provide context for each example:
|
||||||||||||||||
| - missing: not collected | ||||||||||||||||
| - not applicable: control sample | ||||||||||||||||
| - missing: endangered species | ||||||||||||||||
|
|
||||||||||||||||
| ## How to fill in sample age information | ||||||||||||||||
|
|
||||||||||||||||
| _TODO_ | ||||||||||||||||
|
|
||||||||||||||||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.