Description
Question about the use of the dwc:datasetName field.
Scenario: different groups, as part of a formal national initiative, will each observe the same taxonomic group in their respective region. Some of the observed taxa will be collected and vouchered in various collections across this given nation.
General Puzzlement One: Each group publishes their own dataset to GBIF (observations and specimen records). How do these disparate datasets find each other? How can they be grouped "after-the-publishing step?"
- Given: each group will use the same standardized protocol, mapped to relevant standards, for both observing and collection data for this taxonomic group.
Specific Puzzlement Two: What if each group gave their own dataset the same name? Example: dwc:datasetName = Our [taxonomic group] Data
. Would this work? Say, for publishing to GBIF, does it matter if two datasets have the same datasetName?
Last Puzzlement: Is there a better strategy? What are the options (standard terms? extensions?) for ensuring (or at least improving the chances) these data can be aggregated and understood to be part of the same initiative? Would this be a use case for a term that conveyed a funding number (a grant number)? In that way, all datasets (and for that matter, data records) would be gatherable by that?
Insights, discussion, and options welcome. I'm guessing others have already solved this or at least grok current possible options as well as needs to make this a reasonable approach to a distributed national-level project.