Clarify ownership and provenance for datasets listed in data.json #296
Description
As @JoshData and @lilybradley have mentioned, the data.json from HHS includes data aggregated from State governments as well (here's an example from ny.gov). Does Project Open Data already have a clear requirement that datasets should only be those produced by the agency? If not, should that requirement be better specified?
I'm sure there are a lot of gray areas here where a local government has produced some data in partnership with a federal agency, but if we can provide better guidance on those scenarios (eg the data.json should only include data hosted on a federal .gov) then that would be helpful.
If other sources can be included, we'll want a better method to identify the source of these datasets. If each federal dataset listed the programCode
and bureauCode
as required it would be easy to filter out those that don't have them, but we'd still want a consistent and detailed way to identify those other sources.
Activity