Most other subjects have highly heterogeneous data without semantics and this holds back the creation of knowledge. There is a pressing need to make knowledge about climate available to mitigate the effects of gaseous emissions.
An important resource is the UN’s IPCC reports, published about every five years. In 2021-2022 AR6, with 10,000 pages, was released. #semanticClimate is a group of young Indian science students who are developing tools and community protocols to make IPCC AR6 semantic.
The UNFCCC publishers annual reports, mainly based on the UN's COP meetings. We have scraped and analysed most of these from the last >25 years.
- to convert the IPCC documents from PDF into (a) HTML (b) XML
- extract terms and explore their use and meaning
- link terms to Wikidata and create AMI-dictionaries
- create new structiures for navigation, search, display
We develop tools to liberate knowledge from locked PDFs and host events everybody gets a chance to explore the content in these reports through our tools. Our Technical Strategy Page gives an overview of the tools.
Check out our Events page for details about upcoming hackathons we host and other events we are part of.
We are looking for volunteers/funders to:
- run more events
- develop the code (open an Issue/PR to get started)
- develop the content (start a Discussion thread at https://github.com/petermr/petermr/discussions/)
- dictionaries
- semantified chapters
Hackathon planned for 2022-10-24 to 2022-10-28
- see WG3 repository
We are using the Github Discussions tool to keep a narrative of our work. Currently we focus on individual chapters of the IPCC/AR^/WG3 report.