A curated list of services, tools and documentation for CLARIN's Component Metadata Infrastructure
Web pages about CMDI:
- CMDI home page - An introduction to CMDI and usage description for metadata modellers, authors and repository managers.
- FAQs - Frequently asked questions about metadata in CLARIN.
- Video lecture: "CMDI explained" - Ten minute introduction to CMDI by Henk van den Heuvel.
- CMDI best practices guide - PDF with best practices for both modellers and authors, and a section on common approaches and problems.
- Best practices schematron rules - For automated best practice compliance in records and component definitions.
- CMDI and granularity - PDF with guidelines with respect to metadata hierarchies and levels of description.
Essential services that form the operational core of the Component Metadata Infrastructure:
- Component Registry - Registry and editor for CMD components and profiles.
- CLAVAS - CLARIN's vocabulary service used in CMDI.
- CCR - CLARIN's concept registry used in CMDI.
CMDI based services that are maintained and hosted centrally by CLARIN ERIC:
- VLO - The Virtual Language Observatory offers metadata based search and discovery for language resources and tools.
- Curation dashboard - Metadata quality control with up-to-date link checking.
- OAI-PMH harvest viewer - Latest harvest results for the VLO.
- CMD Toolkit - Contains the schemata, stylesheets and scripts that form the basis of CMDI.
- Component validator - A library ("CMDValidate") for validating CMD component specifications using XSD and Schematron.
- Instance validator - Java based utility for validating CMDI records.
- COMEDI - A web-based editor for CMDI records with storage and distribution facilities.
- CLARIAH CMDI Forms - A tweakable, web based editor for CMDI records (sources, runnable via Docker).
- CLARIN metadata conversion stylesheets - A repository with stylesheets for conversion from various metadata formats to CMDI.
- OAI harvest manager - Solution for harvesting a predefined set of endpoints, with support for flexible XSLT based processing pipelines and integration with the CLARIN centre registry.
- CLARIN OAI-PMH providers - List of endpoints of registered CLARIN centres that provide metadata.
- CLARIN DSpace - Adaptation of DSpace that supports CMDI and other CLARIN requirements and conventions, developed at the Institute of Formal and Applied Linguistics of the Charles University.
- TLA FLAT - Repository solution based on Islandora, developed at the Max Planck Institute for Psycholinguistics.
- XSD Schemas - Common schemas for CMDI 1.1.
- CMDI specification - Complete specification for CMDI 1.2.
- Summary of changes - Executive summary of changes in CMDI 1.2 compared to CMDI 1.1.
- ISO 24622 - Language resource management.
Component Metadata Infrastructure (CMDI)
- ISO 24622-1:2015 - Part 1: The Component Metadata Model.
- ISO 24622-2:2019 - Part 2: Component metadata specification language.
- XSD Schemas - Common schemas for CMDI 1.2.
- Component Metadata Infrastructure - Book chapter (Windhouwer, M., & Goosen, T. (2022). Component metadata infrastructure. CLARIN: The infrastructure for language resources, 191-222).
- Free digital copy - PDF version of the chapter (CC-BY).
- CMDI first aid kit - A printable booklet with helpful links for CMDI users.
Contributions welcome! Read the contribution guidelines first.