Goal: Repeat the ICD-11 E2E evaluation workflow on a real-world country-level dataset (rather than the curated sample dataset used in March), testing the pipeline at greater scale and linguistic/clinical diversity. Results inform the WHO ICD-11 connectathon preparation in May.
Demo Driver: Joe
Steps:
- Obtain and prepare the country-level input dataset (e.g. a national concept dictionary or local mapping dataset from a partner — MSF, Arogya Sri Lanka, or similar)
- Pre-process dataset as CSV for OCL Mapper input
- Start a new project in OCL Mapper:
- Target repo:
WHO / ICD-11-WHO - Algorithms: CIEL Bridge, WHO ICD-11 automatch, LLM-as-terminologist
- Reranker + AI Assistant enabled
- Target repo:
- Run Auto-match across all algorithms
- Export results and evaluate:
- Re-ranked score distributions
- AI Assistant recommendation quality
- Cases where algorithms agree/disagree
- Human expert spot-check on high-confidence and low-confidence cases
- Document findings in an evaluation summary
- Identify issues to resolve before the May connectathon
List view
0 issues of 2 selected
- Status: Open.#2387 In OpenConceptLab/ocl_issues;
- Status: Open.#2389 In OpenConceptLab/ocl_issues;