GitHub

Goal: Repeat the ICD-11 E2E evaluation workflow on a real-world country-level dataset (rather than the curated sample dataset used in March), testing the pipeline at greater scale and linguistic/clinical diversity. Results inform the WHO ICD-11 connectathon preparation in May.

Demo Driver: Joe

Steps:

Obtain and prepare the country-level input dataset (e.g. a national concept dictionary or local mapping dataset from a partner — MSF, Arogya Sri Lanka, or similar)
Pre-process dataset as CSV for OCL Mapper input
Start a new project in OCL Mapper:
- Target repo: WHO / ICD-11-WHO
- Algorithms: CIEL Bridge, WHO ICD-11 automatch, LLM-as-terminologist
- Reranker + AI Assistant enabled
Run Auto-match across all algorithms
Export results and evaluate:
- Re-ranked score distributions
- AI Assistant recommendation quality
- Cases where algorithms agree/disagree
- Human expert spot-check on high-confidence and low-confidence cases
Document findings in an evaluation summary
Identify issues to resolve before the May connectathon

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CIEL Bridge] ICD11 E2E evaluation workflow on country dataset

Real-world country content for ICD-11 transition

Provide a separate manual-mapping helper tool for validation (not the evaluation platform)

[CIEL Bridge] ICD11 E2E evaluation workflow on country dataset

List view

Real-world country content for ICD-11 transition

Provide a separate manual-mapping helper tool for validation (not the evaluation platform)