Posology Structuration Task

This repository holds the test data and the evaluation script for evaluating the predictions of model on the posology structuration task.

How to install

You can install using uv with:

uv sync

Otherwise, you can use other installation methods with the requirements.txt.

How to run

Provided that you have a JSON file with predictions (see data format below), you can run the following script:

uv run evaluate.py --predictions <YOUR FILE>

There is a --output_file option to store errors in a txt file.

Expected data format

The expected data format is a list of dictionaries containing two keys:

query: the same query as the corresponding sample in the test set
entities: the prediction of the model, it can be a JSON itself as in the test set or the raw text LLM prediction that the evaluation script will try to parse.

The structuration schema contains the following fields:

as_needed: dict (optional), contains information about whether the drug must be taken only under certain condition (e.g. if "as needed during headache" is written)
- as_needed: bool, True if the drug must be taken under certain condition
- as_needed_for: str, the specific condition (e.g. "during headache")
designation: str, the minimal string to which the dosage corresponds, it does not contain the whole instruction, but rather the unit and the type of intake (e.g. "1 comprimé"), this allows to differentiate several posology instructions when needed
max_dose_per_period: dict (optional), contains information about a potential maximum dosing
- dose: int, the maximum amount of units
- dose_unit: str (optional), the type of unit (e.g. "comprimé")
- code: str (optional), the SNOMED code of the dose_unit
quantity_and_rate: dict, contains the amount of intake units to take at once
- value: float, the amount of unit to take at once
- unit: str (optional), the type of unit
- code: str (optional), the SNOMED code for the unit
timing: dict, contains the the timing with which each intake must take place
- bounds_duration: dict (optional), the duration during which the treatment occurs
  - max_value: float (optional), the maximum amount of time units the treatment must last
  - value: float, the amount of time units the treatment must last (if max_value is not null, value reperesent the minimal value)
  - unit: str, the unit for measuring the treatment duration ("hours", "day", "week", or "month")
- bounds_duration_text: str, the text from which bounds_duration is inferred
- bounds_period: dict (optional), an alternative to bounds_duration where exact dates have been provided
  - start_date: str, start date in format YYYY/MM/DD
  - end_date: str, end date in format YYYY/MM/DD
- day_of_week: list (optional), list of weekday diminutives when the treatment must be taken (e.g. "mon" or "sat")
- frequency: int, an integer for the number of intake for the given period (e.g. the 2 in "twice a day")
- frequency_max: int (optional), an integer for the maximum amount fo intake for the given period (frequency is then the minimum)
- frequency_texts: list, a list of elements from the original text from which the timing was inferred
- number_repeats_allows: int (optional), the number of times the treatment can be renewed
- offset: str (optional), substring extracted from the query that indicates what is the offset of the intake ("30 minutes" in "30 minutes before meals")
- period: int, the number of time periods to observe before repeating the intake (6 in "3 tablets every 6 hours")
- period_unit: str, the time unit used for measuring the period between intakes ("hours" in "3 tablets every 6 hours")
- sequence: int (optional), when several posology instructions are given, sequence indicates their order
- time_of_day: list (optional), indicates precise intake hours in the format HH:mm:ss. time_of_day and when cannot be simultanously non-empty
- when: list (optional), indicates the moment of intake relative to the patient activity (e.g. "AC" for before a meal), the list of possible values is given in the FHIR standard

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
annotation_processing.py		annotation_processing.py
evaluate.py		evaluate.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
stats.py		stats.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Posology Structuration Task

How to install

How to run

Expected data format

About

Uh oh!

Releases

Packages

Uh oh!

Languages

posos-tech/posology-structuration-task

Folders and files

Latest commit

History

Repository files navigation

Posology Structuration Task

How to install

How to run

Expected data format

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages