Added functionality for adding metadata using validate api #72

sanjanag · 2025-04-08T17:37:39Z

Key Info

Implementation plan: link
Priority: urgent

What changed?

Two new arguments added to validate() in the Valdiator API

metadata: dict for user provided metadata
log_results: bool for including internally computed metadata, such as scores from TrustwortyRAG

src/cleanlab_codex/validator.py

- Introduced `process_score_metadata` function to standardize score metadata. - Updated `validate` method to include optional logging of internally processed metadata. - Adjusted `_remediate` method to accept updated metadata structure.

kelsey-wong

also don't forget to bump the version and update release notes so that the changes can go live!

kelsey-wong · 2025-04-17T00:17:55Z

src/cleanlab_codex/internal/validator.py

+        metadata[metric] = score_data["score"]
+
+        # Add is_bad flags with standardized naming
+        is_bad_key = score_to_is_bad_key.get(metric, f"is_not_{metric}")


do we need to use f"is_not_{metric}" as a fallback, or can we define all expected keys in score_to_is_bad_key?

This is intended to support arbitrary evals passed into the Validator.

For example, a user-defined eval like query_ease_customized would require a corresponding top-level key fallback. We automatically run and threshold that eval, so it needs to be handled explicitly.

The fallback approach of using f"is_not_{metric}" feels awkward to me—probably the main reason I’m hesitant about relying on top-level keys to store the nested is_bad flags.

Would it be cleaner to restructure it like this instead?

metadata = { "is_bad": { "trustworthiness": True, "response_helpfulness": False, ... } }

I do think that could be cleaner; is the metadata already expected in a certain format by the frontend? If not, I would say to structure it in a more straightforward/easy to understand format like you're suggesting.

src/cleanlab_codex/validator.py

kelsey-wong · 2025-04-17T13:11:37Z

src/cleanlab_codex/internal/validator.py

+        thresholds_dict[metric] = thresholds.get_threshold(metric)
+    metadata["thresholds"] = thresholds_dict
+
+    return metadata


do you plan to add label to the metadata that's passed to project.query?

Not long term, no. But I've added it in e0141b3 for now.

Added functionality for adding metadata using validate api

dd80675

elisno reviewed Apr 8, 2025

View reviewed changes

src/cleanlab_codex/validator.py Outdated Show resolved Hide resolved

elisno added 4 commits April 16, 2025 22:43

use original metric name as key for eval score in metadata

5357ef8

fix test

7cd9bf8

Merge branch 'main' into add-metadata

c1b6ec6

elisno marked this pull request as ready for review April 16, 2025 23:59

elisno requested a review from kelsey-wong April 16, 2025 23:59

support optional metadata logging in validate_async method

b4cbb55

kelsey-wong reviewed Apr 17, 2025

View reviewed changes

elisno added 2 commits April 17, 2025 00:48

address comments

417179c

update changelog and bump version

eab7e4e

kelsey-wong reviewed Apr 17, 2025

View reviewed changes

add label to metadata

e0141b3

kelsey-wong approved these changes Apr 17, 2025

View reviewed changes

elisno merged commit ad5ec94 into main Apr 17, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added functionality for adding metadata using validate api #72

Added functionality for adding metadata using validate api #72

Uh oh!

sanjanag commented Apr 8, 2025 •

edited by elisno

Loading

Uh oh!

Uh oh!

kelsey-wong left a comment

Uh oh!

kelsey-wong Apr 17, 2025

Uh oh!

elisno Apr 17, 2025

Uh oh!

kelsey-wong Apr 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kelsey-wong Apr 17, 2025

Uh oh!

elisno Apr 17, 2025

Uh oh!

Uh oh!

Uh oh!

Added functionality for adding metadata using validate api #72

Added functionality for adding metadata using validate api #72

Uh oh!

Conversation

sanjanag commented Apr 8, 2025 • edited by elisno Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Key Info

What changed?

Uh oh!

Uh oh!

kelsey-wong left a comment

Choose a reason for hiding this comment

Uh oh!

kelsey-wong Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

elisno Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

kelsey-wong Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kelsey-wong Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

elisno Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sanjanag commented Apr 8, 2025 •

edited by elisno

Loading