Skip to content
This repository has been archived by the owner on Oct 5, 2023. It is now read-only.

Display low-scoring examples controlling for some other feature #819

Open
neubig opened this issue Jun 5, 2023 · 2 comments
Open

Display low-scoring examples controlling for some other feature #819

neubig opened this issue Jun 5, 2023 · 2 comments

Comments

@neubig
Copy link
Contributor

neubig commented Jun 5, 2023

One common use case that I have noticed is

"I want to see examples that are lower-scoring than other examples of similar length"

The reason why this is important is because some metrics are sensitive to the length of the output, so if you select all low-scoring examples you might get all short examples, which is not super-interesting.

Generalizing this a bit, it could be interesting to think of a way to say "I want examples that are low-scoring, controlling for some other feature".

This is relatively easy when controlling for a categorical feature (you could just take all examples in the bottom nth percentile given any particular categorical feature value), but it is somewhat more complicated conceptually when the feature is a float or integer.

@cabreraalex
Copy link
Member

You could do this by looking at the table view and sorting by the score value after you have filtered

@neubig
Copy link
Contributor Author

neubig commented Jun 5, 2023

Nice. It'd also be kinda nice to be able to do this without filtering though. We could brainstorm the details offline sometime.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants