You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The current code structure is not following best practices, as various tasks like summarization, sensitivity-test, sycophancy-test, and others are located within a single sample.py file. We propose reorganizing the code to separate out the evaluation metrics-related code into a dedicated metrics folder. This will improve code maintainability and modularity.
The text was updated successfully, but these errors were encountered:
Description
The current code structure is not following best practices, as various tasks like summarization, sensitivity-test, sycophancy-test, and others are located within a single
sample.py
file. We propose reorganizing the code to separate out the evaluation metrics-related code into a dedicatedmetrics
folder. This will improve code maintainability and modularity.The text was updated successfully, but these errors were encountered: