[DRAFT] FEAT Dataset Loading Utilities #1201

ValbuenaVC · 2025-11-21T23:27:56Z

Description

Adds pyrit.scenario.dataset and pyrit.scenario.dataset.ScenarioDatasetUtils to compartmentalize common dataset loading patterns for Scenarios.

Tests and Documentation

IP

romanlutz · 2025-11-22T16:49:53Z

pyrit/scenario/dataset/load_utils.py

+        return seed_prompts
+
+    @classmethod
+    def get_seed_dataset(cls, which: str) -> SeedDataset:


Which is not a common parameter naming choice. Name seems preferable.

romanlutz · 2025-11-22T16:51:51Z

pyrit/scenario/dataset/load_utils.py

+    """
+    @classmethod
+    def seed_dataset_to_list_str(cls, dataset: Path) -> List[str]:
+        seed_prompts: List[str] = []


I'm kind of surprised we're using these as plain strings. It loses all the metadata. That means we lose harm categories, for example. How will one query for the results?

rlundeen2 · 2025-12-02T18:31:18Z

pyrit/scenario/dataset/load_utils.py

+from pyrit.common.path import DATASETS_PATH, SCORER_CONFIG_PATH
+from pyrit.datasets.harmbench_dataset import fetch_harmbench_dataset
+
+


This is a good stab at the problem. But I think the route I prefer to go is to make everything really easy to put in the database (e.g. include initializer that load all the scenario datasets) and then just have the scenarios grab from the database.

Victor Valbuena added 2 commits November 21, 2025 23:26

Basic implementation added

e3f23b5

Duplicate else clause removed

ee8927f

romanlutz reviewed Nov 22, 2025

View reviewed changes

rlundeen2 reviewed Dec 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DRAFT] FEAT Dataset Loading Utilities #1201

[DRAFT] FEAT Dataset Loading Utilities #1201

Uh oh!

ValbuenaVC commented Nov 21, 2025

Uh oh!

romanlutz Nov 22, 2025

Uh oh!

romanlutz Nov 22, 2025

Uh oh!

rlundeen2 Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		from pyrit.common.path import DATASETS_PATH, SCORER_CONFIG_PATH
		from pyrit.datasets.harmbench_dataset import fetch_harmbench_dataset

[DRAFT] FEAT Dataset Loading Utilities #1201

Are you sure you want to change the base?

[DRAFT] FEAT Dataset Loading Utilities #1201

Uh oh!

Conversation

ValbuenaVC commented Nov 21, 2025

Description

Tests and Documentation

Uh oh!

romanlutz Nov 22, 2025

Choose a reason for hiding this comment

Uh oh!

romanlutz Nov 22, 2025

Choose a reason for hiding this comment

Uh oh!

rlundeen2 Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants