feat: Dataset#category #406

tristan-f-r · 2025-10-06T20:37:55Z

Depends on test: dataset #326

This is a short PR that just adds the Dataset#category parameter and parses it/stores it without usage, as I want to make sure that we agree on category over categories: the latter, while more general, seems biologically useless.

Closes Reed-CompBio#237.

read-the-docs-community · 2025-10-06T20:38:58Z

Documentation build overview

📚 spras | 🛠️ Build #29861638 | 📁 Comparing 965f87c against latest (6e8cc04)

🔍 Preview build

Show files changed (8 files in total): 📝 5 modified | ➕ 0 added | ➖ 3 deleted

File	Status
genindex.html	📝 modified
htcondor.html	➖ deleted
index.html	📝 modified
py-modindex.html	📝 modified
contributing/patching.html	➖ deleted
fordevs/modules.html	📝 modified
fordevs/spras.config.html	➖ deleted
fordevs/spras.html	📝 modified

ntalluri · 2025-10-07T15:37:45Z

Could you update the config files so we can what the use of category label would look like? Also is a user now required to add a dataset category to each dataset they put in a config?

tristan-f-r · 2025-10-07T16:00:09Z

The Optional tag in the schema means that category would be optional.

spras/config/schema.py

tristan-f-r · 2025-10-07T17:07:44Z

Could you update the config files so we can what the use of category label would look like?

I'll avoid this in the main config.yaml until we have sample data for a new dataset. I'll add accompanying documentation once we have some way to have runs only run on specific dataset categories.

agitter · 2025-10-10T22:03:18Z

I'm missing what we want to support with this category attribute. We want to list many datasets in a single config file and then somewhere else say only run on datasets where category=X? That was part of #309, but I'm not seeing the advantages of doing that all within a single config file.

tristan-f-r · 2025-10-14T17:34:22Z

This would be useful for cross-dataset-category analysis (i.e. unified statistics that check how well algorithms do on certain dataset categories than other categories). This is currently no longer useful for parameter tuning.

tristan-f-r and others added 17 commits June 3, 2025 16:59

fix: check if edge weights are between 0 and 1

f1cbe33

Closes Reed-CompBio#237.

fix: better err message on invalid ranges

f89e733

test: dataset

8e4016b

test: one for some normal dataset

f275e57

style: fmt

04632f1

docs: on datasetdict

e5ad211

chore: update naming for empty ds

e9ad2fd

test: empty sources, targets, node-prizes, or network

54be58e

test: empty headers

a6788a3

style: fmt

79f1682

refactor: drop unused dataset edge weight restriction

d0fc7e2

Merge branch 'umain' into weight-range

e778dbb

Merge branch 'main' into weight-range

482be1a

docs: better DatasetDict docstring

ae9d361

Merge branch 'main' into weight-range

e807ad4

Merge branch 'main' into weight-range

78c9565

feat: Dataset#category

43d746b

tristan-f-r added blocked-by-other-pr enhancement New feature or request labels Oct 6, 2025

chore: make Datset#category optional in schema

df9c295

tristan-f-r added the tuning Workflow-spanning algorithm tuning label Oct 6, 2025

tristan-f-r mentioned this pull request Oct 6, 2025

[config] spras-run-on for alg runs #309

Open

feat: store dataset categories

edb25c4

tristan-f-r mentioned this pull request Oct 7, 2025

Support two-stage parameter grid search #318

Open

ntalluri reviewed Oct 7, 2025

View reviewed changes

spras/config/schema.py Outdated Show resolved Hide resolved

Update spras/config/schema.py

965f87c

tristan-f-r removed the tuning Workflow-spanning algorithm tuning label Oct 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Dataset#category #406

feat: Dataset#category #406

Uh oh!

tristan-f-r commented Oct 6, 2025 •

edited

Loading

Uh oh!

read-the-docs-community bot commented Oct 6, 2025 •

edited

Loading

Uh oh!

ntalluri commented Oct 7, 2025

Uh oh!

tristan-f-r commented Oct 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

tristan-f-r commented Oct 7, 2025

Uh oh!

agitter commented Oct 10, 2025

Uh oh!

tristan-f-r commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Dataset#category #406

Are you sure you want to change the base?

feat: Dataset#category #406

Uh oh!

Conversation

tristan-f-r commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

read-the-docs-community bot commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation build overview

Uh oh!

ntalluri commented Oct 7, 2025

Uh oh!

tristan-f-r commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

tristan-f-r commented Oct 7, 2025

Uh oh!

agitter commented Oct 10, 2025

Uh oh!

tristan-f-r commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tristan-f-r commented Oct 6, 2025 •

edited

Loading

read-the-docs-community bot commented Oct 6, 2025 •

edited

Loading

tristan-f-r commented Oct 7, 2025 •

edited

Loading