Skip to content

Comments

Webapp file ingestion changes#206

Open
Mesh-ach wants to merge 7 commits intodevelopfrom
WebappFileIngestionChanges
Open

Webapp file ingestion changes#206
Mesh-ach wants to merge 7 commits intodevelopfrom
WebappFileIngestionChanges

Conversation

@Mesh-ach
Copy link
Collaborator

@Mesh-ach Mesh-ach commented Feb 19, 2026

Functionality to list available bronze datasets and upload selected CSVs into GCS bucket without manual uploads.

changes

  • Added GET /institutions/{inst_id}/input/bronze-datasets to list available .csv datasets in the institution’s Databricks bronze volume root (no subfolders), filtered by VALID_BRONZE_FILE_RE.
  • Added POST /institutions/{inst_id}/input/import-from-bronze to import a selected bronze dataset into the institution’s GCS bucket under unvalidated/ using the existing signed-upload URL flow (same overwrite checks as manual uploads).
  • Added unit tests covering the new endpoints and mocked Databricks/GCS interactions.

context

  • Today users must upload input datasets from the UI into GCS before they can run inference.
  • This change enables a “dataset select” workflow where datasets already present in Databricks bronze can be selected and pulled into the existing GCS validation/batching/inference pipeline, eliminating manual uploads while keeping downstream behaviour unchanged.

questions

No questions at this time


@Mesh-ach Mesh-ach requested a review from chapmanhk February 19, 2026 18:15
@Mesh-ach Mesh-ach marked this pull request as ready for review February 19, 2026 18:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant