Skip to content

Conversation

@nathanthorpe
Copy link
Contributor

@nathanthorpe nathanthorpe commented Jul 24, 2025

  • Add file validate method which compares the checksum of the local file with a Cirro file.
  • Replace deprecated MD5 with CRC64, upload files using that.

Example:

portal = DataPortal()
dataset = portal.get_dataset(project_id, dataset_id)

file = dataset.get_file('nucleus_boundaries.parquet')
local_file = Path('~/Downloads/nucleus_boundaries.parquet').expanduser()

file.validate(local_file)

@nathanthorpe nathanthorpe marked this pull request as draft July 24, 2025 21:30
miniwdl = {version="^1.13.0", optional=true}
referencing = "^0.36.2"
nf-core = {version = "^3.3.2", optional=true}
awscrt = "==0.27.4"
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thoughts on if this should be optional?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd see uploading data via the SDK/client as a core functionalty, warranting keeping this dependency as installed by default.

@nathanthorpe nathanthorpe marked this pull request as ready for review July 25, 2025 14:11
@sonarqubecloud
Copy link

@nathanthorpe nathanthorpe requested a review from a team July 25, 2025 14:19
@nathanthorpe nathanthorpe merged commit 995a98a into main Jul 29, 2025
10 checks passed
@nathanthorpe nathanthorpe deleted the checksum-validate branch July 29, 2025 00:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants