Skip to content

Conversation

@jmarshall
Copy link
Contributor

Production-pipelines etc workflows generator-side implementation for SET-565.

Add _pack_attribute() to compress attribute values if necessary. (Suppress annoying ruff warnings about "incomprehensible" numeric constants.)

Pop 'sequencing_groups' from the incoming attributes as we may not replace the existing item later. Retain or [] to continue to guard against the unlikely event of the raw incoming value being None rather than [].

This compresses sequencing_groups (and is flexible enough for other attributes in future if necessary) for transport through the Hail database to the billing aggregator. populationgenomics/cpg-infrastructure#308 processes any sequencing_groups_gzip attributes it sees, and should be merged and deployed before this PR.

jmarshall added 2 commits June 9, 2025 14:06
Add _pack_attribute() to compress attribute values if necessary.
Suppress annoying ruff warnings about "incomprehensible" numeric constants.

Pop 'sequencing_groups' from the incoming attributes as we may not replace
the existing item later. Retain `or []` to continue to guard against the
unlikely event of the raw incoming value being None rather than [].
Copy link
Contributor

@milo-hyben milo-hyben left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looking good to me!

@jmarshall jmarshall merged commit 5919879 into main Jun 19, 2025
6 of 7 checks passed
@jmarshall jmarshall deleted the compress-large-attributes branch June 19, 2025 08:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants