feat: move clinvar to new pattern #5100

bpblanken · 2025-10-27T22:01:50Z

No description provided.

hanars · 2025-10-29T14:32:42Z

seqr/management/commands/reload_clinvar_all_variants.py

                        if existing_version_obj and ClinvarAllVariantsSnvIndel.objects.filter(version=existing_version_obj.version).exists():
                            clinvar_run_sql(
-                                Template(f"ALTER TABLE `$reference_genome/$dataset_type/clinvar_all_variants` DROP PARTITION '{new_version}';")
+                                Template(f"ALTER TABLE `$reference_genome/$dataset_type/reference_data/clinvar/all_variants` DROP PARTITION '{new_version}';")


You can actually get the table name off of the model's meta field which would make this code more resilient - you just use ClinvarAllVariantsSnvIndel._meta.db_table. It would also get rid of the need to template out variables

hanars

This looks good, I just want to confirm that after the migration runs the new materialized views will automatically populate with the correct clinvar data without needing to run the reload clinvar command or a 'SYSTEM REFRESH VIEW

hanars · 2025-11-07T20:28:47Z

seqr/management/commands/reload_clinvar_all_variants.py

            ('GRCh38', 'SNV_INDEL'),
            ('GRCh38', 'MITO'),
        ]:
            cursor.execute(sql.substitute(reference_genome=reference_genome, dataset_type=dataset_type))


not particularly related to this PR, but this file should really be in the clickhouse_search/management/commands folder not the seqr commands folder

bpblanken · 2025-11-10T16:11:09Z

clinvar/all_variants and the clinvar Join table are just renames, so will have all of the data, leaving seqr search untouched. The seqr_variants table will be empty though, until either the job runs or a manual intervention.

I think this is fine, but there is a potential edge case where the two materialized views we create here run concurrently upon creation; the second one (the seqr_variants -> clinvar Join) will have an empty source table until the first completes, resulting in an empty Join table.

One hacky resolution would be to add a RunPython that runs 'SYSTEM REFRESH VIEW in the correct order at the end of this migration.

hanars · 2025-11-10T16:40:43Z

Having an empty clinvar join table at the end of this is not really an acceptable situation so we will need this to deterministically result with all the views populated. I feel like the best way to do this would be adding RunSql after all the CLINVAR_ALL_TO_SEQR_MV commands that waits until those are populated before running the CLINVAR_SEQR_TO_SEARCH_MV commands

bpblanken · 2025-11-10T17:45:38Z

clickhouse_search/migrations/0020_clinvar_new_reference_data.py

+FROM `$reference_genome/$dataset_type/reference_data/clinvar/seqr_variants`
+""")
+
+def build_materialized_view(reference_genome: str, dataset_type: str, materialized_view: str):


added a mechanism for refreshing and waiting for each view.

This reverts commit ee311c0.

clinvar

70df895

bpblanken changed the title ~~clinvar~~ feat: move clinvar to new pattern Oct 27, 2025

bpblanken added 4 commits October 27, 2025 18:04

bad line

88bf41d

merge

ab20680

table names

0eddfb0

Merge branch 'dev' of github.com:broadinstitute/seqr into benb/clinvar

73ee09a

bpblanken marked this pull request as ready for review October 28, 2025 19:39

bpblanken requested a review from hanars October 28, 2025 19:39

hanars reviewed Oct 29, 2025

View reviewed changes

bpblanken added 5 commits November 4, 2025 16:04

Merge branch 'dev' of github.com:broadinstitute/seqr into benb/clinvar

5027a99

better templating

6cb3b70

fix migration

60d39c6

Merge branch 'dev' of github.com:broadinstitute/seqr into benb/clinvar

98bbd04

quotes

0cb6581

bpblanken mentioned this pull request Nov 7, 2025

chore: update pipeline to support new reference data pattern broadinstitute/seqr-loading-pipelines#1179

Merged

bpblanken requested a review from hanars November 7, 2025 16:23

hanars reviewed Nov 7, 2025

View reviewed changes

bpblanken added 3 commits November 10, 2025 11:54

Merge branch 'dev' of github.com:broadinstitute/seqr into benb/clinvar

e470ffe

add wait

8c527d2

tweaks

d90822a

bpblanken requested a review from hanars November 10, 2025 17:45

bpblanken commented Nov 10, 2025

View reviewed changes

hanars approved these changes Nov 10, 2025

View reviewed changes

bpblanken merged commit ee311c0 into dev Nov 10, 2025
8 checks passed

bpblanken added a commit that referenced this pull request Nov 10, 2025

Revert "feat: move clinvar to new pattern (#5100)"

a0f792e

This reverts commit ee311c0.

bpblanken deleted the benb/clinvar branch November 10, 2025 19:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: move clinvar to new pattern #5100

feat: move clinvar to new pattern #5100

Uh oh!

bpblanken commented Oct 27, 2025

Uh oh!

hanars Oct 29, 2025

Uh oh!

hanars left a comment

Uh oh!

hanars Nov 7, 2025

Uh oh!

bpblanken commented Nov 10, 2025

Uh oh!

hanars commented Nov 10, 2025

Uh oh!

bpblanken Nov 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: move clinvar to new pattern #5100

feat: move clinvar to new pattern #5100

Uh oh!

Conversation

bpblanken commented Oct 27, 2025

Uh oh!

hanars Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

hanars left a comment

Choose a reason for hiding this comment

Uh oh!

hanars Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

bpblanken commented Nov 10, 2025

Uh oh!

hanars commented Nov 10, 2025

Uh oh!

bpblanken Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants