Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Add Pharmcat and Pangu #44

Closed
wants to merge 19 commits into from
Closed

feat: Add Pharmcat and Pangu #44

wants to merge 19 commits into from

Conversation

gconcepcion
Copy link
Contributor

@gconcepcion gconcepcion commented May 22, 2023

Add Pharmcat & pangu to humanWGS workflow

gconcepcion and others added 4 commits May 22, 2023 15:50
- added to inputs
  - still needs filepaths
- updated README
  - still needs descriptions for output
- added pharmcat_positions to ReferenceData
- updated main and sample_analysis

Validates with womtool and miniwdl
README.md Outdated
Comment on lines 253 to 264
| Array[File] | hificnv_copynum_bedgraphs | Copy number values calculated for each region | |
| Array[File] | hificnv_depth_bws | Bigwig file containing the depth measurements from HiFiCNV | |
| Array[File] | hificnv_maf_bws | Bigwig file containing the minor allele frequency measurements from DeepVariant, generated by HiFiCNV | |
| Array[File] | pangu_jsons | | |
| Array[File] | pangu_tsvs | | |
| Array[File] | fixed_pangu_tsvs | | |
| Array[File?] | pharmcat_missing_pgx_vcfs | | |
| Array[File] | pharmcat_preprocessed_filtered_vcfs | | |
| Array[File] | pharmcat_match_jsons | | |
| Array[File] | pharmcat_phenotype_jsons | | |
| Array[File] | pharmcat_report_htmls | | |
| Array[File] | pharmcat_report_jsons | | |
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ask @jrharting to provide brief documentation for each Pharmcat output file.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

for pharmcat, is it ok to refer to the tool website? or do you need a short blurb here too?
https://pharmcat.org/examples/#output

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The pangu_tsvs are intermediates for passing pangu outputs to pharmcat. these could be deleted after running and not returned.

The pangu json is a detailed description of the CYP2D6 star allele call. More info here : https://github.com/PacificBiosciences/pangu/tree/main#caller-outputs

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We'll link out to pangu and pharmcat documentation the first time they're mentioned in the outputs, but we do want to have a brief description. It could be as simple as "PGx report from Pharmcat, html format".

Copy link

@jrharting jrharting May 23, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

| Array[File] | pangu_jsons | Detailed CYP2D6 calling report containing calls and coverage| |
| Array[File] | pangu_tsvs |  CYP2D6 call for input to pharmCAT reports | |
| Array[File] | fixed_pangu_tsvs | (fixed)  CYP2D6 call for input to pharmCAT reports | |
| Array[File?] | pharmcat_missing_pgx_vcfs | PGx-cataloged variants not found in sample vcf | |
| Array[File] | pharmcat_preprocessed_filtered_vcfs | VCF with filled reference calls at all PGx variants with sample coverage above minimum | |
| Array[File] | pharmcat_match_jsons | JSON files with detailed information about how data in the sample VCF matches up with haplotype definitions| |
| Array[File] | pharmcat_phenotype_jsons | Match data and combined with outside call data with assigned function and metabolizer values| |
| Array[File] | pharmcat_report_htmls | HTML file with matched phenotype to information found in CPIC guideline data| |
| Array[File] | pharmcat_report_jsons |  JSON file with matched phenotype to information found in CPIC guideline data| |

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i can fill in a different template if it would be easier ...

Comment on lines 61 to 62
"data": "<local_path_prefix>/dataset/GRCh38/pharmcat/#TODO",
"data_index": "<local_path_prefix>/dataset/GRCh38/pharmcat/#TODO"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Update all backend templates with paths to pharmcat_positions VCF/index.

Comment on lines 21 to 47
"workflows/wdl-common/wdl/tasks/pharmcat.wdl": {
"key": "workflows/wdl-common/wdl/tasks/pharmcat.wdl",
"name": "",
"description": "",
"tasks": {
"pharmcat_preprocess": {
"key": "pharmcat_preprocess",
"digest": "",
"tests": []
},
"filter_preprocessed_vcf": {
"key": "filter_preprocessed_vcf",
"digest": "",
"tests": []
},
"pangu_cyp2d6": {
"key": "pangu_cyp2d6",
"digest": "",
"tests": []
},
"run_pharmcat": {
"key": "run_pharmcat",
"digest": "",
"tests": []
}
}
},
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add Pharmcat tests.

Copy link
Collaborator

@williamrowell williamrowell left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remaining changes:

  • remove pangu_tsvs and fixed_pangu_tsvs from sample_analysis.wdl and main.wdl outputs, as recommended by JH; also remove from README.md
  • update output documentation in README.md using notes from JH; link out to Pangu/Pharmcat documentation when necessary; we should cut/paste these updates into wdl-targetenrichment README.md as well
  • add entry for Pharmcat in "Tool versions and Docker images" table in README.md
  • add pharmcat_positions to data bundle, sync to Azure/AWS/GCP/Zenodo; update backend inputs.json templates with paths
  • run on HPC using HG005/6/7 Sequel II trio and HG002/3/4 Revio trio datasets
  • create minimal test dataset to be used in wdl-ci, upload to where test data lives in CoA blob; update wdl-ci.config.json with real tests

Base automatically changed from williamrowell/documentation to main May 25, 2023 19:11
@williamrowell williamrowell changed the base branch from main to develop May 26, 2023 05:33
Base automatically changed from develop to main July 28, 2023 16:37
@williamrowell williamrowell changed the base branch from main to develop October 4, 2023 21:42
- moved Pharcat subworkflow from tasks to workflows
- removed the internal scatter, so Pharmcat workflow runs once per sample
- broke out the pharmcat inputs and removed some inputs that were no longer used
- updated parameter_meta
@williamrowell
Copy link
Collaborator

I think we're good to go except for:

  • tests
  • potentially making pharmcat optional based on whether the sites are available

@williamrowell
Copy link
Collaborator

Uses this image now: PacificBiosciences/wdl-dockerfiles#32

Base automatically changed from develop to main November 16, 2023 00:18
@williamrowell williamrowell deleted the pharmcat branch December 15, 2023 00:24
@williamrowell williamrowell restored the pharmcat branch December 15, 2023 00:28
@williamrowell williamrowell reopened this Dec 15, 2023
@williamrowell williamrowell changed the title pharmcat added to workflow feat: Add Pharmcat and Pangu Dec 15, 2023
@williamrowell williamrowell changed the base branch from main to develop December 15, 2023 05:36
@williamrowell williamrowell deleted the pharmcat branch June 13, 2024 18:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants