From 6c6cdc6819441ac8bd75734683f87c99071c610e Mon Sep 17 00:00:00 2001 From: pichuan Date: Tue, 27 Jul 2021 11:57:10 -0700 Subject: [PATCH] Update deeptrio training data. PiperOrigin-RevId: 387162875 --- docs/deeptrio-details-training-data.md | 21 +++++++++++++++------ 1 file changed, 15 insertions(+), 6 deletions(-) diff --git a/docs/deeptrio-details-training-data.md b/docs/deeptrio-details-training-data.md index bf4f7109..ce5667d3 100644 --- a/docs/deeptrio-details-training-data.md +++ b/docs/deeptrio-details-training-data.md @@ -1,13 +1,15 @@ # DeepTrio training data -### WGS models +### WGS models[(1)](#vfootnote1) | version | Replicates | #examples | | ------------ | ---------------------------------- | ----------- | | Child model | | | -| 1.1.0 | 4 HG001/NA12891/NA12892 trios
7 HG005/HG006/HG007 trios
3 HG002/HG004/HG004 trios[(1)](#vfootnote1)| 566,589,652 | +| 1.1.0 | 4 HG001/NA12891/NA12892 trios
7 HG005/HG006/HG007 trios
3 HG002/HG004/HG004 trios| 566,589,652 | +| 1.2.0 | (Same model as 1.1.0) | | | Parent model | | | -| 1.1.0 | 7 HG005/HG006/HG007 trios
3 HG002/HG004/HG004 trios[(1)](#vfootnote1) | 315,847,934 | +| 1.1.0 | 7 HG005/HG006/HG007 trios
3 HG002/HG004/HG004 trios | 315,847,934 | +| 1.2.0 | (Same model as 1.1.0) | | ### WES models @@ -15,17 +17,21 @@ | ------------ | ----------------------------------------------- | ---------- | | Child model | | | | 1.1.0 | 27 HG001/NA12891/NA12892 trios
6 HG005/HG006/HG007 trios
7 HG002/HG004/HG004 trios | 18,002,596 | +| 1.2.0 | (Same model as 1.1.0) | | | Parent model | | | | 1.1.0 | 6 HG005/HG006/HG007 trios
6 HG002/HG004/HG004 trios | 4,131,018 | +| 1.2.0 | (Same model as 1.1.0) | | -### PACBIO models +### PACBIO models[(2)](#vfootnote2)[(3)](#vfootnote3) | version | Replicates | #examples | | ------------ | ---------------------------------- | ----------- | | Child model | | | -| 1.1.0 | 1 HG005/HG006/HG007 trio
8 HG002/HG004/HG004 trios[(2)](#vfootnote2) | 397,610,700 | +| 1.1.0 | 1 HG005/HG006/HG007 trio
8 HG002/HG004/HG004 trios | 397,610,700 | +| 1.2.0 | 1 HG005/HG006/HG007 trio
8 HG002/HG004/HG004 trios | 406,893,180[(4)](#vfootnote4) | | Parent model | | | -| 1.1.0 | 1 HG005/HG006/HG007 trio
8 HG002/HG004/HG004 trios[(3)](#vfootnote3) | 386,418,918 | +| 1.1.0 | 1 HG005/HG006/HG007 trio
8 HG002/HG004/HG004 trios | 386,418,918 | +| 1.2.0 | 1 HG005/HG006/HG007 trio
8 HG002/HG004/HG004 trios | 392,749,204[(4)](#vfootnote4) | (1): We include HG002/HG003/HG004 for training WGS @@ -37,3 +43,6 @@ PacBio model training. (3): PacBio training data contains training examples with haplotag sorted images and unsorted images. + +(4): In v1.2.0, we updated the NIST truth versions we +used for training.