Skip to content

Commit 3238a8b

Browse files
committed
wording changes
1 parent d1672c1 commit 3238a8b

File tree

1 file changed

+4
-3
lines changed

1 file changed

+4
-3
lines changed

tex/bgt.tex

+4-3
Original file line numberDiff line numberDiff line change
@@ -39,9 +39,10 @@ \section{Contact:} hengli@broadinstitute.org
3939
\section{Introduction}
4040

4141
VCF/BCF~\citep{Danecek:2011qy} is the primary format for storing and analyzing
42-
genotypes of multiple samples. It however has a few issues. Firstly, as a
43-
streaming format, VCF compresses all types of information together. Retrieving
44-
site annotations or the genotypes of a few samples usually requires to decode
42+
genotypes of multiple samples. It however has a few issues. Firstly, VCF
43+
is a site-oriented format. While accessing a site and all the associated
44+
genotypes is efficient with indexing, retrieving
45+
site annotations or the genotypes of a few samples always requires to decode
4546
the genotypes of all samples, which is unnecessarily expensive. Secondly, VCF
4647
does not take advantage of linkage disequilibrium (LD), while using this
4748
information can dramatically improve compression ratio~\citep{Durbin:2014yq}.

0 commit comments

Comments
 (0)