Skip to content

Raise warning when zero inference sites provided. #683

Closed
@evolgenomics

Description

@evolgenomics

Hi team,

I was trying to run tsinfer on the 1000G GBR samples, focusing on a single 1Mbp region.

I imported using a VCF input copied off the tutorial. Essentially:

vcf = cyvcf2.VCF('GBR.variable.vcf.gz')
with tsinfer.SampleData(
path="GBR.samples", sequence_length=chromosome_length(vcf)
) as samples:
add_diploid_sites(vcf, samples)

Do the inference

ts = tsinfer.infer(samples)

ts.dump("GBR.trees")

$ python tsinfer_convert.py

Sample file created for 182 samples (182 individuals) with 262165 variable sites.
Inferred tree sequence: 1 trees over 63.02552 Mb (182 edges)

$ tskit info GBR.trees
╔═════════════════════════╗
║TreeSequence ║
╠═══════════════╤═════════╣
║Trees │ 1║
╟───────────────┼─────────╢
║Sequence Length│ 63025520║
╟───────────────┼─────────╢
║Time Units │ unknown║
╟───────────────┼─────────╢
║Sample Nodes │ 182║
╟───────────────┼─────────╢
║Total Size │237.6 MiB║
╚═══════════════╧═════════╝
╔═══════════╤═══════╤══════════╤════════════╗
║Table │Rows │Size │Has Metadata║
╠═══════════╪═══════╪══════════╪════════════╣
║Edges │ 182│ 5.7 KiB│ No║
╟───────────┼───────┼──────────┼────────────╢
║Individuals│ 182│ 5.4 KiB│ Yes║
╟───────────┼───────┼──────────┼────────────╢
║Migrations │ 0│ 8 Bytes│ No║
╟───────────┼───────┼──────────┼────────────╢
║Mutations │6314841│ 222.8 MiB│ No║
╟───────────┼───────┼──────────┼────────────╢
║Nodes │ 183│ 5.0 KiB│ No║
╟───────────┼───────┼──────────┼────────────╢
║Populations│ 0│ 8 Bytes│ No║
╟───────────┼───────┼──────────┼────────────╢
║Provenances│ 2│1008 Bytes│ No║
╟───────────┼───────┼──────────┼────────────╢
║Sites │ 262165│ 14.8 MiB│ Yes║
╚═══════════╧═══════╧══════════╧════════════╝

I tried doing the whole chromosome, or a single 1Mbp, and got the same results.

I put some of the files on an ftp server. Please see if you could replicate the issue

http://ftp.tuebingen.mpg.de/fml/ag-chan/tskit/

Thanks

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions