Open
Description
@DomNelson and I were just discussing the possibility of adding support for multiple chromosomes to tskit. One possibility which seems like it might be a reasonably smooth path forward is the following:
- Add a
chromosome
table with an ID,name
,length
andmetadata
. - Add a
chromosome
column to the edge table. Within a chromosome, coordinates must be from 0 to the chromosome's length. - Either deprecate the
sequence_length
property (probably best), or make it equal to the sum of the length of all chromosomes.
For things like trees()
we could add an optional chromosome
argument. If the tree sequence contains multiple chromosomes which would raise an error if it's not specified. For tree sequences with a single chromosome, things would continue to work as now.
Any thoughts @petrelharp, @hyanwong, @bhaller, @molpopgen?