Open
Description
If the dataset has sample_family_id
, sample_paternal_id
, sample_maternal_id
fields (e.g. from read_plink
), then we can use those to write family information in write_plink
. (See https://www.cog-genomics.org/plink/1.9/formats#fam)
Otherwise we should set FID to "0" (missing) and IID to sample_id
. The father and mother IDs should either be set to missing, or set from the parent_id
variable if it is present.
(Or can we not assume anything about paternal/materal ordering in the parent_id
variable? Thoughts @timothymillar, @jeromekelleher?)