You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
TERRA Reference team coordinate implementation of a basic genome seq pipeline (e.g. Process Reads —> Map & Assemble Reads —> Call Variants —> Annotate Variants) described in Proposed formats and databases for genomics data reference-data#19 and summarized in fig. below
Most of our 400 lines will be resequenced, but ~40 lines will be sequenced for de novo assembly, so the pipeline(s) will need to accommodate both of these pipeline paths.
Mike Gore will lead the system architecture
Questions to address:
Resources?
How much computing resources are required?
What are the data sizes?
sequencing coverage
number of samples, libraries, lanes
expected rate of data production over time
do we have sample datasets so that we can set up the pipeline prior to receiving data? (perhaps re-create Maize pipeline)
Overview
Questions to address:
Resources?
Division of Labor