Skip to content

Commit

Permalink
Add assembly submission details
Browse files Browse the repository at this point in the history
  • Loading branch information
corneliusroemer authored and anna-parker committed Jun 28, 2024
1 parent 4c3ad01 commit 8455cb2
Showing 1 changed file with 62 additions and 1 deletion.
63 changes: 62 additions & 1 deletion docs/ena_submission.md
Original file line number Diff line number Diff line change
Expand Up @@ -207,8 +207,69 @@ Every submission must be associated with a study. The study can't be created usi
</RECEIPT>
```

and contains the accession number(s).
and contains the sample accession number(s).

## Submitting sequences (assembly)

[Docs](https://ena-docs.readthedocs.io/en/latest/submit/assembly/genome.html)

1. Create [manifest file](https://ena-docs.readthedocs.io/en/latest/submit/assembly/genome.html#manifest-files) for assembly metadata and file specification (manifest.tsv). The metadata rows are (relevant ones for us):

- STUDY: Study accession - mandatory
- SAMPLE: Sample accession - mandatory
- ASSEMBLYNAME: Unique assembly name, user-provided - mandatory
- ASSEMBLY_TYPE: ‘clone or isolate’ - mandatory
- COVERAGE: The estimated depth of sequencing coverage - mandatory
- PROGRAM: The assembly program - mandatory
- PLATFORM: The sequencing platform, or comma-separated list of platforms - mandatory

The file specification rows are:

- FASTA: sequences in fasta format
- CHROMOSOME_LIST: list of chromosomes

An example manifest.tsv:

```tsv
STUDY PRJEBxxxxxx
SAMPLE SAMEAxxxxxx
ASSEMBLYNAME Standard assembly
ASSEMBLY_TYPE isolate
COVERAGE 100
PROGRAM iVar
PLATFORM Illumina
FASTA genome.fasta.gz
CHROMOSOME_LIST chromosome_list.tsv
```

2. Create chromosome list file (chromosome_list.tsv)

```tsv
sequence_id chromosome_name chromosome_type
ha ha segmented
na na segmented
```

3. Create fasta file (genome.fasta) containing all the sequences in fasta format:

```fasta
>ha
ACGT
>na
ACGT
```

4. Submit the files using the webin-cli:

```bash
webin-cli -[validate|submit] \
-context genome \
-manifest manifest.tsv \
-username Webin-XXXXX \
-password YYYYYY
```

5. Save accession numbers (these will be returned by the webin-cli)

## Promises made to ENA

Expand Down

0 comments on commit 8455cb2

Please sign in to comment.