Skip to content

Commit e6927b6

Browse files
authored
Update README.md
1 parent 0c755bb commit e6927b6

File tree

1 file changed

+37
-20
lines changed

1 file changed

+37
-20
lines changed

README.md

Lines changed: 37 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -10,15 +10,15 @@
1010

1111
# INSTALLATION #
1212

13-
**export2graphlan** is available in GitHub here: [export2graphlan repository](hhttps://github.com/SegataLab/export2graphlan) and can be obtained using:
13+
**export2graphlan** is available in GitHub here: [export2graphlan repository](https://github.com/SegataLab/export2graphlan) and can be obtained using:
1414

15-
1. [Bioconda](https://bioconda.github.io/)
15+
1. [Bioconda](https://bioconda.github.io/recipes/export2graphlan/README.html)
1616

1717
```
1818
$ conda install export2graphlan
1919
```
2020

21-
2. [Mercurial](http://mercurial.selenic.com/)
21+
2. Repository
2222

2323
```
2424
$ git clone git@github.com:SegataLab/export2graphlan.git
@@ -32,7 +32,6 @@ Adding the above line into the bash configuration file will make the path additi
3232

3333
# USAGE #
3434
```
35-
usage: export2graphlan.py [-h] [-i LEFSE_INPUT] [-o LEFSE_OUTPUT] -t TREE -a
3635
usage: export2graphlan.py [-h] [-i LEFSE_INPUT] [-o LEFSE_OUTPUT] -t TREE -a
3736
ANNOTATION [--annotations ANNOTATIONS]
3837
[--external_annotations EXTERNAL_ANNOTATIONS]
@@ -50,17 +49,18 @@ usage: export2graphlan.py [-h] [-i LEFSE_INPUT] [-o LEFSE_OUTPUT] -t TREE -a
5049
[--abundance_threshold ABUNDANCE_THRESHOLD]
5150
[--most_abundant MOST_ABUNDANT]
5251
[--least_biomarkers LEAST_BIOMARKERS]
53-
[--discard_otus] [--internal_levels] [--sep SEP]
52+
[--discard_otus] [--internal_levels]
53+
[--biomarkers2colors BIOMARKERS2COLORS] [--sep SEP]
5454
[--out_table OUT_TABLE] [--fname_row FNAME_ROW]
5555
[--sname_row SNAME_ROW]
5656
[--metadata_rows METADATA_ROWS]
5757
[--skip_rows SKIP_ROWS] [--sperc SPERC]
5858
[--fperc FPERC] [--stop STOP] [--ftop FTOP]
5959
[--def_na DEF_NA]
6060
61-
export2graphlan.py (ver. 0.17 of 21th August 2014). Convert MetaPhlAn, LEfSe,
61+
export2graphlan.py (ver. 0.2.1 of 27 October 2018). Convert MetaPhlAn, LEfSe,
6262
and/or HUMAnN output to GraPhlAn input format. Authors: Francesco Asnicar
63-
(francesco.asnicar@gmail.com)
63+
(f.asnicar@unitn.it)
6464
6565
optional arguments:
6666
-h, --help show this help message and exit
@@ -75,22 +75,28 @@ optional arguments:
7575
--background_levels BACKGROUND_LEVELS
7676
List which levels should be highlight with a shaded
7777
background. Use a comma separate values form, e.g.,
78-
--background_levels 1,2,3
78+
--background_levels 1,2,3. Default is None
7979
--background_clades BACKGROUND_CLADES
8080
Specify the clades that should be highlight with a
8181
shaded background. Use a comma separate values form
82-
and surround the string with " if it contains spaces.
82+
and surround the string with " if there are spaces.
8383
Example: --background_clades "Bacteria.Actinobacteria,
8484
Bacteria.Bacteroidetes.Bacteroidia,
85-
Bacteria.Firmicutes.Clostridia.Clostridiales"
85+
Bacteria.Firmicutes.Clostridia.Clostridiales". Default
86+
is None
8687
--background_colors BACKGROUND_COLORS
8788
Set the color to use for the shaded background. Colors
8889
can be either in RGB or HSV (using a semi-colon to
8990
separate values, surrounded with ()) format. Use a
9091
comma separate values form and surround the string
9192
with " if it contains spaces. Example:
9293
--background_colors "#29cc36, (150; 100; 100), (280;
93-
80; 88)"
94+
80; 88)". To use a fixed set of colors associated to a
95+
fixed set of clades, you can specify a mapping file in
96+
a tab-separated format, where the first column is the
97+
clade (using the same format as for the "--
98+
background_clades" param) and the second colum is the
99+
color associated. Default is None
94100
--title TITLE If specified set the title of the GraPhlAn plot.
95101
Surround the string with " if it contains spaces,
96102
e.g., --title "Title example"
@@ -120,25 +126,35 @@ optional arguments:
120126
--most_abundant MOST_ABUNDANT
121127
When only lefse_input is provided, you can specify how
122128
many clades highlight. Since the biomarkers are
123-
missing, they will be chosen from the most abundant
129+
missing, they will be chosen from the most abundant.
130+
Default is 10
124131
--least_biomarkers LEAST_BIOMARKERS
125132
When only lefse_input is provided, you can specify the
126133
minimum number of biomarkers to extract. The taxonomy
127134
is parsed, and the level is choosen in order to have
128-
at least the specified number of biomarkers
135+
at least the specified number of biomarkers. Default
136+
is 3
129137
--discard_otus If specified the OTU ids will be discarde from the
130-
taxonmy. Default behavior keep OTU ids in taxonomy
138+
taxonmy. Default is True, i.e. keep OTUs IDs in
139+
taxonomy
131140
--internal_levels If specified sum-up from leaf to root the abundances
132-
values. Default behavior do not sum-up abundances on
133-
the internal nodes
141+
values. Default is False, i.e. do not sum-up
142+
abundances on the internal nodes
143+
--biomarkers2colors BIOMARKERS2COLORS
144+
Mapping file that associates biomarkers to a specific
145+
color... I'll define later the specific format of this
146+
file!
134147
135148
input parameters:
136149
You need to provide at least one of the two arguments
137150
138151
-i LEFSE_INPUT, --lefse_input LEFSE_INPUT
139-
LEfSe input data
152+
LEfSe input data. A file that can be given to LEfSe
153+
for biomarkers analysis. It can be the result of a
154+
MetaPhlAn or HUMAnN analysis
140155
-o LEFSE_OUTPUT, --lefse_output LEFSE_OUTPUT
141-
LEfSe output result data
156+
LEfSe output result data. The result of LEfSe analysis
157+
performed on the lefse_input file
142158
143159
output parameters:
144160
-t TREE, --tree TREE Output filename where save the input tree for GraPhlAn
@@ -173,6 +189,7 @@ Input data matrix parameters:
173189
percentile specified by --fperc)
174190
--def_na DEF_NA Set the default value for missing values [default None
175191
which means no replacement]
192+
176193
```
177194

178195
*Note*: the last input parameters (``Input data matrix parameters``) refer to the **DataMatrix** class contained in the [hclust2](https://github.com/SegataLab/hclust2) repository.
@@ -195,5 +212,5 @@ If everything goes well you should find in the same folder of the example six ne
195212
* ``outtree.txt``: is the annotated tree produced by graphlan_annotate.py
196213
* ``tree.txt``: is the tree produced by the export2graphlan.py script
197214

198-
# CONTACTS #
199-
Francesco Asnicar ([f.asnicar@unitn.it](mailto:f.asnicar@unitn.it))
215+
# Support #
216+
If you should find problems in using **export2graphlan** please report them in [The bioBakery help forum](https://forum.biobakery.org/).

0 commit comments

Comments
 (0)