Replies: 3 comments 5 replies
-
Hi Jonathan You ask
I am not sure what you mean by "resulting psExtra"? I'll assume you mean the output of dist_calc(dist = "aitchison"). Correct me if I'm wrong? Now some possibly helpful pieces of info:
hope this helps? |
Beta Was this translation helpful? Give feedback.
-
Hi David, thanks for the quick reply. Yes, I meant the dist_calc ouptut. Since aitchison distances are computed on clr data I was acutally wondering why the dist_calc output contained sequence counts instead of clr-transformed table. About filtering taxa; I'm asking about this because I neeed to remove unassigned genera after agglomerating to that taxrank, and in some samples this represents up to 15% of my estimated sequence counts. Given that CLR uses the mean of sequence counts to center the ratios, filtering before transformation would yield otu_table values that are different to the CLR-transformed original table (for the remaining taxa). I'm not sure how large that effect would be, but my understanding is that data should be CLR-transformed before being subset, in order to preserve the ratio between taxa (subcompositional coherence, as far as I understand it). Intuitively, I would therefore first apply CLR, then measure Aitchison distance on that filtered table. This is why I am confused by the dist_calc documentation, which requires the "identity"-tranformed table as input and therefore precludes filtering post CLR but pre dist_calc. Let me know what you think ! cheers p.s. unassigned genera that were assigned a species might sound weird; the output from Sourmash taxonomic assignment contains a lot of unknown taxa where the species is assigned as e.g. Streptococcaceae_bacterium, therefore containing unassigned at the genus level. |
Beta Was this translation helpful? Give feedback.
-
Ok that sounds great thanks for your help! |
Beta Was this translation helpful? Give feedback.
-
Hi,
Following the recommendations in the doc for PERMANOVA using aitchison distances (option 2):
pass an object with untransformed (or 'identity' transformed) taxa to the data argument of dist_calc() and specify dist = "aitchison".
is it ok to subset the resulting psExtra object, i-e remove certain low abundance taxa, before running dist_permanova ? I thought it would, but I realise that this procedure doesnt rcclr transform the data. I thought aitchison distance calculations were based on log-ratio transformed compositions, so I'm a bit confused. I was counting on subcompositional coherence to subset, but my (very basic) understanding of the underlying maths of compositional data may be causing my confusion.
Looking forward to your input
Beta Was this translation helpful? Give feedback.
All reactions