Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clarifying questions on published materials #2

Open
maxnest opened this issue Jan 13, 2023 · 6 comments
Open

Clarifying questions on published materials #2

maxnest opened this issue Jan 13, 2023 · 6 comments

Comments

@maxnest
Copy link

maxnest commented Jan 13, 2023

Hello! I am very interested in the approach developed by your team, but after reading the available materials, several questions remain:

  1. The results of correction of yeast data were given as an example, but do I understand correctly that there are no restrictions on working with data from multicellular organisms?
  2. Is MultiBaC applicable for correcting the batch effect in one data set (i.e., in one matrix)?
  3. Given the presented example, in the case of analysis of RNA-seq data, tables with normalized expression values should be submitted to the input (yeast data were after TMM normalization). Is it possible to use raw data (to further carry out analysis with edgeR or DESeq2)? What about Transcripts-Per-Million (TPM) values? Is there any additional data preparation required in each of these cases?
    Thanks!
@AnaConesa
Copy link

Hi
Thanks for your interest in Multibac. Please see my responses below

  1. The results of correction of yeast data were given as an example, but do I understand correctly that there are no restrictions on working with data from multicellular organisms?

Correct. You can use MultiBac with any organism

  1. Is MultiBaC applicable for correcting the batch effect in one data set (i.e., in one matrix)?

MultiBaC has been designed to correct the batch effect across omics types to favor multiomics data integration. You are expected to integrate several omics matrices, not just one. The most simple scenario is to have three different omics types distributed in two batches, with one of the omics types shared between the two batches.

  1. Given the presented example, in the case of analysis of RNA-seq data, tables with normalized expression values should be submitted to the input (yeast data were after TMM normalization). Is it possible to use raw data (to further carry out analysis with edgeR or DESeq2)? What about Transcripts-Per-Million (TPM) values? Is there any additional data preparation required in each of these cases?

Normalized data are requested because the batch correction process will return corrected values that are no longer counts. Using TPM values is fine. If you see large distribution differences among samples within the same batch, then TMM is also recommended, but if this is not a problem, just TPM would be fine.

Hope this helps

Ana

@maxnest
Copy link
Author

maxnest commented Jan 14, 2023

@AnaConesa, thank you for your detailed and quick response!
Do I understand correctly that, for example, if biological replicates are collected at different times in one study, we cannot use ARSynNbac (In my previous post I was referring to this module) to correct for the batch effect solely on one data set? The question is caused by the fact that I have already tried different approaches, and, first of all, ComBat-seq, which, although it removes some of the artificial differences, behaves extremely strangely on big data.

@AnaConesa
Copy link

Hi

In this case you should use the ARSyN function of the MultiBac package. You can use or not the information about the batches. Also it works better when you have a multifactorial design.

@maxnest
Copy link
Author

maxnest commented Jan 16, 2023

@AnaConesa, thank you!
One more question, initially, the ARSyN method was intended for the analysis of microarrays, in MultiBac package the method has been adapted to analyze the results of RNA-seq data?

@AnaConesa
Copy link

Yes, provided that you can normalized data, not count data.

@maxnest
Copy link
Author

maxnest commented Jan 16, 2023

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants