to run DoubletFinder[1]
- If you have the cell annotation file for your counts data, use DoubletFinder_w.annot.sh
- If you don't have the cell annotation file * for your counts data, use DoubletFinder_wo.annot.sh
to run Scrublet[2]
use Scrublet.sh
[1] McGinnis, C. S., Murrow, L. M., & Gartner, Z. J. (2019). DoubletFinder: doublet detection in single-cell RNA sequencing data using artificial nearest neighbors. Cell systems, 8(4), 329-337.
[2] Wolock, S. L., Lopez, R., & Klein, A. M. (2019). Scrublet: computational identification of cell doublets in single-cell transcriptomic data. Cell systems, 8(4), 281-291.
The input "cell annotation" should be in the format of tab-delimited plain text file, looking like so:
Cell | CellType |
---|---|
Cell.1 | Celltype1 |
Cell.2 | Celltype2 |
Cell.3 | Celltype3 |
Cell.4 | Celltype1 |
... | ... |
Jeff:
- DoubletFinder looks for possible pairs of cells that were merged when preparing the 10x samples. Thus, you should run the doublet detection only on cells from a single sample. It doesn't really make sense to run on cells from different samples since they can't form doublets if they prepared together. If cells are from multiple samples, you can supply a "sample_file" that tells BETSY which sample each cell is from.
- You can leave sample_file out if all the cells in your gene expression file are from the same sample.
The input "sample_file" should be in the format of tab-delimited plain text file, looking like so:
Cell | Sample |
---|---|
Cell.1 | SampleA |
Cell.2 | SampleB |
Cell.3 | SampleC |
Cell.4 | SampleA |
... | ... |
Here's an example of the sample_file.