Subclustering and using analysis of subgroups #750
Replies: 1 comment 2 replies
-
We get asked this question all the time and its important to note that the answer depends on your application and you should think about why you are subsetting the project. For example, if you have PBMC data and you want to delve into T cell subtypes, maybe you identify T cells using a broad dim red. and clustering of all PBMCs (called projAll) and create a subsetted project consisting only of T cells (call projT). In this new projT, the position in lower dimensional space has been dictated by all PBMCs and may not provide the most robust distinction between T cell subtypes. So you would want to re-perform dimensionality reduction and clustering using just the cells in projT. You should not re-run QC steps / doublet inference, that is done on a per sample basis. On the other hand, if you have subsetted your project for another reason - for example removing a small number of unwanted cells - you might not need to re-perform dim. reduction.
No, not necessarily. The UMAP coordinates stored in your subsetted project would correspond to the original UMAP and sometimes this is what you want to visualize.
Re-running addClusters without re-running LSI does not make sense to me.
This is entirely application specific. I encourage you to think about what this function is doing and why the groupBy parameter is important. |
Beta Was this translation helpful? Give feedback.
-
I have project proj_tumor with 22 different samples. getAvailableMatrices(proj_tumor) returns the matrices listed below after LSI, reduction, and other QC steps, then scoring mat risk, peak and tile were run as recommended on ArchR website.
"GeneScoreMatrix" "MotifMatrix" "PeakMatrix" "TileMatrix"
If I want to slice the ArchR project into multiple ways, one example is subset 10 out of the 22 samples and focus on differential analysis of these 10 samples:
idxSample <- BiocGenerics::which(proj_tumor$Samples %in% c(listThe10SampleNames))
cellsSample <- proj_tumor$cellNames[idxSample]
new_projec <-subsetArchRProject(
ArchRProj = proj_tumor,
cells = cellsSample,
outputDirectory = "ArchRSubset10Samples",
dropCells = TRUE,
logFile = NULL,
threads = getArchRThreads(),
force = FALSE
)
Question is:
Beta Was this translation helpful? Give feedback.
All reactions