A clustering method for small scRNA-seq data based on subspace and weighted distance
A clustering method for small scRNA-seq data based on subspace and weighted distance
Background Identifying the cell types using unsupervised methods is essential for scRNA-seq research. However, conventional similarity measures introduce challenges to single-cell data clustering because of the high dimensional, high noise, and high dropout. Methods We proposed a clustering method for small S cRNA-seq data based on S ubspace and W eighted D istance (SSWD), which follows the assumption that the sets of gene subspace composed of similar density-distributing genes can better distinguish cell groups. To accurately capture the intrinsic relationship among cells or genes, a new distance metric that combines Euclidean and Pearson distance through a weighting strategy was proposed. The relative Calinski-Harabasz (CH) index was used to estimate the cluster numbers instead of the CH index because it is comparable across degrees of freedom. Results We compared SSWD with seven prevailing methods on eight publicly scRNA-seq datasets. The experimental results show that the SSWD has better clustering accuracy and the partitioning ability of cell groups. SSWD can be downloaded at https://github.com/ningzilan/SSWD .
- Hunan Agricultural University China (People's Republic of)
- Yunnan Agricultural University China (People's Republic of)
- Henan Agricultural University China (People's Republic of)
Subspace, QH301-705.5, Bioinformatics, Sequence Analysis, RNA, Gene Expression Profiling, R, EP_dis, Marker gene, Single-Cell Gene Expression Analysis, scRNA-seq, Medicine, Cluster Analysis, Consensus clustering, Biology (General), Single-Cell Analysis
Subspace, QH301-705.5, Bioinformatics, Sequence Analysis, RNA, Gene Expression Profiling, R, EP_dis, Marker gene, Single-Cell Gene Expression Analysis, scRNA-seq, Medicine, Cluster Analysis, Consensus clustering, Biology (General), Single-Cell Analysis
4 Research products, page 1 of 1
- IsRelatedTo
- IsRelatedTo
- IsRelatedTo
- IsRelatedTo
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).3 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Average
