Article ; Online: Supervised capacity preserving mapping: a clustering guided visualization method for scRNA-seq data.
Bioinformatics (Oxford, England)
2022 Volume 38, Issue 9, Page(s) 2496–2503
Abstract: Motivation: The rapid development of scRNA-seq technologies enables us to explore the transcriptome at the cell level on a large scale. Recently, various computational methods have been developed to analyze the scRNAseq data, such as clustering and ... ...
Abstract | Motivation: The rapid development of scRNA-seq technologies enables us to explore the transcriptome at the cell level on a large scale. Recently, various computational methods have been developed to analyze the scRNAseq data, such as clustering and visualization. However, current visualization methods, including t-SNE and UMAP, are challenged by the limited accuracy of rendering the geometric relationship of populations with distinct functional states. Most visualization methods are unsupervised, leaving out information from the clustering results or given labels. This leads to the inaccurate depiction of the distances between the bona fide functional states. In particular, UMAP and t-SNE are not optimal to preserve the global geometric structure. They may result in a contradiction that clusters with near distance in the embedded dimensions are in fact further away in the original dimensions. Besides, UMAP and t-SNE cannot track the variance of clusters. Through the embedding of t-SNE and UMAP, the variance of a cluster is not only associated with the true variance but also is proportional to the sample size. Results: We present supCPM, a robust supervised visualization method, which separates different clusters, preserves the global structure and tracks the cluster variance. Compared with six visualization methods using synthetic and real datasets, supCPM shows improved performance than other methods in preserving the global geometric structure and data variance. Overall, supCPM provides an enhanced visualization pipeline to assist the interpretation of functional transition and accurately depict population segregation. Availability and implementation: The R package and source code are available at https://zenodo.org/record/5975977#.YgqR1PXMJjM. Supplementary information: Supplementary data are available at Bioinformatics online. |
---|---|
MeSH term(s) | Single-Cell Analysis/methods ; Sequence Analysis, RNA/methods ; Gene Expression Profiling/methods ; Algorithms ; Cluster Analysis |
Language | English |
Publishing date | 2022-02-24 |
Publishing country | England |
Document type | Journal Article ; Research Support, U.S. Gov't, Non-P.H.S. ; Research Support, N.I.H., Extramural |
ZDB-ID | 1422668-6 |
ISSN | 1367-4811 ; 1367-4803 |
ISSN (online) | 1367-4811 |
ISSN | 1367-4803 |
DOI | 10.1093/bioinformatics/btac131 |
Database | MEDical Literature Analysis and Retrieval System OnLINE |
More links
Kategorien
In stock of ZB MED Cologne/Königswinter
Zs.A 2374: Show issues | Location: Je nach Verfügbarkeit (siehe Angabe bei Bestand) bis Jg. 1994: Bestellungen von Artikeln über das Online-Bestellformular Jg. 1995 - 2021: Lesesall (2.OG) ab Jg. 2022: Lesesaal (EG) |
Order via subito
This service is chargeable due to the Delivery terms set by subito. Orders including an article and supplementary material will be classified as separate orders. In these cases, fees will be demanded for each order.