Optimal gene selection for cell type discrimination in single cell analyses

doi:10.1101/599654

Open AccessPosted ContentDOI

Optimal gene selection for cell type discrimination in single cell analyses

Bianca Dumitrascu, +3 more

- 04 Apr 2019 -

bioRxiv

- pp 599654

Chats0

TLDR

Given single cell RNA-seq data and a set of cellular labels to discriminate, scGene-Fit selects gene transcript markers that jointly optimize cell label recovery using label-aware compressive classification methods, resulting in a substantially more robust and less redundant set of markers.

Abstract:

Single-cell technologies characterize complex cell populations across multiple data modalities at un-precedented scale and resolution. Multi-omic data for single cell gene expression, in situ hybridization, or single cell chromatin states are increasingly available across diverse tissue types. When isolating specific cell types from a sample of disassociated cells or performing in situ sequencing in collections of heterogeneous cells, one challenging task is to select a small set of informative markers to identify and differentiate specific cell types or cell states as precisely as possible. Given single cell RNA-seq data and a set of cellular labels to discriminate, scGene-Fit selects gene transcript markers that jointly optimize cell label recovery using label-aware compressive classification methods, resulting in a substantially more robust and less redundant set of markers than existing methods. When applied to a data set given a hierarchy of cell type labels, the markers found by our method enable the recovery of the label hierarchy through a computationally efficient and principled optimization.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

MarkerMap: nonlinear marker selection for single-cell studies

Nabeel Sarwar, +4 more

- 28 Jul 2022 -

arXiv.org

TL;DR: MarkerMap is introduced, a generative model for selecting minimal gene sets which are maximally informative of cell type origin and enable whole transcriptome reconstruction, and benchmark MarkerMap’s competitive performance against previously published approaches on real single cell gene expression data sets.

...read moreread less

Journal ArticleDOI

Leveraging Systems Immunology to Optimize Diagnosis and Treatment of Inborn Errors of Immunity

Andrea A. Mauracher, +1 more

- 18 Jul 2022 -

Frontiers in Systems Biology

TL;DR: How systems immunology can contribute to optimizing both diagnosis and treatment of IEI patients by focusing on identifying and quantifying key dysregulated pathways is explored, as well as providing a better understanding of basic immunology.

...read moreread less

Posted ContentDOI

InGene: Finding influential genes from embeddings of nonlinear dimension reduction techniques

Chitrita Goswami, +1 more

- 21 Jun 2023 -

bioRxiv

TL;DR: InGene as mentioned in this paper assigns an importance score to each expressed gene based on its contribution to the construction of the low-dimensional map, which can provide insight into the cellular heterogeneity of scRNA-seq data and accurately identify genes associated with cell-type populations or diseases.

...read moreread less

Challenges and opportunities to computationally deconvolve heterogeneous tissue with varying cell sizes using single cell RNA-sequencing datasets

Sean K. Maden, +5 more

- 10 May 2023 -

arXiv.org

TL;DR: In this article , the authors discuss several experimental and computational challenges in developing and implementing transcriptomics-based deconvolution approaches, especially those using a single cell/nuclei RNA-seq reference atlas, which are becoming rapidly available across many tissues.

...read moreread less

Posted ContentDOI

Gene panel design for spatial transcriptomics with prioritized gene sets

Mashrur Ahmed Yafi, +4 more

- 27 Sep 2022 -

bioRxiv

TL;DR: This work proposes scGIST– a deep neural network that designs sc-ST panels through constrained feature selection that outperformed alternative methods in terms of cell type detection accuracy and allows genes of interest to be prioritized for inclusion in the panel while staying within the size constraint.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets

Evan Z. Macosko, +26 more

- 21 May 2015 -

Cell

TL;DR: Drop-seq will accelerate biological discovery by enabling routine transcriptional profiling at single-cell resolution by separating them into nanoliter-sized aqueous droplets, associating a different barcode with each cell's RNAs, and sequencing them all together.

...read moreread less

Proceedings Article

Distance Metric Learning for Large Margin Nearest Neighbor Classification

Kilian Q. Weinberger, +2 more

TL;DR: In this article, a Mahanalobis distance metric for k-NN classification is trained with the goal that the k-nearest neighbors always belong to the same class while examples from different classes are separated by a large margin.

...read moreread less

Journal ArticleDOI

Massively parallel digital transcriptional profiling of single cells

Grace X.Y. Zheng, +34 more

- 16 Jan 2017 -

Nature Communications

TL;DR: A droplet-based system that enables 3′ mRNA counting of tens of thousands of single cells per sample is described and sequence variation in the transcriptome data is used to determine host and donor chimerism at single-cell resolution from bone marrow mononuclear cells isolated from transplant patients.

...read moreread less

Journal ArticleDOI

Distance Metric Learning for Large Margin Nearest Neighbor Classification

Kilian Q. Weinberger, +1 more

- 01 Dec 2009 -

Journal of Machine Learning Research

TL;DR: This paper shows how to learn a Mahalanobis distance metric for kNN classification from labeled examples in a globally integrated manner and finds that metrics trained in this way lead to significant improvements in kNN Classification.

...read moreread less

Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets

Evan Z. Macosko, +26 more

TL;DR: Drop-seq as discussed by the authors analyzes mRNA transcripts from thousands of individual cells simultaneously while remembering transcripts' cell of origin, and identifies 39 transcriptionally distinct cell populations, creating a molecular atlas of gene expression for known retinal cell classes and novel candidate cell subtypes.

...read moreread less