Optimizing expression quantitative trait locus mapping workflows for single-cell studies

doi:10.1186/S13059-021-02407-X

Open AccessJournal ArticleDOI

Optimizing expression quantitative trait locus mapping workflows for single-cell studies

Anna S E Cuomo, +7 more

- 24 Jun 2021 -

Genome Biology

- Vol. 22, Iss: 1, pp 188-188

Chats0

TLDR

In this article, the role of different normalization and aggregation strategies, covariate adjustment techniques, and multiple testing correction methods to optimize single-cell expression quantitative trait locus (sc-eQTL) mapping is evaluated.

Abstract:

Background Single-cell RNA sequencing (scRNA-seq) has enabled the unbiased, high-throughput quantification of gene expression specific to cell types and states. With the cost of scRNA-seq decreasing and techniques for sample multiplexing improving, population-scale scRNA-seq, and thus single-cell expression quantitative trait locus (sc-eQTL) mapping, is increasingly feasible. Mapping of sc-eQTL provides additional resolution to study the regulatory role of common genetic variants on gene expression across a plethora of cell types and states and promises to improve our understanding of genetic regulation across tissues in both health and disease. Results While previously established methods for bulk eQTL mapping can, in principle, be applied to sc-eQTL mapping, there are a number of open questions about how best to process scRNA-seq data and adapt bulk methods to optimize sc-eQTL mapping. Here, we evaluate the role of different normalization and aggregation strategies, covariate adjustment techniques, and multiple testing correction methods to establish best practice guidelines. We use both real and simulated datasets across single-cell technologies to systematically assess the impact of these different statistical approaches. Conclusion We provide recommendations for future single-cell eQTL studies that can yield up to twice as many eQTL discoveries as default approaches ported from bulk studies.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

scDALI: modeling allelic heterogeneity in single cells reveals context-specific genetic regulation

Tobias J. A. J. Heinen, +5 more

- 06 Jan 2022 -

Genome Biology

TL;DR: In this article , a computational framework that integrates information on cellular states with allelic quantifications of single-cell sequencing data to characterize cell-state-specific genetic effects is proposed.

...read moreread less

Journal ArticleDOI

Interpretable generative deep learning: an illustration with single cell gene expression data

Martin Treppner, +2 more

- 06 Jan 2022 -

Human genetics

TL;DR: In this article , the authors provide an overview of the use of deep generative models for single-cell gene expression data, and demonstrate the utility of such methods in the context of single-genome data.

...read moreread less

Posted ContentDOI

Expression QTLs in single-cell sequencing data

Ariel D. H. Gewirtz, +2 more

- 15 Aug 2022 -

bioRxiv

TL;DR: The results demonstrate the ability of scTBLDA to identify genes involved in cell-type specific regulatory processes associated with SNPs in single-cell data.

...read moreread less

Journal ArticleDOI

Molecular quantitative trait loci

François Aguet, +6 more

- 25 Jan 2023 -

Nature Reviews Methods Primers

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Controlling the false discovery rate: a practical and powerful approach to multiple testing

Yoav Benjamini, +1 more

- 01 Jan 1995 -

Journal of the royal statistical society...

TL;DR: In this paper, a different approach to problems of multiple significance testing is presented, which calls for controlling the expected proportion of falsely rejected hypotheses -the false discovery rate, which is equivalent to the FWER when all hypotheses are true but is smaller otherwise.

...read moreread less

Journal ArticleDOI

STAR: ultrafast universal RNA-seq aligner

Alexander Dobin, +8 more

- 01 Jan 2013 -

Bioinformatics

TL;DR: The Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure outperforms other aligners by a factor of >50 in mapping speed.

...read moreread less

Journal ArticleDOI

edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Mark D. Robinson, +2 more

- 01 Jan 2010 -

Bioinformatics

TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.

...read moreread less

Journal ArticleDOI

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

Shaun Purcell, +17 more

- 01 Sep 2007 -

American Journal of Human Genetics

TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.

...read moreread less

Journal ArticleDOI

featureCounts: an efficient general-purpose program for assigning sequence reads to genomic features

Yang Liao, +2 more

- 01 Apr 2014 -

Bioinformatics

TL;DR: FeatureCounts as discussed by the authors is a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments, which implements highly efficient chromosome hashing and feature blocking techniques.

...read moreread less

Collapse

Related Papers (5)

f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq.

Florian Buettner, +6 more

- 07 Nov 2017 -

Genome Biology

Single-cell RNA sequencing identifies celltype-specific cis-eQTLs and co-expression QTLs.

Monique G. P. van der Wijst, +5 more

- 02 Apr 2018 -

Nature Genetics

Comparison of clustering tools in R for medium-sized 10x Genomics single-cell RNA-sequencing data.

Saskia Freytag, +7 more

- 15 Aug 2018 -

F1000Research

DeepImpute: an accurate, fast, and scalable deep neural network method to impute single-cell RNA-seq data

Cedric Arisdakessian, +5 more

- 18 Oct 2019 -

Genome Biology

Integrating single-cell transcriptomic data across different conditions, technologies, and species.

Andrew Butler, +4 more

- 02 Apr 2018 -

Nature Biotechnology

Optimizing expression quantitative trait locus mapping workflows for single-cell studies

Citations

Village in a dish: a model system for population-scale hiPSC studies

scDALI: modeling allelic heterogeneity in single cells reveals context-specific genetic regulation

Interpretable generative deep learning: an illustration with single cell gene expression data

Expression QTLs in single-cell sequencing data

Molecular quantitative trait loci

References

Controlling the false discovery rate: a practical and powerful approach to multiple testing

STAR: ultrafast universal RNA-seq aligner

edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

featureCounts: an efficient general-purpose program for assigning sequence reads to genomic features

Related Papers (5)

f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq.

Single-cell RNA sequencing identifies celltype-specific cis-eQTLs and co-expression QTLs.

Comparison of clustering tools in R for medium-sized 10x Genomics single-cell RNA-sequencing data.

DeepImpute: an accurate, fast, and scalable deep neural network method to impute single-cell RNA-seq data

Integrating single-cell transcriptomic data across different conditions, technologies, and species.