A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles.
Citations
9 citations
9 citations
Cites background or methods from "A Next Generation Connectivity Map:..."
...Notably, in the original L1000 preprocessing pipeline (Subramanian et al., 2017), the control profiles were replaced by all the profiles on the plate, called population control....
[...]
...For a given cell line, suppose that there areN genes (marked as the indices 1, 2,…, N) with measured shRNA data from the LINCS L1000project (Subramanian et al., 2017)....
[...]
...Specifically, the raw data from the LINCS L1000 project were preprocessed based on the pipeline in the original paper (Subramanian et al., 2017) with minor modifications; We first directly obtained the Level 3 data from L1000, which contained the quantile normalized gene expression profiles....
[...]
...…of shRNAs and obtain a robust signature for each single gene, the z-scores obtained fromthe replicated trials of the same shRNA were first processed using an algorithm with L1000 Level 5 data (Subramanian et al., 2017), then the same protocol was used to reduce the shRNAs targeting the same gene....
[...]
...The cell-line specific gene expression profiles resulting from the shRNA knockdownexperiments in theLINCSL1000project (Subramanian et al., 2017)were used to capture the informationof cell-line specific genetic background....
[...]
9 citations
Cites background or methods from "A Next Generation Connectivity Map:..."
...Normalized transcriptome data of 14 compounds on OCI-LY3 cells were obtained from NCBI Gene Expression Omnibus under the series accession no....
[...]
...The microarray datasets of MYC siRNA experiments were deposited in the NCBI Gene Expression Omnibus under the series accession no....
[...]
...Furthermore, based on measured expression levels of landmark genes, the expression levels of ~21,000 unmeasured genes were inferred by a linear regression model, in which the weight coefficient was estimated from the substantial transcriptome data [10, 11]....
[...]
...978 landmark genes were determined as informative genes from multivariate analysis using 12063 public transcriptome microarray data catalogued in the Gene Expression Omnibus [10]....
[...]
...3 million cell conditions, consisting of compound treatments (multiple doses) and genetic perturbation treatments (knockdown by shRNA, overexpression, and ligand treatment) at multiple time points in several different cell lines [10]....
[...]
9 citations
Cites background from "A Next Generation Connectivity Map:..."
...The L1000 Next-generation Connectivity Map, for instance, contains about one million post-treatment gene expression signatures for about twenty thousand molecules [65]....
[...]
...In this line, the release of CCL screens with readouts other than growth inhibition or proliferation rate [65, 66] will help unveil the connections between the genetic background of the cells and the phenotypic outcome of drug treatment....
[...]
Summary
Identifying the chemical regulators of biological pathways is a time-consuming bottleneck in developing therapeutics and research compounds. Typically, thousands to millions of candidate small molecules are tested in target-based biochemical screens or phenotypic cell-based screens, both expensive experiments customized to each disease. Here, our uncustomized, virtual, profile-based screening approach instead identifies compounds that match to pathways based on the phenotypic information in public cell image data, created using the Cell Painting assay. Our straightforward correlation-based computational strategy retrospectively uncovered the expected, known small-molecule regulators for 32% of positive-control gene queries. In prospective, discovery mode, we efficiently identified new compounds related to three query genes and validated them in subsequent gene-relevant assays, including compounds that phenocopy or pheno-oppose YAP1 overexpression and kill a Yap1-dependent sarcoma cell line. This image-profile-based approach could replace many customized labor- and resource-intensive screens and accelerate the discovery of biologically and therapeutically useful compounds.9 citations
References
34,830 citations
30,124 citations
10,968 citations
8,326 citations
6,319 citations