A census of human RNA-binding proteins.

doi:10.1038/NRG3813

Home
/
Papers
/
A census of human RNA-binding proteins.

Journal Article•DOI•

A census of human RNA-binding proteins.

Stefanie Gerstberger¹, Markus Hafner², Thomas Tuschl¹•Institutions (2)

Howard Hughes Medical Institute¹, National Institutes of Health²

04 Nov 2014-Nature Reviews Genetics (Nat Rev Genet)-Vol. 15, Iss: 12, pp 829-845

TL;DR: This work presents a census of 1,542 manually curated RBPs that are analysed for their interactions with different classes of RNA, their evolutionary conservation, their abundance and their tissue-specific expression, a critical step towards the comprehensive characterization of proteins involved in human RNA metabolism.

read less

Abstract: Post-transcriptional gene regulation (PTGR) concerns processes involved in the maturation, transport, stability and translation of coding and non-coding RNAs. RNA-binding proteins (RBPs) and ribonucleoproteins coordinate RNA processing and PTGR. The introduction of large-scale quantitative methods, such as next-generation sequencing and modern protein mass spectrometry, has renewed interest in the investigation of PTGR and the protein factors involved at a systems-biology level. Here, we present a census of 1,542 manually curated RBPs that we have analysed for their interactions with different classes of RNA, their evolutionary conservation, their abundance and their tissue-specific expression. Our analysis is a critical step towards the comprehensive characterization of proteins involved in human RNA metabolism.

...read moreread less

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Robust transcriptome-wide discovery of RNA-binding protein binding sites with enhanced CLIP (eCLIP)

[...]

Eric L. Van Nostrand¹, Gabriel A. Pratt, Alexander A. Shishkin², Chelsea Gelboin-Burkhart¹, Mark Y. Fang¹, Balaji Sundararaman¹, Steven M. Blue¹, Thai B. Nguyen¹, Christine Surka², Keri Elkins¹, Rebecca Stanton¹, Frank Rigo, Mitchell Guttman², Gene W. Yeo - Show less +10 more•Institutions (2)

University of California, San Diego¹, California Institute of Technology²

01 Jun 2016-Nature Methods

TL;DR: An enhanced CLIP (eCLIP) protocol is developed that decreases requisite amplification by ∼1,000-fold, decreasing discarded PCR duplicate reads by ∼60% while maintaining single-nucleotide binding resolution, and improves specificity in the discovery of authentic binding sites.

...read moreread less

Abstract: As RNA-binding proteins (RBPs) play essential roles in cellular physiology by interacting with target RNA molecules, binding site identification by UV crosslinking and immunoprecipitation (CLIP) of ribonucleoprotein complexes is critical to understanding RBP function. However, current CLIP protocols are technically demanding and yield low-complexity libraries with high experimental failure rates. We have developed an enhanced CLIP (eCLIP) protocol that decreases requisite amplification by ~1,000-fold, decreasing discarded PCR duplicate reads by ~60% while maintaining single-nucleotide binding resolution. By simplifying the generation of paired IgG and size-matched input controls, eCLIP improves specificity in the discovery of authentic binding sites. We generated 102 eCLIP experiments for 73 diverse RBPs in HepG2 and K562 cells (available at https://www.encodeproject.org), demonstrating that eCLIP enables large-scale and robust profiling, with amplification and sample requirements similar to those of ChIP-seq. eCLIP enables integrative analysis of diverse RBPs to reveal factor-specific profiles, common artifacts for CLIP and RNA-centric perspectives on RBP activity.

...read moreread less

1,027 citations

Journal Article•DOI•

A brave new world of RNA-binding proteins

[...]

Matthias W. Hentze, Alfredo Castello¹, Thomas Schwarzl, Thomas Preiss²•Institutions (2)

University of Oxford¹, Australian National University²

17 Jan 2018-Nature Reviews Molecular Cell Biology

TL;DR: The RNA targets and molecular and cellular functions of the new RBPs, as well as the possibility that some RBPs may be regulated by RNA rather than regulate RNA, are discussed.

...read moreread less

Abstract: RNA-binding proteins (RBPs) are typically thought of as proteins that bind RNA through one or multiple globular RNA-binding domains (RBDs) and change the fate or function of the bound RNAs. Several hundred such RBPs have been discovered and investigated over the years. Recent proteome-wide studies have more than doubled the number of proteins implicated in RNA binding and uncovered hundreds of additional RBPs lacking conventional RBDs. In this Review, we discuss these new RBPs and the emerging understanding of their unexpected modes of RNA binding, which can be mediated by intrinsically disordered regions, protein-protein interaction interfaces and enzymatic cores, among others. We also discuss the RNA targets and molecular and cellular functions of the new RBPs, as well as the possibility that some RBPs may be regulated by RNA rather than regulate RNA.

...read moreread less

1,013 citations

Journal Article•DOI•

Expanded encyclopaedias of DNA elements in the human and mouse genomes

[...]

Jill Moore¹, Michael J. Purcaro¹, Henry Pratt¹, Charles B. Epstein², Noam Shoresh², Jessika Adrian³, Trupti Kawli³, Carrie A. Davis⁴, Alexander Dobin⁴, Rajinder Kaul⁵, Jessica Halow, Eric L. Van Nostrand⁶, Peter Freese⁷, David U. Gorkin⁸, David U. Gorkin⁶, Yin Shen⁸, Yin Shen⁹, Yupeng He¹⁰, Mark Mackiewicz, Florencia Pauli-Behn, Brian A. Williams¹¹, Ali Mortazavi¹², Cheryl A. Keller¹³, Xiao-Ou Zhang¹, Shaimae I. Elhajjajy¹, Jack Huey¹, Diane E. Dickel¹⁴, Valentina Snetkova¹⁴, Xintao Wei¹⁵, Xiaofeng Wang¹⁶, Xiaofeng Wang¹⁷, Juan Carlos Rivera-Mulia¹⁸, Juan Carlos Rivera-Mulia¹⁹, Joel Rozowsky²⁰, Jing Zhang²⁰, Surya B. Chhetri²¹, Jialing Zhang²⁰, Alec Victorsen²², Kevin P. White, Axel Visel¹⁴, Axel Visel²³, Gene W. Yeo⁶, Christopher B. Burge⁷, Eric Lécuyer¹⁷, Eric Lécuyer¹⁶, David M. Gilbert¹⁹, Job Dekker¹, John L. Rinn²⁴, Eric M. Mendenhall²¹, Joseph R. Ecker¹⁰, Manolis Kellis⁷, Manolis Kellis², Robert J. Klein²⁵, William Stafford Noble⁵, Anshul Kundaje³, Roderic Guigó²⁶, Peggy J. Farnham²⁷, J. Michael Cherry³, Richard M. Myers, Bing Ren⁸, Bing Ren⁶, Brenton R. Graveley¹⁵, Mark Gerstein²⁰, Len A. Pennacchio²⁸, Len A. Pennacchio¹⁴, Michael Snyder³, Bradley E. Bernstein²⁹, Barbara J. Wold¹¹, Ross C. Hardison¹³, Thomas R. Gingeras⁴, John A. Stamatoyannopoulos⁵, Zhiping Weng³⁰, Zhiping Weng³¹, Zhiping Weng¹ - Show less +70 more•Institutions (31)

University of Massachusetts Medical School¹, Broad Institute², Stanford University³, Cold Spring Harbor Laboratory⁴, University of Washington⁵, University of California, San Diego⁶, Massachusetts Institute of Technology⁷, Ludwig Institute for Cancer Research⁸, University of California, San Francisco⁹, Salk Institute for Biological Studies¹⁰, California Institute of Technology¹¹, University of California, Irvine¹², Pennsylvania State University¹³, Lawrence Berkeley National Laboratory¹⁴, University of Connecticut Health Center¹⁵, Université de Montréal¹⁶, McGill University¹⁷, University of Minnesota¹⁸, Florida State University¹⁹, Yale University²⁰, University of Alabama in Huntsville²¹, University of Chicago²², University of California, Merced²³, University of Colorado Boulder²⁴, Icahn School of Medicine at Mount Sinai²⁵, Pompeu Fabra University²⁶, University of Southern California²⁷, University of California, Berkeley²⁸, Harvard University²⁹, Boston University³⁰, Tongji University³¹

29 Jul 2020-Nature

TL;DR: The authors summarize the data produced by phase III of the Encyclopedia of DNA Elements (ENCODE) project, a resource for better understanding of the human and mouse genomes, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development.

...read moreread less

Abstract: The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal (https://www.encodeproject.org), including phase II ENCODE1 and Roadmap Epigenomics2 data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis-regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes.

...read moreread less

999 citations

Journal Article•DOI•

Promoter-bound METTL3 maintains myeloid leukaemia by m 6 A-dependent translation control

[...]

Isaia Barbieri¹, Konstantinos Tzelepis², Luca Pandolfini¹, Junwei Shi³, Gonzalo Millán-Zambrano¹, Samuel Robson¹, Demetrios Aspris², Valentina Migliori¹, Andrew J. Bannister¹, Namshik Han¹, Etienne De Braekeleer², Hannes Ponstingl², Alan G. Hendrick, Christopher R. Vakoc³, George S. Vassiliou², Tony Kouzarides¹ - Show less +12 more•Institutions (3)

Wellcome Trust/Cancer Research UK Gurdon Institute¹, Wellcome Trust Sanger Institute², Cold Spring Harbor Laboratory³

07 Dec 2017-Nature

TL;DR: Together, these data define METTL3 as a regulator of a chromatin-based pathway that is necessary for maintenance of the leukaemic state and identify this enzyme as a potential therapeutic target for acute myeloid leukaemia.

...read moreread less

Abstract: N6-methyladenosine (m6A) is an abundant internal RNA modification in both coding and non-coding RNAs that is catalysed by the METTL3-METTL14 methyltransferase complex. However, the specific role of these enzymes in cancer is still largely unknown. Here we define a pathway that is specific for METTL3 and is implicated in the maintenance of a leukaemic state. We identify METTL3 as an essential gene for growth of acute myeloid leukaemia cells in two distinct genetic screens. Downregulation of METTL3 results in cell cycle arrest, differentiation of leukaemic cells and failure to establish leukaemia in immunodeficient mice. We show that METTL3, independently of METTL14, associates with chromatin and localizes to the transcriptional start sites of active genes. The vast majority of these genes have the CAATT-box binding protein CEBPZ present at the transcriptional start site, and this is required for recruitment of METTL3 to chromatin. Promoter-bound METTL3 induces m6A modification within the coding region of the associated mRNA transcript, and enhances its translation by relieving ribosome stalling. We show that genes regulated by METTL3 in this way are necessary for acute myeloid leukaemia. Together, these data define METTL3 as a regulator of a chromatin-based pathway that is necessary for maintenance of the leukaemic state and identify this enzyme as a potential therapeutic target for acute myeloid leukaemia.

...read moreread less

705 citations

MicroRNAs: Target Recognition and Regulatory Functions

[...]

David P. Bartel¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Jan 2009

TL;DR: In this article, a review outlines the current understanding of miRNA target recognition in animals and discusses the widespread impact of miRNAs on both the expression and evolution of protein-coding genes.

...read moreread less

Abstract: MicroRNAs (miRNAs) are endogenous ∼23 nt RNAs that play important gene-regulatory roles in animals and plants by pairing to the mRNAs of protein-coding genes to direct their posttranscriptional repression. This review outlines the current understanding of miRNA target recognition in animals and discusses the widespread impact of miRNAs on both the expression and evolution of protein-coding genes.

...read moreread less

646 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Basic Local Alignment Search Tool

[...]

Stephen F. Altschul¹, Warren Gish¹, Webb Miller², Eugene W. Myers³, David J. Lipman¹ - Show less +1 more•Institutions (3)

National Institutes of Health¹, Pennsylvania State University², University of Arizona³

01 Oct 1990-Journal of Molecular Biology

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.

...read moreread less

88,255 citations

"A census of human RNA-binding prote..." refers background in this paper

...database, thereby adding sequence-related insect rRNA sequences from the NCBI nucleotide database (Altschul et al., 1990)....
[...]

Journal Article•DOI•

Fast and accurate short read alignment with Burrows–Wheeler transform

[...]

Heng Li¹, Richard Durbin¹•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Jul 2009-Bioinformatics

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.

...read moreread less

Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

...read moreread less

43,862 citations

Journal Article•DOI•

Gene Ontology: tool for the unification of biology

[...]

M Ashburner¹, Catherine A. Ball, Judith A. Blake, David Botstein, Heather Butler, J. M. Cherry, Allan Peter Davis, Kara Dolinski, Selina S. Dwight, J.T. Eppig, Midori A. Harris, David P. Hill, Laurie Issel-Tarver, Andrew Kasarskis, Suzanna E. Lewis, John C. Matese, Joel E. Richardson, M. Ringwald, Gerald M. Rubin, Gavin Sherlock - Show less +16 more•Institutions (1)

Stanford University¹

01 May 2000-Nature Genetics

TL;DR: The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing.

...read moreread less

Abstract: Genomic sequencing has made it clear that a large fraction of the genes specifying the core biological functions are shared by all eukaryotes. Knowledge of the biological role of such shared proteins in one organism can often be transferred to other organisms. The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. To this end, three independent ontologies accessible on the World-Wide Web (http://www.geneontology.org) are being constructed: biological process, molecular function and cellular component.

...read moreread less

35,225 citations

Journal Article•DOI•

Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

[...]

Da-Wei Huang¹, Brad T. Sherman¹, Richard A. Lempicki¹•Institutions (1)

Science Applications International Corporation¹

01 Jan 2009-Nature Protocols

TL;DR: By following this protocol, investigators are able to gain an in-depth understanding of the biological themes in lists of genes that are enriched in genome-scale studies.

...read moreread less

Abstract: DAVID bioinformatics resources consists of an integrated biological knowledgebase and analytic tools aimed at systematically extracting biological meaning from large gene/protein lists. This protocol explains how to use DAVID, a high-throughput and integrated data-mining environment, to analyze gene lists derived from high-throughput genomic experiments. The procedure first requires uploading a gene list containing any number of common gene identifiers followed by analysis using one or more text and pathway-mining tools such as gene functional classification, functional annotation chart or clustering and functional annotation table. By following this protocol, investigators are able to gain an in-depth understanding of the biological themes in lists of genes that are enriched in genome-scale studies.

...read moreread less

31,015 citations

"A census of human RNA-binding prote..." refers methods in this paper

...Gene Ontology [using the DAVID functional annotation database (Ashburner et al., 2000; Huang et al., 2008)] and GOrilla (Eden et al....
[...]

Journal Article•DOI•

Ultrafast and memory-efficient alignment of short DNA sequences to the human genome

[...]

Ben Langmead¹, Cole Trapnell¹, Mihai Pop¹, Steven L. Salzberg¹•Institutions (1)

University of Maryland, College Park¹

04 Mar 2009-Genome Biology

TL;DR: Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches and can be used simultaneously to achieve even greater alignment speeds.

...read moreread less

Abstract: Bowtie is an ultrafast, memory-efficient alignment program for aligning short DNA sequence reads to large genomes. For the human genome, Burrows-Wheeler indexing allows Bowtie to align more than 25 million reads per CPU hour with a memory footprint of approximately 1.3 gigabytes. Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches. Multiple processor cores can be used simultaneously to achieve even greater alignment speeds. Bowtie is open source http://bowtie.cbcb.umd.edu.

...read moreread less

20,335 citations