Home
/
Authors
/
Owen J. L. Rackham

Author

Owen J. L. Rackham

Other affiliations: Hammersmith Hospital, Medical Research Council, Imperial College London ...read more

Bio: Owen J. L. Rackham is an academic researcher from National University of Singapore. The author has contributed to research in topics: Reprogramming & Regulation of gene expression. The author has an hindex of 26, co-authored 66 publications receiving 6160 citations. Previous affiliations of Owen J. L. Rackham include Hammersmith Hospital & Medical Research Council.

Topics: Reprogramming, Regulation of gene expression, Medicine, Biology, Stem cell ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010

Papers

PDF

Open Access

More filters

Journal Article•DOI•

An atlas of active enhancers across human cell types and tissues

[...]

Robin Andersson¹, Claudia Gebhard², Irene Miguel-Escalada³, Ilka Hoof¹, Jette Bornholdt¹, Mette Boyd¹, Yun Chen¹, Xiaobei Zhao¹, Xiaobei Zhao⁴, Christian Schmidl², Takahiro Suzuki, Evgenia Ntini, Erik Arner, Eivind Valen⁵, Eivind Valen¹, Kang Li¹, Lucia Schwarzfischer², Dagmar Glatz², Johanna Raithel², Berit Lilje¹, Nicolas Rapin¹, Frederik Otzen Bagger¹, Mette Rose Jørgensen¹, Peter Refsing Andersen⁶, Nicolas Bertin, Owen J. L. Rackham, A. Maxwell Burroughs, J Kenneth Baillie⁷, Yuri Ishizu, Yuri Shimizu, Erina Furuhata, Shiori Maeda, Yutaka Negishi, Christopher J. Mungall⁸, Terrence F. Meehan⁹, Timo Lassmann, Masayoshi Itoh, Hideya Kawaji, Naoto Kondo, Jun Kawai, Andreas Lennartsson¹⁰, Carsten O. Daub¹⁰, Peter Heutink¹¹, David A. Hume⁷, Torben Heick Jensen⁶, Harukazu Suzuki, Yoshihide Hayashizaki, Ferenc Müller³, Alistair R. R. Forrest, Piero Carninci, Michael Rehli², Albin Sandelin¹ - Show less +48 more•Institutions (11)

University of Copenhagen¹, University Hospital Regensburg², University of Birmingham³, University of North Carolina at Chapel Hill⁴, Harvard University⁵, Aarhus University⁶, University of Edinburgh⁷, Lawrence Berkeley National Laboratory⁸, European Bioinformatics Institute⁹, Karolinska Institutet¹⁰, VU University Medical Center¹¹

27 Mar 2014-Nature

TL;DR: It is shown that enhancers share properties with CpG-poor messenger RNA promoters but produce bidirectional, exosome-sensitive, relatively short unspliced RNAs, the generation of which is strongly related to enhancer activity.

...read moreread less

Abstract: Enhancers control the correct temporal and cell-type-specific activation of gene expression in multicellular eukaryotes. Knowing their properties, regulatory activity and targets is crucial to understand the regulation of differentiation and homeostasis. Here we use the FANTOM5 panel of samples, covering the majority of human tissues and cell types, to produce an atlas of active, in vivo-transcribed enhancers. We show that enhancers share properties with CpG-poor messenger RNA promoters but produce bidirectional, exosome-sensitive, relatively short unspliced RNAs, the generation of which is strongly related to enhancer activity. The atlas is used to compare regulatory programs between different cells at unprecedented depth, to identify disease-associated regulatory single nucleotide polymorphisms, and to classify cell-type-specific and ubiquitous enhancers. We further explore the utility of enhancer redundancy, which explains gene expression strength rather than expression patterns. The online FANTOM5 enhancer atlas represents a unique resource for studies on cell-type-specific enhancers and gene regulation.

...read moreread less

2,260 citations

Journal Article•DOI•

A promoter-level mammalian expression atlas

[...]

Alistair R. R. Forrest, Hideya Kawaji, Michael Rehli¹, J Kenneth Baillie² +277 more•Institutions (63)

27 Mar 2014-Nature

TL;DR: For example, the authors mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body.

...read moreread less

Abstract: Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body We find that few genes are truly 'housekeeping', whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research

...read moreread less

1,715 citations

Journal Article•DOI•

An atlas of human long non-coding RNAs with accurate 5′ ends

[...]

Chung-Chau Hon¹, Jordan A. Ramilowski, Jayson Harshbarger, Nicolas Bertin², Nicolas Bertin³, Owen J. L. Rackham⁴, Owen J. L. Rackham³, Julian Gough⁴, Elena Denisenko⁵, Sebastian Schmeier⁵, Thomas M. Poulsen⁶, Jessica Severin, Marina Lizio, Hideya Kawaji, Takeya Kasukawa¹, Masayoshi Itoh, A. Maxwell Burroughs⁷, Shohei Noma, Sarah Djebali², Sarah Djebali⁸, Tanvir Alam⁹, Yulia A. Medvedeva, Alison C. Testa¹⁰, Leonard Lipovich¹¹, Chi Wai Yip¹, Imad Abugessaisa¹, Mickal Mendez², Akira Hasegawa, Dave Tang¹², Timo Lassmann¹², Peter Heutink¹³, Magda Babina¹⁴, Christine A. Wells¹⁵, Christine A. Wells¹⁶, Soichi Kojima, Yukio Nakamura¹⁷, Harukazu Suzuki, Carsten O. Daub¹⁸, Michiel J. L. de Hoon, Erik Arner, Yoshihide Hayashizaki, Piero Carninci, Alistair R. R. Forrest¹⁰ - Show less +39 more•Institutions (18)

Centre for Life¹, University of Toulouse², National University of Singapore³, University of Bristol⁴, Massey University⁵, National Institute of Advanced Industrial Science and Technology⁶, National Institutes of Health⁷, Barcelona Biomedical Research Park⁸, King Abdullah University of Science and Technology⁹, Harry Perkins Institute of Medical Research¹⁰, Wayne State University¹¹, University of Western Australia¹², German Center for Neurodegenerative Diseases¹³, Charité¹⁴, University of Melbourne¹⁵, University of Queensland¹⁶, University of Tsukuba¹⁷, Karolinska Institutet¹⁸

09 Mar 2017-Nature

TL;DR: This work integrates multiple transcript collections to generate a comprehensive atlas of 27,919 human lncRNA genes with high-confidence 5′ ends and expression profiles across 1,829 samples from the major human primary cell types and tissues, identifying 19,175 potentially functional lncRNAs in the human genome.

...read moreread less

Abstract: Long non-coding RNAs (lncRNAs) are largely heterogeneous and functionally uncharacterized. Here, using FANTOM5 cap analysis of gene expression (CAGE) data, we integrate multiple transcript collections to generate a comprehensive atlas of 27,919 human lncRNA genes with high-confidence 5' ends and expression profiles across 1,829 samples from the major human primary cell types and tissues. Genomic and epigenomic classification of these lncRNAs reveals that most intergenic lncRNAs originate from enhancers rather than from promoters. Incorporating genetic and expression data, we show that lncRNAs overlapping trait-associated single nucleotide polymorphisms are specifically expressed in cell types relevant to the traits, implicating these lncRNAs in multiple diseases. We further demonstrate that lncRNAs overlapping expression quantitative trait loci (eQTL)-associated single nucleotide polymorphisms of messenger RNAs are co-expressed with the corresponding messenger RNAs, suggesting their potential roles in transcriptional regulation. Combining these findings with conservation data, we identify 19,175 potentially functional lncRNAs in the human genome.

...read moreread less

821 citations

Journal Article•DOI•

A single-cell atlas of entorhinal cortex from individuals with Alzheimer's disease reveals cell-type-specific gene expression regulation.

[...]

Alexandra Grubman¹, Alexandra Grubman², Alexandra Grubman³, Gabriel Chew⁴, John F. Ouyang⁴, Guizhi Sun², Guizhi Sun¹, Guizhi Sun³, Xin Yi Choo, Catriona McLean, Rebecca K. Simmons⁵, Rebecca K. Simmons⁶, Sam Buckberry⁶, Sam Buckberry⁵, Dulce B. Vargas-Landin⁵, Dulce B. Vargas-Landin⁶, Daniel Poppe⁵, Daniel Poppe⁶, Jahnvi Pflueger⁶, Jahnvi Pflueger⁵, Ryan Lister⁵, Ryan Lister⁶, Owen J. L. Rackham⁴, Enrico Petretto⁴, Jose M. Polo², Jose M. Polo¹, Jose M. Polo³ - Show less +23 more•Institutions (6)

Discovery Institute¹, Australian Regenerative Medicine Institute², Monash University, Clayton campus³, National University of Singapore⁴, Harry Perkins Institute of Medical Research⁵, University of Western Australia⁶

01 Dec 2019-Nature Neuroscience

TL;DR: In insights into the coordinated control of Alzheimer’s disease risk genes and their cell-type-specific contribution to disease susceptibility, single-nucleus RNA sequencing is applied to entorhinal cortex samples from control and Alzheimer's disease brains and identified transcription factor networks predicted to control disease progression in a cell-sub type-specific way.

...read moreread less

Abstract: There is currently little information available about how individual cell types contribute to Alzheimer’s disease. Here we applied single-nucleus RNA sequencing to entorhinal cortex samples from control and Alzheimer’s disease brains (n = 6 per group), yielding a total of 13,214 high-quality nuclei. We detail cell-type-specific gene expression patterns, unveiling how transcriptional changes in specific cell subpopulations are associated with Alzheimer’s disease. We report that the Alzheimer’s disease risk gene APOE is specifically repressed in Alzheimer’s disease oligodendrocyte progenitor cells and astrocyte subpopulations and upregulated in an Alzheimer’s disease-specific microglial subopulation. Integrating transcription factor regulatory modules with Alzheimer’s disease risk loci revealed drivers of cell-type-specific state transitions towards Alzheimer’s disease. For example, transcription factor EB, a master regulator of lysosomal function, regulates multiple disease genes in a specific Alzheimer’s disease astrocyte subpopulation. These results provide insights into the coordinated control of Alzheimer’s disease risk genes and their cell-type-specific contribution to disease susceptibility. These results are available at http://adsn.ddnetbio.com. Grubman et al. generated a single-cell transcriptomic atlas of the entorhinal cortex from patients with Alzheimer’s disease and identified transcription factor networks predicted to control disease progression in a cell-subtype-specific way.

...read moreread less

495 citations

Journal Article•DOI•

IL-11 is a crucial determinant of cardiovascular fibrosis

[...]

Sebastian Schafer¹, Sivakumar Viswanathan¹, Anissa A. Widjaja¹, W. Lim, Aida Moreno-Moral¹, Daniel M. DeLaughter², Benjamin Ng, Giannino Patone³, Kingsley Chow, Ester Khin¹, Jessie Tan, Sonia Chothani¹, Lei Ye, Owen J. L. Rackham¹, Nicole S. J. Ko¹, Norliza E Sahib¹, Chee Jian Pua, Nicole T G Zhen, Chen Xie, Mao Wang¹, Henrike Maatz³, Shiqi Lim, Kathrin Saar³, Susanne Blachut³, Enrico Petretto¹, Sabine Schmidt³, Tracy L Putoczki⁴, Tracy L Putoczki⁵, Nuno Guimarães-Camboa⁶, Hiroko Wakimoto², Sebastiaan van Heesch³, Kristmundur Sigmundsson¹, See L Lim, Jia L Soon¹, Victor Tt Chao¹, Yeow L Chua, Teing Ee Tan, Sylvia M. Evans⁷, Sylvia M. Evans⁶, Yee J Loh⁸, Muhammad H. Jamal, Kim K Ong⁸, Kim C. Chua, Boon-Hean Ong, Mathew J Chakaramakkil, Jonathan G. Seidman², Christine E. Seidman⁹, Christine E. Seidman², Christine E. Seidman¹⁰, Norbert Hubner, Kenny Yk Sin¹, Stuart A. Cook - Show less +48 more•Institutions (10)

National University of Singapore¹, Harvard University², Max Delbrück Center for Molecular Medicine³, Walter and Eliza Hall Institute of Medical Research⁴, University of Melbourne⁵, University of Montana⁶, University of California, San Diego⁷, Boston Children's Hospital⁸, Howard Hughes Medical Institute⁹, Brigham and Women's Hospital¹⁰

07 Dec 2017-Nature

TL;DR: It is shown that upregulation of interleukin-11 (IL-11) is the dominant transcriptional response to TGFβ1 exposure and required for its pro-fibrotic effect, and proposed that inhibition of IL-11 is a potential therapeutic strategy to treat fibrotic diseases.

...read moreread less

Abstract: Fibrosis is a common pathology in cardiovascular disease. In the heart, fibrosis causes mechanical and electrical dysfunction and in the kidney, it predicts the onset of renal failure. Transforming growth factor β1 (TGFβ1) is the principal pro-fibrotic factor, but its inhibition is associated with side effects due to its pleiotropic roles. We hypothesized that downstream effectors of TGFβ1 in fibroblasts could be attractive therapeutic targets and lack upstream toxicity. Here we show, using integrated imaging-genomics analyses of primary human fibroblasts, that upregulation of interleukin-11 (IL-11) is the dominant transcriptional response to TGFβ1 exposure and required for its pro-fibrotic effect. IL-11 and its receptor (IL11RA) are expressed specifically in fibroblasts, in which they drive non-canonical, ERK-dependent autocrine signalling that is required for fibrogenic protein synthesis. In mice, fibroblast-specific Il11 transgene expression or Il-11 injection causes heart and kidney fibrosis and organ failure, whereas genetic deletion of Il11ra1 protects against disease. Therefore, inhibition of IL-11 prevents fibroblast activation across organs and species in response to a range of important pro-fibrotic stimuli. These results reveal a central role of IL-11 in fibrosis and we propose that inhibition of IL-11 is a potential therapeutic strategy to treat fibrotic diseases.

...read moreread less

414 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The Phyre2 web portal for protein modeling, prediction and analysis

[...]

Lawrence A. Kelley¹, Stefans Mezulis¹, Christopher M. Yates¹, Christopher M. Yates², Mark N. Wass¹, Mark N. Wass³, Michael J.E. Sternberg¹ - Show less +3 more•Institutions (3)

Imperial College London¹, University College London², University of Kent³

07 May 2015-Nature Protocols

TL;DR: An updated protocol for Phyre2, which uses advanced remote homology detection methods to build 3D models, predict ligand binding sites and analyze the effect of amino acid variants for a user's protein sequence.

...read moreread less

Abstract: Phyre2 is a web-based tool for predicting and analyzing protein structure and function. Phyre2 uses advanced remote homology detection methods to build 3D models, predict ligand binding sites, and analyze amino acid variants in a protein sequence. Phyre2 is a suite of tools available on the web to predict and analyze protein structure, function and mutations. The focus of Phyre2 is to provide biologists with a simple and intuitive interface to state-of-the-art protein bioinformatics tools. Phyre2 replaces Phyre, the original version of the server for which we previously published a paper in Nature Protocols. In this updated protocol, we describe Phyre2, which uses advanced remote homology detection methods to build 3D models, predict ligand binding sites and analyze the effect of amino acid variants (e.g., nonsynonymous SNPs (nsSNPs)) for a user's protein sequence. Users are guided through results by a simple interface at a level of detail they determine. This protocol will guide users from submitting a protein sequence to interpreting the secondary and tertiary structure of their models, their domain composition and model quality. A range of additional available tools is described to find a protein structure in a genome, to submit large number of sequences at once and to automatically run weekly searches for proteins that are difficult to model. The server is available at http://www.sbg.bio.ic.ac.uk/phyre2 . A typical structure prediction will be returned between 30 min and 2 h after submission.

...read moreread less

7,941 citations

Journal Article•DOI•

InterProScan 5: genome-scale protein function classification

[...]

Philip Jones¹, David Binns¹, Hsin-Yu Chang¹, Matthew Fraser¹, Weizhong Li¹, Craig McAnulla¹, Hamish McWilliam¹, John Maslen¹, Alex L. Mitchell¹, Gift Nuka¹, Sebastien Pesseat¹, Antony F. Quinn¹, Amaia Sangrador-Vegas¹, Maxim Scheremetjew¹, Siew-Yit Yong¹, Rodrigo Lopez¹, Sarah Hunter¹ - Show less +13 more•Institutions (1)

Wellcome Trust Sanger Institute¹

01 May 2014-Bioinformatics

TL;DR: A new Java-based architecture for the widely used protein function prediction software package InterProScan is described, resulting in a flexible and stable system that is able to use both multiprocessor machines and/or conventional clusters to achieve scalable distributed data analysis.

...read moreread less

Abstract: Motivation: Robust, large-scale sequence analysis is a major challenge in modern genomic science, where biologists are frequently trying to characterise many millions of sequences. Here we describe a new Java-based architecture for the widely-used protein function prediction software package InterProScan. Developments include improvements and additions to the outputs of the software and the complete re-implementation of the software framework, resulting in a flexible and stable system that is able to utilise both multiprocessor machines and/or conventional clusters to achieve scalable distributed data analysis. InterProScan is freely available for download from the EMBl-EBI FTP site and the (open) source code is hosted at Google Code. Availability: InterProScan is distributed via FTP at ftp://ftp.ebi.ac.uk/pub/software/unix/iprscan/5/ and the source code is available from http://code.google.com/p/interproscan/. Contact: http://www.ebi.ac.uk/support or interhelp@ebi.ac.uk

...read moreread less

5,434 citations

Journal Article•DOI•

Heart Disease and Stroke Statistics-2018 Update: A Report From the American Heart Association.

[...]

Emelia J. Benjamin, Salim S. Virani, Clifton W. Callaway, Alanna M. Chamberlain, Alex R. Chang, Susan Cheng, Stephanie E. Chiuve, Mary Cushman, Francesca N. Delling, Rajat Deo, Sarah D. de Ferranti, Jane F. Ferguson, Myriam Fornage, Cathleen Gillespie, Carmen R. Isasi, Monik C. Jiménez, Lori C. Jordan, Suzanne E. Judd, Daniel T. Lackland, Judith H. Lichtman, Lynda D. Lisabeth, Simin Liu, Chris T. Longenecker, Pamela L. Lutsey, Jason Mackey, David B. Matchar, Kunihiro Matsushita, Michael E. Mussolino, Khurram Nasir, Martin O'Flaherty, Latha Palaniappan, Ambarish Pandey, Dilip K. Pandey, Mathew J. Reeves, Matthew D. Ritchey, Carlos J. Rodriguez, Gregory A. Roth, Wayne D. Rosamond, Uchechukwu K.A. Sampson, Gary Satou, Svati H. Shah, Nicole L. Spartano, David L. Tirschwell, Connie W. Tsao, Jenifer H. Voeks, Joshua Z. Willey, John T. Wilkins, Jason H Y Wu, Heather M. Alger, Sally S. Wong, Paul Muntner - Show less +47 more

20 Mar 2018-Circulation

TL;DR: The Statistical Update represents the most up-to-date statistics related to heart disease, stroke, and the cardiovascular risk factors listed in the AHA's My Life Check - Life’s Simple 7, which include core health behaviors and health factors that contribute to cardiovascular health.

...read moreread less

Abstract: Each chapter listed in the Table of Contents (see next page) is a hyperlink to that chapter. The reader clicks the chapter name to access that chapter. Each chapter listed here is a hyperlink. Click on the chapter name to be taken to that chapter. Each year, the American Heart Association (AHA), in conjunction with the Centers for Disease Control and Prevention, the National Institutes of Health, and other government agencies, brings together in a single document the most up-to-date statistics related to heart disease, stroke, and the cardiovascular risk factors listed in the AHA’s My Life Check - Life’s Simple 7 (Figure1), which include core health behaviors (smoking, physical activity, diet, and weight) and health factors (cholesterol, blood pressure [BP], and glucose control) that contribute to cardiovascular health. The Statistical Update represents …

...read moreread less

5,102 citations

Journal Article•DOI•

The mutational constraint spectrum quantified from variation in 141,456 humans

[...]

Konrad J. Karczewski¹, Laurent C. Francioli¹, Grace Tiao¹, Beryl B. Cummings¹, Jessica Alföldi¹, Qingbo Wang¹, Ryan L. Collins¹, Kristen M. Laricchia¹, Andrea Ganna¹, Daniel P. Birnbaum¹, Laura D. Gauthier¹, Harrison Brand¹, Matthew Solomonson¹, Nicholas A. Watts¹, Daniel R. Rhodes², Moriel Singer-Berk¹, Eleina M. England¹, Eleanor G. Seaby¹, Jack A. Kosmicki¹, Raymond K. Walters¹, Katherine Tashman¹, Yossi Farjoun¹, Eric Banks¹, Timothy Poterba¹, Arcturus Wang¹, Cotton Seed¹, Nicola Whiffin¹, Jessica X. Chong³, Kaitlin E. Samocha⁴, Emma Pierce-Hoffman¹, Zachary Zappala¹, Anne H. O’Donnell-Luria¹, Eric Vallabh Minikel¹, Ben Weisburd¹, Monkol Lek⁵, James S. Ware¹, Christopher Vittal⁶, Irina M. Armean¹, Louis Bergelson¹, Kristian Cibulskis¹, Kristen M. Connolly¹, Miguel Covarrubias¹, Stacey Donnelly¹, Steven Ferriera¹, Stacey Gabriel¹, Jeff Gentry¹, Namrata Gupta¹, Thibault Jeandet¹, Diane Kaplan¹, Christopher Llanwarne¹, Ruchi Munshi¹, Sam Novod¹, Nikelle Petrillo¹, David Roazen¹, Valentin Ruano-Rubio¹, Andrea Saltzman¹, Molly Schleicher¹, Jose Soto¹, Kathleen Tibbetts¹, Charlotte Tolonen¹, Gordon Wade¹, Michael E. Talkowski¹, Benjamin M. Neale¹, Mark J. Daly¹, Daniel G. MacArthur¹ - Show less +61 more•Institutions (6)

Broad Institute¹, Queen Mary University of London², University of Washington³, Wellcome Trust Sanger Institute⁴, Yale University⁵, Harvard University⁶

27 May 2020-Nature

TL;DR: A catalogue of predicted loss-of-function variants in 125,748 whole-exome and 15,708 whole-genome sequencing datasets from the Genome Aggregation Database (gnomAD) reveals the spectrum of mutational constraints that affect these human protein-coding genes.

...read moreread less

Abstract: Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes1. Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases. A catalogue of predicted loss-of-function variants in 125,748 whole-exome and 15,708 whole-genome sequencing datasets from the Genome Aggregation Database (gnomAD) reveals the spectrum of mutational constraints that affect these human protein-coding genes.

...read moreread less

4,913 citations

Journal Article•DOI•

The Pfam protein families database: towards a more sustainable future

[...]

Robert D. Finn¹, Penelope Coggill¹, Ruth Y. Eberhardt², Ruth Y. Eberhardt¹, Sean R. Eddy³, Sean R. Eddy⁴, Jaina Mistry¹, Alex L. Mitchell¹, Simon C. Potter¹, Marco Punta¹, Marco Punta⁵, Matloob Qureshi¹, Amaia Sangrador-Vegas¹, Gustavo A. Salazar¹, John Tate¹, John Tate², Alex Bateman¹ - Show less +13 more•Institutions (5)

European Bioinformatics Institute¹, Wellcome Trust Sanger Institute², Harvard University³, Howard Hughes Medical Institute⁴, University of Paris⁵

04 Jan 2016-Nucleic Acids Research

TL;DR: Pfam is now primarily based on the UniProtKB reference proteomes, with the counts of matched sequences and species reported on the website restricted to this smaller set, and the facility to view the relationship between families within a clan has been improved by the introduction of a new tool.

...read moreread less

Abstract: In the last two years the Pfam database (http://pfam.xfam.org) has undergone a substantial reorganisation to reduce the effort involved in making a release, thereby permitting more frequent releases. Arguably the most significant of these changes is that Pfam is now primarily based on the UniProtKB reference proteomes, with the counts of matched sequences and species reported on the website restricted to this smaller set. Building families on reference proteomes sequences brings greater stability, which decreases the amount of manual curation required to maintain them. It also reduces the number of sequences displayed on the website, whilst still providing access to many important model organisms. Matches to the full UniProtKB database are, however, still available and Pfam annotations for individual UniProtKB sequences can still be retrieved. Some Pfam entries (1.6%) which have no matches to reference proteomes remain; we are working with UniProt to see if sequences from them can be incorporated into reference proteomes. Pfam-B, the automatically-generated supplement to Pfam, has been removed. The current release (Pfam 29.0) includes 16 295 entries and 559 clans. The facility to view the relationship between families within a clan has been improved by the introduction of a new tool.

...read moreread less

4,906 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse