Home
/
Authors
/
Carol Chen

Author

Carol Chen

Bio: Carol Chen is an academic researcher from University of British Columbia. The author has contributed to research in topics: Chromatin & Data curation. The author has an hindex of 9, co-authored 13 publications receiving 4225 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The MIntAct project--IntAct as a common curation platform for 11 molecular interaction databases.

[...]

Sandra Orchard, Mais G. Ammari¹, Bruno Aranda, Lionel Breuza², Leonardo Briganti³, Fiona Broackes-Carter⁴, Nancy H. Campbell⁵, Gayatri Chavali, Carol Chen⁶, Noemi del-Toro, Margaret Duesbury, Marine Dumousseau, Eugenia Galeota³, Ursula Hinz², Marta Iannuccelli³, Sruthi Jagannathan⁷, Rafael C. Jimenez, Jyoti Khadake, Astrid Lagreid⁸, Luana Licata³, Ruth C. Lovering⁵, Birgit H M Meldal, Anna N. Melidoni⁵, Mila Milagros, Daniele Peluso, Livia Perfetto³, Pablo Porras, Arathi Raghunath, Sylvie Ricard-Blum⁹, Bernd Roechert², Andre Stutz², Michael Tognolli², Kim Van Roey, Gianni Cesareni, Henning Hermjakob - Show less +31 more•Institutions (9)

University of Arizona¹, University of Geneva², University of Rome Tor Vergata³, University of Toronto⁴, University College London⁵, University of British Columbia⁶, National University of Singapore⁷, Norwegian University of Science and Technology⁸, Claude Bernard University Lyon 1⁹

01 Jan 2014-Nucleic Acids Research

TL;DR: All data manually curated by the MINT curators have been moved into the IntAct database at EMBL-EBI and are merged with the existing IntAct dataset.

...read moreread less

Abstract: IntAct (freely available at http://www.ebi.ac.uk/intact) is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. IntAct has developed a sophisticated web-based curation tool, capable of supporting both IMEx- and MIMIx-level curation. This tool is now utilized by multiple additional curation teams, all of whom annotate data directly into the IntAct database. Members of the IntAct team supply appropriate levels of training, perform quality control on entries and take responsibility for long-term data maintenance. Recently, the MINT and IntAct databases decided to merge their separate efforts to make optimal use of limited developer resources and maximize the curation output. All data manually curated by the MINT curators have been moved into the IntAct database at EMBL-EBI and are merged with the existing IntAct dataset. Both IntAct and MINT are active contributors to the IMEx consortium (http://www.imexconsortium.org).

...read moreread less

1,602 citations

Journal Article•DOI•

The IntAct molecular interaction database in 2012

[...]

Samuel Kerrien¹, Bruno Aranda², Lionel Breuza², Alan Bridge², Fiona Broackes-Carter², Carol Chen², Margaret Duesbury², Marine Dumousseau², M Feuermann², Ursula Hinz², Christine Jandrasits², Rafael C. Jimenez², Jyoti Khadake², Usha Mahadevan², Patrick Masson², Ivo Pedruzzi², Eric Pfeiffenberger², Pablo Porras², Arathi Raghunath², Bernd Roechert², Sandra Orchard², Henning Hermjakob² - Show less +18 more•Institutions (2)

European Bioinformatics Institute¹, University of British Columbia²

01 Jan 2012-Nucleic Acids Research

TL;DR: Two levels of curation are now available within the IntAct database, with both IMEx-level annotation and less detailed MIMIx-compatible entries currently supported.

...read moreread less

Abstract: IntAct is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. Two levels of curation are now available within the database, with both IMEx-level annotation and less detailed MIMIx-compatible entries currently supported. As from September 2011, IntAct contains approximately 275,000 curated binary interaction evidences from over 5000 publications. The IntAct website has been improved to enhance the search process and in particular the graphical display of the results. New data download formats are also available, which will facilitate the inclusion of IntAct's data in the Semantic Web. IntAct is an active contributor to the IMEx consortium (http://www.imexconsortium.org). IntAct source code and data are freely available at http://www.ebi.ac.uk/intact.

...read moreread less

1,345 citations

Journal Article•DOI•

InnateDB: systems biology of innate immunity and beyond—recent updates and continuing curation

[...]

Karin Breuer¹, Amir Foroushani², Matthew R. Laird², Carol Chen², Anastasia Sribnaia², Raymond Lo², Geoffrey L. Winsor², Robert E. W. Hancock², Fiona S. L. Brinkman², David J. Lynn² - Show less +6 more•Institutions (2)

Simon Fraser University¹, University of British Columbia²

01 Jan 2013-Nucleic Acids Research

TL;DR: The recent integration of bovine data makes InnateDB the first integrated network analysis platform for this agriculturally important model organism, and a range of improvements to the integrated bioinformatics solutions are reported.

...read moreread less

Abstract: InnateDB (http://www.innatedb.com) is an integrated analysis platform that has been specifically designed to facilitate systems-level analyses of mammalian innate immunity networks, pathways and genes. In this article, we provide details of recent updates and improvements to the database. InnateDB now contains >196 000 human, mouse and bovine experimentally validated molecular interactions and 3000 pathway annotations of relevance to all mammalian cellular systems (i.e. not just immune relevant pathways and interactions). In addition, the InnateDB team has, to date, manually curated in excess of 18 000 molecular interactions of relevance to innate immunity, providing unprecedented insight into innate immunity networks, pathways and their component molecules. More recently, InnateDB has also initiated the curation of allergy- and asthma-related interactions. Furthermore, we report a range of improvements to our integrated bioinformatics solutions including web service access to InnateDB interaction data using Proteomics Standards Initiative Common Query Interface, enhanced Gene Ontology analysis for innate immunity, and the availability of new network visualizations tools. Finally, the recent integration of bovine data makes InnateDB the first integrated network analysis platform for this agriculturally important model organism.

...read moreread less

958 citations

Journal Article•DOI•

Protein interaction data curation: the International Molecular Exchange (IMEx) consortium

[...]

Sandra Orchard, Samuel Kerrien, Sara Abbani¹, Bruno Aranda, Jignesh Bhate, Shelby L. Bidwell², Alan Bridge³, Leonardo Briganti⁴, Fiona S. L. Brinkman⁵, Gianni Cesareni⁴, Andrew Chatr-aryamontri⁴, Andrew Chatr-aryamontri⁶, Emilie Chautard⁷, Emilie Chautard⁸, Carol Chen⁹, Marine Dumousseau, Johannes B. Goll, Robert E. W. Hancock⁹, Linda Hannick, Igor Jurisica¹⁰, Jyoti Khadake, David J. Lynn, Usha Mahadevan, Livia Perfetto⁴, Arathi Raghunath, Sylvie Ricard-Blum¹¹, Bernd Roechert⁴, Lukasz Salwinski¹, Volker Stümpflen, Mike Tyers¹², Mike Tyers⁶, Peter Uetz¹³, Ioannis Xenarios¹⁴, Ioannis Xenarios³, Henning Hermjakob - Show less +31 more•Institutions (14)

University of California, Los Angeles¹, J. Craig Venter Institute², Swiss Institute of Bioinformatics³, University of Rome Tor Vergata⁴, Simon Fraser University⁵, University of Edinburgh⁶, Ontario Institute for Cancer Research⁷, Centre national de la recherche scientifique⁸, University of British Columbia⁹, University of Toronto¹⁰, Claude Bernard University Lyon 1¹¹, Mount Sinai Hospital, Toronto¹², Virginia Commonwealth University¹³, University of Lausanne¹⁴

01 Apr 2012-Nature Methods

TL;DR: The International Molecular Exchange consortium is an international collaboration between major public interaction data providers to share literature-curation efforts and make a nonredundant set of protein interactions available in a single search interface on a common website.

...read moreread less

Abstract: The International Molecular Exchange (IMEx) consortium is an international collaboration between major public interaction data providers to share literature-curation efforts and make a nonredundant set of protein interactions available in a single search interface on a common website (http://www.imexconsortium.org/). Common curation rules have been developed, and a central registry is used to manage the selection of articles to enter into the dataset. We discuss the advantages of such a service to the user, our quality-control measures and our data-distribution practices.

...read moreread less

490 citations

Journal Article•DOI•

An ultra-low-input native ChIP-seq protocol for genome-wide profiling of rare cell populations

[...]

Julie Brind’Amour¹, Sheng Liu¹, Matthew Hudson¹, Carol Chen¹, Mohammad M. Karimi¹, Matthew C. Lorincz¹ - Show less +2 more•Institutions (1)

University of British Columbia¹

21 Jan 2015-Nature Communications

TL;DR: This work demonstrates the utility of an ultra-low-input micrococcal nuclease-based native ChIP (ULI-NChIP) and sequencing method to generate genome-wide histone mark profiles with high resolution from as few as 10(3) cells and identifies sexually dimorphic H3K27me3 enrichment at specific genic promoters.

...read moreread less

Abstract: Combined chromatin immunoprecipitation and next-generation sequencing (ChIP-seq) has enabled genome-wide epigenetic profiling of numerous cell lines and tissue types. A major limitation of ChIP-seq, however, is the large number of cells required to generate high-quality data sets, precluding the study of rare cell populations. Here, we present an ultra-low-input micrococcal nuclease-based native ChIP (ULI-NChIP) and sequencing method to generate genome-wide histone mark profiles with high resolution from as few as 10(3) cells. We demonstrate that ULI-NChIP-seq generates high-quality maps of covalent histone marks from 10(3) to 10(6) embryonic stem cells. Subsequently, we show that ULI-NChIP-seq H3K27me3 profiles generated from E13.5 primordial germ cells isolated from single male and female embryos show high similarity to recent data sets generated using 50-180 × more material. Finally, we identify sexually dimorphic H3K27me3 enrichment at specific genic promoters, thereby illustrating the utility of this method for generating high-quality and -complexity libraries from rare cell populations.

...read moreread less

322 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.

[...]

Damian Szklarczyk¹, Annika L. Gable¹, David Lyon¹, Alexander Junge², Stefan Wyder¹, Jaime Huerta-Cepas³, Milan Simonovic¹, Nadezhda Tsankova Doncheva², John H. Morris⁴, Peer Bork, Lars Juhl Jensen², Christian von Mering¹ - Show less +8 more•Institutions (4)

Swiss Institute of Bioinformatics¹, University of Copenhagen², Technical University of Madrid³, University of California, San Francisco⁴

08 Jan 2019-Nucleic Acids Research

TL;DR: The latest version of STRING more than doubles the number of organisms it covers, and offers an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input.

...read moreread less

Abstract: Proteins and their functional interactions form the backbone of the cellular machinery. Their connectivity network needs to be considered for the full understanding of biological phenomena, but the available information on protein-protein associations is incomplete and exhibits varying levels of annotation granularity and reliability. The STRING database aims to collect, score and integrate all publicly available sources of protein-protein interaction information, and to complement these with computational predictions. Its goal is to achieve a comprehensive and objective global network, including direct (physical) as well as indirect (functional) interactions. The latest version of STRING (11.0) more than doubles the number of organisms it covers, to 5090. The most important new feature is an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input. For the enrichment analysis, STRING implements well-known classification systems such as Gene Ontology and KEGG, but also offers additional, new classification systems based on high-throughput text-mining as well as on a hierarchical clustering of the association network itself. The STRING resource is available online at https://string-db.org/.

...read moreread less

10,584 citations

Journal Article•DOI•

STRING v10: protein–protein interaction networks, integrated over the tree of life

[...]

Damian Szklarczyk¹, Andrea Franceschini¹, Stefan Wyder¹, Kristoffer Forslund, Davide Heller¹, Jaime Huerta-Cepas, Milan Simonovic¹, Alexander Roth¹, Alberto Santos², Kalliopi Tsafou², Michael Kuhn³, Peer Bork, Lars Juhl Jensen², Christian von Mering¹ - Show less +10 more•Institutions (3)

Swiss Institute of Bioinformatics¹, University of Copenhagen², Dresden University of Technology³

28 Jan 2015-Nucleic Acids Research

TL;DR: H hierarchical and self-consistent orthology annotations are introduced for all interacting proteins, grouping the proteins into families at various levels of phylogenetic resolution in the STRING database.

...read moreread less

Abstract: The many functional partnerships and interactions that occur between proteins are at the core of cellular processing and their systematic characterization helps to provide context in molecular systems biology. However, known and predicted interactions are scattered over multiple resources, and the available data exhibit notable differences in terms of quality and completeness. The STRING database (http://string-db.org) aims to provide a critical assessment and integration of protein-protein interactions, including direct (physical) as well as indirect (functional) associations. The new version 10.0 of STRING covers more than 2000 organisms, which has necessitated novel, scalable algorithms for transferring interaction information between organisms. For this purpose, we have introduced hierarchical and self-consistent orthology annotations for all interacting proteins, grouping the proteins into families at various levels of phylogenetic resolution. Further improvements in version 10.0 include a completely redesigned prediction pipeline for inferring protein-protein associations from co-expression data, an API interface for the R computing environment and improved statistical analysis for enrichment tests in user-provided networks.

...read moreread less

8,224 citations

Journal Article•DOI•

The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible.

[...]

Damian Szklarczyk¹, John H. Morris², Helen Cook³, Michael Kuhn, Stefan Wyder¹, Milan Simonovic¹, Alberto Santos³, Nadezhda Tsankova Doncheva³, Alexander Roth¹, Peer Bork, Lars Juhl Jensen³, Christian von Mering¹ - Show less +8 more•Institutions (3)

Swiss Institute of Bioinformatics¹, University of California, San Francisco², University of Copenhagen³

04 Jan 2017-Nucleic Acids Research

TL;DR: In the latest version 10.5 of STRING, the biggest changes are concerned with data dissemination: the web frontend has been completely redesigned to reduce dependency on outdated browser technologies, and the database can now also be queried from inside the popular Cytoscape software framework.

...read moreread less

Abstract: A system-wide understanding of cellular function requires knowledge of all functional interactions between the expressed proteins. The STRING database aims to collect and integrate this information, by consolidating known and predicted protein-protein association data for a large number of organisms. The associations in STRING include direct (physical) interactions, as well as indirect (functional) interactions, as long as both are specific and biologically meaningful. Apart from collecting and reassessing available experimental data on protein-protein interactions, and importing known pathways and protein complexes from curated databases, interaction predictions are derived from the following sources: (i) systematic co-expression analysis, (ii) detection of shared selective signals across genomes, (iii) automated text-mining of the scientific literature and (iv) computational transfer of interaction knowledge between organisms based on gene orthology. In the latest version 10.5 of STRING, the biggest changes are concerned with data dissemination: the web frontend has been completely redesigned to reduce dependency on outdated browser technologies, and the database can now also be queried from inside the popular Cytoscape software framework. Further improvements include automated background analysis of user inputs for functional enrichments, and streamlined download options. The STRING resource is available online, at http://string-db.org/.

...read moreread less

5,569 citations

Journal Article•DOI•

UniProt: A worldwide hub of protein knowledge

[...]

Alex Bateman

01 Jan 2019-Nucleic Acids Research

5,284 citations

Journal Article•DOI•

STRING v9.1: protein-protein interaction networks, with increased coverage and integration

[...]

Andrea Franceschini¹, Damian Szklarczyk¹, Sune Frankild¹, Michael Kuhn¹, Milan Simonovic¹, Alexander Roth¹, Jianyi Lin¹, Pablo Minguez¹, Peer Bork¹, Christian von Mering¹, Lars Juhl Jensen¹ - Show less +7 more•Institutions (1)

Swiss Institute of Bioinformatics¹

29 Nov 2012-Nucleic Acids Research

TL;DR: The update to version 9.1 of STRING is described, introducing several improvements, including extending the automated mining of scientific texts for interaction information, to now also include full-text articles, and providing users with statistical information on any functional enrichment observed in their networks.

...read moreread less

Abstract: Complete knowledge of all direct and indirect interactions between proteins in a given cell would represent an important milestone towards a comprehensive description of cellular mechanisms and functions. Although this goal is still elusive, considerable progress has been made-particularly for certain model organisms and functional systems. Currently, protein interactions and associations are annotated at various levels of detail in online resources, ranging from raw data repositories to highly formalized pathway databases. For many applications, a global view of all the available interaction data is desirable, including lower-quality data and/or computational predictions. The STRING database (http://string-db.org/) aims to provide such a global perspective for as many organisms as feasible. Known and predicted associations are scored and integrated, resulting in comprehensive protein networks covering >1100 organisms. Here, we describe the update to version 9.1 of STRING, introducing several improvements: (i) we extend the automated mining of scientific texts for interaction information, to now also include full-text articles; (ii) we entirely re-designed the algorithm for transferring interactions from one model organism to the other; and (iii) we provide users with statistical information on any functional enrichment observed in their networks.

...read moreread less

3,900 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse