Home
/
Authors
/
Simon Jupp

Author

Simon Jupp

Other affiliations: University of Manchester, Wellcome Trust, Swiss Institute of Bioinformatics

Bio: Simon Jupp is an academic researcher from European Bioinformatics Institute. The author has contributed to research in topics: Ontology (information science) & Semantic Web. The author has an hindex of 22, co-authored 70 publications receiving 2081 citations. Previous affiliations of Simon Jupp include University of Manchester & Wellcome Trust.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Expression Atlas update—an integrated database of gene and protein expression in humans, animals and plants

[...]

Robert Petryszak¹, Maria Keays¹, Y. Amy Tang¹, Nuno A. Fonseca¹, Elisabet Barrera¹, Tony Burdett¹, Anja Füllgrabe¹, Alfonso Muñoz-Pomer Fuentes¹, Simon Jupp¹, Satu Koskinen¹, Oliver Mannion¹, Laura Huerta¹, Karyn Megy¹, Catherine Snow¹, Eleanor Williams¹, Mitra Barzine¹, Emma Hastings¹, Hendrik Weisser², James C. Wright², Pankaj Jaiswal³, Wolfgang Huber¹, Jyoti S. Choudhary², Helen Parkinson¹, Alvis Brazma¹ - Show less +20 more•Institutions (3)

European Bioinformatics Institute¹, Wellcome Trust Sanger Institute², Oregon State University³

04 Jan 2016-Nucleic Acids Research

TL;DR: The first proteomics study in human tissues is now displayed alongside transcriptomics data in the same tissues, and novel analyses and visualisations include: ‘enrichment’ in each differential comparison of GO terms, Reactome, Plant Reactome pathways and InterPro domains.

...read moreread less

Abstract: Expression Atlas (http://www.ebi.ac.uk/gxa) provides information about gene and protein expression in animal and plant samples of different cell types, organism parts, developmental stages, diseases and other conditions. It consists of selected microarray and RNA-sequencing studies from ArrayExpress, which have been manually curated, annotated with ontology terms, checked for high quality and processed using standardised analysis methods. Since the last update, Atlas has grown seven-fold (1572 studies as of August 2015), and incorporates baseline expression profiles of tissues from Human Protein Atlas, GTEx and FANTOM5, and of cancer cell lines from ENCODE, CCLE and Genentech projects. Plant studies constitute a quarter of Atlas data. For genes of interest, the user can view baseline expression in tissues, and differential expression for biologically meaningful pairwise comparisons—estimated using consistent methodology across all of Atlas. Our first proteomics study in human tissues is now displayed alongside transcriptomics data in the same tissues. Novel analyses and visualisations include: ‘enrichment’ in each differential comparison of GO terms, Reactome, Plant Reactome pathways and InterPro domains; hierarchical clustering (by baseline expression) of most variable genes and experimental conditions; and, for a given gene-condition, distribution of baseline expression across biological replicates.

...read moreread less

509 citations

Journal Article•DOI•

Expression Atlas update: from tissues to single cells

[...]

Irene Papatheodorou¹, Pablo Moreno¹, Jonathan R. Manning¹, Alfonso Muñoz-Pomer Fuentes¹, Nancy George¹, Silvie Fexova¹, Nuno A. Fonseca¹, Anja Füllgrabe¹, Matthew Green¹, Ni Huang², Ni Huang¹, Laura Huerta¹, Haider Iqbal¹, Monica Jianu¹, Suhaib Mohammed¹, Lingyun Zhao¹, Andrew F. Jarnuczak¹, Simon Jupp¹, John C. Marioni¹, John C. Marioni³, John C. Marioni², Kerstin B. Meyer², Robert Petryszak¹, Cesar Augusto Prada Medina¹, Carlos Talavera-López², Sarah A. Teichmann², Juan Antonio Vizcaíno¹, Alvis Brazma¹ - Show less +24 more•Institutions (3)

European Bioinformatics Institute¹, Wellcome Trust Sanger Institute², University of Cambridge³

30 Oct 2019-Nucleic Acids Research

TL;DR: Expression Atlas is extended with a new added-value service to display gene expression in single cells with the increased availability of single cell RNA-Seq datasets in the public archives.

...read moreread less

Abstract: Expression Atlas is EMBL-EBI's resource for gene and protein expression. It sources and compiles data on the abundance and localisation of RNA and proteins in various biological systems and contexts and provides open access to this data for the research community. With the increased availability of single cell RNA-Seq datasets in the public archives, we have now extended Expression Atlas with a new added-value service to display gene expression in single cells. Single Cell Expression Atlas was launched in 2018 and currently includes 123 single cell RNA-Seq studies from 12 species. The website can be searched by genes within or across species to reveal experiments, tissues and cell types where this gene is expressed or under which conditions it is a marker gene. Within each study, cells can be visualized using a pre-calculated t-SNE plot and can be coloured by different features or by cell clusters based on gene expression. Within each experiment, there are links to downloadable files, such as RNA quantification matrices, clustering results, reports on protocols and associated metadata, such as assigned cell types.

...read moreread less

347 citations

Journal Article•DOI•

Expression Atlas update—a database of gene and transcript expression from microarray- and sequencing-based functional genomics experiments

[...]

Robert Petryszak¹, Tony Burdett¹, Benedetto Fiorelli¹, Nuno A. Fonseca¹, Mar Gonzàlez-Porta¹, Emma Hastings¹, Wolfgang Huber¹, Simon Jupp¹, Maria Keays¹, Nataliya Kryvych¹, Julie A. McMurry¹, John C. Marioni¹, James Malone¹, Karyn Megy¹, Gabriella Rustici¹, Y. Amy Tang¹, Jan Taubert¹, Eleanor Williams¹, Oliver Mannion¹, Helen Parkinson¹, Alvis Brazma¹ - Show less +17 more•Institutions (1)

European Bioinformatics Institute¹

01 Jan 2014-Nucleic Acids Research

TL;DR: The new version of Expression Atlas introduces the concept of ‘baseline’ expression, i.e. gene and splice variant abundance levels in healthy or untreated conditions, such as tissues or cell types, in order to maximize the biological value provided to the user.

...read moreread less

Abstract: Expression Atlas (http://www.ebi.ac.uk/gxa) is a value-added database providing information about gene, protein and splice variant expression in different cell types, organism parts, developmental stages, diseases and other biological and experimental conditions. The database consists of selected high-quality microarray and RNA-sequencing experiments from ArrayExpress that have been manually curated, annotated with Experimental Factor Ontology terms and processed using standardized microarray and RNA-sequencing analysis methods. The new version of Expression Atlas introduces the concept of 'baseline' expression, i.e. gene and splice variant abundance levels in healthy or untreated conditions, such as tissues or cell types. Differential gene expression data benefit from an in-depth curation of experimental intent, resulting in biologically meaningful 'contrasts', i.e. instances of differential pairwise comparisons between two sets of biological replicates. Other novel aspects of Expression Atlas are its strict quality control of raw experimental data, up-to-date RNA-sequencing analysis methods, expression data at the level of gene sets, as well as genes and a more powerful search interface designed to maximize the biological value provided to the user.

...read moreread less

316 citations

Journal Article•DOI•

The EBI RDF Platform: Linked Open Data for the Life Sciences

[...]

Simon Jupp¹, James Malone¹, Jerven Bolleman¹, Marco Brandizi¹, Mark Davies¹, Leyla Garcia¹, Anna Gaulton¹, Sebastien Gehant¹, Camille Laibe¹, Nicole Redaschi¹, Sarala M. Wimalaratne¹, Maria Jesus Martin¹, Nicolas Le Novère¹, Helen Parkinson¹, Ewan Birney¹, Andrew M. Jenkinson¹ - Show less +12 more•Institutions (1)

Swiss Institute of Bioinformatics¹

01 May 2014-Bioinformatics

TL;DR: The EBI RDF platform has been developed to meet an increasing demand to coordinate RDF activities across the institute and provides a new entry point to querying and exploring integrated resources available at the EBI.

...read moreread less

Abstract: Motivation: Resource description framework (RDF) is an emerging technology for describing, publishing and linking life science data. As a major provider of bioinformatics data and services, the European Bioinformatics Institute (EBI) is committed to making data readily accessible to the community in ways that meet existing demand. The EBI RDF platform has been developed to meet an increasing demand to coordinate RDF activities across the institute and provides a new entry point to querying and exploring integrated resources available at the EBI. Availability: http://www.ebi.ac.uk/rdf Contact: jupp@ebi.ac.uk

...read moreread less

225 citations

Journal Article•DOI•

Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics.

[...]

Maya Ghoussaini¹, Edward Mountjoy¹, Miguel Carmona², Gareth Peat², Ellen M. Schmidt¹, Andrew Hercules², Luca Fumis², Alfredo Miranda², Denise Carvalho-Silva², Annalisa Buniello², Tony Burdett², James D. Hayhurst², Jarrod Baker², Javier Ferrer², Asier Gonzalez-Uriarte², Simon Jupp², Mohd Anisul Karim¹, Gautier Koscielny³, Sandra Machlitt-Northen³, Cinzia Malangone², Zoë May Pendlington², Paola Roncaglia², Daniel Suveges², Daniel Wright¹, Olga Vrousgou², Eliseo Papa⁴, Helen Parkinson², Jacqueline A. L. MacArthur², John A. Todd⁵, Jeffrey C. Barrett¹, Jeremy Schwartzentruber¹, David G. Hulcoop³, David Ochoa², Ellen M. McDonagh², Ellen M. McDonagh¹, Ian Dunham¹, Ian Dunham² - Show less +33 more•Institutions (5)

Wellcome Trust Sanger Institute¹, European Bioinformatics Institute², GlaxoSmithKline³, Biogen Idec⁴, University of Oxford⁵

08 Jan 2021-Nucleic Acids Research

TL;DR: Open Targets Genetics offers tools that enable users to prioritise causal variants and genes at disease-associated loci and access systematic cross-disease and disease-molecular trait colocalization analysis across 92 cell types and tissues including the eQTL Catalogue.

...read moreread less

Abstract: Open Targets Genetics (https://genetics.opentargets.org) is an open-access integrative resource that aggregates human GWAS and functional genomics data including gene expression, protein abundance, chromatin interaction and conformation data from a wide range of cell types and tissues to make robust connections between GWAS-associated loci, variants and likely causal genes. This enables systematic identification and prioritisation of likely causal variants and genes across all published trait-associated loci. In this paper, we describe the public resources we aggregate, the technology and analyses we use, and the functionality that the portal offers. Open Targets Genetics can be searched by variant, gene or study/phenotype. It offers tools that enable users to prioritise causal variants and genes at disease-associated loci and access systematic cross-disease and disease-molecular trait colocalization analysis across 92 cell types and tissues including the eQTL Catalogue. Data visualizations such as Manhattan-like plots, regional plots, credible sets overlap between studies and PheWAS plots enable users to explore GWAS signals in depth. The integrated data is made available through the web portal, for bulk download and via a GraphQL API, and the software is open source. Applications of this integrated data include identification of novel targets for drug discovery and drug repurposing.

...read moreread less

218 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

limma powers differential expression analyses for RNA-sequencing and microarray studies

[...]

Matthew E. Ritchie¹, Belinda Phipson², Di Wu³, Yifang Hu¹, Charity W. Law⁴, Wei Shi¹, Gordon K. Smyth⁵, Gordon K. Smyth¹ - Show less +4 more•Institutions (5)

Walter and Eliza Hall Institute of Medical Research¹, Royal Children's Hospital², Harvard University³, University of Zurich⁴, University of Melbourne⁵

20 Apr 2015-Nucleic Acids Research

TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

Abstract: limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

...read moreread less

22,147 citations

Journal Article•DOI•

GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses.

[...]

Zefang Tang¹, Chenwei Li¹, Boxi Kang¹, Ge Gao¹, Cheng Li¹, Zemin Zhang - Show less +2 more•Institutions (1)

Peking University¹

03 Jul 2017-Nucleic Acids Research

TL;DR: GEPIA (Gene Expression Profiling Interactive Analysis) fills in the gap between cancer genomics big data and the delivery of integrated information to end users, thus helping unleash the value of the current data resources.

...read moreread less

Abstract: Tremendous amount of RNA sequencing data have been produced by large consortium projects such as TCGA and GTEx, creating new opportunities for data mining and deeper understanding of gene functions. While certain existing web servers are valuable and widely used, many expression analysis functions needed by experimental biologists are still not adequately addressed by these tools. We introduce GEPIA (Gene Expression Profiling Interactive Analysis), a web-based tool to deliver fast and customizable functionalities based on TCGA and GTEx data. GEPIA provides key interactive and customizable functions including differential expression analysis, profiling plotting, correlation analysis, patient survival analysis, similar gene detection and dimensionality reduction analysis. The comprehensive expression analyses with simple clicking through GEPIA greatly facilitate data mining in wide research areas, scientific discussion and the therapeutic discovery process. GEPIA fills in the gap between cancer genomics big data and the delivery of integrated information to end users, thus helping unleash the value of the current data resources. GEPIA is available at http://gepia.cancer-pku.cn/.

...read moreread less

5,980 citations

Journal Article•DOI•

The PRIDE database and related tools and resources in 2019: improving support for quantification data.

[...]

Yasset Perez-Riverol¹, Attila Csordas¹, Jingwen Bai¹, Manuel Bernal-Llinares¹, Suresh Hewapathirana¹, Deepti J. Kundu¹, Avinash Inuganti¹, Johannes Griss², Johannes Griss¹, Gerhard Mayer³, Martin Eisenacher³, Enrique Perez¹, Julian Uszkoreit³, Julianus Pfeuffer⁴, Timo Sachsenberg⁴, Şule Yılmaz⁵, Shivani Tiwary⁵, Juergen Cox⁵, Enrique Audain, Mathias Walzer¹, Andrew F. Jarnuczak¹, Tobias Ternent¹, Alvis Brazma¹, Juan Antonio Vizcaíno¹ - Show less +20 more•Institutions (5)

European Bioinformatics Institute¹, Medical University of Vienna², Ruhr University Bochum³, University of Tübingen⁴, Max Planck Society⁵

08 Jan 2019-Nucleic Acids Research

TL;DR: Key statistics on the current data contents and volume of downloads are outlined, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas are outlined.

...read moreread less

Abstract: The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world’s largest data repository of mass spectrometry-based proteomics data, and is one of the founding members of the global ProteomeXchange (PX) consortium. In this manuscript, we summarize the developments in PRIDE resources and related tools since the previous update manuscript was published in Nucleic Acids Research in 2016. In the last 3 years, public data sharing through PRIDE (as part of PX) has definitely become the norm in the field. In parallel, data re-use of public proteomics data has increased enormously, with multiple applications. We first describe the new architecture of PRIDE Archive, the archival component of PRIDE. PRIDE Archive and the related data submission framework have been further developed to support the increase in submitted data volumes and additional data types. A new scalable and fault tolerant storage backend, Application Programming Interface and web interface have been implemented, as a part of an ongoing process. Additionally, we emphasize the improved support for quantitative proteomics data through the mzTab format. At last, we outline key statistics on the current data contents and volume of downloads, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas.

...read moreread less

5,735 citations

Journal Article•DOI•

The Ensembl Variant Effect Predictor.

[...]

William M. McLaren¹, Laurent Gil¹, Sarah E. Hunt¹, Harpreet Singh Riat¹, Graham R. S. Ritchie¹, Anja Thormann¹, Paul Flicek¹, Fiona Cunningham¹ - Show less +4 more•Institutions (1)

European Bioinformatics Institute¹

06 Jun 2016-Genome Biology

TL;DR: The Ensembl Variant Effect Predictor can simplify and accelerate variant interpretation in a wide range of study designs.

...read moreread less

Abstract: The Ensembl Variant Effect Predictor is a powerful toolset for the analysis, annotation, and prioritization of genomic variants in coding and non-coding regions. It provides access to an extensive collection of genomic annotation, with a variety of interfaces to suit different requirements, and simple options for configuring and extending analysis. It is open source, free to use, and supports full reproducibility of results. The Ensembl Variant Effect Predictor can simplify and accelerate variant interpretation in a wide range of study designs.

...read moreread less

4,658 citations

Journal Article•DOI•

The EMBL-EBI search and sequence analysis tools APIs in 2019

[...]

Fábio Madeira¹, Youngmi Park¹, Joon Lee¹, Nicola Buso¹, Tamer Gur¹, Nandana Madhusoodanan¹, Prasad Basutkar¹, Adrian R N Tivey¹, Simon C. Potter¹, Robert D. Finn¹, Rodrigo Lopez¹ - Show less +7 more•Institutions (1)

European Bioinformatics Institute¹

02 Jul 2019-Nucleic Acids Research

TL;DR: The latest improvements made to the frameworks which enhance the interconnectivity between public EMBL-EBI resources and ultimately enhance biological data discoverability, accessibility, interoperability and reusability are described.

...read moreread less

Abstract: The EMBL-EBI provides free access to popular bioinformatics sequence analysis applications as well as to a full-featured text search engine with powerful cross-referencing and data retrieval capabilities. Access to these services is provided via user-friendly web interfaces and via established RESTful and SOAP Web Services APIs (https://www.ebi.ac.uk/seqdb/confluence/display/JDSAT/EMBL-EBI+Web+Services+APIs+-+Data+Retrieval). Both systems have been developed with the same core principles that allow them to integrate an ever-increasing volume of biological data, making them an integral part of many popular data resources provided at the EMBL-EBI. Here, we describe the latest improvements made to the frameworks which enhance the interconnectivity between public EMBL-EBI resources and ultimately enhance biological data discoverability, accessibility, interoperability and reusability.

...read moreread less

3,529 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse