Home
/
Authors
/
Olga Melnichuk

Author

Olga Melnichuk

Bio: Olga Melnichuk is an academic researcher from European Bioinformatics Institute. The author has contributed to research in topics: Data management & Data access. The author has an hindex of 4, co-authored 5 publications receiving 854 citations. Previous affiliations of Olga Melnichuk include Harvard University.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

ArrayExpress update—simplifying data submissions

[...]

Nikolay Kolesnikov¹, Emma Hastings¹, Maria Keays¹, Olga Melnichuk¹, Y. Amy Tang¹, Eleanor Williams¹, Miroslaw Dylag¹, Natalja Kurbatova¹, Marco Brandizi¹, Tony Burdett¹, Karyn Megy¹, Ekaterina Pilicheva¹, Gabriella Rustici¹, Andrew Tikhonov¹, Helen Parkinson¹, Robert Petryszak¹, Ugis Sarkans¹, Alvis Brazma¹ - Show less +14 more•Institutions (1)

European Bioinformatics Institute¹

28 Jan 2015-Nucleic Acids Research

TL;DR: The main development over the last two years has been the release of a new data submission tool Annotare, which has reduced the average submission time almost 3-fold and will become the only submission route into ArrayExpress, alongside MAGE-TAB format-based pipelines in the near future.

...read moreread less

Abstract: The ArrayExpress Archive of Functional Genomics Data (http://www.ebi.ac.uk/arrayexpress) is an international functional genomics database at the European Bioinformatics Institute (EMBL-EBI) recommended by most journals as a repository for data supporting peer-reviewed publications. It contains data from over 7000 public sequencing and 42 000 array-based studies comprising over 1.5 million assays in total. The proportion of sequencing-based submissions has grown significantly over the last few years and has doubled in the last 18 months, whilst the rate of microarray submissions is growing slightly. All data in ArrayExpress are available in the MAGE-TAB format, which allows robust linking to data analysis and visualization tools and standardized analysis. The main development over the last two years has been the release of a new data submission tool Annotare, which has reduced the average submission time almost 3-fold. In the near future, Annotare will become the only submission route into ArrayExpress, alongside MAGE-TAB format-based pipelines. ArrayExpress is a stable and highly accessed resource. Our future tasks include automation of data flows and further integration with other EMBL-EBI resources for the representation of multi-omics data.

...read moreread less

676 citations

Journal Article•DOI•

Gene Expression Atlas update—a value-added database of microarray and sequencing-based functional genomics experiments

[...]

Misha Kapushesky¹, Tomasz Adamusiak¹, Tony Burdett¹, Aedín C. Culhane¹, Anna Farne¹, Alexey Filippov¹, Ele Holloway¹, Andrey Klebanov¹, Nataliya Kryvych¹, Natalja Kurbatova¹, Pavel Kurnosov¹, James Malone¹, Olga Melnichuk¹, Robert Petryszak¹, Nikolay Pultsin¹, Gabriella Rustici¹, Andrew Tikhonov¹, Ravensara S. Travillian¹, Eleanor Williams¹, Andrey Zorin¹, Helen Parkinson¹, Alvis Brazma¹ - Show less +18 more•Institutions (1)

Harvard University¹

01 Jan 2012-Nucleic Acids Research

TL;DR: The Gene Expression Atlas is an added-value database providing information about gene expression in different cell types, organism parts, developmental stages, disease states, sample treatments and other biological/experimental conditions.

...read moreread less

Abstract: Gene Expression Atlas (http://www.ebi.ac.uk/gxa) is an added-value database providing information about gene expression in different cell types, organism parts, developmental stages, disease states, sample treatments and other biological/experimental conditions. The content of this database derives from curation, re-annotation and statistical analysis of selected data from the ArrayExpress Archive and the European Nucleotide Archive. A simple interface allows the user to query for differential gene expression either by gene names or attributes or by biological conditions, e.g. diseases, organism parts or cell types. Since our previous report we made 20 monthly releases and, as of Release 11.08 (August 2011), the database supports 19 species, which contains expression data measured for 19,014 biological conditions in 136,551 assays from 5598 independent studies.

...read moreread less

166 citations

Journal Article•DOI•

The BioStudies database-one stop shop for all data supporting a life sciences study.

[...]

Ugis Sarkans¹, Mikhail Gostev¹, Awais Athar¹, Ehsan Behrangi¹, Olga Melnichuk¹, Ahmed Ali¹, Jasmine Minguet¹, Juan Camillo Rada¹, Catherine Snow¹, Andrew Tikhonov¹, Alvis Brazma¹, Johanna McEntyre¹ - Show less +8 more•Institutions (1)

European Bioinformatics Institute¹

04 Jan 2018-Nucleic Acids Research

TL;DR: BioStudies offers a simple way to describe the study structure, and provides flexible data deposition tools and data access interfaces, and is a resource for authors and publishers for packaging data during the manuscript preparation process.

...read moreread less

Abstract: BioStudies (www.ebi.ac.uk/biostudies) is a new public database that organizes data from biological studies. Typically, but not exclusively, a study is associated with a publication. BioStudies offers a simple way to describe the study structure, and provides flexible data deposition tools and data access interfaces. The actual data can be stored either in BioStudies or remotely, or both. BioStudies imports supplementary data from Europe PMC, and is a resource for authors and publishers for packaging data during the manuscript preparation process. It also can support data management needs of collaborative projects. The growth in multiomics experiments and other multi-faceted approaches to life sciences research mean that studies result in a diversity of data outputs in multiple locations. BioStudies presents a solution to ensuring that all these data and the associated publication(s) can be found coherently in the longer term.

...read moreread less

85 citations

Journal Article•DOI•

Orchestrating differential data access for translational research: a pilot implementation.

[...]

Marco Brandizi¹, Olga Melnichuk¹, Raffael Bild², Florian Kohlmayer², Benedicto Rodriguez-Castro², Helmut Spengler², Klaus A. Kuhn², Wolfgang Kuchinke³, Christian Ohmann, Timo Mustonen⁴, Mikael Linden⁴, Tommi Nyrönen⁴, Ilkka Lappalainen⁴, Alvis Brazma¹, Ugis Sarkans¹ - Show less +11 more•Institutions (4)

European Bioinformatics Institute¹, Technische Universität München², University of Düsseldorf³, CSC – IT Center for Science⁴

23 Mar 2017-BMC Medical Informatics and Decision Making

TL;DR: A pilot system that uses several common open source software components in a novel combination to coordinate access to heterogeneous biomedical data repositories containing open data as well as sensitive data in the domain of biobanking and biosample research is presented.

...read moreread less

Abstract: Translational researchers need robust IT solutions to access a range of data types, varying from public data sets to pseudonymised patient information with restricted access, provided on a case by case basis. The reason for this complication is that managing access policies to sensitive human data must consider issues of data confidentiality, identifiability, extent of consent, and data usage agreements. All these ethical, social and legal aspects must be incorporated into a differential management of restricted access to sensitive data. In this paper we present a pilot system that uses several common open source software components in a novel combination to coordinate access to heterogeneous biomedical data repositories containing open data (open access) as well as sensitive data (restricted access) in the domain of biobanking and biosample research. Our approach is based on a digital identity federation and software to manage resource access entitlements. Open source software components were assembled and configured in such a way that they allow for different ways of restricted access according to the protection needs of the data. We have tested the resulting pilot infrastructure and assessed its performance, feasibility and reproducibility. Common open source software components are sufficient to allow for the creation of a secure system for differential access to sensitive data. The implementation of this system is exemplary for researchers facing similar requirements for restricted access data. Here we report experience and lessons learnt of our pilot implementation, which may be useful for similar use cases. Furthermore, we discuss possible extensions for more complex scenarios.

...read moreread less

6 citations

DOI•

BioMedBridges: Implementation of a pilot for the security framework

[...]

Marco Brandizi, Timo Mustonen, Tommi Nyrönen, Mikael Linden, Christian Ohmann, Wolfgang Kuchinke, Olga Melnichuk, Klaus A. Kuhn, Raffael Bild, Florian Kohlmayer, Helmut Spengler, Ugis Sarkans, Benedicto Rodriguez-Castro - Show less +9 more

11 Feb 2016

1 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The PRIDE database and related tools and resources in 2019: improving support for quantification data.

[...]

Yasset Perez-Riverol¹, Attila Csordas¹, Jingwen Bai¹, Manuel Bernal-Llinares¹, Suresh Hewapathirana¹, Deepti J. Kundu¹, Avinash Inuganti¹, Johannes Griss², Johannes Griss¹, Gerhard Mayer³, Martin Eisenacher³, Enrique Perez¹, Julian Uszkoreit³, Julianus Pfeuffer⁴, Timo Sachsenberg⁴, Şule Yılmaz⁵, Shivani Tiwary⁵, Juergen Cox⁵, Enrique Audain, Mathias Walzer¹, Andrew F. Jarnuczak¹, Tobias Ternent¹, Alvis Brazma¹, Juan Antonio Vizcaíno¹ - Show less +20 more•Institutions (5)

European Bioinformatics Institute¹, Medical University of Vienna², Ruhr University Bochum³, University of Tübingen⁴, Max Planck Society⁵

08 Jan 2019-Nucleic Acids Research

TL;DR: Key statistics on the current data contents and volume of downloads are outlined, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas are outlined.

...read moreread less

Abstract: The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world’s largest data repository of mass spectrometry-based proteomics data, and is one of the founding members of the global ProteomeXchange (PX) consortium. In this manuscript, we summarize the developments in PRIDE resources and related tools since the previous update manuscript was published in Nucleic Acids Research in 2016. In the last 3 years, public data sharing through PRIDE (as part of PX) has definitely become the norm in the field. In parallel, data re-use of public proteomics data has increased enormously, with multiple applications. We first describe the new architecture of PRIDE Archive, the archival component of PRIDE. PRIDE Archive and the related data submission framework have been further developed to support the increase in submitted data volumes and additional data types. A new scalable and fault tolerant storage backend, Application Programming Interface and web interface have been implemented, as a part of an ongoing process. Additionally, we emphasize the improved support for quantitative proteomics data through the mzTab format. At last, we outline key statistics on the current data contents and volume of downloads, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas.

...read moreread less

5,735 citations

Journal Article•DOI•

The Reactome Pathway Knowledgebase.

[...]

Antonio Fabregat¹, Konstantinos Sidiropoulos¹, Phani V. Garapati¹, Marc Gillespie², Marc Gillespie³, Kerstin Hausmann¹, Robin Haw², Bijay Jassal², S Jupe¹, Florian Korninger¹, Sheldon J. McKay², Lisa Matthews⁴, Bruce May², Marija Milacic², Karen Rothfels², Veronica Shamovsky⁴, Marissa Webber², Joel Weiser², Mark Williams¹, Guanming Wu², Lincoln Stein², Lincoln Stein⁵, Lincoln Stein⁶, Henning Hermjakob¹, Henning Hermjakob⁷, Peter D'Eustachio⁴ - Show less +22 more•Institutions (7)

European Bioinformatics Institute¹, Ontario Institute for Cancer Research², St. John's University³, New York University⁴, Cold Spring Harbor Laboratory⁵, University of Toronto⁶, Protein Sciences⁷

01 Jan 2014-Nucleic Acids Research

TL;DR: The Reactome Knowledgebase provides molecular details of signal transduction, transport, DNA replication, metabolism and other cellular processes as an ordered network of molecular transformations—an extended version of a classic metabolic map, in a single consistent data model.

...read moreread less

Abstract: The Reactome Knowledgebase (www.reactome.org) provides molecular details of signal transduction, transport, DNA replication, metabolism and other cellular processes as an ordered network of molecular transformations-an extended version of a classic metabolic map, in a single consistent data model. Reactome functions both as an archive of biological processes and as a tool for discovering unexpected functional relationships in data such as gene expression pattern surveys or somatic mutation catalogues from tumour cells. Over the last two years we redeveloped major components of the Reactome web interface to improve usability, responsiveness and data visualization. A new pathway diagram viewer provides a faster, clearer interface and smooth zooming from the entire reaction network to the details of individual reactions. Tool performance for analysis of user datasets has been substantially improved, now generating detailed results for genome-wide expression datasets within seconds. The analysis module can now be accessed through a RESTFul interface, facilitating its inclusion in third party applications. A new overview module allows the visualization of analysis results on a genome-wide Reactome pathway hierarchy using a single screen page. The search interface now provides auto-completion as well as a faceted search to narrow result lists efficiently.

...read moreread less

5,065 citations

Journal Article•DOI•

Analysis Tool Web Services from the EMBL-EBI

[...]

Hamish McWilliam¹, Weizhong Li¹, Mahmut Uludag¹, Silvano Squizzato¹, Youngmi Park¹, Nicola Buso¹, Andrew Peter Cowley¹, Rodrigo Lopez¹ - Show less +4 more•Institutions (1)

European Bioinformatics Institute¹

01 Jul 2013-Nucleic Acids Research

TL;DR: Since 2004 the European Bioinformatics Institute (EMBL-EBI) has provided access to a wide range of databases and analysis tools via Web Services interfaces, which allow their integration into other tools, applications, web sites, pipeline processes and analytical workflows.

...read moreread less

Abstract: Since 2004 the European Bioinformatics Institute (EMBL-EBI) has provided access to a wide range of databases and analysis tools via Web Services interfaces. This comprises services to search across the databases available from the EMBL-EBI and to explore the network of cross-references present in the data (e.g. EB-eye), services to retrieve entry data in various data formats and to access the data in specific fields (e.g. dbfetch), and analysis tool services, for example, sequence similarity search (e.g. FASTA and NCBI BLAST), multiple sequence alignment (e.g. Clustal Omega and MUSCLE), pairwise sequence alignment and protein functional analysis (e.g. InterProScan and Phobius). The REST/SOAP Web Services (http://www.ebi.ac.uk/Tools/webservices/) interfaces to these databases and tools allow their integration into other tools, applications, web sites, pipeline processes and analytical workflows. To get users started using the Web Services, sample clients are provided covering a range of programming languages and popular Web Service tool kits, and a brief guide to Web Services technologies, including a set of tutorials, is available for those wishing to learn more and develop their own clients. Users of the Web Services are informed of improvements and updates via a range of methods.

...read moreread less

1,562 citations

Journal Article•DOI•

g:Profiler—a web server for functional interpretation of gene lists (2016 update)

[...]

Jüri Reimand¹, Tambet Arak², Priit Adler², Liis Kolberg², Sulev Reisberg², Hedi Peterson², Jaak Vilo² - Show less +3 more•Institutions (2)

Ontario Institute for Cancer Research¹, University of Tartu²

08 Jul 2016-Nucleic Acids Research

TL;DR: The 2016 update of g:Profiler introduces several novel features, including transcription factor binding site predictions, Mendelian disease annotations, information about protein expression and complexes and gene mappings of human genetic polymorphisms.

...read moreread less

Abstract: Functional enrichment analysis is a key step in interpreting gene lists discovered in diverse high-throughput experiments. g:Profiler studies flat and ranked gene lists and finds statistically significant Gene Ontology terms, pathways and other gene function related terms. Translation of hundreds of gene identifiers is another core feature of g:Profiler. Since its first publication in 2007, our web server has become a popular tool of choice among basic and translational researchers. Timeliness is a major advantage of g:Profiler as genome and pathway information is synchronized with the Ensembl database in quarterly updates. g:Profiler supports 213 species including mammals and other vertebrates, plants, insects and fungi. The 2016 update of g:Profiler introduces several novel features. We have added further functional datasets to interpret gene lists, including transcription factor binding site predictions, Mendelian disease annotations, information about protein expression and complexes and gene mappings of human genetic polymorphisms. Besides the interactive web interface, g:Profiler can be accessed in computational pipelines using our R package, Python interface and BioJS component. g:Profiler is freely available at http://biit.cs.ut.ee/gprofiler/.

...read moreread less

1,122 citations

Journal Article•DOI•

The Human Urine Metabolome

[...]

Souhaila Bouatra¹, Farid Aziat¹, Rupasri Mandal¹, An Chi Guo¹, Michael Wilson¹, Craig Knox¹, Trent C. Bjorndahl¹, Ramanarayan Krishnamurthy¹, Fozia Saleem¹, Philip B. Liu¹, Zerihun T. Dame¹, Jenna Poelzer¹, Jessica Huynh¹, Faizath S. Yallou¹, Nick Psychogios², Edison Dong¹, Ralf Bogumil³, Cornelia Roehring³, David S. Wishart¹, David S. Wishart⁴ - Show less +16 more•Institutions (4)

University of Alberta¹, Harvard University², Biocrates Life Sciences AG³, National Institute for Nanotechnology⁴

04 Sep 2013-PLOS ONE

TL;DR: A comprehensive, quantitative, metabolome-wide characterization of human urine and the identification and annotation of several previously unknown urine metabolites and to substantially enhance the level of metabolome coverage are undertaken.

...read moreread less

Abstract: Urine has long been a “favored” biofluid among metabolomics researchers. It is sterile, easy-to-obtain in large volumes, largely free from interfering proteins or lipids and chemically complex. However, this chemical complexity has also made urine a particularly difficult substrate to fully understand. As a biological waste material, urine typically contains metabolic breakdown products from a wide range of foods, drinks, drugs, environmental contaminants, endogenous waste metabolites and bacterial by-products. Many of these compounds are poorly characterized and poorly understood. In an effort to improve our understanding of this biofluid we have undertaken a comprehensive, quantitative, metabolome-wide characterization of human urine. This involved both computer-aided literature mining and comprehensive, quantitative experimental assessment/validation. The experimental portion employed NMR spectroscopy, gas chromatography mass spectrometry (GC-MS), direct flow injection mass spectrometry (DFI/LC-MS/MS), inductively coupled plasma mass spectrometry (ICP-MS) and high performance liquid chromatography (HPLC) experiments performed on multiple human urine samples. This multi-platform metabolomic analysis allowed us to identify 445 and quantify 378 unique urine metabolites or metabolite species. The different analytical platforms were able to identify (quantify) a total of: 209 (209) by NMR, 179 (85) by GC-MS, 127 (127) by DFI/LC-MS/MS, 40 (40) by ICP-MS and 10 (10) by HPLC. Our use of multiple metabolomics platforms and technologies allowed us to identify several previously unknown urine metabolites and to substantially enhance the level of metabolome coverage. It also allowed us to critically assess the relative strengths and weaknesses of different platforms or technologies. The literature review led to the identification and annotation of another 2206 urinary compounds and was used to help guide the subsequent experimental studies. An online database containing the complete set of 2651 confirmed human urine metabolite species, their structures (3079 in total), concentrations, related literature references and links to their known disease associations are freely available at http://www.urinemetabolome.ca.

...read moreread less

1,118 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185

Collapse