scispace - formally typeset
Search or ask a question
Author

Manuel Tardaguila

Bio: Manuel Tardaguila is an academic researcher from Wellcome Trust Sanger Institute. The author has contributed to research in topics: Genome-wide association study & Allele. The author has an hindex of 12, co-authored 22 publications receiving 562 citations. Previous affiliations of Manuel Tardaguila include Swiss Institute of Bioinformatics & University of Florida.

Papers
More filters
Journal ArticleDOI
Dragana Vuckovic1, Erik L. Bao2, Parsa Akbari1, Caleb A. Lareau2, Abdou Mousas3, Tao Jiang1, Ming-Huei Chen, Laura M. Raffield4, Manuel Tardaguila5, Jennifer E. Huffman6, Scott C. Ritchie1, Karyn Megy1, Hannes Ponstingl5, Christopher J. Penkett1, Patrick K. Albers5, Emilie M. Wigdor5, Saori Sakaue7, Arden Moscati8, Regina Manansala9, Ken Sin Lo3, Huijun Qian4, Masato Akiyama10, Traci M. Bartz11, Yoav Ben-Shlomo12, Andrew D Beswick12, Jette Bork-Jensen13, Erwin P. Bottinger8, Jennifer A. Brody11, Frank J. A. van Rooij14, Kumaraswamy Naidu Chitrala15, Peter W.F. Wilson16, Hélène Choquet17, John Danesh, Emanuele Di Angelantonio, Niki Dimou18, Jingzhong Ding19, Paul Elliott20, Tõnu Esko21, Michele K. Evans15, Stephan B. Felix22, James S. Floyd11, Linda Broer14, Niels Grarup13, Michael H. Guo23, Qi Guo24, Andreas Greinacher22, Jeffrey Haessler25, Torben Hansen13, J. M. M. Howson1, Wei Huang26, Eric Jorgenson17, Tim Kacprowski27, Mika Kähönen28, Yoichiro Kamatani29, Masahiro Kanai2, Savita Karthikeyan24, Fotios Koskeridis30, Leslie A. Lange31, Terho Lehtimäki, Allan Linneberg13, Yongmei Liu32, Leo-Pekka Lyytikäinen, Ani Manichaikul33, Koichi Matsuda29, Karen L. Mohlke4, Nina Mononen, Yoshinori Murakami29, Girish N. Nadkarni8, Kjell Nikus28, Nathan Pankratz34, Oluf Pedersen13, Michael Preuss8, Bruce M. Psaty11, Olli T. Raitakari35, Stephen S. Rich33, Benjamin Rodriguez, Jonathan D. Rosen4, Jerome I. Rotter36, Petra Schubert6, Cassandra N. Spracklen4, Praveen Surendran5, Hua Tang37, Jean-Claude Tardif3, Mohsen Ghanbari38, Uwe Völker22, Henry Völzke22, Nicholas A. Watkins39, Stefan Weiss22, VA Million Veteran Program5, Na Cai5, Kousik Kundu5, Stephen B. Watt5, Klaudia Walter5, Alan B. Zonderman15, Kelly Cho40, Yun Li4, Ruth J. F. Loos8, Julian C. Knight41, Michel Georges42, Oliver Stegle43, Evangelos Evangelou20, Yukinori Okada7, David J. Roberts44, Michael Inouye, Andrew D. Johnson, Paul L. Auer9, William J. Astle1, Alexander P. Reiner11, Adam S. Butterworth, Willem H. Ouwehand1, Guillaume Lettre3, Vijay G. Sankaran21, Vijay G. Sankaran2, Nicole Soranzo 
03 Sep 2020-Cell
TL;DR: The results show the power of large-scale blood cell trait GWAS to interrogate clinically meaningful variants across a wide allelic spectrum of human variation.

284 citations

Journal ArticleDOI
TL;DR: SQANTI allows the user to maximize the analytical outcome of long-read technologies by providing the tools to deliver quality-evaluated and curated full-length transcriptomes and shows that these new transcripts have a major impact in the correct quantification of transcript levels by state-of-the-art short-read-based quantification algorithms.
Abstract: High-throughput sequencing of full-length transcripts using long reads has paved the way for the discovery of thousands of novel transcripts, even in well-annotated mammalian species. The advances in sequencing technology have created a need for studies and tools that can characterize these novel variants. Here, we present SQANTI, an automated pipeline for the classification of long-read transcripts that can assess the quality of data and the preprocessing pipeline using 47 unique descriptors. We apply SQANTI to a neuronal mouse transcriptome using Pacific Biosciences (PacBio) long reads and illustrate how the tool is effective in characterizing and describing the composition of the full-length transcriptome. We perform extensive evaluation of ToFU PacBio transcripts by PCR to reveal that an important number of the novel transcripts are technical artifacts of the sequencing approach and that SQANTI quality descriptors can be used to engineer a filtering strategy to remove them. Most novel transcripts in this curated transcriptome are novel combinations of existing splice sites, resulting more frequently in novel ORFs than novel UTRs, and are enriched in both general metabolic and neural-specific functions. We show that these new transcripts have a major impact in the correct quantification of transcript levels by state-of-the-art short-read-based quantification algorithms. By comparing our iso-transcriptome with public proteomics databases, we find that alternative isoforms are elusive to proteogenomics detection. SQANTI allows the user to maximize the analytical outcome of long-read technologies by providing the tools to deliver quality-evaluated and curated full-length transcriptomes.

271 citations

Posted ContentDOI
Dragana Vuckovic1, Dragana Vuckovic2, Erik L. Bao3, Erik L. Bao4, Parsa Akbari, Caleb A. Lareau4, Caleb A. Lareau3, Abdou Mousas5, Tao Jiang6, Tao Jiang2, Ming-Huei Chen, Laura M. Raffield7, Manuel Tardaguila1, Jennifer E. Huffman8, Scott C. Ritchie, Karyn Megy9, Karyn Megy2, Karyn Megy10, Hannes Ponstingl1, Christopher J. Penkett10, Christopher J. Penkett2, Patrick K. Albers1, Emilie M. Wigdor1, Saori Sakaue11, Arden Moscati12, Regina Manansala13, Ken Sin Lo5, Huijun Qian7, Masato Akiyama14, Traci M. Bartz15, Yoav Ben-Shlomo16, Andrew D Beswick16, Jette Bork-Jensen17, Erwin P. Bottinger18, Erwin P. Bottinger12, Jennifer A. Brody15, Frank J. A. van Rooij19, Kumaraswamy Naidu Chitrala20, Kelly Cho21, Kelly Cho8, Kelly Cho22, Hélène Choquet23, Adolfo Correa24, John Danesh, Emanuele Di Angelantonio, Niki Dimou25, Niki Dimou26, Jingzhong Ding27, Paul Elliott, Tõnu Esko4, Michele K. Evans20, Stephan B. Felix28, James S. Floyd15, Linda Broer19, Niels Grarup17, Michael H. Guo4, Michael H. Guo29, Andreas Greinacher28, Jeffrey Haessler30, Torben Hansen17, Joanna M. M. Howson6, Joanna M. M. Howson2, Wei Huang31, Eric Jorgenson23, Tim Kacprowski32, Tim Kacprowski28, Mika Kähönen33, Yoichiro Kamatani34, Masahiro Kanai21, Savita Karthikeyan6, Fotis Koskeridis26, Leslie A. Lange35, Terho Lehtimäki, Allan Linneberg17, Allan Linneberg36, Yongmei Liu37, Leo-Pekka Lyytikäinen, Ani Manichaikul38, Koichi Matsuda34, Karen L. Mohlke7, Nina Mononen, Yoshinori Murakami34, Girish N. Nadkarni12, Kjell Nikus33, Nathan Pankratz39, Oluf Pedersen17, Michael Preuss12, Bruce M. Psaty, Olli T. Raitakari40, Olli T. Raitakari41, Stephen S. Rich38, Benjamin Rodriguez, Jonathan D. Rosen7, Jerome I. Rotter42, Petra Schubert8, Cassandra N. Spracklen7, Praveen Surendran, Hua Tang43, Jean-Claude Tardif5, Jean-Claude Tardif44, Mohsen Ghanbari19, Uwe Völker28, Henry Völzke28, Nicholas A. Watkins9, Stefan Weiss28, VA Million Veteran Program1, Na Cai1, Kousik Kundu1, Stephen B. Watt1, Klaudia Walter1, Alan B. Zonderman20, Peter W.F. Wilson45, Yun Li7, Ruth J. F. Loos12, Julian C. Knight46, Michel Georges47, Oliver Stegle48, Evangelos Evangelou26, Evangelos Evangelou49, Yukinori Okada11, David J. Roberts50, David J. Roberts51, Michael Inouye, Andrew D. Johnson, Paul L. Auer13, William J. Astle2, William J. Astle10, Alexander P. Reiner15, Adam S. Butterworth, Willem H. Ouwehand, Guillaume Lettre5, Guillaume Lettre44, Vijay G. Sankaran4, Vijay G. Sankaran3, Nicole Soranzo10, Nicole Soranzo1 
Wellcome Trust Sanger Institute1, National Institute for Health Research2, Boston Children's Hospital3, Broad Institute4, Montreal Heart Institute5, British Heart Foundation6, University of North Carolina at Chapel Hill7, VA Boston Healthcare System8, NHS Blood and Transplant9, University of Cambridge10, Osaka University11, Icahn School of Medicine at Mount Sinai12, University of Wisconsin–Milwaukee13, Kyushu University14, University of Washington15, University of Bristol16, University of Copenhagen17, Hasso Plattner Institute18, Erasmus University Medical Center19, National Institutes of Health20, Harvard University21, Brigham and Women's Hospital22, Kaiser Permanente23, University of Mississippi Medical Center24, International Agency for Research on Cancer25, University of Ioannina26, Wake Forest University27, Greifswald University Hospital28, University of Pennsylvania29, Fred Hutchinson Cancer Research Center30, Chinese National Human Genome Center31, Technische Universität München32, University of Tampere33, University of Tokyo34, University of Colorado Denver35, Frederiksberg Hospital36, Duke University37, University of Virginia38, University of Minnesota39, University of Turku40, Turku University Hospital41, Los Angeles Biomedical Research Institute42, Stanford University43, Université de Montréal44, Veterans Health Administration45, University of Oxford46, University of Liège47, European Bioinformatics Institute48, Imperial College London49, John Radcliffe Hospital50, Churchill Hospital51
03 Feb 2020-medRxiv
TL;DR: These results show the power of large-scale blood cell GWAS to interrogate clinically meaningful variants across the full allelic spectrum of human variation.
Abstract: Blood cells play essential roles in human health, underpinning physiological processes such as immunity, oxygen transport, and clotting, which when perturbed cause a significant health burden. Here we integrate data from UK Biobank and a large-scale international collaborative effort, including 563,946 European ancestry participants, and discover 5,106 new genetic variants independently associated with 29 blood cell phenotypes covering the full allele frequency spectrum of variation impacting hematopoiesis. We holistically characterize the genetic architecture of hematopoiesis, assess the relevance of the omnigenic model to blood cell phenotypes, delineate relevant hematopoietic cell states influenced by regulatory genetic variants and gene networks, identify novel splice-altering variants mediating the associations, and assess the polygenic prediction potential for blood cell traits and clinical disorders at the interface of complex and Mendelian genetics. These results show the power of large-scale blood cell GWAS to interrogate clinically meaningful variants across the full allelic spectrum of human variation.

162 citations

Journal ArticleDOI
TL;DR: The findings support the conclusion that CX3CL1 acts as a positive modifier of breast cancer in concert with ErbB receptors, and this effect was important insofar as mammary tumorigenesis was delayed and tumor multiplicity was reduced by genetic deletion in HER2/neu mice, but not in polyoma middle T-antigen oncomice.
Abstract: Chemokines are relevant molecules in shaping the tumor microenvironment, although their contributions to tumorigenesis are not fully understood. We studied the influence of the chemokine CX3CL1/fractalkine in de novo breast cancer formation using HER2/neu transgenic mice. CX3CL1 expression was downmodulated in HER2/neu tumors, yet, paradoxically, adenovirus-mediated CX3CL1 expression in the tumor milieu enhanced mammary tumor numbers in a dose-dependent manner. Increased tumor multiplicity was not a consequence of CX3CL1-induced metastatic dissemination of the primary tumor, although CX3CL1 induced epithelial-to-mesenchymal transition in breast cancer cells in vitro. Instead, CX3CL1 triggered cell proliferation by induction of ErbB receptors through the proteolytic shedding of an ErbB ligand. This effect was important insofar as mammary tumorigenesis was delayed and tumor multiplicity was reduced by genetic deletion of CX3CL1 in HER2/neu mice, but not in polyoma middle T-antigen oncomice. Our findings support the conclusion that CX3CL1 acts as a positive modifier of breast cancer in concert with ErbB receptors.

67 citations

Journal ArticleDOI
TL;DR: Corrigendum: SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification is presented.
Abstract: Corrigendum: SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification Manuel Tardaguila, Lorena de la Fuente, CristinaMarti, Cécile Pereira, Francisco Jose Pardo-Palacios, Hector del Risco, Marc Ferrell, Maravillas Mellado, Marissa Macchietto, Kenneth Verheggen, Mariola Edelmann, Iakes Ezkurdia, Jesus Vazquez, Michael Tress, Ali Mortazavi, Lennart Martens, Susana Rodriguez-Navarro, Victoria Moreno-Manzano, and Ana Conesa

65 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Key statistics on the current data contents and volume of downloads are outlined, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas are outlined.
Abstract: The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world’s largest data repository of mass spectrometry-based proteomics data, and is one of the founding members of the global ProteomeXchange (PX) consortium. In this manuscript, we summarize the developments in PRIDE resources and related tools since the previous update manuscript was published in Nucleic Acids Research in 2016. In the last 3 years, public data sharing through PRIDE (as part of PX) has definitely become the norm in the field. In parallel, data re-use of public proteomics data has increased enormously, with multiple applications. We first describe the new architecture of PRIDE Archive, the archival component of PRIDE. PRIDE Archive and the related data submission framework have been further developed to support the increase in submitted data volumes and additional data types. A new scalable and fault tolerant storage backend, Application Programming Interface and web interface have been implemented, as a part of an ongoing process. Additionally, we emphasize the improved support for quantitative proteomics data through the mzTab format. At last, we outline key statistics on the current data contents and volume of downloads, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas.

5,735 citations

Journal ArticleDOI
TL;DR: The current landscape of available tools is reviewed, the principles of error correction, base modification detection, and long-read transcriptomics analysis are focused on, and the challenges that remain are highlighted.
Abstract: Long-read technologies are overcoming early limitations in accuracy and throughput, broadening their application domains in genomics. Dedicated analysis tools that take into account the characteristics of long-read data are thus required, but the fast pace of development of such tools can be overwhelming. To assist in the design and analysis of long-read sequencing projects, we review the current landscape of available tools and present an online interactive database, long-read-tools.org, to facilitate their browsing. We further focus on the principles of error correction, base modification detection, and long-read transcriptomics analysis and highlight the challenges that remain.

1,172 citations

Journal ArticleDOI
TL;DR: StringTie2 is a reference-guided transcriptome assembler that works with both short and long reads and offers the ability to work with full-length super-reads assembled from short reads, which further improves the quality of short-read assemblies.
Abstract: RNA sequencing using the latest single-molecule sequencing instruments produces reads that are thousands of nucleotides long. The ability to assemble these long reads can greatly improve the sensitivity of long-read analyses. Here we present StringTie2, a reference-guided transcriptome assembler that works with both short and long reads. StringTie2 includes new methods to handle the high error rate of long reads and offers the ability to work with full-length super-reads assembled from short reads, which further improves the quality of short-read assemblies. StringTie2 is more accurate and faster and uses less memory than all comparable short-read and long-read analysis tools.

635 citations

Posted ContentDOI
08 Jul 2019-bioRxiv
TL;DR: StringTie2 is a reference-guided transcriptome assembler that works with both short and long reads and includes new computational methods to handle the high error rate of long-read sequencing technology, which previous assemblers could not tolerate.
Abstract: RNA sequencing using the latest single-molecule sequencing instruments produces reads that are thousands of nucleotides long. The ability to assemble these long reads can greatly improve the sensitivity of long-read analyses. Here we present StringTie2, a reference-guided transcriptome assembler that works with both short and long reads. StringTie2 includes new computational methods to handle the high error rate of long-read sequencing technology, which previous assemblers could not tolerate. It also offers the ability to work with full-length super-reads assembled from short reads, which further improves the quality of assemblies. On 33 short-read datasets from humans and two plant species, StringTie2 is 47.3% more precise and 3.9% more sensitive than Scallop. On multiple long read datasets, StringTie2 on average correctly assembles 8.3 and 2.6 times as many transcripts as FLAIR and Traphlor, respectively, with substantially higher precision. StringTie2 is also faster and has a smaller memory footprint than all comparable tools.

390 citations

Journal ArticleDOI
TL;DR: This Review discusses bioinformatics tools that have been devised to handle the numerous characteristic features of these long-range data types, with applications in genome assembly, genetic variant detection, haplotype phasing, transcriptomics and epigenomics.
Abstract: Several new genomics technologies have become available that offer long-read sequencing or long-range mapping with higher throughput and higher resolution analysis than ever before. These long-range technologies are rapidly advancing the field with improved reference genomes, more comprehensive variant identification and more complete views of transcriptomes and epigenomes. However, they also require new bioinformatics approaches to take full advantage of their unique characteristics while overcoming their complex errors and modalities. Here, we discuss several of the most important applications of the new technologies, focusing on both the currently available bioinformatics tools and opportunities for future research.

381 citations