Ensembl comparative genomics resources
Javier Herrero,Javier Herrero,Matthieu Muffato,Kathryn Beal,Stephen Fitzgerald,Leo Gordon,Miguel Pignatelli,Albert J. Vilella,Stephen M. J. Searle,M. Ridwan Amode,Simon Brent,William Spooner,Eugene Kulesha,Eugene Kulesha,Andrew D. Yates,Paul Flicek +15 more
TLDR
The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses and it is used to produce reference comparative data and make it freely available.Abstract:
Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org.read more
Citations
More filters
Journal ArticleDOI
OrthoFinder: phylogenetic orthology inference for comparative genomics
David M. Emms,Steven L. Kelly +1 more
TL;DR: This extends OrthoFinder’s high accuracy orthogroup inference to provide phylogenetic inference of orthologs, rooted gene trees, gene duplication events, the rooted species tree, and comparative genomics statistics.
Journal ArticleDOI
CADD: predicting the deleteriousness of variants throughout the human genome.
Philipp Rentzsch,Daniela Witten,Gregory M. Cooper,Jay Shendure,Martin Kircher,Martin Kircher +5 more
TL;DR: The latest updates to CADD are reviewed, including the most recent version, 1.4, which supports the human genome build GRCh38, and also present updates to the website that include simplified variant lookup, extended documentation, an Application Program Interface and improved mechanisms for integrating CADD scores into other tools or applications.
Journal ArticleDOI
The Human Transcription Factors.
Samuel A. Lambert,Arttu Jolma,Laura F. Campitelli,Pratyush Kumar Das,Yimeng Yin,Mihai Albu,Xiaoting Chen,Jussi Taipale,Jussi Taipale,Jussi Taipale,Timothy P. Hughes,Matthew T. Weirauch,Matthew T. Weirauch +12 more
TL;DR: This review considers how TFs are identified and functionally characterized, principally through the lens of a catalog of over 1,600 likely human TFs and binding motifs for two-thirds of them, highlighting the importance of continued effort to understand TF-mediated gene regulation.
Posted ContentDOI
OrthoFinder: phylogenetic orthology inference for comparative genomics
David M. Emms,Steven L. Kelly +1 more
TL;DR: This extends OrthoFinder’s high accuracy orthogroup inference to provide phylogenetic inference of orthologs, rooted genes trees, gene duplication events, the rooted species tree, and comparative genomic statistics.
Journal Article
A High-Resolution Map of Human Evolutionary Constraint Using 29 Mammals
Stefan Washietl,Pouya Kheradpour,Jason Ernst,Lucas D. Ward,Irwin Jungreis,Matthew D. Rasmussen,Manolis Kellis +6 more
TL;DR: The comparison of related genomes has emerged as a powerful lens for genome interpretation as mentioned in this paper, which reveals a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons.
References
More filters
Journal ArticleDOI
Basic Local Alignment Search Tool
TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI
Initial sequencing and analysis of the human genome.
Eric S. Lander,Lauren Linton,Bruce W. Birren,Chad Nusbaum,Michael C. Zody,Jennifer Baldwin,Keri Devon,Ken Dewar,Michael Doyle,William Fitzhugh,Roel Funke,Diane Gage,Katrina Harris,Andrew Heaford,John Howland,Lisa Kann,Jessica A. Lehoczky,Rosie Levine,Paul A. McEwan,Kevin McKernan,James Meldrim,Jill P. Mesirov,Cher Miranda,William Morris,Jerome Naylor,Christina Raymond,Mark Rosetti,Ralph Santos,Andrew Sheridan,Carrie Sougnez,Nicole Stange-Thomann,Nikola Stojanovic,Aravind Subramanian,Dudley Wyman,Jane Rogers,John Sulston,R Ainscough,Stephan Beck,David Bentley,John Burton,C M Clee,Nigel P. Carter,Alan Coulson,Rebecca Deadman,Panos Deloukas,Andrew Dunham,Ian Dunham,Richard Durbin,Lisa French,Darren Grafham,Simon G. Gregory,Tim Hubbard,Sean Humphray,Adrienne Hunt,Matthew Jones,Christine Lloyd,Amanda McMurray,Lucy Matthews,Simon Mercer,Sarah Milne,James C. Mullikin,Andrew J. Mungall,Robert W. Plumb,Mark T. Ross,Ratna Shownkeen,Sarah Sims,Robert H. Waterston,Richard K. Wilson,LaDeana W. Hillier,John Douglas Mcpherson,Marco A. Marra,Elaine R. Mardis,Lucinda Fulton,Asif T. Chinwalla,Kymberlie H. Pepin,Warren Gish,Stephanie L. Chissoe,Michael C. Wendl,Kim D. Delehaunty,Tracie L. Miner,Andrew Delehaunty,Jason B. Kramer,Lisa Cook,Robert S. Fulton,Douglas L. Johnson,Patrick Minx,Sandra W. Clifton,Trevor Hawkins,Elbert Branscomb,Paul Predki,Paul G. Richardson,Sarah Wenning,Tom Slezak,Norman A. Doggett,Jan Fang Cheng,Anne S. Olsen,Susan Lucas,Christopher J. Elkin,Edward Uberbacher,Marvin Frazier,Richard A. Gibbs,Donna M. Muzny,Steven E. Scherer,John Bouck,Erica Sodergren,Kim C. Worley,Catherine M. Rives,James H. Gorrell,Michael L. Metzker,Susan L. Naylor,Raju Kucherlapati,David L. Nelson,George M. Weinstock,Yoshiyuki Sakaki,Asao Fujiyama,Masahira Hattori,Tetsushi Yada,Atsushi Toyoda,Takehiko Itoh,Chiharu Kawagoe,Hidemi Watanabe,Yasushi Totoki,Todd D. Taylor,Jean Weissenbach,Roland Heilig,William Saurin,François Artiguenave,Philippe Brottier,Thomas Brüls,Eric Pelletier,Catherine Robert,Patrick Wincker,André Rosenthal,Matthias Platzer,Gerald Nyakatura,Stefan Taudien,Andreas Rump,Douglas R. Smith,Lynn Doucette-Stamm,Marc Rubenfield,Keith Weinstock,Mei Lee Hong,Joann Dubois,Huanming Yang,Jun Yu,Jian Wang,Guyang Huang,Jun Gu,Leroy Hood,Lee Rowen,Anup Madan,Shizen Qin,Ronald W. Davis,Nancy A. Federspiel,A. Pia Abola,Michael Proctor,Bruce A. Roe,Feng Chen,Huaqin Pan,Juliane Ramser,Hans Lehrach,Richard Reinhardt,W. Richard McCombie,Melissa De La Bastide,Neilay Dedhia,H. Blöcker,K. Hornischer,Gabriele Nordsiek,Richa Agarwala,L. Aravind,Jeffrey A. Bailey,Alex Bateman,Serafim Batzoglou,Ewan Birney,Peer Bork,Daniel G. Brown,Christopher B. Burge,Lorenzo Cerutti,Hsiu Chuan Chen,Deanna M. Church,Michele Clamp,Richard R. Copley,Tobias Doerks,Sean R. Eddy,Evan E. Eichler,Terrence S. Furey,James E. Galagan,James G. R. Gilbert,Cyrus L. Harmon,Yoshihide Hayashizaki,David Haussler,Henning Hermjakob,Karsten Hokamp,Wonhee Jang,L. Steven Johnson,Thomas A. Jones,Simon Kasif,Arek Kaspryzk,Scot Kennedy,W. James Kent,Paul Kitts,Eugene V. Koonin,Ian F Korf,David Kulp,Doron Lancet,Todd M. Lowe,Aoife McLysaght,Tarjei S. Mikkelsen,John V. Moran,Nicola Mulder,Victor J. Pollara,Chris P. Ponting,Greg Schuler,Jörg Schultz,Guy Slater,Arian F.A. Smit,Elia Stupka,Joseph Szustakowki,Danielle Thierry-Mieg,Jean Thierry-Mieg,Lukas Wagner,John W. Wallis,Raymond Wheeler,Alan Williams,Yuri I. Wolf,Kenneth H. Wolfe,Shiaw Pyng Yang,Ru Fang Yeh,Francis S. Collins,Mark S. Guyer,Jane Peterson,Adam Felsenfeld,Kris A. Wetterstrand,Richard M. Myers,Jeremy Schmutz,Mark Dickson,Jane Grimwood,David R. Cox,Maynard V. Olson,Rajinder Kaul,Christopher K. Raymond,Nobuyoshi Shimizu,Kazuhiko Kawasaki,Shinsei Minoshima,Glen A. Evans,Maria Athanasiou,Roger A. Schultz,Aristides Patrinos,Michael J. Morgan +248 more
TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.
Journal ArticleDOI
RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models
TL;DR: UNLABELLED RAxML-VI-HPC (randomized axelerated maximum likelihood for high performance computing) is a sequential and parallel program for inference of large phylogenies with maximum likelihood (ML) that has been used to compute ML trees on two of the largest alignments to date.
Journal ArticleDOI
An integrated encyclopedia of DNA elements in the human genome
TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.
Journal ArticleDOI
The sequence of the human genome.
J. Craig Venter,Mark Raymond Adams,Eugene W. Myers,Peter W. Li,Richard J. Mural,Granger G. Sutton,Hamilton O. Smith,Mark Yandell,Cheryl A. Evans,Robert A. Holt,Jeannine D. Gocayne,Peter Amanatides,Richard M. Ballew,Daniel H. Huson,Jennifer R. Wortman,Qing Zhang,Chinnappa D. Kodira,Xiangqun H. Zheng,Lin Chen,Marian P. Skupski,Gangadharan Subramanian,Paul Thomas,Jinghui Zhang,George L. Gabor Miklos,Catherine R. Nelson,Samuel Broder,Andrew G. Clark,J. H. Nadeau,Victor A. McKusick,Norton D. Zinder,Arnold J. Levine,Richard J. Roberts,M. I. Simon,Carolyn W. Slayman,Michael W. Hunkapiller,Randall Bolanos,Arthur L. Delcher,Ian M. Dew,Daniel Fasulo,Michael Flanigan,Liliana Florea,Aaron L. Halpern,Sridhar Hannenhalli,Saul A. Kravitz,Samuel Levy,Clark M. Mobarry,Knut Reinert,Karin A. Remington,Jane Abu-Threideh,Ellen M. Beasley,Kendra Biddick,Vivien Bonazzi,Rhonda Brandon,Michele Cargill,Ishwar Chandramouliswaran,Rosane Charlab,Kabir Chaturvedi,Zuoming Deng,Valentina Di Francesco,Patrick Dunn,Karen Eilbeck,Carlos Evangelista,Andrei Gabrielian,Weiniu Gan,Wangmao Ge,Fangcheng Gong,Zhiping Gu,Ping Guan,Thomas J. Heiman,Maureen E. Higgins,Rui-Ru Ji,Zhaoxi Ke,Karen A. Ketchum,Zhongwu Lai,Yiding Lei,Zhenya Li,Jiayin Li,Yong Liang,Xiaoying Lin,Fu Lu,Gennady V. Merkulov,Natalia Milshina,Helen M. Moore,Ashwinikumar K Naik,Vaibhav A. Narayan,Beena Neelam,Deborah Nusskern,Douglas B. Rusch,Steven L. Salzberg,Wei Shao,Bixiong Chris Shue,Jingtao Sun,Zhen Yuan Wang,Aihui Wang,Xin Wang,Jian Wang,Ming-Hui Wei,Ron Wides,Chunlin Xiao,Chunhua Yan,Alison Yao,Jane Ye,Ming Zhan,Weiqing Zhang,Hongyu Zhang,Qi Zhao,Liansheng Zheng,Fei Zhong,Wenyan Zhong,Shiaoping C. Zhu,Shaying Zhao,Dennis A. Gilbert,Suzanna Baumhueter,Gene Spier,Christine Carter,Anibal Cravchik,Trevor Woodage,Feroze Ali,Huijin An,Aderonke Awe,Danita Baldwin,Holly Baden,Mary Barnstead,Ian Barrow,Karen Beeson,Dana A. Busam,Amy Carver,Ming Lai Cheng,Liz Curry,Steve Danaher,Lionel Davenport,Raymond Desilets,Susanne Dietz,Kristina Dodson,Lisa Doup,Steven Ferriera,Neha Garg,Andres Gluecksmann,Brit J. Hart,Jason Haynes,Charles Haynes,Cheryl Heiner,Suzanne Hladun,Damon Hostin,Jarrett Houck,Timothy Howland,Chinyere Ibegwam,Jeffery Johnson,Francis Kalush,Lesley Kline,Shashi Koduru,Amy Love,Felecia Mann,David May,Steven McCawley,Tina C. McIntosh,Ivy McMullen,Mee Moy,Linda Moy,Brian Murphy,Keith Nelson,Cynthia Pfannkoch,Eric Pratts,Vinita Puri,Hina Qureshi,Matthew Reardon,Robert Rodriguez,Yu-Hui Rogers,Deanna Romblad,Bob Ruhfel,Richard T. Scott,Cynthia Sitter,Michelle Smallwood,Erin Stewart,Renee Strong,Ellen Suh,Reginald Thomas,Ni Ni Tint,Sukyee Tse,Claire Vech,Gary Wang,Jeremy Wetter,Sherita Williams,Monica Williams,Sandra Windsor,Emily Winn-Deen,Keriellen Wolfe,Jayshree Zaveri,Karena Zaveri,Josep F. Abril,Roderic Guigó,Michael J. Campbell,Kimmen Sjölander,Brian Karlak,Anish Kejariwal,Huaiyu Mi,Betty Lazareva,Thomas Hatton,Apurva Narechania,Karen Diemer,Anushya Muruganujan,Nan Guo,Shinji Sato,Vineet Bafna,Sorin Istrail,Ross Lippert,Russell Schwartz,Brian P. Walenz,Shibu Yooseph,David Allen,Anand Basu,James Baxendale,Louis Blick,Marcelo Caminha,John Carnes-Stine,Parris Caulk,Yen-Hui Chiang,My Coyne,Carl Dahlke,Anne Deslattes Mays,Maria Dombroski,Michael Donnelly,Dale Ely,Shiva Esparham,Carl Fosler,Harold Gire,Stephen Glanowski,Kenneth Glasser,Anna Glodek,Mark Gorokhov,Ken Graham,Barry Gropman,Michael Harris,Jeremy Heil,Scott Henderson,Jeffrey Hoover,Donald Jennings,Catherine Jordan,James Jordan,John Kasha,Leonid Kagan,Cheryl L. Kraft,Alexander Levitsky,Mark Lewis,Xiangjun Liu,John Lopez,Daniel Ma,William H. Majoros,Joe McDaniel,Sean C. Murphy,Matthew Newman,Trung Hieu Nguyen,Ngoc Nguyen,Marc Nodell,Sue Pan,Jim Peck,Marshall Peterson,William Rowe,Robert Sanders,John Scott,Michael Simpson,Thomas J. Smith,Arlan Sprague,Timothy B. Stockwell,Russell Turner,Eli Venter,Mei Wang,Meiyuan Wen,David Wu,Mitchell Wu,Ashley Xia,Ali Zandieh,Xiaohong Zhu +272 more
TL;DR: Comparative genomic analysis indicates vertebrate expansions of genes associated with neuronal function, with tissue-specific developmental regulation, and with the hemostasis and immune systems are indicated.
Related Papers (5)
MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability
Kazutaka Katoh,Daron M. Standley +1 more