Home
/
Authors
/
David W. Burt

Author

David W. Burt

Other affiliations: Brigham and Women's Hospital, University of Edinburgh, University of Leicester ...read more

Bio: David W. Burt is an academic researcher from University of Queensland. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 54, co-authored 224 publications receiving 13977 citations. Previous affiliations of David W. Burt include Brigham and Women's Hospital & University of Edinburgh.

Topics: Genome, Gene, Population, Genomics, Comparative genomics ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1992
1991
1989
1988
1987
1985
1984

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution

[...]

LaDeana W. Hillier¹, Webb Miller², Ewan Birney, Wesley C. Warren¹ +171 more•Institutions (39)

09 Dec 2004-Nature

TL;DR: A draft genome sequence of the red jungle fowl, Gallus gallus, provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes.

...read moreread less

Abstract: We present here a draft genome sequence of the red jungle fowl, Gallus gallus. Because the chicken is a modern descendant of the dinosaurs and the first non-mammalian amniote to have its genome sequenced, the draft sequence of its genome--composed of approximately one billion base pairs of sequence and an estimated 20,000-23,000 genes--provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes. For example, the evolutionary distance between chicken and human provides high specificity in detecting functional elements, both non-coding and coding. Notably, many conserved non-coding sequences are far from genes and cannot be assigned to defined functional classes. In coding regions the evolutionary dynamics of protein domains and orthologous groups illustrate processes that distinguish the lineages leading to birds and mammals. The distinctive properties of avian microchromosomes, together with the inferred patterns of conserved synteny, provide additional insights into vertebrate chromosome architecture.

...read moreread less

2,579 citations

Journal Article•DOI•

Comparative genomics reveals insights into avian genome evolution and adaptation.

[...]

Guojie Zhang¹, Guojie Zhang², Cai Li¹, Qiye Li¹, Bo Li¹, Denis M. Larkin³, Chul Hee Lee⁴, Jay F. Storz⁵, Agostinho Antunes⁶, Matthew J. Greenwold⁷, Robert W. Meredith⁸, Anders Ödeen⁹, Jie Cui¹⁰, Qi Zhou¹¹, Luohao Xu¹, Hailin Pan¹, Zongji Wang¹², Lijun Jin¹, Pei Zhang¹, Haofu Hu¹, Wei Yang¹, Jiang Hu¹, Jin Xiao¹, Zhikai Yang¹, Yang Liu¹, Qiaolin Xie¹, Hao Yu¹, Jinmin Lian¹, Ping Wen¹, Fang Zhang¹, Hui Li¹, Yongli Zeng¹, Zijun Xiong¹, Shiping Liu¹², Long Zhou¹, Zhiyong Huang¹, Na An¹, Jie Wang¹³, Qiumei Zheng¹, Yingqi Xiong¹, Guangbiao Wang¹, Bo Wang¹, Jingjing Wang¹, Yu Fan¹⁴, Rute R. da Fonseca², Alonzo Alfaro-Núñez², Mikkel Schubert², Ludovic Orlando², Tobias Mourier², Jason T. Howard¹⁵, Ganeshkumar Ganapathy¹⁵, Andreas R. Pfenning¹⁵, Osceola Whitney¹⁵, Miriam V. Rivas¹⁵, Erina Hara¹⁵, Julia Smith¹⁵, Marta Farré³, Jitendra Narayan¹⁶, Gancho T. Slavov¹⁶, Michael N Romanov¹⁷, Rui Borges⁶, João Paulo Machado⁶, Imran Khan⁶, Mark S. Springer¹⁸, John Gatesy¹⁸, Federico G. Hoffmann¹⁹, Juan C. Opazo²⁰, Olle Håstad²¹, Roger H. Sawyer⁷, Heebal Kim⁴, Kyu-Won Kim⁴, Hyeon Jeong Kim⁴, Seoae Cho⁴, Ning Li²², Yinhua Huang²², Michael William Bruford²³, Xiangjiang Zhan¹³, Andrew Dixon, Mads F. Bertelsen²⁴, Elizabeth P. Derryberry²⁵, Wesley C. Warren²⁶, Richard K. Wilson²⁶, Shengbin Li²⁷, David A. Ray¹⁹, Richard E. Green²⁸, Stephen J. O'Brien²⁹, Darren K. Griffin¹⁷, Warren E. Johnson³⁰, David Haussler²⁸, Oliver A. Ryder, Eske Willerslev², Gary R. Graves³¹, Per Alström²¹, Jon Fjeldså³², David P. Mindell³³, Scott V. Edwards³⁴, Edward L. Braun³⁵, Carsten Rahbek³², David W. Burt³⁶, Peter Houde³⁷, Yong Zhang¹, Huanming Yang³⁸, Jian Wang¹, Erich D. Jarvis¹⁵, M. Thomas P. Gilbert³⁹, M. Thomas P. Gilbert², Jun Wang - Show less +103 more•Institutions (39)

Beijing Genomics Institute¹, University of Copenhagen², Royal Veterinary College³, Seoul National University⁴, University of Nebraska–Lincoln⁵, University of Porto⁶, University of South Carolina⁷, Montclair State University⁸, Uppsala University⁹, National University of Singapore¹⁰, University of California, Berkeley¹¹, South China University of Technology¹², Chinese Academy of Sciences¹³, Kunming Institute of Zoology¹⁴, Howard Hughes Medical Institute¹⁵, Aberystwyth University¹⁶, University of Kent¹⁷, University of California, Riverside¹⁸, Mississippi State University¹⁹, Austral University of Chile²⁰, Swedish University of Agricultural Sciences²¹, China Agricultural University²², Cardiff University²³, Copenhagen Zoo²⁴, Louisiana State University²⁵, Washington University in St. Louis²⁶, Xi'an Jiaotong University²⁷, University of California, Santa Cruz²⁸, Nova Southeastern University Oceanographic Center²⁹, Smithsonian Conservation Biology Institute³⁰, National Museum of Natural History³¹, Natural History Museum³², University of California, San Francisco³³, Harvard University³⁴, University of Florida³⁵, University of Edinburgh³⁶, New Mexico State University³⁷, Macau University of Science and Technology³⁸, Curtin University³⁹

12 Dec 2014-Science

TL;DR: This work explored bird macroevolution using full genomes from 48 avian species representing all major extant clades to reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits.

...read moreread less

Abstract: Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits.

...read moreread less

872 citations

Journal Article•DOI•

The genome of a songbird

[...]

Wesley C. Warren¹, David F. Clayton², Hans Ellegren³, Arthur P. Arnold⁴, LaDeana W. Hillier¹, Axel Künstner³, Steve Searle⁵, Simon D. M. White⁵, Albert J. Vilella, Susan Fairley⁵, Andreas Heger⁶, Lesheng Kong⁶, Chris P. Ponting⁶, Erich D. Jarvis⁷, Claudio V. Mello, Patrick Minx¹, Peter V. Lovell, Tarciso A. F. Velho, Margaret Ferris², Christopher N. Balakrishnan², Saurabh Sinha², Charles Blatti², Sarah E. London², Yun Li², Ya-Chi Lin², Jimin George², Jonathan V. Sweedler², Bruce R. Southey², Preethi H. Gunaratne⁸, Michael E. Watson, Kiwoong Nam³, Niclas Backström³, Linnéa Smeds³, Benoit Nabholz³, Yuichiro Itoh⁴, Osceola Whitney⁷, Andreas R. Pfenning⁷, Jason T. Howard⁷, Martin Völker, Benjamin M. Skinner⁹, Darren K. Griffin⁹, Liang Ye¹, William M. McLaren, Paul Flicek, Víctor Quesada¹⁰, Gloria Velasco¹⁰, Carlos López-Otín¹⁰, Xose S. Puente¹⁰, Tsviya Olender¹¹, Doron Lancet¹¹, Arian F.A. Smit¹², Robert Hubley¹², Miriam K. Konkel¹³, Jerilyn A. Walker¹³, Mark A. Batzer¹³, Wanjun Gu¹⁴, David D. Pollock¹⁴, Lin Chen¹⁵, Ze Cheng¹⁵, Evan E. Eichler¹⁵, Jessica Stapley¹⁵, Jon Slate¹⁶, Robert Ekblom¹⁶, Tim R. Birkhead¹⁶, Terry Burke¹⁶, David W. Burt¹⁷, Constance Scharff¹⁸, Iris Adam¹⁹, Hugues Richard¹⁸, Marc Sultan¹⁸, Alexey Soldatov¹⁸, Hans Lehrach¹⁸, Scott V. Edwards²⁰, Shiaw-Pyng Yang²¹, XiaoChing Li¹³, Tina Graves¹, Lucinda Fulton¹, Joanne O. Nelson¹, Asif T. Chinwalla¹, Shunfeng Hou¹, Elaine R. Mardis¹, Richard K. Wilson¹ - Show less +78 more•Institutions (21)

Washington University in St. Louis¹, University of Illinois at Urbana–Champaign², Uppsala University³, University of California, Los Angeles⁴, Wellcome Trust Sanger Institute⁵, University of Oxford⁶, Duke University⁷, University of Houston⁸, University of Kent⁹, University of Oviedo¹⁰, Weizmann Institute of Science¹¹, Institute for Systems Biology¹², Louisiana State University¹³, University of Colorado Denver¹⁴, University of Washington¹⁵, University of Sheffield¹⁶, University of Edinburgh¹⁷, Max Planck Society¹⁸, Free University of Berlin¹⁹, Harvard University²⁰, Monsanto²¹

01 Apr 2010-Nature

TL;DR: This work shows that song behaviour engages gene regulatory networks in the zebra finch brain, altering the expression of long non-coding RNAs, microRNAs, transcription factors and their targets and shows evidence for rapid molecular evolution in the songbird lineage of genes that are regulated during song experience.

...read moreread less

Abstract: The zebra finch is an important model organism in several fields with unique relevance to human neuroscience. Like other songbirds, the zebra finch communicates through learned vocalizations, an ability otherwise documented only in humans and a few other animals and lacking in the chicken-the only bird with a sequenced genome until now. Here we present a structural, functional and comparative analysis of the genome sequence of the zebra finch (Taeniopygia guttata), which is a songbird belonging to the large avian order Passeriformes. We find that the overall structures of the genomes are similar in zebra finch and chicken, but they differ in many intrachromosomal rearrangements, lineage-specific gene family expansions, the number of long-terminal-repeat-based retrotransposons, and mechanisms of sex chromosome dosage compensation. We show that song behaviour engages gene regulatory networks in the zebra finch brain, altering the expression of long non-coding RNAs, microRNAs, transcription factors and their targets. We also show evidence for rapid molecular evolution in the songbird lineage of genes that are regulated during song experience. These results indicate an active involvement of the genome in neural processes underlying vocal communication and identify potential genetic substrates for the evolution and regulation of this behaviour.

...read moreread less

837 citations

Journal Article•DOI•

Multi-Platform Next-Generation Sequencing of the Domestic Turkey (Meleagris gallopavo): Genome Assembly and Analysis

[...]

Rami A. Dalloul¹, Julie A. Long², Aleksey V. Zimin³, Luqman Aslam⁴, Kathryn Beal⁵, Le Ann Blomberg², Pascal Bouffard⁶, David W. Burt⁷, Oswald Crasta⁸, Richard P. M. A. Crooijmans⁴, Kristal L. Cooper⁸, Roger A. Coulombe⁹, Supriyo De¹⁰, Mary E. Delany¹¹, Jerry B. Dodgson¹², Jennifer J Dong¹³, Clive Evans⁸, Karin M. Frederickson⁶, Paul Flicek⁵, Liliana Florea³, Otto Folkerts⁸, Martien A. M. Groenen⁴, Tim Harkins⁶, Javier Herrero⁵, Steve Hoffmann¹⁴, Hendrik-Jan Megens⁴, Andrew Jiang¹¹, Pieter J. de Jong¹⁵, Peter K. Kaiser¹⁶, Heebal Kim¹⁷, Kyu-Won Kim¹⁷, Sungwon Kim¹, David Langenberger¹⁴, Mi-Kyung Lee¹³, Taeheon Lee¹⁷, Shrinivasrao P. Mane⁸, Guillaume Marçais³, Manja Marz¹⁸, Manja Marz¹⁴, A. P. McElroy¹, Thero Modise⁸, Mikhail Nefedov¹⁵, Cedric Notredame, Ian R. Paton⁷, William S. Payne¹², Geo Pertea³, Dennis Prickett¹⁶, Daniela Puiu³, Dan Qioa¹, Emanuele Raineri, Magali Ruffier¹⁹, Steven L. Salzberg³, Michael C. Schatz³, Chantel F. Scheuring¹³, Carl J. Schmidt²⁰, Steven Schroeder², Stephen M. J. Searle¹⁹, Edward J. Smith¹, Jacqueline Smith⁷, Tad S. Sonstegard², Peter F. Stadler, Hakim Tafer¹⁴, Hakim Tafer²¹, Zhijian Jake Tu¹, Curtis P. Van Tassell², Albert J. Vilella⁵, Kelly P. Williams⁸, James A. Yorke³, Liqing Zhang¹, Hong-Bin Zhang¹³, Xiaojun Zhang¹³, Yang Zhang¹³, Kent M. Reed²² - Show less +69 more•Institutions (22)

Virginia Tech¹, United States Department of Agriculture², University of Maryland, College Park³, Wageningen University and Research Centre⁴, European Bioinformatics Institute⁵, Roche Applied Science⁶, University of Edinburgh⁷, Virginia Bioinformatics Institute⁸, Utah State University⁹, National Institutes of Health¹⁰, University of California, Davis¹¹, Michigan State University¹², Texas A&M University¹³, Leipzig University¹⁴, Children's Hospital Oakland Research Institute¹⁵, Institute for Animal Health¹⁶, Seoul National University¹⁷, University of Marburg¹⁸, Wellcome Trust Sanger Institute¹⁹, University of Delaware²⁰, University of Vienna²¹, University of Minnesota²²

07 Sep 2010-PLOS Biology

TL;DR: The combined application of next-generation sequencing platforms has provided an economical approach to unlocking the potential of the turkey genome.

...read moreread less

Abstract: A synergistic combination of two next-generation sequencing platforms with a detailed comparative BAC physical contig map provided a cost-effective assembly of the genome sequence of the domestic turkey (Meleagris gallopavo). Heterozygosity of the sequenced source genome allowed discovery of more than 600,000 high quality single nucleotide variants. Despite this heterozygosity, the current genome assembly (∼1.1 Gb) includes 917 Mb of sequence assigned to specific turkey chromosomes. Annotation identified nearly 16,000 genes, with 15,093 recognized as protein coding and 611 as non-coding RNA genes. Comparative analysis of the turkey, chicken, and zebra finch genomes, and comparing avian to mammalian species, supports the characteristic stability of avian genomes and identifies genes unique to the avian lineage. Clear differences are seen in number and variety of genes of the avian immune system where expansions and novel genes are less frequent than examples of gene loss. The turkey genome sequence provides resources to further understand the evolution of vertebrate genomes and genetic variation underlying economically important quantitative traits in poultry. This integrated approach may be a model for providing both gene and chromosome level assemblies of other species with agricultural, ecological, and evolutionary interest.

...read moreread less

415 citations

Journal Article•DOI•

A genetic variation map for chicken with 2.8 million single-nucleotide polymorphisms

[...]

Gane Ka-Shu Wong¹, Gane Ka-Shu Wong², Gane Ka-Shu Wong³, Bin Liu¹, Jun Wang³, Jun Wang¹, Yong Zhang⁴, Yong Zhang¹, Xu Yang¹, Zengjin Zhang¹, Qingshun Meng¹, Jun Zhou¹, Dawei Li¹, Jingjing Zhang¹, Peixiang Ni¹, Songgang Li⁴, Songgang Li¹, Longhua Ran, Heng Li⁵, Jianguo Zhang¹, Ruiqiang Li¹, Shengting Li¹, Hongkun Zheng¹, Wei Lin¹, Guangyuan Li¹, Xiaoling Wang¹, Wenming Zhao¹, Jun Li¹, Chen Ye¹, Mingtao Dai¹, Jue Ruan¹, Yan Zhou³, Yuanzhe Li¹, Ximiao He¹, Yunze Zhang¹, Jing Wang¹, Jing Wang⁴, Xiangang Huang¹, Wei Tong¹, Jie Chen¹, Jia Ye¹, Jia Ye³, Chen Chen¹, Ning Wei¹, Guoqing Li¹, Le Dong¹, Fengdi Lan¹, Yongqiao Sun¹, Zhenpeng Zhang¹, Zheng Yang¹, Yingpu Yu³, Yanqing Huang¹, Dandan He¹, Yan Xi¹, Dong Wei¹, Qiuhui Qi¹, Wenjie Li¹, Jianping Shi¹, Miaoheng Wang¹, Fei Xie¹, Jianjun Wang¹, Xiaowei Zhang¹, Pei Wang¹, Yiqiang Zhao⁶, Ning Li⁶, Ning Yang⁶, Wei Dong¹, Songnian Hu¹, Changqing Zeng¹, Wei-Mou Zheng⁵, Bailin Hao⁵, LaDeana W. Hillier⁷, Shiaw Pyng Yang⁷, Wesley C. Warren⁷, Richard K. Wilson⁷, Mikael Brandström⁸, Hans Ellegren⁸, Richard P. M. A. Crooijmans⁹, Jan J. van der Poel⁹, Henk Bovenhuis⁹, Martien A. M. Groenen⁹, Ivan Ovcharenko¹⁰, Laurie Gordon¹¹, Laurie Gordon¹⁰, Lisa Stubbs¹², Susan Lucas¹¹, Tijana Glavina¹¹, Andrea Aerts¹¹, Peter K. Kaiser, Lisa Rothwell, John R. Young, Sally L. Rogers, Brian A Walker, Andy van Hateren, James C. Kaufman, Nat Bumstead, Susan J. Lamont¹³, Huaijun Zhou¹³, Paul M Hocking¹⁴, David R. Morrice¹⁴, Dirk-Jan de Koning¹⁴, Andy Law¹⁴, Neil Bartley¹⁴, David W. Burt¹⁴, Henry D. Hunt¹⁵, Hans H. Cheng¹⁵, Ulrika Gunnarsson⁸, Per Wahlberg⁸, Leif Andersson⁸, Leif Andersson¹⁶, Ellen Kindlund¹⁷, Martti T. Tammi¹⁷, Martti T. Tammi¹⁸, Björn Andersson¹⁷, Caleb Webber¹⁹, Chris P. Ponting¹⁹, Ian M. Overton²⁰, Paul E. Boardman²⁰, Haizhou Tang²⁰, Simon J. Hubbard²⁰, Stuart A. Wilson²¹, Jun Yu¹, Jun Yu³, Jian Wang³, Jian Wang¹, Huanming Yang¹, Huanming Yang³ - Show less +123 more•Institutions (21)

Beijing Institute of Genomics¹, University of Washington², Zhejiang University³, Peking University⁴, Chinese Academy of Sciences⁵, China Agricultural University⁶, Washington University in St. Louis⁷, Uppsala University⁸, Wageningen University and Research Centre⁹, Lawrence Livermore National Laboratory¹⁰, United States Department of Energy¹¹, University of Illinois at Urbana–Champaign¹², Iowa State University¹³, The Roslin Institute¹⁴, United States Department of Agriculture¹⁵, Swedish University of Agricultural Sciences¹⁶, Karolinska Institutet¹⁷, National University of Singapore¹⁸, University of Oxford¹⁹, University of Manchester²⁰, University of Sheffield²¹

09 Dec 2004-Nature

TL;DR: This map is based on a comparison of the sequences of three domestic chicken breeds with that of their wild ancestor, red jungle fowl, and indicates that at least 90% of the variant sites are true SNPs, and at least 70% are common SNPs that segregate in many domestic breeds.

...read moreread less

Abstract: We describe a genetic variation map for the chicken genome containing 2.8 million single-nucleotide polymorphisms (SNPs). This map is based on a comparison of the sequences of three domestic chicken breeds (a broiler, a layer and a Chinese silkie) with that of their wild ancestor, red jungle fowl. Subsequent experiments indicate that at least 90% of the variant sites are true SNPs, and at least 70% are common SNPs that segregate in many domestic breeds. Mean nucleotide diversity is about five SNPs per kilobase for almost every possible comparison between red jungle fowl and domestic lines, between two different domestic lines, and within domestic lines--in contrast to the notion that domestic animals are highly inbred relative to their wild ancestors. In fact, most of the SNPs originated before domestication, and there is little evidence of selective sweeps for adaptive alleles on length scales greater than 100 kilobases.

...read moreread less

406 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The sequence of the human genome.

[...]

J. Craig Venter¹, Mark Raymond Adams¹, Eugene W. Myers¹, Peter W. Li¹ +269 more•Institutions (12)

16 Feb 2001-Science

TL;DR: Comparative genomic analysis indicates vertebrate expansions of genes associated with neuronal function, with tissue-specific developmental regulation, and with the hemostasis and immune systems are indicated.

...read moreread less

Abstract: A 2.91-billion base pair (bp) consensus sequence of the euchromatic portion of the human genome was generated by the whole-genome shotgun sequencing method. The 14.8-billion bp DNA sequence was generated over 9 months from 27,271,853 high-quality sequence reads (5.11-fold coverage of the genome) from both ends of plasmid clones made from the DNA of five individuals. Two assembly strategies-a whole-genome assembly and a regional chromosome assembly-were used, each combining sequence data from Celera and the publicly funded genome effort. The public data were shredded into 550-bp segments to create a 2.9-fold coverage of those genome regions that had been sequenced, without including biases inherent in the cloning and assembly procedure used by the publicly funded group. This brought the effective coverage in the assemblies to eightfold, reducing the number and size of gaps in the final assembly over what would be obtained with 5.11-fold coverage. The two assembly strategies yielded very similar results that largely agree with independent mapping data. The assemblies effectively cover the euchromatic regions of the human chromosomes. More than 90% of the genome is in scaffold assemblies of 100,000 bp or more, and 25% of the genome is in scaffolds of 10 million bp or larger. Analysis of the genome sequence revealed 26,588 protein-encoding transcripts for which there was strong corroborating evidence and an additional approximately 12,000 computationally derived genes with mouse matches or other weak supporting evidence. Although gene-dense clusters are obvious, almost half the genes are dispersed in low G+C sequence separated by large tracts of apparently noncoding sequence. Only 1.1% of the genome is spanned by exons, whereas 24% is in introns, with 75% of the genome being intergenic DNA. Duplications of segmental blocks, ranging in size up to chromosomal lengths, are abundant throughout the genome and reveal a complex evolutionary history. Comparative genomic analysis indicates vertebrate expansions of genes associated with neuronal function, with tissue-specific developmental regulation, and with the hemostasis and immune systems. DNA sequence comparisons between the consensus sequence and publicly funded genome data provided locations of 2.1 million single-nucleotide polymorphisms (SNPs). A random pair of human haploid genomes differed at a rate of 1 bp per 1250 on average, but there was marked heterogeneity in the level of polymorphism across the genome. Less than 1% of all SNPs resulted in variation in proteins, but the task of determining which SNPs have functional consequences remains an open challenge.

...read moreread less

12,098 citations

Journal Article•DOI•

Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets

[...]

Benjamin P. Lewis¹, Christopher B. Burge¹, David P. Bartel¹•Institutions (1)

Massachusetts Institute of Technology¹

14 Jan 2005-Cell

TL;DR: In a four-genome analysis of 3' UTRs, approximately 13,000 regulatory relationships were detected above the estimate of false-positive predictions, thereby implicating as miRNA targets more than 5300 human genes, which represented 30% of the gene set.

...read moreread less

11,624 citations

Journal Article•DOI•

Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project

[...]

Ewan Birney, John A. Stamatoyannopoulos¹, Anindya Dutta², Roderic Guigó³ +317 more•Institutions (44)

14 Jun 2007-Nature

TL;DR: Functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project are reported, providing convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts.

...read moreread less

Abstract: We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

...read moreread less

5,091 citations

Journal Article•DOI•

Combinatorial microRNA target predictions.

[...]

Azra Krek¹, Dominic Grün¹, Matthew N. Poy², Rachel Wolf¹, Lauren Rosenberg¹, Eric J Epstein², Philip MacMenamin¹, Isabelle da Piedade¹, Kristin C. Gunsalus¹, Markus Stoffel², Nikolaus Rajewsky¹ - Show less +7 more•Institutions (2)

New York University¹, Rockefeller University²

03 Apr 2005-Nature Genetics

TL;DR: PicTar, a computational method for identifying common targets of micro RNAs, is presented and widespread coordinate control executed by microRNAs is suggested, thus providing evidence for coordinate microRNA control in mammals.

...read moreread less

Abstract: MicroRNAs are small noncoding RNAs that recognize and bind to partially complementary sites in the 3' untranslated regions of target genes in animals and, by unknown mechanisms, regulate protein production of the target transcript. Different combinations of microRNAs are expressed in different cell types and may coordinately regulate cell-specific target genes. Here, we present PicTar, a computational method for identifying common targets of microRNAs. Statistical tests using genome-wide alignments of eight vertebrate genomes, PicTar's ability to specifically recover published microRNA targets, and experimental validation of seven predicted targets suggest that PicTar has an excellent success rate in predicting targets for single microRNAs and for combinations of microRNAs. We find that vertebrate microRNAs target, on average, roughly 200 transcripts each. Furthermore, our results suggest widespread coordinate control executed by microRNAs. In particular, we experimentally validate common regulation of Mtpn by miR-375, miR-124 and let-7b and thus provide evidence for coordinate microRNA control in mammals.

...read moreread less

4,660 citations

Journal Article•DOI•

Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation

[...]

Nuala A. O'Leary¹, Mathew W. Wright¹, J. Rodney Brister¹, Stacy Ciufo¹, Diana Haddad¹, Richard McVeigh¹, Bhanu Rajput¹, Barbara Robbertse¹, Brian Smith-White¹, Danso Ako-adjei¹, Alexander Astashyn¹, Azat Badretdin¹, Yiming Bao¹, Olga Blinkova¹, Vyacheslav Brover¹, Vyacheslav Chetvernin¹, Jinna Choi¹, Eric Cox¹, Olga Ermolaeva¹, Catherine M. Farrell¹, Tamara Goldfarb¹, Tripti Gupta¹, Daniel H. Haft¹, Eneida L. Hatcher¹, Wratko Hlavina¹, Vinita Joardar¹, Vamsi K. Kodali¹, Wenjun Li¹, Donna Maglott¹, Patrick Masterson¹, Kelly M. McGarvey¹, Michael R. Murphy¹, Kathleen O'Neill¹, Shashikant Pujar¹, Sanjida H. Rangwala¹, Daniel Rausch¹, Lillian D. Riddick¹, Conrad L. Schoch¹, Andrei Shkeda¹, Susan S. Storz¹, Hanzhen Sun¹, Françoise Thibaud-Nissen¹, Igor Tolstoy¹, Raymond E. Tully¹, Anjana R. Vatsan¹, Craig Wallin¹, David Webb¹, Wendy Wu¹, Melissa J. Landrum¹, Avi Kimchi¹, Tatiana Tatusova¹, Michael DiCuccio¹, Paul Kitts¹, Terence Murphy¹, Kim D. Pruitt¹ - Show less +51 more•Institutions (1)

National Institutes of Health¹

04 Jan 2016-Nucleic Acids Research

TL;DR: The approach to utilizing available RNA-Seq and other data types in the authors' manual curation process for vertebrate, plant, and other species is summarized, and a new direction for prokaryotic genomes and protein name management is described.

...read moreread less

Abstract: The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management.

...read moreread less

4,104 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse