Home
/
Authors
/
Per Unneberg

Author

Per Unneberg

Other affiliations: Centre national de la recherche scientifique, Royal Institute of Technology, Uppsala University

Bio: Per Unneberg is an academic researcher from Science for Life Laboratory. The author has contributed to research in topics: Gene & Expressed sequence tag. The author has an hindex of 17, co-authored 26 publications receiving 5529 citations. Previous affiliations of Per Unneberg include Centre national de la recherche scientifique & Royal Institute of Technology.

Topics: Gene, Expressed sequence tag, Genome, Population, Candidate gene ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)

[...]

Gerald A. Tuskan¹, Gerald A. Tuskan², Stephen P. DiFazio², Stephen P. DiFazio³, Stefan Jansson⁴, Joerg Bohlmann⁵, Igor V. Grigoriev⁶, Uffe Hellsten⁶, Nicholas H. Putnam⁶, Steven G. Ralph⁵, Stephane Rombauts⁷, Asaf Salamov⁶, Jacquie Schein, Lieven Sterck⁷, Andrea Aerts⁶, Rishikeshi Bhalerao⁴, Rishikesh P. Bhalerao⁸, Damien Blaudez⁹, Wout Boerjan⁷, Annick Brun⁹, Amy M. Brunner¹⁰, Victor Busov¹¹, Malcolm M. Campbell¹², John E. Carlson¹³, Michel Chalot⁹, Jarrod Chapman⁶, G.-L. Chen², Dawn Cooper⁵, Pedro M. Coutinho¹⁴, Jérémy Couturier⁹, Sarah F. Covert¹⁵, Quentin C. B. Cronk⁵, R. Cunningham², John M. Davis¹⁶, Sven Degroeve⁷, Annabelle Déjardin⁹, Claude W. dePamphilis¹³, John C. Detter⁶, Bill Dirks¹⁷, Inna Dubchak¹⁸, Inna Dubchak⁶, Sébastien Duplessis⁹, Jürgen Ehlting⁵, Brian E. Ellis⁵, Karla C Gendler¹⁹, David Goodstein⁶, Michael Gribskov²⁰, Jane Grimwood²¹, Andrew Groover²², Lee E. Gunter², Björn Hamberger⁵, Berthold Heinze, Yrjö Helariutta²³, Yrjö Helariutta²⁴, Yrjö Helariutta⁸, Bernard Henrissat¹⁴, D. Holligan¹⁵, Robert A. Holt, Wenyu Huang⁶, N. Islam-Faridi²², Steven J.M. Jones, M. Jones-Rhoades²⁵, Richard A. Jorgensen¹⁹, Chandrashekhar P. Joshi¹¹, Jaakko Kangasjärvi²⁴, Jan Karlsson⁴, Colin T. Kelleher⁵, Robert Kirkpatrick, Matias Kirst¹⁶, Annegret Kohler⁹, Udaya C. Kalluri², Frank W. Larimer², Jim Leebens-Mack¹⁵, Jean-Charles Leplé⁹, Philip F. LoCascio², Y. Lou⁶, Susan Lucas⁶, Francis Martin⁹, Barbara Montanini⁹, Carolyn A. Napoli¹⁹, David R. Nelson²⁶, C D Nelson²², Kaisa Nieminen²⁴, Ove Nilsson⁸, V. Pereda⁹, Gary F. Peter¹⁶, Ryan N. Philippe⁵, Gilles Pilate⁹, Alexander Poliakov¹⁸, J. Razumovskaya², Paul G. Richardson⁶, Cécile Rinaldi⁹, Kermit Ritland⁵, Pierre Rouzé⁷, D. Ryaboy¹⁸, Jeremy Schmutz²¹, J. Schrader²⁷, Bo Segerman⁴, H. Shin, Asim Siddiqui, Fredrik Sterky, Astrid Terry⁶, Chung-Jui Tsai¹¹, Edward C. Uberbacher², Per Unneberg, Jorma Vahala²⁴, Kerr Wall¹³, Susan R. Wessler¹⁵, Guojun Yang¹⁵, T. Yin², Carl J. Douglas⁵, Marco A. Marra, Göran Sandberg⁸, Y. Van de Peer⁷, Daniel S. Rokhsar⁶, Daniel S. Rokhsar¹⁷ - Show less +112 more•Institutions (27)

University of Tennessee¹, Oak Ridge National Laboratory², West Virginia University³, Umeå University⁴, University of British Columbia⁵, United States Department of Energy⁶, Ghent University⁷, Swedish University of Agricultural Sciences⁸, Institut national de la recherche agronomique⁹, Virginia Tech¹⁰, Michigan Technological University¹¹, University of Toronto¹², Pennsylvania State University¹³, University of Provence¹⁴, University of Georgia¹⁵, University of Florida¹⁶, University of California, Berkeley¹⁷, Lawrence Berkeley National Laboratory¹⁸, University of Arizona¹⁹, Purdue University²⁰, Stanford University²¹, United States Department of Agriculture²², University of Turku²³, University of Helsinki²⁴, Massachusetts Institute of Technology²⁵, University of Tennessee Health Science Center²⁶, University of Tübingen²⁷

15 Sep 2006-Science

TL;DR: The draft genome of the black cottonwood tree, Populus trichocarpa, has been reported in this paper, with more than 45,000 putative protein-coding genes identified.

...read moreread less

Abstract: We report the draft genome of the black cottonwood tree, Populus trichocarpa. Integration of shotgun sequence assembly with genetic mapping enabled chromosome-scale reconstruction of the genome. More than 45,000 putative protein-coding genes were identified. Analysis of the assembled genome revealed a whole-genome duplication event; about 8000 pairs of duplicated genes from that event survived in the Populus genome. A second, older duplication event is indistinguishably coincident with the divergence of the Populus and Arabidopsis lineages. Nucleotide substitution, tandem gene duplication, and gross chromosomal rearrangement appear to proceed substantially more slowly in Populus than in Arabidopsis. Populus has more protein-coding genes than Arabidopsis, ranging on average from 1.4 to 1.6 putative Populus homologs for each Arabidopsis gene. However, the relative frequency of protein domains in the two genomes is similar. Overrepresented exceptions in Populus include genes associated with lignocellulosic wall biosynthesis, meristem development, disease resistance, and metabolite transport.

...read moreread less

4,025 citations

Journal Article•DOI•

The genomic landscape underlying phenotypic integrity in the face of gene flow in crows

[...]

Jelmer W. Poelstra¹, Nagarjun Vijay¹, Christen M. Bossu¹, Henrik Lantz², Henrik Lantz¹, Bettina Ryll², Inge Müller³, Inge Müller⁴, Vittorio Baglione⁵, Per Unneberg¹, Martin Wikelski³, Martin Wikelski⁴, Manfred Grabherr¹, Jochen B. W. Wolf¹ - Show less +10 more•Institutions (5)

Science for Life Laboratory¹, Uppsala University², University of Konstanz³, Max Planck Society⁴, University of Valladolid⁵

20 Jun 2014-Science

TL;DR: Characterization of genomic differentiation in a classic example of hybridization between all-black carrion crows and gray-coated hooded crows identified genome-wide introgression extending far beyond the morphological hybrid zone, indicating localized genomic selection can cause marked heterogeneity in introgressive landscapes while maintaining phenotypic divergence.

...read moreread less

Abstract: The importance, extent, and mode of interspecific gene flow for the evolution of species has long been debated. Characterization of genomic differentiation in a classic example of hybridization between all-black carrion crows and gray-coated hooded crows identified genome-wide introgression extending far beyond the morphological hybrid zone. Gene expression divergence was concentrated in pigmentation genes expressed in gray versus black feather follicles. Only a small number of narrow genomic islands exhibited resistance to gene flow. One prominent genomic region (<2 megabases) harbored 81 of all 82 fixed differences (of 8.4 million single-nucleotide polymorphisms in total) linking genes involved in pigmentation and in visual perception-a genomic signal reflecting color-mediated prezygotic isolation. Thus, localized genomic selection can cause marked heterogeneity in introgression landscapes while maintaining phenotypic divergence.

...read moreread less

495 citations

Journal Article•DOI•

Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones

[...]

Tadashi Imanishi¹, Takeshi Itoh¹, Yutaka Suzuki², Claire O'Donovan³ +164 more•Institutions (42)

20 Apr 2004-PLOS Biology

TL;DR: The H-InvDB as discussed by the authors is a database of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level.

...read moreread less

Abstract: The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology.

...read moreread less

341 citations

Journal Article•DOI•

A Populus EST resource for plant functional genomics

[...]

Fredrik Sterky¹, Rupali Bhalerao, Per Unneberg, Bo Segerman, Peter Nilsson, Amy M. Brunner, Laurence Charbonnel-Campaa, Jenny Jonsson Lindvall, Karolina Tandre, Steven H. Strauss, Björn Sundberg, Petter Gustafsson, Mathias Uhlén, Rishikesh P. Bhalerao, Ove Nilsson, Göran Sandberg, Jan Karlsson, Joakim Lundeberg, Stefan Jansson - Show less +15 more•Institutions (1)

Royal Institute of Technology¹

21 Sep 2004-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The coding content of Populus and Arabidopsis genomes shows very high similarity, indicating that differences between these annual and perennial angiosperm life forms result primarily from differences in gene regulation.

...read moreread less

Abstract: Trees present a life form of paramount importance for terrestrial ecosystems and human societies because of their ecological structure and physiological function and provision of energy and industrial materials. The genus Populus is the internationally accepted model for molecular tree biology. We have analyzed 102,019 Populus ESTs that clustered into 11,885 clusters and 12,759 singletons. We also provide >4,000 assembled full clone sequences to serve as a basis for the upcoming annotation of the Populus genome sequence. A public web-based EST database (populusdb) provides digital expression profiles for 18 tissues that comprise the majority of differentiated organs. The coding content of Populus and Arabidopsis genomes shows very high similarity, indicating that differences between these annual and perennial angiosperm life forms result primarily from differences in gene regulation. The high similarity between Populus and Arabidopsis will allow studies of Populus to directly benefit from the detailed functional genomic information generated for Arabidopsis, enabling detailed insights into tree development and adaptation. These data will also valuable for functional genomic efforts in Arabidopsis.

...read moreread less

340 citations

Journal Article•DOI•

Dominant mutations in GRHL3 cause Van der Woude syndrome and disrupt oral periderm development

[...]

Myriam Peyrard-Janvid¹, Elizabeth J. Leslie², Youssef A. Kousa³, Tiffany L. Smith², Martine Dunnwald², Måns Magnusson⁴, Brian A. Lentz², Per Unneberg⁴, Ingegerd Fransson¹, Hannele Koillinen⁵, Jorma Rautio⁵, Marie Pegelow¹, Agneta Karsten¹, Lina Basel-Vanagaite⁶, Lina Basel-Vanagaite⁷, William Gordon⁸, Bogi Andersen⁸, Thomas Svensson⁴, Jeffrey C. Murray², Robert A. Cornell², Juha Kere⁵, Juha Kere⁴, Juha Kere¹, Brian C. Schutte³ - Show less +20 more•Institutions (8)

Karolinska Institutet¹, University of Iowa², Michigan State University³, Science for Life Laboratory⁴, University of Helsinki⁵, Rabin Medical Center⁶, Tel Aviv University⁷, University of California, Irvine⁸

02 Jan 2014-American Journal of Human Genetics

TL;DR: Data demonstrated that mutations in two genes, IRF6 and GRHL3, can lead to nearly identical phenotypes of orofacial cleft and supported the hypotheses that both genes are essential for the presence of a functional oral periderm and that failure of this process contributes to VWS.

...read moreread less

Abstract: Mutations in interferon regulatory factor 6 (IRF6) account for ∼70% of cases of Van der Woude syndrome (VWS), the most common syndromic form of cleft lip and palate. In 8 of 45 VWS-affected families lacking a mutation in IRF6, we found coding mutations in grainyhead-like 3 (GRHL3). According to a zebrafish-based assay, the disease-associated GRHL3 mutations abrogated periderm development and were consistent with a dominant-negative effect, in contrast to haploinsufficiency seen in most VWS cases caused by IRF6 mutations. In mouse, all embryos lacking Grhl3 exhibited abnormal oral periderm and 17% developed a cleft palate. Analysis of the oral phenotype of double heterozygote (Irf6+/−;Grhl3+/−) murine embryos failed to detect epistasis between the two genes, suggesting that they function in separate but convergent pathways during palatogenesis. Taken together, our data demonstrated that mutations in two genes, IRF6 and GRHL3, can lead to nearly identical phenotypes of orofacial cleft. They supported the hypotheses that both genes are essential for the presence of a functional oral periderm and that failure of this process contributes to VWS.

...read moreread less

186 citations

1
2
3
4
…
5
6

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics

[...]

Brandi L. Cantarel¹, Pedro M. Coutinho², Corinne Rancurel², Thomas Bernard², Vincent Lombard², Bernard Henrissat² - Show less +2 more•Institutions (2)

University of Provence¹, Aix-Marseille University²

01 Jan 2009-Nucleic Acids Research

TL;DR: The Carbohydrate-Active Enzyme (CAZy) database is a knowledge-based resource specialized in the enzymes that build and breakdown complex carbohydrates and glycoconjugates and has been used to improve the quality of functional predictions of a number genome projects by providing expert annotation.

...read moreread less

Abstract: The Carbohydrate-Active Enzyme (CAZy) database is a knowledge-based resource specialized in the enzymes that build and breakdown complex carbohydrates and glycoconjugates. As of September 2008, the database describes the present knowledge on 113 glycoside hydrolase, 91 glycosyltransferase, 19 polysaccharide lyase, 15 carbohydrate esterase and 52 carbohydrate-binding module families. These families are created based on experimentally characterized proteins and are populated by sequences from public databases with significant similarity. Protein biochemical information is continuously curated based on the available literature and structural information. Over 6400 proteins have assigned EC numbers and 700 proteins have a PDB structure. The classification (i) reflects the structural features of these enzymes better than their sole substrate specificity, (ii) helps to reveal the evolutionary relationships between these enzymes and (iii) provides a convenient framework to understand mechanistic properties. This resource has been available for over 10 years to the scientific community, contributing to information dissemination and providing a transversal nomenclature to glycobiologists. More recently, this resource has been used to improve the quality of functional predictions of a number genome projects by providing expert annotation. The CAZy resource resides at URL: http://www.cazy.org/.

...read moreread less

6,028 citations

Journal Article•DOI•

TM4: a free, open-source system for microarray data management and analysis.

[...]

Alexander I. Saeed, Vasily Sharov, James R. White, J. Li, Wei Liang, Nirmal Bhagabati, John C. Braisted, Maria I. Klapa, T. Currier, Mathangi Thiagarajan, Alexander Sturn, Mark Snuffin, A. Rezantsev, D. Popov, A. Ryltsov, E. Kostukovich, I. Borisovsky, Z. Liu, A. Vinsavich, V. Trush, John Quackenbush¹ - Show less +17 more•Institutions (1)

George Washington University¹

01 Feb 2003-BioTechniques

TL;DR: This research presents a novel and scalable approach to genome engineering that addresses the challenge of integrating RNAseq data to provide real-time information about the “silent” response of the immune system to DNA editing.

...read moreread less

Abstract: White1, J. Li1, W. Liang1, N. Bhagabati1, J. Braisted1, M. Klapa1, T. Currier1, M. Thiagarajan1, A. Sturn1, M. Snuffin2, A. Rezantsev2, D. Popov2, A. Ryltsov2, E. Kostukovich2, I. Borisovsky2, Z. Liu3, A. Vinsavich3, V. Trush3, and J. Quackenbush1,4 1The Institute for Genomic Research, Rockville, MD, 2DataNaut, Bethesda, MD, 3Syntek Systems, Bethesda, MD, and 4Department of Biochemistry, George Washington University, Washington, D.C., USA

...read moreread less

4,756 citations

Integrative analysis of 111 reference human epigenomes

[...]

Anshul Kundaje, Wouter Meuleman, Jason Ernst, Angela Yen, Pouya Kheradpour, Zhizhuo Zhang, Jianrong Wang, Lucas D. Ward, Abhishek Sarkar, Gerald Quon, Matthew L. Eaton, Yi-Chieh Wu, Andreas R. Pfenning, Xinchen Wang, Melina Claussnitzer, Yaping Liu, Mukul S. Bansal, Soheil Feizi-Khankandi, Ah Ram Kim, Richard C Sallari, Nicholas A Sinnott-Armstrong, Laurie A. Boyer, Elizabeta Gjoneska, Li-Huei Tsai, Manolis Kellis - Show less +21 more

01 Feb 2015

TL;DR: In this article, the authors describe the integrative analysis of 111 reference human epigenomes generated as part of the NIH Roadmap Epigenomics Consortium, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression.

...read moreread less

Abstract: The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.

...read moreread less

4,409 citations

Journal Article•DOI•

The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)

[...]

Gerald A. Tuskan¹, Gerald A. Tuskan², Stephen P. DiFazio³, Stephen P. DiFazio², Stefan Jansson⁴, Joerg Bohlmann⁵, Igor V. Grigoriev⁶, Uffe Hellsten⁶, Nicholas H. Putnam⁶, Steven G. Ralph⁵, Stephane Rombauts⁷, Asaf Salamov⁶, Jacquie Schein, Lieven Sterck⁷, Andrea Aerts⁶, Rishikeshi Bhalerao⁴, Rishikesh P. Bhalerao⁸, Damien Blaudez⁹, Wout Boerjan⁷, Annick Brun⁹, Amy M. Brunner¹⁰, Victor Busov¹¹, Malcolm M. Campbell¹², John E. Carlson¹³, Michel Chalot⁹, Jarrod Chapman⁶, G.-L. Chen², Dawn Cooper⁵, Pedro M. Coutinho¹⁴, Jérémy Couturier⁹, Sarah F. Covert¹⁵, Quentin C. B. Cronk⁵, R. Cunningham², John M. Davis¹⁶, Sven Degroeve⁷, Annabelle Déjardin⁹, Claude W. dePamphilis¹³, John C. Detter⁶, Bill Dirks¹⁷, Inna Dubchak¹⁸, Inna Dubchak⁶, Sébastien Duplessis⁹, Jürgen Ehlting⁵, Brian E. Ellis⁵, Karla C Gendler¹⁹, David Goodstein⁶, Michael Gribskov²⁰, Jane Grimwood²¹, Andrew Groover²², Lee E. Gunter², Björn Hamberger⁵, Berthold Heinze, Yrjö Helariutta²³, Yrjö Helariutta²⁴, Yrjö Helariutta⁸, Bernard Henrissat¹⁴, D. Holligan¹⁵, Robert A. Holt, Wenyu Huang⁶, N. Islam-Faridi²², Steven J.M. Jones, M. Jones-Rhoades²⁵, Richard A. Jorgensen¹⁹, Chandrashekhar P. Joshi¹¹, Jaakko Kangasjärvi²³, Jan Karlsson⁴, Colin T. Kelleher⁵, Robert Kirkpatrick, Matias Kirst¹⁶, Annegret Kohler⁹, Udaya C. Kalluri², Frank W. Larimer², Jim Leebens-Mack¹⁵, Jean-Charles Leplé⁹, Philip F. LoCascio², Y. Lou⁶, Susan Lucas⁶, Francis Martin⁹, Barbara Montanini⁹, Carolyn A. Napoli¹⁹, David R. Nelson²⁶, C D Nelson²², Kaisa Nieminen²³, Ove Nilsson⁸, V. Pereda⁹, Gary F. Peter¹⁶, Ryan N. Philippe⁵, Gilles Pilate⁹, Alexander Poliakov¹⁸, J. Razumovskaya², Paul G. Richardson⁶, Cécile Rinaldi⁹, Kermit Ritland⁵, Pierre Rouzé⁷, D. Ryaboy¹⁸, Jeremy Schmutz²¹, J. Schrader²⁷, Bo Segerman⁴, H. Shin, Asim Siddiqui, Fredrik Sterky, Astrid Terry⁶, Chung-Jui Tsai¹¹, Edward C. Uberbacher², Per Unneberg, Jorma Vahala²³, Kerr Wall¹³, Susan R. Wessler¹⁵, Guojun Yang¹⁵, T. Yin², Carl J. Douglas⁵, Marco A. Marra, Göran Sandberg⁸, Y. Van de Peer⁷, Daniel S. Rokhsar¹⁷, Daniel S. Rokhsar⁶ - Show less +112 more•Institutions (27)

University of Tennessee¹, Oak Ridge National Laboratory², West Virginia University³, Umeå University⁴, University of British Columbia⁵, United States Department of Energy⁶, Ghent University⁷, Swedish University of Agricultural Sciences⁸, Institut national de la recherche agronomique⁹, Virginia Tech¹⁰, Michigan Technological University¹¹, University of Toronto¹², Pennsylvania State University¹³, University of Provence¹⁴, University of Georgia¹⁵, University of Florida¹⁶, University of California, Berkeley¹⁷, Lawrence Berkeley National Laboratory¹⁸, University of Arizona¹⁹, Purdue University²⁰, Stanford University²¹, United States Department of Agriculture²², University of Helsinki²³, University of Turku²⁴, Massachusetts Institute of Technology²⁵, University of Tennessee Health Science Center²⁶, University of Tübingen²⁷

15 Sep 2006-Science

TL;DR: The draft genome of the black cottonwood tree, Populus trichocarpa, has been reported in this paper, with more than 45,000 putative protein-coding genes identified.

...read moreread less

4,025 citations

Journal Article•DOI•

Genome sequence of the palaeopolyploid soybean

[...]

Jeremy Schmutz, Steven B. Cannon¹, Jessica A. Schlueter², Jessica A. Schlueter³, Jianxin Ma², Therese Mitros⁴, William Nelson⁵, David L. Hyten¹, Qijian Song⁶, Qijian Song¹, Jay J. Thelen⁷, Jianlin Cheng⁷, Dong Xu⁷, Uffe Hellsten⁸, Gregory D. May⁹, Yeisoo Yu⁵, Tetsuya Sakurai, Taishi Umezawa, Madan K. Bhattacharyya¹⁰, Devinder Sandhu¹¹, Babu Valliyodan⁷, Erika Lindquist⁸, Myron Peto¹, David Grant¹, Shengqiang Shu⁸, David Goodstein⁸, Kerrie Barry⁸, Montona Futrell-Griggs², Brian Abernathy², Jianchang Du², Zhixi Tian², Liucun Zhu², Navdeep Gill², Trupti Joshi⁷, Marc Libault⁷, Ananad Sethuraman, Xue-Cheng Zhang⁷, Kazuo Shinozaki, Henry T. Nguyen⁷, Rod A. Wing⁵, Perry B. Cregan¹, James E. Specht¹², Jane Grimwood⁸, Daniel S. Rokhsar⁸, Gary Stacey⁷, Randy C. Shoemaker¹, Scott A. Jackson² - Show less +43 more•Institutions (12)

Agricultural Research Service¹, Purdue University², University of North Carolina at Charlotte³, University of California, Berkeley⁴, University of Arizona⁵, University of Maryland, College Park⁶, University of Missouri⁷, Joint Genome Institute⁸, National Center for Genome Resources⁹, Iowa State University¹⁰, University of Wisconsin–Stevens Point¹¹, University of Nebraska–Lincoln¹²

14 Jan 2010-Nature

TL;DR: An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.

...read moreread less

Abstract: Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70% more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78% of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75% of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.

...read moreread less

3,743 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse