scispace - formally typeset
Search or ask a question
Author

Hiroaki Sakai

Bio: Hiroaki Sakai is an academic researcher from University of Tsukuba. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 21, co-authored 40 publications receiving 4869 citations. Previous affiliations of Hiroaki Sakai include National Agriculture and Food Research Organization & National Institute of Advanced Industrial Science and Technology.
Topics: Genome, Gene, Genomics, Whole genome sequencing, Vigna

Papers
More filters
Journal ArticleDOI
06 Feb 2013-Rice
TL;DR: A revised, error-corrected, and validated assembly of the Nipponbare cultivar of rice was generated using optical map data, re-sequencing data, and manual curation that will facilitate on-going and future research in rice.
Abstract: Rice research has been enabled by access to the high quality reference genome sequence generated in 2005 by the International Rice Genome Sequencing Project (IRGSP). To further facilitate genomic-enabled research, we have updated and validated the genome assembly and sequence for the Nipponbare cultivar of Oryza sativa (japonica group). The Nipponbare genome assembly was updated by revising and validating the minimal tiling path of clones with the optical map for rice. Sequencing errors in the revised genome assembly were identified by re-sequencing the genome of two different Nipponbare individuals using the Illumina Genome Analyzer II/IIx platform. A total of 4,886 sequencing errors were identified in 321 Mb of the assembled genome indicating an error rate in the original IRGSP assembly of only 0.15 per 10,000 nucleotides. A small number (five) of insertions/deletions were identified using longer reads generated using the Roche 454 pyrosequencing platform. As the re-sequencing data were generated from two different individuals, we were able to identify a number of allelic differences between the original individual used in the IRGSP effort and the two individuals used in the re-sequencing effort. The revised assembly, termed Os-Nipponbare-Reference-IRGSP-1.0, is now being used in updated releases of the Rice Annotation Project and the Michigan State University Rice Genome Annotation Project, thereby providing a unified set of pseudomolecules for the rice community. A revised, error-corrected, and validated assembly of the Nipponbare cultivar of rice was generated using optical map data, re-sequencing data, and manual curation that will facilitate on-going and future research in rice. Detection of polymorphisms between three different Nipponbare individuals highlights that allelic differences between individuals should be considered in diversity studies.

1,551 citations

Journal ArticleDOI
Klaus F. X. Mayer, Jane Rogers, Jaroslav Doležel1, Curtis J. Pozniak2, Kellye Eversole, Catherine Feuillet3, Bikram S. Gill4, Bernd Friebe4, Adam J. Lukaszewski5, Pierre Sourdille6, Takashi R. Endo7, M. Kubaláková1, Jarmila Číhalíková1, Zdeňka Dubská1, Jan Vrána1, Romana Šperková1, Hana Šimková1, Melanie Febrer8, Leah Clissold, Kirsten McLay, Kuldeep Singh9, Parveen Chhuneja9, Nagendra K. Singh10, Jitendra P. Khurana11, Eduard Akhunov4, Frédéric Choulet6, Adriana Alberti, Valérie Barbe, Patrick Wincker, Hiroyuki Kanamori12, Fuminori Kobayashi12, Takeshi Itoh12, Takashi Matsumoto12, Hiroaki Sakai12, Tsuyoshi Tanaka12, Jianzhong Wu12, Yasunari Ogihara13, Hirokazu Handa12, P. Ron Maclachlan2, Andrew G. Sharpe14, Darrin Klassen14, David Edwards, Jacqueline Batley, Odd-Arne Olsen, Simen Rød Sandve15, Sigbjørn Lien15, Burkhard Steuernagel16, Brande B. H. Wulff16, Mario Caccamo, Sarah Ayling, Ricardo H. Ramirez-Gonzalez, Bernardo J. Clavijo, Jonathan M. Wright, Matthias Pfeifer, Manuel Spannagl, Mihaela Martis, Martin Mascher17, Jarrod Chapman18, Jesse Poland4, Uwe Scholz17, Kerrie Barry18, Robbie Waugh19, Daniel S. Rokhsar18, Gary J. Muehlbauer, Nils Stein17, Heidrun Gundlach, Matthias Zytnicki20, Véronique Jamilloux20, Hadi Quesneville20, Thomas Wicker21, Primetta Faccioli, Moreno Colaiacovo, Antonio Michele Stanca, Hikmet Budak22, Luigi Cattivelli, Natasha Glover6, Lise Pingault6, Etienne Paux6, Sapna Sharma, Rudi Appels23, Matthew I. Bellgard23, Brett Chapman23, Thomas Nussbaumer, Kai Christian Bader, Hélène Rimbert, Shichen Wang4, Ron Knox, Andrzej Kilian, Michael Alaux20, Françoise Alfama20, Loïc Couderc20, Nicolas Guilhot6, Claire Viseux20, Mikaël Loaec20, Beat Keller21, Sébastien Praud 
18 Jul 2014-Science
TL;DR: Insight into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.
Abstract: An ordered draft sequence of the 17-gigabase hexaploid bread wheat (Triticum aestivum) genome has been produced by sequencing isolated chromosome arms. We have annotated 124,201 gene loci distributed nearly evenly across the homeologous chromosomes and subgenomes. Comparative gene analysis of wheat subgenomes and extant diploid and tetraploid wheat relatives showed that high sequence similarity and structural conservation are retained, with limited gene loss, after polyploidization. However, across the genomes there was evidence of dynamic gene gain, loss, and duplication since the divergence of the wheat lineages. A high degree of transcriptional autonomy and no global dominance was found for the subgenomes. These insights into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.

1,421 citations

Journal ArticleDOI
TL;DR: The Rice Annotation Project Database (RAP-DB, http://rapdb.dna.go.jp/) has been providing a comprehensive set of gene annotations for the genome sequence of rice, Oryza sativa (japonica group) cv.
Abstract: The Rice Annotation Project Database (RAP-DB, http://rapdb.dna.affrc.go.jp/) has been providing a comprehensive set of gene annotations for the genome sequence of rice, Oryza sativa (japonica group) cv. Nipponbare. Since the first release in 2005, RAP-DB has been updated several times along with the genome assembly updates. Here, we present our newest RAP-DB based on the latest genome assembly, Os-Nipponbare-Reference-IRGSP-1.0 (IRGSP-1.0), which was released in 2011. We detected 37,869 loci by mapping transcript and protein sequences of 150 monocot species. To provide plant researchers with highly reliable and up to date rice gene annotations, we have been incorporating literature-based manually curated data, and 1,626 loci currently incorporate literature-based annotation data, including commonly used gene names or gene symbols. Transcriptional activities are shown at the nucleotide level by mapping RNA-Seq reads derived from 27 samples. We also mapped the Illumina reads of a Japanese leading japonica cultivar, Koshihikari, and a Chinese indica cultivar, Guangluai-4, to the genome and show alignments together with the single nucleotide polymorphisms (SNPs) and gene functional annotations through a newly developed browser, Short-Read Assembly Browser (S-RAB). We have developed two satellite databases, Plant Gene Family Database (PGFD) and Integrative Database of Cereal Gene Phylogeny (IDCGP), which display gene family and homologous gene relationships among diverse plant species. RAP-DB and the satellite databases offer simple and user-friendly web interfaces, enabling plant and genome researchers to access the data easily and facilitating a broad range of plant research topics.

547 citations

Journal ArticleDOI
TL;DR: It is shown that the barley (H) genome displays a mosaic of structural similarity to hexaploid bread wheat (Triticum aestivum) A, B, and D subgenomes and that orthologous genes in different grasses exhibit signatures of positive selection in different lineages.
Abstract: We used a novel approach that incorporated chromosome sorting, next-generation sequencing, array hybridization, and systematic exploitation of conserved synteny with model grasses to assign; similar to 86% of the estimated; similar to 32,000 barley (Hordeum vulgare) genes to individual chromosome arms. Using a series of bioinformatically constructed genome zippers that integrate gene indices of rice (Oryza sativa), sorghum (Sorghum bicolor), and Brachypodium distachyon in a conserved synteny model, we were able to assemble 21,766 barley genes in a putative linear order. We show that the barley (H) genome displays a mosaic of structural similarity to hexaploid bread wheat (Triticum aestivum) A, B, and D subgenomes and that orthologous genes in different grasses exhibit signatures of positive selection in different lineages. We present an ordered, information-rich scaffold of the barley genome that provides a valuable and robust framework for the development of novel strategies in cereal breeding.

448 citations

Journal ArticleDOI
Tsuyoshi Tanaka1, Baltazar A. Antonio1, Shoshi Kikuchi1, Takashi Matsumoto1, Yoshiaki Nagamura1, Hisataka Numa1, Hiroaki Sakai1, Jianzhong Wu1, Takeshi Itoh1, Takeshi Itoh2, Takuji Sasaki1, Ryo Aono, Yasuyuki Fujii3, Takuya Habara, Erimi Harada, Masako Kanno, Yoshihiro Kawahara4, Hiroaki Kawashima, Hiromi Kubooka, Akihiro Matsuya, Hajime Nakaoka, Naomi Saichi, Ryoko Sanbonmatsu, Yoshiharu Sato, Yuji Shinso, Mami Suzuki, Jun-ichi Takeda, Motohiko Tanino, Fusano Todokoro, Kaori Yamaguchi, Naoyuki Yamamoto, Chisato Yamasaki, Tadashi Imanishi2, Toshihisa Okido, Masahito Tada, Kazuho Ikeo, Yoshio Tateno, Takashi Gojobori, Yao-Cheng Lin5, Fu Jin Wei5, Yue-Ie C. Hsing5, Qiang Zhao, Bin Han, Melissa Kramer6, Richard W. McCombie6, David Lonsdale7, Claire O'Donovan7, Eleanor J. Whitfield7, Rolf Apweiler7, Kanako O. Koyanagi8, Jitendra P. Khurana9, Saurabh Raghuvanshi9, Nagendra K. Singh10, Akhilesh K. Tyagi9, Georg Haberer, Masaki Fujisawa, Satomi Hosokawa, Yukiyo Ito, Hiroshi Ikawa, Michie Shibata, Mayu Yamamoto, Richard Bruskiewich11, Douglas R. Hoen12, Thomas E. Bureau12, Nobukazu Namiki13, Hajime Ohyanagi13, Yasumichi Sakai13, Satoshi Nobushima13, Katsumi Sakata13, Roberto A. Barrero14, Yutaka Sato15, Alexandre Souvorov16, Brian Smith-White16, Tatiana Tatusova16, Suyoung An17, Gynheung An17, Satoshi Oota, Galina Fuks18, Joachim Messing, Karen R. Christie19, Damien Lieberherr20, Hyeran Kim21, Andrea Zuccolo21, Rod A. Wing, Kan Nobuta22, Pamela J. Green22, Cheng Lu22, Blake C. Meyers22, Cristian Chaparro23, Benoît Piégu23, Olivier Panaud23, Manuel Echeverria23 
TL;DR: The latest version of the RAP-DB contains a variety of annotation data as follows: clone positions, structures and functions of 31 439 genes validated by cDNAs, RNA genes detected by massively parallel signature sequencing (MPSS) technology and sequence similarity, flanking sequences of mutant lines, transposable elements, etc.
Abstract: The Rice Annotation Project Database (RAP-DB) was created to provide the genome sequence assembly of the International Rice Genome Sequencing Project (IRGSP), manually curated annotation of the sequence, and other genomics information that could be useful for comprehensive understanding of the rice biology. Since the last publication of the RAP-DB, the IRGSP genome has been revised and reassembled. In addition, a large number of rice-expressed sequence tags have been released, and functional genomics resources have been produced worldwide. Thus, we have thoroughly updated our genome annotation by manual curation of all the functional descriptions of rice genes. The latest version of the RAP-DB contains a variety of annotation data as follows: clone positions, structures and functions of 31 439 genes validated by cDNAs, RNA genes detected by massively parallel signature sequencing (MPSS) technology and sequence similarity, flanking sequences of mutant lines, transposable elements, etc. Other annotation data such as Gnomon can be displayed along with those of RAP for comparison. We have also developed a new keyword search system to allow the user to access useful information. The RAP-DB is available at: http://rapdb.dna.affrc.go.jp/ and http://rapdb.lab.nig.ac.jp/.

342 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: This protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results, which takes less than 1 d of computer time for typical experiments and ∼1 h of hands-on time.
Abstract: Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identify new genes and new splice variants of known ones, as well as compare gene and transcript expression under two or more conditions. This protocol describes in detail how to use TopHat and Cufflinks to perform such analyses. It also covers several accessory tools and utilities that aid in managing data, including CummeRbund, a tool for visualizing RNA-seq analysis results. Although the procedure assumes basic informatics skills, these tools assume little to no background with RNA-seq analysis and are meant for novices and experts alike. The protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results. The protocol's execution time depends on the volume of transcriptome sequencing data and available computing resources but takes less than 1 d of computer time for typical experiments and ∼1 h of hands-on time.

10,913 citations

Journal ArticleDOI
29 Jan 2009-Nature
TL;DR: An initial analysis of the ∼730-megabase Sorghum bicolor (L.) Moench genome is presented, placing ∼98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information.
Abstract: Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the approximately 730-megabase Sorghum bicolor (L.) Moench genome, placing approximately 98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the approximately 75% larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization approximately 70 million years ago, most duplicated gene sets lost one member before the sorghum-rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24% of genes are grass-specific and 7% are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum's drought tolerance.

2,809 citations

Journal ArticleDOI
Rudi Appels1, Rudi Appels2, Kellye Eversole, Nils Stein3  +204 moreInstitutions (45)
17 Aug 2018-Science
TL;DR: This annotated reference sequence of wheat is a resource that can now drive disruptive innovation in wheat improvement, as this community resource establishes the foundation for accelerating wheat research and application through improved understanding of wheat biology and genomics-assisted breeding.
Abstract: An annotated reference sequence representing the hexaploid bread wheat genome in 21 pseudomolecules has been analyzed to identify the distribution and genomic context of coding and noncoding elements across the A, B, and D subgenomes. With an estimated coverage of 94% of the genome and containing 107,891 high-confidence gene models, this assembly enabled the discovery of tissue- and developmental stage-related coexpression networks by providing a transcriptome atlas representing major stages of wheat development. Dynamics of complex gene families involved in environmental adaptation and end-use quality were revealed at subgenome resolution and contextualized to known agronomic single-gene or quantitative trait loci. This community resource establishes the foundation for accelerating wheat research and application through improved understanding of wheat biology and genomics-assisted breeding.

2,118 citations

Journal ArticleDOI
TL;DR: The InterPro database integrates together predictive models or ‘signatures’ representing protein domains, families and functional sites from multiple, diverse source databases: Gene3D, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs.
Abstract: The InterPro database (http://www.ebi.ac.uk/interpro/) integrates together predictive models or 'signatures' representing protein domains, families and functional sites from multiple, diverse source databases: Gene3D, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs. Integration is performed manually and approximately half of the total approximately 58,000 signatures available in the source databases belong to an InterPro entry. Recently, we have started to also display the remaining un-integrated signatures via our web interface. Other developments include the provision of non-signature data, such as structural data, in new XML files on our FTP site, as well as the inclusion of matchless UniProtKB proteins in the existing match XML files. The web interface has been extended and now links out to the ADAN predicted protein-protein interaction database and the SPICE and Dasty viewers. The latest public release (v18.0) covers 79.8% of UniProtKB (v14.1) and consists of 16 549 entries. InterPro data may be accessed either via the web address above, via web services, by downloading files by anonymous FTP or by using the InterProScan search software (http://www.ebi.ac.uk/Tools/InterProScan/).

1,834 citations

Journal ArticleDOI
John P. Vogel1, David F. Garvin2, Todd C. Mockler2, Jeremy Schmutz, Daniel S. Rokhsar3, Michael W. Bevan4, Kerrie Barry5, Susan Lucas5, Miranda Harmon-Smith5, Kathleen Lail5, Hope Tice5, Jane Grimwood, Neil McKenzie4, Naxin Huo6, Yong Q. Gu6, Gerard R. Lazo6, Olin D. Anderson6, Frank M. You7, Ming-Cheng Luo7, Jan Dvorak7, Jonathan M. Wright4, Melanie Febrer4, Dominika Idziak8, Robert Hasterok8, Erika Lindquist5, Mei Wang5, Samuel E. Fox2, Henry D. Priest2, Sergei A. Filichkin2, Scott A. Givan2, Douglas W. Bryant2, Jeff H. Chang2, Haiyan Wu9, Wei Wu10, An-Ping Hsia10, Patrick S. Schnable9, Anantharaman Kalyanaraman11, Brad Barbazuk12, Todd P. Michael, Samuel P. Hazen13, Jennifer N. Bragg6, Debbie Laudencia-Chingcuanco6, Yiqun Weng14, Georg Haberer, Manuel Spannagl, Klaus F. X. Mayer, Thomas Rattei15, Therese Mitros3, Sang-Jik Lee16, Jocelyn K. C. Rose16, Lukas A. Mueller16, Thomas L. York16, Thomas Wicker17, Jan P. Buchmann17, Jaakko Tanskanen18, Alan H. Schulman18, Heidrun Gundlach, Michael W. Bevan4, Antonio Costa de Oliveira19, Luciano da C. Maia19, William R. Belknap6, Ning Jiang, Jinsheng Lai9, Liucun Zhu20, Jianxin Ma20, Cheng Sun21, Ellen J. Pritham21, Jérôme Salse, Florent Murat, Michael Abrouk, Rémy Bruggmann, Joachim Messing, Noah Fahlgren2, Christopher M. Sullivan2, James C. Carrington2, Elisabeth J. Chapman, Greg D. May22, Jixian Zhai23, Matthias Ganssmann23, Sai Guna Ranjan Gurazada23, Marcelo A German23, Blake C. Meyers23, Pamela J. Green23, Ludmila Tyler3, Jiajie Wu7, James A. Thomson6, Shan Chen13, Henrik Vibe Scheller24, Jesper Harholt25, Peter Ulvskov25, Jeffrey A. Kimbrel2, Laura E. Bartley24, Peijian Cao24, Ki-Hong Jung26, Manoj Sharma24, Miguel E. Vega-Sánchez24, Pamela C. Ronald24, Chris Dardick6, Stefanie De Bodt27, Wim Verelst27, Dirk Inzé27, Maren Heese28, Arp Schnittger28, Xiaohan Yang29, Udaya C. Kalluri29, Gerald A. Tuskan29, Zhihua Hua14, Richard D. Vierstra14, Yu Cui9, Shuhong Ouyang9, Qixin Sun9, Zhiyong Liu9, Alper Yilmaz30, Erich Grotewold30, Richard Sibout31, Kian Hématy31, Grégory Mouille31, Herman Höfte31, Todd P. Michael, Jérôme Pelloux32, Devin O'Connor3, James C. Schnable3, Scott C. Rowe3, Frank G. Harmon3, Cynthia L. Cass33, John C. Sedbrook33, Mary E. Byrne4, Sean Walsh4, Janet Higgins4, Pinghua Li16, Thomas P. Brutnell16, Turgay Unver34, Hikmet Budak34, Harry Belcram, Mathieu Charles, Boulos Chalhoub, Ivan Baxter35 
11 Feb 2010-Nature
TL;DR: The high-quality genome sequence will help Brachypodium reach its potential as an important model system for developing new energy and food crops and establishes a template for analysis of the large genomes of economically important pooid grasses such as wheat.
Abstract: Three subfamilies of grasses, the Ehrhartoideae, Panicoideae and Pooideae, provide the bulk of human nutrition and are poised to become major sources of renewable energy. Here we describe the genome sequence of the wild grass Brachypodium distachyon (Brachypodium), which is, to our knowledge, the first member of the Pooideae subfamily to be sequenced. Comparison of the Brachypodium, rice and sorghum genomes shows a precise history of genome evolution across a broad diversity of the grasses, and establishes a template for analysis of the large genomes of economically important pooid grasses such as wheat. The high-quality genome sequence, coupled with ease of cultivation and transformation, small size and rapid life cycle, will help Brachypodium reach its potential as an important model system for developing new energy and food crops.

1,603 citations