scispace - formally typeset
Search or ask a question
Author

Myron Peto

Other affiliations: Agricultural Research Service
Bio: Myron Peto is an academic researcher from Iowa State University. The author has contributed to research in topics: Genome & Genome project. The author has an hindex of 6, co-authored 7 publications receiving 3365 citations. Previous affiliations of Myron Peto include Agricultural Research Service.

Papers
More filters
Journal ArticleDOI
14 Jan 2010-Nature
TL;DR: An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.
Abstract: Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70% more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78% of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75% of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.

3,743 citations

Journal ArticleDOI
06 May 2010-Nature
TL;DR: A comparative genomics approach between soybean and maize is used to show that a single-base mutation in chromosome 19 accounts for the duplicate recessive epistasis needed to greatly reduce phytate production in soybean seed.
Abstract: Nature 463, 178–183 (2010) During resubmission of this work, a paper was published1 that used a comparative genomics approach between soybean and maize to show that a single-base mutation in chromosome 19 accounts for the duplicate recessive epistasis needed to greatly reduce phytate production in soybean seed.

42 citations

01 Jan 2010
TL;DR: A soybean whole-genome shotgun sequence, comprised of 950megabases (Mb) of assembled and anchored sequence, representing about 85% of the predicted genes, is reported here.
Abstract: Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70% more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78% of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75% of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties. Legumes are an important part of world agriculture as they fix atmospheric nitrogen by intimate symbioses with microorganisms. The soybeaninparticularisimportant worldwideasapredominant plant source of both animal feed protein and cooking oil. We report here a soybean whole-genome shotgun sequence of Glycine max var. Williams 82, comprised of 950megabases (Mb) of assembled and anchored sequence (Fig. 1), representing about 85% of the predicted

23 citations

Journal ArticleDOI
TL;DR: The transfer matrix method offers a superior way of generating all conformations within a given geometry on a lattice by completely avoiding attrition and reducing this highly complicated geometrical problem to a simple algebraic problem of matrix multiplication.
Abstract: We enumerated all compact conformations within simple geometries on the two-dimensional (2D) triangular and three-dimensional (3D) face centered cubic (fcc) lattice. These compact conformations correspond mathematically to Hamiltonian paths and Hamiltonian circuits and are frequently used as simple models of proteins. The shapes that were studied for the 2D triangular lattice included m×n parallelograms, regular equilateral triangles, and various hexagons. On the 3D fcc lattice we generated conformations for a limited class of skewed parallelepipeds. Symmetries of the shape were exploited to reduce the number of conformations. We compared surface to volume ratios against protein length for compact conformations on the 3D cubic lattice and for a selected set of real proteins. We also show preliminary work in extending the transfer matrix method, previously developed by us for the 2D square and the 3D cubic lattices, to the 2D triangular lattice. The transfer matrix method offers a superior way of generatin...

15 citations

Journal ArticleDOI
TL;DR: There is a marked difference in the designability between various protein shapes, with some of them accounting for a significantly larger share of the total foldable sequences.
Abstract: One important problem in computational structural biology is protein designability, that is, why protein sequences are not random strings of amino acids but instead show regular patterns that encode protein structures. Many previous studies that have attempted to solve the problem have relied upon reduced models of proteins. In particular, the 2D square and the 3D cubic lattices together with reduced amino acid alphabet models have been examined extensively and have lead to interesting results that shed some light on evolutionary relationship among proteins. Here we perform designability studies on the 2D square lattice and explore the effects of variable overall shapes on protein designability using a binary hydrophobic-polar (HP) amino acid alphabet. Because we rely on a simple energy function that counts the total number of H-H interactions between non-sequential residues, we restrict our studies to protein shapes that have the same number of residues and also a constant number of non-bonded contacts. We have found that there is a marked difference in the designability between various protein shapes, with some of them accounting for a significantly larger share of the total foldable sequences.

9 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

13,246 citations

Journal ArticleDOI
TL;DR: Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number of complete plant genomes.
Abstract: The number of sequenced plant genomes and associated genomic resources is growing rapidly with the advent of both an increased focus on plant genomics from funding agencies, and the application of inexpensive next generation sequencing. To interact with this increasing body of data, we have developed Phytozome (http://www.phytozome.net), a comparative hub for plant genome and gene family data and analysis. Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number (currently 25) of complete plant genomes, including all the land plants and selected algae sequenced at the Joint Genome Institute, as well as selected species sequenced elsewhere. Through a comprehensive plant genome database and web portal, these data and analyses are available to the broader plant science research community, providing powerful comparative genomics tools that help to link model systems with other plants of economic and ecological importance.

3,728 citations

Journal ArticleDOI
TL;DR: The MCScanX toolkit implements an adjusted MCScan algorithm for detection of synteny and collinearity that extends the original software by incorporating 14 utility programs for visualization of results and additional downstream analyses.
Abstract: MCScan is an algorithm able to scan multiple genomes or subgenomes in order to identify putative homologous chromosomal regions, and align these regions using genes as anchors. The MCScanX toolkit implements an adjusted MCScan algorithm for detection of synteny and collinearity that extends the original software by incorporating 14 utility programs for visualization of results and additional downstream analyses. Applications of MCScanX to several sequenced plant genomes and gene families are shown as examples. MCScanX can be used to effectively analyze chromosome structural changes, and reveal the history of gene family expansions that might contribute to the adaptation of lineages and taxa. An integrated view of various modes of gene duplication can supplement the traditional gene tree analysis in specific families. The source code and documentation of MCScanX are freely available at http://chibba.pgml.uga.edu/mcscan2/.

3,388 citations

Journal ArticleDOI
TL;DR: It is becoming clear that a single WRKY transcription factor might be involved in regulating several seemingly disparate processes, and that members of the family play roles in both the repression and de-repression of important plant processes.

1,967 citations

Journal ArticleDOI
Boulos Chalhoub1, Shengyi Liu2, Isobel A. P. Parkin3, Haibao Tang4, Haibao Tang5, Xiyin Wang6, Julien Chiquet1, Harry Belcram1, Chaobo Tong2, Birgit Samans7, Margot Correa8, Corinne Da Silva8, Jérémy Just1, Cyril Falentin9, Chu Shin Koh10, Isabelle Le Clainche1, Maria Bernard8, Pascal Bento8, Benjamin Noel8, Karine Labadie8, Adriana Alberti8, Mathieu Charles9, Dominique Arnaud1, Hui Guo6, Christian Daviaud, Salman Alamery11, Kamel Jabbari12, Kamel Jabbari1, Meixia Zhao13, Patrick P. Edger14, Houda Chelaifa1, David C. Tack15, Gilles Lassalle9, Imen Mestiri1, Nicolas Schnel9, Marie-Christine Le Paslier9, Guangyi Fan, Victor Renault16, Philippe E. Bayer11, Agnieszka A. Golicz11, Sahana Manoli11, Tae-Ho Lee6, Vinh Ha Dinh Thi1, Smahane Chalabi1, Qiong Hu2, Chuchuan Fan17, Reece Tollenaere11, Yunhai Lu1, Christophe Battail8, Jinxiong Shen17, Christine Sidebottom10, Xinfa Wang2, Aurélie Canaguier1, Aurélie Chauveau9, Aurélie Bérard9, G. Deniot9, Mei Guan18, Zhongsong Liu18, Fengming Sun, Yong Pyo Lim19, Eric Lyons20, Christopher D. Town5, Ian Bancroft21, Xiaowu Wang, Jinling Meng17, Jianxin Ma13, J. Chris Pires22, Graham J.W. King23, Dominique Brunel9, Régine Delourme9, Michel Renard9, Jean-Marc Aury8, Keith L. Adams15, Jacqueline Batley24, Jacqueline Batley11, Rod J. Snowdon7, Jörg Tost, David Edwards11, David Edwards24, Yongming Zhou17, Wei Hua2, Andrew G. Sharpe10, Andrew H. Paterson6, Chunyun Guan18, Patrick Wincker25, Patrick Wincker1, Patrick Wincker8 
22 Aug 2014-Science
TL;DR: The polyploid genome of Brassica napus, which originated from a recent combination of two distinct genomes approximately 7500 years ago and gave rise to the crops of rape oilseed, is sequenced.
Abstract: Oilseed rape (Brassica napus L.) was formed ~7500 years ago by hybridization between B. rapa and B. oleracea, followed by chromosome doubling, a process known as allopolyploidy. Together with more ancient polyploidizations, this conferred an aggregate 72× genome multiplication since the origin of angiosperms and high gene content. We examined the B. napus genome and the consequences of its recent duplication. The constituent An and Cn subgenomes are engaged in subtle structural, functional, and epigenetic cross-talk, with abundant homeologous exchanges. Incipient gene loss and expression divergence have begun. Selection in B. napus oilseed types has accelerated the loss of glucosinolate genes, while preserving expansion of oil biosynthesis genes. These processes provide insights into allopolyploid evolution and its relationship with crop domestication and improvement.

1,743 citations