Author
Giovana Augusta Torres
Other affiliations: University of Wisconsin-Madison
Bio: Giovana Augusta Torres is an academic researcher from Universidade Federal de Lavras. The author has contributed to research in topics: Ploidy & Genome. The author has an hindex of 12, co-authored 30 publications receiving 2216 citations. Previous affiliations of Giovana Augusta Torres include University of Wisconsin-Madison.
Topics: Ploidy, Genome, Satellite DNA, Meiosis, Piper aduncum
Papers
More filters
••
Beijing Institute of Genomics1, Cayetano Heredia University2, Indian Council of Agricultural Research3, Russian Academy of Sciences4, University of Dundee5, Huazhong Agricultural University6, Hunan Agricultural University7, Imperial College London8, Polish Academy of Sciences9, International Potato Center10, J. Craig Venter Institute11, National University of La Plata12, Michigan State University13, James Hutton Institute14, Teagasc15, Plant & Food Research16, Aalborg University17, University of Wisconsin-Madison18, Virginia Tech19, Wageningen University and Research Centre20
TL;DR: The potato genome sequence provides a platform for genetic improvement of this vital crop and predicts 39,031 protein-coding genes and presents evidence for at least two genome duplication events indicative of a palaeopolyploid origin.
Abstract: Potato (Solanum tuberosum L.) is the world's most important non-grain food crop and is central to global food security. It is clonally propagated, highly heterozygous, autotetraploid, and suffers acute inbreeding depression. Here we use a homozygous doubled-monoploid potato clone to sequence and assemble 86% of the 844-megabase genome. We predict 39,031 protein-coding genes and present evidence for at least two genome duplication events indicative of a palaeopolyploid origin. As the first genome sequence of an asterid, the potato genome reveals 2,642 genes specific to this large angiosperm clade. We also sequenced a heterozygous diploid clone and show that gene presence/absence variants and other potentially deleterious mutations occur frequently and are a likely cause of inbreeding depression. Gene family expansion, tissue-specific expression and recruitment of genes to new pathways contributed to the evolution of tuber development. The potato genome sequence provides a platform for genetic improvement of this vital crop.
1,813 citations
••
TL;DR: The presence of two distinct types of centromeres, coupled with the boom-and-bust cycles of centromeric satellite repeats in Solanum species, suggests that repeat-based centromere evolution can rapidly evolve from neocentromeres by de novo amplification and insertion of satellite repeat repeats in the CENH3 domains.
Abstract: Centromeres in most higher eukaryotes are composed of long arrays of satellite repeats. By contrast, most newly formed centromeres (neocentromeres) do not contain satellite repeats and instead include DNA sequences representative of the genome. An unknown question in centromere evolution is how satellite repeat-based centromeres evolve from neocentromeres. We conducted a genome-wide characterization of sequences associated with CENH3 nucleosomes in potato (Solanum tuberosum). Five potato centromeres (Cen4, Cen6, Cen10, Cen11, and Cen12) consisted primarily of single- or low-copy DNA sequences. No satellite repeats were identified in these five centromeres. At least one transcribed gene was associated with CENH3 nucleosomes. Thus, these five centromeres structurally resemble neocentromeres. By contrast, six potato centromeres (Cen1, Cen2, Cen3, Cen5, Cen7, and Cen8) contained megabase-sized satellite repeat arrays that are unique to individual centromeres. The satellite repeat arrays likely span the entire functional cores of these six centromeres. At least four of the centromeric repeats were amplified from retrotransposon-related sequences and were not detected in Solanum species closely related to potato. The presence of two distinct types of centromeres, coupled with the boom-and-bust cycles of centromeric satellite repeats in Solanum species, suggests that repeat-based centromeres can rapidly evolve from neocentromeres by de novo amplification and insertion of satellite repeats in the CENH3 domains.
193 citations
••
TL;DR: It is demonstrated that the oligo-based FISH techniques are powerful new tools for chromosome identification and karyotyping research, especially for nonmodel plant species.
Abstract: Developing the karyotype of a eukaryotic species relies on identification of individual chromosomes, which has been a major challenge for most nonmodel plant and animal species. We developed a novel chromosome identification system by selecting and labeling oligonucleotides (oligos) located in specific regions on every chromosome. We selected a set of 54,672 oligos (45 nt) based on single copy DNA sequences in the potato genome. These oligos generated 26 distinct FISH signals that can be used as a "bar code" or "banding pattern" to uniquely label each of the 12 chromosomes from both diploid and polyploid (4× and 6×) potato species. Remarkably, the same bar code can be used to identify the 12 homeologous chromosomes among distantly related Solanum species, including tomato and eggplant. Accurate karyotypes based on individually identified chromosomes were established in six Solanum species that have diverged for >15 MY. These six species have maintained a similar karyotype; however, modifications to the FISH signal bar code led to the discovery of two reciprocal chromosomal translocations in Solanum etuberosum and S. caripense We also validated these translocations by oligo-based chromosome painting. We demonstrate that the oligo-based FISH techniques are powerful new tools for chromosome identification and karyotyping research, especially for nonmodel plant species.
95 citations
••
TL;DR: It is concluded that the CL34 repeat family emerged recently from the subtelomeric regions of potato chromosomes and is rapidly evolving, providing further evidence that subtelomersic domains are among the most dynamic regions in eukaryotic genomes.
Abstract: Subtelomeric domains immediately adjacent to telomeres represent one of the most dynamic and rapidly evolving regions in eukaryotic genomes. A common feature associated with subtelomeric regions in different eukaryotes is the presence of long arrays of tandemly repeated satellite sequences. However, studies on molecular organization and evolution of subtelomeric repeats are rare. We isolated two subtelomeric repeats, CL14 and CL34, from potato (Solanum tuberosum). The CL14 and CL34 repeats are organized as independent long arrays, up to 1-3 Mb, of 182 bp and 339 bp monomers, respectively. The CL14 and CL34 repeat arrays are directly connected with the telomeric repeats at some chromosomal ends. The CL14 repeat was detected at the subtelomeric regions among highly diverged Solanum species, including tomato (Solanum lycopersicum). In contrast, CL34 was only found in potato and its closely related species. Interestingly, the CL34 repeat array was always proximal to the telomeres when both CL14 and CL34 were found at the same chromosomal end. In addition, the CL34 repeat family showed more sequence variability among monomers compared with the CL14 repeat family. We conclude that the CL34 repeat family emerged recently from the subtelomeric regions of potato chromosomes and is rapidly evolving. These results provide further evidence that subtelomeric domains are among the most dynamic regions in eukaryotic genomes.
74 citations
••
TL;DR: The sequence comparison of five homoeologous centromeres in two Solanum species reveals rapid divergence of centromeric sequences among closely related species, supporting the idea thatCentromeric satellite repeats undergo boom-bust cycles before a favorable repeat is fixed in the population.
Abstract: Centromeres are composed of long arrays of satellite repeats in most multicellular eukaryotes investigated to date. The satellite repeat–based centromeres are believed to have evolved from “neocentromeres” that originally contained only single- or low-copy sequences. However, the emergence and evolution of the satellite repeats in centromeres has been elusive. Potato (Solanum tuberosum) provides a model system for studying centromere evolution because each of its 12 centromeres contains distinct DNA sequences, allowing comparative analysis of homoeologous centromeres from related species. We conducted genome-wide analysis of the centromeric sequences in Solanum verrucosum, a wild species closely related to potato. Unambiguous homoeologous centromeric sequences were detected in only a single centromere (Cen9) between the two species. Four centromeres (Cen2, Cen4, Cen7, and Cen10) in S. verrucosum contained distinct satellite repeats that were amplified from retrotransposon-related sequences. Strikingly, the same four centromeres in potato contain either different satellite repeats (Cen2 and Cen7) or exclusively single- and low-copy sequences (Cen4 and Cen10). Our sequence comparison of five homoeologous centromeres in two Solanum species reveals rapid divergence of centromeric sequences among closely related species. We propose that centromeric satellite repeats undergo boom-bust cycles before a favorable repeat is fixed in the population.
68 citations
Cited by
More filters
••
TL;DR: Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number of complete plant genomes.
Abstract: The number of sequenced plant genomes and associated genomic resources is growing rapidly with the advent of both an increased focus on plant genomics from funding agencies, and the application of inexpensive next generation sequencing. To interact with this increasing body of data, we have developed Phytozome (http://www.phytozome.net), a comparative hub for plant genome and gene family data and analysis. Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number (currently 25) of complete plant genomes, including all the land plants and selected algae sequenced at the Joint Genome Institute, as well as selected species sequenced elsewhere. Through a comprehensive plant genome database and web portal, these data and analyses are available to the broader plant science research community, providing powerful comparative genomics tools that help to link model systems with other plants of economic and ecological importance.
3,728 citations
••
TL;DR: A high-quality genome sequence of domesticated tomato is presented, a draft sequence of its closest wild relative, Solanum pimpinellifolium, is compared, and the two tomato genomes are compared to each other and to the potato genome.
Abstract: Tomato (Solanum lycopersicum) is a major crop plant and a model system for fruit development. Solanum is one of the largest angiosperm genera1 and includes annual and perennial plants from diverse habitats. Here we present a high-quality genome sequence of domesticated tomato, a draft sequence of its closest wild relative, Solanum pimpinellifolium2, and compare them to each other and to the potato genome (Solanum tuberosum). The two tomato genomes show only 0.6% nucleotide divergence and signs of recent admixture, but show more than 8% divergence from potato, with nine large and several smaller inversions. In contrast to Arabidopsis, but similar to soybean, tomato and potato small RNAs map predominantly to gene-rich chromosomal regions, including gene promoters. The Solanum lineage has experienced two consecutive genome triplications: one that is ancient and shared with rosids, and a more recent one. These triplications set the stage for the neofunctionalization of genes controlling fruit characteristics, such as colour and fleshiness.
2,687 citations
••
TL;DR: The sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy and transcriptomes of development and stress response and the proteome of the shell are reported, showing that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes.
Abstract: The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress response and the proteome of the shell. The oyster genome is highly polymorphic and rich in repetitive sequences, with some transposable elements still actively shaping variation. Transcriptome studies reveal an extensive set of genes responding to environmental stress. The expansion of genes coding for heat shock protein 70 and inhibitors of apoptosis is probably central to the oyster's adaptation to sessile life in the highly stressful intertidal zone. Our analyses also show that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes. The oyster genome sequence fills a void in our understanding of the Lophotrochozoa.
1,806 citations
••
TL;DR: The computational problems surrounding repeats are discussed and strategies used by current bioinformatics systems to solve them are described.
Abstract: Repetitive DNA sequences are abundant in a broad range of species, from bacteria to mammals, and they cover nearly half of the human genome. Repeats have always presented technical challenges for sequence alignment and assembly programs. Next-generation sequencing projects, with their short read lengths and high data volumes, have made these challenges more difficult. From a computational perspective, repeats create ambiguities in alignment and assembly, which, in turn, can produce biases and errors when interpreting results. Simply ignoring repeats is not an option, as this creates problems of its own and may mean that important biological phenomena are missed. We discuss the computational problems surrounding repeats and describe strategies used by current bioinformatics systems to solve them.
1,451 citations
••
Academy of Sciences of the Czech Republic1, University of Saskatchewan2, Bayer3, Kansas State University4, University of California, Riverside5, Blaise Pascal University6, Kyoto University7, University of Dundee8, Punjab Agricultural University9, Indian Agricultural Research Institute10, University of Delhi11, University of Tsukuba12, Yokohama City University13, National Research Council14, Norwegian University of Life Sciences15, Sainsbury Laboratory16, Leibniz Association17, United States Department of Energy18, James Hutton Institute19, Institut national de la recherche agronomique20, University of Zurich21, Sabancı University22, Murdoch University23
TL;DR: Insight into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.
Abstract: An ordered draft sequence of the 17-gigabase hexaploid bread wheat (Triticum aestivum) genome has been produced by sequencing isolated chromosome arms. We have annotated 124,201 gene loci distributed nearly evenly across the homeologous chromosomes and subgenomes. Comparative gene analysis of wheat subgenomes and extant diploid and tetraploid wheat relatives showed that high sequence similarity and structural conservation are retained, with limited gene loss, after polyploidization. However, across the genomes there was evidence of dynamic gene gain, loss, and duplication since the divergence of the wheat lineages. A high degree of transcriptional autonomy and no global dominance was found for the subgenomes. These insights into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.
1,421 citations