scispace - formally typeset
Search or ask a question
Author

Mi-Kyung Lee

Bio: Mi-Kyung Lee is an academic researcher from Texas A&M University. The author has contributed to research in topics: Genome & Contig. The author has an hindex of 17, co-authored 19 publications receiving 1870 citations. Previous affiliations of Mi-Kyung Lee include International Rice Research Institute.

Papers
More filters
Journal ArticleDOI
01 Sep 2006-Science
TL;DR: Comparison of the two species' genomes reveals a rapid expansion and diversification of many protein families associated with plant infection such as hydrolases, ABC transporters, protein toxins, proteinase inhibitors, and, in particular, a superfamily of 700 proteins with similarity to known oömycete avirulence genes.
Abstract: Draft genome sequences have been determined for the soybean pathogen Phytophthora sojae and the sudden oak death pathogen Phytophthora ramorum. Oomycetes such as these Phytophthora species share the kingdom Stramenopila with photosynthetic algae such as diatoms, and the presence of many Phytophthora genes of probable phototroph origin supports a photosynthetic ancestry for the stramenopiles. Comparison of the two species' genomes reveals a rapid expansion and diversification of many protein families associated with plant infection such as hydrolases, ABC transporters, protein toxins, proteinase inhibitors, and, in particular, a superfamily of 700 proteins with similarity to known oomycete avirulence genes.

1,016 citations

Journal ArticleDOI
TL;DR: The combined application of next-generation sequencing platforms has provided an economical approach to unlocking the potential of the turkey genome.
Abstract: A synergistic combination of two next-generation sequencing platforms with a detailed comparative BAC physical contig map provided a cost-effective assembly of the genome sequence of the domestic turkey (Meleagris gallopavo). Heterozygosity of the sequenced source genome allowed discovery of more than 600,000 high quality single nucleotide variants. Despite this heterozygosity, the current genome assembly (∼1.1 Gb) includes 917 Mb of sequence assigned to specific turkey chromosomes. Annotation identified nearly 16,000 genes, with 15,093 recognized as protein coding and 611 as non-coding RNA genes. Comparative analysis of the turkey, chicken, and zebra finch genomes, and comparing avian to mammalian species, supports the characteristic stability of avian genomes and identifies genes unique to the avian lineage. Clear differences are seen in number and variety of genes of the avian immune system where expansions and novel genes are less frequent than examples of gene loss. The turkey genome sequence provides resources to further understand the evolution of vertebrate genomes and genetic variation underlying economically important quantitative traits in poultry. This integrated approach may be a model for providing both gene and chromosome level assemblies of other species with agricultural, ecological, and evolutionary interest.

415 citations

Journal ArticleDOI
TL;DR: This map represents the first genome-wide, BAC-based physical map of the chicken genome and provides a powerful platform for many areas of chicken genomics, including targeted marker development, fine mapping of genes and QTL alleles, positional cloning, analysis of avian genome organization and evolution, chicken-mammalian comparative genomics and large-scale genome sequencing.
Abstract: A genome-wide physical map constructed with bacterial artificial chromosomes (BACs) is an essential component in linking phenotypic traits to the responsible genetic variation in the genomes of plants and animals. We have constructed a physical map of the chicken genome from 57,091 BACs (7.9-fold haploid genome coverage) by restriction fingerprint analysis using high-resolution polyacrylamide gel electrophoresis. The physical map consists of 2331 overlapping BAC contigs and is estimated to span 1510 Mb in physical length. BAC contigs were verified manually and by screening the BACs with 367 DNA markers. A total of 361 of the contigs have been anchored to the existing chicken genetic map. This map represents the first genome-wide, BAC-based physical map of the chicken genome. It provides a powerful platform for many areas of chicken genomics, including targeted marker development, fine mapping of genes and QTL alleles, positional cloning, analysis of avian genome organization and evolution, chicken-mammalian comparative genomics, and large-scale genome sequencing.

84 citations

Journal ArticleDOI
TL;DR: It is found that the size variations of both gene families are associated with organisms’ phylogeny, suggesting their roles in speciation and evolution.
Abstract: Many genes exist in the form of families; however, little is known about their size variation, evolution and biology. Here, we present the size variation and evolution of the nucleotide-binding site (NBS)-encoding gene family and receptor-like kinase (RLK) gene family in Oryza, Glycine and Gossypium. The sizes of both families vary by numeral fold, not only among species, surprisingly, also within a species. The size variations of the gene families are shown to correlate with each other, indicating their interactions, and driven by natural selection, artificial selection and genome size variation, but likely not by polyploidization. The numbers of genes in the families in a polyploid species are similar to those of one of its diploid donors, suggesting that polyploidization plays little roles in the expansion of the gene families and that organisms tend not to maintain their ‘surplus’ genes in the course of evolution. Furthermore, it is found that the size variations of both gene families are associated with organisms’ phylogeny, suggesting their roles in speciation and evolution. Since both selection and speciation act on organism’s morphological, physiological and biological variation, our results indicate that the variation of gene family size provides a source of genetic variation and evolution.

63 citations

Journal ArticleDOI
TL;DR: Chicken BAC libraries constructed with three different restriction enzyme-generated inserts (HindIII, BamHI and EcoRI) should provide nearly full coverage of the chicken genome, suitable for high-resolution physical mapping and sequence analysis.
Abstract: Source/description: Large-insert BAC libraries have been essential components in the physical mapping and sequencing of the human genome and those of other species. Crooijmans et al. constructed a chicken BAC library with a 5.5-fold representation of the chicken genome using HindIII partial digest fragments. However, this is unlikely to provide full coverage by itself, as the use of only a single enzyme will bias against regions of the genome with unusually high or low densities of restriction sites. Kato et al. recently described a HindIII-based library of similar size, but this is not publicly available. This paper describes chicken BAC libraries constructed with three different restriction enzyme-generated inserts (HindIII, BamHI and EcoRI). Together, the three libraries should provide nearly full coverage of the chicken genome, suitable for high-resolution physical mapping and sequence analysis. All three libraries are publicly available, distributed as duplicated libraries, individual BACs and high-density colony filters. A female of the Red Jungle Fowl (Gallus gallus gallus) line UCD001 was the DNA source. A single bird from an inbred line was chosen to minimize heterozygosity that could impede eventual BAC contig assembly using fingerprint analysis. In addition, comparison of the UCD001 sequence to numerous available White Leghorn accessions should generate dense single-nucleotide polymorphism coverage of the genome...

54 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: This work proposes a new k-mer counting algorithm and associated implementation, called Jellyfish, which is fast and memory efficient, based on a multithreaded, lock-free hash table optimized for counting k-mers up to 31 bases in length.
Abstract: Motivation: Counting the number of occurrences of every k-mer (substring of length k) in a long string is a central subproblem in many applications, including genome assembly, error correction of sequencing reads, fast multiple sequence alignment and repeat detection. Recently, the deep sequence coverage generated by next-generation sequencing technologies has caused the amount of sequence to be processed during a genome project to grow rapidly, and has rendered current k-mer counting tools too slow and memory intensive. At the same time, large multicore computers have become commonplace in research facilities allowing for a new parallel computational paradigm. Results: We propose a new k-mer counting algorithm and associated implementation, called Jellyfish, which is fast and memory efficient. It is based on a multithreaded, lock-free hash table optimized for counting k-mers up to 31 bases in length. Due to their flexibility, suffix arrays have been the data structure of choice for solving many string problems. For the task of k-mer counting, important in many biological applications, Jellyfish offers a much faster and more memory-efficient solution. Availability: The Jellyfish software is written in C++ and is GPL licensed. It is available for download at http://www.cbcb.umd.edu/software/jellyfish. Contact: [email protected] Supplementary information:Supplementary data are available at Bioinformatics online.

2,779 citations

Journal ArticleDOI
04 Oct 2012-Nature
TL;DR: The sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy and transcriptomes of development and stress response and the proteome of the shell are reported, showing that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes.
Abstract: The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress response and the proteome of the shell. The oyster genome is highly polymorphic and rich in repetitive sequences, with some transposable elements still actively shaping variation. Transcriptome studies reveal an extensive set of genes responding to environmental stress. The expansion of genes coding for heat shock protein 70 and inhibitors of apoptosis is probably central to the oyster's adaptation to sessile life in the highly stressful intertidal zone. Our analyses also show that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes. The oyster genome sequence fills a void in our understanding of the Lophotrochozoa.

1,806 citations

Journal ArticleDOI
Erich D. Jarvis1, Siavash Mirarab2, Andre J. Aberer3, Bo Li4, Bo Li5, Bo Li6, Peter Houde7, Cai Li6, Cai Li4, Simon Y. W. Ho8, Brant C. Faircloth9, Benoit Nabholz, Jason T. Howard1, Alexander Suh10, Claudia C. Weber10, Rute R. da Fonseca11, Jianwen Li, Fang Zhang Zhang, Hui Li, Long Zhou, Nitish Narula12, Nitish Narula7, Liang Liu13, Ganesh Ganapathy1, Bastien Boussau, Shamsuzzoha Bayzid2, Volodymyr Zavidovych1, Sankar Subramanian14, Toni Gabaldón15, Salvador Capella-Gutierrez, Jaime Huerta-Cepas, Bhanu Rekepalli16, Bhanu Rekepalli17, Kasper Munch18, Mikkel H. Schierup18, Bent E. K. Lindow11, Wesley C. Warren19, David A. Ray, Richard E. Green20, Michael William Bruford21, Xiangjiang Zhan22, Xiangjiang Zhan21, Andrew Dixon, Shengbin Li5, Ning Li23, Yinhua Huang23, Elizabeth P. Derryberry24, Elizabeth P. Derryberry25, Mads F. Bertelsen26, Frederick H. Sheldon25, Robb T. Brumfield25, Claudio V. Mello27, Claudio V. Mello28, Peter V. Lovell27, Morgan Wirthlin27, Maria Paula Cruz Schneider28, Francisco Prosdocimi28, José Alfredo Samaniego11, Amhed Missael Vargas Velazquez11, Alonzo Alfaro-Núñez11, Paula F. Campos11, Bent O. Petersen29, Thomas Sicheritz-Pontén29, An Pas, Thomas L. Bailey, R. Paul Scofield30, Michael Bunce31, David M. Lambert14, Qi Zhou, Polina L. Perelman32, Amy C. Driskell33, Beth Shapiro20, Zijun Xiong, Yongli Zeng, Shiping Liu, Zhenyu Li, Binghang Liu, Kui Wu, Jin Xiao, Xiong Yinqi, Quiemei Zheng, Yong Zhang, Huanming Yang, Jian Wang, Linnéa Smeds10, Frank E. Rheindt34, Michael J. Braun35, Jon Fjeldså11, Ludovic Orlando11, F. Keith Barker6, Knud A. Jønsson6, Warren E. Johnson33, Klaus-Peter Koepfli33, Stephen J. O'Brien36, David Haussler, Oliver A. Ryder, Carsten Rahbek6, Eske Willerslev11, Gary R. Graves33, Gary R. Graves6, Travis C. Glenn13, John E. McCormack37, Dave Burt38, Hans Ellegren10, Per Alström, Scott V. Edwards39, Alexandros Stamatakis3, David P. Mindell40, Joel Cracraft6, Edward L. Braun41, Tandy Warnow42, Tandy Warnow2, Wang Jun, M. Thomas P. Gilbert6, M. Thomas P. Gilbert31, Guojie Zhang11, Guojie Zhang4 
12 Dec 2014-Science
TL;DR: A genome-scale phylogenetic analysis of 48 species representing all orders of Neoaves recovered a highly resolved tree that confirms previously controversial sister or close relationships and identifies the first divergence in Neoaves, two groups the authors named Passerea and Columbea.
Abstract: To better determine the history of modern birds, we performed a genome-scale phylogenetic analysis of 48 species representing all orders of Neoaves using phylogenomic methods created to handle genome-scale data. We recovered a highly resolved tree that confirms previously controversial sister or close relationships. We identified the first divergence in Neoaves, two groups we named Passerea and Columbea, representing independent lineages of diverse and convergently evolved land and water bird species. Among Passerea, we infer the common ancestor of core landbirds to have been an apex predator and confirm independent gains of vocal learning. Among Columbea, we identify pigeons and flamingoes as belonging to sister clades. Even with whole genomes, some of the earliest branches in Neoaves proved challenging to resolve, which was best explained by massive protein-coding sequence convergence and high levels of incomplete lineage sorting that occurred during a rapid radiation after the Cretaceous-Paleogene mass extinction event about 66 million years ago.

1,624 citations

Journal ArticleDOI
13 Nov 2008-Nature
TL;DR: Analysis of molecular divergence compared with yeasts and metazoans reveals rapid rates of gene diversification in diatoms, and documents the presence of hundreds of genes from bacteria, likely to provide novel possibilities for metabolite management and for perception of environmental signals.
Abstract: Diatoms are photosynthetic secondary endosymbionts found throughout marine and freshwater environments, and are believed to be responsible for around one- fifth of the primary productivity on Earth(1,2). The genome sequence of the marine centric diatom Thalassiosira pseudonana was recently reported, revealing a wealth of information about diatom biology(3-5). Here we report the complete genome sequence of the pennate diatom Phaeodactylum tricornutum and compare it with that of T. pseudonana to clarify evolutionary origins, functional significance and ubiquity of these features throughout diatoms. In spite of the fact that the pennate and centric lineages have only been diverging for 90 million years, their genome structures are dramatically different and a substantial fraction of genes (similar to 40%) are not shared by these representatives of the two lineages. Analysis of molecular divergence compared with yeasts and metazoans reveals rapid rates of gene diversification in diatoms. Contributing factors include selective gene family expansions, differential losses and gains of genes and introns, and differential mobilization of transposable elements. Most significantly, we document the presence of hundreds of genes from bacteria. More than 300 of these gene transfers are found in both diatoms, attesting to their ancient origins, and many are likely to provide novel possibilities for metabolite management and for perception of environmental signals. These findings go a long way towards explaining the incredible diversity and success of the diatoms in contemporary oceans.

1,500 citations

Journal ArticleDOI
Brian J. Haas1, Sophien Kamoun2, Sophien Kamoun3, Michael C. Zody1, Michael C. Zody4, Rays H. Y. Jiang5, Rays H. Y. Jiang1, Robert E. Handsaker1, Liliana M. Cano3, Manfred Grabherr1, Chinnappa D. Kodira1, Chinnappa D. Kodira6, Sylvain Raffaele3, Trudy Torto-Alalibo2, Trudy Torto-Alalibo6, Tolga O. Bozkurt3, Audrey M. V. Ah-Fong7, Lucia Alvarado1, Vicky L. Anderson8, Miles R. Armstrong9, Anna O. Avrova9, Laura Baxter10, Jim Beynon10, Petra C. Boevink9, Stephanie R. Bollmann11, Jorunn I. B. Bos2, Vincent Bulone12, Guohong Cai13, Cahid Cakir2, James C. Carrington14, Megan Chawner15, Lucio Conti16, Stefano Costanzo11, Richard Ewan16, Noah Fahlgren14, Michael A. Fischbach17, Johanna Fugelstad12, Eleanor M. Gilroy9, Sante Gnerre1, Pamela J. Green18, Laura J. Grenville-Briggs8, John Griffith15, Niklaus J. Grünwald11, Karolyn Horn15, Neil R. Horner8, Chia-Hui Hu19, Edgar Huitema2, Dong-Hoon Jeong18, Alexandra M. E. Jones3, Jonathan D. G. Jones3, Richard W. Jones11, Elinor K. Karlsson1, Sridhara G. Kunjeti20, Kurt Lamour21, Zhenyu Liu2, Li-Jun Ma1, Dan MacLean3, Marcus C. Chibucos22, Hayes McDonald23, Jessica McWalters15, Harold J. G. Meijer5, William Morgan24, Paul Morris25, Carol A. Munro8, Keith O'Neill6, Keith O'Neill1, Manuel D. Ospina-Giraldo15, Andrés Pinzón, Leighton Pritchard9, Bernard H Ramsahoye26, Qinghu Ren27, Silvia Restrepo, Sourav Roy7, Ari Sadanandom16, Alon Savidor28, Sebastian Schornack3, David C. Schwartz29, Ulrike Schumann8, Ben Schwessinger3, Lauren Seyer15, Ted Sharpe1, Cristina Silvar3, Jing Song2, David J. Studholme3, Sean M. Sykes1, Marco Thines3, Marco Thines30, Peter J. I. van de Vondervoort5, Vipaporn Phuntumart25, Stephan Wawra8, R. Weide5, Joe Win3, Carolyn A. Young2, Shiguo Zhou29, William E. Fry13, Blake C. Meyers18, Pieter van West8, Jean B. Ristaino19, Francine Govers5, Paul R. J. Birch31, Stephen C. Whisson9, Howard S. Judelson7, Chad Nusbaum1 
17 Sep 2009-Nature
TL;DR: The sequence of the P. infestans genome is reported, which at ∼240 megabases (Mb) is by far the largest and most complex genome sequenced so far in the chromalveolates and probably plays a crucial part in the rapid adaptability of the pathogen to host plants and underpins its evolutionary potential.
Abstract: Phytophthora infestans is the most destructive pathogen of potato and a model organism for the oomycetes, a distinct lineage of fungus-like eukaryotes that are related to organisms such as brown algae and diatoms. As the agent of the Irish potato famine in the mid-nineteenth century, P. infestans has had a tremendous effect on human history, resulting in famine and population displacement(1). To this day, it affects world agriculture by causing the most destructive disease of potato, the fourth largest food crop and a critical alternative to the major cereal crops for feeding the world's population(1). Current annual worldwide potato crop losses due to late blight are conservatively estimated at $6.7 billion(2). Management of this devastating pathogen is challenged by its remarkable speed of adaptation to control strategies such as genetically resistant cultivars(3,4). Here we report the sequence of the P. infestans genome, which at similar to 240 megabases (Mb) is by far the largest and most complex genome sequenced so far in the chromalveolates. Its expansion results from a proliferation of repetitive DNA accounting for similar to 74% of the genome. Comparison with two other Phytophthora genomes showed rapid turnover and extensive expansion of specific families of secreted disease effector proteins, including many genes that are induced during infection or are predicted to have activities that alter host physiology. These fast-evolving effector genes are localized to highly dynamic and expanded regions of the P. infestans genome. This probably plays a crucial part in the rapid adaptability of the pathogen to host plants and underpins its evolutionary potential.

1,341 citations