Author
Marcus A. Koch
Other affiliations: Heidelberg Institute for Theoretical Studies, Max Planck Society, Novartis ...read more
Bio: Marcus A. Koch is an academic researcher from Heidelberg University. The author has contributed to research in topics: Phylogenetic tree & Population. The author has an hindex of 57, co-authored 207 publications receiving 12133 citations. Previous affiliations of Marcus A. Koch include Heidelberg Institute for Theoretical Studies & Max Planck Society.
Topics: Phylogenetic tree, Population, Boechera, Arabis, Molecular phylogenetics
Papers published on a yearly basis
Papers
More filters
••
TL;DR: The authors analyzed sequence variation for chalcone synthase (Chs) and alcohol dehydrogenase (Adh) loci in 28 species in the genera Arabidopsis and Arabis and related taxa from tribe Arabideae.
Abstract: We analyzed sequence variation for chalcone synthase (Chs) and alcohol dehydrogenase (Adh) loci in 28 species in the genera Arabidopsis and Arabis and related taxa from tribe Arabideae. Chs was single-copy in nearly all taxa examined, while Adh duplications were found in several species. Phylogenies constructed from both loci confirmed that the closest relatives of Arabidopsis thaliana include Arabidopsis lyrata, Arabidopsis petraea, and Arabidopsis halleri (formerly in the genus Cardaminopsis). Slightly more distant are the North American n = 7 Arabis (Boechera) species. The genus Arabis is polyphyletic-some unrelated species appear within this taxonomic classification, which has little phylogenetic meaning. Fossil pollen data were used to compute a synonymous substitution rate of 1.5 x 10 substitutions per site per year for both Chs and Adh. Arabidopsis thaliana diverged from its nearest relatives about 5 MYA, and from Brassica roughly 24 MYA. Independent molecular and fossil data from several sources all provide similar estimates of evolutionary timescale in the Brassicaceae.
886 citations
••
TL;DR: Phylogenetic relationships based on the chloroplast 5'-trnL (UAA)-trnF(GAA) region and estimated divergence times based on sequence data of the chalcone synthase gene are congruent with comparative painting data and place Calepina, Conringia, and Sisymbrium outside the clade of Brassiceae species with triplicated genomes.
Abstract: We have used an approximately 8.7-Mb BAC contig of Arabidopsis thaliana Chromosome 4 to trace homeologous chromosome regions in 21 species of the family Brassicaceae. Homeologs of this segment could be identified in all tested species. Painting of pachytene chromosomes of Calepina, Conringia, and Sisymbrium species (2n = 14, 16), traditionally placed in tribe Brassiceae, showed one homeologous copy of the Arabidopsis contig, while the remaining taxa of the tribe (2n = 14-30) revealed three, and three Brassica species (2n = 34, 36, and 38) and Erucastrum gallicum (2n = 30) had six copies corresponding to the 8.7-Mb segment. The multiple homeologous copies corresponded structurally to the Arabidopsis segment or were rearranged by inversions and translocations within the diploidized genomes. These chromosome rearrangements accompanied by chromosome fusions/fissions led to the present-day chromosome number variation within the Brassiceae. Phylogenetic relationships based on the chloroplast 5'-trnL (UAA)-trnF(GAA) region and estimated divergence times based on sequence data of the chalcone synthase gene are congruent with comparative painting data and place Calepina, Conringia, and Sisymbrium outside the clade of Brassiceae species with triplicated genomes. Most likely, species containing three or six copy pairs descended from a common hexaploid ancestor with basic genomes similar to that of Arabidopsis. The presumed hexaploidization event occurred after the Arabidopsis-Brassiceae split, between 7.9 and 14.6 Mya.
619 citations
••
TL;DR: A structural classification of natural products (SCONP) is reported as organizing principle for charting the known chemical space explored by nature and provides a viable analysis- and hypothesis-generating tool for the design of natural product-derived compound collections.
Abstract: The identification of small molecules that fall within the biologically relevant subfraction of vast chemical space is of utmost importance to chemical biology and medicinal chemistry research. The prerequirement of biological relevance to be met by such molecules is fulfilled by natural product-derived compound collections. We report a structural classification of natural products (SCONP) as organizing principle for charting the known chemical space explored by nature. SCONP arranges the scaffolds of the natural products in a tree-like fashion and provides a viable analysis- and hypothesis-generating tool for the design of natural product-derived compound collections. The validity of the approach is demonstrated in the development of a previously undescribed class of selective and potent inhibitors of 11β-hydroxysteroid dehydrogenase type 1 with activity in cells guided by SCONP and protein structure similarity clustering. 11β-hydroxysteroid dehydrogenase type 1 is a target in the development of new therapies for the treatment of diabetes, the metabolic syndrome, and obesity.
508 citations
••
TL;DR: It is concluded that tribe Brassiceae might be the only monophyletic group of the traditional tribes, and tribes Lepidieae, Arabideae, and Sisymbrieae are not monophylets.
Abstract: Phylogenetic relationships were inferred using nucleotide sequence variation of the nuclear-encoded chalcone synthase gene (Chs) and the chloroplast gene matK for members of five tribes from the family Brassicaceae to analyze tribal and subtribal structures. Phylogenetic trees from individual data sets are mostly in congruence with the results from a combined matK-Chs analysis with a total of 2721 base pairs, but with greater resolution and higher statistical support for deeper branching patterns. The analysis indicates that tribes Lepidieae, Arabideae, and Sisymbrieae are not monophyletic. Among taxa under study four different lineages each were detected in tribes Arabideae and Lepidieae, interspersed with taxa from tribes Sisymbrieae, Hesperideae, and Brassiceae. It is concluded that tribe Brassiceae might be the only monophyletic group of the traditional tribes. From our data we estimated several divergence times for different lineages among cruciferous plants: 5.8 mya (million years ago) for the Arabidopsis-Cardamiiolpsis split, 20 mya for the Brassica-Arabidopsis split, and -40 mya for the age of the deepest split between the most basal crucifer Aethiolerna and remaining cruciferous taxa.
406 citations
••
TL;DR: This review examines recent outcomes from diverse botanical disciplines to synthesize for the first time a holistic view on the evolutionary history of the mustard family.
347 citations
Cited by
More filters
•
TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.
11,521 citations
01 Jun 2012
TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).
Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.
10,124 citations
••
TL;DR: This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.
Abstract: The flowering plant Arabidopsis thaliana is an important model system for identifying genes and determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans--the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.
8,742 citations
01 Aug 2000
TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.
Abstract: BIOE 402. Medical Technology Assessment. 2 or 3 hours. Bioentrepreneur course. Assessment of medical technology in the context of commercialization. Objectives, competition, market share, funding, pricing, manufacturing, growth, and intellectual property; many issues unique to biomedical products. Course Information: 2 undergraduate hours. 3 graduate hours. Prerequisite(s): Junior standing or above and consent of the instructor.
4,833 citations
••
University of Tennessee1, Oak Ridge National Laboratory2, West Virginia University3, Umeå University4, University of British Columbia5, United States Department of Energy6, Ghent University7, Swedish University of Agricultural Sciences8, Institut national de la recherche agronomique9, Virginia Tech10, Michigan Technological University11, University of Toronto12, Pennsylvania State University13, University of Provence14, University of Georgia15, University of Florida16, University of California, Berkeley17, Lawrence Berkeley National Laboratory18, University of Arizona19, Purdue University20, Stanford University21, United States Department of Agriculture22, University of Turku23, University of Helsinki24, Massachusetts Institute of Technology25, University of Tennessee Health Science Center26, University of Tübingen27
TL;DR: The draft genome of the black cottonwood tree, Populus trichocarpa, has been reported in this paper, with more than 45,000 putative protein-coding genes identified.
Abstract: We report the draft genome of the black cottonwood tree, Populus trichocarpa. Integration of shotgun sequence assembly with genetic mapping enabled chromosome-scale reconstruction of the genome. More than 45,000 putative protein-coding genes were identified. Analysis of the assembled genome revealed a whole-genome duplication event; about 8000 pairs of duplicated genes from that event survived in the Populus genome. A second, older duplication event is indistinguishably coincident with the divergence of the Populus and Arabidopsis lineages. Nucleotide substitution, tandem gene duplication, and gross chromosomal rearrangement appear to proceed substantially more slowly in Populus than in Arabidopsis. Populus has more protein-coding genes than Arabidopsis, ranging on average from 1.4 to 1.6 putative Populus homologs for each Arabidopsis gene. However, the relative frequency of protein domains in the two genomes is similar. Overrepresented exceptions in Populus include genes associated with lignocellulosic wall biosynthesis, meristem development, disease resistance, and metabolite transport.
4,025 citations