scispace - formally typeset
Search or ask a question
Author

José Alfredo Samaniego

Bio: José Alfredo Samaniego is an academic researcher from University of Copenhagen. The author has contributed to research in topics: Population & Domestication. The author has an hindex of 11, co-authored 15 publications receiving 2416 citations. Previous affiliations of José Alfredo Samaniego include American Museum of Natural History.

Papers
More filters
Journal ArticleDOI
Erich D. Jarvis1, Siavash Mirarab2, Andre J. Aberer3, Bo Li4, Bo Li5, Bo Li6, Peter Houde7, Cai Li4, Cai Li6, Simon Y. W. Ho8, Brant C. Faircloth9, Benoit Nabholz, Jason T. Howard1, Alexander Suh10, Claudia C. Weber10, Rute R. da Fonseca11, Jianwen Li, Fang Zhang Zhang, Hui Li, Long Zhou, Nitish Narula7, Nitish Narula12, Liang Liu13, Ganesh Ganapathy1, Bastien Boussau, Shamsuzzoha Bayzid2, Volodymyr Zavidovych1, Sankar Subramanian14, Toni Gabaldón15, Salvador Capella-Gutierrez, Jaime Huerta-Cepas, Bhanu Rekepalli16, Bhanu Rekepalli17, Kasper Munch18, Mikkel H. Schierup18, Bent E. K. Lindow11, Wesley C. Warren19, David A. Ray, Richard E. Green20, Michael William Bruford21, Xiangjiang Zhan21, Xiangjiang Zhan22, Andrew Dixon, Shengbin Li5, Ning Li23, Yinhua Huang23, Elizabeth P. Derryberry24, Elizabeth P. Derryberry25, Mads F. Bertelsen26, Frederick H. Sheldon24, Robb T. Brumfield24, Claudio V. Mello27, Claudio V. Mello28, Peter V. Lovell28, Morgan Wirthlin28, Maria Paula Cruz Schneider27, Francisco Prosdocimi27, José Alfredo Samaniego11, Amhed Missael Vargas Velazquez11, Alonzo Alfaro-Núñez11, Paula F. Campos11, Bent O. Petersen29, Thomas Sicheritz-Pontén29, An Pas, Thomas L. Bailey, R. Paul Scofield30, Michael Bunce31, David M. Lambert14, Qi Zhou, Polina L. Perelman32, Amy C. Driskell33, Beth Shapiro20, Zijun Xiong, Yongli Zeng, Shiping Liu, Zhenyu Li, Binghang Liu, Kui Wu, Jin Xiao, Xiong Yinqi, Quiemei Zheng, Yong Zhang, Huanming Yang, Jian Wang, Linnéa Smeds10, Frank E. Rheindt34, Michael J. Braun35, Jon Fjeldså11, Ludovic Orlando11, F. Keith Barker6, Knud A. Jønsson6, Warren E. Johnson33, Klaus-Peter Koepfli33, Stephen J. O'Brien36, David Haussler, Oliver A. Ryder, Carsten Rahbek6, Eske Willerslev11, Gary R. Graves6, Gary R. Graves33, Travis C. Glenn13, John E. McCormack37, Dave Burt38, Hans Ellegren10, Per Alström, Scott V. Edwards39, Alexandros Stamatakis3, David P. Mindell40, Joel Cracraft6, Edward L. Braun41, Tandy Warnow42, Tandy Warnow2, Wang Jun, M. Thomas P. Gilbert6, M. Thomas P. Gilbert31, Guojie Zhang4, Guojie Zhang11 
12 Dec 2014-Science
TL;DR: A genome-scale phylogenetic analysis of 48 species representing all orders of Neoaves recovered a highly resolved tree that confirms previously controversial sister or close relationships and identifies the first divergence in Neoaves, two groups the authors named Passerea and Columbea.
Abstract: To better determine the history of modern birds, we performed a genome-scale phylogenetic analysis of 48 species representing all orders of Neoaves using phylogenomic methods created to handle genome-scale data. We recovered a highly resolved tree that confirms previously controversial sister or close relationships. We identified the first divergence in Neoaves, two groups we named Passerea and Columbea, representing independent lineages of diverse and convergently evolved land and water bird species. Among Passerea, we infer the common ancestor of core landbirds to have been an apex predator and confirm independent gains of vocal learning. Among Columbea, we identify pigeons and flamingoes as belonging to sister clades. Even with whole genomes, some of the earliest branches in Neoaves proved challenging to resolve, which was best explained by massive protein-coding sequence convergence and high levels of incomplete lineage sorting that occurred during a rapid radiation after the Cretaceous-Paleogene mass extinction event about 66 million years ago.

1,624 citations

Journal ArticleDOI
TL;DR: It is shown that nuclear DNA has degraded at least twice as fast as mtDNA, a baseline for predicting long-term DNA survival in bone, and considerable sample-to-sample variance in DNA preservation could not be accounted for by geologic age.
Abstract: Claims of extreme survival of DNA have emphasized the need for reliable models of DNA degradation through time. By analysing mitochondrial DNA (mtDNA) from 158 radiocarbon-dated bones of the extinct New Zealand moa, we confirm empirically a long-hypothesized exponential decay relationship. The average DNA half-life within this geographically constrained fossil assemblage was estimated to be 521 years for a 242 bp mtDNA sequence, corresponding to a per nucleotide fragmentation rate (k) of 5.50 × 10(-6) per year. With an effective burial temperature of 13.1°C, the rate is almost 400 times slower than predicted from published kinetic data of in vitro DNA depurination at pH 5. Although best described by an exponential model (R(2) = 0.39), considerable sample-to-sample variance in DNA preservation could not be accounted for by geologic age. This variation likely derives from differences in taphonomy and bone diagenesis, which have confounded previous, less spatially constrained attempts to study DNA decay kinetics. Lastly, by calculating DNA fragmentation rates on Illumina HiSeq data, we show that nuclear DNA has degraded at least twice as fast as mtDNA. These results provide a baseline for predicting long-term DNA survival in bone.

477 citations

Journal ArticleDOI
TL;DR: Four Illumina library preparation protocols are compared, including two “single‐tube” methods developed for this study with the explicit aim of improving data quality and reducing preparation time and expenses, and single‐tube protocols increase library complexity, yield more reads that map uniquely to the reference genome, reduce processing time, and may decrease laboratory costs by 90%.
Abstract: In recent years, massive parallel sequencing has revolutionised the study of degraded DNA, thus enabling the field of ancient DNA to evolve into that of paleogenomics. Despite these advances, the recovery and sequencing of degraded DNA remains challenging due to limitations in the manipulation of chemically damaged and highly fragmented DNA molecules. In particular, the enzymatic reactions and DNA purification steps during library preparation can result in DNA template loss and sequencing biases, affecting downstream analyses. The development of library preparation methods that circumvent these obstacles and enable higher throughput are therefore of interest to researchers working with degraded DNA. In this study, we compare four Illumina library preparation protocols, including two “single-tube” methods developed for this study with the explicit aim of improving data quality and reducing preparation time and expenses. The methods are tested on grey wolf (Canis lupus) museum specimens. We found single-tube protocols increase library complexity, yield more reads that map uniquely to the reference genome, reduce processing time, and may decrease laboratory costs by 90%. Given the advantages of single-tube library preparations, we anticipate these methods will be of considerable interest to the growing field of paleogenomics and other applications investigating degraded DNA.

205 citations

Journal ArticleDOI
TL;DR: It is found that the initial diffusion of maize into the Southwest about 4,000 years ago is likely to have occurred along a highland route, followed by gene flow from a lowland coastal maize beginning at least 2,500 years ago.
Abstract: The origin of maize (Zea mays mays) in the US Southwest remains contentious, with conflicting archaeological data supporting either coastal(1-4) or highland(5,6) routes of diffusion of maize into the United States. Furthermore, the genetics of adaptation to the new environmental and cultural context of the Southwest is largely uncharacterized(7). To address these issues, we compared nuclear DNA from 32 archaeological maize samples spanning 6,000 years of evolution to modern landraces. We found that the initial diffusion of maize into the Southwest about 4,000 years ago is likely to have occurred along a highland route, followed by gene flow from a lowland coastal maize beginning at least 2,000 years ago. Our population genetic analysis also enabled us to differentiate selection during domestication for adaptation to the climatic and cultural environment of the Southwest, identifying adaptation loci relevant to drought tolerance and sugar content.

130 citations

Journal ArticleDOI
TL;DR: These findings provide the first direct evidence for the ethnic origins of enslaved Africans, at a time for which historical records are scarce, and demonstrate that genomic data provide another type of record that can shed new light on long-standing historical questions.
Abstract: Between 1500 and 1850, more than 12 million enslaved Africans were transported to the New World. The vast majority were shipped from West and West-Central Africa, but their precise origins are largely unknown. We used genome-wide ancient DNA analyses to investigate the genetic origins of three enslaved Africans whose remains were recovered on the Caribbean island of Saint Martin. We trace their origins to distinct subcontinental source populations within Africa, including Bantu-speaking groups from northern Cameroon and non-Bantu speakers living in present-day Nigeria and Ghana. To our knowledge, these findings provide the first direct evidence for the ethnic origins of enslaved Africans, at a time for which historical records are scarce, and demonstrate that genomic data provide another type of record that can shed new light on long-standing historical questions.

107 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: The approach to utilizing available RNA-Seq and other data types in the authors' manual curation process for vertebrate, plant, and other species is summarized, and a new direction for prokaryotic genomes and protein name management is described.
Abstract: The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management.

4,104 citations

Journal ArticleDOI
TL;DR: PartitionFinder 2 is a program for automatically selecting best-fit partitioning schemes and models of evolution for phylogenetic analyses that includes the ability to analyze morphological datasets, new methods to analyze genome-scale datasets, and new output formats to facilitate interoperability with downstream software.
Abstract: PartitionFinder 2 is a program for automatically selecting best-fit partitioning schemes and models of evolution for phylogenetic analyses. PartitionFinder 2 is substantially faster and more efficient than version 1, and incorporates many new methods and features. These include the ability to analyze morphological datasets, new methods to analyze genome-scale datasets, new output formats to facilitate interoperability with downstream software, and many new models of molecular evolution. PartitionFinder 2 is freely available under an open source license and works on Windows, OSX, and Linux operating systems. It can be downloaded from www.robertlanfear.com/partitionfinder. The source code is available at https://github.com/brettc/partitionfinder.

3,445 citations

Book ChapterDOI
31 Jan 1963

2,885 citations

Journal ArticleDOI
TL;DR: This work presents BUSCO v3 with example analyses that highlight the wide‐ranging utility of BUSCO assessments, which extend beyond quality control of genomics data sets to applications in comparative genomics analyses, gene predictor training, metagenomics, and phylogenomics.
Abstract: Genomics promises comprehensive surveying of genomes and metagenomes, but rapidly changing technologies and expanding data volumes make evaluation of completeness a challenging task. Technical sequencing quality metrics can be complemented by quantifying completeness of genomic data sets in terms of the expected gene content of Benchmarking Universal Single-Copy Orthologs (BUSCO, http://busco.ezlab.org). The latest software release implements a complete refactoring of the code to make it more flexible and extendable to facilitate high-throughput assessments. The original six lineage assessment data sets have been updated with improved species sampling, 34 new subsets have been built for vertebrates, arthropods, fungi, and prokaryotes that greatly enhance resolution, and data sets are now also available for nematodes, protists, and plants. Here, we present BUSCO v3 with example analyses that highlight the wide-ranging utility of BUSCO assessments, which extend beyond quality control of genomics data sets to applications in comparative genomics analyses, gene predictor training, metagenomics, and phylogenomics.

1,575 citations

Journal ArticleDOI
TL;DR: The Environment for Tree Exploration v3 is presented, featuring numerous improvements in the underlying library of methods, and providing a novel set of standalone tools to perform common tasks in comparative genomics and phylogenetics.
Abstract: The Environment for Tree Exploration (ETE) is a computational framework that simplifies the reconstruction, analysis, and visualization of phylogenetic trees and multiple sequence alignments. Here, we present ETE v3, featuring numerous improvements in the underlying library of methods, and providing a novel set of standalone tools to perform common tasks in comparative genomics and phylogenetics. The new features include (i) building gene-based and supermatrix-based phylogenies using a single command, (ii) testing and visualizing evolutionary models, (iii) calculating distances between trees of different size or including duplications, and (iv) providing seamless integration with the NCBI taxonomy database. ETE is freely available at http://etetoolkit.org.

1,452 citations