Home
/
Authors
/
Melissa J. Hubisz

Author

Melissa J. Hubisz

Other affiliations: Cold Spring Harbor Laboratory, University of Chicago, Howard Hughes Medical Institute

Bio: Melissa J. Hubisz is an academic researcher from Cornell University. The author has contributed to research in topics: Population & Genome. The author has an hindex of 37, co-authored 48 publications receiving 17392 citations. Previous affiliations of Melissa J. Hubisz include Cold Spring Harbor Laboratory & University of Chicago.

Topics: Population, Genome, Gene, Natural selection, Molecular evolution ...read more

Papers published on a yearly basis

2023
2022
2020
2019
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Inferring weak population structure with the assistance of sample group information.

[...]

Melissa J. Hubisz¹, Daniel Falush², Matthew Stephens, Jonathan K. Pritchard¹•Institutions (2)

Howard Hughes Medical Institute¹, University College Cork²

01 Sep 2009-Molecular Ecology Resources

TL;DR: It is demonstrated that the new models developed for the structure program allow structure to be detected at lower levels of divergence, or with less data, than the original structure models or principal components methods, and that they are not biased towards detecting structure when it is not present.

...read moreread less

Abstract: Genetic clustering algorithms require a certain amount of data to produce informative results. In the common situation that individuals are sampled at several locations, we show how sample group information can be used to achieve better results when the amount of data is limited. New models are developed for the structure program, both for the cases of admixture and no admixture. These models work by modifying the prior distribution for each individual's population assignment. The new prior distributions allow the proportion of individuals assigned to a particular cluster to vary by location. The models are tested on simulated data, and illustrated using microsatellite data from the CEPH Human Genome Diversity Panel. We demonstrate that the new models allow structure to be detected at lower levels of divergence, or with less data, than the original structure models or principal components methods, and that they are not biased towards detecting structure when it is not present. These models are implemented in a new version of structure which is freely available online at http://pritch.bsd.uchicago.edu/structure.html.

...read moreread less

3,105 citations

Journal Article•DOI•

Evolution of genes and genomes on the Drosophila phylogeny.

[...]

Andrew G. Clark¹, Michael B. Eisen², Michael B. Eisen³, Douglas Smith +426 more•Institutions (70)

08 Nov 2007-Nature

TL;DR: These genome sequences augment the formidable genetic tools that have made Drosophila melanogaster a pre-eminent model for animal genetics, and will further catalyse fundamental research on mechanisms of development, cell biology, genetics, disease, neurobiology, behaviour, physiology and evolution.

...read moreread less

Abstract: Comparative analysis of multiple genomes in a phylogenetic framework dramatically improves the precision and sensitivity of evolutionary inference, producing more robust results than single-genome analyses can provide. The genomes of 12 Drosophila species, ten of which are presented here for the first time (sechellia, simulans, yakuba, erecta, ananassae, persimilis, willistoni, mojavensis, virilis and grimshawi), illustrate how rates and patterns of sequence divergence across taxa can illuminate evolutionary processes on a genomic scale. These genome sequences augment the formidable genetic tools that have made Drosophila melanogaster a pre-eminent model for animal genetics, and will further catalyse fundamental research on mechanisms of development, cell biology, genetics, disease, neurobiology, behaviour, physiology and evolution. Despite remarkable similarities among these Drosophila species, we identified many putatively non-neutral changes in protein-coding genes, non-coding RNA genes, and cis-regulatory regions. These may prove to underlie differences in the ecology and behaviour of these diverse species.

...read moreread less

2,057 citations

Journal Article•DOI•

Detection of nonneutral substitution rates on mammalian phylogenies

[...]

Katherine S. Pollard¹, Melissa J. Hubisz, Kate R. Rosenbloom, Adam Siepel•Institutions (1)

University of California, San Francisco¹

01 Jan 2010-Genome Research

TL;DR: By applying phyloP to mammalian multiple alignments from the ENCODE project, it shed light on patterns of conservation/acceleration in known and predicted functional elements, approximate fractions of sites subject to constraint, and differences in clade-specific selection in the primate and glires clades.

...read moreread less

Abstract: Methods for detecting nucleotide substitution rates that are faster or slower than expected under neutral drift are widely used to identify candidate functional elements in genomic sequences. However, most existing methods consider either reductions (conservation) or increases (acceleration) in rate but not both, or assume that selection acts uniformly across the branches of a phylogeny. Here we examine the more general problem of detecting departures from the neutral rate of substitution in either direction, possibly in a clade-specific manner. We consider four statistical, phylogenetic tests for addressing this problem: a likelihood ratio test, a score test, a test based on exact distributions of numbers of substitutions, and the genomic evolutionary rate profiling (GERP) test. All four tests have been implemented in a freely available program called phyloP. Based on extensive simulation experiments, these tests are remarkably similar in statistical power. With 36 mammalian species, they all appear to be capable of fairly good sensitivity with low false-positive rates in detecting strong selection at individual nucleotides, moderate selection in 3-bp elements, and weaker or clade-specific selection in longer elements. By applying phyloP to mammalian multiple alignments from the ENCODE project, we shed light on patterns of conservation/acceleration in known and predicted functional elements, approximate fractions of sites subject to constraint, and differences in clade-specific selection in the primate and glires clades. We also describe new "Conservation" tracks in the UCSC Genome Browser that display both phyloP and phastCons scores for genome-wide alignments of 44 vertebrate species.

...read moreread less

1,895 citations

Journal Article•DOI•

Evolutionary and biomedical insights from the rhesus macaque genome

[...]

Richard A. Gibbs¹, Jeffrey Rogers², Michael G. Katze³, Roger E. Bumgarner³ +174 more•Institutions (28)

13 Apr 2007-Science

TL;DR: The genome sequence of an Indian-origin Macaca mulatta female is determined and compared with chimpanzees and humans to reveal the structure of ancestral primate genomes and to identify evidence for positive selection and lineage-specific expansions and contractions of gene families.

...read moreread less

Abstract: The rhesus macaque (Macaca mulatta) is an abundant primate species that diverged from the ancestors of Homo sapiens about 25 million years ago. Because they are genetically and physiologically similar to humans, rhesus monkeys are the most widely used nonhuman primate in basic and applied biomedical research. We determined the genome sequence of an Indian-origin Macaca mulatta female and compared the data with chimpanzees and humans to reveal the structure of ancestral primate genomes and to identify evidence for positive selection and lineage-specific expansions and contractions of gene families. A comparison of sequences from individual animals was used to investigate their underlying genetic diversity. The complete description of the macaque genome blueprint enhances the utility of this animal model for biomedical research and improves our understanding of the basic biology of the species.

...read moreread less

1,297 citations

Journal Article•DOI•

A high-resolution map of human evolutionary constraint using 29 mammals.

[...]

Kerstin Lindblad-Toh¹, Manuel Garber¹, Or Zuk¹, Michael F. Lin¹, Michael F. Lin², Brian J. Parker³, Stefan Washietl², Pouya Kheradpour¹, Pouya Kheradpour², Jason Ernst², Jason Ernst¹, Gregory E. Jordan⁴, Evan Mauceli¹, Lucas D. Ward¹, Lucas D. Ward², Craig B. Lowe⁵, Craig B. Lowe⁶, Craig B. Lowe⁷, Alisha K. Holloway⁸, Michele Clamp¹, Sante Gnerre¹, Jessica Alföldi¹, Kathryn Beal⁴, Jean Chang¹, Hiram Clawson⁶, James Cuff⁹, Federica Di Palma¹, Stephen Fitzgerald⁴, Paul Flicek⁴, Mitchell Guttman¹, Melissa J. Hubisz¹⁰, David B. Jaffe¹, Irwin Jungreis², W. James Kent⁸, Dennis Kostka⁸, Marcia Lara¹, André L. Martins¹⁰, Tim Massingham⁴, Ida Moltke³, Brian J. Raney⁶, Matthew D. Rasmussen², James Robinson¹, Alexander Stark¹¹, Albert J. Vilella⁴, Jiayu Wen³, Xiaohui Xie¹, Michael C. Zody¹, Kim C. Worley¹², Christie Kovar¹², Donna M. Muzny¹², Richard A. Gibbs¹², Wesley C. Warren¹³, Elaine R. Mardis¹³, George M. Weinstock¹³, George M. Weinstock¹², Richard K. Wilson¹³, Ewan Birney⁴, Elliott H. Margulies¹⁴, Javier Herrero⁴, Eric D. Green¹⁴, David Haussler⁶, David Haussler⁷, Adam Siepel¹⁰, Nick Goldman⁴, Katherine S. Pollard⁸, Jakob Skou Pedersen¹⁵, Jakob Skou Pedersen³, Eric S. Lander¹, Manolis Kellis², Manolis Kellis¹ - Show less +66 more•Institutions (15)

Massachusetts Institute of Technology¹, Vassar College², University of Copenhagen³, Wellcome Trust⁴, Stanford University⁵, University of California, Santa Cruz⁶, Howard Hughes Medical Institute⁷, University of California, San Francisco⁸, Harvard University⁹, Cornell University¹⁰, Research Institute of Molecular Pathology¹¹, Human Genome Sequencing Center¹², Washington University in St. Louis¹³, National Institutes of Health¹⁴, Aarhus University Hospital¹⁵

27 Oct 2011-Nature

TL;DR: The comparison of related genomes has emerged as a powerful lens for genome interpretation and sequencing and comparative analysis of 29 eutherian genomes confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ∼4.2%" of the genome.

...read moreread less

Abstract: The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ∼4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ∼60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease.

...read moreread less

1,023 citations

1
2
3
4
…
5
6
7
8
9
10
11

Collapse

Cited by

PDF

Open Access

More filters

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

An integrated encyclopedia of DNA elements in the human genome

[...]

Principal investigators¹, Nhgri groups², Data production leads³, Lead analysts³•Institutions (3)

Wellcome Trust¹, University of Washington², Pennsylvania State University³

06 Sep 2012-Nature

TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

Abstract: The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

13,548 citations

Journal Article•

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

[...]

Fumio Tajima¹•Institutions (1)

Kyushu University¹

30 Oct 1989-Genomics

TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

...read moreread less

11,521 citations

Journal Article•DOI•

PAML 4: Phylogenetic Analysis by Maximum Likelihood

[...]

Ziheng Yang¹•Institutions (1)

University College London¹

01 Aug 2007-Molecular Biology and Evolution

TL;DR: PAML, currently in version 4, is a package of programs for phylogenetic analyses of DNA and protein sequences using maximum likelihood (ML), which can be used to estimate parameters in models of sequence evolution and to test interesting biological hypotheses.

...read moreread less

Abstract: PAML, currently in version 4, is a package of programs for phylogenetic analyses of DNA and protein sequences using maximum likelihood (ML). The programs may be used to compare and test phylogenetic trees, but their main strengths lie in the rich repertoire of evolutionary models implemented, which can be used to estimate parameters in models of sequence evolution and to test interesting biological hypotheses. Uses of the programs include estimation of synonymous and nonsynonymous rates (d(N) and d(S)) between two protein-coding DNA sequences, inference of positive Darwinian selection through phylogenetic comparison of protein-coding genes, reconstruction of ancestral genes and proteins for molecular restoration studies of extinct life forms, combined analysis of heterogeneous data sets from multiple gene loci, and estimation of species divergence times incorporating uncertainties in fossil calibrations. This note discusses some of the major applications of the package, which includes example data sets to demonstrate their use. The package is written in ANSI C, and runs under Windows, Mac OSX, and UNIX systems. It is available at -- (http://abacus.gene.ucl.ac.uk/software/paml.html).

...read moreread less

10,773 citations

Journal Article•DOI•

ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data

[...]

Kai Wang¹, Mingyao Li¹, Hakon Hakonarson¹•Institutions (1)

Children's Hospital of Philadelphia¹

01 Sep 2010-Nucleic Acids Research

TL;DR: The ANNOVAR tool to annotate single nucleotide variants and insertions/deletions, such as examining their functional consequence on genes, inferring cytogenetic bands, reporting functional importance scores, finding variants in conserved regions, or identifying variants reported in the 1000 Genomes Project and dbSNP is developed.

...read moreread less

Abstract: High-throughput sequencing platforms are generating massive amounts of genetic variation data for diverse genomes, but it remains a challenge to pinpoint a small subset of functionally important variants. To fill these unmet needs, we developed the ANNOVAR tool to annotate single nucleotide variants (SNVs) and insertions/deletions, such as examining their functional consequence on genes, inferring cytogenetic bands, reporting functional importance scores, finding variants in conserved regions, or identifying variants reported in the 1000 Genomes Project and dbSNP. ANNOVAR can utilize annotation databases from the UCSC Genome Browser or any annotation data set conforming to Generic Feature Format version 3 (GFF3). We also illustrate a 'variants reduction' protocol on 4.7 million SNVs and indels from a human genome, including two causal mutations for Miller syndrome, a rare recessive disease. Through a stepwise procedure, we excluded variants that are unlikely to be causal, and identified 20 candidate genes including the causal gene. Using a desktop computer, ANNOVAR requires ∼4 min to perform gene-based annotation and ∼15 min to perform variants reduction on 4.7 million variants, making it practical to handle hundreds of human genomes in a day. ANNOVAR is freely available at http://www.openbioinformatics.org/annovar/.

...read moreread less

10,461 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse