Home
/
Authors
/
Nadia Chuzhanova

Author

Nadia Chuzhanova

Other affiliations: Russian Academy of Sciences, University of Central Lancashire, University of Wales ...read more

Bio: Nadia Chuzhanova is an academic researcher from Nottingham Trent University. The author has contributed to research in topics: Gene & Mutation. The author has an hindex of 40, co-authored 101 publications receiving 6808 citations. Previous affiliations of Nadia Chuzhanova include Russian Academy of Sciences & University of Central Lancashire.

Topics: Gene, Mutation, Gene mutation, Human genome, Gene conversion ...read more

Papers published on a yearly basis

2022
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1990

Papers

PDF

Open Access

More filters

Journal Article•DOI•

At least 1 in 20 16S rRNA sequence records currently held in public repositories is estimated to contain substantial anomalies

[...]

Kevin E. Ashelford¹, Nadia Chuzhanova¹, John C. Fry¹, Antonia J. Jones¹, Andrew J. Weightman¹ - Show less +1 more•Institutions (1)

Cardiff University¹

01 Dec 2005-Applied and Environmental Microbiology

TL;DR: The method is implemented as a program with a simple-to-use graphic user interface that is capable of running on a range of computer platforms and concludes that, as a conservative estimate, 1 in every 20 public database records is likely to be corrupt.

...read moreread less

Abstract: A new method for detecting chimeras and other anomalies within 16S rRNA sequence records is presented. Using this method, we screened 1,399 sequences from 19 phyla, as defined by the Ribosomal Database Project, release 9, update 22, and found 5.0% to harbor substantial errors. Of these, 64.3% were obvious chimeras, 14.3% were unidentified sequencing errors, and 21.4% were highly degenerate. In all, 11 phyla contained obvious chimeras, accounting for 0.8 to 11% of the records for these phyla. Many chimeras (43.1%) were formed from parental sequences belonging to different phyla. While most comprised two fragments, 13.7% were composed of at least three fragments, often from three different sources. A separate analysis of the Bacteroidetes phylum (2,739 sequences) also revealed 5.8% records to be anomalous, of which 65.4% were apparently chimeric. Overall, we conclude that, as a conservative estimate, 1 in every 20 public database records is likely to be corrupt. Our results support concerns recently expressed over the quality of the public repositories. With 16S rRNA sequence data increasingly playing a dominant role in bacterial systematics and environmental biodiversity studies, it is vital that steps be taken to improve screening of sequences prior to submission. To this end, we have implemented our method as a program with a simple-to-use graphic user interface that is capable of running on a range of computer platforms. The program is called Pintail, is released under the terms of the GNU General Public License open source license, and is freely available from our website at http://www.cardiff.ac.uk/biosi/research/biosoft/.

...read moreread less

802 citations

Journal Article•DOI•

New screening software shows that most recent large 16S rRNA gene clone libraries contain chimeras.

[...]

Kevin E. Ashelford¹, Nadia Chuzhanova¹, John C. Fry¹, Antonia J. Jones¹, Andrew J. Weightman¹ - Show less +1 more•Institutions (1)

Cardiff University¹

01 Sep 2006-Applied and Environmental Microbiology

TL;DR: A new computer program, called Mallard, is presented for screening entire 16S rRNA gene libraries of up to 1,000 sequences for chimeras and other artifacts, which far exceed previous estimates of artifacts within public repositories and highlight the urgent need for all researchers to adequately screen their libraries prior to submission.

...read moreread less

Abstract: A new computer program, called Mallard, is presented for screening entire 16S rRNA gene libraries of up to 1,000 sequences for chimeras and other artifacts. Written in the Java computer language and capable of running on all major operating systems, the program provides a novel graphical approach for visualizing phylogenetic relationships among 16S rRNA gene sequences. To illustrate its use, we analyzed most of the large libraries of cloned bacterial 16S rRNA gene sequences submitted to the public repository during 2005. Defining a large library as one containing 100 or more sequences of 1,200 bases or greater, we screened 25 of the 28 libraries and found that all but three contained substantial anomalies. Overall, 543 anomalous sequences were found. The average anomaly content per clone library was 9.0%, 4% higher than that previously estimated for the public repository overall. In addition, 90.8% of anomalies had characteristic chimeric patterns, a rise of 25.4% over that found previously. One library alone was found to contain 54 chimeras, representing 45.8% of its content. These figures far exceed previous estimates of artifacts within public repositories and further highlight the urgent need for all researchers to adequately screen their libraries prior to submission. Mallard is freely available from our website at http://www.cardiff.ac.uk/biosi/research/biosoft/.

...read moreread less

711 citations

Journal Article•DOI•

Gene conversion: mechanisms, evolution and human disease

[...]

Jian-Min Chen¹, David Neil Cooper², Nadia Chuzhanova³, Claude Férec, George P. Patrinos⁴ - Show less +1 more•Institutions (4)

French Institute of Health and Medical Research¹, Cardiff University², University of Central Lancashire³, Erasmus University Medical Center⁴

01 Oct 2007-Nature Reviews Genetics

TL;DR: Current thinking about how gene conversion occurs is assessed, the key part it has played in fashioning extant human genes is explored, and a meta-analysis of gene-conversion events that are known to have caused human genetic disease is carried out.

...read moreread less

Abstract: Gene conversion, one of the two mechanisms of homologous recombination, involves the unidirectional transfer of genetic material from a 'donor' sequence to a highly homologous 'acceptor'. Considerable progress has been made in understanding the molecular mechanisms that underlie gene conversion, its formative role in human genome evolution and its implications for human inherited disease. Here we assess current thinking about how gene conversion occurs, explore the key part it has played in fashioning extant human genes, and carry out a meta-analysis of gene-conversion events that are known to have caused human genetic disease.

...read moreread less

609 citations

Journal Article•DOI•

A meta‐analysis of nonsense mutations causing human genetic disease

[...]

Matthew Mort¹, Dobril Ivanov¹, David Neil Cooper¹, Nadia Chuzhanova²•Institutions (2)

Cardiff University¹, University of Central Lancashire²

01 Aug 2008-Human Mutation

TL;DR: The proportion of disease‐causing nonsense mutations predicted to elicit nonsense‐mediated mRNA decay (NMD) is significantly higher than among nonobserved (potential) nonsense mutations, implying that nonsense mutations that elicit NMD are more likely to come to clinical attention.

...read moreread less

Abstract: Nonsense mutations account for ∼11% of all described gene lesions causing human inherited disease and ∼20% of disease-associated single-basepair substitutions affecting gene coding regions. Pathological nonsense mutations resulting in TGA (38.5%), TAG (40.4%), and TAA (21.1%) occur in different proportions to naturally occurring stop codons. Of the 23 different nucleotide substitutions giving rise to nonsense mutations, the most frequent are CGA → TGA (21%; resulting from methylation-mediated deamination) and CAG → TAG (19%). The differing nonsense mutation frequencies are largely explicable in terms of variable nucleotide substitution rates such that it is unnecessary to invoke differential translational termination efficiency or differential codon usage. Some genes are characterized by numerous nonsense mutations but relatively few if any missense mutations (e.g., CHM) whereas other genes exhibit many missense mutations but few if any nonsense mutations (e.g., PSEN1). Genes in the latter category have a tendency to encode proteins characterized by multimer formation. Consistent with the operation of a clinical selection bias, genes exhibiting an excess of nonsense mutations are also likely to display an excess of frameshift mutations. Tumor suppressor (TS) genes exhibit a disproportionate number of nonsense mutations while most mutations in oncogenes are missense. A total of 12% of somatic nonsense mutations in TS genes were found to occur recurrently in the hypermutable CpG dinucleotide. In a comparison of somatic and germline mutational spectra for 17 TS genes, ∼43% of somatic nonsense mutations had counterparts in the germline (rising to 98% for CpG mutations). Finally, the proportion of disease-causing nonsense mutations predicted to elicit nonsense-mediated mRNA decay (NMD) is significantly higher (P=1.56 × 10−9) than among nonobserved (potential) nonsense mutations, implying that nonsense mutations that elicit NMD are more likely to come to clinical attention.

...read moreread less

331 citations

Journal Article•DOI•

An absence of cutaneous neurofibromas associated with a 3-bp inframe deletion in Exon 17 of the NF1 gene (c.2970-2972 delAAT): evidence of a clinically significant NF1 genotype-phenotype correlation

[...]

Meena Upadhyaya¹, Susan M Huson², M. Davies¹, Nicholas Stuart Tudor Thomas¹, Nadia Chuzhanova¹, S. Giovannini², D G R Evans², E. Howard², Bronwyn Kerr², Sian Wyn Griffiths¹, Claudia Consoli¹, Lucy Side, Darius J. Adams³, Mary Ella M Pierpont, Rachel K. Hachen⁴, A. Barnicoat⁵, Hua Li⁶, P. Wallace⁶, J. P. Van Biervliet, David A. Stevenson⁷, Dave Viskochil⁷, Diana Baralle, Eric Haan⁸, Vincent M. Riccardi, Peter D. Turnpenny⁹, Conxi Lázaro, Ludwine Messiaen¹⁰ - Show less +23 more•Institutions (10)

Cardiff University¹, St Mary's Hospital², Albany Medical College³, University of Pennsylvania⁴, University College London⁵, University of Florida⁶, University of Utah⁷, University of Adelaide⁸, Royal Devon and Exeter Hospital⁹, University of Alabama at Birmingham¹⁰

01 Jan 2007-American Journal of Human Genetics

TL;DR: These data represent results from the first study to correlate a specific small mutation of the NF1 gene to the expression of a particular clinical phenotype, and the biological mechanism that relates this specific mutation to the suppression of cutaneous neurofibroma development is unknown.

...read moreread less

Abstract: Neurofibromatosis type 1 (NF1) is characterized by cafe-au-lait spots, skinfold freckling, and cutaneous neurofibromas. No obvious relationships between small mutations (<20 bp) of the NF1 gene and a specific phenotype have previously been demonstrated, which suggests that interaction with either unlinked modifying genes and/or the normal NF1 allele may be involved in the development of the particular clinical features associated with NF1. We identified 21 unrelated probands with NF1 (14 familial and 7 sporadic cases) who were all found to have the same c.2970-2972 delAAT (p.990delM) mutation but no cutaneous neurofibromas or clinically obvious plexiform neurofibromas. Molecular analysis identified the same 3-bp inframe deletion (c.2970-2972 delAAT) in exon 17 of the NF1 gene in all affected subjects. The ΔAAT mutation is predicted to result in the loss of one of two adjacent methionines (codon 991 or 992) (ΔMet991), in conjunction with silent ACA→ACG change of codon 990. These two methionine residues are located in a highly conserved region of neurofibromin and are expected, therefore, to have a functional role in the protein. Our data represent results from the first study to correlate a specific small mutation of the NF1 gene to the expression of a particular clinical phenotype. The biological mechanism that relates this specific mutation to the suppression of cutaneous neurofibroma development is unknown.

...read moreread less

308 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

UCHIME improves sensitivity and speed of chimera detection

[...]

Robert C. Edgar, Brian J. Haas¹, Jose C. Clemente¹, Christopher Quince¹, Rob Knight¹ - Show less +1 more•Institutions (1)

University of Colorado Boulder¹

01 Aug 2011-Bioinformatics

TL;DR: UCHIME has better sensitivity than ChimeraSlayer (previously the most sensitive database method), especially with short, noisy sequences, and in testing on artificial bacterial communities with known composition, UCHIME de novo sensitivity is shown to be comparable to Perseus.

...read moreread less

Abstract: Motivation: Chimeric DNA sequences often form during polymerase chain reaction amplification, especially when sequencing single regions (e.g. 16S rRNA or fungal Internal Transcribed Spacer) to assess diversity or compare populations. Undetected chimeras may be misinterpreted as novel species, causing inflated estimates of diversity and spurious inferences of differences between populations. Detection and removal of chimeras is therefore of critical importance in such experiments. Results: We describe UCHIME, a new program that detects chimeric sequences with two or more segments. UCHIME either uses a database of chimera-free sequences or detects chimeras de novo by exploiting abundance data. UCHIME has better sensitivity than ChimeraSlayer (previously the most sensitive database method), especially with short, noisy sequences. In testing on artificial bacterial communities with known composition, UCHIME de novo sensitivity is shown to be comparable to Perseus. UCHIME is >100× faster than Perseus and >1000× faster than ChimeraSlayer. Contact: [email protected] Availability: Source, binaries and data: http://drive5.com/uchime. Supplementary information:Supplementary data are available at Bioinformatics online.

...read moreread less

11,904 citations

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

Journal Article•DOI•

Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB

[...]

Todd Z. DeSantis¹, Philip Hugenholtz², Neils Larsen, Mark Rojas³, Eoin L. Brodie¹, Keith Keller⁴, Thomas Huber⁵, Daniel Dalevi⁶, Ping Hu¹, Gary L. Andersen¹ - Show less +6 more•Institutions (6)

Lawrence Berkeley National Laboratory¹, Joint Genome Institute², Baylor University³, University of California, Berkeley⁴, University of Queensland⁵, Chalmers University of Technology⁶

01 Jul 2006-Applied and Environmental Microbiology

TL;DR: A 16S rRNA gene database (http://greengenes.lbl.gov) was used to provide chimera screening, standard alignment, and taxonomic classification using multiple published taxonomies as mentioned in this paper.

...read moreread less

Abstract: A 16S rRNA gene database (http://greengenes.lbl.gov) addresses limitations of public repositories by providing chimera screening, standard alignment, and taxonomic classification using multiple published taxonomies. It was found that there is incongruent taxonomic nomenclature among curators even at the phylum level. Putative chimeras were identified in 3% of environmental sequences and in 0.2% of records derived from isolates. Environmental sequences were classified into 100 phylum-level lineages in the Archaea and Bacteria.

...read moreread less

9,593 citations

Journal Article•DOI•

SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB

[...]

Elmar Pruesse¹, Christian Quast², Katrin Knittel², Bernhard M. Fuchs², Wolfgang Ludwig², Jörg Peplies², Frank Oliver Glöckner² - Show less +3 more•Institutions (2)

Max Planck Society¹, Technische Universität München²

01 Dec 2007-Nucleic Acids Research

TL;DR: SILVA (from Latin silva, forest), was implemented to provide a central comprehensive web resource for up to date, quality controlled databases of aligned rRNA sequences from the Bacteria, Archaea and Eukarya domains.

...read moreread less

Abstract: Sequencing ribosomal RNA (rRNA) genes is currently the method of choice for phylogenetic reconstruction, nucleic acid based detection and quantification of microbial diversity. The ARB software suite with its corresponding rRNA datasets has been accepted by researchers worldwide as a standard tool for large scale rRNA analysis. However, the rapid increase of publicly available rRNA sequence data has recently hampered the maintenance of comprehensive and curated rRNA knowledge databases. A new system, SILVA (from Latin silva, forest), was implemented to provide a central comprehensive web resource for up to date, quality controlled databases of aligned rRNA sequences from the Bacteria, Archaea and Eukarya domains. All sequences are checked for anomalies, carry a rich set of sequence associated contextual information, have multiple taxonomic classifications, and the latest validly described nomenclature. Furthermore, two precompiled sequence datasets compatible with ARB are offered for download on the SILVA website: (i) the reference (Ref) datasets, comprising only high quality, nearly full length sequences suitable for in-depth phylogenetic analysis and probe design and (ii) the comprehensive Parc datasets with all publicly available rRNA sequences longer than 300 nucleotides suitable for biodiversity analyses. The latest publicly available database release 91 (August 2007) hosts 547 521 sequences split into 461 823 small subunit and 85 689 large subunit rRNAs.

...read moreread less

5,733 citations

Journal Article•DOI•

A review of feature selection techniques in bioinformatics

[...]

Yvan Saeys¹, Iñaki Inza¹, Pedro Larrañaga¹•Institutions (1)

University of the Basque Country¹

10 Sep 2007-Bioinformatics

TL;DR: A basic taxonomy of feature selection techniques is provided, providing their use, variety and potential in a number of both common as well as upcoming bioinformatics applications.

...read moreread less

Abstract: Feature selection techniques have become an apparent need in many bioinformatics applications. In addition to the large pool of techniques that have already been developed in the machine learning and data mining fields, specific applications in bioinformatics have led to a wealth of newly proposed techniques. In this article, we make the interested reader aware of the possibilities of feature selection, providing a basic taxonomy of feature selection techniques, and discussing their use, variety and potential in a number of both common as well as upcoming bioinformatics applications. Contact: yvan.saeys@psb.ugent.be Supplementary information: http://bioinformatics.psb.ugent.be/supplementary_data/yvsae/fsreview

...read moreread less

4,706 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse