scispace - formally typeset
Search or ask a question

Showing papers in "Human Genomics in 2016"


Journal ArticleDOI
TL;DR: The human and mouse Ly6/uPAR family gene and protein structure and genomic organization, expression, functions, and evolution are reviewed, and new names for novel family members are introduced.
Abstract: Members of the lymphocyte antigen-6 (Ly6)/urokinase-type plasminogen activator receptor (uPAR) superfamily of proteins are cysteine-rich proteins characterized by a distinct disulfide bridge pattern that creates the three-finger Ly6/uPAR (LU) domain. Although the Ly6/uPAR family proteins share a common structure, their expression patterns and functions vary. To date, 35 human and 61 mouse Ly6/uPAR family members have been identified. Based on their subcellular localization, these proteins are further classified as GPI-anchored on the cell membrane, or secreted. The genes encoding Ly6/uPAR family proteins are conserved across different species and are clustered in syntenic regions on human chromosomes 8, 19, 6 and 11, and mouse Chromosomes 15, 7, 17, and 9, respectively. Here, we review the human and mouse Ly6/uPAR family gene and protein structure and genomic organization, expression, functions, and evolution, and introduce new names for novel family members.

145 citations


Journal ArticleDOI
TL;DR: A data loader and mapper that connects sequence information from oncology patients to data stored in an in-house database, the JAX Clinical Knowledgebase, which can be queried readily to access comprehensive data for clinical reporting via customized reporting queries, and the capability to rapidly analyze the landscape of targeted therapies in clinical trials, enabling a unique view into current therapeutic availability in oncologists.
Abstract: Precision medicine in oncology relies on rapid associations between patient-specific variations and targeted therapeutic efficacy. Due to the advancement of genomic analysis, a vast literature characterizing cancer-associated molecular aberrations and relative therapeutic relevance has been published. However, data are not uniformly reported or readily available, and accessing relevant information in a clinically acceptable time-frame is a daunting proposition, hampering connections between patients and appropriate therapeutic options. One important therapeutic avenue for oncology patients is through clinical trials. Accordingly, a global view into the availability of targeted clinical trials would provide insight into strengths and weaknesses and potentially enable research focus. However, data regarding the landscape of clinical trials in oncology is not readily available, and as a result, a comprehensive understanding of clinical trial availability is difficult. To support clinical decision-making, we have developed a data loader and mapper that connects sequence information from oncology patients to data stored in an in-house database, the JAX Clinical Knowledgebase (JAX-CKB), which can be queried readily to access comprehensive data for clinical reporting via customized reporting queries. JAX-CKB functions as a repository to house expertly curated clinically relevant data surrounding our 358-gene panel, the JAX Cancer Treatment Profile (JAX CTP), and supports annotation of functional significance of molecular variants. Through queries of data housed in JAX-CKB, we have analyzed the landscape of clinical trials relevant to our 358-gene targeted sequencing panel to evaluate strengths and weaknesses in current molecular targeting in oncology. Through this analysis, we have identified patient indications, molecular aberrations, and targeted therapy classes that have strong or weak representation in clinical trials. Here, we describe the development and disseminate system methods for associating patient genomic sequence data with clinically relevant information, facilitating interpretation and providing a mechanism for informing therapeutic decision-making. Additionally, through customized queries, we have the capability to rapidly analyze the landscape of targeted therapies in clinical trials, enabling a unique view into current therapeutic availability in oncology.

117 citations


Journal ArticleDOI
TL;DR: In AKI, some miRNAs appear to act pathogenically by promoting inflammation, apoptosis, and fibrosis, while others may act protectively by exerting anti-inflammatory,anti-apoptotic, anti-fibrotic, and pro-angiogenic effects.
Abstract: Acute kidney injury (AKI) is an important clinical issue that is associated with significant morbidity and mortality. Despite research advances over the past decades, the complex pathophysiology of AKI is not fully understood. The regulatory mechanisms underlying post-AKI repair and fibrosis have not been clarified either. Furthermore, there is no definitively effective treatment for AKI. MicroRNAs (miRNAs) are endogenous single-stranded noncoding RNAs of 19~23 nucleotides that have been shown to be crucial to the post-transcriptional regulation of various cellular biological functions, including proliferation, differentiation, metabolism, and apoptosis. In addition to being fundamental to normal development and physiology, miRNAs also play important roles in various human diseases. In AKI, some miRNAs appear to act pathogenically by promoting inflammation, apoptosis, and fibrosis, while others may act protectively by exerting anti-inflammatory, anti-apoptotic, anti-fibrotic, and pro-angiogenic effects. Thus, miRNAs have not only emerged as novel biomarkers for AKI; they also hold promise to be potential therapeutic targets.

93 citations


Journal ArticleDOI
TL;DR: The genetic architecture of AMD is reviewed, considering the contribution of both common and rare alleles to susceptibility and the possible mechanistic links between photoreceptor degeneration and the alternative complement pathway, a cascade that has emerged as the most potent genetic driver of this disorder.
Abstract: Age-related macular degeneration (AMD) is an ocular neurodegenerative disorder and is the leading cause of legal blindness in Western societies, with a prevalence of up to 8 % over the age of 60, which continues to increase with age. AMD is characterized by the progressive breakdown of the macula (the central region of the retina), resulting in the loss of central vision including visual acuity. While its molecular etiology remains unclear, advances in genetics and genomics have illuminated the genetic architecture of the disease and have generated attractive pathomechanistic hypotheses. Here, we review the genetic architecture of AMD, considering the contribution of both common and rare alleles to susceptibility, and we explore the possible mechanistic links between photoreceptor degeneration and the alternative complement pathway, a cascade that has emerged as the most potent genetic driver of this disorder.

71 citations


Journal ArticleDOI
TL;DR: How the authors' gene families are curated is reviewed and how to view, search and download the gene family data is explained.
Abstract: The HUGO Gene Nomenclature Committee (HGNC) approves unique gene symbols and names for human loci. As well as naming genomic loci, we manually curate genes into family sets based on shared characteristics such as function, homology or phenotype. Each HGNC gene family has its own dedicated gene family report on our website, www.genenames.org. We have recently redesigned these reports to support the visualisation and browsing of complex relationships between families and to provide extra curated information such as family descriptions, protein domain graphics and gene family aliases. Here, we review how our gene families are curated and explain how to view, search and download the gene family data.

62 citations


Journal ArticleDOI
Kenneth K. Kidd1
TL;DR: The proposed nomenclature involves a simple root consisting of “mh” followed by the two-digit chromosome number and unique characters established by the authors in the initial publication of microhaplotypes.
Abstract: Microhaplotypes are a new type of genetic marker in forensics and population genetics. A standardized nomenclature is desirable. A simple approach that does not require a central authority for approval is proposed. The nomenclature proposed follows the recommendation of the HUGO Gene Nomenclature Committee ( http://www.genenames.org ): “We strongly encourage naming families and groups of genes related by sequence and/or function using a “root” symbol. This is an efficient and informative way to name related genes, and already works well for a number of established gene families…” The proposal involves a simple root consisting of “mh” followed by the two-digit chromosome number and unique characters established by the authors in the initial publication. We suggest the unique symbol be an indication of the laboratory followed by characters unique to the chromosome and laboratory. For instance, the microhaplotype symbol mh01KK-001 refers to a locus on chromosome 1 published by the Kidd Lab (KK-) as their #001. Publication defines mh01KK-001 as comprised of four single nucleotide polymorphisms (SNPs), rs4648344, rs6663840, rs58111155, and rs6688969.

43 citations


Journal ArticleDOI
TL;DR: High-throughput RNA sequencing provides novel insights into the pathogenesis mechanism of periodontitis in terms of gene expression and alternative splicing and reveals unique transcription variants inperiodontitis tissues.
Abstract: Periodontitis is the most common chronic inflammatory disease caused by complex interaction between the microbial biofilm and host immune responses. In the present study, high-throughput RNA sequencing was utilized to systemically and precisely identify gene expression profiles and alternative splicing. The pooled RNAs of 10 gingival tissues from both healthy and periodontitis patients were analyzed by deep sequencing followed by computational annotation and quantification of mRNA structures. The differential expression analysis designated 400 up-regulated genes in periodontitis tissues especially in the pathways of defense/immunity protein, receptor, protease, and signaling molecules. The top 10 most up-regulated genes were CSF3, MAFA, CR2, GLDC, SAA1, LBP, MME, MMP3, MME-AS1, and SAA4. The 62 down-regulated genes in periodontitis were mainly cytoskeletal and structural proteins. The top 10 most down-regulated genes were SERPINA12, MT4, H19, KRT2, DSC1, PSORS1C2, KRT27, LCE3C, AQ5, and LCE6A. The differential alternative splicing analysis revealed unique transcription variants in periodontitis tissues. The EDB exon was predominantly included in FN1, while exon 2 was mostly skipped in BCL2A1. These findings using RNA sequencing provide novel insights into the pathogenesis mechanism of periodontitis in terms of gene expression and alternative splicing.

39 citations


Journal ArticleDOI
TL;DR: In this paper, the authors evaluated a neurological gene panel as a first-line diagnostic test for limb-girdle muscular dystrophy (LGMD) and other myopathies.
Abstract: Fifty random genetically unstudied families (limb-girdle muscular dystrophy (LGMD)/myopathy) were screened with a gene panel incorporating 759 OMIM genes associated with neurological disorders. Average coverage of the CDS and 10 bp flanking regions of genes was 99 %. All families were referred to the Neurosciences Clinic of King Faisal Specialist Hospital and Research Centre, Saudi Arabia. Patients presented with muscle weakness affecting the pelvic and shoulder girdle. Muscle biopsy in all cases showed dystrophic or myopathic changes. Our main objective was to evaluate a neurological gene panel as a first-line diagnostic test for LGMD/myopathies. Our panel identified the mutation in 76 % of families (38/50; 11 novel). Thirty-four families had mutations in LGMD-related genes with four others having variants not typically associated with LGMD. The majority of cases had recessive inheritance with homoallelic pathogenic variants (97.4 %, 37/38), as expected considering the high rate of consanguinity in the study population. In one case, we detected a heterozygous mutation in DNAJB responsible for LGMD-1E. Our cohort included seven different subtypes of LGMD2. Mutations of DYSF were the most commonly identified cause of disease followed by that in CAPN3 and FKRP. Non-LGMD myopathies were due to mutations in genes associated with congenital disorder of glycosylation (ALG2), rigid spine muscular dystrophy 1 (SEPN1), inclusion body myopathy2/Nonaka myopathy (GNE), and neuropathy (WNK1). Whole exome sequencing (WES) of patients who remained undiagnosed with the neurological panel did not improve our diagnostic yield. Our neurological panel achieved a high clinical sensitivity (76 %) and is an effective first-line laboratory test in patients with LGMD and other myopathies. This sensitive, cost-effective, and rapid assay significantly assists clinical practice especially in these phenotypically and genetically heterogeneous disorders. Moreover, the application of the American College of Medical Genetics (ACMG) and Association for Molecular Pathology (AMP) guidelines applied in the classification of variant pathogenecity provides a clear interpretation for physicians on the relevance of such findings.

35 citations


Journal ArticleDOI
TL;DR: The data showed a lower number of collagen OI pathogenic variants in Vietnamese patients compared to reported rates for Asian populations, indicating the OI mutational profile of the Vietnamese population is unique and related to the presence of a high number of recessive mutations in non-collagenous OI genes.
Abstract: The genetics of osteogenesis imperfecta (OI) have not been studied in a Vietnamese population before. We performed mutational analysis of the COL1A1 and COL1A2 genes in 91 unrelated OI patients of Vietnamese origin. We then systematically characterized the mutation profiles of these two genes which are most commonly related to OI. Genomic DNA was extracted from EDTA-preserved blood according to standard high-salt extraction methods. Sequence analysis and pathogenic variant identification was performed with Mutation Surveyor DNA variant analysis software. Prediction of the pathogenicity of mutations was conducted using Alamut Visual software. The presence of variants was checked against Dalgleish’s osteogenesis imperfecta mutation database. The sample consisted of 91 unrelated osteogenesis imperfecta patients. We identified 54 patients with COL1A1/2 pathogenic variants; 33 with COL1A1 and 21 with COL1A2. Two patients had multiple pathogenic variants. Seventeen novel COL1A1 and 10 novel COL1A2 variants were identified. The majority of identified COL1A1/2 pathogenic variants occurred in a glycine substitution (36/56, 64.3 %), usually serine (23/36, 63.9 %). We found two pathogenic variants of the COL1A1 gene c.2461G > A (p.Gly821Ser) in four unrelated patients and one, c.2005G > A (p.Ala669Thr), in two unrelated patients. Our data showed a lower number of collagen OI pathogenic variants in Vietnamese patients compared to reported rates for Asian populations. The OI mutational profile of the Vietnamese population is unique and related to the presence of a high number of recessive mutations in non-collagenous OI genes. Further analysis of OI patients negative for collagen mutations, is required.

32 citations


Journal ArticleDOI
TL;DR: Hypomethylation appears to play an active, targeted, and complementary role in cancer progression, and it interplays with hypermethylation in a coordinated fashion in the cancer process.
Abstract: Methylation changes are frequent in cancers, but understanding how hyper- and hypomethylated region changes coordinate, associate with genomic features, and affect gene expression is needed to better understand their biological significance. The functional significance of hypermethylation is well studied, but that of hypomethylation remains limited. Here, with paired expression and methylation samples gathered from a patient/control cohort, we attempt to better characterize the gene expression and methylation changes that take place in cancer from B cell chronic lymphocyte leukemia (B-CLL) samples. Across the dataset, we found that consistent differentially hypomethylated regions (C-DMRs) across samples were relatively few compared to the many poorly consistent hypo- and highly conserved hyper-DMRs. However, genes in the hypo-C-DMRs tended to be associated with functions antagonistic to those in the hyper-C-DMRs, like differentiation, cell-cycle regulation and proliferation, suggesting coordinated regulation of methylation changes. Hypo-C-DMRs in B-CLL were found enriched in key signaling pathways like B cell receptor and p53 pathways and genes/motifs essential for B lymphopoiesis. Hypo-C-DMRs tended to be proximal to genes with elevated expression in contrast to the transcription silencing-mechanism imposed by hypermethylation. Hypo-C-DMRs tended to be enriched in the regions of activating H4K4me1/2/3, H3K79me2, and H3K27ac histone modifications. In comparison, the polycomb repressive complex 2 (PRC2) signature, marked by EZH2, SUZ12, CTCF binding-sites, repressive H3K27me3 marks, and “repressed/poised promoter” states were associated with hyper-C-DMRs. Most hypo-C-DMRs were found in introns (36 %), 3′ untranslated regions (29 %), and intergenic regions (24 %). Many of these genic regions also overlapped with enhancers. The methylation of CpGs from 3′UTR exons was found to have weak but positive correlation with gene expression. In contrast, methylation in the 5′UTR was negatively correlated with expression. To better characterize the overlap between methylation and expression changes, we identified correlation modules that associate with “apoptosis” and “leukocyte activation”. Despite clinical heterogeneity in disease presentation, a number of methylation changes, both hypo and hyper, appear to be common in B-CLL. Hypomethylation appears to play an active, targeted, and complementary role in cancer progression, and it interplays with hypermethylation in a coordinated fashion in the cancer process.

32 citations


Journal ArticleDOI
TL;DR: The authors' integrative analysis through both genetic and functional approaches identified two candidate genes (TACC2 and MYO1E) that augment cigarette smoke (CS)-induced cytotoxicity and, potentially, COPD susceptibility.
Abstract: Chronic obstructive pulmonary disease (COPD) is characterized by an irreversible airflow limitation in response to inhalation of noxious stimuli, such as cigarette smoke. However, only 15–20 % smokers manifest COPD, suggesting a role for genetic predisposition. Although genome-wide association studies have identified common genetic variants that are associated with susceptibility to COPD, effect sizes of the identified variants are modest, as is the total heritability accounted for by these variants. In this study, an extreme phenotype exome sequencing study was combined with in vitro modeling to identify COPD candidate genes. We performed whole exome sequencing of 62 highly susceptible smokers and 30 exceptionally resistant smokers to identify rare variants that may contribute to disease risk or resistance to COPD. This was a cross-sectional case-control study without therapeutic intervention or longitudinal follow-up information. We identified candidate genes based on rare variant analyses and evaluated exonic variants to pinpoint individual genes whose function was computationally established to be significantly different between susceptible and resistant smokers. Top scoring candidate genes from these analyses were further filtered by requiring that each gene be expressed in human bronchial epithelial cells (HBECs). A total of 81 candidate genes were thus selected for in vitro functional testing in cigarette smoke extract (CSE)-exposed HBECs. Using small interfering RNA (siRNA)-mediated gene silencing experiments, we showed that silencing of several candidate genes augmented CSE-induced cytotoxicity in vitro. Our integrative analysis through both genetic and functional approaches identified two candidate genes (TACC2 and MYO1E) that augment cigarette smoke (CS)-induced cytotoxicity and, potentially, COPD susceptibility.

Journal ArticleDOI
TL;DR: The analysis of various versions of GENCODE revealed that there was a constant upgradation of transcripts for both protein-coding and long noncoding RNA (lncRNAs) leading to conflicting annotations.
Abstract: Our understanding of the transcriptional potential of the genome and its functional consequences has undergone a significant change in the last decade This has been largely contributed by the improvements in technology which could annotate and in many cases functionally characterize a number of novel gene loci in the human genome Keeping pace with advancements in this dynamic environment and being able to systematically annotate a compendium of genes and transcripts is indeed a formidable task Of the many databases which attempted to systematically annotate the genome, GENCODE has emerged as one of the largest and popular compendium for human genome annotations The analysis of various versions of GENCODE revealed that there was a constant upgradation of transcripts for both protein-coding and long noncoding RNA (lncRNAs) leading to conflicting annotations The GENCODE version 24 accounts for 418 % of the human genome to be transcribed which is an increase of 158 % from its first version Out of 2,51,614 transcripts annotated across GENCODE versions, only 217 % had consistency We also examined GENCODE consortia categorized transcripts into 70 biotypes out of which only 17 remained stable throughout In this report, we try to review the impact on the dynamicity with respect to gene annotations, specifically (lncRNA) annotations in GENCODE over the years Our analysis suggests a significant dynamism in gene annotations, reflective of the evolution and consensus in nomenclature of genes While a progressive change in annotations and timely release of the updates make the resource reliable in the community, the dynamicity with each release poses unique challenges to its users Taking cues from other experiments with bio-curation, we propose potential avenues and methods to mend the gap

Journal ArticleDOI
TL;DR: All the inflammation-related single nucleotide polymorphisms involved in cytosine-phosphate-guanine (CpG) dinucleotides were associated with CHD in the combined or subgroup tests, suggesting an important role of inflammation in the risk of CHD.
Abstract: Chronic inflammation has been widely considered to be the major risk factor of coronary heart disease (CHD). The goal of our study was to explore the possible association with CHD for inflammation-related single nucleotide polymorphisms (SNPs) involved in cytosine-phosphate-guanine (CpG) dinucleotides. A total of 784 CHD patients and 739 non-CHD controls were recruited from Zhejiang Province, China. Using the Sequenom MassARRAY platform, we measured the genotypes of six inflammation-related CpG-SNPs, including IL1B rs16944, IL1R2 rs2071008, PLA2G7 rs9395208, FAM5C rs12732361, CD40 rs1800686, and CD36 rs2065666). Allele and genotype frequencies were compared between CHD and non-CHD individuals using the CLUMP22 software with 10,000 Monte Carlo simulations. Allelic tests showed that PLA2G7 rs9395208 and CD40 rs1800686 were significantly associated with CHD. Moreover, IL1B rs16944, PLA2G7 rs9395208, and CD40 rs1800686 were shown to be associated with CHD under the dominant model. Further gender-based subgroup tests showed that one SNP (CD40 rs1800686) and two SNPs (FAM5C rs12732361 and CD36 rs2065666) were associated with CHD in females and males, respectively. And the age-based subgroup tests indicated that PLA2G7 rs9395208, IL1B rs16944, and CD40 rs1800686 were associated with CHD among individuals younger than 55, younger than 65, and over 65, respectively. In conclusion, all the six inflammation-related CpG-SNPs (rs16944, rs2071008, rs12732361, rs2065666, rs9395208, and rs1800686) were associated with CHD in the combined or subgroup tests, suggesting an important role of inflammation in the risk of CHD.

Journal ArticleDOI
TL;DR: Transcriptogram analysis identifies novel pathways altered in ADPKD, providing new avenues to identify both AD PKD’s mechanisms of pathogenesis and pharmaceutical targets to ameliorate the progression of the disease.
Abstract: Autosomal dominant polycystic kidney disease (ADPKD) causes progressive loss of renal function in adults as a consequence of the accumulation of cysts. ADPKD is the most common genetic cause of end-stage renal disease. Mutations in polycystin-1 occur in 87% of cases of ADPKD and mutations in polycystin-2 are found in 12% of ADPKD patients. The complexity of ADPKD has hampered efforts to identify the mechanisms underlying its pathogenesis. No current FDA (Federal Drug Administration)-approved therapies ameliorate ADPKD progression. We used the de Almeida laboratory’s sensitive new transcriptogram method for whole-genome gene expression data analysis to analyze microarray data from cell lines developed from cell isolates of normal kidney and of both non-cystic nephrons and cysts from the kidney of a patient with ADPKD. We compared results obtained using standard Ingenuity Volcano plot analysis, Gene Set Enrichment Analysis (GSEA) and transcriptogram analysis. Transcriptogram analysis confirmed the findings of Ingenuity, GSEA, and published analysis of ADPKD kidney data and also identified multiple new expression changes in KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways related to cell growth, cell death, genetic information processing, nucleotide metabolism, signal transduction, immune response, response to stimulus, cellular processes, ion homeostasis and transport and cofactors, vitamins, amino acids, energy, carbohydrates, drugs, lipids, and glycans. Transcriptogram analysis also provides significance metrics which allow us to prioritize further study of these pathways. Transcriptogram analysis identifies novel pathways altered in ADPKD, providing new avenues to identify both ADPKD’s mechanisms of pathogenesis and pharmaceutical targets to ameliorate the progression of the disease.

Journal ArticleDOI
Bihai Zhao1, Sai Hu1, Xueyong Li1, Fan Zhang1, Qinglong Tian1, Wen-yin Ni1 
TL;DR: The proposed FP-MPN method was a better predictor when used on Saccharomyces cerevisiae protein data than other function prediction methods previously used.
Abstract: Background Accurate annotation of protein functions is still a big challenge for understanding life in the post-genomic era. Many computational methods based on protein-protein interaction (PPI) networks have been proposed to predict the function of proteins. However, the precision of these predictions still needs to be improved, due to the incompletion and noise in PPI networks. Integrating network topology and biological information could improve the accuracy of protein function prediction and may also lead to the discovery of multiple interaction types between proteins. Current algorithms generate a single network, which is archived using a weighted sum of all types of protein interactions.

Journal ArticleDOI
TL;DR: The findings revealed genes with potential to be associated with risk assessment and tumorigenesis of this rare type of sarcoma and established a liquid biopsy approach for monitoring patient follow-up based on genomic information that can be similarly adopted for patients diagnosed with a rare tumor.
Abstract: Genome-wide profiling of rare tumors is crucial for improvement of diagnosis, treatment, and, consequently, achieving better outcomes. Desmoplastic small round cell tumor (DSRCT) is a rare type of sarcoma arising from mesenchymal cells of abdominal peritoneum that usually develops in male adolescents and young adults. A specific translocation, t(11;22)(p13;q12), resulting in EWS and WT1 gene fusion is the only recurrent molecular hallmark and no other genetic factor has been associated to this aggressive tumor. Here, we present a comprehensive genomic profiling of one DSRCT affecting a 26-year-old male, who achieved an excellent outcome. We investigated somatic and germline variants through whole-exome sequencing using a family based approach and, by array CGH, we explored the occurrence of genomic imbalances. Additionally, we performed mate-paired whole-genome sequencing for defining the specific breakpoint of the EWS-WT1 translocation, allowing us to develop a personalized tumor marker for monitoring the patient by liquid biopsy. We identified genetic variants leading to protein alterations including 12 somatic and 14 germline events (11 germline compound heterozygous mutations and 3 rare homozygous polymorphisms) affecting genes predominantly involved in mesenchymal cell differentiation pathways. Regarding copy number alterations (CNA) few events were detected, mainly restricted to gains in chromosomes 5 and 18 and losses at 11p, 13q, and 22q. The deletions at 11p and 22q indicated the presence of the classic translocation, t(11;22)(p13;q12). In addition, the mapping of the specific genomic breakpoint of the EWS-WT1 gene fusion allowed the design of a personalized biomarker for assessing circulating tumor DNA (ctDNA) in plasma during patient follow-up. This biomarker has been used in four post-treatment blood samples, 3 years after surgery, and no trace of EWS-WT1 gene fusion was detected, in accordance with imaging tests showing no evidence of disease and with the good general health status of the patient. Overall, our findings revealed genes with potential to be associated with risk assessment and tumorigenesis of this rare type of sarcoma. Additionally, we established a liquid biopsy approach for monitoring patient follow-up based on genomic information that can be similarly adopted for patients diagnosed with a rare tumor.

Journal ArticleDOI
TL;DR: The multiphasic nature of changes in global methylation levels with age was delineated, and investigation into the factors underlying this profile will be essential to a proper understanding of the aging process.
Abstract: The change in epigenetic signatures, in particular DNA methylation, has been proposed as risk markers for various age-related diseases. However, the course of variation in methylation levels with age, the difference in methylation between genders, and methylation-disease association at the whole genome level is unclear. In the present study, genome-wide methylation levels in DNA extracted from peripheral blood for 2116 healthy Chinese in the 2–97 age range and 280 autistic trios were examined using the fluorescence polarization-based genome-wide DNA methylation quantification method developed by us. Genome-wide or global DNA methylation levels proceeded through multiple phases of variation with age, consisting of a steady increase from age 2 to 25 (r = 0.382) and another rise from age 41 to 55 to reach a peak level of ~80 % (r = 0.265), followed by a sharp decrease to ~40 % in the mid-1970s (age 56 to 75; r = −0.395) and leveling off thereafter. Significant gender effect in methylation levels was observed only for the 41–55 age group in which methylation in females was significantly higher than in males (p = 0.010). In addition, global methylation level was significantly higher in autistic children than in age-matched healthy children (p < 0.001). The multiphasic nature of changes in global methylation levels with age was delineated, and investigation into the factors underlying this profile will be essential to a proper understanding of the aging process. Furthermore, this first report of global hypermethylation in autistic children also illustrates the importance of age-matched controls in characterization of disease-associated variations in DNA methylation.

Journal ArticleDOI
TL;DR: The results are presented in order to enlighten the readership that the ~100% Kras mutant frequency and the high Her2 amplification rate reported by Jayson et al may be too exaggerated to be applicable into all populations.
Abstract: Jayson GC et al. remarked in Lancet that nearly 100% of mucinous ovarian cancer cases have Kras mutation as well as a high frequency of Her2 amplification. Using the Abbott PathVysion Her2 DNA Probe Kit and Kras mutant-enriched PCR Kits (FemtoPath®), 21 samples of primary ovarian mucinous cystadenocarcinomas from Taiwanese patients were examined to determine the status of Her2 amplification and Kras mutations. Our results showed the Her2 amplification rates were 33.33%, while the Kras mutation rates were 61.90%. We present here our results in order to enlighten the readership that the ~100% Kras mutant frequency and the high Her2 amplification rate reported by Jayson et al. may be too exaggerated to be applicable into all populations. Additionally, we report another 2 novel Kras mutations (A11V, V14I).

Journal ArticleDOI
TL;DR: This multidisciplinary task force was instrumental to formally approve the reimbursement of HTS for molecular diagnosis of Mendelian disorders in Switzerland and to continuously improve the diagnostic use of H TS.
Abstract: In order to optimally integrate the use of high-throughput sequencing (HTS) as a tool in clinical diagnostics of likely monogenic disorders, we have created a multidisciplinary “Genome Clinic Task Force” at the University Hospitals of Geneva, which is composed of clinical and molecular geneticists, bioinformaticians, technicians, bioethicists, and a coordinator. We have implemented whole exome sequencing (WES) with subsequent targeted bioinformatics analysis of gene lists for specific disorders. Clinical cases of heterogeneous Mendelian disorders that could potentially benefit from HTS are presented and discussed during the sessions of the task force. Debate concerning the interpretation of identified variants and the content of the final report constitutes a major part of the task force’s work. Furthermore, issues related to bioethics, genetic counseling, quality control, and reimbursement are also addressed. This multidisciplinary task force has enabled us to create a platform for regular exchanges between all involved experts in order to deal with the multiple complex issues related to HTS in clinical practice and to continuously improve the diagnostic use of HTS. In addition, this task force was instrumental to formally approve the reimbursement of HTS for molecular diagnosis of Mendelian disorders in Switzerland.

Journal ArticleDOI
TL;DR: This study demonstrates that various factors such as coverage depth, read length, and genome size may influence performance of individual k-spectrum-based error correction methods and recommends Musket as the top choice because of its consistently superior performance across all six testing datasets.
Abstract: Innumerable opportunities for new genomic research have been stimulated by advancement in high-throughput next-generation sequencing (NGS). However, the pitfall of NGS data abundance is the complication of distinction between true biological variants and sequence error alterations during downstream analysis. Many error correction methods have been developed to correct erroneous NGS reads before further analysis, but independent evaluation of the impact of such dataset features as read length, genome size, and coverage depth on their performance is lacking. This comparative study aims to investigate the strength and weakness as well as limitations of some newest k-spectrum-based methods and to provide recommendations for users in selecting suitable methods with respect to specific NGS datasets. Six k-spectrum-based methods, i.e., Reptile, Musket, Bless, Bloocoo, Lighter, and Trowel, were compared using six simulated sets of paired-end Illumina sequencing data. These NGS datasets varied in coverage depth (10× to 120×), read length (36 to 100 bp), and genome size (4.6 to 143 MB). Error Correction Evaluation Toolkit (ECET) was employed to derive a suite of metrics (i.e., true positives, false positive, false negative, recall, precision, gain, and F-score) for assessing the correction quality of each method. Results from computational experiments indicate that Musket had the best overall performance across the spectra of examined variants reflected in the six datasets. The lowest accuracy of Musket (F-score = 0.81) occurred to a dataset with a medium read length (56 bp), a medium coverage (50×), and a small-sized genome (5.4 MB). The other five methods underperformed (F-score < 0.80) and/or failed to process one or more datasets. This study demonstrates that various factors such as coverage depth, read length, and genome size may influence performance of individual k-spectrum-based error correction methods. Thus, efforts have to be paid in choosing appropriate methods for error correction of specific NGS datasets. Based on our comparative study, we recommend Musket as the top choice because of its consistently superior performance across all six testing datasets. Further extensive studies are warranted to assess these methods using experimental datasets generated by NGS platforms (e.g., 454, SOLiD, and Ion Torrent) under more diversified parameter settings (k-mer values and edit distances) and to compare them against other non-k-spectrum-based classes of error correction methods.

Journal ArticleDOI
TL;DR: In this article, the authors developed a new, comprehensive and efficient single assay strategy for complete molecular diagnosis of CADASIL mutations through the use of a custom next-generation sequencing (NGS) panel for improved routine clinical molecular diagnostic testing.
Abstract: Background Cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy (CADASIL) is a monogenic, hereditary, small vessel disease of the brain causing stroke and vascular dementia in adults. CADASIL has previously been shown to be caused by varying mutations in the NOTCH3 gene. The disorder is often misdiagnosed due to its significant clinical heterogeneic manifestation with familial hemiplegic migraine and several ataxia disorders as well as the location of the currently identified causative mutations. The aim of this study was to develop a new, comprehensive and efficient single assay strategy for complete molecular diagnosis of NOTCH3 mutations through the use of a custom next-generation sequencing (NGS) panel for improved routine clinical molecular diagnostic testing. Results Our custom NGS panel identified nine genetic variants in NOTCH3 (p.D139V, p.C183R, p.R332C, p.Y465C, p.C597W, p.R607H, p.E813E, p.C977G and p.Y1106C). Six mutations were stereotypical CADASIL mutations leading to an odd number of cysteine residues in one of the 34 NOTCH3 gene epidermal growth factor (EGF)-like repeats, including three new typical cysteine mutations identified in exon 11 (p.C597W; c.1791C>G); exon 18 (p.C977G; c.2929T>G) and exon 20 (p.Y1106C; c.3317A>G). Interestingly, a novel missense mutation in the CACNA1A gene was also identified in one CADASIL patient. All variants identified (novel and known) were further investigated using in silico bioinformatic analyses and confirmed through Sanger sequencing. Conclusions NGS provides an improved and effective methodology for the diagnosis of CADASIL. The NGS approach reduced time and cost for comprehensive genetic diagnosis, placing genetic diagnostic testing within reach of more patients.

Journal ArticleDOI
TL;DR: A short guide based on previously documented cases of disease-associated CNVs is presented to provide a general view on the impact of repeated elements on the stability of the genomic sequence and consequently in the origin of the human pathogenic variome.
Abstract: Copy number variants (CNVs) are important contributors to the human pathogenic genetic diversity as demonstrated by a number of cases reported in the literature. The high homology between repetitive elements may guide genomic stability which will give rise to CNVs either by non-allelic homologous recombination (NAHR) or non-homologous end joining (NHEJ). Here, we present a short guide based on previously documented cases of disease-associated CNVs in order to provide a general view on the impact of repeated elements on the stability of the genomic sequence and consequently in the origin of the human pathogenic variome.

Journal ArticleDOI
TL;DR: The results indicated that the increase of snail protein expression may play an important role in the carcinogenesis, progression, and metastasis of gastric cancer.
Abstract: Background Snail is a typical transcription factor that could induce epithelial-mesenchymal transition (EMT) and cancer progression. There are some related reports about the clinical significance of snail protein expression in gastric cancer. However, the published results were not completely consistent. This study was aimed to investigate snail expression and clinical significance in gastric cancer.

Journal ArticleDOI
TL;DR: The rs5882 polymorphism (CETP) showed a significant association with CAD and therefore could be a promising marker for CAD risk estimation while the rs708272 polymorphism had a protective effect from CAD.
Abstract: Coronary artery disease (CAD) is a leading cause of morbidity and mortality worldwide. Many genetic and environmental risk factors including atherogenic dyslipidemia contribute towards the development of CAD. Functionally relevant mutations in the dyslipidemia-related genes and enzymes involved in the reverse cholesterol transport system are associated with CAD and contribute to increased susceptibility of myocardial infarction (MI). Blood samples from 990 angiographically confirmed Saudi CAD patients with at least one event of myocardial infarction were collected between 2012 and 2014. A total of 618 Saudi controls with no history or family history of CAD participated in the study. Four polymorphisms, rs2230806, rs2066715 (ABCA1), rs5882, and rs708272 (CETP), were genotyped using TaqMan Assay. CETP rs5882 (OR = 1.45, P < 0.005) and ABCA1 rs2230806 (OR = 1.42, P = 0.017) polymorphisms were associated with increased risk of CAD. However, rs708272 polymorphism showed protective effect (B1 vs. B2: OR = 0.80, P = 0.003 and B2B2 vs. B1B1: OR = 0.68, P = 0.012) while the ABCA1 variant rs2066715 was not associated. This study is the first to report the association of these polymorphisms with CAD in the population of the Eastern Province of Saudi Arabia. The rs5882 polymorphism (CETP) showed a significant association and therefore could be a promising marker for CAD risk estimation while the rs708272 polymorphism had a protective effect from CAD.

Journal ArticleDOI
TL;DR: Exome sequencing provided initial evidence linking KALRN to monogenic intellectual disability in man, and it is proposed that KAL RN is the causative gene for the autosomal recessive phenotype in this family.
Abstract: The recent availability of whole-exome sequencing has opened new possibilities for the evaluation of individuals with genetically undiagnosed intellectual disability. We report two affected siblings, offspring of first-cousin parents, with intellectual disability, hypotonia, short stature, growth hormone deficiency, and delayed bone age. All members of the nuclear family were genotyped, and exome sequencing was performed in one of the affected individuals. We used an in-house algorithm (CATCH v1.1) that combines homozygosity mapping with exome sequencing results and provides a list of candidate variants. One identified novel homozygous missense variant in KALRN (NM_003947.4:c.3644C>A: p.(Thr1215Lys)) was predicted to be pathogenic by all pathogenicity prediction software used (SIFT, PolyPhen, Mutation Taster). KALRN encodes the protein kalirin, which is a GTP-exchange factor protein with a reported role in cytoskeletal remodeling and dendritic spine formation in neurons. It is known that mice with ablation of Kalrn exhibit age-dependent functional deficits and behavioral phenotypes. Exome sequencing provided initial evidence linking KALRN to monogenic intellectual disability in man, and we propose that KALRN is the causative gene for the autosomal recessive phenotype in this family.

Journal ArticleDOI
TL;DR: Although the actions of EGCG involve multiple targets/pathways, further analysis by mining the existing genomic datasets revealed that the upregulations of Myb and Peg3 are likely the key anti-cancer events of E GCG in vivo.
Abstract: Green tea polyphenol epigallocatechin-3-gallate (EGCG) has been demonstrated to inhibit cancer in experimental studies through its antioxidant activity and modulations on cellular functions by binding specific proteins. By means of computational analysis and functional genomic approaches, we previously identified a set of protein coding genes and microRNAs whose expressions were significantly modulated in response to the EGCG treatment in tobacco carcinogen-induced lung adenocarcinoma in A/J mice. However, to what degree these genes are involved in the cancer inhibition of EGCG remains unclear. In this study, we further employed statistical methods and literature research to analyze these data in combination with The Cancer Genome Atlas (TCGA) lung adenocarcinoma datasets for additional data mining. Under the assumption that, if a gene mediates EGCG’s cancer inhibition, its expression level change caused by EGCG should be opposite to what occurred in the carcinogenesis, we identified Myb and Peg3 as the primary putative genes involved in the cancer inhibitory activity. Further analysis suggested that the regulation of Myb could be mediated through an EGCG-upregulated microRNA, miR-449c-5p. Although the actions of EGCG involve multiple targets/pathways, further analysis by mining the existing genomic datasets revealed that the upregulations of Myb and Peg3 are likely the key anti-cancer events of EGCG in vivo.

Journal ArticleDOI
TL;DR: This study performed a systematic analysis of atherosclerosis-related genes using text mining and found that the most highly enriched pathway is the Toll-like receptor signaling pathway.
Abstract: Atherosclerosis is one of the common health threats all over the world. It is a complex heritable disease that affects arterial blood vessels. Chronic inflammatory response plays an important role in atherogenesis. There has been little success in fully identifying functionally important genes in the pathogenesis of atherosclerosis. In the present study, we performed a systematic analysis of atherosclerosis-related genes using text mining. We identified a total of 1312 genes. Gene ontology (GO) analysis revealed that a total of 35 terms exhibited significance (p < 0.05) as overrepresented terms, indicating that atherosclerosis invokes many genes with a wide range of different functions. Pathway analysis demonstrated that the most highly enriched pathway is the Toll-like receptor signaling pathway. Finally, through gene network analysis, we prioritized 48 genes using the hub gene method. Our study provides a valuable resource for the in-depth understanding of the mechanism underlying atherosclerosis.

Journal ArticleDOI
TL;DR: A novel algorithm named MINE (identifying protein complexes based on Multi-relationship protein Interaction NEtwork) is designed to predict protein complexes with high cohesion and low coupling from MPIN.
Abstract: Background Protein complexes play an important role in biological processes. Recent developments in experiments have resulted in the publication of many high-quality, large-scale protein-protein interaction (PPI) datasets, which provide abundant data for computational approaches to the prediction of protein complexes. However, the precision of protein complex prediction still needs to be improved due to the incompletion and noise in PPI networks.

Journal ArticleDOI
TL;DR: The presence of SCN1A mutations and absence of mutations in ATP1A2 or CACNA1A suggest that the Polish patients represent FHM type 3, and the presence of KCNK18 mutation indicated another FHM subtype.
Abstract: Migraine is the most common neurological disorder, affecting approximately 12 % of the adult population worldwide, caused by both environmental and genetic factors. Three causative genes have been identified in familial hemiplegic migraine (FHM) families: CACNA1A, ATP1A2, and SCNA1A. Recently, several mutations in KCNK18 have also been found as causative factors in migraine development. The aim of our study was to identify the genetic background of migraine in the Polish population. Sixty patients with migraine without aura (MO) or with different types of migraine with aura (MA), including sporadic hemiplegic, familial hemiplegic, and probable familial hemiplegic, were screened for mutations in the four genes previously linked with different types of migraine (ATP1A2, CACNA1A, SCN1A, and KCNK18). Two missense mutations were found. One novel mutation in SCN1A, encoding α subunit of sodium channel, causing amino acid change M1500V localized to a region encoding inactivation loop between transmembrane domains III and IV of the channel, was detected in a female FHM patient. The M1500V mutation was absent in a group of 62 controls, as well as in the ExAC database. The second, already known missense mutation S231P in KCNK18 was found in a female MA patient. Additionally, a novel intronic polymorphism possibly affecting alternative splicing of SCN1A, at chr2:16685249, g.77659T>C, and c.4581+32A>G, located between exons 24 and 25, in a region encoding the inactivation loop of the sodium channel was found in a female MO patient. No mutations in ATP1A2 or CACNA1A were found in the study group. The presence of SCN1A mutations and absence of mutations in ATP1A2 or CACNA1A suggest that the Polish patients represent FHM type 3. On the other hand, the presence of KCNK18 mutation indicated another FHM subtype. It could be speculated that contrary to other European populations, the genetic basis of migraine in the Polish population involves mutations in genes not included in the study. Next-generation sequencing methods should be implemented to identify other migraine-associated variants.

Journal ArticleDOI
TL;DR: In this article, the authors adopted a next-generation sequencing approach in a celiac disease trio of Greek descent to identify all genomic variants with the potential of Celiac disease predisposition, which revealed six genomic variants of prime interest: SLC9A4 c.1919G>A, KIAA1109 c.2933T>C and c.4268_4269delCCinsTA, HoxB6 c.745_746delAAinsG.
Abstract: Celiac disease is a complex chronic immune-mediated disorder of the small intestine. Today, the pathobiology of the disease is unclear, perplexing differential diagnosis, patient stratification, and decision-making in the clinic. Herein, we adopted a next-generation sequencing approach in a celiac disease trio of Greek descent to identify all genomic variants with the potential of celiac disease predisposition. Analysis revealed six genomic variants of prime interest: SLC9A4 c.1919G>A, KIAA1109 c.2933T>C and c.4268_4269delCCinsTA, HoxB6 c.668C>A, HoxD12 c.418G>A, and NCK2 c.745_746delAAinsG, from which NCK2 c.745_746delAAinsG is novel. Data validation in pediatric celiac disease patients of Greek (n = 109) and Serbian (n = 73) descent and their healthy counterparts (n = 111 and n = 32, respectively) indicated that HoxD12 c.418G>A is more prevalent in celiac disease patients in the Serbian population (P A and KIAA1109 c.2933T>C and c.4268_4269delCCinsTA were more abundant in patients; nevertheless, they failed to show statistical significance. The next-generation sequencing-based family genomics approach described herein may serve as a paradigm towards the identification of novel functional variants with the aim of understanding complex disease pathobiology.