scispace - formally typeset
Open AccessJournal ArticleDOI

Identification of Diagnostic Markers for Breast Cancer Based on Differential Gene Expression and Pathway Network

Reads0
Chats0
TLDR
It is shown that the difference of gene expression level is important for the diagnosis of breast cancer, and 23 breast cancer diagnostic markers are identified, which provides valuable information for clinical diagnosis and basic treatment experiments.
Abstract
Background: Breast cancer is the second largest cancer in the world, the incidence of breast cancer continues to rise worldwide, and women’s health is seriously threatened. Therefore, it is very important to explore the characteristic changes of breast cancer from the gene level, including the screening of differentially expressed genes and the identification of diagnostic markers. Methods: The gene expression profiles of breast cancer were obtained from the TCGA database. The edgeR R software package was used to screen the differentially expressed genes between breast cancer patients and normal samples. The function and pathway enrichment analysis of these genes revealed significant enrichment of functions and pathways. Next, download these pathways from KEGG website, extract the gene interaction relations, construct the KEGG pathway gene interaction network. The potential diagnostic markers of breast cancer were obtained by combining the differentially expressed genes with the key genes in the network. Finally, these markers were used to construct the diagnostic prediction model of breast cancer, and the predictive ability of the model and the diagnostic ability of the markers were verified by internal and external data. Results: 1060 differentially expressed genes were identified between breast cancer patients and normal controls. Enrichment analysis revealed 28 significantly enriched pathways (p < 0.05). They were downloaded from KEGG website, and the gene interaction relations were extracted to construct the gene interaction network of KEGG pathway, which contained 1277 nodes and 7345 edges. The key nodes with a degree greater than 30 were extracted from the network, containing 154 genes. These 154 key genes shared 23 genes with differentially expressed genes, which serve as potential diagnostic markers for breast cancer. The 23 genes were used as features to construct the SVM classification model, and the model had good predictive ability in both the training dataset and the validation dataset (AUC = 0.960 and 0.907, respectively). Conclusion: This study showed that the difference of gene expression level is important for the diagnosis of breast cancer, and identified 23 breast cancer diagnostic markers, which provides valuable information for clinical diagnosis and basic treatment experiments.

read more

Citations
More filters
Journal ArticleDOI

Analysis and modeling of myopia-related factors based on questionnaire survey

TL;DR: Wang et al. as discussed by the authors investigated the relationship between four main factors (environment, habits, parental vision, and demographic) and myopia status by analyzing the questionnaire data, and found that the 4 most influential features with XGBoost could achieve a competitive AUC of 0.764.
Journal ArticleDOI

iEnhancer-MRBF: Identifying enhancers and their strength with a multiple Laplacian-regularized radial basis function network.

TL;DR: Li et al. as mentioned in this paper proposed a two-layer model called iEnhancer-MRBF, wherein the first layer is used to identify enhancers, and the identified enhancers are divided into strong enhancers and weak enhancers according to their strength in the second layer.
Journal ArticleDOI

Identification of Novel Diagnostic and Prognostic Gene Signature Biomarkers for Breast Cancer Using Artificial Intelligence and Machine Learning Assisted Transcriptomics Analysis

TL;DR: In this article , the authors applied machine learning (ML) methods to identify the valuable gene signature model based on differentially expressed genes (DEGs) for BC diagnosis and prognosis.
Journal ArticleDOI

A review of multi-omics data integration through deep learning approaches for disease diagnosis, prognosis, and treatment

TL;DR: In this paper , the authors systematically evaluate the recent trends in multi-omics data analysis based on deep learning techniques and their application in disease prediction, highlighting the current challenges in the field and discuss how advances in deep learning methods and their optimization for application is vital in overcoming them.
References
More filters
Journal ArticleDOI

Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks

TL;DR: Several case studies of Cytoscape plug-ins are surveyed, including a search for interaction pathways correlating with changes in gene expression, a study of protein complexes involved in cellular recovery to DNA damage, inference of a combined physical/functional interaction network for Halobacterium, and an interface to detailed stochastic/kinetic gene regulatory models.
Journal ArticleDOI

Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.

TL;DR: By following this protocol, investigators are able to gain an in-depth understanding of the biological themes in lists of genes that are enriched in genome-scale studies.
Journal ArticleDOI

edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Journal ArticleDOI

Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists

TL;DR: The survey will help tool designers/developers and experienced end users understand the underlying algorithms and pertinent details of particular tool categories/tools, enabling them to make the best choices for their particular research interests.
Journal ArticleDOI

The cancer genome atlas pan-cancer analysis project

John N. Weinstein, +379 more
- 01 Oct 2013 - 
TL;DR: The Pan-Cancer initiative compares the first 12 tumor types profiled by TCGA with a major opportunity to develop an integrated picture of commonalities, differences and emergent themes across tumor lineages.
Related Papers (5)