Identification of Diagnostic Markers for Breast Cancer Based on Differential Gene Expression and Pathway Network
Reads0
Chats0
TLDR
It is shown that the difference of gene expression level is important for the diagnosis of breast cancer, and 23 breast cancer diagnostic markers are identified, which provides valuable information for clinical diagnosis and basic treatment experiments.Abstract:
Background: Breast cancer is the second largest cancer in the world, the incidence of breast cancer continues to rise worldwide, and women’s health is seriously threatened. Therefore, it is very important to explore the characteristic changes of breast cancer from the gene level, including the screening of differentially expressed genes and the identification of diagnostic markers. Methods: The gene expression profiles of breast cancer were obtained from the TCGA database. The edgeR R software package was used to screen the differentially expressed genes between breast cancer patients and normal samples. The function and pathway enrichment analysis of these genes revealed significant enrichment of functions and pathways. Next, download these pathways from KEGG website, extract the gene interaction relations, construct the KEGG pathway gene interaction network. The potential diagnostic markers of breast cancer were obtained by combining the differentially expressed genes with the key genes in the network. Finally, these markers were used to construct the diagnostic prediction model of breast cancer, and the predictive ability of the model and the diagnostic ability of the markers were verified by internal and external data. Results: 1060 differentially expressed genes were identified between breast cancer patients and normal controls. Enrichment analysis revealed 28 significantly enriched pathways (p < 0.05). They were downloaded from KEGG website, and the gene interaction relations were extracted to construct the gene interaction network of KEGG pathway, which contained 1277 nodes and 7345 edges. The key nodes with a degree greater than 30 were extracted from the network, containing 154 genes. These 154 key genes shared 23 genes with differentially expressed genes, which serve as potential diagnostic markers for breast cancer. The 23 genes were used as features to construct the SVM classification model, and the model had good predictive ability in both the training dataset and the validation dataset (AUC = 0.960 and 0.907, respectively). Conclusion: This study showed that the difference of gene expression level is important for the diagnosis of breast cancer, and identified 23 breast cancer diagnostic markers, which provides valuable information for clinical diagnosis and basic treatment experiments.read more
Citations
More filters
Journal ArticleDOI
Analysis and modeling of myopia-related factors based on questionnaire survey
Jianqiang Xiao,Mujiexin Liu,Qinlai Huang,Zijie Sun,Lin Ning,Junguo Duan,Siquan Q. Zhu,Jian-Zhong Huang,Hao Lin,Hui Yang +9 more
TL;DR: Wang et al. as discussed by the authors investigated the relationship between four main factors (environment, habits, parental vision, and demographic) and myopia status by analyzing the questionnaire data, and found that the 4 most influential features with XGBoost could achieve a competitive AUC of 0.764.
Journal ArticleDOI
iEnhancer-MRBF: Identifying enhancers and their strength with a multiple Laplacian-regularized radial basis function network.
TL;DR: Li et al. as mentioned in this paper proposed a two-layer model called iEnhancer-MRBF, wherein the first layer is used to identify enhancers, and the identified enhancers are divided into strong enhancers and weak enhancers according to their strength in the second layer.
Journal ArticleDOI
Identification of Novel Diagnostic and Prognostic Gene Signature Biomarkers for Breast Cancer Using Artificial Intelligence and Machine Learning Assisted Transcriptomics Analysis
Zeenat Mirza,Md.Shahid Ansari,Nesar Ahmad,Nofe Alganmi,Haneen Banjar,Mohammed H. Al-Qahtani,Sajjad Karim +6 more
TL;DR: In this article , the authors applied machine learning (ML) methods to identify the valuable gene signature model based on differentially expressed genes (DEGs) for BC diagnosis and prognosis.
Journal ArticleDOI
A review of multi-omics data integration through deep learning approaches for disease diagnosis, prognosis, and treatment
TL;DR: In this paper , the authors systematically evaluate the recent trends in multi-omics data analysis based on deep learning techniques and their application in disease prediction, highlighting the current challenges in the field and discuss how advances in deep learning methods and their optimization for application is vital in overcoming them.
References
More filters
Journal ArticleDOI
Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks
Paul Shannon,Andrew Markiel,Owen Ozier,Nitin S. Baliga,Jonathan T. Wang,Daniel Ramage,Nada Amin,Benno Schwikowski,Trey Ideker +8 more
TL;DR: Several case studies of Cytoscape plug-ins are surveyed, including a search for interaction pathways correlating with changes in gene expression, a study of protein complexes involved in cellular recovery to DNA damage, inference of a combined physical/functional interaction network for Halobacterium, and an interface to detailed stochastic/kinetic gene regulatory models.
Journal ArticleDOI
Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.
TL;DR: By following this protocol, investigators are able to gain an in-depth understanding of the biological themes in lists of genes that are enriched in genome-scale studies.
Journal ArticleDOI
edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.
TL;DR: EdgeR as mentioned in this paper is a Bioconductor software package for examining differential expression of replicated count data, which uses an overdispersed Poisson model to account for both biological and technical variability and empirical Bayes methods are used to moderate the degree of overdispersion across transcripts, improving the reliability of inference.
Journal ArticleDOI
Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists
TL;DR: The survey will help tool designers/developers and experienced end users understand the underlying algorithms and pertinent details of particular tool categories/tools, enabling them to make the best choices for their particular research interests.
Journal ArticleDOI
The cancer genome atlas pan-cancer analysis project
John N. Weinstein,John N. Weinstein,Eric A. Collisson,Gordon B. Mills,Kenna R. Mills Shaw,Kenna R. Mills Shaw,Brad Ozenberger,Kyle Ellrott,Kyle Ellrott,Chris Sander,Joshua M. Stuart,Joshua M. Stuart,Kyle Chang,Chad J. Creighton,Caleb F. Davis,Lawrence A. Donehower,Jennifer Drummond,David A. Wheeler,Adrian Ally,Miruna Balasundaram,Inanc Birol,Inanc Birol,Inanc Birol,Yaron S.N. Butterfield,Andy Chu,Eric Chuah,Hye Jung E. Chun,Noreen Dhalla,Ranabir Guin,Martin Hirst,Carrie Hirst,Robert A. Holt,Steven J.M. Jones,Darlene Lee,Haiyan I. Li,Marco A. Marra,Michael Mayo,Richard A. Moore,Andrew J. Mungall,A. Gordon Robertson,Jacqueline E. Schein,Payal Sipahimalani,Angela Tam,Nina Thiessen,Richard Varhol,Rameen Beroukhim,Ami S. Bhatt,Angela N. Brooks,Andrew D. Cherniack,Samuel S. Freeman,Stacey Gabriel,Elena Helman,Joonil Jung,Matthew Meyerson,Akinyemi I. Ojesina,Chandra Sekhar Pedamallu,Gordon Saksena,Steven E. Schumacher,Barbara Tabak,Travis I. Zack,Travis I. Zack,Eric S. Lander,Christopher A. Bristow,Angela Hadjipanayis,Psalm Haseley,Raju Kucherlapati,Semin Lee,Eunjung Lee,Lovelace J. Luquette,Harshad S. Mahadeshwar,Angeliki Pantazi,Michael Parfenov,Michael Parfenov,Peter J. Park,Alexei Protopopov,Xiaojia Ren,Netty Santoso,Jonathan G. Seidman,Sahil Seth,Xingzhi Song,Jiabin Tang,Ruibin Xi,Ruibin Xi,Ruibin Xi,Andrew Wei Xu,Lixing Yang,Dong Zeng,J. Todd Auman,Saianand Balu,Elizabeth Buda,Cheng Fan,Katherine A. Hoadley,Corbin D. Jones,Shaowu Meng,Piotr A. Mieczkowski,Joel S. Parker,Charles M. Perou,Jeffrey Roach,Yan Shi,Grace O. Silva,Donghui Tan,Umadevi Veluvolu,Scot Waring,Matthew D. Wilkerson,Junyuan Wu,Wei Zhao,Tom Bodenheimer,D. Neil Hayes,D. Neil Hayes,Alan P. Hoyle,Stuart R. Jeffreys,Lisle E. Mose,Janae V. Simons,Mathew G. Soloway,Stephen B. Baylin,Benjamin P. Berman,Moiz S. Bootwalla,Ludmila Danilova,James G. Herman,Toshinori Hinoue,Peter W. Laird,Suhn K. Rhie,Hui Shen,Timothy J. Triche,Daniel J. Weisenberger,Scott L. Carter,Kristian Cibulskis,Lynda Chin,Jianhua Zhang,Carrie Sougnez,Min Wang,Gad Getz,Gad Getz,Huyen Dinh,Harshavardhan Doddapaneni,Richard A. Gibbs,Preethi Gunaratne,Preethi Gunaratne,Yi Han,Divya Kalra,Christie Kovar,Lora Lewis,Margaret B. Morgan,Donna Morton,Donna Muzny,Jeffrey G. Reid,Liu Xi,Juok Cho,Daniel DiCara,Scott Frazer,Nils Gehlenborg,David I. Heiman,Jaegil Kim,Michael S. Lawrence,Pei Lin,Yingchun Liu,Michael S. Noble,Petar Stojanov,Doug Voet,Hailei Zhang,Lihua Zou,Chip Stewart,Brady Bernard,Ryan Bressler,Andrea Eakin,Lisa Iype,Theo A. Knijnenburg,Roger Kramer,Richard Kreisberg,Kalle Leinonen,Jake Lin,Yuexin Liu,Michael Miller,Sheila M. Reynolds,Hector Rovira,Ilya Shmulevich,Vesteinn Thorsson,Da Yang,Wei Zhang,Samirkumar B. Amin,Chang-Jiun Wu,Chia Chin Wu,Rehan Akbani,Kenneth Aldape,Keith A. Baggerly,Bradley McIntosh Broom,Tod D. Casasent,James Cleland,Deepti Dodda,Mary Elizabeth Edgerton,Leng Han,Shelley M. Herbrich,Zhenlin Ju,Hoon Kim,Hoon Kim,Seth Lerner,Jun Li,Han Liang,Wenbin Liu,Philip L. Lorenzi,Yiling Lu,James M. Melott,Lam Nguyen,Lam Nguyen,Xiaoping Su,Roeland Verhaak,Wenyi Wang,Andrew J. Wong,Andrew J. Wong,Yang Yang,Jun Yao,Rong Yao,Kosuke Yoshihara,Yuan Yuan,Yuan Yuan,W. K. Alfred Yung,Nianxiang Zhang,Siyuan Zheng,Michael B. Ryan,Michael B. Ryan,David W. Kane,David W. Kane,B. Arman Aksoy,Giovanni Ciriello,Gideon Dresdner,Jianjiong Gao,Benjamin Gross,Anders Jacobsen,André Kahles,Marc Ladanyi,William Lee,Kjong-Van Lehmann,Martin L. Miller,Ricardo Ramirez,Gunnar Rätsch,Boris Reva,Nikolaus Schultz,Yasin Senbabaoglu,Ronglai Shen,Rileen Sinha,S. Onur Sumer,Yichao Sun,Barry S. Taylor,Barry S. Taylor,Barry S. Taylor,Nils Weinhold,Suzanne S. Fei,Paul T. Spellman,Christopher C. Benz,Christopher C. Benz,Daniel E. Carlin,Daniel E. Carlin,Melisssa Cline,Melisssa Cline,Brian Craft,Brian Craft,Mary Goldman,David Haussler,David Haussler,David Haussler,Singer Ma,Singer Ma,Sam Ng,Sam Ng,Evan O. Paull,Evan O. Paull,Amie Radenbaugh,Amie Radenbaugh,Sofie R. Salama,Sofie R. Salama,Sofie R. Salama,Artem Sokolov,Artem Sokolov,Teresa Swatloski,Teresa Swatloski,Vladislav Uzunangelov,Vladislav Uzunangelov,Peter Waltman,Peter Waltman,Christina Yau,Jing Zhu,Jing Zhu,Stanley R. Hamilton,Scott Abbott,Rachel Abbott,Nathan D. Dees,Kim D. Delehaunty,Li Ding,David J. Dooling,James M. Eldred,Catrina Fronick,Robert S. Fulton,Lucinda Fulton,Joelle Kalicki-Veizer,Krishna L. Kanchi,Cyriac Kandoth,Daniel C. Koboldt,David E. Larson,Timothy J. Ley,Ling Lin,Charles Lu,Vincent Magrini,Elaine R. Mardis,Michael D. McLellan,Joshua F. McMichael,Christopher A. Miller,Michelle O'Laughlin,Craig Pohl,Heather Schmidt,Scott M. Smith,Jason Walker,John W. Wallis,Michael C. Wendl,Michael C. Wendl,Richard K. Wilson,Todd Wylie,Qunyuan Zhang,Robert A. Burton,Mark A. Jensen,Ari B. Kahn,Todd Pihl,David A. Pot,Yunhu Wan,Douglas A. Levine,Aaron D. Black,Jay Bowen,Jessica Frick,Julie M. Gastier-Foster,Julie M. Gastier-Foster,Hollie A. Harper,Carmen Helsel,Kristen M. Leraas,Tara M. Lichtenberg,Cynthia McAllister,Nilsa C. Ramirez,Nilsa C. Ramirez,Samantha Sharpe,Lisa Wise,Erik Zmuda,Stephen J. Chanock,Tanja Davidsen,John A. Demchok,Greg Eley,Ina Felau,Margi Sheth,Heidi J. Sofia,Louis M. Staudt,Roy Tarnuzzer,Zhining Wang,Liming Yang,Jiashan Zhang,Larsson Omberg,Adam Margolin,Benjamin J. Raphael,Fabio Vandin,Hsin-Ta Wu,Mark D.M. Leiserson,Stephen C. Benz,Charles J. Vaske,Houtan Noushmehr,Houtan Noushmehr,Denise M. Wolf,Laura van 't Veer,Dimitris Anastassiou,Tai Hsien Ou Yang,Nuria Lopez-Bigas,Abel Gonzalez-Perez,David Tamborero,Zheng Xia,Wei Li,Dong Yeon Cho,Teresa M. Przytycka,Mark P. Hamilton,Sean E. McGuire,Sven Nelander,Sven Nelander,Patrik Johansson,Rebecka Jörnsten,Rebecka Jörnsten,Teresia Kling +379 more
TL;DR: The Pan-Cancer initiative compares the first 12 tumor types profiled by TCGA with a major opportunity to develop an integrated picture of commonalities, differences and emergent themes across tumor lineages.
Related Papers (5)
Trimming of mammalian transcriptional networks using network component analysis
Differential gene expression analysis in glioblastoma cells and normal human brain cells based on GEO database.
Anping Wang,Guibin Zhang +1 more