scispace - formally typeset
Search or ask a question
Author

Kazim Topuz

Bio: Kazim Topuz is an academic researcher from University of Tulsa. The author has contributed to research in topics: Bayesian network & Computer science. The author has an hindex of 6, co-authored 12 publications receiving 245 citations. Previous affiliations of Kazim Topuz include Wichita State University & Oklahoma State University–Stillwater.

Papers
More filters
Journal ArticleDOI
01 Feb 2018
TL;DR: This study offers a novel methodological solution to this prediction problem by analyzing the retrospective database including > 31,000 U.S. patients and introducing a comprehensive feature selection framework that accounts for medical literature, data analytics methods and elastic net (EN) regression.
Abstract: Predicting the graft survival for kidney transplantation is a high stakes undertaking considering the shortage of available organs and the utilization of healthcare resources. The strength of any predictive model depends on the selection of proper predictors. However, despite improvements in acute rejection management and short-term graft survival, the accurate prediction of kidney transplant outcomes remains suboptimal. Among other approaches, machine-learning techniques have the potential to offer solutions to this prediction problem in kidney transplantation. This study offers a novel methodological solution to this prediction problem by: (a) analyzing the retrospective database including > 31,000 U.S. patients; (b) introducing a comprehensive feature selection framework that accounts for medical literature, data analytics methods and elastic net (EN) regression (c) using sensitivity analyses and information fusion to evaluate and combine features from several machine learning approaches (i.e., support vector machines (SVM), artificial neural networks (ANN), and Bootstrap Forest (BF)); (d) constructing several different scenarios by merging different sets of features that are optioned through these fused data mining models and statistical models in addition to expert knowledge; and (e) using best performing sets in Bayesian belief network (BBN) algorithm to identify non-linear relationships and the interactions between explanatory factors and risk levels for kidney graft survival. The results showed that the predictor set obtained through fused data mining model and literature review outperformed the all other alternative predictors sets with the scores of 0.602, 0.684, 0.495 for F-Measure, Average Accuracy, and G-Mean, respectively. Overall, our findings provide novel insights about risk prediction that could potentially help in improving the outcome of kidney transplants. This methodology can also be applied to other similar transplant data sets.

82 citations

Journal ArticleDOI
TL;DR: In this article, the authors used a large and feature-rich crash dataset along with a number of predictive analytics algorithms to model the complex relationships between varying levels of injury severity and the crash related risk factors.
Abstract: Investigation of the risk factors that contribute to the injury severity in motor vehicle crashes has proved to be a thought-provoking and challenging problem. The results of such investigation can help better understand and potentially mitigate the severe injury risks involved in automobile crashes and thereby advance the well-being of people involved in these traffic accidents. Many factors were found to have an impact on the severity of injury sustained by occupants in the event of an automobile accident. In this analytics study we used a large and feature-rich crash dataset along with a number of predictive analytics algorithms to model the complex relationships between varying levels of injury severity and the crash related risk factors. Applying a systematic series of information fusion-based sensitivity analysis on the trained predictive models we identified the relative importance of the crash related risk factors. The results provided invaluable insights for the use of predictive analytics in this domain and exposed the relative importance of crash related risk factors with the changing levels of injury severity.

81 citations

Journal ArticleDOI
01 Jun 2016
TL;DR: The results show that the proposed BBN method provides similar predictive performance to the best approaches in the literature and provides novel information on the interactions among the predictors and the conditional probability of survival for a given set of relevant donor-recipient characteristics.
Abstract: Recent research has shown that data mining models can accurately predict the outcome of a heart transplant based on predictors that include patient and donor's health/demographics. These models have not been adopted in practice, however, since they did not: a) consider the interactions between the explanatory variables; b) provide a patient's specific risk of survival (reported results have been primarily deterministic); and c) offer an automated decision tool that can provide some data-driven insights to practitioners. In this study, we attempt to overcome these three limitations through the use of Bayesian Belief Networks (BBN). The proposed BBN framework is comprised of four phases. In the first two phases, the data is preprocessed, and a candidate set of predictors is generated based on employing several variable selection methods. The third phase involves the addition of medically relevant variables to the list. In phase four, the BBN model is applied. The results show that the proposed BBN method provides similar predictive performance to the best approaches in the literature. More importantly, our method provides novel information on the interactions among the predictors and the conditional probability of survival for a given set of relevant donor-recipient characteristics. We offer U.S. practitioners a decision support tool that presents an individualized survival score based on our BBN model (and the UNOS dataset). A model that provides competitive predictors for 9-year heart transplantation outcomes.The BBN model presents data-driven insights into how the predictors interact.The result of the methodology is a novel individualized risk score for heart transplant outcomes.A decision support tool is provided to assist practitioners in characterizing a heart transplant's risk.

61 citations

Journal ArticleDOI
TL;DR: It is shown that this EN-based BBN framework is a comparable prediction methodology regarding the best approaches found in the literature and provides novel information on the interrelations of predictors and the conditional probability of predicting “no-shows.”
Abstract: No-shows are becoming a major problem in primary care facilities, creating additional costs for the facility while adversely affecting the quality of patient care. Accurately predicting no-shows plays an important role in the overbooking strategy. In this study, a hybrid probabilistic prediction framework based on the elastic net (EN) variable-selection methodology integrated with probabilistic Bayesian Belief Network (BBN) is proposed. The study predicts the “no-show probability of the patient(s)” using demographics, socioeconomic status, current appointment information, and appointment attendance history of the patient and the family. The proposed framework is validated using ten years of local pediatric clinic data. It is shown that this EN-based BBN framework is a comparable prediction methodology regarding the best approaches found in the literature. More importantly, this methodology provides novel information on the interrelations of predictors and the conditional probability of predicting “no-shows.” The output of the model can be applied to the appointment scheduling system for a robust overbooking strategy.

48 citations

Journal ArticleDOI
TL;DR: The results show that the imbalanced dataset produces similar predictive results in detecting the at-risk students and the feature selection results in simpler, more understandable, interpretable, and actionable results without compromising on the accuracy of the prediction task.

36 citations


Cited by
More filters
01 Jan 2020
TL;DR: Prolonged viral shedding provides the rationale for a strategy of isolation of infected patients and optimal antiviral interventions in the future.
Abstract: Summary Background Since December, 2019, Wuhan, China, has experienced an outbreak of coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Epidemiological and clinical characteristics of patients with COVID-19 have been reported but risk factors for mortality and a detailed clinical course of illness, including viral shedding, have not been well described. Methods In this retrospective, multicentre cohort study, we included all adult inpatients (≥18 years old) with laboratory-confirmed COVID-19 from Jinyintan Hospital and Wuhan Pulmonary Hospital (Wuhan, China) who had been discharged or had died by Jan 31, 2020. Demographic, clinical, treatment, and laboratory data, including serial samples for viral RNA detection, were extracted from electronic medical records and compared between survivors and non-survivors. We used univariable and multivariable logistic regression methods to explore the risk factors associated with in-hospital death. Findings 191 patients (135 from Jinyintan Hospital and 56 from Wuhan Pulmonary Hospital) were included in this study, of whom 137 were discharged and 54 died in hospital. 91 (48%) patients had a comorbidity, with hypertension being the most common (58 [30%] patients), followed by diabetes (36 [19%] patients) and coronary heart disease (15 [8%] patients). Multivariable regression showed increasing odds of in-hospital death associated with older age (odds ratio 1·10, 95% CI 1·03–1·17, per year increase; p=0·0043), higher Sequential Organ Failure Assessment (SOFA) score (5·65, 2·61–12·23; p Interpretation The potential risk factors of older age, high SOFA score, and d-dimer greater than 1 μg/mL could help clinicians to identify patients with poor prognosis at an early stage. Prolonged viral shedding provides the rationale for a strategy of isolation of infected patients and optimal antiviral interventions in the future. Funding Chinese Academy of Medical Sciences Innovation Fund for Medical Sciences; National Science Grant for Distinguished Young Scholars; National Key Research and Development Program of China; The Beijing Science and Technology Project; and Major Projects of National Science and Technology on New Drug Creation and Development.

4,408 citations

Journal ArticleDOI
TL;DR: A mapping review of the literature concerning the ethics of artificial intelligence (AI) in health care finds that ethical issues can be epistemic, normative or traceability-related and at the relevant level of abstraction.

170 citations

01 Jan 2012
TL;DR: In this paper, the authors combine data mining and statistical regression methods to identify the main contributing factors associated with the levels of pedestrian injury severity outcomes and provide recommendations for policy makers, traffic engineers, and law enforcement to reduce the severity of pedestrian-vehicle collisions.
Abstract: Understanding the underlying relationship between pedestrian injury severity outcomes and factors leading to more severe injuries is very important in dealing with the problem of pedestrian safety. To investigate injury severity outcomes, many previous works relied on statistical regression models. There has also been some interest for data mining techniques, in particular for clustering techniques which segment the data into more homogeneous subsets. This research combines these two approaches (data mining and statistical regression methods) to identify the main contributing factors associated with the levels of pedestrian injury severity outcomes. This work relies on the analysis of two unique pedestrian injury severity datasets from the City of New York, US (2002-2006) and the City of Montreal, Canada (2003-2006). General injury severity models were estimated for the whole datasets and for sub-populations obtained through clustering analysis. This paper shows how the segmentation of the accident datasets help to better understand the complex relationship between the injury severity outcomes and the contributing geometric, built environment and socio-demographic factors. While using the same methodology for the two datasets, different techniques were tested. For instance, for New York, latent class with ordered probit method provides the best results. However, for Montreal, the K-means with multinomial logit model is identified as the most appropriate technique. The results show the power of using clustering with regression to provide a complementary and more detailed analysis. Among other results, it was found that pedestrian age, location at intersection, actions prior to accident, driver age, vehicle type, vehicle movement, driver alcohol involvement and lighting conditions have an influence on the likelihood of a fatal crash. Moreover, several features within the built environment are shown to have an effect. Finally, the research provides recommendations for policy makers, traffic engineers, and law enforcement to reduce the severity of pedestrian-vehicle collisions.

161 citations

Journal ArticleDOI
TL;DR: The results show that the LASSO model performed well for both physical fatigue detection and modeling, and is not participant and/or workload regime specific and thus can be adopted for other applications.

157 citations