Combining Random Forest and Neural Networks Algorithms to Diagnose Heart Disease

Sima Dehnavi, Madjid Emamipour, Amin Golabpour
469

Views


Abstract

Introduction: Heart disease is known as one of the most important causes of death in today's society and so far no definitive method has been found to predict it and several factors are effective in contracting this disease. Therefore, the aim of this study was to provide a data mining model for predicting heart disease.

Material and Methods: This study used standard data from UCI. These data include four Cleveland, Hungarian, Swiss and Long Beach VA databases. These data include 13 independent variables and one dependent variable. The data are missing, and the EM algorithm was used to control this loss, and at the end of the data, a suggestion algorithm was implemented that combined the two random forest algorithms and the artificial neural network.

Results: In this study, data was divided into two training sets and 10-Fold method was used. To evaluate the algorithms, three indicators of sensitivity, specificity, accuracy were used and the accuracy of the prediction algorithm for four data Cleveland, Hungarian, Switzerland and Long Beach VA reached 87.65%, 94.37%, 93.45% and 85%, respectively. Then, the proposed algorithm was compared with similar articles in this field, and it was found that this algorithm is more accurate than similar methods.

Conclusion: The results of this study showed that by combining the two algorithms of random forest and artificial neural network, a suitable model for predicting heart attacks can be provided.


Keywords

Heart Disease; Random Forest; Artificial Neural Network

References

DeSilva R. Heart disease. ABC-CLIO: Greenwood; 2013.

Benjamin EJ, Muntner P, Bittencourt MS. Heart disease and stroke statistics-2019 update: A report from the American heart association. Circulation. 2019; 139(10): e56-e528. PMID: 30700139 DOI: 10.1161/CIR.0000000000000659

Sultana M, Haider A, Uddin MS. Analysis of data mining techniques for heart disease prediction. International Conference on Electrical Engineering and Information Communication Technology; IEEE:2016.

Thomas J, Princy RT. Human heart disease prediction system using data mining techniques. International Conference on Circuit, Power and Computing Technologies; IEEE: 2016.

Pereira N. Using machine learning classification methods to detect the presence of heart disease [Masters Dissertation]. Technological University Dublin; 2019.

Saqlain SM, Sher M, Shah FA, Khan I, Ashraf MU, Awais M, et al. Fisher score and Matthews correlation coefficient-based feature subset selection for heart disease diagnosis using support vector machines. Knowledge and Information Systems. 2019; 58(1): 139-67.

Reddy GT, Reddy MPK, Lakshmanna K, Rajput DS, Kaluri R, Srivastava G. Hybrid genetic algorithm and a fuzzy logic classifier for heart disease diagnosis. Evolutionary Intelligence. 2020: 13: 185-96.

Prakash S, Sangeetha K, Ramkumar N. An optimal criterion feature selection method for prediction and effective analysis of heart disease. Cluster Computing. 2019; 22(5): 11957-63.

Wei-Jia L, Liang M, Hao C. Particle swarm optimisation-support vector machine optimised by association rules for detecting factors inducing heart diseases. Journal of Intelligent Systems. 2017; 26(3): 573-83.

Yahyaie M, Tarokh MJ, Mahmoodyar MA. Use of Internet of things to provide a new model for remote heart attack prediction. Telemed J E Health. 2019; 25(6): 499-510. PMID: 30256729 DOI: 10.1089/tmj.2018.0076

Aha DW. Heart disease data set [Internet]. 1988 [cited: 17 Dec 2019]. Available from: https://archive.ics.uci.edu/ml/datasets/Heart+Disease

McLachlan GJ, Krishnan T. The EM Algorithm and extensions. Wiley; 2007.

van Buuren S. Flexible imputation of missing data. Taylor & Francis; 2012.

Liaw A, Wiener M. Classification and regression by random forest. R News. 2002; 2(3): 18-22.

Hand DJ. Principles of data mining. Drug Saf. 2007; 30(7): 621-2. PMID: 17604416 DOI: 10.2165/00002018-200730070-00010

Lahsasna A, Ainon RN, Zainuddin R, Bulgiba A. Design of a fuzzy-based decision support system for coronary heart disease diagnosis. J Med Syst. 2012; 36(5): 3293-306. PMID: 22252606 DOI: 10.1007/s10916-012-9821-7




DOI: https://doi.org/10.30699/fhi.v9i1.214

Refbacks

  • There are currently no refbacks.