Introduction: One of the challenges facing medical science is the time and correct diagnosis of diseases. Particularly with regard to certain diseases such as the types of cancer, which are the leading causes of death worldwide, their early diagnosis has a significant impact on the control and treatment of this disease. The use of intelligent decision support systems with high precision can be a good way to reduce human error due to fatigue and lack of experience. Therefore, the present study tries to predict the disease by using data mining techniques and taking into account the variables that influence the prediction of laryngeal cancer.
Material and methods: This study is an analytical study. The data from the 249 cases referred to Shafa Hospital in Kerman in 2017 have been obtained. This study is based on the Crisp methodology and in the MATLAB software environment. First, in order to understand the laryngeal cancer, a review of related studies was conducted and interviewed by specialist physicians. Then, according to expert opinion, 24 variables were identified as effective factors in predicting laryngeal cancer. After clearing and preparing data, an artificial neural network model was used to predict the risk of laryngeal cancer. In the following, another model of the combination of the genetic algorithm and the neural network was created. Using genetic algorithm, 9 functional features of prediction of laryngeal cancer were determined from among the 24 selected variables, and artificial neural network was used to predict the risk of laryngeal cancer. Finally, the criteria for accuracy, specificity, and sensitivity were used to evaluate the two models.
Results: The genetic algorithm reduced the complexity of the model by reducing the number of features from 24 to 9, but improved the average precision from 80% to 84%. Also, the model made with the characteristics selected by the genetic algorithm, increased the specificity and accuracy criteria by 13% and 8%, respectively.Conclusion: Combining the genetic algorithm with the neural network, in addition to improving the accuracy of prediction of laryngeal cancer, accelerates the diagnosis process, especially at the data collection stage, by reducing the number of characteristics required. Therefore, using this model as a smart decision system is suggested.
Kuo R-J, Huang M-H, Cheng W-C, Lin C-C, Wu Y-H. Application of a two-stage fuzzy neural network to a prostate cancer prognosis system. Artif Intell Med. 2015; 63(2): 119-33.
Bhardwaj A, Tiwari A. Breast cancer diagnosis using genetically optimized neural network model. Expert Systems with Applications. 2015; 42(10): 4611-20.
Bhuvaneswari P, Therese AB. Detection of cancer in lung with k-nn classification using genetic algorithm. Procedia Materials Science. 2015; 10: 433-40.
Weng C-H, Huang TC-K, Han R-P. Disease prediction with different types of neural network classifiers. Telematics and Informatics. 2016; 33(2): 277-92.
Ada, Kaur R. A study of detection of lung cancer using data mining classification techniques. International Journal of Advanced Research in Computer Science and Software Engineering. 2013; 3(3): 131-4.
Schumacher M, Roßner R, Vach W. Neural networks and logistic regression: Part I. Computational Statistics & Data Analysis. 1996; 21(6): 661-82.
Wang J, Tsapakis I, Zhong C. A space–time delay neural network model for travel time prediction. Engineering Applications of Artificial Intelligence. 2016; 52: 145-60.
Schiavo ML, Prinari B, Gronski JA, Serio AV. An artificial neural network approach for modeling the ward atmosphere in a medical unit. Mathematics and Computers in Simulation. 2015; 116: 44-58.
Riedmiller M. Advanced supervised learning in multi-layer perceptrons-from backpropagation to adaptive learning algorithms. Computer Standards and Interfaces. 1994; 16(3): 265-78.
Bhattacharyya S, Bhattacharjee S, Mondal NK. A quantum backpropagation multilayer perceptron (QBMLP) for predicting iron adsorption capacity of calcareous soil from aqueous solution. Applied Soft Computing. 2015; 27: 299-312.
Goldberg DE. Genetic algorithms in search, optimization and machine learning. Reading, Addison-Wesley Longman Publishing Co.; Boston, USA. 1989.
Coley DA. An introduction to genetic algorithms for scientists and engineers. World Scientific Publishing Company; Singapore. 1999.
Van Stralen KJ, Stel VS, Reitsma JB, Dekker FW, Zoccali C, Jager KJ. Diagnostic methods I: sensitivity, specificity, and other measures of accuracy. Kidney Int. 2009; 75(12): 1257-63.
Guazzo M. Retrieval performance and information theory. Information Processing & Management. 1977; 13(3): 155-65.
Levenberg K. A method for the solution of certain non-linear problems in least squares. Quart Appl Math. 1944; 2(2): 164-8.
Marquardt DW. An algorithm for least-squares estimation of nonlinear parameters. Journal of the Society for Industrial and Applied Mathematics. 1963; 11(2): 431-41.
Cong Y, Wang S, Fan B, Yang Y, Yu H. UDSFS: Unsupervised deep sparse feature selection. Neurocomputing. 2016; 196: 150-8.
Zhang X, Mei C, Chen D, Li J. Feature selection in mixed data. Journal of Pattern Recognition. 2016; 56: 1-15.