Predict Survival of Patients with Lung Cancer Using an Ensemble Feature Selection Algorithm and Classification Methods in Data Mining

Authors

  • Mahdis Dezfuly Department of Electrical and Computer Engineering, Kermanshah Branch, Islamic Azad University, Kermanshah, Iran
  • Hedieh Sajedi Department of Computer Science, School of Mathematics, Statistics, and Computer Science, College of Science, University of Tehran, Tehran, Iran

DOI:

https://doi.org/10.18488/journal.104/2015.1.1/104.1.1.11

Abstract

This research proposes an efficient model for predicting the survival rate of patients affected by lung cancer. The researchers collected data from four feature categories (population, recognition, treatment, and result) of cancer patients based on the importance of the survival of patients with lung cancer. Analyses of the predicted survival rates of the patients indicate that, among the classification algorithms, Decision Tree C5.0 results the highest accuracy. The models were created using algorithms based on the level of death risk in five stages: six months, nine months, one year, two years, and five years. In this paper, we proposed a mechanism for feature selection. Our mechanism combines the results of some feature section algorithm. The results illustrate that out mechanism outperform other feature selection algorithms. After applying the proposed mechanism for feature selection, the accuracy of the C5.0 algorithm was equivalent to 97.93%.

Keywords:

Lung cancer, Data mining, Decision tree algorithm, Bayes network, Neural network

Abstract Video

Published

2015-06-15

How to Cite

Dezfuly, M. ., & Sajedi, H. . (2015). Predict Survival of Patients with Lung Cancer Using an Ensemble Feature Selection Algorithm and Classification Methods in Data Mining. Journal of Information, 1(1), 1–11. https://doi.org/10.18488/journal.104/2015.1.1/104.1.1.11

Issue

Section

Articles