Predict Survival of Patients with Lung Cancer Using an Ensemble Feature Selection Algorithm and Classification Methods in Data Mining
DOI:
https://doi.org/10.18488/journal.104/2015.1.1/104.1.1.11Abstract
This research proposes an efficient model for predicting the survival rate of patients affected by lung cancer. The researchers collected data from four feature categories (population, recognition, treatment, and result) of cancer patients based on the importance of the survival of patients with lung cancer. Analyses of the predicted survival rates of the patients indicate that, among the classification algorithms, Decision Tree C5.0 results the highest accuracy. The models were created using algorithms based on the level of death risk in five stages: six months, nine months, one year, two years, and five years. In this paper, we proposed a mechanism for feature selection. Our mechanism combines the results of some feature section algorithm. The results illustrate that out mechanism outperform other feature selection algorithms. After applying the proposed mechanism for feature selection, the accuracy of the C5.0 algorithm was equivalent to 97.93%.