PACIS 2010 Proceedings

Cost-Sensitive Learning for Recurrence Prediction of Breast Cancer

Tsang-Hsiang Cheng, Southern Taiwan UniversityFollow
Ci-Wei Lan, IBM Research CollaboratoryFollow
Chih-Ping Wei, National Taiwan UniversityFollow
Henry Chang, IBM T.J. Watson Research CenterFollow

Abstract

Breast cancer is one of the top cancer-death causes and specifically accounts for 10.4% of all cancer incidences among women. The prediction of breast cancer recurrence has been a challenging research problem for many researchers. Data mining techniques have recently received considerable attention, especially when used for the construction of prognosis models from survival data. However, existing data mining techniques may not be effective to handle censored data. Censored instances are often discarded when applying classification techniques to prognosis. In this paper, we propose a cost-sensitive learning approach to involve the censored data in prognostic assessment with better recurrence prediction capability. The proposed approach employs an outcome inference mechanism to infer the possible probabilistic outcome of each censored instance and adopt the cost-proportionate rejection sampling and a committee machine strategy to take into account these instances with probabilistic outcomes during the classification model learning process. We empirically evaluate the effectiveness of our proposed approach for breast cancer recurrence prediction and include a censored-data-discarding method (i.e., building the recurrence prediction model by only using uncensored data) and the Kaplan-Meier method (a common prognosis method) as performance benchmarks. Overall, our evaluation results suggest that the proposed approach outperforms its benchmark techniques, measured by precision, recall and F1 score.

Recommended Citation

Cheng, Tsang-Hsiang; Lan, Ci-Wei; Wei, Chih-Ping; and Chang, Henry, "Cost-Sensitive Learning for Recurrence Prediction of Breast Cancer" (2010). PACIS 2010 Proceedings. 118.
https://aisel.aisnet.org/pacis2010/118

Download

COinS

PACIS 2010 Proceedings

Cost-Sensitive Learning for Recurrence Prediction of Breast Cancer

Abstract

Recommended Citation

Search

Links

Browse

Author Corner

PACIS 2010 Proceedings

Cost-Sensitive Learning for Recurrence Prediction of Breast Cancer

Authors

Abstract

Recommended Citation

Share

Search

Links

Browse

Author Corner