Abstract

Survival trees are a common machine learning tool designed to handle censored data, where only partial information about failure events is available. Most survival tree models work by recursively dividing the feature space using splits defined by single attributes in internal nodes. However, there is a less common type known as oblique survival trees, which use more attributes to create splits in the form of hyperplanes. In this paper, we depart from the typical top-down approach and focus on globally induced oblique survival trees, aiming to optimize both prediction accuracy and model complexity. We propose using two different loss functions— the integrated Brier score and a likelihood-based loss— in the process of oblique survival tree induction. We then compare the resulting models in terms of their predictive performance and complexity.

Recommended Citation

Kretowska, M. & Kretowski, M. (2024). The Influence of Loss Function on Oblique Survival Tree Induction. In B. Marcinkowski, A. Przybylek, A. Jarzębowicz, N. Iivari, E. Insfran, M. Lang, H. Linger, & C. Schneider (Eds.), Harnessing Opportunities: Reshaping ISD in the post-COVID-19 and Generative AI Era (ISD2024 Proceedings). Gdańsk, Poland: University of Gdańsk. ISBN: 978-83-972632-0-8. https://doi.org/10.62036/ISD.2024.111

Paper Type

Full Paper

DOI

10.62036/ISD.2024.111

Share

COinS
 

The Influence of Loss Function on Oblique Survival Tree Induction

Survival trees are a common machine learning tool designed to handle censored data, where only partial information about failure events is available. Most survival tree models work by recursively dividing the feature space using splits defined by single attributes in internal nodes. However, there is a less common type known as oblique survival trees, which use more attributes to create splits in the form of hyperplanes. In this paper, we depart from the typical top-down approach and focus on globally induced oblique survival trees, aiming to optimize both prediction accuracy and model complexity. We propose using two different loss functions— the integrated Brier score and a likelihood-based loss— in the process of oblique survival tree induction. We then compare the resulting models in terms of their predictive performance and complexity.