Weighted Least Support Vector Machine for Survival Analysis
Keywords:
Survival Weighted Least Squares Support Vector Machine (Surv-WLSSVM), Kaplan-Meier estimator, Particle Swarm Optimization (PSO), c-index, prognostic index
Abstract
Background: The increasing complexity and volume of data across various disciplines have encouraged the use of machine learning methods, including in survival analysis. Given the large percentage of censored data in survival datasets, a methodological technique that can generate more precise survival probability forecasts is required. This study aims to advance survival analysis by applying the Weighted Least Squares Support Vector Machine, using a weighting approach to manage the information imbalance between censored observations and event occurrences. This strategy can yield a prognostic index that is easily categorized into low-risk and high-risk groups.Methods: This study proposes the Survival Weighted Least Squares Support Vector Machine (Surv-WLSSVM) modelthrough the integration of a weighting strategy based on the Kaplan–Meier estimator. Data with events are assigned weights that consider the value of the survival function, while censored data are given constant weights. Surv-WLSSVM was applied to both simulated and real datasets, and the results were compared with the unweighted method, namely Survival Least Squares Support Vector Machine (Surv-LSSVM). The simulation scenarios included the complexity of variable numbers, data distribution, sample size, and censoring percentage. The Real datasets used in this study consist of Breastfeeding, PBC, and Bone-Marrow data. A tuning parameters using Particle Swarm Optimization (PSO) was performed to enhance the performance of both Surv-LSSVM and Surv-WLSSVM models. Model performance was evaluated using the concordance index (c-index), where a higher c-index indicates a better model.Results: In every simulated data setting, the Surv-WLSSVM model continuously showed better performance. Similarly,on real datasets, this model outperformed the alternative and produced more diverse prognostic indices, facilitating the categorization of individuals into low-risk and high-risk groups.Conclusion: The Surv-WLSSVM represents a significant advancement in SVM-based survival modelling. This approachdemonstrates greater reliability and adaptability in handling the complexity of modern survival data.
Published
2025-12-06
How to Cite
Rahmawati, R., Thamrin, S. A., Lawi, A., & Kusuma, J. (2025). Weighted Least Support Vector Machine for Survival Analysis. Statistics, Optimization & Information Computing. https://doi.org/10.19139/soic-2310-5070-3016
Issue
Section
Research Articles
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).