XGBoost-Based vs. AFT Model Imputation: Addressing Interval Censoring in Time-to-Event Data [comunicação oral]
Date
2025-09-20
Embargo
Advisor
Coadvisor
Journal Title
Journal ISSN
Volume Title
Publisher
Société Française d Thermique
Language
English
Alternative Title
Abstract
Interval-censored data pose significant challenges in survival analysis, as the exact timing of events is unknown and only known to fall within observed intervals. This study explores imputation-based strategies for regression modeling under interval censoring, including traditional midpoint and Accelerated Failure Time (AFT) model imputations, as well as a machine learningbased approach using XGBoost. We further introduce the Scaled Linear Redistribution Method, a novel rescaling mechanism that adjusts model-based imputations to respect censoring intervals while preserving their relative variability. Using real clinical data, we illustrate how these methods influence the estimation of survival curves. Since true event times are not observed, direct evaluation of the accuracy of imputed times is not possible. Instead, we assess the resulting survival estimates by comparing them with the Turnbull estimator, a nonparametric method that fully accounts for interval censoring without requiring imputation. The analysis demonstrates that midpoint, AFT, and XGBoost-based imputations yield survival curves that are broadly consistent with the Turnbull curve in this dataset.
Keywords
XGBoost-Based, AFT Model Imputation
Document Type
Conference presentation
Publisher Version
Citation
Soutinho, G., & Meira-Machado, L. (2025). XGBoost-Based vs. AFT Model Imputation: Addressing Interval Censoring in Time-to-Event Data [comunicação oral]. 23rd International Conference of Numerical Analysis and Applied Mathematics (ICNAAM 2025), Crete, Greece, 16-20 September 2025. Repositório Institucional UPT. https://hdl.handle.net/11328/6946
Identifiers
TID
Designation
Access Type
Open Access