XGBoost-Based vs. AFT Model Imputation: Addressing Interval Censoring in Time-to-Event Data [comunicação oral]
Data
2025-09-20
Embargo
Orientador
Coorientador
Título da revista
ISSN da revista
Título do volume
Editora
Société Française d Thermique
Idioma
Inglês
Título Alternativo
Resumo
Interval-censored data pose significant challenges in survival analysis, as the exact timing of events is unknown and only known to fall within observed intervals. This study explores imputation-based strategies for regression modeling under interval censoring, including traditional midpoint and Accelerated Failure Time (AFT) model imputations, as well as a machine learningbased approach using XGBoost. We further introduce the Scaled Linear Redistribution Method, a novel rescaling mechanism that adjusts model-based imputations to respect censoring intervals while preserving their relative variability. Using real clinical data, we illustrate how these methods influence the estimation of survival curves. Since true event times are not observed, direct evaluation of the accuracy of imputed times is not possible. Instead, we assess the resulting survival estimates by comparing them with the Turnbull estimator, a nonparametric method that fully accounts for interval censoring without requiring imputation. The analysis demonstrates that midpoint, AFT, and XGBoost-based imputations yield survival curves that are broadly consistent with the Turnbull curve in this dataset.
Palavras-chave
XGBoost-Based, AFT Model Imputation
Tipo de Documento
Apresentação em Conferência
Versão da Editora
Citação
Soutinho, G., & Meira-Machado, L. (2025). XGBoost-Based vs. AFT Model Imputation: Addressing Interval Censoring in Time-to-Event Data [comunicação oral]. 23rd International Conference of Numerical Analysis and Applied Mathematics (ICNAAM 2025), Crete, Greece, 16-20 September 2025. Repositório Institucional UPT. https://hdl.handle.net/11328/6946
Identificadores
TID
Designação
Tipo de Acesso
Acesso Aberto