Imputation strategies for interval-censored data: From AFT models to machine learning and scaled redistribution

Soutinho, Gustavo; Meira-Machado, Luís

Imputation strategies for interval-censored data: From AFT models to machine learning and scaled redistribution

dc.contributor.author	Soutinho, Gustavo
dc.contributor.author	Meira-Machado, Luís
dc.date.accessioned	2026-03-11T16:02:09Z
dc.date.available	2026-03-11T16:02:09Z
dc.date.issued	2026-03-06
dc.description.abstract	Interval-censored data pose challenges in survival analysis because event times are only known to occur within observation intervals. Traditional strategies, such as midpoint imputation, often fail to capture the uncertainty inherent to this censoring. This study compares classical, model-based, and machine learning approaches for imputing interval-censored event times. Specifically, we evaluate (ⅰ) standard midpoint imputation, (ⅱ) accelerated failure time (AFT) model–based imputation, (ⅲ) a machine learning method using XGBoost, and (ⅳ) a new scaled linear redistribution method that constrains model-based imputations within censoring bounds while preserving their relative variability. A comprehensive simulation study under varying levels of right censoring was carried out to assess bias, accuracy, and concordance. Three real datasets were then analyzed to illustrate the practical behavior of the imputation methods. Results show that the XGBoost-based imputation shows stable performance across the different censoring scenarios considered, yielding survival estimates close to those of the nonparametric Turnbull estimator. The midpoint method performs adequately when intervals are short or censoring is mild, whereas parametric models are more sensitive to distributional assumptions and may yield biased estimates under heavy censoring. Analyses of real data further revealed greater variability among parametric models under high right censoring and a flattening of survival curves when censoring occurs, mainly at long event times. The proposed scaled linear redistribution method provides a way to map model-based predictions back to their observed censoring intervals while retaining their relative dispersion. The methods considered display complementary strengths across censoring regimes, with no single approach uniformly dominating.
dc.identifier.citation	Soutinho, G., & Meira-Machado, L. (2026). Imputation strategies for interval-censored data: From AFT models to machine learning and scaled redistribution. AIMS Mathematics, 11(3), 5719-5737. https://doi.org/10.3934/math.2026235. Repositório Institucional UPT. https://hdl.handle.net/11328/7000
dc.identifier.issn	2473-6988
dc.identifier.uri	https://hdl.handle.net/11328/7000
dc.language.iso	eng
dc.publisher	AIMS Press
dc.relation.hasversion	https://doi.org/10.3934/math.2026235
dc.rights	open access
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject	interval-censored data
dc.subject	machine learning
dc.subject	XGBoost
dc.subject	imputation methods
dc.subject.fos	Ciências Naturais - Matemáticas
dc.subject.ods	09 - industry, innovation and infrastructure
dc.title	Imputation strategies for interval-censored data: From AFT models to machine learning and scaled redistribution
dc.type	journal article
dcterms.references	https://www.aimspress.com/article/doi/10.3934/math.2026235
dspace.entity.type	Publication
oaire.citation.endPage	5737
oaire.citation.issue	3
oaire.citation.startPage	5719
oaire.citation.title	AIMS Mathematics
oaire.citation.volume	11
oaire.version	http://purl.org/coar/version/c_970fb48d4fbd8a85
person.affiliation.name	DCT - Departamento de Ciência e Tecnologia
person.familyName	Soutinho
person.givenName	Gustavo
person.identifier.ciencia-id	0918-604C-2C04
person.identifier.orcid	0000-0002-0559-1327
person.identifier.rid	GSE-1063-2022
person.identifier.scopus-author-id	57195326662
relation.isAuthorOfPublication	6b00013b-9493-4621-b710-79beb48b65a4
relation.isAuthorOfPublication.latestForDiscovery	6b00013b-9493-4621-b710-79beb48b65a4

Ficheiros

Principais

A mostrar 1 - 1 de 1

Nome:: 10.3934_math.2026235.pdf
Tamanho:: 308.97 KB
Formato:: Adobe Portable Document Format

Ver/Abrir

Coleções

REMIT - Artigos em Revistas Internacionais / Papers in International Journals