Exponentiated Weibull Models Applied to Medical Data in Presence of Right-censoring, Cure Fraction and Covariates
Abstract
Cure fraction models have been widely used to analyze survival data in which a proportion of the individuals isnot susceptible to the event of interest. This article considers frequentist and Bayesian methods to estimate the unknown model parameters of the exponentiated Weibull (EW) distribution considering right-censored survival data with a cure fraction and covariates. The EW distribution is as an extension to the Weibull distribution by considering an additional shape parameter to the model. We consider four types of cure fraction models: the mixture cure fraction (MCF), the nonmixture cure fraction (NMCF), the complementary promotion time cure (CPTC), and the cure rate proportional odds (CRPO) models. Bayesian inferences are obtained by using MCMC (Markov Chain Monte Carlo) methods. A simulation study was conducted to examine the performance of the maximum likelihood estimators for different sample sizes. Two real datasets were considered to illustrate the applicability of the proposed model. The EW distribution and its sub-models have the flexibility to accommodate different shapes for the hazard function and should be an attractive choice for survival data analysis when a cure fraction is present.References
J. A. Achcar, E. A. Coelho-Barros, and J. Mazucheli, Cure fraction models using mixture and non-mixture models, Tatra Mountains Mathematical Publications, vol. 51, no. 1, pp. 1–9, 2012.
J. Aitchison, and J. A. Brown, The lognormal distribution, Cambridge, Cambridge University Press, 1957.
H. Akaike, A new look at the statistical model identification, IEEE Transactions on Automatic Control, vol. 19, no. 6, pp. 716–723, 1974.
M. Alizadeh, M. N. Khan, M. Rasekhi, and G. G. Hamedani, A new generalized modified Weibull distribution, Statistics, Optimization & Information Computing, vol. 9, no. 1, pp. 17–34, 2021.
I. Arano, T. Sugimoto, T. Hamasaki, and Y. Ohno, Practical application of cure mixture model for long-term censored survivor data from a withdrawal clinical trial of patients with major depressive disorder, BMC Medical Research Methodology, vol. 10, no. 1, pp. 1–13, 2010.
L. Benkhelifa, The Weibull Birnbaum-Saunders distribution and its applications, Statistics, Optimization & Information Computing, vol. 9, no. 1, pp. 61–81, 2021.
J. W. Boag, Maximum likelihood estimates of the proportion of patients cured by cancer therapy, Journal of the Royal Statistical Society Series B, vol. 11, no. 1, pp. 15–53, 1949.
C. G. Broyden, The convergence of a class of double-rank minimization algorithms 1. General Considerations, IMA Journal of Applied Mathematics, vol. 6, no. 1, pp. 76–90, 1970.
J. M. Carrasco, E. M. Ortega, and G. M. Cordeiro, A generalized modified Weibull distribution for lifetime modeling, Computational Statistics & Data Analysis, vol. 53, no. 2, pp. 450–462, 2008.
J. E. Cavanaugh, and A. A. Neath, The Akaike information criterion: Background, derivation, properties, application, interpretation, and refinements, Wiley Interdisciplinary Reviews: Computational Statistics, vol. 11, no. 3, pp. e1460, 2019.
M. H. Chen, J. G. Ibrahim, and D. Sinha, A new Bayesian model for survival data with a surviving fraction, The Journal of the American Statistical Association, vol. 94, no. 447, pp. 909–919, 1999.
M. H. Chen, and Q. M. Shao, Monte Carlo estimation of Bayesian credible and HPD intervals, Journal of Computational and Graphical Statistics, vol. 8, no. 1, pp. 69–92, 1999.
S. Chib, and E. Greenberg, Understanding the Metropolis-Hastings algorithm, The American Statistician, vol. 49, no. 4, pp. 327– 335, 1995.
R. Christensen, W. Johnson, A. Branscum, T. E. Hanson, Bayesian ideas and data analysis: an introduction for scientists and statisticians, Boca Raton, CRC press, 2011.
J. Cohen, The Earth is round (p < .05), American Psychologist, vol. 49, no. 12, pp. 997–003, 1994.
G. M. Cordeiro, E. M., Ortega, G. O. and Silva, The Kumaraswamy modified Weibull distribution: theory and applications, Journal of Statistical Computation and Simulation, vol. 84, no. 7, pp. 1387–1411, 2014.
D. R. Cox, and E. J. Snell, A general definition of residuals, Journal of the Royal Statistical Society Series B, vol. 30, no. 2, pp. 248–275, 1968.
K. Davies, S. Pal, and J. A. Siddiqua, Stochastic EM algorithm for generalized exponential cure rate model and an empirical study, Journal of Applied Statistics, vol. 48, no. 12, p. 2112–2135, 2021.
S. V. Deo, V. Deo, and V. Sundaram, Survival analysis - part 2: Cox proportional hazards model, Indian Journal of Thoracic and Cardiovascular Surgery, vol. 37, pp. 229–233, 2021.
D. K. Dey, M. H. Chen, and H. Chang, Bayesian approach for nonlinear random effects models, Biometrics, vol. 53, no. 4, pp. 1239–1252, 1957.
V. T. Farewell, The use of mixture models for the analysis of survival data with long-term survivors, Biometrics, vol. 38, no. 4, pp. 1041–1046, 1982.
T. R. Fleming, and D. Y. Lin, Survival analysis in clinical trials: past developments and future directions, Biometrics, vol. 56, no. 4, pp. 971–983, 2000.
M. L. Garg, B. R. Rao, and C. K. Redmont, Maximum-likelihood estimation of the parameters of the Gompertz survival function, Journal of the Royal Statistical Society. Series C (Applied Statistics), vol. 19, no. 2, pp. 152–159, 1970.
S. Geisser, and W. F. Eddy, A predictive approach to model selection, Journal of the American Statistical Association, vol. 74, no. 365, pp. 153–160, 1979.
A. E. Gelfand, and D. K. Dey, Bayesian model choice: asymptotics and exact calculations, Journal of the Royal Statistical Society. Series B, vol. 56, no. 3, pp. 501C514, 1994.
J. Geweke, Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments, Bayesian Statistics, vol. 4, pp. 641-649, 1992.
Y. Gu, D. Sinha, and S. Banerjee, Analysis of cure rate survival data under proportional odds model, Lifetime Data Analysis, vol. 17, no. 1, pp. 123–134, 2011.
A. K. Gupta, and S. Nadarajah, Handbook of beta distribution and its applications, Boca Raton, CRC press, 2004.
R. C. Gupta, R. D. Gupta, and P. L. Gupta, Modeling failure timedata by Lehman alternatives, Communication in Statistics - Theory and Methods, vol. 27, no. 4, pp. 887–904, 1998.
A. Henningsen, and O. Toomet, maxLik: A package for maximum likelihood estimation in R, Computational Statistics, vol. 26, no. 3, pp. 443–458, 2011.
H. A. Howlader, and A. M. Hossain, Bayesian survival estimation of Pareto distribution of the second kind based on failure-censored data, Computational Statistics & Data Analysis, vol. 38, no. 3, pp. 301–314, 2002.
J. G. Ibrahim, M. H. Chen, and D. Sinha, Bayesian Survival Analysis, Springer-Verlag, New York, 2001.
A. A. Jacome, D. R. Wohnrath, C. Scapulatempo-Neto, E. C. Carneseca, S. V. Serrano, L. S. Viana, J. S. Nunes, E. Z. Martinez, J. S. Santos, Prognostic value of epidermal growth factor receptors in gastric cancer: a survival analysis by Weibull model incorporating long-term survivors, Gastric Cancer, vol. 17, no. 1, pp. 76–86, 2014.
M. Kieser, T. Friede, and M. Gondan, Assessment of statistical significance and clinical relevance, Statistics in Medicine, vol. 32, no. 10, pp. 1707–1719, 2013.
D. H. Kim, W. D. Lee, and S. G. Kang, Bayesian survival estimation of Pareto distribution of the second kind based on type II censored data, Communications for Statistical Applications and Methods, vol. 12, no. 3, pp. 729–742, 2005.
B. Klefsjo, ¨ TTT-plotting - a tool for both theoretical and practical problems, Journal of Statistical Planning and Inference, vol. 29, no. 1–2, pp. 99–110, 1991.
J. P. Klein, and M. L. Moeschberger, Survival analysis: techniques for censored and truncated data, Springer, New York, 2003.
P. C. Lambert, Modeling of the cure fraction in survival studies, The Stata Journal, vol. 7, no. 3, pp. 351–375, 2007.
J. Li, Y. Tang, L. Huang, Q. Yu, G. Hu, and X. Yuan, Genetic variants in the p14ARF/ MDM2/ TP53 pathway are associated with the prognosis of esophageal squamous cell carcinoma patients treated with radical resection, PloS one, vol. 11, pp. e0158613, 2016.
C. S. Li, J. M. Taylor, and J. P. Sy, Identifiability of cure models, Statistics & Probability Letters, vol. 54, no. 4, pp. 389 395, 2001.
M. A. Looha, E. Zarean, F. Masaebi, M. A. Pourhoseingholi, and M. R. Zali, Assessment of prognostic factors in long- term survival of male and female patients with colorectal cancer using non-mixture cure model based on the Weibull distribution, Surgical Oncology, pp. 101562, 2021.
R. A. Maller, and X. Zhou, Survival analysis with long-term survivors, Wiley, New York, 1996.
A. D. Martin, K. M. Quinn, and J. H. Park, MCMCpack: Markov chain Monte Carlo in R, Journal of Statistical Software, vol. 42, no. 9, pp. 1–21, 2011.
E. Z. Martinez, and J. A. Achcar, A new straightforward defective distribution for survival analysis in the presence of a cure fraction, Journal of Statistical Theory and Practice, vol. 12, no. 4, pp. 688–703, 2018.
E. Z. Martinez, J. A. Achcar, A. A. Jacome, and J. S. Santos, ´ Mixture and non-mixture cure fraction models based on the generalized modified Weibull distribution with an application to gastric cancer data, Computer Methods and Programs in Biomedicine, vol. 112, no. 3, pp. 343–355, 2013.
M. Meshkat, A. R. Baghestani, F. Zayeri, M. Khayamzadeh, and M. E. Akbari, Survival probability and prognostic factors of Iranian breast cancer patients using cure rate model, The Breast Journal, vol. 24, no. 6, pp. 1015–1018, 2018.
G. S. Mudholkar, and A. D. Hutson, The exponentiated Weibull family: some properties and a flood data application,Communicationin Statistics - Theory and Methods, vol. 25, no. 12, pp. 3059–3083, 1996.
G. S. Mudholkar, and D. K. Srivastava, Exponentiated Weibull family for analyzing bathtub failure-rate data, IEEE Transactions on Reliability, vol. 42, no. 2, pp. 299–302, 1993.
G. S. Mudholkar, D. K. Srivastava, and M. Freimer, The exponentiated Weibull family: a reanalysis of the bus-motor failure data, Technometrics, vol. 37, no. 4, pp. 436–445, 1995.
M. M. Nassar, and F. H. Eissa, On the exponentiated Weibull distribution, Communications in Statistics - Theory and Methods, vol. 32, no. 7, pp. 1317–1336, 2003.
G. W. Oehlert, A note on the delta method, The American Statistician, vol. 46, no. 1, pp. 27–29, 1992.
R. P. Oliveira, M. V. Oliveira-Peres, M. R. Santos, E. Z. Martinez, and J. A. Achcar, A Bayesian inference approach for bivariate Weibull distributions derived from Roy and Morgenstern methods, Statistics, Optimization & Information Computing, vol. 9, no. 3, pp. 529–554, 2021.
M. E. Omer, M. A. Bakar, M. Adam, and M. Mustafa, Utilization of a mixture cure rate model based on the generalized modified Weibull distribution for the analysis of leukemia patients, Asian Pacific Journal of Cancer Prevention, vol. 22, no. 4, pp. 1045–1053, 2021.
S. Pasari, and O. Dikshit, O. Stochastic earthquake interevent time modeling from exponentiated Weibull distributions, Natural Hazards, vol. 90, no. 2, pp. 823–842, 2018.
Y. Peng, and J. M. Taylor, Residual-based model diagnosis methods for mixture cure models, Biometrics, vol. 73, no. 2, pp. 495–505, 2017.
Y. Peng, and B. Yu, Cure Models: Methods, Applications, and Implementation, CRC Press, New York, 2021.
J. E. Pinder III, J. G. Wiener, and M. H. Smith, The Weibull distribution: a new method of summarizing survivorship data, Ecology, vol. 59, no. 1, pp. 175–179, 1978
M. Plummer, N. Best, K. Cowles, and K. Vines, CODA: convergence diagnosis and output analysis for MCMC, R News, vol. 6, no. 1, pp. 7–11, 2006
A. Ramakrishnan, J. Zreloff, M. A. Moore, S. H. Bergquist, M. Cellai, J. Higdon, J. B. OKeefe, D. Roberts, and H. M. Wu, Prolonged symptoms after COVID-19 infection in outpatients, Open Forum Infectious Diseases, vol. 8, no. 3, pp. ofab060, 2021.
P. Ramos, D. Guzman, A. Mota, F. Rodrigues, and F. Louzada, Sampling with censored data: a practical guide, arXiv preprint, vol. arXiv:2011.08417, 2020.
P. L. Ramos, D. C. Nascimento, C. Cocolo, M. J. Nicola, C. Alonso, L. G. Ribeiro, A, Ennes, and F. Louzada, Reliability-centered maintenance: analyzing failure in harvest sugarcane machine using some generalizations of the Weibull distribution, Modelling and Simulation in Engineering, vol. 2018, pp. 1241856, 2018.
E. Ramos, P. L. Ramos, and F. Louzada, Posterior properties of the Weibull distribution for censored data, Statistics & Probability Letters, vol. 166, pp. 108873, 2020.
C. Ricci, S. Partelli, L. Landoni, M. Rinzivillo, C. Ingaldi, V. Andreasi, C. Nessi, F. Muffatti, M. Fontana, D. Tamburrino, G. Deiro, L. Alberici, D. Campana, F. Panzuto, C. Bassi, M. Falconi, and R. Casadei, Sporadic non-functioning pancreatic neuroendocrine tumours: multicentre analysis, British Journal of Surgery, vol. 108, no. 7, pp. 811C-816, 2021.
H. Rinne, The Weibull distribution: a handbook, CRC Press, New York, 2008.
R. Rocha, S. Nadarajah, V. Tomazella, and F. Louzada, A new class of defective models based on the MarshallCOlkin family of distributions for cure rate modeling, Computational Statistics & Data Analysis, vol. 107, pp. 48–63, 2017.
J. Scudilio, V. F. Calsavara, R. Rocha, F. Louzada, V. Tomazella, and A. S. Rodrigues, Defective models induced by gamma frailty term for survival data with cured fraction, Journal of Applied Statistics, vol. 46, no. 3, pp. 484–507, 2019.
W. Shah, T. Hillman, E. D. Playford, and L. Hishmeh, Managing the long term effects of covid-19: summary of NICE, SIGN, and RCGP rapid guideline, BMJ, vol. 372, pp. n136, 2021.
N. R. Smoll, K. Schaller, and O. P. Gautschi, The cure fraction of glioblastoma multiforme, Neuroepidemiology, vol. 39, no. 1, pp. 63–69, 2012.
G. O. Silva, E. M. Ortega, and G. M. Cordeiro, The beta modified Weibull distribution, Lifetime Data Analysis, vol. 16, no. 3, pp. 409–430, 2010.
R. Singh, and K. Mukhopadhyay, Survival analysis in clinical trials: basics and must know areas, Perspectives in Clinical Research, vol. 2, no. 4, pp. 145–148, 2011.
J. G. Surles, and W. J. Padgett, Inference for reliability and stress-strength for a scaled Burr type X distribution, Lifetime Data Analysis, vol. 7, no. 2, pp. 187–200, 2001.
M. Teimouri, S. M. Hoseini, S. Nadarajah, Comparison of estimation methods for the Weibull distribution, Statistics, vol. 47, no. 1, pp. 93–109, 2013.
D. R. Thoman, L. J. Bain, and C. E. Antle, Inferences on the parameters of the Weibull distribution, Technometrics, vol. 11, no. 3, pp. 445–460, 1969.
S. Thomas, D. Patel, B. Bittel, K. Wolski, Q. Wang, A. Kumar, Z. J. IlGiovine, R. Mehra, C. McWilliams, S. E. Nissen, M. Y.
Desai, Effect of high-dose zinc and ascorbic acid supplementation vs usual care on symptom length and reduction among ambulatory patients with SARS-CoV-2 infection: the COVID A to Z Randomized Clinical Trial, JAMA Network Open, vol. 4, no. 2, pp. e210369, 2021.
A. D. Tsodikov, J. G. Ibrahim, and A. Y. Yakovlev, Estimating cure rates from survival data: an alternative to two-component mixture models, Journal of the American Statistical Association, vol. 98, no. 464, pp. 1063–1078, 2003.
M. H. van Rijn, A. Bech, J. Bouyer, and J. A. van den Brand, Statistical significance versus clinical relevance, Nephrology Dialysis Transplantation, vol. 32, pp. ii6-ii12, 2017.
M. V. P. Vigas, M. B. Fatoretto, G. S. Slanzon, E. M. M. Ortega, C. G. B. Demetrio, and C. M. M. Bittar, Red propolis effect analysis of dairy calves health based on Weibull regression model with long-term survivors, Research in Veterinary Science, vol. 136, pp. 464–471, 2021.
A. Y. Yakovlev, and A. D. Tsodikov, Stochastic models of tumor latency and their biostatistical applications, World Scientific, New Jersey, 1996.
B. Yiqi, V. G. Cancho, D. K. Dey, N. Balakrishnan, and A. K. Suzuki, Power series cure rate model for spatially correlated intervalcensored data based on generalized extreme value distribution, Journal of Computational and Applied Mathematics, vol. 364, pp. 112362, 2020.
P. Zhai, Y. Ding, X. Wu, J. Long, Y. Zhong, and Y. Li, The epidemiology, diagnosis and treatment of COVID-19, International Journal of Antimicrobial Agents, vol. 55, no. 5, pp. 105955, 2020.
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).