MAXIMUM LIKELIHOOD ESTIMATION OF CABG PATIENTS BY PARAMETRIC MODELS BASED ON INCOMPLETE AND COMPLETE POPULATION

SALEEM M.1*, ZAFAR MAHMUD2*, KHAN K.H.3*
1Centre for Advanced Studies in Pure and Applied Mathematics Bahauddin Zakariya University Multan, Pakistan.
2Department of Statistics Bahauddin Zakariya University Multan, Pakistan.
3Centre for Advanced Studies in Pure and Applied Mathematics Bahauddin Zakariya University Multan, Pakistan.
* Corresponding Author : drkhizar@gmail.com

Received : 10-02-2012     Accepted : 22-03-2012     Published : 09-04-2012
Volume : 3     Issue : 1       Pages : 69 - 73
J Stat Math 3.1 (2012):69-73

Cite - MLA : SALEEM M., et al "MAXIMUM LIKELIHOOD ESTIMATION OF CABG PATIENTS BY PARAMETRIC MODELS BASED ON INCOMPLETE AND COMPLETE POPULATION." Journal of Statistics and Mathematics 3.1 (2012):69-73.

Cite - APA : SALEEM M., ZAFAR MAHMUD, KHAN K.H. (2012). MAXIMUM LIKELIHOOD ESTIMATION OF CABG PATIENTS BY PARAMETRIC MODELS BASED ON INCOMPLETE AND COMPLETE POPULATION. Journal of Statistics and Mathematics, 3 (1), 69-73.

Cite - Chicago : SALEEM M., ZAFAR MAHMUD, and KHAN K.H. "MAXIMUM LIKELIHOOD ESTIMATION OF CABG PATIENTS BY PARAMETRIC MODELS BASED ON INCOMPLETE AND COMPLETE POPULATION." Journal of Statistics and Mathematics 3, no. 1 (2012):69-73.

Copyright : © 2012, SALEEM M., et al, Published by Bioinfo Publications. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution and reproduction in any medium, provided the original author and source are credited.

Abstract

In this paper, a new approach is used for estimation of survival proportions of Coronary Artery Bypass Graft Surgery (CABG) patients by complete population, from its incomplete population. The availability of a complete population may lead to better estimates of the survivor’s proportions. Maximum likelihood method, in-conjunction with Davidon-Fletcher-Powell optimization method and Cubic Interpolation method is used in estimation of survivor’s proportions of some parametric distributions.

Keywords

CABG Patients, Complete and Incomplete populations, Parametric models (Weibull, Exponential and etc), Maximum likelihood method, Davidon-Fletcher-Powell optimization method and Survivor’s proportions.

Introduction

The coronary Artery Disease (CAD) is a chronic disease, which progresses with age at different rates. CAD is a result of built-up of fats on the inner walls of the coronary arteries. Thus, the size of coronary arteries become narrow and the blood flow to the heart muscles is reduced / blocked. Therefore, the heart muscles do not receive required oxygenated blood, which leads to the heart attack. CAD is a leading cause of death (see William, Stephen, Van -Thomas and Robert [37] , John [18] , Hansson [13] , Axel, Yiwen, Dalit, Veena, Elaine, Catia, Matthew, Jonathan, Edward & Len [3] and Sun and Hoong [33] ). The medical scientists; William, Ellis, Josef, Ralph and Robert [37] , Heymann [15] , Goldstein, Adams, Alberts, Appel, Brass, Bushnell, Culebras, DeGraba, Gorelick & Guyton [11] and Jennifer [17] are of the opinion that CABG is an effective treatment for CAD patients. The medical research organizations like Heart and stroke foundation Canada [14] , American Heart Association [2] and Virtual Health Care Team Columbia [35] pointed out that hypertension; diabetes and smoking are main contributing factors for CAD’s survivors. Xing, Xiao and Marcello [38] estimated survival after AIDS diagnosis from surveillance data. Collett [7] modelled survival data in medical research The importance of parametric models for analysis of lifetime date has been mentioned by Mann, Schefer and Singpurwala [30] , Nelson [31] , Cyrus [9] , Lawless [26] , Klein & Moeschberger [23] and Sridhar and Mun Choon Chan [32] . The exponential distribution model has been used by Lee, Kim and Jung [28] in medical research for survival data of patients.The Weibull distribution model has been used for survival analysis by Cohen [6] , Gross and Clark [12] , Bunday [5] , Crow [8] , Klein & Moeschberger [23] , Lawrencce [27] , Abrenthy [1] , Hisada & Arizino [16] , Lawless [26] , David & Mitchel [10] and Lang [24] . In particular, the survival study of chronic diseases, such as AIDS and Cancer, has been carried out by Bain and Englehardt [4] , Khan & Mahmud [20 & 21] , Klein & Moeschberger [23] , Lawless [26] and Swaminathan and Brenner [34] using exponential and Weibull distributions. Lanju & William [25] used Weibull distribution to human survival data of patients with plasma cell and in response-adaptive randomization for survival trials respectively. Lee, Kim and Jung [28] used exponential in medical research for survival data of the patients.
In this study parametric models (Exponential, Weibull, Log-logistic, Gompertz and Logistic distributions) are used to estimate survival proportions for CABG patients. William, Ellis, Josef, Ralph and Robert [37] in 1995 carried out the survival study on incomplete population (progressive censoring of type 1) of CABG patients comprising 2011 patients using Kaplan Meier method [19] . This study also comprises the same data set of 2011 patients. The patients were grouped as male, female, age (non modifiable factors) and hypertension & diabetes (modifiable factors). The patients were undergone through a first re-operation at Emory University hospitals from 1975 to 1993. In this paper, 12 years survival data of same CABG patients, age group under 50 years is considered. Khan, Saleem and Mahmud [22] proposed a procedure, to make an incomplete population (IP) a complete population (CP). In this paper, the survivor proportions are obtained for the CABG patients of parametric models using Maximum likelihood method, in- conjunction with Davidon-Fletcher-Powell optimization method and Cubic Interpolation method. A subroutine for maximizing log-likelihood function of each model is developed in FORTRAN program to obtain the estimates of the parameters of the model. The survival proportions of CP and IP for age group under 50 years of CABG patients by the models are presented and their differences are observed.
The method proposed by Kaplan Meier [21] and latter discussed by William [36] in 1995 and Lawless [26] is:
, where and are the number of items (individuals / patients) failed (died individuals) and number of individuals at risk at time , that is, the number of individuals survived and uncensored at time . This method does not take into account the censored individuals completely and thus the analysis is performed on incomplete population As Khan, Saleem and Mahmud [22] proposed that the censored individuals could be taken into account. The inclusion of splitted censored individuals, proportionally into known survived, and died individual’s respectively makes the population complete. Thus the survival analysis may be performed on the complete population . Klein & Moeschberger [23] and Lawless [26] proposed likelihood function for a survival model, in the presence of censored data. The maximum likelihood method works by developing a likelihood function based on the available data and finding the estimates of parameters of a probability distribution that maximizes the likelihood function. This may be achieved by using iterative method: see Bunday & Al-Mutwali [5] and Khan & Mahmud [20] . The likelihood function for all observed died and censored individuals is of the form:
, where and are the number of died & censored individuals in interval i each of length t, is pdf in a parametric model with survivor function, & hazard function, and is vector of parameters say of the model . To obtain maximum likelihood estimates of parameters of a parametric model using DFP optimization method, we take negative log on both the sides of above equation and therefore by setting , we get:

or



(AS ) Where, the first sum is for failure and the second sum is for all censored individuals.
Setting , where represents total no of individuals at time we get:



In this study time is partitioned into intervals, which are of unit length t starting from zero. Moreover, failures and censoring of the patients occur in each interval i of equal length of time t,



For complete population the term for censored observations is dropped from the likelihood function.

Application

We present detail application of above methodology for one of the parametric models considered (Weibull distribution). Same procedure is followed for Exponential, Log-logistic, Gompertz and Logistic models. The probability density function (pdf) of Weibull distribution is: where is vector of parameters a and b ; a is a scale parameter and b is a shape parameter; a ,b and t > 0.
The survival and hazard functions of Weibull distribution are:



For incomplete population replacing values of the survival and hazard functions of Weibull distribution in equation (1), we get

or



where, is the total number of failures in a given time.
Differentiating (2) with respect to and simplifying we get





By using (2), (3) and (4) in the DFP optimization method, we find the parameters estimates for which value of the likelihood function is maximum. For complete population we drop the term for censored observations from likelihood function. Same procedure is followed for Exponential, Log-logistic, Gompertz and Logistic models. FORTRAN program for the parameters estimation of each model is developed. The optimal estimates of the scale and shape / location parameters ( respectively) of exponential, Weibull, log-logistic, gompertz and logistic distributions distribution using CP and IP of age group under 50 years are obtained by maximizing the log-likelihood function t-ratios of the parameters are given in parenthesis. The values of parameters estimates, t-tarios, log-likelihood function and variance-covariance matrix are given in table see [Table-1] , [Table-2] and [Table-3] .
The estimated value of is greater than 1 (using Weibull distribution) which indicates increasing failure rate with the increase in age. The t-ratios in the parenthesis, indicates that estimates of the parameters are significant at 5% level of significance. The positive or negative values of co-variances indicates that the movements of are in the same or opposite direction respectively.
Survival Proportions by the parametric models (Exponential, Weibull, Log-logistic, Gompertz and Logistic distributions) of CP and IP respectively of age group less than 50 years (Male and Female CABG Patients) and respective graphs are presented as under [Table-4] .

Graphs of the Survival Proportions by models (Exponential, Weibull, Log-logistic, Gompertz and Logistic distributions) of CP and IP respectively of Age Group under 50 years.

Discussion

The graph of survival proportions (obtained by using exponential distribution) of CP and IP indicates that for the age group under 50 years of CABG patients, the difference between survival proportions of CP and IP is almost ignorable. The graph of survival proportions (obtained by using Weibull, log-logistic, gompertz and logistic distributions) of CP and IP indicates that for the age group under 50 years of CABG patients, the difference between survival proportions of CP and IP is small at the start, the difference is continuously but slowly increasing, whereas the survival proportions of IP of the CABG patients using Weibull, log-logistic, gompertz and logistic distributions are slightly lower then those of CP. The differences between the means of survival proportions (obtained by using Weibull, log-logistic, gompertz and logistic distributions) of CP and IP respectively of age group under 50 years of CABG patients, are tested using t-statistic under the null hypothesis , against an alternative hypothesis , by using one sided upper tailed test (see Lind, Marchall and Mason [29] ). The values of t-statistic calculated are 0.058, -0.475, -0.390 and -1.000 respectively, as shown in the respective graphs, when compared with , suggest that is accepted which means that the differences between the means of CP and IP of age group under 50 years are statistically insignificant at 5% level of significance.

Conclusion

The differences between the means of survival proportions obtained by Exponential, Weibull, Log-logistic, Gompertz and Logistic distributions for CP and IP, of age group under 50 years of CABG patients are statistically insignificant at 5% level of significance. However, the data is best modelled by Exponential and Weibull survival models.

References

[1] Abernathy R.B. (1998) The New Weibull Handbook. 3rd ed.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[2] American Heart Association Dallas, Texas (2007) Heart Disease and Stroke Statistics.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[3] Axel Vise, Yiwen Zhu, Dalit May, Veena Afza, Elaine Gong, Catia Attanasio, Matthew J. Blow, Jonathan. C. Cohen, Edward M.R. bin & Len A. Pennacchio (2010) Targeted deletion of the 9p21 non-coding coronary artery disease risk interval in mice.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[4] Bain L.J. and Englehardt M. (1991) Statistical Analysis of Reliability and Life-Testing Models: Theory and Methods, 2nd ed.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[5] Bunday B.D. and Al-Mutwali I.A (1981) IEE Trans, Reliability, 30(4), 367-339.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[6] Cohen A.C. (1965) Technometrics., 7(4), 579-588.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[7] Collett D. (1995) Modelling Survival Data in Medical Research.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[8] Crow L.H. (1982) Technometrics., 24(1), 67-72.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[9] Cyrus R. Mehta (1981) Biometrika 68(3), 669-675.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[10] David G. Kleinbaum, Mitchel Klein (2005) Survival analysis: a self-learning text.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[11] Goldstein L., Adams R., Alberts M.L. Appel L. Brass, Bushnell C., Culebras A., DeGraba T., Gorelick P., Guyton J. (2006) American Journal of Ophthalmology, 142(4), 716-716.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[12] Gross A.J. and Clark V.A. (1975) Survival Distribution: Reliability Applications in the Biomedical Sciences Wiley.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[13] Hansson Göran K.M.D. (2005) Inflammation, Atherosclerosis, and Coronary Artery Disease. 352(16), 1685-1695.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[14] Heart and Stroke Foundation Canada (1997) Heart Disease and Stroke Statistics.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[15] Heymann C., Von Heymann (2002) Journal of Cardiothoracic and Vascular Anesthesia, 16(5), 615-616.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[16] Hisada & Arizino (2002) IEEE Transactions, 51(3), 331-336.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[17] Jennifer Heisler R.N. (2008) After Coronary Artery Bypass Graft Surgery-Recovering From Open Heart Surgery.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[18] John H. Lemmer (2003) Hand Book of Patient Care in Cardiology Surgery; Lippincott Williams & Wilkins.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[19] Kaplan E.L. Meier P. (1958) J. American. Assoc., 53, 457-481.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[20] Khan K.H. and Mahmud Z. (1999) Weibull Distribution Model for The Breast Cancer Survival Data Using Maximum Likelihood Method.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[21] Khizar H., Khan M. Saleem and Zafar Mahmud (2011) Survival Proportions of CABG Patients: A New Approch. 3(3).  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[22] Klein P.J. and Moeschberger L.M. (1997, 2003) Survival Analysis Techniques for Censored and Truncated Data.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[23] Lang Wu (2010) Mixed effects models for complex data.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[24] Lanju Zhang and William F. Rosenberger (2007) Journal of the Royal Statistical Society: Series C, 56(2), 153-165.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[25] Lawless Jerald F. Lawless (1982, 2003) Statistical Models and Methods for Lifetime Data, John Wiley and Sons.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[26] Lawrencce M. Leemis (1995) Reliability Probabilistic Modela and Statistical Methods.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[27] Lee Jaeyong, Kim Jinseog and Jung Sin-Ho (2006) Lifetime Data Analysis, 13(1).  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[28] Lind A. Douglas, Marchall G. William and Mason D. Robert (2002)Statistical techniques in Business and Economics.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[29] Maan N.R., Schafer R.E. and Singpurawalla N.D. (1974) Method for Statistical Analysis of reliability and Lifetime Data.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[30] Nelson W. (1982) Applied Life Data Analysis.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[31] Sridhar K.N., Mun Choon Chan (2009) Modeling link lifetime data with parametric regression models in MANETs. IEEE. 13(12).  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[32] Sun Zhonghua and Hong Ng. Kwan (2011) World J. Cardiol. 3(9), 303-310.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[33] Swaminathan R. and Brenner H. (1998,2011) Statistical methods for Cancer Survival Analysis., 1&2.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[34] William S., Weintraub M.D., Ellis Jonees L., Josef M. Craver, Ralph Grossedwald, Robert A. Guyton (1995,1997) American Heart Association.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[35] William S., Weintraub M.D. Stephen D. Clements M.D., Van-Thomas Crisco L., Robert A. Guyton (2003) American Heart Association.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

[36] Xing Ming Tu, Xiao-Li Meng, Marcello Pagano (1993) Journal of the American Statistical Association.  
» CrossRef   » Google Scholar   » PubMed   » DOAJ   » CAS   » Scopus  

Images
Fig. 1-
Table 1-
Table 2-
Table 3-
Table 4-