|
Generalized Quasi-likelihood (GQL) Inference*
by Brajendra C. Sutradhar
Memorial University
Email address: bsutradh@mun.ca
QL Estimation for Independent Data. For let denote the response variable for the th individual, and be the associated dimensional covariate vector. Also, let be the dimensional vector of regression effects of on Further suppose that the responses are collected from independent individuals. It is understandable that if the probability distribution of is not known, then one can not use the well known likelihood approach to estimate the underlying regression parameter Next suppose that only two moments of the data, that is, the mean and the variance functions of the response variable for all are known, and for a known functional form , these moments are given by
where for a link function and and are the first and second order derivatives of respectively, with respect to For the estimation of the regression parameter vector under this independence set up, Wedderburn (1974) (see also McCullagh (1983)) proposed to solve the so-called quasi-likelihood (QL) estimating equation given by
![$\displaystyle \sum^K_{i=1}[\frac{\partial a'(\theta_{i})}{\partial \beta}\frac{(y_{i}-a'(\theta_{i}))}{a''(\theta_i)}]=0.$ $\displaystyle \sum^K_{i=1}[\frac{\partial a'(\theta_{i})}{\partial \beta}\frac{(y_{i}-a'(\theta_{i}))}{a''(\theta_i)}]=0.$](http://statprob.com/cache/objects/248/l2h/img26.png) |
(2) |
Let be the QL estimator of obtained from (2). It is known that this estimator is consistent and highly efficient. In fact, for Poisson and binary data, for example, is equivalent to the maximum likelihood (ML) estimator and hence it turns out to be an optimal estimator.
Illustration for the Poisson case: For the Poisson data, one uses
 |
(3) |
with identity link function that is, This gives the mean and the variance functions as
var  (say)
yielding by (2), the QL estimating equation for as
 |
(4) |
Note that as the Poisson density is given by with it follows that the log likelihood function of has the form log yielding the likelihood equation for as
![$\displaystyle \frac{\partial \mbox{log} L}{\partial \beta}=\sum^K_{i=1}[y_{i}-a... ...a_{i})]\frac{\partial \theta_{i}}{\partial \beta}=\sum^K_{i=1}x_i(y_i-\mu_i)=0,$ $\displaystyle \frac{\partial \mbox{log} L}{\partial \beta}=\sum^K_{i=1}[y_{i}-a... ...a_{i})]\frac{\partial \theta_{i}}{\partial \beta}=\sum^K_{i=1}x_i(y_i-\mu_i)=0,$](http://statprob.com/cache/objects/248/l2h/img42.png) |
(5) |
which is the same as the QL estimating equation (4). Thus, if the likelihood function were known, then the ML estimate of would be the same as the QL estimate 
Illustration for the binary case: For the binary data, one uses
and
|
(6) |
with The QL estimating equation (2) for the binary data, however, provides the same formula (4) as in the Poisson case, except that now for the binary case whereas for the Poisson case
As far as the ML estimation for the binary case is concerned, one first writes the binary density given by Next by writing the log likelihood function as log one obtains the same likelihood estimating equation as in (5), except that here under the binary model. Since the QL estimating equation (4) is the same as the ML estimating equation (5), it then follows that the ML and QL estimates for would also be the same for the binary data.
GQL Estimation: A Generalization of the QL Estimation to the Correlated Data.
As opposed to the independence set up, we now consider as a vector of repeated binary or count responses, collected from the th individual, for all Let where represents the response recorded at time for the th individual. Also, let be the dimensional covariate vector corresponding to the scalar and be the dimensional regression effects of on for all and all Suppose that and be the mean and the variance of that is and var Note that both and are functions of But, when the variance is a function of mean, it is sufficient to estimate involved in the mean function only, by treating involved in the variance function to be known. Further note that since the repeated responses of an individual are likely to correlated, the estimate of to be obtained by ignoring the correlations, that is, the solution of the independence assumption based QL estimating equation
![$\displaystyle \sum^K_{i=1}\sum^T_{t=1}[\frac{\partial \mu_{it}}{\partial \beta}\frac{(y_{i}-\mu_{it})}{\sigma_{itt}}]=0,$ $\displaystyle \sum^K_{i=1}\sum^T_{t=1}[\frac{\partial \mu_{it}}{\partial \beta}\frac{(y_{i}-\mu_{it})}{\sigma_{itt}}]=0,$](http://statprob.com/cache/objects/248/l2h/img83.png) |
(7) |
for will be consistent but inefficient. As a remedy to this inefficient estimation problem, Sutradhar (2003) has proposed a generalization of the QL estimation approach, where is now obtained by solving the GQL estimating equation given by
 |
(8) |
where is the mean vector of and is the covariance matrix of that can be expressed as , with diag and as the correlation matrix of being a correlation index parameter.
Note that the use of the GQL estimating equation (8) requires the structure of the correlation matrix to be known, which is, however, unknown in practice. To overcome this difficulty, Sutradhar (2003) has suggested a general stationary auto-correlation structure given by
![$\displaystyle C_i(\rho)=\left[ \begin{array}{ccccc} 1 & \rho_1 & \rho_2 & \cdot... ...s \\ \rho_{T-1} & \rho_{T-2} & \rho_{T-3} & \cdots & 1 \\ \end{array} \right] ,$ $\displaystyle C_i(\rho)=\left[ \begin{array}{ccccc} 1 & \rho_1 & \rho_2 & \cdot... ...s \\ \rho_{T-1} & \rho_{T-2} & \rho_{T-3} & \cdots & 1 \\ \end{array} \right] ,$](http://statprob.com/cache/objects/248/l2h/img98.png) |
(9) |
(see also Sutradhar and Das (1999, Section 3)), for all where for represents the lag auto-correlation. As far as the estimation of the lag correlations is concerned, they may be consistently estimated by using the well known method of moments. For , , , the moment estimator for the autocorrelation of lag , , has the formula
 |
(10) |
(Sutradhar and Kovacevic (2000, eqn. (2.18), Sutradhar (2003)), where is the standardized residual, defined as .
The GQL estimating equation (8) for and the moment estimate of by (10) are solved iteratively until convergence. The final estimate of obtained from this iterative process is referred to as the GQL estimate of and may be denoted by This estimator is consistent for and also highly efficient, the ML estimator being fully efficient which is however impossible or extremely complex to obtain in the correlated data set up.
With regard to the generality of the stationary auto-correlation matrix in (9), one may show that this matrix, in fact, represents the correlations of many stationary dynamic such as stationary auto-regressive order 1 (AR(1)), stationary moving average order 1 (MA(1)), and stationary equi-correlations (EQC) models. For example, consider the stationary AR(1) model given by
 |
(11) |
(McKenzie (1988), Sutradhar (2003)) where it is assumed that for given , denotes the so-called binomial thinning operation (McKenzie, 1988). That is,
 |
(12) |
with and . Furthermore, it is assumed in (11) that follows the Poisson distribution with mean parameter that is, where with stationary covariate vector such that for all Further, in (11), and is independent of This model in (11) yields the mean, variance and auto-correlations of the data as shown in Table 1. The Table 1 also contains the MA(1) and EQC models and their basic properties including the correlation structures.
It is clear from Table 1 that the correlation structures for all three processes can be represented by in (9). By following Qaqish (2003), one may write similar but different dynamic models for the repeated binary data, with their correlation structures represented by Thus, if the count or binary data follow this type of auto-correlations model, one may then certainly estimate the regression vector consistently and efficiently by solving the general auto-correlations matrix based GQL estimating equation (8), where the lag correlations are estimated by (10) consistently.
[* Reprinted with permission from Lovric, Miodrag (2011), International Encyclopedia of Statistical Science. Heidelberg: Springer Science & Business Media, LLC]
- McCullagh, P. (1983). Quasilikelihood functions. Ann. Statist. 11, 59-67.
- McKenzie, E. (1988). Some ARMA models for dependent sequences of Poisson counts. Advances in Applied Probability 20, 822-35.
- Qaqish, B. F. (2003). A family of multivariate binary distributions for simulating correlated binary variables with specified marginal means and correlations. Biometrika 90, 455-463.
- Sutradhar, B. C. (2003). An overview on regression models for discrete longitudinal responses. Statistical Science 18, 377-93.
- Sutradhar, B. C. & Das, K. (1999). On the efficiency of regression estimators in for longitudinal data. Biometrika 86, 459-65.
- Sutradhar, B. C. & Kovacevic, M. (2000). Analyzing ordinal longitudinal survey data: Generalized estimating equations approach. Biometrika 87, 837-848.
- Wedderburn, R. W. M. (1974). Quasi-likelihood functions, generalised linear models, and the Gauss-Newton method. Biometrika 61, 439-447.
|