|
Multivariate Reduced-Rank Regression
Alan J. Izenman
Department of Statistics
Temple University
Philadelphia, PA 19122
alan@temple.edu
Key Words: Canonical variate analysis; Dimensionality reduction; Effective dimensionality; Graphical methods; Least-squares estimation; Linear discriminant analysis; Multivariate analysis; Principal component analysis; Regression coefficient matrix.
Multivariate reduced-rank regression is a way of constraining the multivariate linear regression model so that the rank of the regression coefficient matrix has less than full rank. Without the constraint, multivariate linear regression has no true multivariate content.
To see this, suppose we have a random -vector of predictor (or input) variables with mean vector and covariance matrix , and a random -vector of response (or output) variables with mean vector and covariance matrix . Suppose that the -vector has a joint distribution with mean vector and covariance matrix,

|
(1) |
respectively, where we assume that and are both nonsingular. Now, consider the classical multivariate linear regression model,
 |
(2) |
where Y depends linearly on X, is the overall mean vector, is the multivariate regression coefficient matrix, and is the error term. In this model, and are unknown and are to be estimated. The least-squares estimator of  is given by
where
In (3), the expectation is taken over the joint distribution of . The minimum achieved is . The -matrix is called the (full-rank) regression coefficient matrix. This solution is identical to that obtained by performing a sequence of ordinary least-squares multiple regressions. For the th such multiple regression, is regressed on the -vector X, where . Suppose the minimizing regression coefficient vectors are the -vectors , . Arranging the coefficient vectors as the columns,  , of an -matrix, and then transposing the result, it follows from (4) that
Thus, multivariate linear regression is equivalent to just carrying out a sequence of multiple regressions. This is why multivariate regression is often confused with multiple regression.
Now, rewrite the multivariate linear model as
 |
(6) |
where the rank constraint is
 |
(7) |
Equations (6) and (7) form the multivariate reduced-rank regression model. When the rank condition (7) holds, there exist two (nonunique) full-rank matrices A and B, where A is an -matrix and B is a -matrix, such that
 |
(8) |
The multivariate reduced-rank regression model can now be written as
 |
(9) |
The rank condition has been embedded into the regression model. The goal is to estimate , A, and B (and, hence, C).
Let be a positive-definite symmetric -matrix of weights. The weighted least-squares estimates of  are
where
and is an -matrix, where the th column, , is the eigenvector corresponding to the th largest eigenvalue, , of the symmetric matrix,
 |
(14) |
The multivariate reduced-rank regression coefficient matrix C with rank is, therefore, given by
 |
(15) |
The minimum achieved is .
The main reason that multivariate reduced-rank regression is so important is that it contains as special cases the classical statistical techniques of principal component analysis, canonical variate and correlation analysis, linear discriminant analysis, exploratory factor analysis, multiple correspondence analysis, and other linear methods of analyzing multivariate data. It is also closely related to artificial neural network models and to cointegration in the econometric literature.
For example, the special cases of principal component analysis, canonical variate and correlation analysis, and linear discriminant analysis are given by the following choices: For principal component analysis, set and ; for canonical variate and correlation analysis, set ; for linear discriminant analysis, use the canonical-variate analysis choice of and set Y to be a vector of binary variables whose component values (0 or 1) indicate the group or class to which an observation belongs. Details of these and other special cases can be found in Izenman (2008). If the elements of in (1) are unknown, as will happen in most practical problems, they have to be estimated using sample data on Z.
The relationships between multivariate reduced-rank regression and the classical linear dimensionality reduction techniques become more interesting when the metaparameter is unknown and has to be estimated. The value of is called the effective dimensionality of the multivariate regression (Izenman, 1980). Estimating is equivalent to the classical problems of determining the number of principal components to retain, the number of canonical variate to retain, or the number of linear discriminant functions necessary for classification purposes. Graphical methods for estimating include the scree plot, the rank trace plot, and heatmap plots. Formal hypothesis tests have also been developed for estimating .
When the number of variables is greater than the number of observations, some adjustments to the results have to be made to ensure that and can be inverted. One simple way of doing this is to replace by and to replace by as appropriate, where and . Other methods, including regularization, banding, tapering, and thresholding, have been studied for estimating large covariance matrices and can be used here as appropriate.
The multivariate reduced-rank regression model can also be developed for the case of nonstochastic (or fixed) predictor variables.
The multivariate reduced-rank regression model has its origins in Anderson (1951), Rao (1964, 1965), and Brillinger (1969), and its name was coined by Izenman (1972, 1975). For the asymptotic distribution of the estimated reduced-rank regression coefficient matrix, see Anderson (1999), who gives results for both the random-X and fixed-X cases. Additional references are the monographs by van der Leeden (1990) and Reinsel and Velu (1998).
Reprinted with permission from Lovric, Miodrag (2011), International Encyclopedia of Statistical Science. Heidelberg: Springer Science +Business Media, LLC
References
Anderson, T.W. (1951). Estimating linear restrictions on regression coefficients for multivariate normal distributions, The Annals of Mathematical Statistics, 22, 327-351.
Anderson, T.W. (1999). Asymptotic distribution of the reduced-rank regression estimator under general conditions, The Annals of Statistics, 27, 1141-1154.
Brillinger, D.R. (1969). The canonical analysis of stationary time series, in Multivariate Analysis II (ed. P.R. Krishaiah), New York: Academic Press, pp. 331-350.
Izenman, A.J. (1972). Reduced-Rank Regression for the Multivariate Linear Model, Its Relationship to Certain Multivariate Techniques, and Its Application to the Analysis of Multivariate Data, Ph.D. dissertation, University of California, Berkeley.
Izenman, A.J. (1975). Reduced-rank regression for the multivariate linear model, Journal of Multivariate Analysis, 5, 248-264.
Izenman, A.J. (1980). Assessing dimensionality in multivariate regression, in Krishnaiah (ed.), Handbook of Statistics, 1, Amsterdam: North-Holland, pp. 571-591.
Izenman, A.J. (2008). Modern Multivariate Statistical Techniques: Regression, Classification, and Manifold Learning, New York: Springer.
Rao, C.R. (1964). The use and interpretation of principal components in applied research, Sankhya (A), 26, 329-358.
Rao, C.R. (1965). Linear Statistical Inference and Its Applications, New York: Wiley.
Reinsel, G.C. and Velu, R.P. (1998). Multivariate Reduced-Rank Regression, Lecture Notes in Statistics, 136, New York: Springer.
Van der Leeden, R. (1990). Reduced-Rank Regression With Structured Residuals, Leiden: DSWO Press.
|