Login
In Cooperation with:

American Society for Quality Statistics Division

American Statistical Association

Bernoulli Society for Mathematical Statistics and Probability

Institute of Mathematical Statistics

International Biometric Society

International Chinese Statistical Association

International Society for Bayesian Analysis

International Statistical Institute

Royal Statistical Society

Statistical Society of Canada / Société statistique du Canada
Nonparametric regression using kernel and spline methods
cite
Nonparametric regression using kernel and spline methodsJean D. Opsomer1 F. Jay Breidt 2
|
![]() |
A class of kernel-based estimators that generalizes the Nadaraya-Watson estimator in (1) is referred to as local polynomial regression estimators. At each location
, the estimator
is obtained as the estimated intercept,
, in the weighted least squares fit of a polynomial of degree
,

where
![$\displaystyle {\mbox{\boldmath$X$}}_x = \left[ \begin{array}{cccc} 1 & x_1-x & ... ...ts & \vdots & & \vdots \ 1 & x_n-x & \cdots & (x_n-x)^p \end{array} \right]. $ $\displaystyle {\mbox{\boldmath$X$}}_x = \left[ \begin{array}{cccc} 1 & x_1-x & ... ...ts & \vdots & & \vdots \ 1 & x_n-x & \cdots & (x_n-x)^p \end{array} \right]. $](http://statprob.com/cache/objects/239/l2h/img49.png)
An extensive literature on kernel regression and local polynomial regression exists, and their theoretical properties are well understood. Both kernel regression and local polynomial regression estimators are biased but consistent estimators of the unknown mean function, when that function is continuous and sufficiently smooth. For further information on these methods, we refer to reader to the monographs by wan95 and fan96.
Spline methods
In the previous section, the unknown mean function was assumed to be locally well approximated by a polynomial, which led to local polynomial regression. An alternative approach is to represent the fit as a piecewise polynomial, with the pieces connecting at points called knots. Once the knots are selected, such an estimator can be computed globally in a manner similar to that for a parametrically specified mean function, as will be explained below. A fitted mean function represented by a piecewise continuous curve only rarely provides a satisfactory fit, however, so that usually the function and at least its first derivative are constrained to be continuous everywhere, with only the second or higher derivatives allowed to be discontinuous at the knots. For historical reasons, these constrained piecewise polynomials are referred to as splines, leading to the name spline regression or spline smoothing for this type of nonparametric regression.
Consider the following simple type of polynomial spline of degree
:
where
For fixed knots, a regression spline is linear in the unknown parameters
and can be fitted parametrically using least squares techniques. Under the homoskedastic model described in Section 1, the regression spline estimator for
is obtained by solving
and setting
The smoothing spline estimator is an important extension of the regression spline estimator. The smoothing spline estimator for
for a set of data generated by the statistical model described in Section 1 is defined as the minimizer of
over the set of all functions
Traditional regression spline fitting as in (4) is usually done using a relatively small number of knots. By construction, smoothing splines use a large number of knots (typically,
knots), but the smoothness of the function is controlled by a penalty term and the smoothing parameter
. The penalized spline estimator represents a compromise between these two approaches. It uses a moderate number of knots and puts a penalty on the coefficients of the basis functions. Specifically, a simple type of penalized spline estimator for
is obtained by solving
and setting
Spline-based regression methods are extensively described in the statistical literature. While the theoretical properties of (unpenalized) regression splines and smoothing splines are well established, results for penalized regression splines have only recently become available. The monographs by wah90, eub99 and rup03 are good sources of information on spline-based methods.
Acknowledgements
Based on an article from Lovric, Miodrag (2011), International Encyclopedia of Statistical Science. Heidelberg: Springer Science +Business Media, LLC.
Bibliography
- EubankEubank1999
- Eubank, R. L. (1999).
Nonparametric Regression and Spline Smoothing (2nd ed.).
New York: Marcel Dekker. - Fan and GijbelsFan and Gijbels1996
- Fan, J. and I. Gijbels (1996).
Local Polynomial Modelling and its Applications.
London: Chapman & Hall. - Ruppert, Wand, and CarrollRuppert et al.2003
- Ruppert, D., M. P. Wand, and R. J. Carroll (2003).
Semiparametric Regression.
Cambridge, UK: Cambridge University Press. - WahbaWahba1990
- Wahba, G. (1990).
Spline models for observational data.
SIAM [Society for Industrial and Applied Mathematics]. - Wand and JonesWand and Jones1995
- Wand, M. P. and M. C. Jones (1995).
Kernel Smoothing.
London: Chapman and Hall.
Footnotes
- ... Opsomer1
- Department of Statistics, Colorado State University, Fort Collins, CO, USA. Email: jopsomer@stat.colostate.edu.
- ... Breidt2
- Department of Statistics, Colorado State University, Fort Collins, CO, USA. Email: jbreidt@stat.colostate.edu




![\includegraphics[scale=0.5]{bank_swallow.eps} \includegraphics[scale=0.5]{bank_swallow.eps}](http://statprob.com/cache/objects/239/l2h/img40.png)



