SUMMARY

We propose a new method for estimation in linear models. The ‘lasso’ minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant. Because of the nature of this constraint it tends to produce some coefficients that are exactly 0 and hence gives interpretable models. Our simulation studies suggest that the lasso enjoys some of the favourable properties of both subset selection and ridge regression. It produces interpretable models like subset selection and exhibits the stability of ridge regression. There is also an interesting relationship with recent work in adaptive function estimation by Donoho and Johnstone. The lasso idea is quite general and can be applied in a variety of statistical models: extensions to generalized regression models and tree-based models are briefly described.

REFERENCES

1

Breiman
,
L.
(
1993
)
Better subset selection using the non-negative garotte
.
Technical Report.
University of California
,
Berkeley
.

2

Breiman
,
L.
,
Friedman
,
J.
,
Olshen
,
R.
and
Stone
,
C.
(
1984
)
Classification and Regression Trees.
Belmont
:
Wadsworth
.

3

Breiman
,
L.
and
Spector
,
P.
(
1992
)
Submodel selection and evaluation in regression: the x-random case
.
Int. Statist. Rev.
,
60
,
291
319
.

4

Chen
,
S.
and
Donoho
,
D.
(
1994
)
Basis pursuit
. In
28th Asilomar Conf. Signals, Systems Computers, Asilomar.

5

Donoho
,
D.
and
Johnstone
,
I.
(
1994
)
Ideal spatial adaptation by wavelet shrinkage
.
Biometrika
,
81
,
425
455
.

6

Donoho
,
D. L.
,
Johnstone
,
I. M.
,
Hoch
,
J. C.
and
Stern
,
A. S.
(
1992
)
Maximum entropy and the nearly black object (with discussion)
.
J. R. Statist. Soc. B
,
54
,
41
81
.

7

Donoho
,
D. L.
,
Johnstone
,
I. M.
,
Kerkyacharian
,
G.
and
Picard
,
D.
(
1995
)
Wavelet shrinkage; asymptopia?
J. R. Statist. Soc. B
,
57
,
301
337
.

8

Efron
,
B.
and
Tibshirani
,
R.
(
1993
)
An Introduction to the Bootstrap.
London
:
Chapman and Hall
.

9

Frank
,
I.
and
Friedman
,
J.
(
1993
)
A statistical view of some chemometrics regression tools (with discussion)
.
Technometrics
,
35
,
109
148
.

10

Friedman
,
J.
(
1991
)
Multivariate adaptive regression splines (with discussion)
.
Ann. Statist.
,
19
,
1
141
.

11

George
,
E.
and
McCulloch
,
R.
(
1993
)
Variable selection via gibbs sampling
.
J. Am. Statist. Ass.
,
88
,
884
889
.

12

Hastie
,
T.
and
Tibshirani
,
R.
(
1990
)
Generalized Additive Models.
New York
:
Chapman and Hall
.

13

Lawson
,
C.
and
Hansen
,
R.
(
1974
)
Solving Least Squares Problems.
Englewood Cliffs
:
Prentice Hall
.

14

LeBlanc
,
M.
and
Tibshirani
,
R.
(
1994
)
Monotone shrinkage of trees
.
Technical Report.
University of Toronto
,
Toronto
.

15

Murray
,
W.
,
Gill
,
P.
and
Wright
,
M.
(
1981
)
Practical Optimization.
New York
:
Academic Press
.

16

Shao
,
J.
(
1992
)
Linear model selection by cross-validation
.
J. Am. Statist. Ass.
,
88
,
486
494
.

17

Stamey
,
T.
,
Kabalin
,
J.
,
McNeal
,
J.
,
Johnstone
,
I.
,
Freiha
,
F.
,
Redwine
,
E.
and
Yang
,
N.
(
1989
)
Prostate specific antigen in the diagnosis and treatment of adenocarcinoma of the prostate, ii: Radical prostatectomy treated patients
.
J. Urol.
,
16
,
1076
1083
.

18

Stein
,
C.
(
1981
)
Estimation of the mean of a multivariate normal distribution
.
Ann. Statist.
,
9
,
1135
1151
.

19

Tibshirani
,
R.
(
1994
)
A proposal for variable selection in the cox model
.
Technical Report.
University of Toronto
,
Toronto
.

20

Zhang
,
P.
(
1993
)
Model selection via multifold cv
.
Ann. Statist.
,
21
,
299
311
.

This content is only available as a PDF.
This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)