eScholarship Repository eScholarship Repository California Digital Library
eScholarship > CBMB > Paper L1Cox

CBMB Papers

CBMB Website

Policies

Search CBMB

Submit a Paper

Notify me of new papers

institute_logo

Center for Bioinformatics & Molecular Biostatistics
University of California, San Francisco

CBMB Papers  •  CBMB Website  •  Policies  •  Search CBMB  •  Submit a Paper

Penalized Cox Regression Analysis in the High-Dimensional and Low-sample Size Settings, with Applications to Mi-croarray Gene Expression Data
Jiang Gui, University of California, Davis
Hongzhe Li, University of California, Davis

Download the Paper (263 K, PDF file) - March 1, 2004 Tell a colleague about it.
Printing Tips: Select 'print as image' in the Acrobat print dialog if you have trouble printing.

ABSTRACT:
An important application of microarray technology is to relate gene expression profiles to various clinical phenotypes of patients. Success has been demonstrated in molecular classification of cancer in which the gene expression data serve as predictors and different types of cancer serve as a categorical outcome variable. However, there has been less research in linking gene expression profiles to the censored survival data such as patients' overall survival time or time to cancer relapse. Due to large variability in time to certain clinical event among patients, studying possibly censored survival phenotypes can be more informative than treating the phenotypes as categorical variables. We propose to use the L1 penalized estimation for the Cox model to select genes that are relevant to patients' survival and to build a predictive model for future prediction. The computational difficulty associated with the estimation in the high-dimensional and low-sample size settings can be efficiently solved by using the latest developed least angle regression method. Results from our simulation studies and application to real data set on predicting survival after chemotherapy for patients with diffuse large B-cell lymphoma demonstrate that the proposed procedure, which we call the LARS-Lasso procedure, can be used for identifying important genes that are related to time to death due to cancer and for building a parsimonious model for predicting the survival of future patients. The LARS-Lasso regression gives much better predictive performance than the L2 penalized regression or dimension-reduction based methods such as the partial Cox regression method.

SUGGESTED CITATION:
Jiang Gui and Hongzhe Li, "Penalized Cox Regression Analysis in the High-Dimensional and Low-sample Size Settings, with Applications to Mi-croarray Gene Expression Data" (March 1, 2004). Center for Bioinformatics & Molecular Biostatistics. Paper L1Cox.
http://repositories.cdlib.org/cbmb/L1Cox

 
bar
Open Archives Initiative eScholarship is a service of the California Digital Library bepress