eScholarship Repository eScholarship Repository California Digital Library
eScholarship > CBMB > Paper regression

CBMB Papers

CBMB Website

Policies

Search CBMB

Submit a Paper

Notify me of new papers

institute_logo

Center for Bioinformatics & Molecular Biostatistics
University of California, San Francisco

CBMB Papers  •  CBMB Website  •  Policies  •  Search CBMB  •  Submit a Paper

Regression Approaches for Microarray Data Analysis
Mark R. Segal, University of California, San Francisco
Kam D. Dahlquist, Gladstone Institute of Cardiovascular Disease and Cardiovascular Research Institute, UCSF
Bruce R. Conklin, Gladstone Institute of Cardiovascular Disease and Cardiovascular Research Institute, UCSF

To appear in Journal of Computational Biology

Download the Paper (297 K, PDF file) - August 1, 2002 Tell a colleague about it.
Printing Tips: Select 'print as image' in the Acrobat print dialog if you have trouble printing.

ABSTRACT:
A variety of new procedures have been devised to handle the two sample comparison (e.g., tumor versus normal tissue) of gene expression values as measured with microarrays. Such new methods are required in part because of some defining characteristics of microarray-based studies: (i) the very large number of genes contributing expression measures which far exceeds the number of samples (observations) available, and (ii) the fact that by virtue of pathway/network relationships, the gene expression measures tend to be highly correlated. These concerns are exacerbated in the regression setting, where the objective is to relate gene expression, simultaneously for multiple genes, to some external outcome or phenotype. Correspondingly, several methods have been recently proposed for addressing these issues. We briefly critique some of these methods prior to a detailed evaluation of gene harvesting. This reveals that gene harvesting, without additional constraints, can yield artifactual solutions. Results obtained employing such constraints motivate the use of regularized regression procedures such as the lasso, least angle regression, and support vector machines. Model selection and solution multiplicity issues are also discussed. The methods are evaluated using a microarraybased study of cardiomyopathy in transgenic mice.

SUGGESTED CITATION:
Mark R. Segal, Kam D. Dahlquist, and Bruce R. Conklin, "Regression Approaches for Microarray Data Analysis" (August 1, 2002). Center for Bioinformatics & Molecular Biostatistics. Paper regression.
http://repositories.cdlib.org/cbmb/regression

 
bar
Open Archives Initiative eScholarship is a service of the California Digital Library bepress