eScholarship Repository eScholarship Repository California Digital Library
eScholarship > UCLASTAT > PAPERS > Paper 2004072501

Statistics Papers

Statistics Website

Policies

Search Statistics

Submit a Paper

Notify me of new papers

institute_logo

Department of Statistics, UCLA
University of California, Los Angeles

Statistics Papers  •  Statistics Website  •  Policies  •  Search Statistics  •  Submit a Paper

An Introduction to Ensemble Methods for Data Analysis (Revised July, 2004)
Richard Berk, UCLA Department of Statistics

Download the Paper (330 K, PDF file) - July 25, 2004 Tell a colleague about it.
Printing Tips: Select 'print as image' in the Acrobat print dialog if you have trouble printing.

ABSTRACT:
This paper provides an introduction to ensemble statistical procedures as a special case of algorithmic methods. The discussion beings with classification and regression trees (CART) as a didactic device to introduce many of the key issues. Following the material on CART is a consideration of cross-validation, bagging, random forests and boosting. Major points are illustrated with analyses of real data.

SUGGESTED CITATION:
Richard Berk, "An Introduction to Ensemble Methods for Data Analysis (Revised July, 2004)" (July 25, 2004). Department of Statistics, UCLA. Department of Statistics Papers. Paper 2004072501.
http://repositories.cdlib.org/uclastat/papers/2004072501

 
bar
Open Archives Initiative eScholarship is a service of the California Digital Library bepress