eScholarship Repository eScholarship Repository California Digital Library
eScholarship > UCLASTAT > PAPERS > Paper 2005061401

Statistics Papers

Statistics Website

Policies

Search Statistics

Submit a Paper

Notify me of new papers

institute_logo

Department of Statistics, UCLA
University of California, Los Angeles

Statistics Papers  •  Statistics Website  •  Policies  •  Search Statistics  •  Submit a Paper

Statistical Learning Procedures for Monitoring Regulatory Compliance: An Application to Fisheries Data
Cleridy E. Lennert-Cody, Inter-American Tropical Tuna Commission
Richard Berk, Department of Statistics, UCLA

Download the Paper (308 K, PDF file) - June 14, 2005 Tell a colleague about it.
Printing Tips: Select 'print as image' in the Acrobat print dialog if you have trouble printing.

ABSTRACT:
As a special case of statistical learning, ensemble methods are well suited for the analysis of opportunistically collected data that involve many weak and sometimes specialized predictors, especially when subject-matter knowledge favors inductive approaches. In this paper, we analyze data on the incidental mortality of dolphins in the purse-seine fishery for tunas in the eastern Pacific Ocean. The goal is to identify those rare purse-seine sets for which incidental mortality would be expected but none was reported. The ensemble method random forests is used to classify sets according to whether mortality was (response = 1) or was not (response = 0) reported. To identify questionable reporting practice, we construct “residuals” as the difference between the categorical response (0, 1) and the proportion of trees in the forest that correctly assify a given set. Two uses of these residuals to identify suspicious data are illustrated. This approach shows promise as a means to identify suspect data gathered for environmental monitoring.

SUGGESTED CITATION:
Cleridy E. Lennert-Cody and Richard Berk, "Statistical Learning Procedures for Monitoring Regulatory Compliance: An Application to Fisheries Data" (June 14, 2005). Department of Statistics, UCLA. Department of Statistics Papers. Paper 2005061401.
http://repositories.cdlib.org/uclastat/papers/2005061401

 
bar
Open Archives Initiative eScholarship is a service of the California Digital Library bepress