eScholarship Repository eScholarship Repository California Digital Library
eScholarship > UCLASTAT > CTS > Paper 2003121401

CTS Papers

CTS Website

Search CTS

Notify me of new papers

institute_logo

Department of Statistics, UCLA
Center for the Teaching of Statistics
University of California, Los Angeles

CTS Papers  •  CTS Website  •  Search

Internet Data Analysis for the Undergraduate Statistics Curriculum. Journal of Statistics Education (forthcoming)
Juana Sanchez, UCLA Department of Statistics
Yan He, UCLA Department of Statistics

Download the Paper (577 K, PDF file) - December 14, 2003 Tell a colleague about it.
Printing Tips: Select 'print as image' in the Acrobat print dialog if you have trouble printing.

ABSTRACT:
Statistics textbooks for undergraduates have not caught up with the enormous amount of analysis of Internet data that is taking place these days. Case studies that use Web server log data, Internet survey data or Internet network traffic data are rare in undergraduate Statistics education. This paper summarizes the results of research in three areas of Internet data analysis: users' web browing behavior, user demographics, and network performance. We present some of the main questions analyzed in the literature, some unsolved problems, and some typical data analysis methods used. We illustrate the questions and the methods with large data sets. The data sets were obtained from the publicly available pool of data. Those data sets had to be processed and transformed to make them available for classroom exercises. The processed data sets as well as more material for classes, are available at a web site with address that can be obtained from the main author.

SUGGESTED CITATION:
Juana Sanchez and Yan He, "Internet Data Analysis for the Undergraduate Statistics Curriculum. Journal of Statistics Education (forthcoming)" (December 14, 2003). Center for the Teaching of Statistics. Paper 2003121401.
http://repositories.cdlib.org/uclastat/cts/2003121401

 
bar
Open Archives Initiative eScholarship is a service of the California Digital Library bepress