eScholarship Repository eScholarship Repository California Digital Library
eScholarship > ISCHOOL > Paper 2008-021

ISchool Papers

ISchool Website

Policies

Search ISchool

Submit a Paper

Notify me of new papers

institute_logo

ISchool Papers  •  ISchool Website  •  Policies  •  Search ISchool  •  Submit a Paper

Automatically Assessing the Quality of Wikipedia Articles
Joshua E. Blumenstock, School of Information, UC Berkeley

Download the Paper (344 K, PDF file) - April 1, 2008 Tell a colleague about it.
Printing Tips: Select 'print as image' in the Acrobat print dialog if you have trouble printing.

ABSTRACT:
Since its inception in 2001, Wikipedia has fast become one of the Internet's most dominant sources of information. Dubbed "the free encyclopedia", Wikipedia contains millions of articles that are written, edited, and maintained by volunteers. Due in part to the open, collaborative process by which content is generated, many have questioned the reliability of these articles. The high variance in quality between articles is a potential source of confusion that likely leaves many visitors unable to distinguish between good articles and bad. In this work, we describe how a very simple metric – word count – can be used to as a proxy for article quality, and discuss the implications of this result for Wikipedia in particular, and quality assessment in general.

SUGGESTED CITATION:
Joshua E. Blumenstock, "Automatically Assessing the Quality of Wikipedia Articles" (April 1, 2008). School of Information. Paper 2008-021.
http://repositories.cdlib.org/ischool/2008-021

 
bar
Open Archives Initiative eScholarship is a service of the California Digital Library bepress