The definition of vandalism
at Wikipedia includes "any addition, removal, or change of content made in a deliberate attempt to compromise the integrity of Wikipedia." Hence, Wikipedia vandalism detection comprises the following classification task:
Given a set of edits on Wikipedia articles, the task is to separate the ill-intentioned edits from the well-intentioned edits.
Your vandalism detector will be evaluated based on a corpus that contains manually annotated Wikipedia vandalism cases. The evaluation divides into a training phase and a test phase. For the training phase, we offer a fully annotated training corpus that may be used to adjust your detector. In the testing phase, you have to detect vandalism that is hidden in the test corpus and submit your detection results to us.
You can submit your vandalism detector by clicking here.
- S.-C. Chin, W. N. Street, P. Srinivasan, and D. Eichmann. Detecting Wikipedia vandalism with active learning and statistical ... Fourth Workshop on Information Credibility on the Web (WICOW 2010), Raleigh, NC, April 2010.
- B. Thomas Adler, Luca de Alfaro, Santiago M. Mola-Velasco, Paolo Rosso, and Andrew G. West. Wikipedia Vandalism Detection: Combining Natural Language, Metadata.... In Proceedings of the 12th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing'11), Tokyo, Japan, 2011.