Memorias de investigación
Communications at congresses:
Temporal characterization of the requests to Wikipedia
Year:2011

Research Areas
  • Information technology and adata processing

Information
Abstract
This paper presents an empirical study about the temporal patterns characterizing the requests submitted by users to Wikipedia. The study is based on the analysis of the log lines registered by the Wikimedia Foundation Squid servers after having sent the appropriate content in response to users' requests. The analysis has been conducted regarding the ten most visited editions of Wikipedia and has involved more than 14,000 million log lines corresponding to the traffic of the entire year 2009. The conducted methodology has mainly consisted in the parsing and filtering of users' requests according to the study directives. As a result, relevant information fields have been finally stored in a database for persistence and further characterization. In this way, we, first, assessed, whether the traffic to Wikipedia could serve as a reliable estimator of the overall traffic to all the Wikimedia Foundation projects. Our subsequent analysis of the temporal evolutions corresponding to the different types of requests to Wikipedia revealed interesting differences and similarities among them that can be related to the users' attention to the Encyclopedia. In addition, we have performed separated characterizations of each Wikipedia edition to compare their respective evolutions over time.
International
Si
Congress
5th International Workshop on New Challenges in Distributed Information Filtering and Retrieval
960
Place
Reviewers
Si
ISBN/ISSN
1613-0073
Start Date
17/09/2011
End Date
17/09/2011
From page
1
To page
10
Proceedings of the 5th International Workshop on New Challenges in Distributed Information Filtering and Retrieval
Participants
  • Autor: Israel Herraiz Tabernero UPM

Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Matemática e Informática Aplicadas a la Ingeniería civil