You are in : Home » Results & submissions » Select UOA » 11 - Computer Science and Informatics » View submission: University of Edinburgh » Outputs » Detail

Output details

11 - Computer Science and Informatics

University of Edinburgh

Return to search Previous output Next output

Output 0 of 0 in the submission

Output title

Repeatable and reliable search system evaluation using crowdsourcing

Type

E - Conference contribution

DOI

10.1145/2009916.2010039

Name of conference/published proceedings

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval

Volume number

Issue number

First page of article

923

ISSN of proceedings

Year of publication

2011

URL

http://dl.acm.org/citation.cfm?doid=2009916.2010039

Number of additional authors

Additional information

<16> Originality: First-ever demonstration of the repeatability and reliability of crowdsourced evaluation of semantic-web search-engines using multiple trials and comparing with expert judgements.

Significance: Describes an ongoing evaluation campaign of semweb search system performance with submissions from academic and commercial participants. Repeatability enables comparison of performance across time for deployed search engines. Supports work towards fully-automated develop-test-evaluate loop by showing high agreement between distinct judge pools separated in time. Subsequent competitions have reused this methodology (http://www.websemanticsjournal.org/index.php/ps/article/view/336).

Rigour: Demonstrates reliability using multiple evaluation and relevance metrics. Introduces objective criteria for rejecting unreliable crowdsourced judges. Acceptance rate for this SIGIR was 19.9%.

Interdisciplinary

Cross-referral requested

Research group

D - Institute for Language, Cognition & Computation

Citation count

Proposed double-weighted

Double-weighted statement

Reserve for a double-weighted output

Non-English

English abstract