You are in : Home » Results & submissions » Select UOA » 11 - Computer Science and Informatics » View submission: University of St Andrews » Outputs » Detail

Output details

11 - Computer Science and Informatics

University of St Andrews

Return to search Previous output Next output

Output 0 of 0 in the submission

Output title

The imagination of crowds : Conversational AAC language modeling using crowdsourcing and large data sources

Type

E - Conference contribution

DOI

Name of conference/published proceedings

Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP 2011)

Volume number

Issue number

First page of article

700

ISSN of proceedings

Year of publication

2011

URL

http://aclweb.org/anthology-new/D/D11/D11-1065.pdf

Number of additional authors

Additional information

<20>EMNLP is a top NLP conference (acceptance rate: 23%). The paper introduces a novel methodology for collecting data for statistical language models. A long-standing problem in AAC is a lack of representative data. This paper shows that we can crowd source the creation of such data and it also shows that this surrogate data predicts AAC-like text better than previously used texts. We expanded the surrogate data using cross-entropy difference selection on social media and show 5-11% keystroke savings---nearly an order of magnitude better than recent approaches. The paper was featured in New Scientist (February 26, 2012; pp. 24-25; http://www.newscientist.com/article/mg21328536.600-crowdsourcing-improves-predictivetexting.html).

Interdisciplinary

Cross-referral requested

Research group

B - Human-computer Interaction

Citation count

Proposed double-weighted

Double-weighted statement

Reserve for a double-weighted output

Non-English

English abstract