You are in : Home » Results & submissions » Select UOA » 11 - Computer Science and Informatics » View submission: University of Brighton » Outputs » Detail

Output details

11 - Computer Science and Informatics

University of Brighton

Return to search Previous output Next output

Output 7 of 32 in the submission

Article title

An investigation into the validity of some metrics for automatically evaluating natural language generation systems

Type

D - Journal article

DOI

10.1162/coli.2009.35.4.35405

Title of journal

Computational Linguistics

Article number

Volume number

Issue number

First page of article

529

ISSN of journal

0891-2017

Year of publication

2009

URL

http://www.mitpressjournals.org/doi/abs/10.1162/coli.2009.35.4.35405

Number of additional authors

Additional information

<22>

This paper presents an empirical investigation into the validity of corpus-based evaluation metrics such as BLEU for evaluating Natural Language Generation (NLG) systems. It is helping to shape the NLG community’s perspective on using corpus-based evaluation metrics. The experimental design, for human ratings-based evaluations of NLG systems, has since been adapted and used by other NLG researchers, such as in the context of the Generation Challenges series of NLG system competitions. Computational Linguistics is a top journal in the field and is high on international journal rankings, e.g. A* on the Australian ERA/CORE list.

Interdisciplinary

Cross-referral requested

Research group

None

Citation count

Proposed double-weighted

Double-weighted statement

Reserve for a double-weighted output

Non-English

English abstract