You are in : Home » Results & submissions » Select UOA » 11 - Computer Science and Informatics » View submission: University of Sheffield » Outputs » Detail

Output details

11 - Computer Science and Informatics

University of Sheffield

Return to search Previous output Next output

Output 13 of 109 in the submission

Article title

Adapting SVM for data sparseness and imbalance: a case study in information extraction

Type

D - Journal article

DOI

10.1017/S1351324908004968

Title of journal

Natural Language Engineering

Article number

Volume number

Issue number

First page of article

241

ISSN of journal

14698110

Year of publication

2008

URL

http://dx.doi.org/10.1017/S1351324908004968

Number of additional authors

Additional information

<22> Supervised learning approaches are seriously hampered by unbalanced training data. This paper is the first to show how to apply the uneven margins SVM model to address this problem within NLP, where it is pervasive. The algorithm achieved the best reported results on two benchmark datasets for evaluation of ML algorithms for information extraction. The paper appears in a leading NLP journal and, together with a preliminary conference version (CONLL), has 75 citations in Google Scholar. An open source implementation is being used by South London and Maudsley NHS Trust (Robert Stewart <robert.stewart@kcl.ac.uk>) to extract information from clinical records.

Interdisciplinary

Cross-referral requested

Research group

None

Citation count

Proposed double-weighted

Double-weighted statement

Reserve for a double-weighted output

Non-English

English abstract