For the current REF see the REF 2021 website REF 2021 logo

Output details

11 - Computer Science and Informatics

University of Sheffield

Return to search Previous output Next output
Output 13 of 109 in the submission
Article title

Adapting SVM for data sparseness and imbalance: a case study in information extraction

Type
D - Journal article
Title of journal
Natural Language Engineering
Article number
-
Volume number
15
Issue number
02
First page of article
241
ISSN of journal
14698110
Year of publication
2008
Number of additional authors
2
Additional information

<22> Supervised learning approaches are seriously hampered by unbalanced training data. This paper is the first to show how to apply the uneven margins SVM model to address this problem within NLP, where it is pervasive. The algorithm achieved the best reported results on two benchmark datasets for evaluation of ML algorithms for information extraction. The paper appears in a leading NLP journal and, together with a preliminary conference version (CONLL), has 75 citations in Google Scholar. An open source implementation is being used by South London and Maudsley NHS Trust (Robert Stewart <robert.stewart@kcl.ac.uk>) to extract information from clinical records.

Interdisciplinary
-
Cross-referral requested
-
Research group
None
Citation count
14
Proposed double-weighted
No
Double-weighted statement
-
Reserve for a double-weighted output
No
Non-English
No
English abstract
-