For the current REF see the REF 2021 website REF 2021 logo

Output details

11 - Computer Science and Informatics

University of Sheffield

Return to search Previous output Next output
Output 47 of 109 in the submission
Article title

GATECloud.net: a platform for large-scale, open-source text processing on the cloud

Type
D - Journal article
Title of journal
Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences
Article number
20120071
Volume number
371
Issue number
1983
First page of article
20120071
ISSN of journal
14712962
Year of publication
2012
URL
-
Number of additional authors
3
Additional information

<15>GATE Cloud is the first user-extendable cloud-based text mining platform, employing parallel and distributed computation for Big Data text processing. This paper summarises results from a JISC/EPSRC project (EP/I034092/1) which won best paper award at UK eScience All Hands’ Meeting in 2011. Here we: discuss infrastructural facilities (load balancing, efficient data upload and storage, deployment to virtual machines, security, fault tolerance); quantify the scaleability profile of the distributed computation; evaluate the system in use at Public Health England (Amanda Semper <Amanda.Semper@phe.gov.uk>). In 2011-12 Cunningham was ANR Chaire d’Excellence at the Internet Memory Foundation applying this work to multi-terabyte web crawls.

Interdisciplinary
-
Cross-referral requested
-
Research group
None
Citation count
1
Proposed double-weighted
No
Double-weighted statement
-
Reserve for a double-weighted output
No
Non-English
No
English abstract
-