For the current REF see the REF 2021 website REF 2021 logo

Output details

11 - Computer Science and Informatics

University of Oxford

Return to search Previous output Next output
Output 0 of 0 in the submission
Article title

OXPath: A language for scalable data extraction, automation, and crawling on the deep web

Type
D - Journal article
Title of journal
VLDB Journal
Article number
-
Volume number
22
Issue number
1
First page of article
47
ISSN of journal
1066-8888
Year of publication
2013
Number of additional authors
4
Additional information

<16>

This paper introduces OXPath, the first wrapper extraction language with guaranteed constant memory for bounded-depth extractions. It gives the first hard memory guarantee for a wrapper language independent of the number of pages wrapped. OXPath also outperforms existing wrapper systems, including commercial ones, by often several magnitudes. This has been acknowledged by selection for the best paper issue of VLDB 2011, the top-level database. OXPath has also received the silver price in the Open Source Software World Challenge 2011 and there have been several tutorials at industry events. It has seen considerable uptake in academia and open-source projects.

Interdisciplinary
-
Cross-referral requested
-
Research group
None
Citation count
4
Proposed double-weighted
No
Double-weighted statement
-
Reserve for a double-weighted output
No
Non-English
No
English abstract
-