For the current REF see the REF 2021 website REF 2021 logo

Output details

11 - Computer Science and Informatics

University of Huddersfield

Return to search Previous output Next output
Output 0 of 0 in the submission
Article title

Improving the Performance of Focused Web Crawlers

Type
D - Journal article
Title of journal
Data and Knowledge Engineering
Article number
-
Volume number
68
Issue number
10
First page of article
1001
ISSN of journal
0169-023X
Year of publication
2009
Number of additional authors
2
Additional information

<16>This paper addresses the issue of building topical web crawlers with high performance. Several methods based on both web page contents and link structure and search heuristics were tasted and evaluated over different topics. Results indicated that combining web link information with both page content and link anchor text achieved the best overall results. Using anchor text to differentiate priorities of different links into the same page also increased the overall performance significaly. This work has significantly influenced the work of other research groups, e.g. evidenced by the number of citations on Google Scholar (47 as of 26/9/13).

Interdisciplinary
-
Cross-referral requested
-
Research group
None
Citation count
22
Proposed double-weighted
No
Double-weighted statement
-
Reserve for a double-weighted output
No
Non-English
No
English abstract
-