Output details
11 - Computer Science and Informatics
University of Edinburgh
XML tree structure compression using RePair
<15> Originality: This paper introduces the most efficient grammar compressor for trees and shows it is an order of magnitude faster than previous compressors. This provides the smallest pointer-based in-memory representation for repetitive trees such as XML and provides a better time/space trade-off than other tree data structures.
Significance: The results have many applications: queries can be executed directly on the compressed structure thus giving speedup, the compressed structures can be used as XML synopses, and the compressor discovers repeating tree patterns which is useful for data mining.
Rigour: A new algorithm is presented together with an extensive experimental evaluation.