Semantics based buffer reduction for queries over XML data streams
conference contribution
posted on 2008-01-01, 00:00authored byChi Yang, Chengfei Liu, Jianxin Li, Jeffrey Xu Yu, Junhu Wang
With respect to current methods for query evaluation over XML data streams, adoption of certain types of buffering techniques is unavoidable. Under lots of circumstances, the buffer scale may increase exponentially, which can cause memory bottleneck. Some optimization techniques have been proposed to solve the problem. However, the limit of these techniques has been defined by a concurrency lower bound and been theoretically proved. In this paper, we show through an empirical study that this lower bound can be broken by taking semantic information into account for buffer reduction. To demonstrate this, we build a SAX-based XML stream query evaluation system and design an algorithm that consumes buffers in line with the concurrency lower bound. After a further analysis of the lower bound, we design several semantic rules for the purpose of breaking the lower bound and incorporate these rules in the lower bound algorithm. Experiments are conducted to show that the algorithms deploying semantic rules individually and collectively all significantly outperform the lower bound algorithm that does not consider semantic information.
History
Volume
75
Pagination
145-153
Location
Wollongong, N.S.W.
Start date
2007-12-03
End date
2007-12-04
ISBN-13
978-1-920682-56-9
Language
eng
Publication classification
E1.1 Full written paper - refereed
Copyright notice
2008, Australian Computer Society, Inc.
Editor/Contributor(s)
Fekete A, Lin X
Title of proceedings
ADC 2008 : Proceedings of the Nineteenth Australasian Database Conference 2008
Event
Australian Computer Society. Conference (19th : 2008 : Wollongong, N.S.W.)