SemanticTwig: a semantic approach to optimize XML query processing
conference contribution
posted on 2023-05-23, 09:01authored byBao, Z, Ling, TW, Lu, J, B Chen
Twig pattern matching (TPM) is the core operation of XML query processing. Existing approaches rely on either efficient data structures or novel labeling/indexing schemes to reduce the intermediate result size, but none of them takes into account the rich semantic information resided in XML document and the query issued. Moreover, in order to fulfill the semantics of the XPath/XQuery query, most of them require costly post processing to eliminate redundant matches and group matching results. In this paper, we propose an innovative semanticsaware query optimization approach to overcome these limitations. In particular, we exploit the functional dependency derived from the given semantic information to stop query processing early; we distinguish the output and predicate nodes of a query, then propose a query breakup technique and build a query plan, such that for each distinct query output, we avoid finding the redundant matches having the same results as the first match in most cases. Both I/O and structural join cost are saved, and much less intermediate results are produced. Experiments show the effectiveness of our optimization.
History
Publication title
Database Systems for Advanced Applications
Editors
JR Haritsa, R Kotagiri, V Pudi
Pagination
282-298
ISBN
978-3-540-78567-5
Department/School
School of Information and Communication Technology
Publisher
Springer-Verlag
Place of publication
Berlin, Germany
Event title
Proceedings of 13 International Conference on Database Systems for Advanced Applications
Event Venue
New Delhi, India
Date of Event (Start Date)
2008-03-19
Date of Event (End Date)
2008-03-21
Rights statement
Copyright 2008 Springer
Repository Status
Restricted
Socio-economic Objectives
Information systems, technologies and services not elsewhere classified