Querying XML with locator semantics
Peter Fankhauser
GMD-IPSI
German National Research Center for Information Technology
Integrated Publication and Information Systems Institute
Dolivostraße 15, D-64293 Darmstadt, Germany
Abstract
The prevailing data model for semi-structured data is XML. XQL is a
query language for XML that supports flexible navigation and filtering
on possibly irregular document trees by means of simple path expressions.
I present an approach to extend XQL with support for joins and for more
expressive extraction. To this end, locators are introduced, which allow
to identify document nodes by paths.
On this basis, a locator algebra (or L'Algebra for short) can be defined,
which supports matching, combination, and restructuring of XML documents
without excessive generation of new nodes. L'Algebra preserves document
order, and - by virtue of the locator semantics - also preservers regular
or irregular structure and navigation history.
I define the semantics of the main operators of L'Algebra, introduce its
axioms, and describe a mapping from XQL expressions to L'Algebra.
Biography
Peter Fankhauser has studied computer science at the Technical University
of Vienna. His PHD was on knowledge based methods for the integration of
heterogeneous database schemas. Since 1989 he is affiliated with the
Institute for Integrated Publication and Information Systems of
the GMD (German National Research Center for Information Technology)
in Darmstadt, Germany. He currently works on interoperability of
information systems.