I will describe the Strudel system, which is the first system to apply concepts from database management systems to the problem of Web site management. In Strudel we separate three distinct tasks in building Web sites that are usually interdependent in current Web site management tools: (1) managing the data underlying the site, (2) the management of the structure of the site (i.e., specifying the data contained within each page and the links between pages), and (3) designing the graphical presentation of pages.
The key idea in Strudel is that the structure and the content of the Web site are specified declaratively in a high-level query language, StruQL. As a result, it is possible to easily restructure and modify Web sites, and to create multiple versions of a Web site from the same underlying data. The underlying declarative representation of the Web site also provides a platform for specifying and enforcing integrity constraints on sites, and for designing data warehouses for their support.
After describing Strudel and some experiences we had in using the system, I will argue the following: (1) Web-site management is an important application of semistructured data (2) StruQL, as a language for querying semistructured data has several advantages over related languages, and, most importantly, (3) Web-site management is an important field for database research.
The Strudel system was developed jointly with Mary Fernandez (AT&T Labs), Daniela Florescu (INRIA, France), Jaewoo Kang (Severa Inc.) and Dan Suciu (AT&T Labs).