Isn't this the same as ...?
|
At first glance, the notions of a WSMS and query optimization over web services might seem similar to what a lot of existing work in the database literature has already addressed. Here, we briefly explain what makes the web-services problem unique, and the differences from the major pieces of related work. For details, see our paper.
The problem of query optimization over web services could be considered a special case of the more general problems of parallel and distributed query optimization, each of which have been addressed extensively in the database literature. However, the key difference is that in a parallel / distributed database setting, one can get every node to process any desired data in any desired way, while in the web services setting, the functionality of a web service is already preset; we can merely choose what data to send it for processing. This limitation results in a considerably smaller search space for query plans in the web services scenario as compared to full-blown distributed or parallel query processing. Consequently, we are able to guarantee optimal plans while most of parallel / distributed query optimization is limited to heuristics. We are not aware of any work in the context of parallel or distributed query optimization that, when applied to the web services scenario, produces the optimal results. Each web service could be considered as an expensive predicate and a query over web services as a classical relational database query with expensive predicates (which can be optimized with known techniques). However, the execution model is substantially different: in a classical relational database, each predicate would be executed on the same machine, while in the web services context, each web service executes independently using its own resources. This leads to a fundamentally different cost model, and consequently an entirely different set of optimization techniques. WSQ/DSQ is a much earlier system, also from Stanford, that is based on an observation similar to what our optimization techniques are based on: that queries over web sources (in their case search engines) can be considerably sped up by making several calls in parallel. However WSQ/DSQ does not perform any cost-based optimization. |
Stanford InfoLab |
This page was last updated on 09/26/2005 by Utkarsh Srivastava |