Source Integration in the Information Manifold

Alon Levy



The Information Manifold system (IM) provides uniform access to multiple structured information sources on the World-Wide Web (e.g., enterntainment sources, product information, name servers). As such the system is able to answer complex queries that require combining information from multiple sources. The key aspect distinguishing IM from other information integration systems is the way we describe the contents and capabilities of information sources. In particular, contents of information sources are described as views over a set of virtual world-view relations.

I will describe the architecture of IM and its advantages, and the algorithms we use to answer queries. Answering queries in IM requires developing practical algorithms for rewriting queries using views. I will also describe how IM represents and exploits capabilities of information sources and the issues that arise when some sources contain incomplete information.

Finally, I will outline several topics for future research in the area of information integration.