A data warehouse is a repository storing integrated information for efficient querying and analysis. Information is extracted from heterogeneous sources as it is generated or updated. The information is then translated into a common data model and integrated with exisiting data at the warehouse. When a user query is submitted to the warehouse, the needed information is already there, with inconsistencies and differences already resolved. This makes it much easier and more efficient to run queries over data that originally came from different sources. Key advantages of data warehousing include:
The goal of WHIPS (WareHouse Information Prototype at Stanford) is to develop algorithms and tools for the creation and maintenance of data warehouses. The WHIPS project is funded by Rome Laboratories and by the Advanced Research and Development Committee of the Community Management Staff as part of the Massive Digital Data Systems (MDDS) Program.