VITA
Jennifer Widom


Current Position

Departments of Computer Science and Electrical Engineering, Stanford University
Professor, 2004-present
Associate Professor, 1996-2004
Assistant Professor, 1993-1996

Recent areas of research: Data uncertainty and lineage; query processing on data streams; data caching and replication; combining databases and the Web; database systems for semistructured data and XML; data transformations and warehousing; temporal database systems

Recent areas of teaching: Introduction to databases; database system implementation


Education

  • Ph.D. in Computer Science; Cornell University, 1987
  • M.S. in Computer Science; Cornell University, 1985
  • B.S. in Music with minors in Mathematics and Computer Science; Indiana University

Previous Positions

  • Research Staff Member, Computer Science Department, IBM Almaden Research Center; 1988-1993
  • Visiting Assistant Professor, Computer Science Department, Cornell University; 1987-1988
  • Summer Research Intern, Xerox Palo Alto Research Center; 1984, 1985

Honors and Fellowships

  • Member, National Academy of Engineering, class of 2005
  • ACM Fellow, conferred 2005
  • Stanford University School of Engineering 3Com Corporation Faculty Development Scholar, 2003-2006
  • Guggenheim Foundation Fellow, 2000-2001
  • Stanford University Terman Fellow, 1996-1998
  • Stanford University School of Engineering Anderson Faculty Scholar, 1993-1994

Awards

  • ACM SIGMOD Edgar F. Codd Innovations Award, 2007

  • "Test of Time" Paper Award, 2005 ACM SIGMOD International Conference on Management of Data (for View Maintenance in a Warehousing Environment, with Y. Zhuge, H. Garcia-Molina, and J. Hammer)

  • Best Paper Award, Twelfth International World Wide Web Conference, 2003 (for Scaling Personalized Web Search, with G. Jeh)

  • 10-Year Paper Award, Twenty-Sixth International Conference on Very Large Data Bases, 2000 (for Deriving Production Rules for Constraint Maintenance, with S. Ceri)

  • "Test of Time" Paper Award, 2000 ACM SIGMOD International Conference on Management of Data (for Set-Oriented Production Rules in Relational Database Systems, with S. Finkelstein)

  • IBM Research Division Award for Extensible Database Technology, 1992

  • Best Paper Award, Seventeenth International Conference on Very Large Data Bases, 1991 (for Deriving Production Rules for Incremental View Maintenance, with S. Ceri)

Professional Activities

Board of Trustees
  • VLDB Endowment, 1998-2003 (executive board 2000-2003)

Advisory Board Member

  • Abrevity Inc., 2006-present
  • Ingrian Networks Inc., 2004-present
  • Celequest Inc., 2003-2007
  • Kaltix Inc., 2003
  • Business Signatures Inc., 2002-2006
  • CrossGain Inc., 2000-2001
  • WhizBang! Labs Inc., 1999-2002
  • Angara Inc., 1997-2001
  • Brookhaven National Laboratory Protein Data Bank, 1997-1999

Editor

  • ACM Journal of Data and Information Quality; advisory board, 2006-present
  • Springer Encyclopedia of Database Systems; advisory board, 2006-present
  • ACM Transactions on Database Systems; associate editor 2003-2006
  • Springer book series on Data-Centric Systems and Applications; editorial board 2002-present
  • ACM SIGMOD Digital Review; editorial board 1999-2000
  • Kluwer Journal on Distributed and Parallel Databases; editorial board 1998-present
  • Springer-Verlag VLDB Journal; editorial board 1995-2001
  • ACM SIGMOD Record; editor-in-chief 1995-1997
  • Kluwer Journal on Intelligent Information Systems; guest editor 1995
  • IEEE Transactions on Knowledge and Data Engineering; editor 1994-1999
  • IEEE Data Engineering Bulletin; associate editor 1994-95

Program Committee Chair

  • 2005 ACM SIGMOD International Conference on Management of Data
  • Twenty-Fourth International Conference on Very Large Data Bases (VLDB '98)
  • Fourth IEEE International Workshop on Research Issues in Data Engineering (1994)

Program Committee Member

  • Third Biennial Conference on Innovative Data Systems Research (CIDR '07)
  • Twenty-Second ACM Symposium on Principles of Database Systems (PODS '04)
  • 2003 Workshop on Management and Processing of Data Streams
  • 2003 ACM SIGMOD International Conference on Management of Data
  • Twenty-Eighth International Conference on Very Large Data Bases (VLDB '02)
  • 2001 Workshop on Internet and Databases
  • Nineteenth ACM Symposium on Principles of Database Systems (PODS '01)
  • Twenty-Sixth International Conference on Very Large Data Bases (VLDB '00)
  • Second International Workshop on the Web and Databases (WebDB '99)
  • Eighth International World Wide Web Conference (1999)
  • 1999 Workshop on Query Processing for Semistructured Data and Non-Standard Data Formats
  • 1998 ACM SIGMOD International Conference on Management of Data
  • 1996 International Workshop on Logic in Databases
  • 1996 Workshop on Materialized Views: Techniques and Applications
  • 1996 International Conference on Extending Database Technology (EDBT '96)
  • Second International Workshop on Rules in Database Systems (1995)
  • Twenty-First International Conference on Very Large Data Bases (VLDB '95)
  • Thirteenth ACM Symposium on Principles of Database Systems (PODS '95)
  • Eleventh IEEE International Conference on Data Engineering (1995)
  • Third International Conference on Parallel and Distributed Information Systems (PDIS '94)
  • Twentieth International Conference on Very Large Data Bases (VLDB '94)
  • Second Workshop on Principles and Practice of Constraint Programming (1994)
  • 1994 International Conference on Extending Database Technology (EDBT '94)
  • 1993 ACM SIGMOD International Conference on Management of Data
  • International Working Conference on Cooperating Knowledge Base Systems (1990)

Conference Organizer

  • 2009 ACM SIGMOD International Conference on Management of Data; panel selection chair
  • 2006 ACM SIGMOD International Conference on Management of Data; tutorial selection co-chair
  • Biennial Conferences on Innovative Data Systems Research (CIDR '03 and '05); organizing committee
  • 1996 ACM SIGMOD International Conference on Management of Data; industrial session chair
  • 1994 ACM SIGMOD International Conference on Management of Data; panel selection chair

Invited Keynote Speaker

  • 2007 ACM SIGMOD International Conference on Management of Data
  • Second Biennial Conference on Innovative Data Systems Research (CIDR '05)
  • Ninth International Workshop on Database Programming Languages (2003)
  • Twenty-Sixth International Conference on Very Large Data Bases (VLDB '00)
  • Third IFCIS Conference on Cooperative Information Systems (1998)
  • 1996 International Conference on Extending Database Technology (EDBT '96)
  • Fourth International Conference on Information and Knowledge Management (1995)
  • Fifth Intl. Workshop on the Deductive Approach to Information Systems and Databases (1994)

Tutorialist

  • Tenth IEEE International Conference on Data Engineering (1994)
  • 1993 Lausanne Course on Advanced Database Systems
  • 1992 ACM SIGMOD International Conference on Management of Data
  • 1992 International Conference on Extending Database Technology (EDBT '92)
  • 1992 IBM La Hulpe Database Technology Symposium
  • 1991 EDBT Summer School on Advances in Database Technology[-1mm]

Students

Current Ph.D. students at Stanford University
  • Parag Agrawal
  • Anish Das Sarma
  • Raghotham Murthy

Graduated Ph.D. students

  • Utkarsh Srivastava, Stanford University, 2006
  • Arvind Arasu, Stanford University, 2006
  • Shivnath Babu, Stanford University, 2005
  • Chris Olston, Stanford University, 2003
  • Yingwei Cui, Stanford University, 2001
  • Jun Yang, Stanford University, 2001
  • Roy Goldman, Stanford University, 2000
       (winner of Stanford Computer Science Dept. Arthur Samuel Thesis Award)
  • Jason McHugh, Stanford University, 2000
  • Dallan Quass, Stanford University, 1997
  • Ashish Gupta, Stanford University, 1994
  • Elena Baralis, Politecnico di Torino (Italy), 1994

Member of Ph.D. thesis committee

  • Computer Science, Civil Engineering, Electrical Engineering, and Medical Informatics students at Stanford University

  • Computer Science students at University of California at Berkeley, Columbia University, University of Maryland, Oregon Graduate Institute, University of Twente (The Netherlands), University of Waterloo

Primary Research Grants

  • Uncertain Information Integration. U.S. Office of Research and Development, 2007, total funding approx. $150,000. Principal Investigator.

  • Information Management Research. Hewlett-Packard Corporation, 2006-2009, total funding approx. $600,000. Principal Investigator.

  • Next-Generation Issues in Data Stream Management Systems. National Science Foundation, 2006-2009, total funding approx. $960,000. Principal Investigator.

  • Intelligent Information Integration and Aggregation. Boeing Corporation, 2005-2009, total funding approx. $860,000. Co-Principal Investigator (with H. Garcia-Molina).

  • DataMotion - Dealing with Fast-Moving Data. National Science Foundation Information Technology Research (ITR), 2003-2008, total funding approx. $2,050,000. Co-Principal Investigator (with H. Garcia-Molina and R. Motwani).

  • Management and Processing of Data Streams. National Science Foundation, 2001-2004, total funding approx. $445,000. Principal Investigator.

  • From the Web to the Global InfoBase. National Science Foundation Information Technology Research (ITR), 2000-2003, total funding approx. $3,250,000. Co-Principal Investigator (with H. Garcia-Molina, C. Manning, and J.D. Ullman).

  • Managing Semistructured Data. National Science Foundation, 1998-2001, total funding approx. $235,000. Principal Investigator.

  • A Warehousing System for Information Integration and Change Management. Department of the Air Force, 1997-1999, total funding approx. $500,000. Principal Investigator.

  • Data Management for Wireless Networks. National Science Foundation, 1996-1998, total funding approx. $360,000. Principal Investigator.

  • Changes, Consistency and Configurations in Heterogeneous, Distributed Systems. Defense Advanced Research Projects Agency (DARPA), 1995-1998, total funding approx. $825,000. Principle Investigator.

  • A Warehousing Approach to Data and Knowledge Integration. CIA Office of Research and Development, 1995-1998, total funding approx. $1,000,000. Co-Principal Investigator (with H. Garcia-Molina and J.D. Ullman).

  • Efficient Management of Active Databases. Army Research Office, 1995-1998, total funding approx. $225,000. Co-Principal Investigator (with J.D. Ullman).

  • Data Management for Wireless Networks. Stanford Center for Telecommunications and Center for Integrated Systems, 1995-1996, total funding approx. $150,000. Principal Investigator.

  • An Integrated Information Management System. Defense Advanced Research Projects Agency (DARPA), 1994-1997, total funding approx. $2,000,000. Co-Principle Investigator (with H. Garcia-Molina and J.D. Ullman).

  • A Warehousing Approach to Data and Knowledge Integration. Department of the Air Force, 1994-1996, total funding approx. $200,000. Principal Investigator.

Publications

Books

  1. A First Course in Database Systems. Prentice Hall, Upper Saddle River, New Jersey, first edition 1997, second edition 2002, third edition 2007 (with J.D. Ullman). Translations: Chinese, Hungarian, Italian, Korean, Polish, Spanish

  2. Database Systems - The Complete Book. Prentice Hall, Upper Saddle River, New Jersey, 2002 (with H. Garcia-Molina and J.D. Ullman). Translations: Polish, Russian

  3. Database System Implementation. Prentice Hall, Upper Saddle River, New Jersey, 2000 (with H. Garcia-Molina and J.D. Ullman). Translations: Chinese

  4. Active Database Systems: Triggers and Rules for Advanced Database Processing. Morgan Kaufmann, San Francisco, California, 1996 (with S. Ceri).
Book Chapters

  1. STREAM: The Stanford Data Stream Management System. In M. Garofalakis, J. Gehrke, and R. Rastogi, editors, Data Stream Management: Processing High-Speed Data Streams, Springer, 2006 (with A. Arasu, B. Babcock, S. Babu, J. Cieslewicz, M. Datar, K. Ito, R. Motwani, and U. Srivastava).

  2. Rule Processing in Active Database Systems. In L. Delcambre and F. Petry, editors, Advances in Databases and Artificial Intelligence, JAI Press, 1995 (with E.N. Hanson).

  3. Active Database Systems. In W. Kim, editor, Modern Database Systems: The Object Model, Interoperability, and Beyond, Addison-Wesley, Reading, Massachusetts, 1994 (with U. Dayal and E.N. Hanson).
Refereed Journal Articles

  1. Swoosh: A Generic Approach to Entity Resolution. To appear in Springer VLDB Journal, 2008 (with O. Benjelloun, H. Garcia-Molina, D. Menestrina, Q. Su, and S.E. Whang).

  2. Databases with Uncertainty and Lineage. To appear in Springer VLDB Journal, 2008 (with O. Benjelloun, A. Das Sarma, M. Theobald, and A. Halevy).

  3. The CQL Continuous Query Language: Semantic Foundations and Query Execution. Springer VLDB Journal, 15(2):212-142, June 2006 (with A. Arasu and S. Babu).

  4. Exploiting k-Constraints to Reduce Memory Overhead in Continuous Queries over Data Streams. ACM Transactions on Database Systems, 29(3):545-580, September 2004 (with S. Babu and U. Srivastava).

  5. Characterizing Memory Requirements for Queries over Continuous Data Streams. ACM Transactions on Database Systems, 29(1):162-194, March 2004 (with A. Arasu, B. Babcock, S. Babu, and J. McAlister).

  6. Incremental Computation and Maintenance of Temporal Aggregates. Springer-Verlag VLDB Journal, 12(3):262-283, October 2003 (with J. Yang).

  7. Lineage Tracing for General Data Warehouse Transformations. Springer-Verlag VLDB Journal, 12(1):41-58, May 2003 (with Y. Cui).

  8. Computing the Median with Uncertainty. SIAM Journal on Computing, 32(2):538-547, March 2003 (with T. Feder, R. Motwani, R. Panigrahy, and C. Olston).

  9. Exploiting Hierarchical Domain Structure to Compute Similarity. ACM Transactions on Information Systems, 21(1): 64-93, January 2003 (with P. Ganesan and H. Garcia-Molina).

  10. Better Static Rule Analysis for Active Database Systems. ACM Transactions on Database Systems, 25(3):269-332, September 2000 (with E. Baralis).

  11. Tracing the Lineage of View Data in a Warehousing Environment. ACM Transactions on Database Systems, 25(2):179-227, June 2000 (with Y. Cui and J.L. Wiener).

  12. From Semistructured Data to XML: Migrating the Lore Data Model and Query Language. Markup Languages: Theory & Practice, 2(2), 2000 (with R. Goldman and J. McHugh).

  13. Managing Historical Semistructured Data. Theory and Practice of Object Systems, 5(3):143-162, 1999 (with S. Chawathe and S. Abiteboul).

  14. A Location Management Technique to Support Lifelong Numbering in Personal Communications Services. ACM Mobile Computing and Communications Review, 2(1):27-35, January 1998 (with D. Lam, Y. Cui, and D.C. Cox).

  15. Protocols for Integrity Constraint Checking in Federated Databases. International Journal of Distributed and Parallel Databases, 5(4):327-355, October 1997 (with P. Grefen).

  16. Efficient and Flexible Location Management Techniques for Wireless Communication Systems. ACM/Baltzer Journal of Wireless Networks, 3(5):361-374, October 1997 (with J. Jannink, D. Lam, N. Shivakumar, and D.C. Cox).

  17. Querying Semistructured Heterogeneous Information. Journal of Systems Integration, 7(3/4):381-407, September 1997 (with D. Quass, A. Rajaraman, J.D. Ullman, and Y. Sagiv).

  18. Per-User Profile Replication in Mobile Environments: Algorithms, Analysis, and Simulation Results.linebreak ACM/Baltzer Journal of Mobile Networks and Applications, 2(2):129-140, September 1997 (with N. Shivakumar and J. Jannink).

  19. The Lorel Query Language for Semistructured Data. International Journal on Digital Libraries, 1(1):68-88, April 1997 (with S. Abiteboul, D. Quass, J. McHugh, J.L. Wiener).

  20. The TSIMMIS Approach to Mediation: Data Models and Languages. Journal of Intelligent Information Systems, 8(2):117-132, March 1997 (with H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajaraman, Y. Sagiv, and J.D. Ullman).

  21. Teletraffic Modeling for Personal Communications Services. IEEE Communications, 35(2):79-87, February 1997 (with D. Lam and D.C. Cox).

  22. The Starburst Active Database Rule System. IEEE Transactions on Knowledge and Data Engineering, 8(4):583-595, August 1996.

  23. Static Analysis Techniques for Predicting the Behavior of Active Database Rules. ACM Transactions on Database Systems, 20(1):3-41, March 1995 (with A. Aiken and J.M. Hellerstein).

  24. Deriving Incremental Production Rules for Deductive Data. Information Systems, 19(6):467-490, 1994 (with S. Ceri).

  25. An Overview of Production Rules in Database Systems. The Knowledge Engineering Review, 8(2):121-143, June 1993 (with E.N. Hanson).

  26. Rule Processing in Active Database Systems. International Journal of Expert Systems, 6(1):83-119, 1993 (with E.N. Hanson).

  27. Trace-Based Network Proof Systems: Expressiveness and Completeness. ACM Transactions on Programming Languages and Systems, 14(3):396-416, July 1992 (with D. Gries and F.B. Schneider).

  28. Whiteboards: a Graphical Database Tool. ACM Transactions on Office Information Systems, 4(1):24-41, January 1986 (with J. Donahue).
Invited or Unrefereed Journal Articles

  1. Generic Entity Resolution in the SERF Project. IEEE Data Engineering Bulletin, Special Issue on Data Quality, 29(2):13-20, June 2006 (with O. Benjelloun, H. Garcia-Molina, H. Kawai, T.E. Larson, D. Menestrina, Q. Su, and S. Thavisomboon).

  2. An Introduction to ULDBs and the Trio System. IEEE Data Engineering Bulletin, Special Issue on Probabilistic Databases, 29(1):5-16, March 2006 (with O. Benjelloun, A. Das Sarma, and C. Hayworth).

  3. Monitoring and Querying of Distributed, Dynamic Data via Approximate Replication. IEEE Data Engineering Bulletin, Special Issue on In-Network Query Processing, 28(1):11-18, March 2005 (with C. Olston).

  4. A Denotational Semantics for Continuous Queries over Streams and Relations. ACM SIGMOD Record, 33(3):6-12, September 2004 (with A. Arasu).

  5. STREAM: The Stanford Stream Data Manager. IEEE Data Engineering Bulletin, Special Issue on Data Stream Processing, 26(1):19-26, March 2003 (with A. Arasu, B. Babcock, S. Babu, J. Cieslewicz, M. Datar, K. Ito, R. Motwani, and U. Srivastava).

  6. Continuous Queries over Data Streams. ACM SIGMOD Record, 30(3):109-120, September 2001 (with S. Babu).

  7. Lore: A Database Management System for XML. Dr. Dobb's Journal, 25(4):76-80, April 2000 (with J. McHugh and R. Goldman).

  8. Data Management for XML: Research Directions. IEEE Data Engineering Bulletin, Special Issue on XML, 22(3):44-52, September 1999.

  9. Integrating Dynamically-Fetched External Information into a DBMS for Semistructured Data. ACM SIGMOD Record, 26(4):24-31, December 1997 (with J. McHugh).

  10. Lore: A Database Management System for Semistructured Data. ACM SIGMOD Record, 26(3):54-66, September 1997 (with J. McHugh, S. Abiteboul, R. Goldman, and D. Quass).

  11. Integrating Heterogeneous Databases: Lazy or Eager? ACM Computing Surveys, 28A(4), December 1996.

  12. The Stanford Data Warehousing Project. IEEE Data Engineering Bulletin, Special Issue on Materialized Views and Data Warehousing, 18(2):41-48, June 1995 (with J. Hammer, H. Garcia-Molina, W.J. Labio, and Y. Zhuge).

  13. Flexible Constraint Management for Autonomous Distributed Databases. IEEE Data Engineering Bulletin, Special Issue on Database Constraint Management, 17(2):23-27, June 1994 (with S. Chawathe and H. Garcia-Molina).

  14. The Starburst Rule System: Language Design, Implementation, and Applications. IEEE Data Engineering Bulletin, Special Issue on Active Databases, 15(4):15-18, December 1992.

  15. A Denotational Semantics for the Starburst Production Rule Language. ACM SIGMOD Record, 21(3):4-9, September 1992.

  16. A Syntax and Semantics for Set-Oriented Production Rules in Relational Database Systems (Extended Abstract). ACM SIGMOD Record, Special Issue on Rule Management and Processing in Expert Database Systems, 18(3):36-45, September 1989 (with S.J. Finkelstein).
Refereed Conferences and Workshops

  1. Exploiting Lineage for Confidence Computation in Uncertain and Probabilistic Databases. To appear in Proceedings of the Twenty-Fourth International Conference on Data Engineering, Cancun, Mexico, April 2008 (with A. Das Sarma and M. Theobald).

  2. Making Aggregation Work in Uncertain and Probabilistic Databases. Proceedings of the Workshop on Management of Uncertain Data, pages 76-90, Vienna, Austria, September 2007 (with R. Murthy).

  3. Optimization of Continuous Queries with Shared Expensive Filters. Proceedings of the Twenty-Sixth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, Beijing, China, June 2007 (with K. Munagala and U. Srivastava).

  4. ULDBs: Databases with Uncertainty and Lineage. Proceedings of the Thirty-Second International Conference on Very Large Data Bases, pages 953-964, Seoul, Korea, September 2006 (with O. Benjelloun, A. Das Sarma, and A. Halevy).

  5. Query Optimization over Web Services. Proceedings of the Thirty-Second International Conference on Very Large Data Bases, pages 355-366, Seoul, Korea, September 2006 (with U. Srivastava, K. Munagala, and R. Motwani).

  6. Estimating Data Stream Quality for Object-Detection Applications. Proceedings of the Third International ACM SIGMOD Workshop on Information Quality in Information Systems, Chicago, Illinois, June 2006 (with A. Das Sarma, S.R. Jeffery, and M.J. Franklin).

  7. Declarative Support for Sensor Data Cleaning. Proceedings of the Fourth International Conference on Pervasive Computing, Lecture Notes in Computer Science 3968, pages 83-100, Springer, Berlin, May 2006 (with S.R. Jeffery, G. Alonso, M.J. Franklin, and W. Hong).

  8. Working Models for Uncertain Data. Proceedings of the Twenty-Second International Conference on Data Engineering, Atlanta, Georgia, April 2006 (with A. Das Sarma, O. Benjelloun, and A. Halevy).

  9. A Pipelined Framework for Online Cleaning of Sensor Data Streams. Proceedings of the Twenty-Second International Conference on Data Engineering (short paper), Atlanta, Georgia, April 2006 (with S.R. Jeffery, G. Alonso, M.J. Franklin, and W. Hong).

  10. Content-Based Routing: Different Plans for Different Data. Proceedings of the Thirty-First International Conference on Very Large Data Bases, Trondheim, Norway, pages 757-768, September 2005 (with P. Bizarro, S. Babu, and D. DeWitt).

  11. Indexing Relational Database Content Offline for Efficient Keyword-Based Search. Proceedings of the Ninth International Database Engineering and Applications Symposium, pages 297-306, Montreal, Canada, July 2005 (with Q. Su).

  12. Operator Placement for In-Network Stream Query Processing. Proceedings of the Twenty-Fourth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pages 250-258, Baltimore, Maryland, June 2005 (with U. Srivastava and K. Munagala).

  13. Adaptive Caching for Continuous Queries. Proceedings of the Twenty-First International Conference on Data Engineering, pages 188-129, Tokyo, Japan, April 2005 (with S. Babu, K. Munagala, and R. Motwani).

  14. Trio: A System for Integrated Management of Data, Accuracy, and Lineage. Proceedings of the Second Biennial Conference on Innovative Data Systems Research (CIDR '05), Pacific Grove, California, January 2005.

  15. The Pipelined Set Cover Problem. Proceedings of the Tenth International Conference on Database Theory, Lecture Notes in Computer Science 3363, pages 83-98, Springer, Berlin, January 2005 (with K. Munagala, S. Babu, and R. Motwani).

  16. Mining the Space of Graph Properties. Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 187-196, Seattle, Washington, August 2004 (with G. Jeh).

  17. Memory-Limited Execution of Windowed Stream Joins. Proceedings of the Thirtieth International Conference on Very Large Data Bases, pages 324-335, Toronto, Canada, August 2004 (with U. Srivastava).

  18. Resource Sharing in Continuous Sliding-Window Aggregates. Proceedings of the Thirtieth International Conference on Very Large Data Bases, pages 336-347, Toronto, Canada, August 2004 (with A. Arasu).

  19. Enabling Privacy for the Paranoids. Proceedings of the Thirtieth International Conference on Very Large Data Bases, pages 708-719, Toronto, Canada, August 2004 (with G. Aggarwal, M. Bawa, P. Ganesan, H. Garcia-Molina, K. Kenthapadi, N. Mishra, R. Motwani, U. Srivastava, and D. Thomas).

  20. Flexible Time Management in Data Stream Systems. Proceedings of the Twenty-Third ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pages 263-274, Paris, France, June 2004 (with U. Srivastava).

  21. Adaptive Ordering of Pipelined Stream Filters. Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 407-418, Paris, France, June 2004 (with S. Babu, R. Motwani, K. Munagala, and I. Nishizawa).

  22. Monitoring Stream Properties for Continuous Query Processing. Proceedings of the Workshop on Management and Processing of Data Streams, San Diego, California, June 2003 (with U. Srivastava and S. Babu).

  23. Adaptive Filters for Continuous Queries over Distributed Data Streams. Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 563-574, San Diego, California, June 2003 (with C. Olston and J. Jiang).

  24. Scaling Personalized Web Search. Proceedings of the Twelfth International World Wide Web Conference (WWW 2003), pages 271-279, Budapest, Hungary, May 2003 (with G. Jeh).

  25. Query Processing, Resource Management, and Approximation in a Data Stream Management System. Proceedings of the First Biennial Conference on Innovative Data Systems Research (CIDR '03), pages 245-256, Pacific Grove, California, January 2003 (with R. Motwani, A. Arasu, B. Babcock, S. Babu, M. Datar, G. Manku, C. Olston, J. Rosenstein, and R. Varma).

  26. SimRank: A Measure of Structural-Context Similarity. Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 538-543, Edmonton, Canada, July 2002 (with G. Jeh).

  27. Characterizing Memory Requirements for Queries over Continuous Data Streams. Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pages 221-232, Madison, Wisconsin, June 2002 (with A. Arasu, B. Babcock, S. Babu, and J. McAlister).

  28. Best-Effort Cache Synchronization with Source Cooperation. Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 73-84, Madison, Wisconsin, June 2002 (with C. Olston).

  29. Lineage Tracing for General Data Warehouse Transformations. Proceedings of the Twenty-Seventh International Conference on Very Large Data Bases, pages 471-480, Rome, Italy, September 2001 (with Y. Cui).

  30. A Data Stream Management System for Network Traffic Management. Proceedings of the Workshop on Network-Related Data Management, Santa Barbara, California, May 2001 (with S. Babu and L. Subramanian).

  31. Adaptive Precision Setting for Cached Approximate Values. Proceedings of the ACM SIGMOD International Conference on Management of Data, Santa Barbara, California, pages 355-366, May 2001 (with C. Olston and B.T. Loo).

  32. Incremental Computation and Maintenance of Temporal Aggregates. Proceedings of the Seventeenth International Conference on Data Engineering, pages 51-60, Heidelberg, Germany, April 2001 (with J. Yang).

  33. Offering a Precision-Performance Tradeoff for Aggregation Queries over Replicated Data. Proceedings of the Twenty-Sixth International Conference on Very Large Data Bases, pages 144-155, Cairo, Egypt, September 2000 (with C. Olston).

  34. Performance Issues in Incremental Warehouse Maintenance. Proceedings of the Twenty-Sixth International Conference on Very Large Data Bases, pages 461-472, Cairo, Egypt, September 2000 (with W.J. Labio, J. Yang, Y. Cui, and H. Garcia-Molina).

  35. Storing Auxiliary Data for Efficient Maintenance and Lineage Tracing of Complex Views. Proceedings of the Second International Workshop on Design and Management of Data Warehouses (DMDW 2000), Stockholm, Sweden, June 2000 (with Y. Cui).

  36. WSQ/DSQ: A Practical Approach for Combined Querying of Databases and the Web. Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 285-296, Dallas, Texas, May 2000 (with R. Goldman).

  37. Computing the Median with Uncertainty. Proceedings of the Thirty-Second Annual ACM Symposium on Theory of Computing, pages 602-607, Portland, Oregon, May 2000 (with T. Feder, R. Motwani, R. Panigrahy, and C. Olston).

  38. Temporal View Self-Maintenance in a Warehousing Environment. Proceedings of the Seventh International Conference on Extending Database Technology (EDBT 2000), pages 395-412, Konstanz, Germany, March 2000 (with J. Yang).

  39. Practical Lineage Tracing in Data Warehouses. Proceedings of the Sixteenth International Conference on Data Engineering, pages 367-378, San Diego, California, February 2000 (with Y. Cui).

  40. Ozone: Integrating Structured and Semistructured Data. Proceedings of the Seventh International Workshop on on Database Programming Languages, Kinloch Rannoch, Scotland, September 1999 (with T. Lahiri and S. Abiteboul).

  41. Query Optimization for XML. Proceedings of the Twenty-Fifth International Conference on Very Large Data Bases, Edinburgh, Scotland, pages 315-326, September 1999 (with J. McHugh).

  42. From Semistructured Data to XML: Migrating the Lore Data Model and Query Language. Proceedings of the 2nd International Workshop on the Web and Databases (WebDB '99), pages 25-30, Philadelphia, Pennsylvania, June 1999 (with R. Goldman and J. McHugh).

  43. Approximate DataGuides. Proceedings of the Workshop on Query Processing for Semistructured Data and Non-Standard Data Formats, Jerusalem, Israel, January 1999 (with R. Goldman).

  44. Compile-Time Path Expansion in Lore. Proceedings of the Workshop on Query Processing for Semistructured Data and Non-Standard Data Formats, Jerusalem, Israel, January 1999 (with J. McHugh).

  45. Interactive Query and Search in Semistructured Databases. Proceedings of the First International Workshop on the Web and Databases (WebDB '98), Lecture Notes in Computer Science 1590, pages 52-62, Springer-Verlag, Berlin, March 1998 (with R. Goldman).

  46. Maintaining Temporal Views Over Non-Historical Information Sources for Data Warehousing. Proceedings of the Sixth International Conference on Extending Database Technology (EDBT '98), pages 389-403, Valencia, Spain, March 1998 (with J. Yang).

  47. Efficient PCS Call Setup Protocols. Proceedings of the Seventeenth Annual IEEE Joint Conference on Computer Communications (Infocom '98), pages 728-736, San Francisco, California, March 1998 (with Y. Cui, D. Lam, and D.C. Cox).

  48. Representing and Querying Changes in Semistructured Data. Proceedings of the Fourteenth International Conference on Data Engineering, pages 4-13, Orlando, Florida, February 1998 (with S. Chawathe and S. Abiteboul).

  49. A Location Management Technique to Support Lifelong Numbering in Personal Communications Services. Proceedings of the 1997 IEEE Global Telecommunications Conference (Globecom '97), pages 704-710, Phoenix, Arizona, November 1997 (with D. Lam, Y. Cui, and D.C. Cox).

  50. DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. Proceedings of the Twenty-Third International Conference on Very Large Data Bases, pages 436-445, Athens, Greece, August 1997 (with R. Goldman).

  51. Integrating Dynamically-Fetched External Information into a DBMS for Semistructured Data. Proceedings of the Workshop on Management of Semistructured Data, pages 75-82, Tucson, Arizona, May 1997 (with J. McHugh).

  52. On-Line Warehouse View Maintenance. Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 393-404, Tucson, Arizona, May 1997 (with D. Quass).

  53. The STRIP Rule System for Efficiently Maintaining Derived Data. Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 147-158, Tucson, Arizona, May 1997 (with B. Adelberg and H. Garcia-Molina).

  54. Clustering Association Rules. Proceedings of the Thirteenth International Conference on Data Engineering, pages 220-231, Birmingham, UK, April 1997 (with B. Lent and A. Swami).

  55. Making Views Self-Maintainable for Data Warehousing. Proceedings of the Fourth International Conference on Parallel and Distributed Information Systems (PDIS '96), pages 158-169, Miami Beach, Florida, December 1996 (with D. Quass, A. Gupta, and I.S. Mumick).

  56. Efficient and Flexible Location Management Techniques for Wireless Communication Systems. Proceedings of the Second ACM International Conference on Mobile Computing and Networking (MobiCom '96), pages 38-49, White Plains, New York, November 1996 (with J. Jannink, D. Lam, N. Shivakumar, and D.C. Cox).

  57. Modeling Location Management in Personal Communication Services. Proceedings of the 1996 IEEE International Conference on Universal Personal Communications, volume 2 pages 596-601, Cambridge, Massachusetts, September 1996 (with D. Lam, J. Jannink, and D.C. Cox).

  58. A System Prototype for Warehouse View Maintenance. Proceedings of the 1996 Workshop on Materialized Views: Techniques and Applications, pages 26-33, Montreal, Canada, June 1996 (with J.L. Wiener, H. Gupta, W.J. Labio, Y. Zhuge, and H. Garcia-Molina).

  59. Integrity Constraint Checking in Federated Databases. Proceedings of the First IFCIS International Conference on Cooperative Information Systems, pages 38-47, Brussels, Belgium, June 1996 (with P. Grefen).

  60. Change Detection in Hierarchically Structured Information. Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 493-504, Montreal, Canada, June 1996 (with S. Chawathe, A. Rajaraman, and H. Garcia-Molina).

  61. A Toolkit for Constraint Management in Heterogeneous Information Systems. Proceedings of the Twelfth International Conference on Data Engineering, pages 56-65, New Orleans, Louisiana, February 1996 (with S. Chawathe and H. Garcia-Molina).

  62. Querying Semistructured Heterogeneous Information. Proceedings of the Fourth International Conference on Deductive and Object-Oriented Databases, pages 319-344, Singapore, December 1995 (with D. Quass, A. Rajaraman, Y. Sagiv, and J.D. Ullman).

  63. User Profile Replication for Faster Location Lookup in Mobile Environments. Proceedings of the First ACM International Conference on Mobile Computing and Networking (MobiCom '95), pages 161-169, Berkeley, CA, November 1995 (with N. Shivakumar).

  64. Using Delta Relations to Optimize Condition Evaluation in Active Databases. Proceedings of the Second International Workshop on Rules in Database Systems, Lecture Notes in Computer Science 985, pages 292-308, Springer-Verlag, Berlin, September 1995 (with E. Baralis).

  65. The TSIMMIS Approach to Mediation: Data Models and Languages. Proceedings of the Second International Workshop on Next Generation Information Technologies and Systems, pages 185-193, Naharia, Israel, June 1995 (with H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajaraman, Y. Sagiv, and J.D. Ullman).

  66. View Maintenance in a Warehousing Environment. Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 316-327, San Jose, CA, May 1995 (with Y. Zhuge, H. Garcia-Molina, and J. Hammer).

  67. Object Exchange Across Heterogeneous Information Sources. Proceedings of the Eleventh International Conference on Data Engineering, pages 251-260, Taipei, Taiwan, March 1995 (with Y. Papakonstantinou and H. Garcia-Molina).

  68. An Algebraic Approach to Rule Analysis in Expert Database Systems. Proceedings of the Twentieth International Conference on Very Large Data Bases, pages 606-617, Santiago, Chile, September 1994 (with E. Baralis).

  69. Constraint Checking with Partial Information. Proceedings of the Thirteenth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pages 45-55, Minneapolis, Minnesota, May 1994 (with A. Gupta, Y. Sagiv, and J.D. Ullman).

  70. Efficient and Complete Tests for Database Integrity Constraint Checking. Proceedings of the Second Workshop on Principles and Practice of Constraint Programming, pages 146-151, Orcas Island, Washington, May 1994 (with A. Gupta, Y. Sagiv, and J.D. Ullman).

  71. Managing Semantic Heterogeneity with Production Rules and Persistent Queues. Proceedings of the Nineteenth International Conference on Very Large Data Bases, pages 108-119, Dublin, Ireland, August 1993 (with S. Ceri).

  72. Better Termination Analysis for Active Databases. Proceedings of the First International Workshop on Rules in Database Systems, pages 163-179, Edinburgh, Scotland, August 1993 (with E. Baralis and S. Ceri).

  73. Deductive and Active Databases: Two Paradigms or Ends of a Spectrum? Proceedings of the First International Workshop on Rules in Database Systems, pages 306-315, Edinburgh, Scotland, August 1993.

  74. Local Verification of Global Integrity Constraints in Distributed Databases. Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 49-58, Washington, D.C., May 1993 (with A. Gupta).

  75. Production Rules in Parallel and Distributed Database Environments. Proceedings of the Eighteenth International Conference on Very Large Data Bases, pages 339-351, Vancouver, British Columbia, August 1992 (with S. Ceri).

  76. Behavior of Database Production Rules: Termination, Confluence, and Observable Determinism. Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 59-68, San Diego, California, June 1992 (with A. Aiken and J.M. Hellerstein).

  77. Deriving Production Rules for Incremental View Maintenance. Proceedings of the Seventeenth International Conference on Very Large Data Bases, pages 577-589, Barcelona, Spain, September 1991 (with S. Ceri).

  78. Implementing Set-Oriented Production Rules as an Extension to Starburst. Proceedings of the Seventeenth International Conference on Very Large Data Bases, pages 275-285, Barcelona, Spain, September 1991 (with R.J. Cochrane and B. Lindsay).

  79. Deriving Production Rules for Constraint Maintenance. Proceedings of the Sixteenth International Conference on Very Large Data Bases, pages 566-577, Brisbane, Australia, August 1990 (with S. Ceri).

  80. Set-Oriented Production Rules in Relational Database Systems. Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 259-270, Atlantic City, New Jersey, May 1990 (with S.J. Finkelstein).

  81. A Temporal-Logic Based Compositional Proof System for Real-Time Message Passing. PARLE '89: Proceedings of Parallel Architectures and Languages Europe, Volume II, Lecture Notes in Computer Science 366, pages 424-441, Springer-Verlag, Berlin, June 1989 (with J. Hooman).

  82. Expressiveness Bounds for Completeness in Trace-Based Network Proof Systems. CAAP '88: Proceedings of the Thirteenth Colloquium on Trees in Algebra and Programming, Lecture Notes in Computer Science 299, pages 200-214, Springer-Verlag, Berlin, March 1988 (with P. Panangaden).

  83. Completeness and Incompleteness of Trace-Based Network Proof Systems. Proceedings of the Fourteenth Annual ACM Symposium on Principles of Programming Languages, pages 27-38, Munich, West Germany, January 1987 (with D. Gries and F.B. Schneider). Invited Conference and Workshop Articles

  84. CQL: A Language for Continuous Queries over Streams and Relations. Proceedings of the Ninth International Conference on Data Base Programming Languages, pages 1-19, Potsdam, Germany, September 2003 (with A. Arasu and S. Babu).

  85. Models and Issues in Data Stream Systems. Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pages 1-16, Madison, Wisconsin, June 2002 (with B. Babcock, S. Babu, M. Datar, and R. Motwani).

  86. Practical Applications of Triggers and Constraints: Successes and Lingering Issues. Proceedings of the Twenty-Sixth International Conference on Very Large Data Bases, pages 254-262, Cairo, Egypt, September 2000 (with S. Ceri and R.J. Cochrane).

  87. Research Problems in Data Warehousing. Proceedings of the Fourth International Conference on Information and Knowledge Management (CIKM '95), pages 25-30, Baltimore, Maryland, November 1995.

  88. Integrating and Accessing Heterogeneous Information Sources in TSIMMIS. Proceedings of the AAAI Spring Symposium on Information Gathering, pages 61-64, Stanford, California, February 1995 (with J. Hammer, H. Garcia-Molina, K. Ireland, Y. Papakonstantinou, and J.D. Ullman).

  89. The Tsimmis Project: Integration of Heterogeneous Information Sources. Proceedings of the 100th Anniversary Meeting of the Information Processing Society of Japan, pages 7-18, Tokyo, Japan, October 1994 (with S. Chawathe, H. Garcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, and J.D. Ullman).

  90. Validating Constraints with Partial Information: Research Overview. Proceedings of the Fifth International Workshop on the Deductive Approach to Information Systems and Databases, pages 375-385, Costa Brava, Spain, September 1994 (with A. Gupta, Y. Sagiv, and J.D. Ullman).

  91. Intelligence and Cooperation through Database Production Rules. Proceedings of the Second International Workshop on Intelligent and Cooperative Information Systems, pages 62-67, Como, Italy, October 1991.
Submitted for Publication

  1. Functional Dependencies for Uncertain Relations. Submitted for conference publication, November 2007 (with A. Das Sarma and J.D. Ullman).

  2. Confidence-Aware Joins in Large Uncertain Databases. Submitted for conference publication, November 2007 (with P. Agrawal).
Technical Reports

  1. Representing Uncertain Data: Uniqueness, Equivalence, Minimization, and Approximation. Technical Report, Stanford University InfoLab, December 2005 (with A. Das Sarma and S. Nabar).

  2. Run-Time Translation of View Tuple Deletions Using Data Lineage. Technical Report, Stanford University InfoLab, June 2001 (with Y. Cui).

  3. Implementing Parameterized Range Types in an Extensible DBMS. Technical Report, Stanford University InfoLab, November 2000 (with J. Yang and P. Brown).

  4. Summarizing and Searching Sequential Semistructured Sources. Technical Report, Stanford University InfoLab, March 2000 (with R. Goldman).

  5. Optimizing Branching Path Expressions. Technical Report, Stanford University InfoLab, June 1999 (with J. McHugh).

  6. Indexing Semistructured Data. Technical Report, Stanford University InfoLab, February 1998 (with J. McHugh, S. Abiteboul, Q. Luo, and A. Rajaraman).

  7. Starburst Rule System User's Guide. Internal Technical Report, IBM Almaden Research Center, San Jose, California, July 1992.

  8. Trace-Based Network Proof Systems: Expressiveness and Completeness (Ph.D. thesis). Technical Report 87-833, Computer Science Department, Cornell University, May 1987.

Last updated by Jennifer Widom, February 2008