Anish Das Sarma's Publications


2012

Efficient Spatial Sampling of Large Geographical Tables Anish Das Sarma, Hongrae Lee, Hector Gonzalez, Jayant Madhavan, Alon Halevy. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), Arizona, USA, May 2012.
REX: Explaining Relationships between Entity Pairs Lujun Fang, Anish Das Sarma, Cong Yu, Philip Bohannon. In Proceedings of the conference on Very Large Data Basess (VLDB), Istanbul, Turkey, August 2012.
Understanding Cyclic Trends in Social Choices Anish Das Sarma, Sreenivas Gollapudi, Rina Panigrahy and Li Zhang. In Proceedings of the conference on Web Search and Data Mining (WSDM), Seattle, USA, 2012.
Fuzzy Joins Using MapReduce. Foto Afrati, Anish Das Sarma, David Menestrina, Aditya Parameswaran, Jeffrey Ullman. In Proceedings of the conference on International Conference on Data Engineering (ICDE), Washington, USA, April 2012.

2011

CoScan: Cooperative Scan Sharing in the Cloud. Xiaodan Wang, Anish Das Sarma, Christopher Olston, Randal Burns. In Proceedings of the Synmposium on Cloud Computing (SoCC), Portugal, 2011.
Human-Assisted Graph Search: It's Okay to Ask Questions. Aditya Parameswaran, Anish Das Sarma, Hector Garcia-Molina, Alkis Polyzotis, Jennifer Widom. In Proceedings of the conference on Very Large Data Basess (VLDB), Seattle, USA, 2011.
Dynamic Relationship and Event Discovery. Anish Das Sarma, Alpa Jain, Cong Yu. Proceedings of the Web-Search and Data Mining Conference (WSDM), Hong Kong, 2011.
Data Integration with Dependent Sources. Anish Das Sarma, Luna Dong, Alon Halevy. Proceedings of the International Conference on Extending Database Technology (EDBT), 2011.
Building a Generic Debugger for Information Extraction Pipelines Anish Das Sarma, Alpa Jain, Philip Bohannon. Poster paper, CIKM, October 2011. (Full version)
Ibis: A Provenance Manager for Multi-Layer Systems. Christopher Olston, Anish Das Sarma. In Proceedings of the 5th Biennial Conference on Innovative Data Systems Research (CIDR) , Pacific Grove, California, January 2011.

2010

Uncertainty in Data Integration and Dataspace Support Platforms. Anish Das Sarma, Luna Dong, Alon Halevy. Book chapter, In Schema Matching and Mapping, ISBN: 978-3-642-16517-7, 2010.
Foundations of Uncertain-Data Integration. Parag Agrawal, Anish Das Sarma, Jeffrey Ullman, Jennifer Widom. Proceedings of the 36th International Conference on Very Large Data Bases (VLDB), Singapore, September 2010.
Succinct Coverage Oracles. Ioannis Antonellis, Anish Das Sarma, Shaddin Dughmi. Technical Report, 2010.
LIVE: A Lineage-Supportd Versioned DBMS. Anish Das Sarma, Martin Theobald, Jennifer Widom. In Proceedings of the 22nd International Conference on Scientific and Statistical Database Management (SSDBM), Heidelberg, Germany, June 2010.
I4E: Interactive Investigation of Iterative Information Extraction. Anish Das Sarma, Alpa Jain, Divesh Srivastava. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), Indianapolis, Indiana, USA, June 2010.
Synthesizing View Definitions from Data. Anish Das Sarma, Aditya Parameswaran, Hector Garcia-Molina, Jennifer Widom. In Proceedings of the International Conference on Database Theory (ICDT), Lausanne, Switzerland, March 2010.
Ranking Mechanisms in Twitter-Like Forums. Anish Das Sarma, Atish Das Sarma, Sreenivas Gollapudi, Rina Panigrahy. In Proceedings of the International Conference on Web Search and Data Mining (WSDM), New York City, USA, February 2010.

2009

Managing Uncertain Data. Anish Das Sarma. Ph.D. Thesis, Stanford University, November 2009.

Functional Dependency Generation and Applications in Pay-As-You-Go Data Integration Systems. Daisy Zhe Wang, Luna Dong, Anish Das Sarma, Michael J. Franklin, Alon Halevy. In Proceedings of WebDB, Providence, Rhode Island, June 2009.
Representing Uncertain Data: Models, Properties, and Algorithms. Anish Das Sarma, Omar Benjelloun, Alon Halevy, Shubha Nabar, Jennifer Widom. In VLDB Journal, 18(5), 989-1019, October 2009. (Special issue on uncertain and probabilistic databases.)

Data Modeling in Dataspace Support Platforms. Anish Das Sarma, Luna Dong, Alon Halevy. In Conceptual Modeling: Foundations and Applications, Essays in Honor of John Mylopoulos, Springer Festschrift, LNCS 5600, 2009.

Schema Design for Uncertain Databases. Anish Das Sarma, Jeffrey Ullman, Jennifer Widom. Proceedings of the 3rd Alberto Mendelzon Workshop on Foundations of Data Management, Peru, May 2009.

Sailing the Information Ocean with Awareness of Currents: Discovery and Application of Source Dependence. Laure Berti-Equille, Anish Das Sarma, Xin Luna Dong, Amelie Marian, Divesh Srivastava. Proceedings of the 4th Biennial Conference on Innovative Data Systems Research (CIDR) , Pacific Grove, California, January 2009.
Uncertainty In Data Integration. Anish Das Sarma, Luna Dong, Alon Halevy. In C. Aggarwal, editor, Managing and Mining Uncertain Data, Springer, 2009.

2008

Towards Special-Purpose Indexes and Statistics for Uncertain Data. Anish Das Sarma, Parag Agrawal, Shubha Nabar, Jennifer Widom. Proceedings of the Workshop on Management of Uncertain Data (MUD), Auckland, New Zealand, August 2008.

Bootstrapping Pay-As-You-Go Data Integration Systems. Anish Das Sarma, Luna Dong, Alon Halevy. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), Vancouver, Canada, June 2008.

Exploiting Lineage for Confidence Computation in Uncertain and Probabilistic Databases. Anish Das Sarma, Martin Theobald, Jennifer Widom. Proceedings of the 24th International Conference on Data Engineering (ICDE), Cancun, Mexico, April 2008. (DBClip)

Databases with Uncertainty and Lineage. Omar Benjelloun, Anish Das Sarma, Alon Halevy, Martin Theobald, Jennifer Widom. VLDB Journal, 17(2), 243-264, March 2008. (Special issue on Best papers of VLDB '06.)

2007

Leveraging Aggregate Constraints for Deduplication. Surajit Chaudhuri, Anish Das Sarma, Venkatesh Ganti, Raghav Kaushik. Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD), Beijing, China, June 2007.

Detecting Near-Duplicates for Web-Crawling. Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma. Proceedings of the 16th International World Wide Web (WWW) Conference, Banff, Canada, May 2007.

Trio-One: Layering Uncertainty and Lineage on a Conventional DBMS. Michi Mutsuzaki, Martin Theobald, Ander de Keijzer, Jennifer Widom, Parag Agrawal, Omar Benjelloun, Anish Das Sarma, Raghotham Murthy, Tomoe Sugihara. Proceedings of the 3rd Biennial Conference on Innovative Data Systems Research (CIDR) , Pacific Grove, California, January 2007. Demostration description.

2006

ULDBs: Databases with Uncertainty and Lineage. Omar Benjelloun, Anish Das Sarma, Alon Halevy, Jennifer Widom. Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB), Seoul, Korea, September 2006.

Trio: A System for Data, Uncertainty, and Lineage. Parag Agrawal, Omar Benjelloun, Anish Das Sarma, Chris Hayworth, Shubha Nabar, Tomoe Sugihara, Jennifer Widom. Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB), Seoul, Korea, September 2006. Demostration description.

Estimating Data Stream Quality for Object-Detection Applications. Anish Das Sarma, Shawn R. Jeffery, Michael J. Franklin, Jennifer Widom. Proceedings of the Third International ACM SIGMOD Workshop on Information Quality in Information Systems, Chicago, Illinois, June 2006.

Working Models for Uncertain Data. Anish Das Sarma, Omar Benjelloun, Alon Halevy, Jennifer Widom. Proceedings of the Twenty-Second International Conference on Data Engineering (ICDE), Atlanta, Georgia, April 2006.

An Introduction to ULDBs and the Trio System. Omar Benjelloun, Anish Das Sarma, Chris Hayworth, Jennifer Widom. IEEE Data Engineering Bulletin, Special Issue on Probabilistic Databases, 29(1), March 2006.

2005

Representing Uncertainty: Uniqueness, Equivalence, Minimization and Approximation. Anish Das Sarma, Shubha U. Nabar, Jennifer Widom. Technical Report, Stanford University, December 2005.

2004

A Decomposition Based Approach for Design of Supply Aggregation and Demand Aggregation Exchanges. Shantanu Biswas, Y. Narahari, Anish Das Sarma. The International Workshop on Theory Building and Formal Methods in Electronic/Mobile Commerce (TheFormEMC) collocated with FORTE, 2004. Published in LNCS, pp. 58-71, Volume 3236, 2004.

Generic Text Summarization Using WordNet. Kedar Bellare, Anish Das Sarma, Atish Das Sarma, Navneet Loiwal, Vaibhav Mehta, Ganesh Ramakrishnan, Pushpak Bhattacharya. Internationational Conference on Language Resources and Evaluation (LREC), 2004.


Anish Das Sarma < Email: anish@cs.stanford.edu >