[Papers ] [Articles ]
[Books ]
[Book Chapters ] [Theses ] [Miscellaneous ]
bar

Conference & Workshop Papers (Refereed)

2008

``SpotSigs: Robust and Efficient Near Duplicate Detection in Large Web Collections.'' [slides]
Martin Theobald, Jonathan Siddharth, and Andreas Paepcke.
Proceedings of the 31st Annual International ACM SIGIR Conference (SIGIR 2008), Singapore, 2008.

``PhotoSpread: A Spreadsheet for Managing Photos.''
Sean Kandel, Eric Abelson, Hector Garcia-Molina, Andreas Paepcke, and Martin Theobald.
Proceedings of the 2008 Conference on Human Factors in Computing Systems (CHI 2008), Florence, Italy, 2008.

``Exploiting Lineage for Confidence Computation in Uncertain and Probabilistic Databases.''
Anish Das Sarma, Martin Theobald, and Jennifer Widom.
Proceedings of the 24th International Conference on Data Engineering (ICDE 2008), Cancun, Mexico, 2008.

2007

``Efficient Text Proximity Search.''
Ralf Schenkel, Andreas Broschart, Seungwon Hwang, Martin Theobald, and Gerhard Weikum.
4th String Processing and Information Retrieval Symposium (SPIRE 2007), Santiago, Chile, 2007.

``The TopX DB&IR Engine.''
Martin Theobald, Ralf Schenkel, and Gerhard Weikum.
Proceedings of the 27th ACM SIGMOD International Conference on Management of Data (SIGMOD 2007), Beijing, China, 2007.
Demonstration Description.

``TopX - Efficient and Versatile Top-k Query Processing for Text, Structured, and Semistructured Data.'' [slides]
Martin Theobald, Ralf Schenkel, and Gerhard Weikum.
12. GI-Fachtagung für Datenbanksysteme in Business, Technologie und Web (BTW 2007), Aachen, Germany, 2007.
Dissertation Award Invited Paper.

``Trio-One: Layering Uncertainty and Lineage on a Conventional DBMS.''
Michi Mutsuzaki, Martin Theobald, Ander de Keijzer, Jennifer Widom, Parag Agrawal, Omar Benjelloun, Anish Das Sarma, Raghotham Murthy, and Tomoe Sugihara.
3rd Biennial Conference on Innovative Data Systems Research (CIDR 2007), Pacific Grove, California, 2007.
Demonstration Description.

2006

``IO-Top-k: Index-Access Optimized Top-k Query Processing.''
Holger Bast, Debapriyo Majumdar, Ralf Schenkel, Martin Theobald, and Gerhard Weikum.
32nd International Conference on Very Large Data Bases (VLDB 2006), Seoul, Korea, 2006.

``Structural Feedback for Keyword-Based XML Retrieval.''
Ralf Schenkel and Martin Theobald.
28th European Conference on Information Retrieval Research (ECIR 2006), London, UK, 2006.

``Feedback-Driven Structural Query Expansion for Ranked Retrieval of XML Data.''
Ralf Schenkel and Martin Theobald.
10th International Conference on Extending Database Technologies (EDBT 2006), Munich, Germany, 2006.

2005

``Word Sense Disambiguation for Exploiting Hierarchical Thesauri in Text Classification.''
Dimitrios Mavroeidis, George Tsatsaronis, Michalis Vazirgiannis, Martin Theobald, and Gerhard Weikum.
9th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD 2005), Porto, Portugal, 2005.

``An Efficient and Versatile Query Engine for TopX Search.'' [slides]
Martin Theobald, Ralf Schenkel, and Gerhard Weikum.
31st International Conference on Very Large Databases (VLDB 2005), Trondheim, Norway, 2005.

``Learning Word-to-Concept Mappings for Automatic Text Classification.''
Georgiana Ifrim, Martin Theobald, and Gerhard Weikum.
Learning in Web Search Workshop - 22nd International Conference on Machine Learning (ICML 2005), Bonn, Germany, 2005.

``Efficient and Self-Tuning Incremental Query Expansion for Top-k Query Processing.'' [slides]
Martin Theobald, Ralf Schenkel, and Gerhard Weikum.
28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2005), Salvador, Brasil, 2005.

2004

``Towards a Statistically Semantic Web.''
Gerhard Weikum, Jens Graupmann, Ralf Schenkel, and Martin Theobald.
23rd International Conference on Conceptual Modeling (ER 2004), Shanghai, China, 2004. Invited Paper.

``Top-k Query Processing with Probabilistic Guarantees.'' [slides]
Martin Theobald, Ralf Schenkel, and Gerhard Weikum.
30th International Conference on Very Large Data Bases (VLDB 2004), Toronto, Canada, 2004.

``COMPASS: A Concept-based Web Search Engine for HTML, XML, and Deep Web Data.''
Jens Graupmann, Michael Biwer, Patrick Zimmer, Christian Zimmer, Matthias Bender, Martin Theobald, and Gerhard Weikum.
30th International Conference on Very Large Data Bases (VLDB 2004), Toronto, Canada, 2004.
Demonstration Description.

``BINGO! and Daffodil: Personalized Exploration of Digital Libraries and Web Sources.'' [slides]
Martin Theobald and Claus-Peter Klas.
7th International Conference on Computer-Assisted Information Retrieval (RIAO 2004), Avignon, France, 2004.

2003

``From Focused Crawling to Expert Information: An Application Framework for Web Exploration and Portal Generation.''
Jens Graupmann, Sergej Sizov, and Martin Theobald.
29th International Conference on Very Large Data Bases (VLDB 2003), Berlin, Germany, 2003.
Demonstration Description.

``Exploiting Structure, Annotation, and Ontological Knowledge for Automatic Classification of XML Data.'' [slides]
Martin Theobald, Ralf Schenkel, and Gerhard Weikum.
6th International Workshop on the Web and Databases (WebDB 2003), San Diego, California, 2003.

``The BINGO! System for Information Portal Generation and Expert Web Search.''
Sergej Sizov, Michael Biwer, Jens Graupmann, Stefan Siersdorfer, Martin Theobald, Gerhard Weikum, and Patrick Zimmer.
1st Biennial Conference on Innovative Data Systems Research (CIDR 2003), Pacific Grove, California, 2003.

2002

``BINGO! - Bookmark-Induced Gathering of Information.''
Sergej Sizov, Stefan Siersdorfer, Martin Theobald, and Gerhard Weikum.
3rd International Conference on Web Information Systems Engineering (WISE 2002), Singapore, 2002.

``BINGO! - Ein thematisch fokussierender Crawler zur Generierung personalisierter Ontologien.'' [slides]
Martin Theobald, Stefan Siersdorfer, and Sergej Sizov.
Web Information Retrieval Workshop; 32. Jahrestagung der Gesellschaft für Informatik, Dortmund, Germany, 2002.

``The BINGO! Focused Crawler: From Bookmarks to Archetypes.''
Sergej Sizov, Stefan Siersdorfer, Martin Theobald, and Gerhard Weikum.
18th International Conference on Data Engineering (ICDE 2002), San Jose, CA, 2002.

bar

Journal Articles (Refereed)

2008

``Databases with Uncertainty and Lineage.''
Omar Benjelloun, Anish Das Sarma, Alon Halevy, Martin Theobald, and Jennifer Widom.
International Journal on Very Large Databases (VLDB-J), VLDB 2006 Selected Papers Special Issue, 17(2), 2008.

``Efficient and Versatile Top-k Query Processing for Semistructured Data.''
Martin Theobald, Holger Bast, Debapriyo Majumdar, Ralf Schenkel, and Gerhard Weikum.
International Journal on Very Large Databases (VLDB-J), Special Issue on Integration of Databases and Information Retrieval, 17(1), 2008.

bar

Books (Monographs)

2007

``Efficient and Versatile Top-k Query Processing for Text, Structured, and Semistructured Data.''
Martin Theobald, ISBN: 978-3-8364-4582-5, VDM Verlag Dr. Müller, 2007.
Available from Amazon.

bar

Book Chapters

2008

``Integrated DB&IR Semi-Structured Text Retrieval.''
Ralf Schenkel and Martin Theobald.
To appear in the Encyclopedia of Database Systems, Ling Liu and M. Tamer Özsu (Eds.), LNCS, Springer, 2008.

``XML Top-k Query Processing.''
Amélie Marian and Martin Theobald.
To appear in the Encyclopedia of Database Systems, Ling Liu and M. Tamer Özsu (Eds.), LNCS, Springer, 2008.

2003

``Classification and Focused Crawling for Semistructured Data.''
Martin Theobald, Ralf Schenkel, and Gerhard Weikum.
Intelligent Search on XML Data, Henk Blanken, Torsten Grabs, Hans-Jörg Schek, Ralf Schenkel, and Gerhard Weikum (Eds.), LNCS 2818, Springer, 2003.

bar

Theses

2006

``TopX - Efficient and Versatile Top-k Query Processing for Text, Structured, and Semistructured Data.'' [slides]
Ph.D. Thesis, Martin Theobald, Saarland University, May 2006.

2002

``BINGO! - Bookmark-Induced Gathering of Information with Adaptive Classification into Personalized Ontologies.''
Diploma Thesis, Martin Theobald, Saarland University, March 2002.

bar

Miscellaneous (Non-Refereed)

2007

``TopX @ INEX 2007.''
Andreas Broschart, Ralf Schenkel, Martin Theobald, and Gerhard Weikum.
Advances in XML Information Retrieval and Evaluation - 6th International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2007), Schloss Dagstuhl, Germany, 2007.
Revised Selected Papers.

``The PhotoSpread Query Language.''
Sean Kandel, Andreas Paepcke, Martin Theobald, and Hector Garcia-Molina.
Stanford Infolab, Technical Report, 2007.

2006

``TopX @ INEX 2006: Ad-Hoc and Feedback Tasks.''
Ralf Schenkel and Martin Theobald.
Advances in XML Information Retrieval and Evaluation - 5th International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2006), Schloss Dagstuhl, Germany, 2006.
Revised Selected Papers.

``IO-Top-k at the TREC Terabyte Track 2006.''
Holger Bast, Debapriyo Majumdar, Ralf Schenkel, Martin Theobald, and Gerhard Weikum.
Proceedings of the 15th Text Retrieval Conference (TREC 2006), NIST, Gaithersburg, Maryland, 2006.

``IO-Top-k: Index-Access Optimized Top-k Query Processing.''
Holger Bast, Debapriyo Majumdar, Ralf Schenkel, Martin Theobald, and Gerhard Weikum.
Technical Report (extended version), MPI-I-2006-5-002, 2006.

2005

``TopX & XXL @ INEX 2005.''
Ralf Schenkel and Martin Theobald.
Advances in XML Information Retrieval and Evaluation - 4th International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2005), Schloss Dagstuhl, Germany, 2005.
Revised Selected Papers.

bar