| [Papers ] | [Articles ] |
[Books ] |
[Book Chapters ] | [Theses ] | [Miscellaneous ] |

``SpotSigs: Robust and Efficient Near Duplicate Detection in
Large Web Collections.'' [slides]
Martin Theobald, Jonathan Siddharth, and Andreas Paepcke.
Proceedings of the 31st Annual International ACM SIGIR Conference (SIGIR 2008), Singapore, 2008.
``PhotoSpread: A Spreadsheet for Managing Photos.''
Sean Kandel, Eric Abelson, Hector Garcia-Molina, Andreas Paepcke, and
Martin Theobald.
Proceedings of the 2008 Conference on Human Factors in Computing
Systems (CHI 2008), Florence, Italy, 2008.
``Exploiting Lineage for Confidence Computation in Uncertain and
Probabilistic Databases.''
Anish Das Sarma, Martin Theobald, and Jennifer Widom.
Proceedings of the 24th International Conference on Data
Engineering (ICDE 2008), Cancun, Mexico, 2008.
``Efficient Text Proximity Search.''
Ralf Schenkel, Andreas Broschart, Seungwon Hwang, Martin Theobald, and
Gerhard Weikum.
4th String Processing and Information Retrieval Symposium (SPIRE 2007),
Santiago, Chile, 2007.
``The TopX DB&IR Engine.''
Martin Theobald, Ralf Schenkel,
and Gerhard Weikum.
Proceedings of the 27th ACM SIGMOD International
Conference on Management of Data (SIGMOD 2007), Beijing, China, 2007.
Demonstration Description.
``TopX - Efficient and Versatile Top-k Query Processing for
Text, Structured, and Semistructured Data.'' [slides]
Martin Theobald, Ralf
Schenkel, and Gerhard Weikum.
12. GI-Fachtagung für
Datenbanksysteme in Business, Technologie und Web (BTW 2007), Aachen,
Germany, 2007.
Dissertation Award Invited Paper.
``Trio-One: Layering Uncertainty and Lineage on a Conventional
DBMS.''
Michi Mutsuzaki, Martin Theobald, Ander de Keijzer, Jennifer
Widom, Parag Agrawal, Omar Benjelloun, Anish Das Sarma, Raghotham
Murthy, and Tomoe Sugihara.
3rd Biennial Conference on Innovative
Data Systems Research (CIDR 2007), Pacific Grove, California, 2007.
Demonstration Description.
``IO-Top-k: Index-Access Optimized Top-k Query Processing.''
Holger Bast, Debapriyo Majumdar, Ralf Schenkel, Martin Theobald, and
Gerhard Weikum.
32nd International Conference on Very Large Data
Bases (VLDB 2006), Seoul, Korea, 2006.
``Structural Feedback for Keyword-Based XML Retrieval.''
Ralf
Schenkel and Martin Theobald.
28th European Conference on
Information Retrieval Research (ECIR 2006), London, UK, 2006.
``Feedback-Driven Structural Query Expansion for Ranked Retrieval
of XML Data.''
Ralf Schenkel and Martin Theobald.
10th International Conference on Extending Database Technologies
(EDBT 2006),
Munich, Germany, 2006.
``Word Sense Disambiguation for Exploiting Hierarchical Thesauri
in Text Classification.''
Dimitrios Mavroeidis, George Tsatsaronis,
Michalis Vazirgiannis, Martin Theobald, and Gerhard Weikum.
9th European Conference on Principles and Practice of Knowledge
Discovery
in Databases (PKDD 2005), Porto, Portugal, 2005.
``An Efficient and Versatile Query Engine for TopX Search.'' [slides]
Martin Theobald, Ralf Schenkel, and Gerhard Weikum.
31st International Conference on Very Large Databases (VLDB 2005),
Trondheim,
Norway, 2005.
``Learning Word-to-Concept Mappings for Automatic Text
Classification.''
Georgiana Ifrim, Martin Theobald, and Gerhard Weikum.
Learning in Web Search Workshop - 22nd International Conference on
Machine Learning (ICML 2005), Bonn, Germany, 2005.
``Efficient and Self-Tuning Incremental Query Expansion for Top-k
Query Processing.'' [slides]
Martin Theobald, Ralf Schenkel, and Gerhard Weikum.
28th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval (SIGIR 2005), Salvador, Brasil,
2005.
``Towards a Statistically Semantic Web.''
Gerhard Weikum, Jens
Graupmann, Ralf Schenkel, and Martin Theobald.
23rd International
Conference on Conceptual Modeling (ER 2004), Shanghai, China, 2004.
Invited
Paper.
``Top-k Query Processing with Probabilistic Guarantees.'' [slides]
Martin
Theobald, Ralf Schenkel, and Gerhard Weikum.
30th International
Conference on Very Large Data Bases (VLDB 2004), Toronto, Canada, 2004.
``COMPASS: A Concept-based Web Search Engine for HTML, XML, and Deep Web
Data.''
Jens Graupmann, Michael Biwer, Patrick Zimmer, Christian
Zimmer, Matthias Bender, Martin Theobald, and Gerhard Weikum.
30th
International Conference on Very Large Data Bases (VLDB 2004), Toronto,
Canada, 2004.
Demonstration Description.
``BINGO! and Daffodil: Personalized Exploration of Digital
Libraries and Web Sources.'' [slides]
Martin Theobald and Claus-Peter Klas.
7th International Conference on Computer-Assisted Information
Retrieval
(RIAO 2004), Avignon, France, 2004.
``From Focused Crawling to Expert Information: An Application
Framework for Web Exploration and Portal Generation.''
Jens Graupmann,
Sergej Sizov, and Martin Theobald.
29th International Conference
on Very Large Data Bases (VLDB 2003), Berlin, Germany, 2003.
Demonstration
Description.
``Exploiting Structure, Annotation, and Ontological Knowledge for
Automatic Classification of XML Data.'' [slides]
Martin Theobald, Ralf Schenkel,
and Gerhard Weikum.
6th International Workshop on the Web and
Databases (WebDB 2003), San Diego, California, 2003.
``The BINGO! System for Information Portal Generation and Expert
Web Search.''
Sergej Sizov, Michael Biwer, Jens Graupmann, Stefan
Siersdorfer, Martin Theobald, Gerhard Weikum, and Patrick Zimmer.
1st Biennial Conference on Innovative Data Systems Research (CIDR 2003),
Pacific Grove, California, 2003.
``BINGO! - Bookmark-Induced Gathering of Information.''
Sergej Sizov, Stefan Siersdorfer, Martin Theobald, and Gerhard Weikum.
3rd International Conference on Web Information Systems Engineering
(WISE 2002), Singapore, 2002.
``BINGO! - Ein thematisch fokussierender Crawler zur Generierung
personalisierter Ontologien.'' [slides]
Martin Theobald, Stefan Siersdorfer, and
Sergej Sizov.
Web Information Retrieval Workshop; 32. Jahrestagung
der Gesellschaft für Informatik, Dortmund, Germany, 2002.
``The BINGO! Focused Crawler: From Bookmarks to Archetypes.''
Sergej Sizov, Stefan Siersdorfer, Martin Theobald, and Gerhard Weikum.
18th International Conference on Data Engineering (ICDE 2002), San
Jose, CA, 2002.

``Databases with Uncertainty and Lineage.''
Omar Benjelloun,
Anish Das Sarma, Alon Halevy, Martin Theobald, and Jennifer Widom.
International Journal on Very Large Databases (VLDB-J), VLDB
2006 Selected Papers Special Issue, 17(2), 2008.
``Efficient and Versatile Top-k Query Processing for
Semistructured Data.''
Martin Theobald, Holger Bast, Debapriyo
Majumdar, Ralf Schenkel, and Gerhard Weikum.
International Journal on Very Large Databases (VLDB-J),
Special Issue on Integration of Databases and Information Retrieval,
17(1), 2008.

``Efficient and Versatile Top-k Query Processing for Text,
Structured, and Semistructured Data.''
Martin Theobald, ISBN: 978-3-8364-4582-5, VDM Verlag Dr. Müller, 2007.
Available from Amazon.

``Integrated DB&IR Semi-Structured Text Retrieval.''
Ralf
Schenkel and Martin Theobald.
To appear in the Encyclopedia of
Database Systems, Ling Liu and M. Tamer Özsu (Eds.), LNCS,
Springer, 2008.
``XML Top-k Query Processing.''
Amélie Marian and Martin
Theobald.
To appear in the Encyclopedia of Database Systems, Ling
Liu and M. Tamer Özsu (Eds.), LNCS, Springer, 2008.
``Classification and Focused Crawling for Semistructured Data.''
Martin Theobald, Ralf Schenkel, and Gerhard Weikum.
Intelligent Search on XML Data, Henk Blanken,
Torsten Grabs, Hans-Jörg Schek, Ralf Schenkel, and Gerhard Weikum
(Eds.), LNCS 2818, Springer,
2003.

``TopX - Efficient and Versatile Top-k Query Processing for Text, Structured, and Semistructured Data.'' [slides]
Ph.D. Thesis, Martin Theobald, Saarland University, May 2006.
``BINGO! - Bookmark-Induced Gathering of Information with Adaptive Classification into Personalized Ontologies.''
Diploma Thesis, Martin Theobald, Saarland University, March 2002.

``TopX @ INEX 2007.''
Andreas Broschart, Ralf Schenkel, Martin
Theobald, and Gerhard Weikum.
Advances in XML Information
Retrieval and Evaluation - 6th International Workshop of the Initiative
for the Evaluation of XML Retrieval (INEX 2007), Schloss Dagstuhl,
Germany,
2007.
Revised Selected Papers.
``The PhotoSpread Query Language.''
Sean Kandel, Andreas Paepcke,
Martin Theobald, and Hector Garcia-Molina.
Stanford Infolab, Technical Report, 2007.
``TopX @ INEX 2006: Ad-Hoc and Feedback Tasks.''
Ralf Schenkel and Martin Theobald.
Advances in XML Information Retrieval and
Evaluation - 5th International Workshop of the Initiative for the
Evaluation of XML Retrieval (INEX 2006), Schloss Dagstuhl, Germany, 2006.
Revised Selected Papers.
``IO-Top-k at the TREC Terabyte Track 2006.''
Holger Bast, Debapriyo Majumdar, Ralf Schenkel, Martin Theobald, and Gerhard Weikum.
Proceedings of the 15th Text Retrieval Conference (TREC 2006), NIST,
Gaithersburg, Maryland, 2006.
``IO-Top-k: Index-Access Optimized Top-k Query Processing.''
Holger Bast, Debapriyo Majumdar, Ralf Schenkel, Martin Theobald, and
Gerhard Weikum.
Technical Report (extended version), MPI-I-2006-5-002, 2006.
``TopX & XXL @ INEX 2005.''
Ralf Schenkel and Martin
Theobald.
Advances in XML Information Retrieval and Evaluation - 4th
International Workshop of the Initiative for the Evaluation of XML
Retrieval (INEX 2005), Schloss Dagstuhl, Germany, 2005.
Revised Selected
Papers.
