Homework Assignment 3:
Large-Scale DAML Content
1. We have generated a DAML/RDF Version of Wordnet. The RDF source of Wordnet and a schema describing the ontology used to describe the content of Wordnet is available at: http://www.semanticweb.org/library
2. We have also generated 10.820 statements (according to DAML viewer) so far by crawling the URLs: https://www.daml.org/homework/1/summary.htmll and http://www.ontoknowledge.org/oil/case-studies/ as entry points. The statements including are "ontological" and "instance oriented" as well, consisting of pages like the CIA factbook , the KA2 Ontology and the DAML pages of Homework Assignment 1 .
The result page of this crawl can be found at http://www.aifb.uni-karlsruhe.de/WBS/daml/homework/crawlstatements.rdf
3. Lessons Learned. Techniques and Tools
For 1: Datasets of this size need database support - otherwise no use is possible.
For 2: We used the RDF/DAML-Crawler which writes a repository of RDF facts obtained during crawling and verify this repository with the DARPA DAML-Viewer . We use the DAML-Viewer also for counting the amount of statements in the repository file (file->properties).
During the crawling some questions arised:
By Sergey Melnik, Siegfried Handschuh, Stefan Decker,