References
Web pages
W3C Extensible Markup Language (XML),
contains many XML related links
XML FAQ
The SGML/XML Web Page
by Robin Cover contains a list of books and articles about XML.
Microsoft XML page,
with a tutorial and some XML demos
IBM XML page
Oracle XML page
MText Mining, Web Mining, Information Retrieval and Extraction from the WWW References,
a list of related links collected by Weiguo Fan
Articles
XML, Java, and the future of the Web
,
Jon Bosak, Sun Microsystems
Research projects
NoDoSE, the Northwestern Document Structure Extractor
, Northwestern University
LORE, A database management system for XML
, Stanford University
Strudel Web-Site Management System
, University of Washington and AT&T Lab
MIDAS, Mining data at Stanford
, Stanford University
Research Papers
NoDoSE: A Tool for Semi-Automatically Extracting Structured and Semistructured Data from Text Documents
, Technical Report.
Extracting Schema from Semistructured Data
, Svetlozar Nestorov, Serge Abiteboul, Rajeev Motwani,
Proceedings of 1998 ACM International Conference On Management of Data (
SIGMOD'98
), Seattle, Washington, June 1998.
Books
XML Applications, by Frank Boumphrey et.al., Wrox Press Ltd (Examples can be found at
http://webdev.wrox.co.uk/books/1525
.)
Tools
Tidy: converting HTML documents to XML documents
Lark
: An XML parser by Tim Bray
XP
: An XML parser by James Clark (notice that this is an FTP link)
Other references
The HL7 document Patient Record Architecture
, Kona Editorial Group