Filters
dynamically loaded drivers for handling document format protocols
variety of functions, mostly recognizing doc format (based on content, extension, etc.) and extracting text/metadata
filter(s) named in collection configuration and loaded by other filters on demand
formats include plain text, PDF, Word docs, etc….