Categorizer
Uses three categorization techniques
- Title-term spotting
- Identifies terms that are of the desired category type
- Information extraction
- Extracts information based on predefined text patterns
- Keyword pruning
- Selects keywords that are of the desired category type
Uses categorization criteria to determine
- What categories to create
- Which documents to assign to those categories