DESIRE Toolkit Components
2.11 Matcher
Description
The Matcher tool implements a subject classification process using a
subject-specific thesaurus by which terms are intellectually mapped to
categories or subject classes. The classification process is made up of
several steps. First, the document to be classified is fetched. Text is
extracted from this document, and all thesaurus terms are matched to it.
Some heuristic processing rules are applied to the results from the
matching process. Finally, the outcome is formatted either for presentation
or for storing in a database.
|