DESIRE Toolkit Components

2.11 Matcher

Description

The Matcher tool implements a subject classification process using a subject-specific thesaurus by which terms are intellectually mapped to categories or subject classes. The classification process is made up of several steps. First, the document to be classified is fetched. Text is extracted from this document, and all thesaurus terms are matched to it. Some heuristic processing rules are applied to the results from the matching process. Finally, the outcome is formatted either for presentation or for storing in a database.


demonstrator:
http://www.lub.lu.se/desire/demonstration.html


installation:
http://cvs.desire.org/cgi-bin/cvsweb.cgi/desire-toolkit/perl/autoclass/


related documentation:

Creation and automatic classification of a robot-generated subject index
Anders Ardö, Traugott Koch NetLab. Summary of poster for ECDL99 Conference
http://www.lub.lu.se/desire/poster.html

The construction of a robot-generated subject index
Anders Ardö, Traugott Koch and Lars Noodén, NetLab, Lund Univ.
http://www.lub.lu.se/desire/DESIRE36a-WP1.html

Automatic Classification and Content Navigation Support for Web Services - DESIRE II Cooperates with OCLC
Traugott Koch, NetLab, Lund University, Diane Vizine-Goetz, Consulting Research Scientist, OCLC Office of Research
http://www.oclc.org/oclc/research/publications/review98/koch_vizine-goetz/automatic.htm