WWW Indexing and Harvesting Software
The Combine harvesting robot (DESIRE 1)
- distributed architecture
- different components talk using client server technology.
- a modular design, easy to modify and extend.
DESIRE 2 will improve and extend:
- Metdata indexing - to include RDF
- Range of document types
- New summarizers and summarizer mechanisms.
- Range of protocols harvested , to include NNTP and possibly FTP