Content Source Description
Introduction
The Content Source Description (CSD) is a digital libarary service that supports the execution of content-based queries against a number of content sources (such as collections) that are associated with DILIGENT indices.
Implementation Overview
Among the many possible ways of implementing a content source description service, the provided reference CSD service is based on the representation of text sources as term histograms. A histogram basically contains the most representative words and phrases of a content source (i.e. a content collection) together with term statistics. To obtain these statistics, the reference CSD service interacts with index services in order to derive statistical information from full-text DILIGENT indices of internal sources and to subscribe for notifications should these indices change (notifications will be available in the beta-release of the project).
Dependencies
- Java JDK 1.5
- WS-Core
- DiligentProvider
- KXML (version 2.3.0)
- Contentmanagement
- DIRCommons library
- Indexservice Generatorservice
- Indexservice Lookupservice
- DISHL client
- DISIP