Difference between revisions of "Data Assessment, Harmonisation, and Certification Facilities"
m (→Key Features) |
m (→Key Features) |
||
(15 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
+ | [[Category:gCube Features]] | ||
== Overview == | == Overview == | ||
Line 11: | Line 12: | ||
The components part of the subsystem provide the following main key features: | The components part of the subsystem provide the following main key features: | ||
− | ; | + | ;workflow-oriented tabular data manipulation |
− | + | :user-defined definition and execution of workflows of data manipulation steps | |
− | + | :rich array of data manipulation facilities offered 'as-a-Service' | |
− | : | + | :rich array of data mining facilities offered 'as-a-Service' |
− | + | :rich array of data visualisation facilities offered 'as-a-Service' | |
− | + | ||
− | : | + | ;reference-data management support |
− | + | :uniform model for reference-data representation including versioning and provenance | |
− | + | ||
− | : | + | |
− | + | ||
− | + | ||
− | : | + | |
− | + | ||
− | ; | + | |
− | : | + | |
− | ; | + | ;data curation and enrichment support |
− | : | + | :species occurrence data enrichment with environmental data dynamically acquired by data providers |
+ | :data provenance recording | ||
− | ; | + | ;standard-based data presentation |
− | : | + | :[http://www.opengeospatial.org/ OGC standard]-based Geospatial data presentation |
== Main Components == | == Main Components == | ||
Line 38: | Line 32: | ||
; Tabular Data | ; Tabular Data | ||
:this family of components provides: | :this family of components provides: | ||
− | :* [[Tabular Data Flow Manager]]: a service providing tabular data flow management. | + | <!--:* [[Tabular Data Flow Manager]]: a service providing tabular data flow management. |
− | :* [[Tabular Data Manager]]: a set of libraries for tabular data visualization and management. | + | :* [[Tabular Data Manager]]: a set of libraries for tabular data visualization and management.--> |
+ | :* [[Tabular Data Service]]: a service supporting tabular data flow management; | ||
; Time Series | ; Time Series | ||
:this family of components provides: | :this family of components provides: | ||
− | :* [[Time Series | + | :* [[TimeSeries|Time Series]]: a service for performing assessment and harmonization on time series. |
:* [[Codelist Manager]]: a library for performing import, harmonization and curation on code lists. | :* [[Codelist Manager]]: a library for performing import, harmonization and curation on code lists. | ||
; Biodiversity Data | ; Biodiversity Data |
Latest revision as of 20:05, 16 December 2013
Overview
gCube is a software suite equipped with a rich array of services capable to interface with data sources having different characteristics both in terms of data types these sources offers (e.g. from document data, to statistical, biodiversity, and semantic data - see Data Access and Storage Facilities) and the heterogeneity of data belonging to the same type.
The goal of the Data Assessment, Harmonisation, and Certification Facilities is to deal with the above heterogeneity and provide unified views over diverse data items through a number of dedicated services. To meet this goal a number of components have been designed.
This page outlines the design rationale and high-level architecture of such components.
Key Features
The components part of the subsystem provide the following main key features:
- workflow-oriented tabular data manipulation
- user-defined definition and execution of workflows of data manipulation steps
- rich array of data manipulation facilities offered 'as-a-Service'
- rich array of data mining facilities offered 'as-a-Service'
- rich array of data visualisation facilities offered 'as-a-Service'
- reference-data management support
- uniform model for reference-data representation including versioning and provenance
- data curation and enrichment support
- species occurrence data enrichment with environmental data dynamically acquired by data providers
- data provenance recording
- standard-based data presentation
- OGC standard-based Geospatial data presentation
Main Components
- Tabular Data
- this family of components provides:
- Tabular Data Service: a service supporting tabular data flow management;
- Time Series
- this family of components provides:
- Time Series: a service for performing assessment and harmonization on time series.
- Codelist Manager: a library for performing import, harmonization and curation on code lists.
- Biodiversity Data
- this family of components provides:
- Occurrence Data Reconciliation: a service for performing assessment and harmonization on occurrence points of species.
- Occurrence Data Enrichment Service: a service for performing enrichment of information associated to occurrence points of species.
- Taxon Names Reconciliation Service: a service for performing assessment and harmonization on taxa.