Difference between revisions of "Data Transfer 2"

From Gcube Wiki
Jump to: navigation, search
m
Line 2: Line 2:
 
== Overview ==
 
== Overview ==
  
This class of components manages transfer capabilities between gCube infrastructure nodes ...
+
The implementation of a reliable data transfer mechanisms between the nodes of a gCube-based Hybrid Data Infrastructure is one of the main objectives when dealing with large set of multi-type datasets distributed across different repositories.
 +
 
 +
To promote an efficient and optimized consumption of these data resources, a number of components have been designed to meet the data transfer requirements.
  
 
This document outlines the  design rationale, key features, and high-level architecture, the options for their deployment and as well some use cases.
 
This document outlines the  design rationale, key features, and high-level architecture, the options for their deployment and as well some use cases.
Line 9: Line 11:
  
 
The components belonging to this class are responsible for:
 
The components belonging to this class are responsible for:
 +
 +
;Point to Point transfer
 +
:direct transfer invocation to a gCube Node
  
 
;reliable data transfer between Infrastructure Data Sources and Data Storages
 
;reliable data transfer between Infrastructure Data Sources and Data Storages
 
:by exploiting the uniform access interfaces provided by gCube and standard transfer protocols
 
:by exploiting the uniform access interfaces provided by gCube and standard transfer protocols
  
;structured and unstructured Data Transfer
+
;automatic transfer optimization
:it guarantees both Tree based and File based transfer to cover all possible iMarine use-cases
+
:by exploiting best available transfer options between invoker and target nodes
  
;transfers to local nodes for data staging
+
;advanced and extensible post transfer processing
:data staging for particular use cases can be enabled on each node of the infrastructure
+
:plugin - oriented implementation to serve advanced use case
  
;advanced transfer scheduling and transfer optimization
+
== Design ==
:a dedicated gCube service responsible for data transfer scheduling combined to transfer optimization at the level of protocols and Access interfaces.
+
  
;transfer statistics availability
 
:transfers are logged by the system and make available to interested consumers.
 
  
;transfer shares per scopes and users
 
:a management interface is used to configure transfer shares per scopes and users at the level of Data Sources and Storages.
 
  
== Design ==
 
  
 
=== Philosophy===
 
=== Philosophy===

Revision as of 14:59, 9 September 2016

Overview

The implementation of a reliable data transfer mechanisms between the nodes of a gCube-based Hybrid Data Infrastructure is one of the main objectives when dealing with large set of multi-type datasets distributed across different repositories.

To promote an efficient and optimized consumption of these data resources, a number of components have been designed to meet the data transfer requirements.

This document outlines the design rationale, key features, and high-level architecture, the options for their deployment and as well some use cases.

Key features

The components belonging to this class are responsible for:

Point to Point transfer
direct transfer invocation to a gCube Node
reliable data transfer between Infrastructure Data Sources and Data Storages
by exploiting the uniform access interfaces provided by gCube and standard transfer protocols
automatic transfer optimization
by exploiting best available transfer options between invoker and target nodes
advanced and extensible post transfer processing
plugin - oriented implementation to serve advanced use case

Design

Philosophy

Architecture

Deployment

Large Deployment

Small Deployment

Use Cases

Well suited use cases

Less well suited use cases