Data Processing Pipeline

From Gcube Wiki
Jump to: navigation, search

The gCube Data Processing Pipeline is the processing engine of gCube which is responsible for carrying out requests for processing data hosted, managed or accessible by the gCube system.

Typical parts of the gCube Data Processing Pipeline are:

  • The gCube Search Service which creates the graph of the services that will carry out the processing of data and optimizes its execution plan.
  • The gRS mechanism that is responsible for the creation of the pipeline by allowing paged and staged transfers and performing as flow control among the participants of flow
  • The Process Execution Service which manages the execution by performing the required matching invocation and monitoring.
  • The worker services which are typically the Search Operators but are not limited to these and can be any kind of appropriately crafted and described gCube service (Custom Operators).