Data Mining Facilities
Data Mining facilities include a set of features, services and methods for performing data processing and mining on biological information sets. These features face several aspects of biological data processing ranging from ecological modeling issues to niche modeling experiments. The set of libraries and services used by the D4Science e-infrastructure manage data mining problems even from a computational complexity point of view. Algorithms are executed in parallel and possibly distributed fashion, using the same GHNs as working nodes. Furthermore services performing Data Mining operations are distributed in order to implement a load balancing for those procedures requiring local resources.
By means of the above methods, Data Mining in i-Marine has the aim to manage problems like (i) the prediction of the impact of climate changes on biodiversity, (ii) the prevention of the spread of invasive species, (iii) the identification of geographical and ecological aspects of disease transmission, (iv) the conservation planning, (v) the prediction of suitable habitat for marine species. By using the computational facilities of the D4Science e-Infrastructure, algorithms can be run in a cost-effective way, letting scientists perform more experiments and combine different techniques.