Abstract
|
|
---|---|
One of the most common forms of addressing reproducibility in scientific workflow-based computational science in the past decades has consisted in tracking the provenance of the produced and published results. Such provenance allows inspecting intermediate and final results, improves understanding, and permits replaying a workflow execution. Nevertheless, this approach does not provide any means for capturing and sharing the very valuable knowledge about the experimental equipment of a computational experiment, i.e., the execution environment in which the experiments are conducted. In this work, we propose a novel approach for describing the execution environment of scientific workflows, so as to conserve them, using semantic vocabularies. We define a process for documenting the workflow application and its related management system, as well as their dependencies. Then we apply this approach over three dierent real workflow applications on three distinguished scenarios, using public, private, and local Cloud platforms. In particular, we study one astronomy workflow and two life science workflows for genomic information analysis. Experimental results show that our approach can reproduce an equivalent execution environment of a predefined virtual machine image on all evaluated computing platforms. | |
International
|
Si |
JCR
|
Si |
Title
|
Future Generation Computer Systems-the International Journal of Escience |
ISBN
|
0167-739X |
Impact factor JCR
|
2,43 |
Impact info
|
|
Volume
|
|
|
10.1016/j.future.2015.12.017 |
Journal number
|
|
From page
|
1 |
To page
|
13 |
Month
|
SIN MES |
Ranking
|