Skip to:

e-Science 2008 4th IEEE International Conference on e-Science

Main Conference Sessions

User-friendly Management of Workflow Results: From Provenance Information to Grid Logical File Names

Authors

  • Tristan Glatard, University of Amsterdam
  • Silvia Olabarriaga, Academic Medical Centre Amsterdam

Abstract

Grid workflows can produce thousands of results that should be properly organized to enable further analysis. Typically results are stored on locations hard-coded in the workflow or in the components, limiting reusability. In this paper we present an approach to reorganize the output files generated by a grid workflow in a distributed storage environment. We propose to perform a post-mortem mapping of workflow results into a directory structure. This mapping is based on data provenance information and exploits grid catalog features, namely logical file names, to avoid
data replication. By defining different mappings, users can generate their own semantic view of results generated during a workflow execution, which fosters user-friendliness, whereas preserving workflow reusability. An implementation on the VBrowser framework is detailed and evaluated on neuroimaging workflows. Results show that the complex directory structure of an image analysis application can be properly generated by our system. An initial performance evaluation of the mapping resolution and directory structure creation indicates that this approach provides a practical, simple, yet powerful solution to an important roadblock for the adoption of workflows to implement complex image analysis pipelines.

Date and Time

Friday, December 12, 3:30 p.m. to 4:00 p.m.

Room Number

206

More Information

Show your support for e-Science 2008

Add one of our badges to your site:

  • Teal eScience 2008 Web badge
  • Green eScience 2008 Web badge
  • Orange eScience 2008 Web badge