From gridified scripts to workflows: the FSL Feat case Tristan - - PowerPoint PPT Presentation

from gridified scripts to workflows the fsl feat case
SMART_READER_LITE
LIVE PREVIEW

From gridified scripts to workflows: the FSL Feat case Tristan - - PowerPoint PPT Presentation

From gridified scripts to workflows: the FSL Feat case Tristan Glatard and Slvia D. Olabarriaga Academic Medical Center Informatics Institute University of Amsterdam MICCAI-G workshop September 6 th 2008 T.Glatard - S.D. Olabarriaga -


slide-1
SLIDE 1

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - 1

From gridified scripts to workflows: the FSL Feat case

Tristan Glatard and Sílvia D. Olabarriaga Academic Medical Center – Informatics Institute University of Amsterdam MICCAI-G workshop – September 6th 2008

slide-2
SLIDE 2

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 2/16

Workflows in neuroimaging

  • Coming up in the community
  • See e.g. [Rex et al 03, Porro et al 06, Fissel 08, Soleman et al

08, Krefting et al 08, Pernod et al 08]

  • Transparency of analysis methods
  • Eases application tweaking
  • Improves reusability & maintenance (components)
  • Improves error detection
  • Facilitated access to grids
  • Transparent parallelization
  • Performance improvement (↓CPU time, ↓results size)
slide-3
SLIDE 3

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 3/16

Many use-cases / one feat

fMRI scan Stimulus Activation map

Pre-processing Registration

(intra-patient)

GLM computation

Anatomical scan

active rest time

Registration

(standard brain)

Template brain

[Smith et al 04]

slide-4
SLIDE 4

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 4/16

Workflow drawbacks

  • Performance issues
  • ↑number of jobs (↑grid load, ↑fault probability)
  • ↑data transfers
  • ↑sensitivity to latency
  • Usability issues
  • Tiresome description of the application
  • Management of distributed results
slide-5
SLIDE 5

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 5/16

Outline

  • Introduction
  • Workflow implementation description
  • Performance comparison
  • Output organization

Is it worth moving from to ?

Feat

Feat

slide-6
SLIDE 6

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 6/16

Feat FSL workflow

Normalization Pre-processing Model computation

  • Largest workflow on (in June 2008)
  • To be iterated hundreds to thousands of times
  • Used Scufl language with dot-product from [Montagnat et al'06]
  • Expected parallelism exploitation
slide-7
SLIDE 7

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 7/16

Implementation evaluation

✔ Reproduced use-case of [Soleman et al 08]

  • Assessed on limited dataset
  • Executed on vlemed EGEE VO using MOTEUR

✗ Not implemented Feat options

  • B0 unwarping, contrast masking, denoising,

perfusion subtraction

✗ Dynamic patterns hardly manageable

 e.g., fixed number of EVs and contrasts

✗ May not generalize to other use-cases

  • Assumed, e.g., 1 anatomical scan per EPI scan
slide-8
SLIDE 8

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 8/16

Performance study

  • Use-cases
  • Job farming (nF files)
  • Sweep on model parameter

(nP parameters)

  • Simulation of workflow scheduling
  • List-scheduling algorithm (nR =10 resources)
  • Data transfers measured on vlemed VO
  • CPU time measured on local PC
  • With/without latency: time to access free resource

Normalization Pre-processing Model

Job farming: nF files Param. sweep: nP params

slide-9
SLIDE 9

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 9/16

Results: job farming

  • No data transfers – no latency
  • Workflow outperforms monolithic
  • Reaches linear speed-up

feat CPU time

slide-10
SLIDE 10

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 10/16

Results: job farming (#2)

  • With data transfers – no latency
  • Workflow similar to monolithic up to nF = 3.nR

CPU time = 3. data transfers

slide-11
SLIDE 11

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 11/16

Results: job farming (#3)

  • With data transfers and latency
  • Workflow more sensitive to latency

Latency increases

slide-12
SLIDE 12

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 12/16

Results: parameter sweep

  • With data transfers and latency
  • Workflow outperforms monolithic for realistic

latency values

Latency increases

slide-13
SLIDE 13

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 13/16

Output organization: problem

  • Regular feat output
  • Workflow output (as in MOTEUR)
  • scan_name-param.feat/

reg/ ... stats/ zstat1.nii.gz report.html design.gif ... ...

  • Directory structure

matches experiment logic

  • Easy file retrieval
  • Automatically

generated file names

  • Provenance info

available

slide-14
SLIDE 14

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 14/16

Output organization: constraints

  • Meaningfulness
  • Easily retrieve a particular file
  • Associate it to the input parameters
  • Reusability
  • Components among workflows
  • Workflows among users
  • Grid-awareness
  • Distributed storage
  • File replication, move
  • LFN change

LFN1 SURL1 ... GUID ... LFNn SURLm

slide-15
SLIDE 15

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 15/16

Output organization: existing approaches

Components produce GUIDs Provenance GUID browsing Result LFNs function of inputs LFN annotation with metadata Meaningful Reusable Grid-aware

       

   

slide-16
SLIDE 16

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 16/16

Conclusions

  • Description of feat workflow feasible
  • For a specific use-case (e.g. fixed number of EVs)
  • Requires a tiresome analysis
  • Workflow performance evaluation
  • Execution time reduction for parameter sweep
  • Data transfers and latency prevail for job farming
  • Output organization
  • Should be grid-aware, reusable and meaningful
  • Components-, workflow- and execution-independent
  • Sharing complex workflows is still difficult
  • Use-case specific implementation
slide-17
SLIDE 17

T.Glatard - S.D. Olabarriaga - MICCAI-G'08 - Sept. 6th 17/16

http://www.vl-e.nl/

Thanks for your attention!

Downloads, demos and videos available from https://pc-vlab18.science.uva.nl:8080/vbrowser/ (and on my laptop...) Acknowledgement:

  • AMC, University of Amsterdam
  • S.Olabarriaga, K. Boulebiar, A. van Kampen
  • A. Nederveen, M. Caan, S. Gevers, R. Soleman, D. Veltman
  • Informatics, University of Amsterdam
  • P. de Boer, A. Belloum
  • R. Belleman, R. Bakker
  • S. Marshall, M. Roos
  • Prof. Dr. L.O.Hertzberger
  • SARA Supercomputing Services
  • M. Bouwhuis, J. Engelberts, Ron Trompert, grid-support@sara.nl
  • National Institute for Nuclear Physics and High Energy Physics (NIKHEF)
  • J.J. Keijser, D. van Dok, J. Templon, grid-support@nikhef.nl