The Functional Neuroimaging Data Pipeline. Activation patterns obtained from functional neuroimaging studies reflect interactions among the components of a complicated data pipeline of experimental design parameters, and a series of methodological choices including data acquisition, post-acquisition processing and data-analysis model selection.
Why Optimize The Data Pipeline? As the field of functional neuroimaging matures and becomes more widely used, standardization and optimization of data-analytic approaches, and automated quality control procedures will become increasingly important. For many researchers the generation of a "plausible result" that can be linked to the neuroscientific literature, perhaps through a hypothesis, is often taken as justification of the pipeline choices made providing a systematic bias in the field towards prevailing neuroscientific expectations ( Strother et al., 1995; Skudlarski et al., 1999). We do not advocate ignoring the existing neuroscientific knowledge base, but both its implicit and explicit use needs to be balanced against a concerted effort to independently define and test the validity of the rapidly increasing range of experimental and methodological techniques used in functional neuroimaging. The NPAIRS approach is guided by the rapidly developing field of predictive learning in statistics (Friedman, 1994; Larsen and Hansen, 1997; Ripley, 1998).
NPAIRS Performance Metrics. NPAIRS defines the "validity or quality" of functional neuroimaging results, and the experimental and pipeline choices made in obtaining them, by quantitatively measuring and optimizing their predictive performance in a crossvalidation resampling framework. This is defined as the ability of the pipeline to produce data-analytic model(s) parameters from a training dataset that can accurately predict the values of experimental design parameters (e.g., brain-state labels, performance measures) in an independent test dataset, and also reproduce the activation image parameters between the training and test datasets.
|