| Literature DB >> 24769182 |
B Sengupta1, K J Friston2, W D Penny3.
Abstract
Data assimilation is a fundamental issue that arises across many scales in neuroscience - ranging from the study of single neurons using single electrode recordings to the interaction of thousands of neurons using fMRI. Data assimilation involves inverting a generative model that can not only explain observed data but also generate predictions. Typically, the model is inverted or fitted using conventional tools of (convex) optimization that invariably extremise some functional - norms, minimum descriptive length, variational free energy, etc. Generally, optimisation rests on evaluating the local gradients of the functional to be optimized. In this paper, we compare three different gradient estimation techniques that could be used for extremising any functional in time - (i) finite differences, (ii) forward sensitivities and a method based on (iii) the adjoint of the dynamical system. We demonstrate that the first-order gradients of a dynamical system, linear or non-linear, can be computed most efficiently using the adjoint method. This is particularly true for systems where the number of parameters is greater than the number of states. For such systems, integrating several sensitivity equations - as required with forward sensitivities - proves to be most expensive, while finite-difference approximations have an intermediate efficiency. In the context of neuroimaging, adjoint based inversion of dynamical causal models (DCMs) can, in principle, enable the study of models with large numbers of nodes and parameters.Entities:
Keywords: Adjoint methods; Augmented Lagrangian; Dynamic causal modelling; Dynamical systems; Model fitting
Mesh:
Year: 2014 PMID: 24769182 PMCID: PMC4120812 DOI: 10.1016/j.neuroimage.2014.04.040
Source DB: PubMed Journal: Neuroimage ISSN: 1053-8119 Impact factor: 6.556
Fig. 1Forward Sensitivity The solid path indicates a trajectory of points x, with n = 1…5, for a dynamical system with parameters p. The dotted path indicates the trajectory for the same dynamical system but with parameters . The dotted path can be reached from the solid path via the total derivative . The Forward Sensitivity approach provides a method for computing this derivative.
Comparison of the different gradient computation methods. The flow eqn. is either linear or non-linear, with P parameters and N state variables.
| Finite differences | Forward sensitivities | Adjoint | |
|---|---|---|---|
| Suitability | Arbitrary | N ≫ P | P ≫ N |
| Cost | (1 + | 1 linear adjoint eqn. + 1 flow eqn. | |
| Key steps | Integrate flow eqn. Parametrically perturb flow P times | Integrate the coupled flow and sensitivity eqns. | Integrate flow eqn. Integrate adjoint eqn. |
Fig. 2Linear System (A) The 5-dimensional state-space model and (B) the linear evolution of its eigenstates.
Fig. 3Computational efficiency for linear systems (A) Comparison of the parametric gradient obtained by the three methods. (B) Scaling of run-time as a function of the number of nodes. The absolute and relative tolerances of FD and FS methods were set to 10− 7 while the tolerances for the AM method were fixed to 10− 3. Simulation time was fixed at 400 ms.
Fig. 4Computational efficiency for non-linear systems (A) Comparison of the gradient obtained by the three methods. Here, the last five parameters quantify the intrinsic oscillator frequencies, and the first 40 parameters the sine and cosine interaction terms. (B) Scaling of run-time as a function of the number of nodes. The absolute and relative tolerances of FD and FS methods were set to 10− 7 while the tolerances for the AM method were fixed to 10− 3. Simulation time was fixed at 100 ms.