Literature DB >> 32632961

A data-driven approach to optimising the encoding for multi-shell diffusion MRI with application to neonatal imaging.

Jacques-Donald Tournier^1,2, Daan Christiaens^1,2, Jana Hutter^1,2, Anthony N Price^1,2, Lucilio Cordero-Grande^1,2, Emer Hughes^1,2, Matteo Bastiani^3,4, Stamatios N Sotiropoulos^3,4, Stephen M Smith³, Daniel Rueckert⁵, Serena J Counsell², A David Edwards², Joseph V Hajnal^1,2.

Abstract

Diffusion MRI has the potential to provide important information about the connectivity and microstructure of the human brain during normal and abnormal development, noninvasively and in vivo. Recent developments in MRI hardware and reconstruction methods now permit the acquisition of large amounts of data within relatively short scan times. This makes it possible to acquire more informative multi-shell data, with diffusion sensitisation applied along many directions over multiple b-value shells. Such schemes are characterised by the number of shells acquired, and the specific b-value and number of directions sampled for each shell. However, there is currently no clear consensus as to how to optimise these parameters. In this work, we propose a means of optimising multi-shell acquisition schemes by estimating the information content of the diffusion MRI signal, and optimising the acquisition parameters for sensitivity to the observed effects, in a manner agnostic to any particular diffusion analysis method that might subsequently be applied to the data. This method was used to design the acquisition scheme for the neonatal diffusion MRI sequence used in the developing Human Connectome Project (dHCP), which aims to acquire high quality data and make it freely available to the research community. The final protocol selected by the algorithm, and currently in use within the dHCP, consists of 20 b=0 images and diffusion-weighted images at b = 400, 1000 and 2600 s/mm2 with 64, 88 and 128 directions per shell, respectively.

Entities: Chemical

Keywords: HARDI; diffusion MRI; multi-shell; neonatal imaging

Mesh：

Substances：
Contrast Media

Year: 2020 PMID： 32632961 PMCID： PMC7116416 DOI： 10.1002/nbm.4348

Source DB: PubMed Journal: NMR Biomed ISSN： 0952-3480 Impact factor: 4.478

Introduction

Diffusion MRI (dMRI) has been the focus of intense research over the last 20 years, holding great promise for investigation of tissue microstructure due to the technique's unique sensitivity to the micron-scale diffusion of water. In recent years, there has been increasing interest in the use of so-called multi-shell dMRI, driven to a large extent by improvements in image acquisition and reconstruction methods, which can now provide the amount of data required within clinically feasible scan times. A number of promising methods have been proposed to model data of this nature (eg, Jensen et al[1]; Jeurissen et al[2]; and Kaden et al, among others[3-6]). Many of these approaches are based on the so-called standard model of diffusion,[7] consisting of intra- and extra-axonal compartments convolved with a corresponding fibre orientation function, optionally with an additional isotropic cerebro-spinal fluid compartment.[6,8-16] More recent work has also investigated the use of additional contrast mechanisms, such as spherical and planar tensor encoding,[17-20] more complex Q-space trajectories,[21] variable echo times[22] and variable inversion times,[23,24] among others. Many of these entail a considerable increase in scan time, and the use of custom sequences, limiting their widespread use in the immediate future. Furthermore, these methods are still under active research, and a consensus has yet to be reached regarding which of these should be deployed. We therefore decided to focus on optimisation of the multi-shell pulsed gradient spin echo (PGSE) sequence, as it is likely to remain the most widely used acquisition strategy for the foreseeable future, especially for large longitudinal and cross-sectional studies. A common requirement for researchers when setting up a multi-shell dMRI protocol is the determination of optimal imaging parameters, particularly the number of b-value shells and the number of diffusion-weighting directions and the b-value per shell. However, there are many different reconstruction algorithms available, and no clear consensus as to which is best. The lack of ground truth is also problematic, motivating the use of hardware phantoms with known fibre configurations.[25] This makes it very difficult to identify optimal imaging parameters, since any optimisation will typically aim to reduce errors in the estimated parameters of a given model; this obviously does not provide any guarantees that the data would be suitable for a different (possibly yet to be devised) reconstruction algorithm. To illustrate the difficulties with such an approach, Figure shows the median absolute deviation in DTI metrics computed using different subsets of an extended data acquisition to seek protocols that produce low errors. The approach is to calculate mean diffusivity (MD) and fractional anisotropy (FA) using a target analysis method of choice for all the available data (six shells, 50 directions on each, five subjects) to determine a “reference value” and then to do the same using only three of the shells to find feasible duration protocols that are reliable in the sense of producing the lowest median error (see the Appendix for full details). This was performed using three different commonly used tensor estimation strategies, namely, ordinary least-squares (OLS), weighted least-squares (WLS) and iteratively reweighted least-squares (IWLS),[26-29] all implemented within MRtrix3. Depending on the analysis method used, different “optimal” protocols are identified (see the Figure 1 caption for details). It is striking that this is the case even although the same model of diffusion (the diffusion tensor) is assumed for all cases.

Figure 1

Optimising sampling schemes for measuring fractional anisotropy (FA) and mean diffusivity (MD) using minimal discrepancy relative to a superset of all available data for three different DTI analysis methods (ordinary least squares [OLS], weighted least squares [WLS] and iteratively reweighted least squares [IWLS]). The optimal protocols, which are different in each case, are highlighted in red. The bar graphs show the median absolute difference in DTI metrics obtained using all the data compared with those obtained using different subsets of three of the available b-values. The results for MD are shown in units of μm2/s (FA is dimensionless). Differences were computed voxel-wise, and the median absolute difference was calculated over the whole brain for each of the five subjects included in the study, then averaged across subjects. Results are shown for FA (left) and MD (centre), computed using three commonly used estimators: OLS (top row), WLS (middle row) and IWLS (bottom row). The x-axis corresponds to the numbered schemes shown in the table on the top right, which specify the included shells. The schemes with the lowest discrepancy are highlighted in red. The bottom right panel shows the equivalent median absolute difference between the metrics obtained with the different fitting approaches using all the available data (ie, the difference between the references used for the other plots)

There is therefore a need for a means to determine acquisition parameters that provide the most eloquent, information-rich data in a manner agnostic to any particular reconstruction algorithm. This is particularly important for large-scale data collection projects that will provide a shared resource to be used by diverse academic groups for purposes that the users, rather than the data collectors, define. This study was motivated by the need to optimise the dMRI acquisition scheme for the developing Human Connectome Project (dHCP), which aims to acquire structural, functional and diffusion MRI data from more than 1000 neonates and fetuses, among other clinical, behavioural and genetic measures (http://www.developingconnectome.org/). Since these data are to be made freely available to the wider neuroimaging community, it was important to acquire data suitable for the widest possible range of analysis methods. Moreover, the diffusion-weighted (DW) signal in the perinatal age range differs markedly from that in adults, in that it exhibits much higher apparent diffusion (suggesting increased extracellular and reduced intracellular content), and much lower anisotropy. Furthermore, these characteristics vary strongly as a function of both location and age, as the various structures in the brain mature at different rates. Much analysis work to date has used algorithms designed for adult studies, but there is clearly scope for optimising all aspects of the analysis/modelling pipeline and this could result in methods that are different from existing approaches. Taken together, these factors provide strong motivation to develop a data-driven strategy that can be used to design data acquisitions for a desired target subject group, optimised for information content rather than to support a predefined processing method. To address this challenge, we propose to use an information-theoretic approach, similar in spirit to our previous work on optimising single-shell dMRI parameters.[30]

Methods

In this paper, we propose a framework to determine the optimal number of shells, along with the corresponding b-value and the number of directions sampled. To make this problem tractable, we separate the b-value dependence from the orientation dependence and treat each problem separately. For the b-value dependence, the central concept is then to identify a linear basis to represent the b-value dependence of the data (voxel values) as measured empirically, and to determine the number of coefficients of that basis that can realistically be measured in practice. Given this number, the task is then to identify a set of parameters (i.e., the number of shells, b-value per shell and the number of DW volumes per shell) that provide optimal sensitivity to these coefficients. The angular dependence was itself the focus of previous work, applied to the adult case[30]; here, we simply deploy the same methodology to determine the per-shell sampling density required to capture the detectable number of spherical harmonics. Briefly, the approach expresses the dMRI signal for a single b-value shell using the spherical harmonics basis, and identifies the highest angular frequency term that can realistically be detected in the data. This analysis is performed only in voxels deemed to contain a single-fibre population, since these contain the highest angular frequency content, and also because this allows averaging across voxels (after realignment of the fibre direction to a common axis). For further details, the reader is referred to Tournier et al.[30] This analysis provides an estimate of the angular frequency content at each b-value, which translates directly into the corresponding minimum number of directions required for nonlossy sampling (i.e., without aliasing in the angular domain). Once the optimal b-values have been identified, this analysis can be used to ensure that the sampling density within each shell is adequate. To determine the sampling requirements across b-values, we focus on the b-value dependence of the orientationally averaged raw DW signal as a function of b-value. Assuming sufficiently dense and uniform sampling, the mean DW signal over a single b-value shell is rotationally invariant, and independent of the fibre arrangements.[3,31,32] Near-uniform distribution of gradient directions can be obtained using electrostatic repulsion approaches,[33,34] as used in this study, and sufficient sampling density can be verified using the approach described above.[30] As an overview, this study consists of the following steps: Acquire data that are sufficiently densely sampled in the b-value and angular domains to capture all the expected features of the DW signal. Identify a suitable data-driven linear basis to represent the signal. Estimate the effect size observed in the signal for each coefficient of that basis. Identify the set of acquisition parameters that provide optimal sensitivity to these coefficients. The optimisation of the acquisition parameters (the last step in the list above) was performed by: For a given set of b-values and corresponding number of directions, define a measure of sensitivity to the coefficients of interest. For a given set of b-values, optimise this sensitivity measure with respect to the number of directions per b-value shell. Given a fixed number of b-value shells, optimise the sensitivity measure with respect to the value of these b-values (with the corresponding number of directions optimised as per the previous step). Each of these steps is described in more detail in the following sections.

Data acquisition and preprocessing

Data were acquired from five neonates scanned at term-equivalent age (see Table 1 for details) on a Philips Achieva 3 T magnet at the Evelina Newborn Imaging Center at St. Thomas’ Hospital (London, UK). The study was approved by the National Research Ethics Committee and written informed parental consent was obtained prior to scanning. The system was equipped with a standard 32-channel head coil, using a DW pulsed gradient spin-echo EPI sequence (TE/TR = 70/2260 ms, 2 × 2 mm voxel size, 11 slices with 2 mm thickness with a 1 mm slice gap, 112 × 112 matrix, SENSE factor 2, single anterior–posterior phase-encode direction). Data were collected along 50 noncollinear directions optimised using electrostatic repulsion[33] at b-values of 500, 1000, 2000, 3000 and 4000 s/mm2, along with five b=0 s/mm2 volumes. These directions were optimised using the MRtrix3 “dirgen” command, and the same set of 50 directions was used for each shell. We note that in adults, a spherical harmonic order of 8 (45 parameters) was shown to be sufficient to capture all realistically measurable components of the signal, even up to b = 5000 s/mm2.[30] Therefore, the 50 directions used in the acquisition here are sufficient to satisfy this criterion, especially considering that neonatal data will exhibit lower anisotropy than observed in adults.[35-37]

Table 1

Details of the subjects included in this study

GA (weeks)	PMA (weeks)	Clinical features	Imaging findings
39 + 2	45 + 4	Smith-Lemli-Opitz syndrome	Cerebellar hypoplasia, small hypothalamic hamartoma
38 + 0	40 + 1	Fetal alcohol syndrome	Small but otherwise normal imaging appearance
40 + 5	42 + 5	Seizures, antenatal ventriculomegaly	Perisylvian polymicrogyria
33 + 3	36 + 1	Preterm	small germinal matrix haemorrhage
39 + 6	40 + 5	Meconium aspiration, mild hypoxia	Normal imaging appearance

For each subject, the data were first corrected for motion- and eddy current-induced distortions using the FSL EDDY (version 5.0.8).[38,39] The mean DW signal for each shell was then computed within a brain mask. Typical images of the resulting mean signal are shown in Figure 2.

Figure 2

Images of the mean raw dMRI signal for all five subjects (columns), for each b-value (rows). These constitute the input for the optimisation algorithm. Note that the images are windowed independently for each b-value to allow visualisation of the contrast in the high b-value images

Identifying a suitable linear basis

In contrast to the angular domain, where spherical harmonics form a natural basis,[30] there is no such natural basis to represent the b-value dependence of the signal. While some bases have been proposed,[40-47] there is an inherent scale dependence that makes the analysis dependent on user-defined parameters (D).[48] For this reason, we adopted a fully data-driven, model-free basis, derived using matrix decomposition approaches. We use the compact singular value decomposition (SVD), which decomposes a general m × n matrix A (with m ≤ n) as USV, where U is a m × m orthonormal matrix, S is a m × m diagonal matrix of singular values, and V is a m × n orthonormal matrix. To make use of this decomposition, we form the compact SVD of the transpose of D: More specifically, we consider a multi-shell protocol consisting of N b-values, with n uniformly distributed DW directions for the shell index s. We form the N × N matrix D consisting of the mean DW signal for each brain voxel at each b-value, where N is the number of voxels in the subject-specific brain mask. We wish to decompose D into a N × N matrix W of per-voxel weights, and a N × N orthonormal basis matrix H, such that: The weight and basis matrices can then be obtained as: Furthermore, thanks to the properties of the SVD, the singular values (i.e., the diagonal of S) can be used to obtain an estimate of the typical component-wise effect sizes ϵ (this evaluates to the root-mean-square value of the weights across the rows of W):

Measure of sensitivity

The decomposition described above provides a linear breakdown of the effects observed in the data, and of their relative contributions. This can be used to predict the sensitivity to these various effects for a candidate acquisition with the same b-value shells, but different numbers of volumes per shell. Assuming independent and identically distributed Gaussian noise, the expected variance in the measured mean DW signal for each shell s is simply , with n the number of volumes in shell s, and σ 2 the variance per measurement in the data. This rests on the observation that provided the sampling density is uniform and sufficiently dense (as per Tournier et al[30]), the variance in the mean DW signal for a given shell is inversely proportional to the number of DW volumes acquired at that b-value; the mean DW signal is orientationally invariant, and each volume contributes equally to its estimation. The predicted variance in the mean per-shell signal can be used to predict the variance in the estimated basis coefficients using the law of propagation of errors.[49] Based on Equation 1, if d = Hw is the vector of mean per-shell signals for a single voxel (where w is the corresponding vector of basis coefficients), the variance–covariance matrix Σ for the coefficients w is related to the corresponding variance–covariance matrix Σ for the mean per-shell signals d according to: Since H is orthogonal by construction, H −1 = H, yielding: With repeated measurements per shell, (see above). Hence: Finally, we compute the coefficient of variation (CV) for coefficient c as the ratio of its standard deviation (given by the square root of its variance Σ(c,c)) to its effect size ϵ (as provided in Equation 4): Note that CV is equivalent to the inverse of the contrast-to-noise ratio (CNR) of the coefficient of interest.

Optimising the number of directions per shell

The optimal acquisition should strive to minimise the combined CV of the N estimated coefficients. We therefore use the sum of squared coefficients of variation (SSCV) as a measure of optimality: Note that if the effect size were the same for all coefficients, this is proportional to the sum of the variances, which is appropriate for independent measurements. However, when effect sizes differ, this measure is most strongly influenced by the coefficients with the highest CV, pushing the optimisation towards parameters that maximise the CNR of the noisiest components, without overly penalising high CNR components. We now need to derive the number of directions per shell = {n} that maximise the sensitivity, and hence minimise the SSCV, within the constraint that the number of DW volumes, N total, is fixed by scan time limitations: This can be solved using Lagrange multipliers: The derivative with respect to the number of directions in shell s is: Setting the derivative to zero yields: This therefore provides the optimal number of DW volumes for shell s relative to the other shells. Since we assume that the total number of volumes is fixed by scan time constraints, the absolute numbers can trivially be obtained by normalising to N total: where is the value obtained by setting λ in Equation 13 to an arbitrary constant. It is interesting to note that with this derivation, the fractions of measurements on each shell are independent of N total or σ: they depend purely on the intrinsic effects in the DW signal.

Optimising the b-values

So far, the derivation above holds for data acquired over a fixed number of shells with predetermined b-values. If a sufficiently large number of different combinations of shells and b-values were available, this could be used to identify the optimal set of parameters, by searching for the subset of b-values with minimum SSCV (Equation 9). It is, however, impractical to acquire data over a sufficient number of b-values to ensure the search is exhaustive, particularly in a neonatal cohort where scan times must be kept short. We therefore used interpolation methods to resample the acquired data, allowing the generation of predicted data for any combination of b-values; this approximation is justified on the grounds that the diffusion signal varies smoothly as a function of b-value, so that interpolation errors are likely to be minimal. For this purpose, we use the Piecewise cubic Hermite Interpolating Polynomial (PCHIP) algorithm[50] available in Matlab (MathWorks, Natick, MA, USA), which has the desirable property of explicitly preserving monotonic behaviour between sample points. Given a fixed desired number of shells, the optimal b-values for each shell are determined using a simple nonlinear optimisation approach (the Nelder–Mead simplex algorithm,[51] as implemented in Matlab’s fminsearch routine). For each iteration, the mean DW signals per voxel at the current b-values are obtained by PCHIP interpolation from the measured data. The SSCV of the parameters is then optimised with respect to the number of directions per shell as described earlier in Optimising the number of directions per shell (section 2.4). Finally, the optimised SSCV value is used as the cost function for the search over b-values.

Accounting for T 2 decay

So far, the derivation has ignored the effect of T 2 decay on the sensitivity of the acquisition. In practice, the shell with the highest b-value will dictate the echo time of the entire acquisition (assuming a constant echo time is desired, as is typically the case), and this will have a direct impact on the SNR of the data. To account for these effects, we include a b max-dependent T 2 decay term, assuming a standard PGSE sequence, similar to Alexander and Barker,[52] with square diffusion-sensitising gradients and the following parameters: A 5 ms delay between the 90° excitation pulse and the onset of the first DW gradient pulse. A 5 ms pulse duration for the 180° refocusing pulse (including slice selection gradients and crushers, if any). A 15 ms delay between the end of the second DW gradient pulse and echo time. 80 mT/m gradient strength. The attenuation factor is then computed as exp(−TE/T 2), and is used to scale the effect sizes in Equation 4, inherently penalising parameters sets that include a high b-value shell. Results were generated assuming T 2 = 100, 150 and 200 ms, which are typical for neonatal brain,[53] as well as ignoring T 2 relaxation.

Analysis within and across subjects

As described, the analysis requires input data in the form of a N × N matrix corresponding to the mean DW signal for each of the N = 6 b-value shells across all N brain voxels. The analysis was applied for each subject independently within a mask of the brain voxels (with N ranging from 14 894 to 23 481 voxels). In addition, the analysis was also performed on a single dataset formed by concatenating the data for all subjects along the slice axis, for a combined total of N = 99 306 voxels. Unless otherwise stated, the results shown were produced with this combined analysis.

Results

The angular domain analysis shows that the angular frequency content is much lower in neonates (Figure 3) than has been shown previously in adults,[30] as expected. In our data, spherical harmonic terms of order 8 and above are indistinguishable from noise for any realistic SNR value (Figure S1). Note that this does not imply that these terms are strictly zero, but that they can, for all intents and purposes, be neglected for any reasonable voxel-wise analysis. Order 6 terms were also small but could be observed with b ≥ 2000 s/mm2, implying that a minimum of 28 directions are required at these b-values. At lower b-values, only spherical harmonic terms of order 2 or 4 could be detected, implying a minimum of 6 and 15 directions, respectively.

Figure 3

Angular frequency content of dMRI signal in neonates, as a function of b-value (cf.[30]). Top: polar plot of the signal (fibre orientation is along the vertical axis), showing the rapid decrease in signal with increasing b-value, along with the expected increase in angular contrast. Bottom: a plot of the corresponding spherical harmonic coefficient amplitudes. Angular frequencies above l = 4 are already within the noise floor, although there is a suggestion that l = 6 might be detectable for b ≥ 2000 s/mm2. This implies that a minimum of 15 and 28 DW directions are required for b < 2000 s/mm2 and b ≥ 2000 s/mm2, respectively

The basis functions H and weights maps W (cf. Equation 3) derived using SVD are shown in Figures 4 and 5, respectively, and are remarkably consistent across subjects. The first component corresponds to the mean DW signal over all voxels, with the higher-order components showing increasingly rapid oscillations with b-value. Clear structure can be observed in the corresponding weights maps for at least the first four components (although the fourth component may possibly contain some residual misregistration due to, for example, motion or eddy currents, as well as any actual biology-related structure). In our data, the fifth order lacks consistency across subjects, suggesting that the remaining spatial structure may be strongly related to artefacts in the data. The sixth and highest order that could be derived from the test data given the number of shells, lacks both consistency across subjects and anatomical coherence, suggesting that noise and artefacts have become completely dominant. The effect sizes for these components decrease rapidly for the higher-order coefficients, as shown in Table 2.

Figure 4

The basis functions observed in the data, for all five subjects included in the study. The basis functions estimated from each subject when analysed independently are shown by the solid coloured lines, and show a high degree of consistency across subjects. The dotted black line shows the basis functions estimated from the combined data across all subjects. The basis functions can be seen to represent increasingly rapid oscillations in the b-value dependence, with the first component corresponding to the mean signal across the data. See Table 2 for the corresponding effect sizes

Figure 5

Weights maps for each component in Figure 4, for all five subjects included in the study

Table 2

Effect sizes for each observed component in the decomposition (as shown in Figure 4, across all subjects (arbitrary units)

component	1	2	3	4	5	6
effect size	1.000	0.134	0.038	0.018	0.009	0.004

The optimal b-values selected by the algorithm are shown in Figure 6 for the case of 2-, 3- and 4-shells (not including the b=0 shell), assuming a T 2 value of 150 ms in neonatal brain tissue.[53] Figure 7 shows the effect of different T 2 values for the 3-shell case. In all cases, a low b-value in the region of b = 400 s/mm2 is included, and can be seen to provide good separation between the first component and the other smaller components. In general, the algorithm seems to distribute b-values in a way that provides optimal separation between the different components, as desired. When more shells are included, the highest b-value selected increases, particularly when T 2 relaxation is ignored. When relaxation effects are included, the maximum b-value never exceeds b ≈ 2600 s/mm2.

Figure 6

Optimal b-values selected by the algorithm, for 2-shell (left), 3-shell (middle) and 4-shell (right) cases, overlaid on the corresponding basis functions. The vertical dotted lines indicate the optimal b-values identified. Results obtained assuming a T 2 value of 150 ms

Figure 7

Effect of T 2 on optimal b-values selected for the 3-shell case. The vertical dotted lines indicate the optimal b-values identified the T 2 value stated for each plot. While T 2 does have an influence, it is relatively minor, with lower T 2 leading primarily to a slightly reduced maximum b-value, with little to no impact on the other selected b-values

The b-values selected as optimal are stable across subjects, apart from the maximum b-value, as shown in Figure 8 for the 3-shell case. The maximum b-value varies in this case from b ≈ 2300 to 3500 s/mm2; this is likely related to the signal levelling out as a function of b-value in this range, leading to a broad optimum.

Figure 8

Optimal b-values selected by the algorithm when applied to each subject individually for the 3-shell case and assuming T 2 = 150 ms. The vertical dotted lines indicate the optimal b-values identified in each case. While there is some variation in the maximum b-value, the other b-values remain very stable

Table 3 shows the optimal results for 2-, 3- and 4-shells assuming T 2 = 150 ms (suitable for neonates at 3T[53]). Along with the optimal b-values, the optimal number of directions per shell are also listed as a percentage of the total number of volumes acquired. For reference, the CNR for the coefficients themselves is shown in Table 4, assuming SNR = 30 and N = 100 imaging volumes. As expected, the CNR drops rapidly for the smaller coefficients, and is typically improved by using the smallest number of shells needed to characterise that coefficient (eg, b=0 + 2 shells to characterise the third coefficient). These CNR calculations can be generalised to arbitrary conditions by adjusting them based on the actual number of volumes and SNR values: As expected, larger numbers of volumes are needed for the higher b-value shells. This is expected given the lower SNR available at these b-values, and also helps to ensure that the minimum orientation sampling density requirements (identified earlier using the angular frequency analysis) are satisfied.[30]

Table 3

Optimal b-values (in s/mm2) for each shell and corresponding number of directions (N), expressed as a percentage of the total number of volumes acquired. Results are shown for the 2-shell, 3-shell and 4-shell cases, assuming a T 2 value of 150 ms


2 shells	b-value	0	395	1781
	N_total	12%	29%	59%

3 shells	b-value	0	404	990	2574
	N_total	6%	19%	28%	47%

4 shells	b-value	0	405	990	1901	3784
	N_total	3%	9%	16%	29%	43%

Table 4

Contrast-to-noise ratios (CNR) for each component, as would be estimated using the corresponding parameters in Table 3. These values assume a T 2 value of 150 ms, a total of N = 100, and an SNR in the b=0 images of 30

component	1	2	3	4	5
2 shells	152	18	6.0
3 shells	125	16	4.5	2.7
4 shells	88	13	4.0	2.1	1.3

Based on this optimisation, a 3-shell protocol was adopted for the dHCP.[54] The acquisition includes a total of 300 samples distributed in the proportions listed in Table 3. Figure 9 shows example images and shell average images acquired as part of the dHCP study (PGSE EPI acquisition with multiband factor 4, 64 slices, TE/TR = 90/3800 ms, 1.5 × 1.5 × 3 mm resolution, four phase-encoding directions, SENSE 1.2, Partial Fourier 0.855), after correction for EPI distortion, subject motion and outliers.[38,39] While the individual images in the highest shell display the expected low SNR, the shell average reveals a highly coherent anatomical structure confirming its information richness, which can be well accessed by virtue of the large number of samples collected. All shells have angular sampling in excess of the requirements determined from the pilot data.

Figure 9

Single diffusion-weighted images (top row) and mean diffusion-weighted image (bottom row) for each shell for a typical subject acquired using the final dHCP protocol

Discussion

This study provides a data-driven framework to identify the optimal parameters for multi-shell HARDI acquisitions, based on an information-theoretic approach. It is independent of any particular reconstruction/diffusion analysis algorithm, which is a significant advantage given that many of these approaches are under active development, have a number of different tuneable parameters, and are designed to provide different types of information. As illustrated in the Appendix, a protocol optimised for any one of these would therefore not be guaranteed optimal for any of the other methods (or indeed, any other parameters or outputs of the same method). The proposed design method allows protocols to be implemented now that will remain future-proof as novel analysis approaches are developed. While a number of decompositions could be used for the b-value dependence, singular value decomposition is arguably the most appropriate for this study. The SVD provides a unique and complete signal representation with a clear interpretation as components of decreasing effect size. In contrast to other factorisations that deploy biophysical constraints, for example, to enforce nonnegativity in both weights and basis functions,[55-57] the SVD deliberately avoids any such constraints. As such, we can ensure that the optimised multi-shell HARDI scheme is driven by the data itself, and not by implicit assumptions of a particular tissue model. The optimisation suggests that the optimal number of directions per shell can be expressed as a fraction of the total number of volumes to be acquired. Furthermore, the CNR of the estimated coefficients will scale with the square root of the total number of measurements. These results can therefore easily be tailored to the particular circumstances under which the protocol is to be used, using the recommendations outlined below: Identify the desired imaging parameters (eg, resolution, spatial coverage) that will determine the SNR and repetition time of the sequence. Divide the scan time allocated to this sequence by theTR to obtain an estimate of the total number of volumes to be acquired. Based on the SNR and total number of volumes, compute the CNR for the various coefficients for the 2-, 3- and 4-shell scenarios, using Equation 15. Decide on the number of shells that will be used by selecting that which predicts usable CNR for all estimated coefficients (ie, the CNR is above the noise floor for all coefficients). Using our previous orientation-domain approach,[30] verify that the number of directions for each b-value is sufficient to avoid aliasing, given the noise level in the images. If the number of directions per shell is not sufficient to avoid aliasing in the angular domain, the data acquisition will not provide optimal characterisation of the dMRI signal. There are several ways to address this issue: Increase the number of volumes in the acquisition, and hence the scan time, to meet the minimum sampling requirements. Select a lower number of shells, as this increases the number of directions for the remaining shells. This decision will depend on whether the error introduced by inadequate sampling of the b-value domain (i.e., having too few shells) is greater than that introduced by inadequate sampling of the angular domain (i.e., having too few directions). Redistribute directions between shells to ensure the minimum sampling requirements are met, striving to maintain the number of directions per shell as close to those identified as optimal. While this implies a departure from the optimal parameters for overall CNR, the actual impact is likely to remain acceptable for small adjustments, and can be evaluated using the approach outlined in this study. It is interesting to note that the b-values identified as optimal are very robust across subjects (Figure 8): for the 3-shell case, b = 400 and 1000 s/mm2 are identified in all subjects with high reproducibility (the range is b = 397-411 and 988-1001 s/mm2, respectively). There is more variation for the highest b-value (ranging from b = 2360 to 3700 s/mm2), as can be seen from Figure 8. Moreover, these values are remarkably consistent when different numbers of shells are selected (b = 400 s/mm2 is present in all schemes, b = 1000 s/mm2 is present for the 3- and 4-shell schemes) (Figure 6). However, given the smoothness of the principal components, minor variations in the exact b-values used are unlikely to have a large impact on the optimality of the acquisition. Perhaps surprisingly, these values are also remarkably insensitive to changes in T 2 (Figure 7). This can be explained by considering that to provide sensitivity to all N components, acquisition schemes will need to include a range of b-values, so that the highest b-value will necessarily be high relative to the others. This places a constraint on the minimum achievable echo time: any reduction in the echo time (and hence the maximum b-value) entails a greater reduction in sensitivity than the increase in SNR. Furthermore, this relative insensitivity is also likely to be a consequence of the long T 2 values observed in neonatal MRI at 3T, which typically range from more than 100 to 300 ms (see, for example, table 4 in Williams et al[53]). It is likely that the same analysis performed in adult data, where T 2 values are considerably shorter, would show a stronger effect of T 2. It is also interesting to look at the robustness of the acquisition to deviations in the parameters from the values identified as optimal. Figure 10 shows the effect of varying the b-value of each b≠0 shell in turn, in terms of its impact on the CNR of the individual components, and on the overall SSCV metric (the measure of sensitivity minimised in the present study). It is clear that higher sensitivity to individual components can be achieved, but this comes at the expense of sensitivity to other components. For example, lowering the first shell from b = 400 to b = 300 s/mm2 would increase sensitivity to the first component by ~10%, but would also reduce sensitivity to components 3 and 4 by ~10%. The SSCV measure shows a broad minimum, particularly for the highest shell, indicating that the acquisition would still perform well if the b-values differed somewhat from those identified.

Figure 10

The impact of deviations from the optimal solution on the relative contrast-to-noise ratio (CNR) of each component (coloured lines) and on the sum of squared coefficients of variation (SSCV) measure (black line). Each plot shows the effect of changing the b-value of the corresponding shell away from the value identified as optimal (denoted by the dashed vertical line). The CNR for each component is shown as a fraction of its value at the overall optimum to allow all plots to coexist on the same scale. This suggests that the optimum is fairly broad, and performance is likely to be relatively tolerant of deviations in the b-values, particularly for the highest shell. In general, the acquisition is more sensitive to the largest component (component 1) with lower b-values, at the expense of sensitivity to the weakest components. This was produced assuming a scheme consisting of four b-value shells to estimate four components, and T 2 = 150 ms

Being entirely data-driven, the approach inherently provides parameters optimal for the particular cohort under investigation. In the present work, we focus on the neonatal age range, and the parameters identified should therefore only be considered optimal for this age range specifically. This approach could also be used to optimise protocols for ex vivo scanning, for instance. However, we note that the b-value optimisation procedure used here did not explicitly take the orientation information into account (beyond ensuring the minimum sampling density requirements are met); this approach may yield sub-optimal results if applied to the adult case where the angular dependence is much stronger. For instance, angular features in the signal have recently been shown to be critical to resolving degeneracies in the standard model[7]; since the approach taken here ignores these angular features, it cannot provide any guarantee of optimality for applications of this type. By contrast, the neonatal brain exhibits much lower anisotropy, and many of the most interesting features in this age range are likely to be better characterised by focusing on the b-value dependence of the mean DW signal (eg, Kaden et al[3] and Reisert et al[32]), motivating in part the current approach. Joint optimisation of both angular and b-value dependences requires extended orthonormal q-space decompositions (D),[48] and is the subject of ongoing work. Due to its focus on the features of the raw signal, the optimisation performed here provides a solution that is generally suitable for any type of multi-shell dMRI analysis. However, it is likely that in certain applications, very specific aspects of the signal may be of interest, in which case a specially designed acquisition scheme may provide improved sensitivity for this particular purpose. We do, however, believe that such an acquisition would necessarily entail some loss of generality, with reduced sensitivity when used for other purposes. As shown in Table 3, the lowest b-value selected by the algorithm, at b ≈ 400 s/mm2, is lower than has typically been used in previous multi-shell acquisitions (notably the Human Connectome Project), although it is in line with recommendations for diffusion kurtosis imaging (DKI)[1,58] and neurite orientation density and dispersion imaging (NODDI).[6] Furthermore, lower b-values are expected considering that mean diffusivity is higher in neonates than in the adult brain. On the other hand, the highest b-value selected is similar to those identified as optimal for fibre orientation estimation in adults[30,52]; the acquisition scheme should therefore be well suited to tractography applications. Finally, the number of DW directions per shell increases with higher b-value, most likely due to the stronger signal attenuation (coincidentally, increasing sampling density is also required to characterise the increasingly complex angular dependence of the signal at higher b-values). A question that naturally arises from this work is whether and how the sampling directions should be optimised across shells. It has been suggested in previous work that the sampling directions should be chosen to provide uniform angular coverage across all shells.[59] By contrast, in this study the directions were optimised for uniformity across each shell independently, with no attempt to introduce any interaction between shells. We motivate our approach using sampling theory, based on similar arguments to our previous work,[30] as follows. We assume the diffusion signal is band-limited in the angular domain; that is, the diffusion signal varies smoothly as a function of orientation, and its angular frequency spectrum contains low harmonic terms only (depending on the b-value). By construction, our sampling scheme is sufficiently dense that all angular frequency components have been fully captured. When these conditions are met, there is no additional information to be gained by introducing inter-dependencies across shells, since the signal is already fully sampled. Furthermore, any attempt at introducing cross-shell dependencies can only push the sampling scheme away from per-shell uniformity, and hence compromise its overall quality. We therefore feel justified to optimise the directions on a purely per-shell basis (we do however note that optimisation across shells may be beneficial in the undersampled case, where each shell is not sufficiently sampled in isolation to capture all features of the signal; in this case, a suitable model of the signal may be able to make use of the complementary orientational information provided by the different shells). Identifying the optimal number of shells to be included in the final protocol necessarily requires additional factors to be considered, most notably N, which is in turn dependent on details of the acquisition and the total feasible acquisition time. The minimum angular sampling requirement imposes an upper limit on the number of shells that can be acquired for a chosen N since increasing the number of shells necessarily reduces the number of directions per shell. However, these requirements are easily met for large N, as is the case for the dHCP. It is notable that adding higher shells can strongly suppress the number of samples available for lower shells as, at least in the neonatal case, almost 50% of the samples must be obtained using the maximum b-value. We used the per-voxel maps of weights for each component to help inform this balance, based on an expectation that meaningful components should exhibit anatomically plausible spatial patterns that are consistent across subjects. As shown in Figure 5, the maps for the fourth component do exhibit some anatomically plausible contrast (notably in the posterior limb of the internal capsule), whereas the fifth component is much less consistent across the test subjects, suggesting it was strongly impacted by noise and artefacts, probably related to residual misregistration. For this reason, and given the nonnegligible drop in CNR for all components when increasing the number of shells (Table 4), we decided to use a 3-shell (+ b=0) scheme for the final dHCP protocol, with 300 volumes acquired within 20 minutes using multiband factor 4.[54]

Conclusion

The approach presented in this study provides a means to design an optimal acquisition multi-shell HARDI protocol, based on maximising the information content of the data, independent of any particular reconstruction method. It is based on empirically measured data, with no reliance on any assumed model of microstructure. It focuses primarily on the mean DW signal per shell, with separately derived safeguards to ensure adequate angular sampling; this is appropriate for neonatal data given the low angular contrast in this age range. The analysis performed here suggests that a b=0 + 3 shells acquisition is suitable for neonatal imaging, consisting of b = 0, 400, 1000 and 2600 s/mm2 with numbers of DW directions per shell in proportions of 6:19:28:47; these were employed in the final diffusion MRI protocol for the dHCP project.

59 in total

1. Minimal gradient encoding for robust estimation of diffusion anisotropy.

Authors: N G Papadakis; C D Murrills; L D Hall; C L Huang; T Adrian Carpenter
Journal: Magn Reson Imaging Date: 2000-07 Impact factor: 2.546

2. Learning Compact q -Space Representations for Multi-Shell Diffusion-Weighted MRI.

Authors: Daan Christiaens; Lucilio Cordero-Grande; Jana Hutter; Anthony N Price; Maria Deprez; Joseph V Hajnal
Journal: IEEE Trans Med Imaging Date: 2018-10-04 Impact factor: 10.048

3. Weighted linear least squares estimation of diffusion MRI parameters: strengths, limitations, and pitfalls.

Authors: Jelle Veraart; Jan Sijbers; Stefan Sunaert; Alexander Leemans; Ben Jeurissen
Journal: Neuroimage Date: 2013-05-16 Impact factor: 6.556

4. MAPL: Tissue microstructure estimation using Laplacian-regularized MAP-MRI and its application to HCP data.

Authors: Rutger H J Fick; Demian Wassermann; Emmanuel Caruyer; Rachid Deriche
Journal: Neuroimage Date: 2016-04-01 Impact factor: 6.556

5. Multiple q-shell diffusion propagator imaging.

Authors: Maxime Descoteaux; Rachid Deriche; Denis Le Bihan; Jean-François Mangin; Cyril Poupon
Journal: Med Image Anal Date: 2010-07-14 Impact factor: 8.545

Review 6. The UNC/UMN Baby Connectome Project (BCP): An overview of the study design and protocol development.

Authors: Brittany R Howell; Martin A Styner; Wei Gao; Pew-Thian Yap; Li Wang; Kristine Baluyot; Essa Yacoub; Geng Chen; Taylor Potts; Andrew Salzwedel; Gang Li; John H Gilmore; Joseph Piven; J Keith Smith; Dinggang Shen; Kamil Ugurbil; Hongtu Zhu; Weili Lin; Jed T Elison
Journal: Neuroimage Date: 2018-03-22 Impact factor: 6.556

7. Design of multishell sampling schemes with uniform coverage in diffusion MRI.

Authors: Emmanuel Caruyer; Christophe Lenglet; Guillermo Sapiro; Rachid Deriche
Journal: Magn Reson Med Date: 2013-04-26 Impact factor: 4.668

8. Rotationally-invariant mapping of scalar and orientational metrics of neuronal microstructure with diffusion MRI.

Authors: Dmitry S Novikov; Jelle Veraart; Ileana O Jelescu; Els Fieremans
Journal: Neuroimage Date: 2018-03-12 Impact factor: 6.556

9. Multi-tissue constrained spherical deconvolution for improved analysis of multi-shell diffusion MRI data.

Authors: Ben Jeurissen; Jacques-Donald Tournier; Thijs Dhollander; Alan Connelly; Jan Sijbers
Journal: Neuroimage Date: 2014-08-07 Impact factor: 6.556

10. Time-efficient and flexible design of optimized multishell HARDI diffusion.

Authors: Jana Hutter; J Donald Tournier; Anthony N Price; Lucilio Cordero-Grande; Emer J Hughes; Shaihan Malik; Johannes Steinweg; Matteo Bastiani; Stamatios N Sotiropoulos; Saad Jbabdi; Jesper Andersson; A David Edwards; Joseph V Hajnal
Journal: Magn Reson Med Date: 2017-05-30 Impact factor: 4.668

4 in total

1. The Developing Human Connectome Project Neonatal Data Release.

Authors: A David Edwards; Daniel Rueckert; Stephen M Smith; Samy Abo Seada; Amir Alansary; Jennifer Almalbis; Joanna Allsop; Jesper Andersson; Tomoki Arichi; Sophie Arulkumaran; Matteo Bastiani; Dafnis Batalle; Luke Baxter; Jelena Bozek; Eleanor Braithwaite; Jacqueline Brandon; Olivia Carney; Andrew Chew; Daan Christiaens; Raymond Chung; Kathleen Colford; Lucilio Cordero-Grande; Serena J Counsell; Harriet Cullen; John Cupitt; Charles Curtis; Alice Davidson; Maria Deprez; Louise Dillon; Konstantina Dimitrakopoulou; Ralica Dimitrova; Eugene Duff; Shona Falconer; Seyedeh-Rezvan Farahibozorg; Sean P Fitzgibbon; Jianliang Gao; Andreia Gaspar; Nicholas Harper; Sam J Harrison; Emer J Hughes; Jana Hutter; Mark Jenkinson; Saad Jbabdi; Emily Jones; Vyacheslav Karolis; Vanessa Kyriakopoulou; Gregor Lenz; Antonios Makropoulos; Shaihan Malik; Luke Mason; Filippo Mortari; Chiara Nosarti; Rita G Nunes; Camilla O'Keeffe; Jonathan O'Muircheartaigh; Hamel Patel; Jonathan Passerat-Palmbach; Maximillian Pietsch; Anthony N Price; Emma C Robinson; Mary A Rutherford; Andreas Schuh; Stamatios Sotiropoulos; Johannes Steinweg; Rui Pedro Azeredo Gomes Teixeira; Tencho Tenev; Jacques-Donald Tournier; Nora Tusor; Alena Uus; Katy Vecchiato; Logan Z J Williams; Robert Wright; Julia Wurie; Joseph V Hajnal
Journal: Front Neurosci Date: 2022-05-23 Impact factor: 5.152

2. Multi-Channel 4D Parametrized Atlas of Macro- and Microstructural Neonatal Brain Development.

Authors: Alena Uus; Irina Grigorescu; Maximilian Pietsch; Dafnis Batalle; Daan Christiaens; Emer Hughes; Jana Hutter; Lucilio Cordero Grande; Anthony N Price; Jacques-Donald Tournier; Mary A Rutherford; Serena J Counsell; Joseph V Hajnal; A David Edwards; Maria Deprez
Journal: Front Neurosci Date: 2021-06-16 Impact factor: 4.677

3. dStripe: Slice artefact correction in diffusion MRI via constrained neural network.

Authors: Maximilian Pietsch; Daan Christiaens; Joseph V Hajnal; J-Donald Tournier
Journal: Med Image Anal Date: 2021-09-25 Impact factor: 8.545

4. Maternal depressive symptoms, neonatal white matter, and toddler social-emotional development.

Authors: Alexandra Lautarescu; Alexandra F Bonthrone; Maximilian Pietsch; Dafnis Batalle; Lucilio Cordero-Grande; J-Donald Tournier; Daan Christiaens; Joseph V Hajnal; Andrew Chew; Shona Falconer; Chiara Nosarti; Suresh Victor; Michael C Craig; A David Edwards; Serena J Counsell
Journal: Transl Psychiatry Date: 2022-08-09 Impact factor: 7.989

4 in total