Literature DB >> 26539075

A comparison of FreeSurfer-generated data with and without manual intervention.

Christopher S McCarthy1, Avinash Ramprashad1, Carlie Thompson1, Jo-Anna Botti1, Ioana L Coman1, Wendy R Kates1.   

Abstract

This paper examined whether FreeSurfer-generated data differed between a fully-automated, unedited pipeline and an edited pipeline that included the application of control points to correct errors in white matter segmentation. In a sample of 30 individuals, we compared the summary statistics of surface area, white matter volumes, and cortical thickness derived from edited and unedited datasets for the 34 regions of interest (ROIs) that FreeSurfer (FS) generates. To determine whether applying control points would alter the detection of significant differences between patient and typical groups, effect sizes between edited and unedited conditions in individuals with the genetic disorder, 22q11.2 deletion syndrome (22q11DS) were compared to neurotypical controls. Analyses were conducted with data that were generated from both a 1.5 tesla and a 3 tesla scanner. For 1.5 tesla data, mean area, volume, and thickness measures did not differ significantly between edited and unedited regions, with the exception of rostral anterior cingulate thickness, lateral orbitofrontal white matter, superior parietal white matter, and precentral gyral thickness. Results were similar for surface area and white matter volumes generated from the 3 tesla scanner. For cortical thickness measures however, seven edited ROI measures, primarily in frontal and temporal regions, differed significantly from their unedited counterparts, and three additional ROI measures approached significance. Mean effect sizes for edited ROIs did not differ from most unedited ROIs for either 1.5 or 3 tesla data. Taken together, these results suggest that although the application of control points may increase the validity of intensity normalization and, ultimately, segmentation, it may not affect the final, extracted metrics that FS generates. Potential exceptions to and limitations of these conclusions are discussed.

Entities:  

Keywords:  FreeSurfer; control points; intensity normalization; interactive; semi-automatic; white matter edits

Year:  2015        PMID: 26539075      PMCID: PMC4612506          DOI: 10.3389/fnins.2015.00379

Source DB:  PubMed          Journal:  Front Neurosci        ISSN: 1662-453X            Impact factor:   4.677


Introduction

FreeSurfer (FS) is a freely available fully automated brain image morphometric software package that allows for the measurement of neuroanatomic volume, cortical thickness, surface area, and cortical gyrification of regions of interest (ROIs) throughout the brain. FS was designed around an automated workflow that encompasses several standard image processing steps necessary to achieve a final brain parcellation within the subject's space; however, manual image editing is allowed after each stage to ensure quality control. The first stage performs skull stripping and motion artifact correction, the second performs gray-white matter segmentation (Fischl et al., 2002), and the third segments 34 ROIs based on anatomic landmarks (Desikan et al., 2006). Another critical function that FS provides is the ability to construct surface-based representations of the cortex, from which cortical thickness, neuroanatomic volume, and surface area can be derived. Manual measurement of the volumes of specific ROIs is an arduous, labor-intensive task, and is subject to inter-rater variability. FS offers consistency in its fully automated processing, which is ideal for either single- or multi-site studies with large sample sizes. In general, validation studies have demonstrated that FS can produce measurements that are comparable to those derived from manual tracing of brain regions (Fischl et al., 2002; Tae et al., 2008; Bhojraj et al., 2011). FS has also been shown to be a highly reliable method for automated cortical thickness measurements across scanner strength and pulse sequence in all regions of the brain, with minor variability being attributed to cytoarchitectural differences of certain ROIs and difficulties with surface reconstructions in temporal lobe regions (Han et al., 2006; Fjell et al., 2009). However, strictly implementing the automated procedures in FS can result in variability in the accuracy of segmentation for some ROIs. For example, Cherbuin et al. (2009) showed that absolute hippocampal volumes measured with FS were significantly larger than those of manual tracings, with reported 23 and 29% overestimation of left and right hippocampal volumes, respectively. Closer inspection revealed that this was due to inclusions of surrounding high intensity voxel structures as well as misidentification of pockets of cerebrospinal fluid as hippocampal tissue (Cherbuin et al., 2009). Other studies suggest that the temporal lobe and nearby regions are troublesome areas of the brain for FS to measure accurately (Desikan et al., 2006; Oguz et al., 2008). The presence of either excess dura matter, closely adjacent temporal bone or cerebellum can potentially lead to inclusions which may affect volume and ROI segmentation (Desikan et al., 2010). Moreover, some neuropathological conditions, which lead to enlarged ventricles like normal pressure hydrocephalus or Alzheimer's disease may affect white matter segmentation steps and thus may lead to greater necessity of editing the FS images of patients with similar conditions (Moore et al., 2012). Magnetic Resonance (MR) imaging acquisition artifacts can also lead to over-inclusion of white matter. Given the propensity of FS to include areas of the brain extraneous to the ROI, investigators have the option of interrupting the automated process and its output. This can be done via skull stripping the brain, via the addition of control points to correct intensity normalization, via direct manual edits of white matter boundaries, or via a combination of these manual editing methods. These manual edits alter the white matter surface so that it more fully includes white matter structures and does not mistakenly segment gray matter or non-brain tissue as white matter. Manually editing the skull strip can ensure that it is more precise than the automatically completed procedure implemented by FS, and not affected by altered local anatomy in pathological states (Fennema-Notestine et al., 2006). This may improve the segmentation of white matter and lead to less control point placement in the next stage of quality control human intervention. We reviewed 82 previous studies published primarily between 2006 and 2013 (see Table 1) that utilized FS, discovering a great deal of variability in the extent to which investigators utilized skull stripping, control point or white matter editing options (see Table 1 for review criteria). Two of the studies obtained their samples from previously established databases. Of those 82 studies, 36 utilized 3 tesla (T) or higher MRI scanners, with 8 of those electing the fully automated procedure (31%). The remaining 18 chose to manually edit their 3T data using different combinations of skull stripping, control points, and white matter editing options (69%). The remaining studies utilized 1.5T MRI scanners with 26 choosing the fully automated procedure (46%). Thirty-one 1.5T studies implemented some combination of manual intervention (54%). Scanner strength did not robustly affect whether or not a study decided to edit their data. Fujimoto et al. (2014) compared 3T and 7T data, and reported only editing 7T data for residual hyperintensities in the temporal lobe while leaving the 3T unedited. Pfefferbaum et al. (2012) compared 3T data to 1.5T data, and chose to edit the 3T images more extensively. The heterogeneity in the papers we reviewed underlines the lack of a standard protocol for deciding whether to interrupt the FS segmentation process and manually edit.
Table 1

Methodological variations in articles utilizing FreeSurfer, published between 2006 and 2013.

Study (Year)Patient NPathologyControl NTotal NScanner type/StrengthManual edits
Alner et al., 201226Insertional mutation10361.5T GE SignaS, C
Anticevic et al., 201220HC0203T Siemens Allegra
Barnes et al., 201078HC0781.5T GE SignaS, C
Batty et al., 201025ADHD (age 9–15)24491.5T Phillips AchievaS, C, W
Benedict et al., 200950MS20701.5T GE SignaS
Bhojraj et al., 201156SCZ36921.5T GES, C, W
Bomboi et al., 201124MS24481.5T, 3T GE SignaC, W
Bray et al., 201159Fragile X831421.5T GE SignaC
Cerasa et al., 2011109MAO-A alleles01093T Siemens AllegraS, C, W
Cherbuin et al., 2009403HC04031.5T Phillips
Chiang et al., 2011149HC (elderly)01491.5T
Clarkson et al., 2011106AD and FTD441501.5T GE Signa
Dalaker et al., 201143PD41841.5T Philips InteraS, C
Desikan et al., 2010162MCI01621.5T GE, Siemens, or Phillips
Dickerson et al., 200715HC0151.5T Siemens Avanto
Du et al., 200741AD and FTD23641.5T Siemens Vision
Durand-Dubief et al., 20129MS091.5T Siemens Sonata + Intera
Dykstra et al., 20125TLE05T1 Weighted MRIS, C
Eggert et al., 201238Simulated0383T Siemens TRIO
Ehrlich et al., 2012131SCZ1382691.5T, 3T Siemens Sonata + TRIOC
Eyler et al., 2011202HC (twins)04041.5T SiemensC
Feczko et al., 200929AD761051.5T SiemensS, C, W
Fennema-Notestine et al., 2007133HC (elderly)01331.5T GE Signa + SiemensC
Fennema-Notestine et al., 2009259AD and MCI139398ADNI WEBSITEC
Fennema-Notestine et al., 200616Various diagnoses0161.5T GE
Fjell et al., 2009883HC (18-93)08831.5TS, C
Francis et al., 201170Children of SCZ patients731431.5T GES, C, W
Fujimoto et al., 20146HC067T SiemensC
Furst and Lal, 201113AD11241.5T Siemens VisionC
Goghari et al., 2007a19Relatives of SCZ patients22413T GE SignaS, C
Goghari et al., 2007b19Relatives of SCZ patients22413T GE SignaS, C, W
Goldman et al., 2009307SCZ and unaffected siblings1965031.5T GE
Gronenschild et al., 201210Various diagnoses20303T Siemens Allegra
Gutierrez-Galve et al., 200938APO-E allele23611.5T GE SignaS, C, W
Han et al., 200615HC0151.5T Siemens Sonata + 3T Siemens
Hinds et al., 200910HC0103T TIM TRIOC
Iglesias et al., 201192HC0921.5T Siemens
Keller et al., 201210JME62723T Philips Intera
Khan et al., 200850Various diagnoses0501.5T GE + Siemens
Klein and Tourville, 2012101HC01013T, 7TC
Kremen et al., 2010474HC (twins)04741.5T SiemensC
Lee et al., 200630Cases from ICBM030DatabaseC
Lehmann et al., 201051AD and FTD25761.5T GES, C
Levinski et al., 20091.5S, C, W
Mahone et al., 201142ADHD44861.5T Phillips
Makris et al., 200724ADHD (adults)18421.5T Siemens Sonata
Moore et al., 201215NPH and AD15303T GE SignaC
Morey et al., 200920HC0203T GE Excite
Mueller et al., 201038AD and MCI53914T BrukerMedSpecS
Murakami et al., 201121SCZ21423T GE SignaS, C
Nesvåg et al., 200953SCZ/SCA0531.5T GEC
Noble et al., 201260HC0601.5T Siemens SonataS, C
O'Donnell et al., 200535HC (8–20)0351.5T GE HorizonS, C
Oertel-Knöchel et al., 201354SCZ and healthy relatives37913T Siemens AllegraS, C, W
Oguz et al., 20089HC091.5T
Ostby et al., 2009171HC01711.5T Siemens AvantoS, C
Panizzon et al., 2009474Publicly available images04741.5T SiemensS, C
Park et al., 20041Tumor121.5TS, C
Park et al., 200933Blind from birth or after35683T PhilipsS, C
Pellicano et al., 201024MS24483T GE SignaC
Pengas et al., 200911SD0111.5T GE Signa
Pfefferbaum et al., 2012114HC01141.5T, 3T GE SignaC
Poulin et al., 2011367AD from ADNI base03671.5T GE, Siemens, or Phillips
Putcha et al., 201116MCI18343T Siemens TrioS
Raj et al., 201027Drug resistant TLE30574T BrukerMedSpecC
Ramasamy et al., 200988MS381261.5T GE SignaC, W
Rimol et al., 2010312SCZ and BD2075191.5T Siemens Sonata
Rohrer et al., 200976SD and aphasia291051.5T GE SignaS, C, W
Romero-Garcia et al., 201230HC elderly0301.5T Phillips InteraC
Safford et al., 201013HC0133T Siemens Allegra
Schultz et al., 2010b59SCZ591181.5T Siemens Vision
Schultz et al., 2010a54SCZ541081.5T Siemens Vision
Shattuck et al., 200940HC0401.5T GE
Strangman et al., 201050TBI0501.5T Siemens Avanto
Tae et al., 200821MDD20411.5T Philips
Tomasevic et al., 201320RRMS0201.5T PhilipsS, C
Tosun et al., 2010171AD and MCI772481.5T
Travis et al., 201417HC (infants)0171.5T GEC
Weier et al., 201215MS15451.5T Siemens Avanto
Winkler et al., 2010486HC04863T Siemens Trio
Wonderlick et al., 200911HC0113T Siemens TrioS
Woodward et al., 200950PTSD47971.5T GE Signa

The literature review was conducted during the summer of 2013 via PubMed search of the terms “freesurfer” and “freesurfer editing,” which yielded hundreds articles largely between the years 2006 and 2013. Studies were excluded (1) if they were based on dissertations, posters, or abstracts; or (2) if they compared FreeSurfer performance between computer systems, as most did not report utilizing any manual editing protocol. The full length studies remaining had their methods section analyzed for the precise methods of editing used. Studies which definitively stated that no editing was done were included, as it showed awareness of the possibility to edit. If they failed to mention manual editing in their methods section, they were excluded on the basis of the wide variability in possible minor manual edits that could have been made and gone unreported. Additionally, if the study did mention minor manual intervention, but failed to specify precisely what was done at each stage, the study was excluded. We also looked for studies that applied homogenous editing protocols to all study scans: those in which only a subset of scans were edited were excluded due to variability in selection of the images to be edited.

AD, Alzheimer's; PTSD, post-traumatic stress disorder; ADHD, attention-deficit hyperactivity disorder; RRMS, relapsing-remitting MS; BD, bipolar disorder; SCA, schizoaffective disorder; FTD, frontotemporal dementia; SCZ, schizophrenia; HC, healthy controls; SD, semantic dementia; JME, juvenile myoclonic epilepsy; TBI, traumatic brain injury; MDD, major depressive disorder; TLE, temporal lobe epilepsy; MS, multiple sclerosis; S, data was manually skull stripped; MCI, mild cognitive impairment; C, control points were manually applied; NPH, normal pressure hydrocephalus; W, white matter edits were utilized; PD, Parkinson's disease.

Methodological variations in articles utilizing FreeSurfer, published between 2006 and 2013. The literature review was conducted during the summer of 2013 via PubMed search of the terms “freesurfer” and “freesurfer editing,” which yielded hundreds articles largely between the years 2006 and 2013. Studies were excluded (1) if they were based on dissertations, posters, or abstracts; or (2) if they compared FreeSurfer performance between computer systems, as most did not report utilizing any manual editing protocol. The full length studies remaining had their methods section analyzed for the precise methods of editing used. Studies which definitively stated that no editing was done were included, as it showed awareness of the possibility to edit. If they failed to mention manual editing in their methods section, they were excluded on the basis of the wide variability in possible minor manual edits that could have been made and gone unreported. Additionally, if the study did mention minor manual intervention, but failed to specify precisely what was done at each stage, the study was excluded. We also looked for studies that applied homogenous editing protocols to all study scans: those in which only a subset of scans were edited were excluded due to variability in selection of the images to be edited. AD, Alzheimer's; PTSD, post-traumatic stress disorder; ADHD, attention-deficit hyperactivity disorder; RRMS, relapsing-remitting MS; BD, bipolar disorder; SCA, schizoaffective disorder; FTD, frontotemporal dementia; SCZ, schizophrenia; HC, healthy controls; SD, semantic dementia; JME, juvenile myoclonic epilepsy; TBI, traumatic brain injury; MDD, major depressive disorder; TLE, temporal lobe epilepsy; MS, multiple sclerosis; S, data was manually skull stripped; MCI, mild cognitive impairment; C, control points were manually applied; NPH, normal pressure hydrocephalus; W, white matter edits were utilized; PD, Parkinson's disease. Given that there is no standard protocol for the decision to interrupt the fully automated FS pipeline to manually edit the images, this paper seeks to establish the extent to which editing affects the final measurements that FS provides. Conceivably, time consuming manual interventions may only marginally affect the edited data sets, leading one to believe that the editing of this data may only be necessary for specific ROIs. To that end, our study is constructed around the following question: To what extent do the FreeSurfer-generated data for each region of interest differ significantly between the edited and unedited (i.e., fully automated) methods of measurement? Accordingly, we compare the means and variances of surface area, white matter volumes, and cortical thickness derived from edited and unedited datasets for each of the 34 ROIs. Note that surface area was chosen instead of gray matter volume, since surface area has been shown to be genetically and phenotypically independent of cortical thickness (Panizzon et al., 2009; Winkler et al., 2010) and, therefore, more informative than gray matter volume. Moreover, we compare effect sizes between edited and unedited conditions in a small sample of individuals with 22q11.2 deletion syndrome (22q11DS) and neurotypical controls, in order to determine whether or not editing FS output would alter the sample size necessary to detect significant differences in surface area, white matter, or cortical thickness. We hypothesize that the values generated by the edited method will differ from those of the unedited method, and that the edited method will produce larger effect sizes.

Materials and methods

Participants

Data used in this study were selected from an ongoing longitudinal study focusing on biomarkers for psychosis in 22q11.2 deletion syndrome (Kates et al., 2011a). The procedures of the longitudinal study were approved by the Institutional Review Board at SUNY Upstate Medical University. Participants were recruited through the SUNY Upstate International Center for the Evaluation, Treatment and Study of Velo-Cardio-Facial Syndrome and from the community, and all participants provided informed consent. Imaging data and neuropsychiatric testing data were acquired at four visits, about 3 years apart. For the first three time points, images were acquired on a 1.5T scanner; for the fourth time point, images were acquired on a 3T scanner. The subsample with imaging data from the 1.5T MR scanner was drawn from a larger sample of 116 participants who returned for the third time point of the longitudinal study. The subsample consisted of the first 30 participants (stratified by study group) whose Time 3 imaging data were processed, roughly corresponding to the order in which the participants returned for Time 3. They consisted of 20 with 22Q11.2 deletion syndrome (22q11DS) (8 male; mean age 17.54, SD 1.9) and 10 community controls (4 male; mean age 17.18, SD 1.21). The subsample of participants whose imaging data was from the 3T MR scanner consisted of 21 subjects who returned for the fourth time point and had been included in the subsample with 1.5T MR dataset. Nine additional subjects were matched by age, gender, and diagnosis to the remaining participants from the 1.5T MR subsample. The mean age of the 22q11DS group was 20.74, SD 2.1, and the mean age of the control group was 20.42, SD 1.06. This study was approved by the Institutional Review Board of SUNY Upstate Medical University, and all participants provided signed, informed consent in accordance with the Declaration of Helsinki. The individuals who implemented the FS processing pipeline were blind to the diagnostic status of study participants.

Imaging study

The 1.5T imaging data were acquired in the axial plane on a 1.5T Philips Interra scanner (Philips Medical Systems, Best, The Netherlands) utilizing the following T1-weighted inversion recovery, turbo gradient echo (TFE) 3-D pulse sequence: echo time = 4.6 ms; repetition time = 20 ms; 2 repetitions; matrix size 256 × 154; field of view = 24 cm; multishot = 32; TFE pre-inversion recovery = 394 ms, 1.5 mm slice thickness (Kates et al., 2011b). The 3T imaging data were acquired in the sagittal plane on a 3T Siemens Magnetom Trio Tim scanner (syngo MR B17, Siemens Medical Solutions, Erlangen, Germany) utilizing an ultrafast gradient echo 3D sequence (MPRAGE) with PAT k-space-based algorithm GRAPPA and the following parameters: echo time = 3.31 ms; repetition time = 2530 ms; matrix size 256 × 256; field of view = 256 mm, slice thickness = 1 mm.

Image analysis

Imaging data preprocessing

Preprocessing of 1.5T imaging data consisted of generating an isotropic brain image with non-brain tissue removed, and aligning that image along the anterior-posterior commissure. This was accomplished by importing the raw 1.5T MRI images into the imaging software program, BrainImage (available from the Center for Interdisciplinary Brain Sciences Research, Stanford University), where we performed an initial intensity correction, an automatic brain mask creation, followed by a manual editing step of the brainmask (Subramaniam et al., 1997). After the final manual editing, the skull was removed from the image and the brain image was saved in Analyze file format for import into the imaging software package, 3DSlicer (www.slicer.org; Fedorov et al., 2012). In 3DSlicer, the skull-stripped brains were aligned along the anterior and posterior commissure axis, and then re-sampled into isotropic voxels (0.9375 mm3) using a cubic spline interpolation transformation. Preproccessing of 3T images also consisted of generating an isotropic brain image with non-brain tissue removed. However, instead of using BrainImage to remove non-brain tissue, we used the initial, preprocessing step in the FS pipeline. The resulting brain mask was imported into 3DSlicer, and manually edited using the same steps included in the protocol cited above. Afterwards, the skull was removed from the image and the brain image was aligned along the anterior and posterior commissure axis using a cubic spline transformation and kept at the same resolution as the initial data, isotropic voxels (1 mm3). At that point, both 1.5T and 3T edited and aligned brain masks were subject to the FreeSurfer segmentation process, described below.

FS segmentation process

The preprocessed images were imported into the automated brain segmentation software FreeSurfer (FS) installed on a Dell Optiplex machine using the Ubuntu 12.04 operating system. In addition to resampling of the image into 0.9375 mm3 using a cubic spline transformation during preprocessing as described above, the FS segmentation process resampled the images into 1 mm3 as part of its motion correction step. Cortical reconstruction and volumetric segmentation was performed with the Freesurfer image analysis suite, which is documented and freely available for download online (http://surfer.nmr.mgh.harvard.edu/). The technical details of these procedures are described in prior publications (Dale and Sereno, 1993; Dale et al., 1999; Fischl et al., 1999a,b, 2001, 2002, 2004a,b; Fischl and Dale, 2000; Ségonne et al., 2004; Han et al., 2006; Jovicich et al., 2006). Briefly, the FS segmentation process included: the segmentation of the subcortical white matter and deep gray matter volumetric structures (including hippocampus, amygdala, caudate, putamen, ventricles) (Fischl et al., 2002, 2004a); intensity normalization (Sled et al., 1998); tessellation of the gray matter white matter boundary; automated topology correction (Fischl et al., 2001; Ségonne et al., 2007); and surface deformation following intensity gradients to optimally place the gray/white and gray/cerebrospinal fluid borders at the location where the greatest shift in intensity defines the transition to the other tissue class (Dale and Sereno, 1993; Dale et al., 1999; Fischl and Dale, 2000). Once the cortical models were complete, a number of deformable procedures were performed including surface inflation (Fischl et al., 1999a), registration to a spherical atlas which utilizes individual cortical folding patterns to match cortical geometry across subjects (Fischl et al., 1999b), parcellation of the cerebral cortex into units based on gyral and sulcal structure (Fischl et al., 2004b; Desikan et al., 2006), and creation of a variety of surface based data including maps of curvature and sulcal depth. Details of the methods involved have been described extensively elsewhere (Fischl and Dale, 2000; Salat et al., 2004).

Final steps of fully automated (unedited) pipeline

Following the successful completion of the FS reconstruction process, the FS directories were duplicated, and one copy immediately underwent the final reconstruction stream without manual intervention. Cortical thickness, surface area and white matter volume measurements were extracted for selected Region of Interest (ROIs) and the directories were backed up to a remote and secure location. Cortical thickness measurements were computed by looking at the average distance, calculated using a spatial lookup table, between the white matter and pial surfaces generated by FS (Fischl and Dale, 2000). This group of FS data without any manual intervention will be referred to as “unedited.”

Final steps of manual intervention (edited) method

The second copy of the data were manually inspected for defects that could affect the accuracy of the final cortical measurements. The full protocols for processing and editing both 1.5T and 3T data are provided in Supplementary Material; however a brief description of the process follows. In the coronal view, starting posteriorly, with the opposite hemisphere of the brain obstructed in order to minimize human error, each slice was inspected for errors in the surfaces created by FS. An error can be described as an instance where one of the surfaces drawn by FS includes or excludes voxels incorrectly. These errors are most often caused by motion artifacts in the more posterior sections of the brain, and by hyperintensities around the temporal and orbitofrontal lobes. Control Points, manually inserted targets that adjust a voxel's intensity value to 110, were inserted within adjacent white matter regions in order to correct surface errors as described on the FS website. Where appropriate, hyperintensities, and extraneous tissue were removed from the brain volume as well, as described in the White Matter Edits tutorial on the FS website. Once completed, the process was repeated for the opposite hemisphere. After all errors were corrected, the brain was re-run through the second reconstruction stream beginning at the module where control point adjusted voxels are taken into account. This process was repeated up to four times to ensure all errors in FS surfaces were corrected. Following successful correction of the FS surfaces, the final reconstruction step was run and cortical thickness and volume measurements were extracted for all ROIs. Manually-corrected data, hereafter referred to as “edited,” were then compared with the unedited data.

Statistical analyses

Analyses comparing the unedited and edited volumes and cortical thickness values for each ROI were run separately in SPSS (v22) for the 1.5T and 3T data. Accordingly, for both the 1.5T and the 3T data, the variance was calculated for each ROI, based on the total sample of 30 individuals, and the Levene's test was used to compare the variance of each edited ROI to that of each unedited ROI. Intraclass correlation coefficients between edited and unedited ROIs were calculated based on the total sample as well, and paired t-tests were conducted in order to determine if the means differed significantly between edited and unedited ROIs. The Bonferroni correction was applied to the 34 paired t-tests that we performed for each set of measures (i.e., surface area, white matter volume, thickness) at each field strength. As noted above, we also generated effect sizes for the mean surface areas/white matter volumes/cortical thickness values between the 20 individuals with 22q11DS and the 10 controls, in order to determine the differences in effect sizes that the edited vs. unedited methods yielded. This would allow one to determine the sample sizes for edited vs. unedited methods that would be necessary to detect significant differences in volume/cortical thickness between individuals with 22q11DS and controls. To determine whether effect sizes for the edited method differed significantly from effect sizes for the unedited method, we calculated paired t-tests across all ROIs. Bonferroni corrections were applied to paired t-tests as described above. In addition, we calculated the arithmetic difference in effect size for each edited vs. unedited ROI (by subtracting the unedited value from the edited value).

Results

Figure 1 compares MR images with and without manual intervention with control points. Means and standard deviations for surface area, white matter volume, and cortical thickness for each ROI, separated by scanner field strength, are provided in Table 2. The differences between edited and unedited measures are represented by Bland—Altman plots in Figure 2. Variances and intraclass correlation coefficients for all ROIs, separated by scanner field strength, are provided in Table 3. Effect sizes are provided in Table 4 and box plots representing effect sizes are provided in Figure 3.
Figure 1

Comparison of MR images before and after manual intervention. (A) In comparison with the unedited 1.5T image (left), the manually edited brain image (right) shows a more accurate portrayal of the parahippocampal gyrus, the hippocampus and the white matter boundary. (B) However, in the 3T brain images, there is little difference between the unedited (left) and the manually edited (right) images. The manual intervention implemented in the 3T brain was intended to include white matter and gray matter incorrectly being excluded from the lateral orbitofrontal gyrus area. Control points on this slice in addition to edits on anterior and posterior brain slices had no significant effect on the exclusion. This shows that although control points can have an effect on white matter and pial surface, as well as cortical parcellation, it is inconsistent.

Table 2

Means and standard deviations of scanner-specific surface area, volume and cortical thickness values for FreeSurfer regions of interest.

RegionSurface area mm3White Matter volume mm3Cortical thickness mm
1.5T3T1.5T3T1.5T3T
Banks of STSaE1819.23±303.372045.43±243.575182.13±953.375260.23±796.002.57±0.192.71±0.14
UE1820.07±316.802070.33±247.345147.77±1028.185342.93±793.982.56±0.192.73±0.12
Caudal anterior cingulateE1160.30±181.701337.60±205.745169.43±834.405311.00±714.432.54±0.302.56±0.16
UE1157.63±193.511338.00±207.515163.97±844.285349.87±722.072.54±0.302.59±0.17
Caudal middle frontalE4272.73±545.804538.07±585.5012,247.07±1415.6512,490.77±1817.582.67±0.182.72±0.11
UE4244.93±561.784533.43±589.2712,252.60±1486.9712,542.23±1887.662.70±0.192.74±0.11
CuneusE2368.60±367.262652.97±391.104050.53±848.293713.60±709.581.96±0.142.05±0.11
UE2384.13±392.152631.97±383.924131.67±885.203689.90±661.601.95±0.152.06±0.10
EntorhinalE708.10±177.26733.10±178.681323.03±409.721179.90±316.513.33±0.273.58±0.32
UE709.27±148.46728.30±144.381379.70±411.881243.57±280.043.33±0.303.50±0.33
FusiformE5591.43±794.386236.23±695.5912,265.73±1951.8612,191.40±1838.862.61±0.152.85±0.13
UE5707.93±738.886301.90±725.6812,583.10±1979.5012,327.93±1879.312.62±0.152.82±0.13
Inferior parietalE9276.30±1158.4510,223.63±1139.9920,702.47±2720.7720,689.87±2833.432.49±0.142.65±0.12
UE9328.00±1167.9510,202.30±1116.6220,812.97±2839.4020,612.40±2767.542.48±0.142.65±0.11
Inferior temporalE5457.40±895.736223.77±787.1910,858.73±2046.6710,668.07±1800.122.87±0.142.93±0.16
UE5624.20±898.596286.13±789.9811,183.50±2222.9210,712.70±1844.552.83±0.142.87±0.15
Isthmus cingulateE1648.90±235.371946.03±295.786375.73±873.637105.00±1092.522.63±0.152.47±0.14
UE1686.40±244.981926.90±273.276448.93±925.177045.73±1060.072.64±0.162.45±0.14
Lateral occipitalE8214.50±922.158606.73±946.6017,090.40±2420.7815,477.23±2254.652.32±0.182.31±0.10
UE8242.03±922.878666.70±933.3817,231.17±2384.3515,630.27±2267.582.32±0.182.31±0.09
Lateral orbitofrontalE4502.70±496.505485.20±444.8912,252.67±1479.2412,805.97±1367.902.94±0.152.67±0.11
UE4595.07±473.115601.20±506.8812,516.47±1492.9013,066.60±1494.302.93±0.162.63±0.12
LingualE5138.27±675.925541.47±723.1510,252.07±1197.559206.50±1279.542.11±0.122.16±0.12
UE5189.47±677.795572.20±742.8910,386.10±1291.089219.00±1220.672.13±0.162.16±0.10
Medial orbitofrontalE3333.27±562.313632.47±411.207189.00±1221.985881.97±1038.602.87±0.252.56±0.10
UE3414.27±460.993604.97±363.917247.13±1051.965949.70±1001.702.84±0.232.49±0.08
Middle temporalE5841.03±757.976544.97±770.4810,884.57±1837.6310,453.53±1737.592.99±0.143.06±0.14
UE5929.13±824.996584.70±759.8211,103.37±1964.1510,506.30±1719.802.97±0.153.06±0.14
ParahippocampalE1334.03±221.011456.33±159.733032.67±548.633146.27±461.452.50±0.382.73±0.26
UE1369.73±204.171457.90±151.423246.57±492.533170.23±523.922.54±0.372.72±0.25
ParacentralE2587.87±348.162779.23±345.327662.50±1225.017949.00±1139.902.29±0.172.54±0.11
UE2582.53±359.132781.33±350.277655.17±1248.137996.13±1173.092.30±0.162.54±0.11
Pars opercularisE2831.67±287.143104.43±317.996544.37±826.256801.97±985.122.69±0.182.77±0.14
UE2857.20±268.483127.27±334.226604.57±799.936873.30±1007.832.71±0.192.78±0.14
Pars orbitalisE1281.47±138.831420.77±158.522149.93±360.751950.20±365.862.97±0.212.78±0.15
UE1278.23±159.801436.83±146.092138.30±411.711984.50±341.422.93±0.262.78±0.17
Pars triangularisE2452.63±332.562649.00±327.665821.23±862.965727.13±720.662.73±0.152.63±0.12
UE2477.87±355.902648.60±311.205898.23±954.735734.17±732.892.73±0.142.63±0.12
PericalcarineE2360.67±368.732665.93±438.505781.17±1016.055372.37±1010.501.70±0.171.85±0.12
UE2336.37±369.702691.53±435.635844.13±1026.375389.70±958.031.71±0.181.86±0.11
Post centralE7541.93±990.567960.33±988.9014,304.80±2303.6413,430.83±2173.772.12±0.152.26±0.11
UE7583.07±1017.597980.97±996.2314,398.90±2356.2013,462.90±2147.872.13±0.142.26±0.12
Posterior cingulateE2063.97±256.272418.10±236.917950.73±928.778365.77±933.222.54±0.182.57±0.12
UE2081.57±254.152408.67±233.948015.83±979.658318.60±909.872.55±0.182.58±0.12
PrecentralE9286.70±774.2810,031.37±761.9125,719.97±2699.7226,046.97±2767.752.56±0.152.65±0.10
UE9302.53±797.9610,062.20±788.5725,752.80±2766.4826,155.40±2854.262.59±0.142.66±0.11
PrecuneusE6504.47±734.217222.87±783.6616,107.43±2516.5916,115.53±2422.522.33±0.132.57±0.11
UE6560.60±746.797229.23±767.9216,194.27±2559.9616,224.70±2379.182.36±0.132.57±.11
Rostral anterior cingulateE1149.37±202.391484.23±173.504480.97±603.324851.10±657.922.90±0.242.88±0.16
UE1169.20±213.351489.37±189.744527.80±628.024780.20±712.052.96±0.252.88±0.17
Rostral middle frontalE10,131.67±1280.8611,517.80±1413.6623,941.70±3363.8623,406.83±3120.462.56±0.182.47±0.10
UE10,227.37±1335.9011,500.07±1399.7724,199.93±3441.3623,510.23±3120.112.56±0.182.49±0.10
Superior frontalE13,023.13±1513.5414,437.67±1523.7333,892.20±4741.9634,307.17±4872.372.93±0.162.86±0.11
UE13,041.17±1500.0114,463.00±1575.1034,054.40±4744.8834,434.57±5075.832.95±0.182.88±0.12
Superior parietalE9379.53±853.699976.73±884.9722,198.90±2452.7021,167.63±2423.512.16±0.132.38±0.11
UE9431.37±875.099999.43±880.6122,453.13±2550.0721,320.03±2462.002.15±0.142.39±0.10
Superior temporalE6814.20±487.937278.53±644.3514,132.53±1432.5113,095.77±1897.512.69±0.222.89±0.11
UE6784.20±577.457302.23±622.0414,226.50±1468.1413,127.07±1871.212.70±0.222.90±0.11
SupramarginalE6940.83±806.347530.60±836.4116,515.77±2261.2216,121.80±2337.312.65±0.162.77±0.12
UE6970.70±872.177550.27±830.4316,545.70±2380.5416,158.20±2393.482.66±0.152.78±0.12
Frontal poleE509.47±70.64456.40±58.98707.27±114.40480.23±96.733.05±0.342.73±0.23
UE502.73±77.85471.47±61.81709.10±130.69496.57±92.223.03±0.312.72±0.24
Temporal poleE850.30±93.61858.40±97.081375.10±170.90115.87±199.393.76±0.273.71±0.31
UE864.30±96.36849.87±106.561409.97±179.531130.40±186.803.74±0.243.69±0.29
Transverse temporalE720.70±108.68738.47±119.251395.90±190.3513,210.47±227.562.34±0.222.52±0.12
UE715.53±105.73749.53±122.301422.40±211.111349.20±204.542.33±0.212.52±0.13
InsulaE4215.53±363.264890.70±533.4216,456.60±1570.4217,657.50±1600.453.22±0.193.14±0.14
UE4169.50±394.894797.03±444.9016,291.13±1705.3017,705.93±1694.213.24±0.173.15±0.14

Banks of the superior temporal sulcus.

Figure 2

Bland Altman plots, representing the differences between edited and unedited measures of surface area, white matter volume and cortical thickness for each field strength. The difference between the edited and unedited measure of each region of interest is plotted against the average of the two measures. Mean, and 95% limits, of agreement are provided in each plot. These plots indicate that, for the most part, the two methods are producing somewhat similar results, although all plots show a fairly wide range of values. Outliers, beyond the 95% agreement limit, indicating poor agreement, include: for surface area (1.5T): inferior temporal gyrus; surface area (3T): lateral orbitofrontal gyrus and insula; white matter volume (1.5T): insula, fusiform gyrus, inferior temporal gyrus; white matter volumes (3T): lateral orbitofrontal gyrus; thickness (1.5T): rostral anterior cingulate, pars orbitalis, and parahippocampal gyrus; thickness (3T): entorhinal cortex, inferior temporal gyrus, and medial orbitofrontal gyrus.

Table 3

Variances and intraclass correlation coefficients (ICC), based on comparisons of “edited” and “unedited” processing pipelines measuring surface area, white matter volume and cortical thickness.

RegionSurface areaWhite matter volumeCortical thickness
VarianceICCVarianceICCVarianceICC
1.5T3T1.5T3T1.5T3T1.5T3T1.5T3T1.5T3T
Banks of STSa0.1150.0080.9260.9590.1370.0120.9670.9700.0180.0640.9580.967
Caudal anterior cingulate0.0730.0010.9830.9880.0020.0810.9830.9710.0060.0010.9330.922
Caudal middle frontal0.0000.0260.9710.9870.1590.1130.9570.9870.0030.0130.9530.938
Cuneus0.1050.0080.9820.9840.0180.0030.9790.9820.0280.0300.8900.962
Entorhinal0.4421.5220.5210.8560.1530.6340.5970.8790.5940.0910.8130.917
Fusiform0.5490.0680.9300.9860.0020.0230.9500.9890.0220.1900.8750.942
Inferior parietal0.0090.0170.9840.9960.0290.0050.9790.9950.0340.0000.9240.974
Inferior temporal0.0000.0020.9240.9880.1150.0040.9460.9940.0550.0020.7580.892
Isthmus cingulate0.2960.0400.8210.9600.0540.0220.9230.9800.0370.0200.8930.910
Lateral occipital0.0020.0320.9750.9880.0700.0050.9720.9870.0050.2940.9850.970
Lateral orbitofrontal0.0410.5160.9200.8940.0200.0840.9480.9350.3610.5530.8790.858
Lingual0.0020.0010.9580.9840.2440.0540.9150.9730.6270.4160.8980.920
Medial orbitofrontal1.1060.0490.9020.8680.4960.0460.9010.9150.0300.6680.8920.646
Middle temporal0.2190.1020.9510.9890.1520.0350.9630.9890.0180.1080.7980.982
Parahippocampal0.0020.1440.2080.8880.0120.3380.3430.9360.0430.0360.9520.980
Paracentral0.0150.0070.9920.9930.0250.0270.9850.9850.1350.0140.9140.955
Pars opercularis0.2050.1470.8990.9670.1450.0170.9220.9550.1000.0520.9460.972
Pars orbitalis0.1380.5500.8800.9690.1300.1130.9330.9810.3030.3550.8440.980
Pars triangularis0.0860.2080.9400.9670.0090.0050.9300.9680.2790.0390.9100.959
Pericalcarine0.0000.0020.9720.9790.0140.0000.9730.9830.0910.0080.9680.941
Post central0.0210.0030.9930.9960.0180.0180.9890.9920.0990.0090.9820.986
Posterior cingulate0.0390.0000.9840.9880.2020.1670.9650.9690.0370.0130.8570.974
Precentral0.0010.0940.9880.9940.0060.0680.9840.9940.0530.0020.9450.972
Precuneus0.0050.0020.9860.9970.0110.0110.9910.9950.0040.0100.9420.974
Rostral anterior cingulate0.0320.6120.9490.9030.0000.3950.9510.9380.0290.0480.9270.893
Rostral middle frontal0.0860.0130.9900.9930.0450.0140.9890.9900.2200.0520.9660.938
Superior Frontal0.0090.0210.9930.9960.0150.0110.9900.9970.2650.0520.9410.936
Superior parietal0.0000.0000.9880.9930.0660.0110.9830.9920.2680.0000.9610.979
Superior temporal0.1720.0740.8980.9810.0070.0110.9560.9860.0200.0000.9760.984
Supramarginal0.2450.0030.9830.9940.0860.0270.9880.9930.1870.0280.9370.982
Frontal pole0.1610.0290.8860.8830.2300.1410.8590.9210.1310.0000.9030.873
Temporal pole0.0710.0040.8220.8620.1300.8650.8530.8290.9850.1480.7940.919
Transverse temporal0.0780.0010.9350.9730.4120.1720.8550.8980.2840.6790.9400.930
Insula0.2591.5820.8330.7990.0870.0190.8790.9231.1980.0330.9250.810

Data from both 1.5 tesla (1.5 T) and 3 tesla (3T) scanners are provided.

Banks of the superior temporal sulcus.

Table 4

Effect sizes (Cohen's d) based on comparisons of means of surface area, white matter volumes and cortical thickness, between individuals with 22q11.2 deletion syndrome (.

RegionSurface areaWhite matter volumeCortical thickness
1.5T3T1.5T3T1.5T3T
Banks of STSaE0.3710.0470.1820.0160.7850.025
UE0.2790.2460.2350.1500.8130.167
caudal anterior cingulateE0.7260.9420.7200.5850.5510.206
UE0.7651.0700.7440.7030.1170.142
Caudal middle frontalE0.1250.0660.2000.2900.1650.596
UE0.2730.0300.0240.2600.2980.780
CuneusE1.4041.6601.2291.0210.4490.740
UE1.2841.6771.3030.9990.3570.823
EntorhinalE0.0950.3980.4630.5460.1900.692
UE0.3580.1910.1890.4920.1410.623
FusiformE0.8890.5331.0070.3910.3830.061
UE0.6950.4971.0760.4020.3150.025
Inferior parietalE0.0180.3420.1070.5390.0470.271
UE0.0690.4100.0140.6130.2740.308
Inferior temporalE0.1990.2760.4730.1230.3950.574
UE0.2100.2770.4150.1140.0040.521
Isthmus cingulateE0.1190.4880.4370.7910.1600.714
UE0.2490.3620.3580.8050.0890.558
Lateral occipitalE0.9530.6290.9320.3410.1350.201
UE0.9780.5420.8160.2770.1240.305
Lateral orbitofrontalE0.2020.0170.0570.3610.2120.621
UE0.0210.1850.1570.4560.0500.395
LingualE1.4031.3550.9610.7870.3140.444
UE1.1801.3331.1010.6810.4560.884
Medial orbitofrontalE0.1130.0080.2030.1520.2501.596
UE0.1070.0660.1950.2780.1741.402
Middle temporalE0.2420.0830.3210.0530.1770.486
UE0.1970.0390.3690.0640.3180.532
ParahippocampalE0.0050.0540.5470.3300.7680.578
UE0.0020.1530.5270.5150.6460.532
ParacentralE0.7910.6750.5710.1510.2181.004
UE0.8150.5780.6120.0570.2240.957
Pars opercularisE0.2570.8370.5971.2130.4861.011
UE0.4370.7520.4031.0990.4981.028
Pars orbitalisE0.0200.3310.1550.4740.1960.762
UE0.2850.2030.2650.3990.2410.701
Pars triangularisE0.8880.8670.7130.0870.1220.802
UE1.1010.8340.4690.0080.1470.799
PericalcarineE0.8411.3950.5830.7180.1621.493
UE0.8761.6870.5200.7390.2161.371
Post centralE0.8470.8480.7640.2160.2340.967
UE0.8720.8200.7010.1530.3000.956
Posterior cingulateE0.5480.5420.2840.0260.3980.069
UE0.5900.5490.1360.1080.5970.159
PrecentralE0.0060.1850.1720.4590.5860.887
UE0.0210.1860.1860.3850.7310.819
PrecuneusE1.4181.2771.2520.5980.1830.650
UE1.3861.2501.2440.5640.1270.699
Rostral anterior cingulateE0.0160.3370.3960.5860.6360.507
UE0.0480.0750.2850.6480.6030.263
Rostral middle frontalE1.4461.7361.3000.9190.0700.702
UE1.4791.6911.3740.8550.1460.720
Superior frontalE0.5350.1480.4850.0220.6370.384
UE0.4810.1710.5670.0220.4290.471
Superior parietalE1.2310.8221.0380.2700.1250.487
UE1.2820.7701.0630.2410.0160.451
Superior temporalE0.6300.1720.5760.1310.7270.051
UE0.5490.1050.6830.1350.5260.044
SupramarginalE0.4250.0420.4840.0850.5801.036
UE0.4250.0540.5040.1230.6931.206
Frontal poleE0.0640.3220.2150.4880.1150.540
UE0.0810.1370.4290.1240.1310.545
Temporal poleE0.1380.3180.2970.7720.4330.558
UE0.3250.0310.0730.5530.1320.546
Transverse temporalE1.3390.6631.0490.4330.2500.023
UE1.2300.6640.8570.6430.0130.086
InsulaE0.2480.6320.4480.5850.8010.768
UE0.0330.4370.4170.3360.9471.245

Effect sizes of comparisons are reported for edited (E) and unedited (UE) measures, derived from 1.5 tesla (1.5T) and 3 tesla (3T) scanners.

Banks of the superior temporal sulcus.

Figure 3

Box plots representing means and standard deviations of effect sizes for each measurement type/field strength. Note that the only outliers were in the cortical thickness plots for the 3T data. The outlying regions of interest were pericalcarine thickness (1.49) and medial orbitofrontal thickness (1.60).

Comparison of MR images before and after manual intervention. (A) In comparison with the unedited 1.5T image (left), the manually edited brain image (right) shows a more accurate portrayal of the parahippocampal gyrus, the hippocampus and the white matter boundary. (B) However, in the 3T brain images, there is little difference between the unedited (left) and the manually edited (right) images. The manual intervention implemented in the 3T brain was intended to include white matter and gray matter incorrectly being excluded from the lateral orbitofrontal gyrus area. Control points on this slice in addition to edits on anterior and posterior brain slices had no significant effect on the exclusion. This shows that although control points can have an effect on white matter and pial surface, as well as cortical parcellation, it is inconsistent. Means and standard deviations of scanner-specific surface area, volume and cortical thickness values for FreeSurfer regions of interest. Banks of the superior temporal sulcus. Bland Altman plots, representing the differences between edited and unedited measures of surface area, white matter volume and cortical thickness for each field strength. The difference between the edited and unedited measure of each region of interest is plotted against the average of the two measures. Mean, and 95% limits, of agreement are provided in each plot. These plots indicate that, for the most part, the two methods are producing somewhat similar results, although all plots show a fairly wide range of values. Outliers, beyond the 95% agreement limit, indicating poor agreement, include: for surface area (1.5T): inferior temporal gyrus; surface area (3T): lateral orbitofrontal gyrus and insula; white matter volume (1.5T): insula, fusiform gyrus, inferior temporal gyrus; white matter volumes (3T): lateral orbitofrontal gyrus; thickness (1.5T): rostral anterior cingulate, pars orbitalis, and parahippocampal gyrus; thickness (3T): entorhinal cortex, inferior temporal gyrus, and medial orbitofrontal gyrus. Variances and intraclass correlation coefficients (ICC), based on comparisons of “edited” and “unedited” processing pipelines measuring surface area, white matter volume and cortical thickness. Data from both 1.5 tesla (1.5 T) and 3 tesla (3T) scanners are provided. Banks of the superior temporal sulcus. Effect sizes (Cohen's d) based on comparisons of means of surface area, white matter volumes and cortical thickness, between individuals with 22q11.2 deletion syndrome (. Effect sizes of comparisons are reported for edited (E) and unedited (UE) measures, derived from 1.5 tesla (1.5T) and 3 tesla (3T) scanners. Banks of the superior temporal sulcus. Box plots representing means and standard deviations of effect sizes for each measurement type/field strength. Note that the only outliers were in the cortical thickness plots for the 3T data. The outlying regions of interest were pericalcarine thickness (1.49) and medial orbitofrontal thickness (1.60).

Philips 1.5T data

Surface area measures

Levene's test indicated that the variance of each edited region of interest did not differ significantly from its unedited counterpart. Intraclass correlation analyses between unedited and edited surface areas yielded coefficients ranging from 0.82 to 0.99 for 32 out of the 34 ROIs. The only exceptions were entorhinal cortex areas (0.52) and parahippocampal gyrus areas (0.21). After Bonferroni correction, paired t-tests indicated that mean areas did not differ significantly between any unedited and edited ROIs. Paired t-tests indicated that the mean effect size for surface areas did not differ significantly from the mean effect size for unedited areas. Moreover, the mean arithmetic difference in effect size between all edited and unedited surface area ROIs was −0.011 (SD 0.12). The regions for which the difference in effect size between edited and unedited methods exceeded either 0.20 or −0.20 (indicating small effect sizes) for was the entorhinal cortex (−0.26), lingual area (0.22), pars orbitalis (−0.27), and pars triangularis (−0.21).

White matter volumes

No significant differences were observed in variances of white matter volumes between edited and unedited ROIs. Intraclass correlation analyses between unedited and edited white matter volumes yielded coefficients ranging from 0.85 to 0.99 for 32 out of 34 ROIs. Similar to surface areas, the exceptions were entorhinal cortex (0.60) and parahippocampal gyrus (0.34) volumes. Mean volumes did not differ significantly between 32 of the 34 pairs of unedited and edited regions. Exceptions were the lateral orbitofrontal (p < 0.001) cortex and the superior parietal lobule (p < 0.001). The mean effect size for edited measures of white matter volumes did not differ significantly from the mean effect size for unedited measures. The mean arithmetic difference in effect size between all edited and unedited white matter ROIs was −0.018 (SD 0.11). The regions with the largest differences in effect sizes between edited and unedited methods for measuring white matter volumes were the entorhinal cortex (0.27), the pars triangularis (0.24), the frontal pole (−0.21) and the temporal pole (0.22).

Cortical thickness

No significant differences were observed in variances of cortical thickness between edited and unedited ROIs. Intraclass correlation analyses between unedited and edited measures of cortical thickness yielded coefficients ranging from 0.84 to 0.985 for 31 out of 34 ROIs. Exceptions included entorhinal cortex (0.81), inferior temporal gyrus (0.76) and the temporal pole (0.79). Mean cortical thickness did not differ significantly between 32 of the 34 pairs of unedited and edited regions. Exceptions were the precentral gyrus (p < 0.001) and the rostral anterior cingulate (p < 0.001). The mean effect size for edited measures of cortical thickness did not differ significantly from the mean effect size for unedited measures. The mean arithmetic difference in effect size between all edited and unedited measures of cortical thickness was −0.03 (SD 0.16). The regions with the largest differences in effect size between edited and unedited methods were the caudal anterior cingulate (0.43), fusiform gyrus (−0.23), inferiorparietal lobule (0.39), rostral anterior cingulate (0.21), superior frontal gyrus (0.20), supramarginal gyrus (0.30) and temporal pole (0.24). Note that the majority of these values were positive, indicating that the effect sizes for the edited method tended to be larger than those for the unedited method used to measure cortical thickness.

Siemens 3T data

For the 3T data, Levene's test similarly indicated that the variance of each edited region of interest did not differ significantly from its unedited counterpart. Intraclass correlation analyses between unedited and edited surface areas yielded coefficients ranging from 0.86 to 0.99 for 33 out of 34 ROIs. Exceptions included the insula (0.799). Paired t-tests indicated that mean surface areas did not differ significantly between any pairs of unedited and edited regions. However, several regions tended to differ, including the fusiform gyrus (p = 0.002), the lateral orbitofrontal area (p = 0.003), and the inferior temporal lobe (p = 0.004). For the 3T data, the mean effect sizes for edited and unedited measures of surface area did not differ. The mean arithmetic difference in effect size between edited and unedited surface area ROIs was −0.028 (SD 0.12). The regions with the largest differences in effect sizes between the edited and unedited methods were the entorhinal cortex (0.21), pericalcarine cortex (−0.29), the rostral anterior cingulate (0.26), and the temporal pole (0.287). No significant differences were observed in the variances of white matter volumes between edited and unedited ROIs. Intraclass correlation analyses between unedited and edited white matter volumes yielded coefficients ranging from 0.90 to 1.00 for all ROIs. After Bonferonni correction, the mean white matter volumes did not differ significantly between any pairs of unedited and edited regions, however the fusiform gyrus (p < 0.005) and the pars orbitalis (p < 0.005) approached significance. The mean effect size for edited measures of white matter volume did not differ significantly from the mean effect size for unedited measures. The mean arithmetic difference in effect size between edited and unedited white matter ROIs was −0.013 (SD 0.11). The regions with the largest differences in effect size between the unedited and edited methods were the frontal pole (0.369), temporal pole (0.22), transverse temporal cortex (0.21) and insula (0.25). No significant differences in the 3T data were observed in variances of cortical thickness between edited and unedited ROIs. Intraclass correlation analyses between unedited and edited measures of cortical thickness yielded coefficients ranging from 0.86 to 0.986 for 32 out of 34 ROIs. Exceptions included medial orbitofrontal cortex (0.65) and the insula (0.81). In contrast to 1.5T data, mean cortical thickness differed significantly between 7 of the 34 pairs of unedited and edited regions, including the banks of the superior temporal sulcus, entorhinal cortex, fusiform gyrus, inferior temporal gyrus, lateral orbitofrontal cortex, medial orbitofrontal cortex and rostral middle frontal cortex (all p < 0.001). Moreover, an additional 3 ROIs approached significance, including the superior frontal gyrus (p < 0.003), precentralgyrus (p < 0.004) and the caudal middle frontal gyrus (p < 0.004). The mean effect size for edited measures of cortical thickness did not differ significantly from the mean effect size for unedited measures. The mean arithmetic difference in effect size between edited and unedited measures of cortical thickness was 0.07 (SD 0.15). The regions with the largest differences in effect sizes were the lateral orbitofrontal cortex (0.226), the lingual gyrus (−0.439), the rostral anterior cingulate (0.244) and the insula (−0.47).

Discussion

In the last 5 years, FreeSurfer (FS) has become the standard for obtaining cortical metrics from MRI images due to its ease of configuration, accurate results, and high reproducibility (Fischl et al., 2002; Tae et al., 2008; Bhojraj et al., 2011). However, there has been a lack of consensus around whether or not additional manual editing is required in order to increase the ability to detect effects between groups. This is the first study, to the best of our knowledge, to directly compare FS's fully automated method to that of FS's semi-automated manual intervention method that utilizes control points to alter gray-white matter boundaries. Overall we found very few differences between methodological approaches, although we do note specific exceptions below.

1.5T data

We found few differences between methodological approaches when using the FS segmentation process to obtain surface areas from 1.5T images. The absence of differences in variance, and the high level of intraclass correlation coefficients between the regions in edited and unedited brains support previous studies that have established the consistency and reproducibility of the fully automated FS segmentation process (Fischl et al., 2002). As found in previous studies, the regions where differences were observed, i.e., the entorhinal cortex and parahippocampal gyrus, are common locations for imaging artifacts (Oguz et al., 2008; Desikan et al., 2010). These results support previous research into FS's difficulty obtaining measurements in similar scenarios, rather than suggesting a difference between the two methods (Desikan et al., 2010). This is supported by an absence of significant differences in the mean volumes and mean effect sizes between the two methods for measuring surface areas. Although some differences were observed in white matter volume variance, the absence of consistently larger effect sizes for either method further indicates that the differences should not be viewed as a higher level of accuracy in volume segmentation for either method. One exception may be the lateral orbitofrontal cortex, for which we observed significant differences in mean volume. Due to motion which causes commonly-occurring imaging artifacts, the lateral orbitofrontal cortex is a region where raters make numerous corrections (i.e., using control points) during the FS pipeline. Although in our data, the difference in effect size between our patient and control samples was negligible for this region, that may not be the case for other populations and therefore automated white matter volumes derived for this region in general, when using a 1.5T scanner, should be viewed with caution. As described in the methods section, cortical thickness is derived from the distance between the white matter surface, which follows the border between white and gray matter, and the pial surface, which follows the border between gray matter and cerebrospinal fluid. Since manually inserting control points affects where those surfaces are positioned, the differences between the methods should be most pronounced in cortical thickness measurements. Although there was an absence of difference in the variance, ICC's, and mean cortical thickness for most regions, the difference in effect sizes was surprising. The caudal anterior cingulate, superior frontal gyrus, supramarginal gyrus, and temporal pole all had effect sizes which favored the edited method, but do not typically require many control points. On the other hand, the region that favored the unedited method, the fusiform gyrus, usually needs heavier manual correction to exclude hyper intensities. Although further exploration is needed in order to determine what specifically caused the unexpected results, it is possible that errors in the automated segmentation are more pronounced in 22q11DS due to enlarged ventricles, and that fusiform gyrus matter was incorrectly excluded in the unedited brains, giving the appearance of a larger effect then was actually present. Nonetheless, the lack of consistently significant differences in variances and mean cortical thickness volumes between the edited methods further supports the notion that manual intervention for 1.5T images in FS's automated process does not provide an increase in ability to detect an effect size between groups commensurate with the human hours required.

3T data

The results for surface area and white matter volume in 3T data are similar to what was observed for the 1.5T data, and suggest that consistency in method is most likely more important than the choice between the fully automated and the manual-edit procedures. This is corroborated by similar effect sizes observed for both the manual and automated process, with the exception of temporal and occipital lobe structures affected by the issues described above. Although no significant differences were observed in cortical thickness variance between the two groups, a notable difference in the results between the 1.5T and 3T data were 7 regions with differences in mean cortical thickness. The relatively large number of regions in the 3T for which we observed differences, and the fact that the same differences weren't present in 1.5T data warrant further explanation. In particular, the superior temporal sulcus, and the lateral and medial orbitofrontal cortices typically require manual editing in both the 1.5T and 3T data. It is possible that due to the higher contrast in 3T scans, the control points had greater success in correcting misplaced surfaces than in the 1.5 scans, potentially resulting in more accurate surfaces and cortical thickness measurements. This would have been supported by larger effect sizes in those regions for the brains which had been edited. However, such an effect was only observed for the lateral orbitofrontal cortex, and overall the differences between effect sizes for any region were evenly split between the edited and unedited methods. Therefore, it is evident that although there were differences between the two methods, editing the brain images didn't translate into our ability to detect group differences more readily with one method or the other.

Limitations

Artifacts due to intensity inhomogeneity, head motion, reduced signal to noise ratio, and partial volume effects can all lead to reduced image quality, alterations in intensity values and, ultimately, errors in image segmentation. These issues may be magnified in higher field-strength data secondary to increases in B1 field inhomogeneity (Marques et al., 2010), potentially necessitating more manual editing of higher field-strength images. Acquiring and averaging multiple acquisitions, which improves signal-to-noise and contrast-to-noise ratios, and reduces motion artifacts, can address these issues (Kochunov et al., 2006; Winkler et al., 2010). The present analyses were based on a single sequence acquisition, which therefore constitutes a limitation to our study. Multiple sequence acquisition carries trade-offs in both scanning cost and time, which can deter researchers. In the present study, the sample consisted, in part, of school-aged children with intellectual disability and, in many cases, attention deficit hyperactivity disorder. Accordingly, we had to strike a balance between optimizing the quality of our images while maintaining a timeframe that our sample would tolerate. This may have necessitated more manual intervention to correct errors in segmentation. Although we observed similarities in the metrics we extracted from the different regions of the brain, we did not conduct an overlap analysis to determine whether the ROIs had a high level of spatial overlap. It is possible that the regions appear to be similar numerically, but have different boundaries with one methodological approach more accurately denoting the region it represents. Another limitation is that both the 1.5T and 3T data used were manually skull stripped prior to implementing the FS pipeline: if brains were run fully automated, they would be subject to the automated skull stripping module included within FS. However, we do not believe that had a significant effect on our results, and previous research supports this notion (Fennema-Notestine et al., 2006). Our processing pipeline may have also been limited by the fact that we did not assess the quality of the images (e.g., signal to noise ratio) prior to processing the data, which may have affected the extent to which manual interventions were needed.

Conclusions

This study is significant in that it shows that the additional time and cost necessary to manually correct the FS segmentation process does not necessarily increase one's ability to detect differences in cortical measurements between groups. Future studies should be conducted with larger and more diverse samples in order to provide additional insight into the differences between methods. In addition, since the temporal and frontal lobe contain numerous regions affected by disorders like Alzheimer disease and schizophrenia, and many of the differences we observed were within those lobes, additional research should focus on methods which can increase the segmentation accuracy specifically in those regions.

Authors contributions

WK, IC, and CM designed the study. CM, CT, and JB completed all image processing for the study. IC and WK completed all statistical analyses of the imaging data. AR, CM, and WK wrote the manuscript. All authors revised the manuscript for accuracy and intellectual content, and all authors approved the final manuscript.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  101 in total

1.  Method for combining information from white matter fiber tracking and gray matter parcellation.

Authors:  Hae-Jeong Park; Marek Kubicki; Carl-Fredrik Westin; Ion-Florin Talos; Anders Brun; Steve Peiper; Ron Kikinis; Ference A Jolesz; Robert W McCarley; Martha E Shenton
Journal:  AJNR Am J Neuroradiol       Date:  2004-09       Impact factor: 3.825

2.  Psychopathological correlates of the entorhinal cortical shape in schizophrenia.

Authors:  C Christoph Schultz; Kathrin Koch; Gerd Wagner; Martin Roebel; Claudia Schachtzabel; Igor Nenadic; Carsten Albrecht; Jürgen R Reichenbach; Heinrich Sauer; Ralf G M Schlösser
Journal:  Eur Arch Psychiatry Clin Neurosci       Date:  2009-11-07       Impact factor: 5.270

3.  Reduced cortical thickness in the posterior cingulate gyrus is characteristic of both typical and atypical Alzheimer's disease.

Authors:  Manja Lehmann; Jonathan D Rohrer; Matthew J Clarkson; Gerard R Ridgway; Rachael I Scahill; Marc Modat; Jason D Warren; Sebastien Ourselin; Josephine Barnes; Martin N Rossor; Nick C Fox
Journal:  J Alzheimers Dis       Date:  2010       Impact factor: 4.472

4.  Identifying cognitively healthy elderly individuals with subsequent memory decline by using automated MR temporoparietal volumes.

Authors:  Gloria C Chiang; Philip S Insel; Duygu Tosun; Norbert Schuff; Diana Truran-Sacrey; Sky Raptentsetsang; Clifford R Jack; Michael W Weiner
Journal:  Radiology       Date:  2011-04-05       Impact factor: 11.105

5.  MAO A VNTR polymorphism and amygdala volume in healthy subjects.

Authors:  Antonio Cerasa; Aldo Quattrone; Maria C Gioia; Angela Magariello; Maria Muglia; Francesca Assogna; Sergio Bernardini; Carlo Caltagirone; Paola Bossù; Gianfranco Spalletta
Journal:  Psychiatry Res       Date:  2011-01-13       Impact factor: 3.222

6.  Reliability of longitudinal brain volume loss measurements between 2 sites in patients with multiple sclerosis: comparison of 7 quantification techniques.

Authors:  F Durand-Dubief; B Belaroussi; J P Armspach; M Dufour; S Roggerone; S Vukusic; S Hannoun; D Sappey-Marinier; C Confavreux; F Cotton
Journal:  AJNR Am J Neuroradiol       Date:  2012-07-12       Impact factor: 3.825

7.  Different regional patterns of cortical thinning in Alzheimer's disease and frontotemporal dementia.

Authors:  An-Tao Du; Norbert Schuff; Joel H Kramer; Howard J Rosen; Maria Luisa Gorno-Tempini; Katherine Rankin; Bruce L Miller; Michael W Weiner
Journal:  Brain       Date:  2007-03-12       Impact factor: 13.501

8.  Mapping cortical morphology in youth with velocardiofacial (22q11.2 deletion) syndrome.

Authors:  Wendy R Kates; Ravi Bansal; Wanda Fremont; Kevin M Antshel; Xuejun Hao; Anne Marie Higgins; Jun Liu; Robert J Shprintzen; Bradley S Peterson
Journal:  J Am Acad Child Adolesc Psychiatry       Date:  2011-01-14       Impact factor: 8.829

9.  Extent of cerebellum, subcortical and cortical atrophy in patients with MS: a case-control study.

Authors:  Deepa Preeti Ramasamy; Ralph H B Benedict; Jennifer L Cox; David Fritz; Nadir Abdelrahman; Sara Hussein; Alireza Minagar; Michael G Dwyer; Robert Zivadinov
Journal:  J Neurol Sci       Date:  2009-02-06       Impact factor: 3.181

10.  The effects of FreeSurfer version, workstation type, and Macintosh operating system version on anatomical volume and cortical thickness measurements.

Authors:  Ed H B M Gronenschild; Petra Habets; Heidi I L Jacobs; Ron Mengelers; Nico Rozendaal; Jim van Os; Machteld Marcelis
Journal:  PLoS One       Date:  2012-06-01       Impact factor: 3.240

View more
  49 in total

1.  Independent Multiple Factor Association Analysis for Multiblock Data in Imaging Genetics.

Authors:  Natalia Vilor-Tejedor; Mohammad Arfan Ikram; Gennady V Roshchupkin; Alejandro Cáceres; Silvia Alemany; Meike W Vernooij; Wiro J Niessen; Cornelia M van Duijn; Jordi Sunyer; Hieab H Adams; Juan R González
Journal:  Neuroinformatics       Date:  2019-10

2.  Optimization of preprocessing strategies in Positron Emission Tomography (PET) neuroimaging: A [11C]DASB PET study.

Authors:  Martin Nørgaard; Melanie Ganz; Claus Svarer; Vibe G Frokjaer; Douglas N Greve; Stephen C Strother; Gitte M Knudsen
Journal:  Neuroimage       Date:  2019-06-01       Impact factor: 6.556

3.  Structural Brain Alterations in Youth With Psychosis and Bipolar Spectrum Symptoms.

Authors:  Maria Jalbrzikowski; David Freedman; Catherine E Hegarty; Eva Mennigen; Katherine H Karlsgodt; Loes M Olde Loohuis; Roel A Ophoff; Raquel E Gur; Carrie E Bearden
Journal:  J Am Acad Child Adolesc Psychiatry       Date:  2019-01-18       Impact factor: 8.829

4.  Diminution of context association memory structure in subjects with subjective cognitive decline.

Authors:  Ling-Yun Fan; Ya-Mei Lai; Ta-Fu Chen; Yung-Chin Hsu; Pin-Yu Chen; Kuo-Zhou Huang; Ting-Wen Cheng; Wen-Yi Isaac Tseng; Mau-Sun Hua; Ya-Fang Chen; Ming-Jang Chiu
Journal:  Hum Brain Mapp       Date:  2018-03-08       Impact factor: 5.038

5.  Assessment of White Matter Integrity after Pediatric Traumatic Brain Injury.

Authors:  Kacie L Wright; Ramona O Hopkins; Frank E Robertson; Erin D Bigler; H Gerry Taylor; Kenneth H Rubin; Kathryn Vannatta; Terry Stancin; Keith Owen Yeates
Journal:  J Neurotrauma       Date:  2020-05-15       Impact factor: 5.269

6.  Progressive symptom-associated prefrontal volume loss occurs in first-episode schizophrenia but not in affective psychosis.

Authors:  Toshiyuki Ohtani; Elisabetta Del Re; James J Levitt; Margaret Niznikiewicz; Jun Konishi; Takeshi Asami; Toshiro Kawashima; Tomohide Roppongi; Paul G Nestor; Martha E Shenton; Dean F Salisbury; Robert W McCarley
Journal:  Brain Struct Funct       Date:  2018-04-18       Impact factor: 3.270

7.  Intrinsic Connectivity Network-Based Classification and Detection of Psychotic Symptoms in Youth With 22q11.2 Deletions.

Authors:  Matthew Schreiner; Jennifer K Forsyth; Katherine H Karlsgodt; Ariana E Anderson; Nurit Hirsh; Leila Kushan; Lucina Q Uddin; Leah Mattiacio; Ioana L Coman; Wendy R Kates; Carrie E Bearden
Journal:  Cereb Cortex       Date:  2017-06-01       Impact factor: 5.357

8.  Cortical-amygdala volumetric ratios predict onset of symptoms of psychosis in 22q11.2 deletion syndrome.

Authors:  David Berhanu; Leah M Mattiaccio; Kevin M Antshel; Wanda Fremont; Frank A Middleton; Wendy R Kates
Journal:  Psychiatry Res Neuroimaging       Date:  2016-11-25       Impact factor: 2.376

9.  Automatic Segmentation of the Dorsal Claustrum in Humans Using in vivo High-Resolution MRI.

Authors:  Shai Berman; Roey Schurr; Gal Atlan; Ami Citri; Aviv A Mezer
Journal:  Cereb Cortex Commun       Date:  2020-09-01

10.  Reliability and Utility of Manual and Automated Estimates of Total Intracranial Volume.

Authors:  Samuel J Crowley; Jared J Tanner; Daniel Ramon; Nadine A Schwab; Loren P Hizel; Catherine C Price
Journal:  J Int Neuropsychol Soc       Date:  2017-10-05       Impact factor: 2.892

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.