| Literature DB >> 29911679 |
Michael A Powell1, Javier O Garcia2,3, Fang-Cheng Yeh4,5, Jean M Vettel2,3,6, Timothy Verstynen7.
Abstract
The unique architecture of the human connectome is defined initially by genetics and subsequently sculpted over time with experience. Thus, similarities in predisposition and experience that lead to similarities in social, biological, and cognitive attributes should also be reflected in the local architecture of white matter fascicles. Here we employ a method known as local connectome fingerprinting that uses diffusion MRI to measure the fiber-wise characteristics of macroscopic white matter pathways throughout the brain. This fingerprinting approach was applied to a large sample (N = 841) of subjects from the Human Connectome Project, revealing a reliable degree of between-subject correlation in the local connectome fingerprints, with a relatively complex, low-dimensional substructure. Using a cross-validated, high-dimensional regression analysis approach, we derived local connectome phenotype (LCP) maps that could reliably predict a subset of subject attributes measured, including demographic, health, and cognitive measures. These LCP maps were highly specific to the attribute being predicted but also sensitive to correlations between attributes. Collectively, these results indicate that the local architecture of white matter fascicles reflects a meaningful portion of the variability shared between subjects along several dimensions.Entities:
Keywords: Behavior prediction; Individual differences; Local connectome; Structural connectivity; White matter
Year: 2018 PMID: 29911679 PMCID: PMC5989992 DOI: 10.1162/NETN_a_00031
Source DB: PubMed Journal: Netw Neurosci ISSN: 2472-1751
Data analysis pipeline. dMRI from the HCP dataset were preprocessed consistent with previous research investigating the local connectome fingerprint (top panel) and included registration via QSDR and estimation of SDF using an axonal directional atlas derived from the HCP dataset. Once fingerprints were estimated for each individual, the pipeline for analysis of the continuous response variables consisted of four major steps: (1) a PCA-based dimensionality reduction, (2) a LASSO model based on the lower-dimensional components of the local connectome fingerprint, (3) local connectome phenotype estimation from projection of the contributing components of the LASSO model, and (4) prediction on the held-out dataset. A similar pipeline was used for categorical response variables with the exception that a logistic LASSO model was used in the LASSO-PCR step and prediction accuracy was assessed as percentage correct rather than as a predicted versus observed correlation.
Summary statistics for 28 continuous HCP attributes used in the modeling analysis
| Age (in years) | 841 | 28.76 | 29.00 | −0.08 | 0.00 | 0.00 | 28.51 | 29.01 |
| Handedness | 841 | 65.36 | 80.00 | −2.18 | 0.10 | 0.07 | 62.33 | 68.40 |
| Total household income (binned; 5 ∼ $40,000–49,999) | 836 | 5.01 | 5.00 | −0.28 | 0.00 | 0.00 | 4.87 | 5.16 |
| Years of education completed | 840 | 14.92 | 16.00 | −0.74 | 0.00 | 0.00 | 14.80 | 15.04 |
| Body mass index | 840 | 26.51 | 25.42 | 0.95 | 0.03 | 0.00 | 26.15 | 26.86 |
| Mean hematocrit sample | 740 | 43.39 | 43.50 | −0.68 | 0.02 | 0.00 | 43.05 | 43.73 |
| Diastolic blood pressure | 830 | 76.77 | 76.00 | 0.33 | 0.02 | 0.00 | 76.06 | 77.49 |
| Systolic blood pressure | 830 | 123.76 | 123.00 | 0.51 | 0.01 | 0.00 | 122.80 | 124.71 |
| Systolic-diastolic blood pressure ratio | 830 | 1.63 | 1.61 | 0.97 | 0.03 | 0.00 | 1.61 | 1.64 |
| Hemoglobin A1C | 566 | 5.26 | 5.30 | 0.12 | 0.05 | 0.01 | 5.22 | 5.29 |
| Pittsburgh Sleep Quality Index | 841 | 5.18 | 5.00 | 0.91 | 0.01 | 0.00 | 4.98 | 5.39 |
| NIH Picture Sequence Memory Test | 840 | 111.83 | 110.70 | 0.11 | 0.00 | 0.00 | 110.92 | 112.73 |
| NIH Dimensional Change Card Sort Test | 839 | 115.28 | 115.07 | 0.18 | 0.02 | 0.00 | 114.59 | 115.97 |
| NIH Flanker Inhibitory Control and Attention Test | 841 | 112.52 | 112.21 | 0.25 | 0.01 | 0.00 | 111.84 | 113.20 |
| Penn Progressive Matrices: Number of correct responses | 838 | 16.76 | 18.00 | −0.55 | 0.00 | 0.00 | 16.44 | 17.09 |
| Penn Progressive Matrices: Total skipped items | 838 | 3.12 | 1.00 | 1.01 | 0.00 | 0.00 | 2.86 | 3.39 |
| Penn Progressive Matrices: Median reaction time for correct responses (sec) | 838 | 15.61 | 14.65 | 0.91 | 0.01 | 0.00 | 14.99 | 16.23 |
| NIH Oral Reading Recognition Test | 841 | 116.96 | 117.59 | −0.14 | 0.01 | 0.00 | 116.24 | 117.67 |
| NIH Picture Vocabulary Test | 841 | 116.76 | 117.03 | 0.09 | 0.01 | 0.00 | 116.12 | 117.40 |
| NIH Toolbox Pattern Comparison Processing Speed Test | 841 | 114.15 | 113.16 | 0.22 | 0.03 | 0.00 | 113.14 | 115.16 |
| Delay Discounting: Area under the curve for discounting of $200 | 838 | 0.25 | 0.20 | 1.39 | 0.05 | 0.00 | 0.24 | 0.27 |
| Delay Discounting: Area under the curve for discounting of $40,000 | 838 | 0.50 | 0.49 | 0.05 | 0.00 | 0.00 | 0.48 | 0.52 |
| Variable Short Penn Line Orientation: >Total number correct | 838 | 14.80 | 15.00 | −0.23 | 0.00 | 0.00 | 14.51 | 15.10 |
| Variable Short Penn Line Orientation: Median reaction time divided by expected number of clicks for correct (sec) | 838 | 1.15 | 1.10 | 1.31 | 0.03 | 0.00 | 1.13 | 1.17 |
| Variable Short Penn Line Orientation: Total positions off for all trials | 838 | 24.34 | 21.00 | 3.16 | 0.05 | 0.02 | 23.33 | 25.35 |
| Penn Word Memory Test: Total number of correct responses | 838 | 35.64 | 36.00 | −0.82 | 0.01 | 0.00 | 35.44 | 35.84 |
| Penn Word Memory Test: Median reaction time for correct responses (sec) | 838 | 1.56 | 1.51 | 1.85 | 0.03 | 0.01 | 1.54 | 1.58 |
| NIH List Sorting Working Memory Test | 841 | 111.21 | 108.06 | 0.16 | 0.02 | 0.00 | 110.45 | 111.97 |
1Using the interquartile range (IQR: 75th percentile minus 25th percentile), we define a mild outlier to be any point greater than the 75th percentile or less than the 25th percentile by an amount at least 1.5 times the IQR.
2Using the interquartile range (IQR: 75th percentile minus 25th percentile), we define an extreme outlier to be any point greater than the 75th percentile or less than the 25th percentile by an amount at least 3 times the IQR.
3Handedness is a bimodal distribution with a strong preference for right-handedness in the HCP cohort, thus labeling as extreme outliers a large number of individuals with strong left-hand dominance.
Lower dimensional structure of the local connectome fingerprints. (A) Three individual local connectome fingerprints, from three separate subjects, show coarse commonalities and unique patterns of variability. (B) Cumulative summation of variance explained from each component, sorted by the amount of variance explained by each component. Dotted lines indicate the number of components (697) needed to explain 90% of the variability in the fingerprint dataset. (C) Mean fingerprint across participants (blue, left) and linear summation of principal components that explain 90% of the variance (red, right).
Linear LASSO-PCR results for 28 continuous HCP attributes
| Age (in years) | 841 | Yes | 0.1430 | 0.0311 | −0.0378 | 0.1007 | 0.1776 |
| Handedness | 841 | No | 0.5581 | −0.0594 | −0.1208 | 0.0017 | 0.9475 |
| Total household income | 836 | Yes | 0.1604 | −0.0029 | −0.0753 | 0.0632 | 0.5181 |
| 840 | No | 0.4377 | 0.0729 | 0.0127 | 0.1343 | <10E-4 | |
| 840 | No | 0.4976 | 0.2736 | 0.2067 | 0.3421 | <10E-4 | |
| 740 | Yes | 0.4348 | 0.1324 | 0.0654 | 0.1939 | <10E-4 | |
| Diastolic blood pressure | 830 | No | 0.2058 | 0.0615 | −0.0154 | 0.1378 | 0.0331 |
| 830 | Yes | 0.3596 | 0.1396 | 0.0745 | 0.2076 | <10E-4 | |
| Systolic-diastolic blood pressure ratio | 830 | Yes | NA | −0.0240 | −0.0926 | 0.0474 | 0.7457 |
| Hemoglobin A1C | 566 | No | 0.2130 | 0.0098 | −0.0794 | 0.1071 | 0.4165 |
| Pittsburgh Sleep Quality Index | 841 | No | NA | −0.0314 | −0.0966 | 0.0415 | 0.8277 |
| 840 | No | 0.5964 | 0.0977 | 0.0290 | 0.1618 | <10E-4 | |
| NIH Dimensional Change Card Sort Test | 839 | No | 0.2381 | −0.0299 | −0.0945 | 0.0379 | 0.8071 |
| NIH Flanker Inhibitory Control and Attention Test | 841 | Yes | 0.1285 | −0.0001 | −0.0706 | 0.0651 | 0.5161 |
| 838 | Yes | 0.2027 | 0.0849 | 0.0187 | 0.1502 | <10E-4 | |
| 838 | Yes | 0.2090 | 0.0733 | 0.0120 | 0.1383 | <10E-4 | |
| Penn Progressive Matrices: Median reaction time for correct responses | 838 | Yes | 0.1078 | 0.0086 | −0.0619 | 0.0754 | 0.4075 |
| NIH Oral Reading Recognition Test | 841 | Yes | 0.1665 | 0.0008 | −0.0702 | 0.0660 | 0.4748 |
| NIH Picture Vocabulary Test | 841 | Yes | 0.5206 | 0.0481 | −0.0187 | 0.1142 | 0.0781 |
| NIH Toolbox Pattern Comparison Processing Speed Test | 841 | No | 0.1814 | −0.0569 | −0.1260 | 0.0061 | 0.9390 |
| Delay Discounting: Area under the curve for discounting of $200 | 838 | Yes | 0.3010 | 0.0275 | −0.0311 | 0.0891 | 0.2202 |
| 838 | No | 0.2056 | 0.0802 | 0.0132 | 0.1527 | <10E-4 | |
| 838 | Yes | 0.4490 | 0.0951 | 0.0279 | 0.1589 | <10E-4 | |
| Variable Short Penn Line Orientation: Median reaction time divided by expected number of clicks for correct | 838 | Yes | 0.4449 | −0.0572 | −0.1302 | 0.0141 | 0.9520 |
| Variable Short Penn Line Orientation: Total positions off for all trials | 838 | Yes | 0.4695 | 0.0014 | −0.0621 | 0.0735 | 0.4741 |
| Penn Word Memory Test: Total number of correct responses | 838 | No | 0.2382 | 0.0474 | −0.0228 | 0.1189 | 0.0764 |
| Penn Word Memory Test: Median reaction time for correct responses | 838 | No | 0.2354 | −0.0391 | −0.0965 | 0.0191 | 0.9034 |
| 841 | Yes | 0.4140 | 0.0793 | 0.0097 | 0.1540 | <10E-4 | |
The observed-predicted correlation was statistically significant after applying the FDR correction for multiple comparisons.
Training correlation could not be computed when the full HCP training set yielded no nonzero LASSO coefficients for ICV or LCF PCs.
Correlations between fingerprints. The matrix of between-subject correlations in local connectome fingerprints, sorted by participant index, is shown on the right. The distribution (inset) is the histogram of the upper triangle of the correlation matrix and the best fit kernel density estimate (red line).
Logistic LASSO-PCR results for four categorical HCP attributes
| 840 | Yes | 0.9405 | 0.8071 | 0.8691 | 0.8452 | 0.8905 | 0 | |
| 760 | Yes | 0.9632 | 0.8276 | 0.9053 | 0.8842 | 0.9263 | 0 | |
| Ethnicity | 833 | No | 0.9136 | 0.9136 | 0.9136 | 0.8944 | 0.9316 | 1.0000 |
| Relationship status | 840 | No | 0.6679 | 0.5571 | 0.5571 | 0.5226 | 0.5917 | 0.7620 |
* The prediction accuracy was statistically significant after applying the false discovery rate (FDR) correction for multiple comparisons.
1The female-male split in the 840 subjects was 56%–44%, respectively.
2The white and black subpopulations made up 82% and 18%, respectively, of the 760 subjects reported here.
3The Not Hispanic/Latino and Hispanic split in the 833 subjects was 91.4%–8.6%, respectively.
4Relationship status included 44.3% of the population in a “married or live-in relationship” and 55.7% not in such a relationship.
Local connectome phenotypes. Matrix inset is a correlation matrix displaying the similarity between phenotypes of the local connectome to the continuous response variables. Example phenotype maps are shown around the correlation matrix, and the color scale for each has been adjusted to reveal the areas of the local connectome that are most predictive of the labeled response variable.