| Literature DB >> 30542912 |
Zhengguo Gu1, Katrijn Van Deun2.
Abstract
This article introduces a package developed for R (R Core Team, 2017) for performing an integrated analysis of multiple data blocks (i.e., linked data) coming from different sources. The methods in this package combine simultaneous component analysis (SCA) with structured selection of variables. The key feature of this package is that it allows to (1) identify joint variation that is shared across all the data sources and specific variation that is associated with one or a few of the data sources and (2) flexibly estimate component matrices with predefined structures. Linked data occur in many disciplines (e.g., biomedical research, bioinformatics, chemometrics, finance, genomics, psychology, and sociology) and especially in multidisciplinary research. Hence, we expect our package to be useful in various fields.Entities:
Keywords: Common/distinctive components; Group Lasso; Lasso; Linked data analysis; Multiblock analysis; Simultaneous component analysis
Mesh:
Year: 2019 PMID: 30542912 PMCID: PMC6797642 DOI: 10.3758/s13428-018-1163-z
Source DB: PubMed Journal: Behav Res Methods ISSN: 1554-351X
Fig. 1An example of common/distinctive components in a concatenated component loading matrix. The columns represent components, and the rows represent variables. The first six rows contain loadings from the first data block, and the remaining five rows contain loadings from the second data block. “×” indicates a non-zero loading, and “0” denotes a zero loading
Fig. 2A flow chart for model selection. Note that CS stands for component structure, and C/D stands for common and distinctive components
Fig. 3A screenshot of the result of the DISCO-SCA method
Fig. 4The cross-validation curve
Fig. 5A screenshot of the output of the estimated loadings (Model 2). Note that the final re-estimated non-shrinkage component loading matrix automatically includes row names if the raw data contains variable names
Fig. 6A heatmap of the component loading matrix (Model 2). The first ten rows represent the loadings from the “Herring_Sensory” data (hence the second block), and the remaining ten rows represent the loadings from the “Herring_ChemPhy” data (hence the first block)
Fig. 7A screenshot of the output of the estimated loadings (Model 4). Note that the final re-estimated non-shrinkage component loading matrix automatically includes row names if the raw data contains variable names
Fig. 8A heatmap of the estimated component loading matrix (Model 4). The first ten rows represent the loadings from the “Herring_Sensory” data (hence the second block), and the remaining ten rows represent the loadings from the “Herring_ChemPhy” data (hence the first block)
Fig. 9The user–computer interaction procedure for the PCA-GCA method
Fig. 10The scree plots generated by the pca_gca function
Fig. 11A screenshot of the output of the estimated loadings (Model 5). Note that the final re-estimated non-shrinkage component loading matrix automatically includes row names if the raw data contains variable names
Descriptive statistics of the 195 family data
| Questionnaire title | Mean | SD |
|---|---|---|
| Mother | ||
| Relationship with partners (the higher the score, the more satisfied) | 3.58 | .79 |
| Argue with partners (the higher the score, the less violent) | 3.65 | .42 |
| Child’s bright future (the higher the score, the stronger the feeling of bright future) | 4.49 | .52 |
| Activities with the child (the higher the score, the more activities) | 2.40 | .39 |
| Feelings about parenting (the higher the score, the more positive about parenting) | 3.33 | .68 |
| Communication with the child (the higher the score, the more communication) | 4.16 | .50 |
| Argue (aggressively) with the child (the higher the score, the less aggressive) | 3.08 | .45 |
| Confidence about oneself (the higher the score, the more confident) | 2.71 | .43 |
| Father | ||
| Relationship with partners (the higher the score, the more satisfied) | 3.67 | .70 |
| Argue with partners (the higher the score, the less violent) | 3.77 | .42 |
| Child’s bright future (the higher the score, the stronger the feeling of bright future) | 4.48 | .51 |
| Activities with the child (the higher the score, the more activities) | 2.30 | .38 |
| Feelings about parenting (the higher the score, the more positive about parenting) | 3.40 | .64 |
| Communication with the child (the higher the score, the more communication) | 3.97 | .60 |
| Argue (aggressively) with the child (the higher the score, the less aggressive) | 3.18 | .42 |
| Confidence about oneself (the higher the score, the more confident) | 2.78 | .47 |
| Child | ||
| Self confidence/esteem (the higher the score, the more confident) | 2.08 | .46 |
| Academic performance (the higher the score, the better the performance) | 6.87 | 1.32 |
| Social life and extracurricular activities (the higher the score, the more social life) | 2.22 | .38 |
| Importance of friendship (the higher the score, the more important friendship is) | 3.94 | .61 |
| Self image (the higher the score, the more positive self image is) | 2.56 | .52 |
| Happiness (the higher the score, the happier) | 2.29 | .44 |
| Confidence about the future (the higher the score, the more confident about the future) | 3.94 | .47 |
The estimated component loading matrix of the 195 family data
| Component 1 | Component 2 | Component 3 | Component 4 | Component 5 | |
|---|---|---|---|---|---|
| Mother | |||||
| Relationship with partners | 0 | 0 | 11.92 | 0 | 0 |
| Argue with partners | − 5.53 | 0 | 5.88 | 0 | 0 |
| Child’s bright future | − 8.83 | 0 | 0 | 0 | 0 |
| Activities with children | − 4.65 | − 9.02 | 0 | 0 | 0 |
| Feeling about parenting | − 9.02 | 0 | 0 | 0 | 0 |
| Communication with children | − 9.20 | 0 | 0 | 0 | 0 |
| Argue with children | − 8.78 | 0 | 0 | 0 | 0 |
| Confidence about oneself | − 6.66 | 0 | 7.26 | 0 | 0 |
| Father | |||||
| Relationship with partners | 0 | 0 | 11.80 | 0 | 0 |
| Argue with partners | 0 | 0 | 5.26 | 0 | − 9.17 |
| Child’s bright future | − 3.39 | 0 | 0 | 0 | − 5.76 |
| Activities with children | 0 | − 11.56 | 0 | 0 | 0 |
| Feeling about parenting | − 4.04 | 0 | 0 | 0 | − 6.94 |
| Communication with children | 0 | − 8.17 | 0 | 0 | 0 |
| Argue with children | − 4.98 | 0 | 0 | 0 | − 9.88 |
| Confidence about oneself | 0 | 0 | 5.60 | 0 | − 8.19 |
| Child | |||||
| Self confidence/esteem | − 5.82 | 0 | 0 | 8.66 | 0 |
| Academic performance | 0 | 0 | 0 | 7.08 | 0 |
| Social life and extracurricular | 0 | 0 | 0 | 4.10 | 0 |
| Importance of friendship | 0 | 0 | 0 | 9.60 | 0 |
| Self Image | 0 | 0 | 0 | 10.36 | 0 |
| Happiness | 0 | 0 | 0 | 9.55 | 0 |
| Confidence about the future | 0 | 0 | 0 | 7.48 | 0 |
Note. To interpret the loadings, we compare the signs of the loadings of each block within a component. Take Component 1 for example, all the non-zero loadings are of the same sign (in this case, ‘-’ sign), meaning that the variables corresponding to those loadings are positively associated with each other; that is, the higher a mother scores on, for example, “Argue with partners”, the higher she scores on the remaining variables (excluding “Relationship with partners”), and also the higher her partner (i.e., the father) scores on “Child’s bright future”, “Feeling about parenting”, and “Argue with children”, and also the higher the child scores on “Self confidence/esteem” and “Self image”