| Literature DB >> 31801472 |
Alexander M Sevy1,2,3, Cinque Soto3,4, Robin G Bombardi3, Jens Meiler1,2,5, James E Crowe6,7,8,9.
Abstract
BACKGROUND: Advances in next-generation sequencing (NGS) of antibody repertoires have led to an explosion in B cell receptor sequence data from donors with many different disease states. These data have the potential to detect patterns of immune response across populations. However, to this point it has been difficult to interpret such patterns of immune response between disease states in the absence of functional data. There is a need for a robust method that can be used to distinguish general patterns of immune responses at the antibody repertoire level.Entities:
Keywords: Antibody sequencing; Immune repertoire analysis; Principal component analysis; Repertoire dissimilarity index
Mesh:
Substances:
Year: 2019 PMID: 31801472 PMCID: PMC6894320 DOI: 10.1186/s12859-019-3281-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Workflow of repertoire fingerprinting by principal component analysis. To perform repertoire fingerprinting we first sequenced antibody genes of human donors and tabulated the IGHV-IGHJ gene pair usages. We then processed the data by subsampling to uniform depth over 10 replicates per donor and normalized counts by Z score transformation. We used PCA to project the input features onto 2 dimensions and analyze gene pairs that contribute to differences between repertoires
Fig. 2Principal component analysis can be used to distinguish antibody repertoires. a. PCA was applied to features of V-J gene use frequency for three healthy donors (HIP1–3). Artificial replicates were generated by subsampling each repertoire to a common depth, repeated 10 times per repertoire. X and Y axes show principal components 1 and 2, and the percent variation explained by each component is shown in parenthesis. b. PCA was applied to gene use frequencies from three healthy donors (HIP1–3) and five HIV-infected donors after influenza vaccination (HIV/Flu1–5). Black circles show results of K-means clustering with k = 2 clusters
HIV-infected subjects studied on day 7 after influenza vaccination
| HIV/Flu | Race | Ethnicity | Age range (years) | Site of collection |
|---|---|---|---|---|
| 1 | Caucasian | Non-Hispanic | 50–59 | Nashville, TN |
| 2 | Caucasian | Non-Hispanic | 50–59 | |
| 3 | Caucasian | Hispanic | 30–39 | |
| 4 | African-American | Non-Hispanic | 50–59 | |
| 5 | Caucasian | Non-Hispanic | 40–49 |
Fig. 3PCA provides better discrimination between donor cohorts than an alternate method. a. The Repertoire Dissimilarity Index from Bolen, et al. [17] was calculated for all pairs of donors within both the healthy and HIV/Flu cohorts and for inter-cohort pairs. b. The Euclidean distance between principal components (PC) 1 + 2 was calculated for the same intra- and inter-cohort pairs. Boxes show the interquartile range of data, with the median shown in blue, and whiskers show the full range of data. Significance was calculated using a two-sided Mann-Whitney rank test
Fig. 4PCA reveals differences between healthy adult and cord blood repertoires. PCA was applied to V-J gene frequency for three healthy donors (HIP1–3) and three cord blood donors (CORD1–3). X and Y axes show principal components 1 and 2, and the percent variation explained by each component is shown in parenthesis. Black circles show results of K-means clustering with k = 2 clusters. Each dot represents a synthetic replicate generated by subsampling each repertoire to a common depth. This subsampling was repeated 10 times per repertoire
Fig. 5Repertoire fingerprinting by PCA can identify perturbations in repertoire after influenza vaccination. PCA was applied to the sequenced repertoires of three individuals (FV, GMC, IB) at 10 time points before and after seasonal influenza vaccination. Dataset is from Laserson, et al. [26]. Shown are three principal components and the percent variation explained by each in parenthesis. Each dot represents a synthetic replicate generated by subsampling each repertoire to a common depth. This subsampling was repeated 10 times per repertoire