| Literature DB >> 28315836 |
Kenneth B Hoehn1,2, Gerton Lunter2, Oliver G Pybus1.
Abstract
Phylogenetic methods have shown promise in understanding the development of broadly neutralizing antibody lineages (bNAbs). However, the mutational process that generates these lineages, somatic hypermutation, is biased by hotspot motifs which violates important assumptions in most phylogenetic substitution models. Here, we develop a modified GY94-type substitution model that partially accounts for this context dependency while preserving independence of sites during calculation. This model shows a substantially better fit to three well-characterized bNAb lineages than the standard GY94 model. We also demonstrate how our model can be used to test hypotheses concerning the roles of different hotspot and coldspot motifs in the evolution of B-cell lineages. Further, we explore the consequences of the idea that the number of hotspot motifs, and perhaps the mutation rate in general, is expected to decay over time in individual bNAb lineages.Entities:
Keywords: B-cell receptor; Genetics of Immunity; antibody; evolution; lineage; phylogenetic tree
Mesh:
Substances:
Year: 2017 PMID: 28315836 PMCID: PMC5419485 DOI: 10.1534/genetics.116.196303
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562
Figure 1Proposed evolutionary model for antibody lineages. All sequences descend from given germline node G, which has sequence g. Arrows indicate the direction of evolutionary change. Note that this known ancestor G is not necessarily the most recent common ancestor of the lineage, which is node R and sequence x. See similarities to tree in Boussau and Gouy (2006).
ML estimates of h and likelihood ratio tests for symmetric WRC/GYW model
| Lineage | Log likelihood | 2×LR | |||
|---|---|---|---|---|---|
| CH103 | 1.86 (1.4, 2.4) | −14600.4 | −14702 | 203.2 | <1 × 10−15 |
| VRC26 | 1.81 (1.5, 2.1) | −37238.5 | −37516.4 | 555.8 | <1 × 10−15 |
| VRC01 | 2.03 (1.7, 2.4) | −43647.1 | −43945.3 | 596.4 | <1 × 10−15 |
Significance was determined using the likelihood ratio test under a chi-squared distribution with one degree of freedom. The 90% confidence intervals for are shown in parentheses in the second column. MLE, ML estimate; LR, likelihood ratio.
Hotspot model selection
| Model name | Constraint/optimization of each | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| CH103 | VRC26 | VRC01 | ||||||||
| Symmetric WR | ML | 0 | 0 | 0 | 0 | 4.6 × 10−15 | 6.3 × 10−05 | 5.2 × 10−03 | ||
| Asymmetric WR | ML | ML | 0 | 0 | 0 | 0 | ||||
| Symmetric W | 0 | 0 | ML | 0 | 0 | <1 × 10−15 | <1 × 10−15 | <1 × 10−15 | ||
| Asymmetric W | 0 | 0 | ML | ML | 0 | 0 | ||||
| Symmetric SY | 0 | 0 | 0 | 0 | ML | 0.53 | 2.0 × 10−13 | 4.2 × 10−03 | ||
| Asymmetric SY | 0 | 0 | 0 | 0 | ML | ML | ||||
| Uniform hotspots | ML | 0 | 0 | 4.2 × 10−15 | <1 × 10−15 | <1 × 10−15 | ||||
| Hierarchical hotspots | ML | ML | 0 | 0 | ||||||
| SCAH | ML | ML | ML | ML | ML | 0.65 | 1.2 × 10−06 | 9.1 × 10−04 | ||
| FCH | ML | ML | ML | ML | ML | ML | ||||
Models of hotspot hierarchy (degree of mutability) and symmetry, specified by placing constraints on how different values of h are optimized. Columns 2–7 show how the parameter h is obtained for a particular model. A value of 0 indicates that h is fixed at zero, ML indicates that a parameter is optimized by ML, and h indicates that h parameter is equal to another value of h. For instance, in Symmetric WR, hYW is equal to its reverse complement hWR, which is ML optimized. However, in Asymmetric WR, both are ML optimized. Rows 8–10 show P-values obtained from likelihood-ratio tests of each of these nested hotspot models for the bNAb lineage specified in each column. Parameters, log likelihood, and AIC of each fit are shown in Figure S3 in File S1. LR, likelihood ratio; SCAH, symmetric coldspots, asymmetric hotspots; FCH, free coldspots and hotspots.
Each of these models is nested with the model immediately below it by one free parameter, allowing hypothesis testing using a likelihood-ratio test.
Figure 2Proportional error in parameter estimation compared to true values for the VRC01 B-cell lineage, fully context-dependent simulations. Values of ω, κ, tree length, and ratio of internal to external branch lengths are shown in (A), (B), (C), and (D), respectively. Estimates obtained under the GY94 are in orange (h = 0) and estimates obtained under the HLP17 model are in blue (h estimated using ML). The edges and centers of box plots show the first, second, and third quartiles, while the whiskers show range. Similar results for B-cell lineages CH103 and VRC26 are shown in Figure S5 in File S1.
HLP17 performance under fully context-dependent simulations for symmetric WRC/GYW hotspots
| Set | Mean | Bias | Variability | Type-1 error | Type-2 error | |
|---|---|---|---|---|---|---|
| CH103 | 0.00 | −0.012 | −0.012 | 0.008 | — | 0.10 |
| 1.00 | 1.091 | 0.091 | 0.018 | 0.00 | 0.05 | |
| 2.00 | 2.095 | 0.095 | 0.028 | 0.00 | 0.05 | |
| 4.00 | 4.146 | 0.146 | 0.093 | 0.00 | 0.05 | |
| VRC26 | 0.00 | 0.004 | 0.004 | 0.003 | — | 0.05 |
| 1.00 | 0.965 | −0.035 | 0.005 | 0.00 | 0.00 | |
| 2.00 | 1.841 | −0.159 | 0.013 | 0.00 | 0.25 | |
| 4.00 | 3.501 | −0.499 | 0.026 | 0.00 | 0.85 | |
| VRC01 | 0.00 | −0.002 | −0.002 | 0.002 | — | 0.05 |
| 1.00 | 0.965 | −0.035 | 0.008 | 0.00 | 0.05 | |
| 2.00 | 1.796 | −0.204 | 0.022 | 0.00 | 0.50 | |
| 4.00 | 3.291 | −0.709 | 0.049 | 0.00 | 0.95 |
Type-1 error rate shows the proportion of data sets that incorrectly failed to reject the null hypothesis of h = 0. Type-2 error rate shows the proportion of data sets that rejected the true value of h shown in the first column. Both hypothesis tests for type-1 and type-2 errors used an α value of 0.05. Importantly, data in these analyses were not simulations under HLP17, but a fully context-dependent variation of it.