| Literature DB >> 30333820 |
Alexander Dimitri Yermanos1,2, Andreas Kevin Dounas3, Tanja Stadler1, Annette Oxenius2, Sai T Reddy1.
Abstract
Antibody evolution studies have been traditionally limited to either tracing a single clonal lineage (B cells derived from a single V-(D)-J recombination) over time or examining bulk functionality changes (e.g., tracing serum polyclonal antibody proteins). Studying a single B cell disregards the majority of the humoral immune response, whereas bulk functional studies lack the necessary resolution to analyze the co-existing clonal diversity. Recent advances in high-throughput sequencing (HTS) technologies and bioinformatics have made it possible to examine multiple co-evolving antibody monoclonal lineages within the context of a single repertoire. A plethora of accompanying methods and tools have been introduced in hopes of better understanding how pathogen presence dictates the global evolution of the antibody repertoire. Here, we provide a comprehensive summary of the tremendous progress of this newly emerging field of systems phylogeny of antibody responses. We present an overview encompassing the historical developments of repertoire phylogenetics, state-of-the-art tools, and an outlook on the future directions of this fast-advancing and promising field.Entities:
Keywords: B cell evolution; Ig-Seq; antibody lineage; phylogenetics; systems immunology
Mesh:
Substances:
Year: 2018 PMID: 30333820 PMCID: PMC6176079 DOI: 10.3389/fimmu.2018.02149
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1Evolutionary dynamics of the Germinal center reaction. (A) Naïve and memory B cells are recruited into germinal center reactions where they undergo subsequent rounds of somatic hypermutation in the dark zone and selection via follicular dendritic cells in the light zone. This leads to successive rounds of division and mutations (shown by colored antibody receptors) or apoptosis (shown by gray cells). Different selection pressures can lead to either balanced selection, in which multiple independent clones expand and undergo SHM, or imbalanced selection where a few clones dominate the GC reaction and undergo many rounds of SHM. (B) Ig-Seq can capture the sequence diversity within populations of B cells. Systems phylogeny aims to assign the recovered sequences into clonal families, followed by the inference of evolutionary histories. The resulting phylogenetic trees can then be compared both within one host and between hosts.
Comparison of tools and methods used for clonal lineage assignment and phylogenetic inference.
| Alignment based |
Potentially fast run time (depends on the tool) Can often supply own germline genes |
Often arbitrary thresholds for clonal relatedness (e.g., 80% CDR3 similarity) |
| Partis |
Human, mouse and macaque germline built in Germline inference possible Docker image available Good documentation |
Large datasets may require subsampling due to computational demands |
| Clonify |
Antibody specific edit distance Explicit incorporation of shared mutational histories |
Limited to unseeded alignment |
| SONAR |
Multiple seeded lineage assignment algorithms Easy export to other phylogenetic software Docker image available |
Limited to Human germlines |
| Distance based |
Computational speed Multiple distance metrics possible |
Difficult to calculate distances for sequences with large divergence and alignment gaps Less sophisticated than probabilistic methods |
| Maximum parsimony |
Intuitive algorithm Clonal frequency incorporation (GCTree) Polytomies and internal nodes (IgTree) |
Ignores antibody specific properties (hotspots, transversions, transitions) Long-branch attraction problem |
| Maximum likelihood |
Complex substitution models Hotspot specific codon models (IgPhyML) |
Computationally demanding Sensitive to model misspecification |
| Bayesian |
Complex substitution models Can produce rooted trees without explicit outgroup Possible to incorporate biological knowledge with priors Mutation rate returned in calendar time (BEAST) |
Sensitive to model misspecification Highest computational demands due to Markov chain Monte Carlo algorithm |
Figure 2Tree topologies for B cells. (A) The inclusion of polytomies in the phylogenetic tree allows a B cell to produce more than two distinct offspring at a given internal node. (B) Experimentally recovered sequences can be inferred as either internal nodes or tips in the phylogenetic tree. (C) Persisting ancestral sequences can be sampled at multiple time points while also producing distinct offspring. (D) Clonal frequencies have often been illustrated by the size of the nodes. Therefore, information regarding clonal expansion can be incorporated into the resulting topologies.