| Literature DB >> 26802217 |
Kenneth B Hoehn1, Anna Fowler2, Gerton Lunter2, Oliver G Pybus3.
Abstract
B-cell receptors (BCRs) are membrane-bound immunoglobulins that recognize and bind foreign proteins (antigens). BCRs are formed through random somatic changes of germline DNA, creating a vast repertoire of unique sequences that enable individuals to recognize a diverse range of antigens. After encountering antigen for the first time, BCRs undergo a process of affinity maturation, whereby cycles of rapid somatic mutation and selection lead to improved antigen binding. This constitutes an accelerated evolutionary process that takes place over days or weeks. Next-generation sequencing of the gene regions that determine BCR binding has begun to reveal the diversity and dynamics of BCR repertoires in unprecedented detail. Although this new type of sequence data has the potential to revolutionize our understanding of infection dynamics, quantitative analysis is complicated by the unique biology and high diversity of BCR sequences. Models and concepts from molecular evolution and phylogenetics that have been applied successfully to rapidly evolving pathogen populations are increasingly being adopted to study BCR diversity and divergence within individuals. However, BCR dynamics may violate key assumptions of many standard evolutionary methods, as they do not descend from a single ancestor, and experience biased mutation. Here, we review the application of evolutionary models to BCR repertoires and discuss the issues we believe need be addressed for this interdisciplinary field to flourish.Entities:
Keywords: B-cell receptor; diversity; immunoglobulin; infection.; molecular evolution
Mesh:
Substances:
Year: 2016 PMID: 26802217 PMCID: PMC4839220 DOI: 10.1093/molbev/msw015
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
. Applications of BCR Repertoire Sequencing.
| Application | Diseases/Infections | Significance | Challenges |
|---|---|---|---|
| Identification of broadly neutralizing antibodies (BNAbs) | Infection with rapidly evolving pathogens such as HIV, Hepatitis C virus, influenza viruses |
Potential for use as vaccine targets ( Provide model system for understanding affinity maturation, coevolution, and immune development (e.g., |
Lineages are often large and diverse, often with high levels of hypermutation, long CDR3s, and poly reactivity ( |
| Study of vaccine responses | Any disease for which vaccines are used or being developed, e.g., Influenza, typhoid, or Ebola |
A model system for immune response with a known stimulus ( Identifying correlates of immune protection following vaccination |
Distinguishing vaccine-specific changes to the BCR repertoire from healthy repertoire diversity Variable immune responses among individuals to the same stimulus ( |
| Tracking B-cell migration and development within the body | Autoimmune diseases such as multiple sclerosis; cancers |
Identifies migration of B cells between tissue compartments ( Can identify the sites at which B cells mature ( |
Accurate B-cell lineage assignment Modeling potentially complex migration patterns Differential sampling between tissues |
| Disease diagnosis | Autoimmune diseases such as multiple sclerosis and rheumatoid arthritis; cancers, in particular B-cell lymphoma |
A direct and potentially cheap diagnosis tool Improved understanding of disease Provide clinical markers of disease progression ( |
Complex and multiple disease epitopes induce complex responses Low level presence of B cells associated with disease |
FChord diagrams showing the pairing of V and J segments within (a) productive and (b) nonproductive IgM sequences from a single healthy individual. Chord widths represent the proportion of sequences with a given V (colored) and J (gray) segment pairing. The five most common V segments in productive rearrangements (and all J segments) are labelled. Note that IGHV3-23/IGHJ4 was significantly more common in productive versus nonproductive rearrangements, which may indicate functional bias of that pairing. The figure was generated from data in Elhanati et al. (2015), which was aligned to the IMGT reference (Lefranc et al. 2009) using IgBLAST (Ye et al. 2013). Productive rearrangements were subsampled to the same read depth as nonproductive rearrangements (∼2 × 105 reads); the values displayed in (a) are means of 100 subsampling repetitions.
FNetwork diagrams that visualize the diversity and clonal structure of BCR sequences. BCR sequences were obtained from (a) two healthy people, (b) two individuals infected with HIV-1, sampled during early infection, and (c) two patients with chronic lymphocytic leukemia, a B-cell cancer. Each point/circle represents a unique BCR sequence, the size of which is proportional to how common that sequence is. Edges are drawn between pairs of sequences that differ by exactly one nucleotide change. Note that samples do differ by read depth (approximately 3.4 × 104 and 3.6 × 104 for part [a], 9.2 × 104 and 3.6 × 105 for part [b], 5.1 × 104 and 2.6 × 104 for part [c]). Parts (a) and (c) are reproduced with permission from Bashford-Rogers et al. (2013) and part (b) from Hoehn et al. (2015).
FMaximum-likelihood phylogenetic tree of the VRC01 BNAb lineage sampled at ten time points over 15 years of diversification within a single individual infected with HIV-1. Each tip represents a BCR heavy chain sequence; terminal branches are colored by time point of sampling (see key). The red circle at the root represents the germline sequence (IGHV1-2*02 and IGHJ1*01, D region left unassigned). Note the general, but not complete, trend of increasing genetic divergence from the root with sampling time. Late-sampled sequences near the root indicate very high rate heterogeneity among lineages; these sequences might represent inactive memory B cells. BNAb sequences were obtained through cell sorting (Wu et al. 2010) followed by high-throughput sequencing data to identify related BCRs (Zhou et al. 2013). See Wu et al. (2015) for full experimental details. Sequences for this tree were obtained from GenBank (Wu et al. 2015) and aligned using MUSCLE (Edgar 2004). A maximum-likelihood phylogeny was estimated using the GTRGAMMA substitution model in RAxML (Stamatakis 2014), and rerooted to position the germline sequence at the root with a divergence of zero. Scale bar represents genetic distance (expected changes per nucleotide site).
FObserved mutation (sequence difference from germline) frequency among productive heavy chain immunoglobulin sequences across the V-gene sequence (horizontal axis, IMGT unique numbering). The distribution of mutations across the region is strongly nonuniform, with mutations more likely to occur at certain positions. The CDR2 region (middle shaded box) has a high rate of observed mutations and is thought to be more important in antigen binding than the surrounding framework regions (FWR2 and FWR3). This figure was generated from the same data set as figure 1.