| Literature DB >> 35626377 |
Hannah Kockelbergh1, Shelley Evans2, Tong Deng2, Ella Clyne2, Anna Kyriakidou3, Andreas Economou3, Kim Ngan Luu Hoang2, Stephen Woodmansey2,4, Andrew Foers5, Anna Fowler1, Elizabeth J Soilleux2.
Abstract
Measuring immunity to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causative agent of coronavirus disease 19 (COVID-19), can rely on antibodies, reactive T cells and other factors, with T-cell-mediated responses appearing to have greater sensitivity and longevity. Because each T cell carries an essentially unique nucleic acid sequence for its T-cell receptor (TCR), we can interrogate sequence data derived from DNA or RNA to assess aspects of the immune response. This review deals with the utility of bulk, rather than single-cell, sequencing of TCR repertoires, considering the importance of study design, in terms of cohort selection, laboratory methods and analysis. The advances in understanding SARS-CoV-2 immunity that have resulted from bulk TCR repertoire sequencing are also be discussed. The complexity of sequencing data obtained by bulk repertoire sequencing makes analysis challenging, but simple descriptive analyses, clonal analysis, searches for specific sequences associated with immune responses to SARS-CoV-2, motif-based analyses, and machine learning approaches have all been applied. TCR repertoire sequencing has demonstrated early expansion followed by contraction of SARS-CoV-2-specific clonotypes, during active infection. Maintenance of TCR repertoire diversity, including the maintenance of diversity of anti-SARS-CoV-2 response, predicts a favourable outcome. TCR repertoire narrowing in severe COVID-19 is most likely a consequence of COVID-19-associated lymphopenia. It has been possible to follow clonotypic sequences longitudinally, which has been particularly valuable for clonotypes known to be associated with SARS-CoV-2 peptide/MHC tetramer binding or with SARS-CoV-2 peptide-induced cytokine responses. Closely related clonotypes to these previously identified sequences have been shown to respond with similar kinetics during infection. A possible superantigen-like effect of the SARS-CoV-2 spike protein has been identified, by means of observing V-segment skewing in patients with severe COVID-19, together with structural modelling. Such a superantigen-like activity, which is apparently absent from other coronaviruses, may be the basis of multisystem inflammatory syndrome and cytokine storms in COVID-19. Bulk TCR repertoire sequencing has proven to be a useful and cost-effective approach to understanding interactions between SARS-CoV-2 and the human host, with the potential to inform the design of therapeutics and vaccines, as well as to provide invaluable pathogenetic and epidemiological insights.Entities:
Keywords: COVID-19; SARS-CoV-2; T-cell receptor repertoire; antibody; coronavirus; diversity; immune response; immunological memory; immunoreceptor; machine learning
Year: 2022 PMID: 35626377 PMCID: PMC9140453 DOI: 10.3390/diagnostics12051222
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1V(D)J recombination determines T-cell receptor specificity. The TCR specificity of αβ T cells is determined by the unique V(D)J recombination events that occur during the development of each T cell. During this process, V, D and J gene segments are randomly selected and are spliced together on the β chain, while the α-chain rearrangement of the V–J gene segments occurs in a similar process. During this process, the random addition or deletion of nucleotides can occur at segment junctions. The complementarity-determining region 3 (CDR3) encoded by sequences located in the V(D)J junction has the greatest diversity and is what determines the antigen specificity of each TCR. TCRβ: T-cell receptor beta; CDR3β: the gene sequence encoding the complementarity-determining region 3 of the TCR beta chain.
Principles of laboratory methods for TCR repertoire sequencing.
| Method | DNA/RNA | Principles | Advantages | Disadvantages | Examples of |
|---|---|---|---|---|---|
| 5′RACE | RNA |
Extra 5′ dCTP nucleotides are added to the sequence during cDNA synthesis using a modified Moloney Murine Leukaemia Virus (MMLV) reverse transcriptase allowing 3′ dGTPs and 5′ adaptor incorporation [ The cDNA undergoes two rounds of PCR using one 5′ adaptor sequence and on 3′ TCR constant region primer [ NGS adaptors are added by ligation or are incorporated into primers during the second round of PCR [ |
Using only one set of oligonucleotides per PCR reaction greatly reduces PCR bias, while the 3′ TCR constant region primer significantly increases specificity [ The addition of molecular barcoding (unique molecular indices or UMIs), together with the adaptor, helps to correct amplification bias at the analysis stage [ |
Relatively complex workflow. | iRepertoire, |
| Multiplex PCR | DNA, RNA |
DNA:NGS adaptors can be incorporated into the original multiplex primers to make it a one-step process. RNA: An initial cDNA synthesis step is performed using either non-specific primers [ The multiplex PCR is then performed using multiple primers for each known V and either J (genomic DNA (gDNA)) or C (cDNA) region [ Some methods a second round of PCR is used to add NGS adaptors |
Simple pipeline for DNA; slightly more complex for RNA. A one-step PCR process can generate a library if adaptors (+/− UMIs) are included in the primers [ Multiplex PCR amplification and Illumina-based NGS are also offered by some companies, such as Adaptive Biotechnologies and Invivoscribe Inc., as a service [ |
The J region rather than the C region is used for the reverse primer for gDNA. Due to the large intron found between these regions [ UMI incorporation is also more difficult than for 5′RACE. | Adaptive Biotechnologies |
| Hybridisation capture | DNA, RNA |
Genomic DNA or cDNA derived from mRNA require an initial fragmentation/ sonication step [ End repair/A tailing is followed by ligation to adaptors with or without UMIs [ A set of specialised biotinylated complementary oligonucleotide baits, specific for the locus of interest, is hybridised to the DNA library, permitting capture of the target sequence [ A final round of PCR is done to release the library from the baits [ |
Easy addition of UMIs, makes it a very powerful process to analyse BCR/TCR repertoire and has even shown promise in allowing more than one locus to be targeted in one reaction [ Likely to avoid PCR bias. |
Relatively complex workflow. Risk of capturing many unrearranged sequences. | Bespoke approaches |
Diversity-based methods of TCR repertoire analysis.
| Analytical Approach | Principles/Interpretation |
|---|---|
| CDR3 length profiles [ | Assumed to be Gaussian distributed Clonal expansions/depletions skew the distribution |
| VDJ usage | Over/under representation of specific gene segments an indication of immune response |
| Clonal abundance | Number of times a clonotype appears in a sample (ideally excluding PCR duplicates) Identify highly abundant sequences as clonal |
| Clonal frequency | Percentage of CDR3 sequences represented by a specific clonotype (excluding PCR duplicates) Identify sequences that comprise a large percentage of all the repertoire as clonal |
| Richness [ | Total number of unique clonotypes High or low numbers of unique clonotypes indicative of immune irregularities |
| D50 diversity [ | Minimum percentage of unique clones amounting to 50% of the total sequences Low percentage indicative of low diversity and clonality |
| Simpson diversity [ | Estimates the probability of any two randomly sampled TCRs having different clonotypes Clonal populations have high values |
| Shannon diversity [ | Assesses the richness and unevenness of a TCR repertoire, the number of clonotypes and differences in their frequencies Higher values denote a more diverse clonotype distribution |
| Hill’s diversity (Hill’s evenness) [ | Describes the effective number of clonotypes within a sample |
| Pielou’s evenness index [ | Shannon diversity index divided by maximum possible Shannon diversity index Indicates the degree to which different clonotypes are equally represented in the sample |
| Parametric methods [ | Assume underlying distribution of TCR clones, commonly Poisson or Zipfian Broad properties of the repertoire inferred from fitted model parameters |
TCR clustering methods. indicates that the TCR clustering method uses the feature to define a cluster.
| Method | Features | |||||||
|---|---|---|---|---|---|---|---|---|
| V(D)J | CDR3s | Short Motifs | Physio-Chemical Properties | Amino Acids | Nucleotides | Frequency | Enrichment | |
| GIANA [ | ||||||||
| ALICE [ | ||||||||
| clusTCR [ | ||||||||
| GLIPH2 [ | ||||||||
| iSMART [ | ||||||||
| TCRdist [ | ||||||||
| TCRNET [ | ||||||||
| ImmunoMap [ | ||||||||
| MiXCR [ | ||||||||
Figure 2Overview of machine learning approaches. (A) Training data and training labels are used to train the model by obtaining its optimal parameters. The model with initial parameters makes predictions for the training data. These predictions are compared to the training labels and the error between them is calculated. The model parameters are updated to correct for the error. These steps continue until the error cannot be made smaller and the model is trained. (B) A trained model should be tested with a separate set of data and labels called the testing data. Positive or negative predictions are made for the testing data using the trained model with its optimal parameters. Each prediction is compared to the positive or negative testing label and categorised as a true positive (TP), false positive (FP), true negative (TN) or false negative (FN). The model’s sensitivity can be estimated as TP/(TP + FN), specificity as TN/(TN + FP) and accuracy as (TP + TN)/(TP + FP + TN + FN).
Figure 3Schematic overview of insights into TCR repertoire, observed over time after infection, obtained by bulk TCR repertoire sequencing. MIS: Multisystem Inflammatory Syndrome.
Biological insights into COVID-19 from bulk TCR repertoire sequencing.
| First Author | Number of | Cells | DNA/RNA | Loci | Key Points |
|---|---|---|---|---|---|
| Chang [ | 3 COVID-19 patients with mild disease 6 with pneumonia | PBMCs | RNA | TRB | SARS-CoV-2-associated TCR clusters exhibited significantly higher TCR generation probabilities and most were public compared to those from pneumonia Different patterns of CDR3 sequence motifs in SARS-CoV-2-associated TCR clonotypic clusters. |
| Cheng [ | 38 patients with mild to moderate COVID-19 disease 8 patients with severe disease (drawn from the Schultheiss cohort [ | PBMCs | DNA | TRB | Four TRB gene segments were overrepresented in severe COVID-19 patients. Computational models demonstrated that the spike protein of SARS-CoV-2, exhibits a high-affinity motif for binding TCRs and may form a ternary complex with MHC-II, permitting it to behave similar to a superantigen, such as staphylococcal enterotoxin B. |
| Minervina [ | 2 patients post mild COVID-19 5 time points | CD4+ and CD8+ T cells | RNA | TRB | CD4+ and CD8+ T-cell clonotypes undergo transient clonal expansion after infection, with similar kinetics, the majority acquiring memory phenotypes, with clonal contraction after day 15. By day 30 post-infection, most pre-infection central memory clones were detected in the effector memory subpopulation. |
| Niu [ | 10 patients, early stage to recovery COVID-19 4 time points 15 healthy controls | PBMCs | RNA | TRB | Low number of TCR sequencing reads in early disease with gradual increase in clinical improvement, especially during convalescence, when some dominant clones remained. Number of TRB sequencing reads increased to the same level as healthy controls after recovery. |
| Rajeh [ | 6 patients post-COVID; 2 time points 4 pre-pandemic controls | PBMCs | RNA | TRB | 100% classification accuracy achieved in predicting previous SARS-CoV-2 infection and thus likely immunity. |
| Schultheiss [ | 19 patients recovered from mild disease 20 patients with active infection, severe disease 39 age-matched healthy controls | PBMCs | DNA | TRB | 150 clonotypic clusters in COVID-19 patients were identified that are likely of pathophysiological relevance. The longitudinal monitoring of one patient during active disease and recovery identified clonotypes that expanded during the patient’s successful immune response towards SARS-CoV-2. These clonotypes encompassed amino acid motifs that were also shared by other patients at recovery. T-cell repertoires of patients with a mild clinical course who recovered from COVID-19 were highly diverse. |
| Shomuradova [ | 34 recovering patients 2 time points 14 healthy donors 20 pre-pandemic controls | CD4+ and CD8+ T cells | RNA | TRB | Healthy donors during the pandemic had increased numbers of SARS-CoV-2-specific T cells, but no antibody response, likely indicating prior asymptomatic infection or activation of pre-existing immunity. Some convalescent patients had anti-SARS-CoV-2 TCRs, but no detectable antibody response. In convalescent patients, there was a public and diverse, high Pgen T-cell response to SARS-CoV-2 epitopes. CD4+ and CD8+ T-cell responses to the spike protein were mediated by groups of homologous TCRs, some of them shared across multiple donors. Hundreds of TCR motifs/ clonotypic clusters were identified, 25 of which were shared across multiple donors. For 19 of these, a potential cognate epitope or restricting HLA allele could be predicted. |
| Shoukat [ | 19 patients recovered from mild disease 39 age-matched healthy controls Obtained from Schultheiss [ | PBMCs | DNA | TRB | Accurate sample classification possible on the basis of TCR repertoires (training accuracy 96.4%; validation accuracy 92.9%), but not BCR repertoires (training accuracy 74.5%; testing accuracy 47.3%). |
| Sidhom [ | 179 patients with mild disease 106 patients with severe disease Drawn from the ImmunoCode database | PBMCs | DNA | TRB | Total T cells, total nucleic acid template and total numbers of rearrangements were lower in severe versus mild infection, during the peak of infection. The 25 most predictive sequences for severe infection contained amino acids most predictive of disease severity in the central part of the CDR3 sequences. Using MIRA, specific SARS-CoV-2 antigen specificity was predicted and shown to differ (a) between CD4 and CD8 T cells and (b) between individuals with mild and severe disease. Able to construct an epitope-specific classifier to predict whether patients had mild or severe disease. |
| Swanson [ | 233 vaccinated patients; samples collected pre- and post-vaccination | PBMCs | DNA | TRB | A significant increase in the fraction of total T cells and fraction of unique TCRs that were spike protein-specific 28 days post second dose. Breadth and depth increases were comparable to COVID-19 convalescent patients. CD4 responses mapped to a broad range of parts of the spike protein, but CD8 responses were more restricted, most likely due to HLA restriction. |
| Wang [ | 9 patients of 2 weeks convalescence 20 patients of 6 months convalescence 5 healthy controls | CD4+ and CD8+ T cells | RNA | TRB | TRBV6-5-TRBD2-TRBJ2-7 is the most enriched V(D)J gene-segment combination in both CD4+ and CD8+ T cells among patients. Identified identical CDR3 motifs of the TRA and TRB from CD8+ T cells that are significantly enriched in convalescent patients. |
| Hu [ | 5 recovered volunteers at least 14 days post recovery 5 unexposed healthy donors | PBMCs | RNA | TRB | Identified 6 prevalent CDR3 amino acid patterns among sorted CD8+ T cells in recovered patients. No cross-reactive memory T cells identified in unexposed healthy donors. |
| Simnica [ | Blood samples: 140 from unrelated COVID-19 patients 140 pre-pandemic age-matched controls Brain tissues: 5 deceased patients with COVID-19, 40 brain tissue sections | PBMCs | DNA | TRB | Previously reported public TCRs in COVID-19 patients shown to have only slightly higher frequencies in COVID-19 patients than that of unexposed controls. 68 different clonotypes identified from brain-derived T cells of COVID-19 patients, which have a public nature and have potential for diagnosis. |
| Shimizu [ | 19 unexposed healthy donors with cross-reactive CD8+ T cells | CD8+ T cells | DNA | TRB | Identified the immunodominant S-protein epitopes of SARS-CoV-2 that is responsible for the activation of cross-reactive CD8+ T cells in HLA-A24 people who have not been exposed to SARS-CoV-2. |
| Li [ | 54 COVID-19 patients in different phases (asymptomatic, symptomatic, convalescent, and re-detectable positive cases) 16 healthy donors | PBMCs | RNA | TRB | Identified unique V–J-gene usage in asymptomatic and re-detectable positive cases. No HLA haplotype was found to be significantly correlated with disease stages. |