Literature DB >> 24441473

Beyond model antigens: high-dimensional methods for the analysis of antigen-specific T cells.

Evan W Newell1, Mark M Davis2.   

Abstract

Adaptive immune responses often begin with the formation of a molecular complex between a T-cell receptor (TCR) and a peptide antigen bound to a major histocompatibility complex (MHC) molecule. These complexes are highly variable, however, due to the polymorphism of MHC genes, the random, inexact recombination of TCR gene segments, and the vast array of possible self and pathogen peptide antigens. As a result, it has been very difficult to comprehensively study the TCR repertoire or identify and track more than a few antigen-specific T cells in mice or humans. For mouse studies, this had led to a reliance on model antigens and TCR transgenes. The study of limited human clinical samples, in contrast, requires techniques that can simultaneously survey TCR phenotype and function, and TCR reactivity to many T-cell epitopes. Thanks to recent advances in single-cell and cytometry methodologies, as well as high-throughput sequencing of the TCR repertoire, we now have or will soon have the tools needed to comprehensively analyze T-cell responses in health and disease.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24441473      PMCID: PMC4001742          DOI: 10.1038/nbt.2783

Source DB:  PubMed          Journal:  Nat Biotechnol        ISSN: 1087-0156            Impact factor:   54.908


From the advent of clonal selection theory[1,2] to the present day, it has become increasingly clear that the adaptive immune response has, as its central unit, the expression of a single rearranged immunoglobulin or TCR on each B or T cell. And that in general, single cells are the operational units or ‘quanta’ of immunity. With respect to T lymphocytes, this means that understanding their role in immune responses requires comprehensive methods of interrogating the phenotypic and functional characteristics of individual T cells. In this regard, the use of flow cytometry for high-throughput analysis of individual T cells has been the gold standard for many years[3]. Gradual improvements in flow cytometry allowing simultaneous assessment of expression of surface and intracellular markers[4] and the precise temporal patterns of cytokine expression by T cells[5-7] have enabled studies on the relationships between T-cell phenotype/function and clinical status in a range of diseases[8-14]. The study of antigen-specificity, however, is complicated by enormous variability and unpredictability in terms of the epitopes targeted by T cells in any given T-cell response, especially considering the highly polymorphic nature of the MHC, and the fact that intact pathogens typically encode a wide variety of potential T cell epitopes[15]. Furthermore, as the breadth or number of epitopes targeted by the T cell response can be important, especially in rapidly evolving viral infections[16-18], and the phenotypes of T cells targeting different epitopes from the same pathogen can vary significantly[19,20], it is important to be able to monitor recognition of numerous epitopes in the response to each pathogen. As a result, the number of parameters analyzed in any given experiment continues to grow beyond the number of colors (12–15) available for fluorescence-based flow cytometry, making the latter type of analysis increasingly arduous or even impossible. Recent developments in methods for analyzing antigen-specific T cells that extend these limits exploit multiplexing and single-cell mass spectrometry-based ‘mass cytometry’[20-24]. Other emerging technologies that promise to dramatically increase both the speed and depth of information that one can obtain about T-cell responses include techniques allowing the analysis of single-cell mRNA transcripts[25,26]. In addition, unlike most mouse models of immunological diseases, wherein the identity of the antigenic epitopes that drive disease initiation and/or progression are known, the instances of human immunological diseases wherein the precise specificities of T cells involved are known remain relatively rare. Therefore, until precise antigenic epitope specificities can be determined, study of these human T cell responses requires alternative approaches; none appear to be more powerful than high-throughput sequencing of TCR repertoires. Data generated by this approach are providing insights into T-cell selection and the nature of repertoire diversity in various T-cell subsets in normal and pathological circumstances[27,28]. TCR sequencing approaches also allow the identification and tracking of TCR clonotypes or motifs involved in immune responses and various pathologies[29-31]. Moreover, high-throughput yeast-display approaches represent a way to identify pMHC ligands that bind to these TCR clonotypes or motifs[32,33]. These approaches hold promise for identifying relevant antigens for immune responses for which relevant antigens are currently completely unknown. For instance, identification of antigens targeted by T cells in patients with auto-inflammatory diseases could facilitate the development of novel treatment options. In this Review we discuss the advantages, disadvantages and complementarity of these high-dimensional approaches for the study of antigen-specific T cells. Common to each approach is the goal of understanding and/or exploiting the specificity of the T-cell mediated immune response to manipulate or predict outcomes of immunological diseases or vaccine responses. These recent technological advances seem poised to finally make possible the comprehensive analyses of T-cell responses.

Analyzing T-cell phenotypic and functional diversity

Each individual αβ T cell expresses one of as many >10[14] different TCR heterodimers[34] and each of these TCRs is specific for a very small fraction of possible self or foreign antigens presented in the context of an individual's MHC molecules (Fig. 1a). Thus, in terms of diversity of antigen-specificity alone, T cells are one of the most diverse cell subsets in the body. Several approaches for analyzing this diversity exist, and each has advantages and disadvantages (Table 1).
Figure 1

Antigen recognition by T cell receptor and probing antigen specificity with peptide-MHC multimers. (a) Antigen-specific T cell responses are initiated through the interaction of TCR, expressed on T cells, and the corresponding petide-MHC protein complex expressed by antigen-presenting cells. TCR engagement initiates a complex cell signaling cascade the results in T cell activation. (b) Binding of TCR to its specific peptide-MHC ligand is very low (∼1-100 μM) and has very fast dissociation kinetics (t1/2 usually much less than a minute). Thus, monomeric staining reagents are insufficiently stable for the detection of antigen-specific T cells. In contrast, by taking advantage of cooperative binding, multimeric complexes of peptide-MHC allow for remarkably sensitive and accurate detection of antigen-specific T cells[57,58]. (c) Mass cytometry and dimensionality reduction methods allow integrated analysis of T cell phenotype and function. In the example shown here, to visualize diversity of the peripheral blood CD8+ T cells, 25 parameters were measured for each cell including 16 phenotypic markers and 9 functional markers. Principal component analysis (PCA) was applied to generate three aggregate parameters describing ∼60% of total variance. A representative donor's 3D-PCA plot with cells annotated based on previously defined stringent criteria for naïve, central-memory (Tcm), effector-memory (Tem) and short-lived effector cells (Tslec). Adapted from ref 22. (d) To illustrate the phenotypic and functional meaning of a non-naïve cell progression along the PC2 axis of the PCA plot, average expression of (left) phenotypic and (right) functional parameters were normalized and plotted as a function of normalized PC2 values. This unsupervised analysis provides a hypothetical framework for graded T cell differentiation involving progressive gains and/or losses of surface marker expression and functional capacities. Adapted from ref [22]. (e) To illustrate the power of non-linear dimensionality approaches, a linear PCA analysis of bone-marrow derived cells colored by a number of user-defined cell subsets is compared to two different non-linear approaches, Isomap and viSNE. Adapted from ref [38].

Table 1

Comparison of approaches for analysis of individual T lymphocytes.

Flow cytometryMass cytometrySingle-cell transcriptomics
Number of parameters per cell∼10-20∼40 more possibleUp to full genome (>10,000)
Cellular throughput10's of thousands per secondHundreds per secondVery low, Maximum ∼100 cells
Sorting possibleYesNoRequired for most approaches
Good antibodies requiredYesYesNo
Sensitive and quantitative+++++Difficult especially for low abundance transcripts
Ability to evaluate antigen-specificityUp to ∼30 specificities with combinatorial tetramer staining>100 specificities with combinatorial tetramer stainingNo. Possible with TCR sequence.
T cell receptor sequence information possibleYes with sortingNoYes

Flow cytometry

As mentioned above, flow cytometry has been the leading method for measuring this diversity over the past three decades. The latest instruments are remarkably powerful, capable of analyzing and/or sorting cells based on up to ∼18 cellular parameters at >10,000 cells per second[4]. Such high-dimensional and high-throughput analysis revealed relationships between T-cell phenotype/function and clinical status for a range of diseases. For instance, in several infectious disease settings, a correlation between pathogen control and poly-functionality (that is, the capacity to perform more than one effector function, such as producing more than one cytokine) of the corresponding antigen-specific T cells have been identified[8-13]. As less simplistic relationships between disease status and antigen-specific T cell phenotypic/functional status must also exist, a great deal of effort is being dedicated to developing computer algorithms that can analyze and extract useful information from this otherwise impossible-to-understand high-dimensional flow cytometry data[35-38]. Human T cells are often segregated into four major categories based on surface markers that indicate their proliferative potential, cytotoxicity capacity, and their ability to produce cytokines. They are the following: naïve (CCR7+CD45RA+), central memory (CCR7+CD45RA–), effector memory (CCR7–CD45RA–), and terminal effector (CCR7–CD45RA+)[39]. Although simple, useful and well accepted, it is clear that much more heterogeneity exists than is captured by these subdivisions. For instance, expression of all possible combinations of CD27, CD28, CD62L and CCR7 are also observed, yet the significance of all of these distinctions is not clear[40]. Also, even cells fitting the most strict definition of naïve (CD45RA+CD45RO–CD27+CCR7+CD62L+CD28+), yet expressing higher levels of CD95, interleukin 2 receptor subunit-β (IL-2Rβ), CXC-motif chemokine receptor 3 (CXCR3) and leukocyte function-associated antigen-1 (LFA-1), have been designated stem cell-like memory cells with high proliferative capacity[41]. T cells can also be classified by the their functional capacities (e.g., abilities to produce various cytokines and effector molecules), transcription factor expression profiles[42], as well as markers indicative of their tissue trafficking potential[43]. Combined with antigen-specificity, as discussed below, the number of parameters that can be clearly distinguished by even the latest fluorescent flow cytometers is insufficient for an integrated and comprehensive view of T-cell diversity.

Mass cytometry

Thus, the arrival of mass cytometry has allowed a quantum leap in phenotypic and functional characterization of single T cells. The mass spectrometry-based flow cytometry method (CyTOF) uses isotopically purified heavy metal atoms, instead of fluorophores, as tags [44]. Whereas the number of parameters available to fluorescent flow cytometry is limited by broad spectral overlap leading to crosstalk between fluorescent channels, in CyTOF many more parameters (now ∼40 and dozens more should be possible in the near future) can be distinguished. In addition, thanks to reduced crosstalk between channels due to accurate resolution of tags differing by as little as a single atomic mass unit, the analysis and interpretation of CyTOF data is greatly simplified[45]. In terms of sample throughput, however, mass cytometry is significantly slower (capable of analyzing ∼500 cells per second) than fluorescence flow cytometry (up to tens of thousands of cells per second). Although reduced sample throughput poses a challenge to studies requiring large sample sizes or rare cells, using pre-enrichment strategies such as T cell purification methods or tetramer enrichment approaches as we have applied[20,22] make it possible to analyze even extremely rare antigen-specific T cells. Mass-tag barcoding[46] can also alleviate the throughput problem by allowing a large number (up to 96 so far) uniquely tagged (barcoded) samples to be analyzed simultaneously. After data acquisition, simple and effective software is available for deconvolution of each barcoded sample. Although mass cytometry is far from a comprehensive ‘proteomics’ method, extending cellular analysis into 40 dimensions means that each cell can be parsed into one of 2[40] (∼1 trillion) possible bins, allowing cells to be classified in unprecedented detail. It also means that a wide variety of T-cell markers can be can be assessed simultaneously providing a view of the overall diversity of a sample of cells. We have used mass cytometry to broadly probe the relationship between CD8+ T cell phenotype, function and specificity by simultaneously assessing several markers of each. For example, after using some of the then-available parameters to isolate single CD8+ T cells, we probed six different antigen-specificities using peptide–MHC tetramers (discussed below, see Fig. 1b), and used the remaining 25 parameters to evaluate expression of 16 surface and nine functional markers[22]. In that study, we chose the parameters and the timing of assessment to maximize the amount of information obtained form each cell and to reveal variation. Nonetheless, a caveat of this approach is that only a snapshot of information from a single time-point taken from a single blood sample is available for each cell. Accurately tracking the kinetics of cellular responses requires an ability to continuously monitor single cells (as described below for the kinetics of cytokine secretion and in references[5-7]). As there is a great deal of interest in T-cell poly-functionality as an index of the strength of an immune response, in the same study we took advantage of the large number cellular parameters available to mass cytometry by assessing nine different functional capacities (eight cytokines and a marker of granular release, CD107), in populations of activated human CD8+ T cells. This gave us 512 (2[9] ) possible combinations and we were able to observe at least 242 of these, indicating that a vast number of possible functional combinations can be expressed. Furthermore, by using MHC tetramers bearing influenza, cytomegalovirus (CMV) and Epstein-Barr virus (EBV) epitopes together with these nine functional markers, we showed that T cells specific for each of the different viruses expressed partially overlapping but distinct combinations of cytokines, ranging from 50 to 100 different combinations of cytokines observed. Although only a few antigen-specificities have been analyzed in this manner so far, these results indicate that T-cell responses are far more complex than previously thought, at least for CD8+ T cells, and that they seem tailored to the particular pathogen[22]. To integrate both phenotypic and functional capacities of the cells, we also combined all the measured information about each cell and performed dimensionality reduction using principal component analysis (PCA). PCA uses weighted combinations of each of the measurements to create composite parameters that maximally represent the data with a minimal number of new parameters. This approach provides an unbiased composite representation of the cellular diversity that incorporates both phenotypic and functional information. It also provides insights about how these markers, and different cell subsets, are related to each other (Fig. 1c, d)[22]. However, the represented pattern depends heavily on the cellular markers chosen for the analysis and on the composition of the cells being analyzed. We anticipate that this and other challenges of analyzing high-dimensional cellular data will improve as new computational methods are developed and their utilities compared for various applications[35]. For example, more sophisticated analogs of principal component analysis that combine the measured parameters in a non-linear fashion are capable explaining more variation in fewer dimensions and should enable better resolution of T-cell diversity[38] (Fig. 1e). Advantages of both fluorescence and mass cytometry include their ability to accurately and sensitively quantify protein levels at the individual cell level in rapid succession (hundreds to tens of thousands per second). However, their dependence on the availability of reliable antibodies specific for each protein severely limits their utility for discovery of novel proteins of interest. Furthermore, though the number of channels continues to increase, even for mass cytometry the number of isotopes in the periodic table represents a hard cap on the number probes possible by this approach.

Single T cell phenotyping and temporal analysis using microwells

Another important new technique was developed by Love and colleagues, who spread T cells onto slides engineered to contain 100,000 microwells, such that many wells are occupied by only a single cell[5-7]. These cells can then be stimulated and many potential cytokines analyzed using antibody capture. This system revealed something unique, which is that the individual T cells typically make multiple cytokines, but not necessarily at the same time and over a several day timespan[14]. This temporal complexity is not captured in other systems and takes advantage of the unique ability of this system to analyze the output of the same cells over time, and in a very high-throughput manner.

Single cell transcription profiling

Going forward, many of the limitations of flow cytometry and mass cytometry will be overcome through the use of single cell gene transcription profiling. However, because copy numbers of mRNA can be as low as just a few copies per cell, single cell gene expression analysis remains a challenge and the data need to be interpreted carefully[48]. Nonetheless, after careful validation, higher-throughput (up to 96 cells at a time) single-cell quantitative reverse transcriptase PCR (RT-PCR) systems are already allowing up to 96 mRNA transcripts to be analyzed in each cell in a quantitative fashion[25,37,49] (Fig. 2). Highlighting diversity in cellular mRNA transcript levels, this approach proved to be useful in distinguishing gene expression profiles of tetramer-stained and sorted antigen-specific T cells responding to the same antigen in the context of different gene-based vaccination[49]. Though it is a dramatic improvement over bulk analysis without single-cell resolution (Fig. 2), the analysis of only 96 cells at a time severely limits the current applicability of this approach. Here we anticipate that implementation of barcoding-based technologies (as discussed below) will gradually increase throughput and broaden the applicability of single-cell transcriptional profiling.
Figure 2

Single cell analysis can reveal heterogeneity in gene expression among T cells. The data shown here were generated using high-throughput BioMark™ microfluidics for multiplex quantitative RT-PCR analysis[25]. (a) After some careful optimization relative single cell gene expression can be validated by plotting compared to relative expression levels obtained from ‘bulk’ analysis of 100 cells. Good correspondence indicates reliable single cell gene expression measurements. (b) The value of single cell analysis is illustrated by comparing expression of CXCR5 and CCL5 mRNA in single T cells and in ‘bulk’ populations of 100 T cells. Averaging expression of genes across 100 cells masks diversity revealed at the single-cell level. (c) Further illustrating the power of the single-cell approach, expression levels of 24 genes (columns) in 163 cells (rows) is shown. In this example, half of the cells were stimulated prior to analysis (noted as ‘Activated’); the other half were unstimulated (‘Resting’). Unsupervised clustering discriminated all but six cells (noted with X on y-axis) into activated vs. resting categories based on gene expression. Adapted from ref [25].

Also exciting is the emergence of single-cell RNA sequencing methods that allow genome wide mRNA transcript quantification[26,48,50]. The use of barcoded primers used to multiplex the analysis of a large number cells in a single DNA sequencing reaction will surely broaden the utility of the single-cell transcriptomics approach[48]. Although it will always be important to evaluate cellular protein levels instead of relying strictly on mRNA levels, accurate genome-wide transcriptomic information from single cells, if of sufficient throughput and sensitivity, will add insight to that gained using the proteomic methods discussed above; it might also enable selection of markers to analyze using those methods. That said, cytometry methods able to directly evaluate T cell antigen-specificity by using peptide-MHC multimers, as discussed in the next section, will remain essential.

Characterizing T-cell antigen specificity

Studies of human immunology are limited by the availability of cellular material, which is usually restricted to small volumes of blood. Furthermore, to varying extents, depending on the immune response of interest, responsive T cells often represent a very small minority of total blood lymphocytes. Thus, to achieve sufficient signal above the inherent noise of complex blood samples, it is imperative to restrict analysis to the relevant cells—ideally through identification of T cells specific for the antigen of interest. However, as mentioned above, the precise antigen-specificities of T cells are only known for a handful of immune responses. Without knowledge of precise antigenic epitopes, only indirect methods relying on a cellular response (e.g., proliferation, cytokine production, surface marker expression changes[8,51-56]) to stimulation with whole-antigen can be used. However, in these approaches, the analysis is ‘single-plex’ in that each sample of cells can only be interrogated with a single antigen (or single mixture of antigens). Thus, epitope mapping is either very low resolution or requires large volumes of sample. Detecting rare antigen-specific cells is also problematic especially if any other cells in the sample are activated and contribute to background signal. Lastly, depending on the method of detection (unless high-dimensional approaches are used, see previous section), the phenotypic and/or functional profile of the antigen-specific cells can be limited and influenced in unpredictable ways by the stimulation used to identity them. Improvements to methods enabling direct detection antigen-specific cells are overcoming each of these limitations. As a result of their cooperative binding and ability to specifically label antigen-specific T cells (Fig. 1), peptide–MHC tetramers (or multimers) are now widely used for directly identifying and characterizing antigen-specific T cells[57,58]. Several recent technical advances have made this approach especially practical, including higher-throughput production of hundreds or thousands of tetramers from one batch of prepared MHC class I protein-loaded ultraviolet (UV)-cleavable and exchangeable peptides[59,60] or cleavable and exchangeable class II-associated Ii peptide (CLIP)-loaded MHC class II proteins[61]; detection of rare antigen-specific T-cell populations using magnetic bead-based enrichment of the peptide-MHC tetramer labelled antigen-specific cells allowing the characterization of very rare T cells, even those in the naïve T cell repertoire[61,62]; and an ability to probe larger numbers of T-cell antigen specificities in a single sample using combinatorial staining approaches[23,24,63]. Most recently, we demonstrated an approach that incorporates each of these improvements and exploits mass cytometry[20]. In this approach >100 antigen specificities and >20 phenotypic or functional markers can be probed in a single blood or tissue sample (Fig. 3a-c). We used this approach to simultaneously screen for, identify and phenotypically characterize rotavirus-specific cells in blood and intestinal lymphocyte samples from healthy human donors. Parallel analysis of T cells specific for a range of previously identified viral (and other) antigens provided ‘landmarks’ for classifying and defining the phenotypic profiles the identified cells. Through this parallel analysis, and consistent with what had previously observed for EBV[19], we identified significant differences in the phenotypic properties of T cells targeting different epitopes derived from the same virus[20] (Fig. 3d, e). Assuming that it will be possible to increase the number T-cell specificities that can be probed simultaneously (reconstruction experiments show that even a thousand different specificities may be possible[20]) using this approach, and apply it to study CD4+ T cells and MHC-class II restricted antigens (McGuire, Davis, unpublished results), we anticipate that this method will enable rapid identification of antigens related to a wide range of pathologies currently characterized by poorly defined T cell responses.
Figure 3

Highly mulitplexed analysis of T–cell antigen-specificity using mass cytometry based combinatorial peptide–MHC tetramer staining. (a-b) By coding each T cell antigen specificity (each pMHC tetramer) by a unique combination of three out ten possible metal tags, up to 120 different antigen specificities can be simultaneously measured in a single sample. Cells staining positive with three and only three of the metals comprising each code can be identified as specific for the pMHC epitope corresponding to that code. Adapted from NBT 31:609. (c) This example illustrates the identification of EBV-BRLF-1-specific T cells (colored green) by virtue of their staining positive for 161Dy, 169Tm, and 175Lu after incubation with a pMHC tetramer barcoded with these metals. This panel shows just one of the 120 possible 3D dot-plots for each sample. Adapted from ref 20. (d) If T cells are also stained with antibodies specific for phenotypic and functional markers, PCA dimensionality reduction can then be used to further summarize the phenotype of T cells specific for each epitope. In this example, each dot represents T cells specific for the indicated epitope from one of the 17 different donors analyzed. T cells displaying previously defined phenotypes and specific for well-characterized antigens can be used as ‘landmarks’ on these plots to help classify the composite phenotype of T cells specific for previously uncharacterized antigens. Adapted from ref 20. (e) This example illustrates the power of this approach to accurately define the status of antigen-specific T cells. Here the phenotypic status of EBV-specific cells targeting lytic- (BRLF1, BLMF1) versus latency-associated (LMP1, LMP2) antigens are delineated suggesting that T cells specific for lytic-cycle antigens have encountered antigen more recently[20]. Adapted from ref 20.

As described above, it is currently impossible to predict a priori which of the multitude of possible T-cell epitopes will be targeted by a T cell-mediated immune response[15,64]. An ability to probe >100 or possibly >1000 candidate epitopes is helpful but alone will not solve this problem, especially for diseases where the range of possible antigens is too large. As we and others have demonstrated for rotavirus and several other viral pathogens (e.g., influenza virus, Dengue virus and human immunodeficiency virus)[20,65-68] with relatively small genomes, epitope or MHC-binding prediction algorithms[69,70] provide a means of narrowing the possibilities and have been applied with some success. However, even for these small-genome pathogens, the number of candidate epitopes can be quite large when prediction stringencies are relaxed in effort to prevent missing epitopes or when considering viruses with high mutation rates and significant epitope variation that also need to be covered. Furthermore, binding prediction algorithms perform especially poorly for MHC alleles for which there is much less peptide binding data with which to train the algorithms[71]. Sometimes it can help to broadly map epitopes through cellular stimulation-based approaches or in the case of MHC class II-restricted cells, tetramer guided epitope mapping[72] before narrowing down on the precise epitopes using peptide-MHC multimers. Another solution is to use careful transcript sequence analysis as has been done for the identification of tumor-specific mutations encoding candidate tumor antigens based on whole-exome sequencing[73]. For example, in one recent study, >400 candidate antigens were screened using combinatorial tetramer staining of tumor infiltrating lymphocytes in a lymph-node metastases to identify a single dominant epitope from a melanoma patient[74]. Lastly, direct identification of MHC-bound peptides through peptide elution[75,76] and ever-improving mass spectrometry/proteomics-based peptide sequencing should provide more accurate means of identifying candidate epitopes for a range of applications[77,78]. Although aided by the development of more sensitive and sophisticated peptide-identifying mass spectrometers[79], this approach is still currently limited by the large number antigen-presenting cells required and the associated bias against low abundance and low affinity peptide ligands. Thus, despite the ongoing improvements being made to T cell epitope discovery methods, identification of antigens for a number of diseases, such as autoimmune and auto-inflammatory diseases remain elusive. In these cases, TCR sequencing-based approaches hold promise as a reliable means of tracking and understanding antigen-specific T cell responses, even when the identities of the antigens are unknown.

TCR repertoire sequencing

The advent of inexpensive and massively parallel DNA sequencing has started to impact T-cell analysis, starting with the pioneering work of Robins and colleagues[80] to understand the scope of the human TCRβ repertoire. From these studies, it was found that natural biases in VDJ recombination lead to an overrepresentation of common rearrangement events and a higher than expected frequency of such sequences common between different donors[81,82]. However, the vast majority of unique TCR sequences are present at low frequencies, as would be expected from early estimates of possible TCR diversity[34]. Considering these data, estimates of TCRβ repertoire diversity in an individual are difficult to calculate , yet lower bounds for total TCRβ diversity in a healthy adult[27,80] are ∼3–4 × 10[6] approximately consistent with previous estimates[83]. Single TCR chain deep sequencing studies have also shown that the clonal diversity of memory cells is greater than might be expected, with only a minority of memory T cell clones clearly expanded[84]. TCR sequencing has also been applied to accurately measure the TCR repertoire diversity of responding T-cell populations in cancer[31], during human herpes virus infection[30], during CMV or polyomavirus BK (BKV) reactivation or during acute allograft rejection[85]. This technique also holds promise for investigating the extent of changes in TCR repertoire during aging[86]. This approach has also proven to be a very sensitive means of accurately detecting the presence of rare malignant T cells, such as in minimal residual disease in acute T lymphoblastic leukemia[87]. Extending this approach, several higher-throughput means of obtaining endogenous pairs of TCRα and β sequences from individual cells are emerging (Fig. 4a). These include single-cell mRNA capture and emulsion linkage RT-PCR, as recently described for simultaneous sequencing of heavy and light chains of immunoglobulin in single B cells[88], single cell barcoding as has been used for single cell mRNA sequencing[28,48] and direct cellular emulsion linkage PCR[89]. Each of these methods allows the sequencing of α and β or γ and δ TCR sequences from individual cells much more efficiently than current practice.
Figure 4

Strategies for high-throughput single-cell analysis of TCR sequences and identification of TCR ligands. (a) As demonstrated for B cell immunologlobulin genes[88], sequencing of V regions of endogenously paired TCRa and TCRb genes can be performed in high-throughput by mRNA capture and emulsion linkage RT-PCR, or by direct cellular emulsion linkage PCR[89]. Cell-specific barcoded tags can also be introduced at the single-cell stage as a general means of increasing the throughput of this type of approach[48]. (b) Libraries of yeast clones displaying 10[5]-10[8] unique peptides tethered to MHC molecules enables high-throughput screening for peptides capable of binding MHC (because these peptides result in proper folding and surface expression of pMHC complexes) and for peptides capable of binding to a TCR of interest (here the TCR is used as a tetrameric staining reagent). After multiple rounds of selection, hits are sorted, cloned, and sequenced; the peptide sequences can then be analyzed[32,33].

In terms of applications, an especially promising approach would be to use single-cell methodologies and barcoding to analyze antigen-specific T-cell responses in the many cases where the major epitopes are unknown. A glimpse of the potential insights that can be gained with this approach is found in our recently published study of T-cell responses in patients with celiac disease[29]. In this disease, patients have a strong autoimmune response to gluten, which inflames the gut and causes serious discomfort and damage. In most cases, eating a gluten-free diet eliminates the symptoms. We monitored the peripheral blood lymphocytes of celiac patients who had ingested gluten for any unusual responses. What was expected based on the work of Sollid and colleagues[90] was that gliadin (a subset of gluten proteins) specific CD4+ T cells would be activated and then enter the circulatory system approximately six days following the first gluten exposure. However, we found that populations of CD8+ T cells and γδ T cells appeared (and disappeared) with the same kinetics, all expressing gut homing receptors. We then employed a single-cell TCR amplification method using a large panel of primers to show that both the CD8+ and γδ T cell populations were highly enriched for particular sequences and that complementarity-determining region 3 (CDR3) encoded by these sequences were shared between many of the patients. This implies that all three of these different types of T cells are recognizing specific ligands in this disease and somehow coordinating their responses. It also shows that it is possible to analyze T-cell responses directly after particular immunological events (albeit with a time lag) from a blood sample and assess the clonality of the response. In this study and in an analogous study that identifyied oligoclonal skin resident T-cell populations responding to herpes virus infection[30], one chain from each T cell population (TCRα, β, γ or δ) was analyzed. However, it is also possible to amplify both chains from single T cells as described above (which we have done, Han et al. unpublished data). Adapting this methodology to a high-throughput system of single cell analysis, as has been done by Georgiou[88] and Robinson (personal communication) for plasmablasts, one could envisage a very powerful way to analyze T-cell responses to disease or vaccination that is entirely independent of other methods and requires no knowledge of the major antigens, but can lead to them.

Antigen specificity from TCR sequence?

As introduced above, it would be ideal if high-throughput sequencing of enough TCR sequences could be used to computationally predict T-cell specificity from a given TCR heterodimer sequence. This would solve many of the problems and challenges discussed in this Review. However, unfortunately, we do not anticipate this to be possible anytime soon. The reasons for this are several-fold but stem from the highly variable nature of each of the components of the TCR–peptide–MHC complex. Unlike the binding of short linear peptides to MHC, which is mediated by relatively simple motifs and mostly predictable anchor residues[91], the energetics of TCR binding to peptide–MHC rely on highly unpredictable and numerous contacts between the TCR and both the MHC and the antigenic peptide[92], although peptide residues generally contribute more than the MHC residues[93]. The challenges associated with such a problem are highlighted by difficulties in the field of predicting protein-protein interactions in general, even when there are high-resolution structures for each[94,95]. Furthermore, it is clear that such a problem has far from a unique solution for a given TCR. This is because TCRs generally have a great deal of flexibility in one or both of the CDR3 loops which are the principal determinants of peptide specificity[96]; this flexibility allows them to recognize multiple different peptides which may not share any sequence homology. For example, Garcia and colleagues solved the structures of one TCR binding to four structurally distinct peptide-MHC complexes; in each case the CDR3s of the TCR adopted a different conformation[32]. Considering these observations together with the relatively ubiquitous cross-reactivity of TCRs to homologous peptides [97,98], one can conclude that most TCRs can likely recognize many different peptides, and with very low affinities (∼10,000 lower than most affinity matured antibodies, for example). This means the “fit” between a TCR and a ligand is much less stringent and therefore significantly harder to model than the fit between an antibody and its cognate antigen. Thus, we are not optimistic about the possibility of being able to computationally predict TCR antigen-specificity solely from sequence information, at least by some form of structural prediction. An alternative may be to identify specific sequence motifs that can be associated with particular antigen responses (as found for antibody sequences in Dengue fever virus infection[99]). But for now we will have to rely on experimental approaches for identifying T-cell antigens based on cloned TCRs. Fortunately, some such approaches are already proving to be useful. One general strategy for identifying antigens bound by a particular cloned TCR is to create libraries of cells (insect/baculovirus or yeast), each expressing a single peptide–MHC antigen that can be selected by staining with tetramerized soluble TCR proteins. For this approach, one needs to know the restricting element (e.g., the appropriate MHC allele) before addressing the even larger challenge of identifying the corresponding peptide epitope. The power of this approach can be illustrated by the challenge of identifying the pMHC recognized by the TCR expressed on the murine diabetogenic T-cell clone, BDC-2.5. Insect cell display was used to identify several mimotopes (presumably irrelevant peptides also capable of stimulating these cells) [100,101] that eventually aided in the identification of the true endogenous antigen. This case was made especially difficult due to the abnormal MHC binding register of the pathogenic epitope, which was not found in any of the insect cell libraries[101]. A possibly faster approach is to generate yeast display libraries restricted to sequences derived from the pathogen of interest, as was demonstrated for influenza virus–specific human leukocyte antigen DR (HLA-DR)-restricted CD4+ T cells[102]. Extending this to large random libraries[89, 90] one can construct yeast display libraries in which a large number (10[6]–10[7]) different peptides can be expressed bound to a give MHC molecule and that these libraries can then be screened with a particular TCR to identify ligands[32,33] (Fig. 4b). This system enabled identification of a number of interesting H-2Ld-restricted ligands of well-described mouse T cell receptor clones. These included an epitope with a sequence totally unrelated to the sequence to the original antigen; although having a high affinity for the TCR, apparently due to an abnormal docking orientation this epitope failed to trigger T-cell activation[32]. Hits like this and other mimotopes may be difficult to distinguish from immunologically relevant epitopes by this approach. However, as demonstrated[101], identification of a consensus sequence from the hits searched against sequence databases containing candidate antigens should help tremendously to narrow down the possibilities especially in cases where the source of antigen is totally unknown. From this, shortlisted candidates can be used as input for a tetramer panel (as described above) or stimulation-based studies of polyclonal T cell populations from which the TCRs of interest had been identified. Thus, as large-scale yeast-display screening approaches continue to improve in throughput and depth, we are optimistic that many more elusive disease-associated T-cell antigens will be identified, possibly leading to novel therapeutics and/or diagnostics.

Concluding remarks

The complexity of T-cell responses to pathogens in outbred populations has, in most cases, severely limited their assessment. This has forced a reliance on highly constrained animal models, which cannot give us a complete picture of how the different types of T cells function during an actual disease or vaccine challenge. It has also greatly inhibited attempts to find T-cell correlates of protection in vaccine or epidemiological work. Fortunately, the technologies discussed here represent a quantum leap in our ability to capture that complexity, and thus we may finally have the tools we need to broadly assess T-cell responses in most situations and to understand their contributions to immunity in much more depth. We anticipate that these approaches will give us much better ways of evaluating new vaccines and immunotherapies in the very near future.
  100 in total

Review 1.  Immunodominance: a pivotal principle in host response to viral infections.

Authors:  Ali Akram; Robert D Inman
Journal:  Clin Immunol       Date:  2012-02-10       Impact factor: 3.969

2.  Cytometry by time-of-flight shows combinatorial cytokine expression and virus-specific cell niches within a continuum of CD8+ T cell phenotypes.

Authors:  Evan W Newell; Natalia Sigal; Sean C Bendall; Garry P Nolan; Mark M Davis
Journal:  Immunity       Date:  2012-01-27       Impact factor: 31.745

3.  Overlap and effective size of the human CD8+ T cell receptor repertoire.

Authors:  Harlan S Robins; Santosh K Srivastava; Paulo V Campregher; Cameron J Turtle; Jessica Andriesen; Stanley R Riddell; Christopher S Carlson; Edus H Warren
Journal:  Sci Transl Med       Date:  2010-09-01       Impact factor: 17.956

4.  Multidimensional analysis of the frequencies and rates of cytokine secretion from single cells by quantitative microengraving.

Authors:  Qing Han; Elizabeth M Bradshaw; Björn Nilsson; David A Hafler; J Christopher Love
Journal:  Lab Chip       Date:  2010-04-08       Impact factor: 6.799

5.  Chromogranin A is an autoantigen in type 1 diabetes.

Authors:  Brian D Stadinski; Thomas Delong; Nichole Reisdorph; Richard Reisdorph; Roger L Powell; Michael Armstrong; Jon D Piganelli; Gene Barbour; Brenda Bradley; Frances Crawford; Philippa Marrack; Sushil K Mahata; John W Kappler; Kathryn Haskins
Journal:  Nat Immunol       Date:  2010-02-07       Impact factor: 25.606

6.  Comparison of peptides bound to spleen and thymus class II.

Authors:  P Marrack; L Ignatowicz; J W Kappler; J Boymel; J H Freed
Journal:  J Exp Med       Date:  1993-12-01       Impact factor: 14.307

7.  Simultaneous detection of many T-cell specificities using combinatorial tetramer staining.

Authors:  Evan W Newell; Lawrence O Klein; Wong Yu; Mark M Davis
Journal:  Nat Methods       Date:  2009-06-21       Impact factor: 28.547

8.  The MHC class I peptide repertoire is molded by the transcriptome.

Authors:  Marie-Hélène Fortier; Etienne Caron; Marie-Pierre Hardy; Grégory Voisin; Sébastien Lemieux; Claude Perreault; Pierre Thibault
Journal:  J Exp Med       Date:  2008-02-25       Impact factor: 14.307

9.  NetMHC-3.0: accurate web accessible predictions of human, mouse and monkey MHC class I affinities for peptides of length 8-11.

Authors:  Claus Lundegaard; Kasper Lamberth; Mikkel Harndahl; Søren Buus; Ole Lund; Morten Nielsen
Journal:  Nucleic Acids Res       Date:  2008-05-07       Impact factor: 16.971

10.  Critical assessment of automated flow cytometry data analysis techniques.

Authors:  Nima Aghaeepour; Greg Finak; Holger Hoos; Tim R Mosmann; Ryan Brinkman; Raphael Gottardo; Richard H Scheuermann
Journal:  Nat Methods       Date:  2013-02-10       Impact factor: 28.547

View more
  68 in total

1.  Teaching 'big data' analysis to young immunologists.

Authors:  Joachim L Schultze
Journal:  Nat Immunol       Date:  2015-09       Impact factor: 25.606

2.  Linking the T cell receptor to the single cell transcriptome in antigen-specific human T cells.

Authors:  Auda A Eltahla; Simone Rizzetto; Mehdi R Pirozyan; Brigid D Betz-Stablein; Vanessa Venturi; Katherine Kedzierska; Andrew R Lloyd; Rowena A Bull; Fabio Luciani
Journal:  Immunol Cell Biol       Date:  2016-02-10       Impact factor: 5.126

Review 3.  Modeling the T cell immune response: a fascinating challenge.

Authors:  Penelope A Morel; James R Faeder; William F Hawse; Natasa Miskov-Zivanov
Journal:  J Pharmacokinet Pharmacodyn       Date:  2014-08-26       Impact factor: 2.745

4.  Enrich and Expand Rare Antigen-specific T Cells with Magnetic Nanoparticles.

Authors:  John W Hickey; Jonathan P Schneck
Journal:  J Vis Exp       Date:  2018-11-17       Impact factor: 1.355

5.  National Institutes of Health Consensus Development Project on Criteria for Clinical Trials in Chronic Graft-versus-Host Disease: III. The 2014 Biomarker Working Group Report.

Authors:  Sophie Paczesny; Frances T Hakim; Joseph Pidala; Kenneth R Cooke; Julia Lathrop; Linda M Griffith; John Hansen; Madan Jagasia; David Miklos; Steven Pavletic; Robertson Parkman; Estelle Russek-Cohen; Mary E D Flowers; Stephanie Lee; Paul Martin; Georgia Vogelsang; Marc Walton; Kirk R Schultz
Journal:  Biol Blood Marrow Transplant       Date:  2015-01-30       Impact factor: 5.742

6.  Hepatitis B virus-specific T cells associate with viral control upon nucleos(t)ide-analogue therapy discontinuation.

Authors:  Laura Rivino; Nina Le Bert; Upkar S Gill; Kamini Kunasegaran; Yang Cheng; Damien Zm Tan; Etienne Becht; Navjyot K Hansi; Graham R Foster; Tung-Hung Su; Tai-Chung Tseng; Seng Gee Lim; Jia-Horng Kao; Evan W Newell; Patrick Tf Kennedy; Antonio Bertoletti
Journal:  J Clin Invest       Date:  2018-01-08       Impact factor: 14.808

7.  Linking T-cell receptor sequence to functional phenotype at the single-cell level.

Authors:  Arnold Han; Jacob Glanville; Leo Hansmann; Mark M Davis
Journal:  Nat Biotechnol       Date:  2014-06-22       Impact factor: 54.908

8.  In-depth determination and analysis of the human paired heavy- and light-chain antibody repertoire.

Authors:  Brandon J DeKosky; Takaaki Kojima; Alexa Rodin; Wissam Charab; Gregory C Ippolito; Andrew D Ellington; George Georgiou
Journal:  Nat Med       Date:  2014-12-15       Impact factor: 53.440

9.  TCR contact residue hydrophobicity is a hallmark of immunogenic CD8+ T cell epitopes.

Authors:  Diego Chowell; Sri Krishna; Pablo D Becker; Clément Cocita; Jack Shu; Xuefang Tan; Philip D Greenberg; Linda S Klavinskis; Joseph N Blattman; Karen S Anderson
Journal:  Proc Natl Acad Sci U S A       Date:  2015-03-23       Impact factor: 11.205

Review 10.  Enhancing vaccine effectiveness with delivery technology.

Authors:  Marie Beitelshees; Yi Li; Blaine A Pfeifer
Journal:  Curr Opin Biotechnol       Date:  2016-03-06       Impact factor: 9.740

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.