Literature DB >> 34868047

Using the T Cell Receptor as a Biomarker in Type 1 Diabetes.

Maki Nakayama1,2,3, Aaron W Michels1,2,3,4.   

Abstract

T cell receptors (TCRs) are unique markers that define antigen specificity for a given T cell. With the evolution of sequencing and computational analysis technologies, TCRs are now prime candidates for the development of next-generation non-cell based T cell biomarkers, which provide a surrogate measure to assess the presence of antigen-specific T cells. Type 1 diabetes (T1D), the immune-mediated form of diabetes, is a prototypical organ specific autoimmune disease in which T cells play a pivotal role in targeting pancreatic insulin-producing beta cells. While the disease is now predictable by measuring autoantibodies in the peripheral blood directed to beta cell proteins, there is an urgent need to develop T cell markers that recapitulate T cell activity in the pancreas and can be a measure of disease activity. This review focuses on the potential and challenges of developing TCR biomarkers for T1D. We summarize current knowledge about TCR repertoires and clonotypes specific for T1D and discuss challenges that are unique for autoimmune diabetes. Ultimately, the integration of large TCR datasets produced from individuals with and without T1D along with computational 'big data' analysis will facilitate the development of TCRs as potentially powerful biomarkers in the development of T1D.
Copyright © 2021 Nakayama and Michels.

Entities:  

Keywords:  HLA; MHC; T cells; TCR sequencing; autoimmunity; type 1 diabetes

Mesh:

Substances:

Year:  2021        PMID: 34868047      PMCID: PMC8635517          DOI: 10.3389/fimmu.2021.777788

Source DB:  PubMed          Journal:  Front Immunol        ISSN: 1664-3224            Impact factor:   7.561


Introduction

A T cell receptor (TCR) determines antigen specificity of T cells by interacting with a peptide-major histocompatibility complex (peptide-MHC), and signals received through the TCR along with the CD3 complex are the primary components that regulate function and fate of T cells. Individual T cells express unique TCRs, and therefore TCR sequences can be used as an identifier of T cells that are specific to particular antigens and involved in immune responses. In this review, we will focus on the potential use of TCR sequences as non-cell based T cell biomarkers for type 1 diabetes (T1D), a tissue-specific autoimmune disease targeting insulin-secreting pancreatic beta cells (1–3). Several features of self-reactive T cells make it challenging to develop T cell biomarkers in diabetes (4). First, the frequency of autoreactive T cells is extremely low in the peripheral blood, estimated to be 1/105 – 1/106. Second, response to peptide-MHC by autoreactive T cells tends to be minimal compared to anti-cancer or anti-pathogen T cell responses (5, 6). Third, healthy individuals with T1D-risk MHC molecules can have autoreactive T cells that are quantitatively and functionally similar to those in T1D patients (7). TCR sequencing allows for the analysis of TCR clonotypes from tens of millions of T cells using nucleotide samples rather than living cell biospecimens and may overcome many of these challenges when appropriately utilized. Advantages provided by TCR biomarkers include (1) living T cells are not required for assays; (2) intra- and inter-assay variations due to cell conditions and operator performance are minimized; and (3) extremely infrequent and low-responding T cells are detectable by recently emerging high-throughput sequencing technologies. Here, we will review current knowledge about TCR repertoires and clonotypes specific for T1D and address the knowledge gaps to develop TCR biomarkers that can stratify individuals throughout the stages of T1D development.

Tri-Molecular Complex Consisting of TCR, Peptide, and MHC Molecules

TCRs expressed by classical T cells are composed of alpha and beta chains, both of which are formed by somatic recombination of the variable (V) and joining (J) segment genes (and diversity [D] for beta chains). In humans, 45 TRAV and 52 TRAJ genes have been identified as functional V and J segment genes for alpha chains (8, 9). Likewise, there are 49 TRBV, 2 TRBD, and 13 TRBJ functional V, D, and J segment genes in the beta chain locus (10, 11). During maturation in the thymus, individual T cells undergo rearrangement of segment genes, resulting in one V, one D (for beta), and one J segment genes assembled adjacent to each other. Since additional nucleotides are often inserted or deleted between the segments, billions of junction sequences with hundreds of different V, D, J combinations are possibly assembled (12–14). Experimentally, each adult person is expected to have over 100 million TCR clonotypes uniquely expressed by hundreds of billions of individual T cells in the body (15–19). There are three regions, called complementarity determining regions (CDR), that directly interact with peptide-MHC complexes, thereby crucial to determine antigen specificity (20–22). Two CDR regions, CDR1 and CDR2, are included in the V segment, and the CDR3 region is formed at the junction between V, D (for beta), and J segments. Amino acid residues in the CDR3 regions closely interact with peptide, and thus are considered to be important to determine antigen specificity and are often used as a property of each TCR clonotype for TCR repertoire analysis.

MHC Molecules in T1D

The major genetic determinant in susceptibility to most autoimmune diseases reside in the human MHC that contains the human leukocyte antigen (HLA) region. MHC molecules are heterodimers formed between alpha and beta chains that function to present peptides to TCRs on T cells. Class I molecules are on all nucleated cells and present antigens to CD8 T cells, while class II molecules are expressed by antigen presenting cells (e.g. B cells, dendritic cells, and macrophages) and present peptides to CD4 T cells. In T1D, specific HLA class I and II alleles are associated with increased risk (23, 24). Several HLA class I and II alleles confer risk for T1D and are associated with other autoimmune disorders ( ) (25, 26). DR is in close linkage disequilibrium with DQ such that the DR4-DQ8 and DR3-DQ2 haplotypes confer the greatest risk for T1D development. Both the alpha and beta chains of DQ molecules are polymorphic, and have the ability to form mixed molecules in cis and trans. As an example, the alpha chain of DQ2 can pair with the beta chain of DQ8 to form DQ8-trans (DQA1*05:01-DQB1*03:02) when both DQ2 and DQ8 are in the genotype. DQ8-trans has an odds ratio of disease development for T1D at 35 (35 times more likely to develop diabetes compared to those without these alleles), compared to odds ratios of ~11 and ~4 for DQ8 and DQ2, respectively (27, 28). Interestingly, HLA-DQ6 (DQA1*01:02-DQB1*06:02) provides dominant protection for T1D development with an odds ratio of only 0.03 (29, 30). The stark dichotomy of risk between DQ molecules highlights the important role of antigen presentation to TCRs in T1D.
Table 1

Common HLA alleles associated with type 1 diabetes risk.

NameAlleleAssociated Autoimmune Diseases
HLA Class II
DQ8 DQA1*03:01-DQB1*03:02 Celiac disease, Addison’s disease
DQ2 DQA1*05:01-DQB1*02:01 Celiac disease, Addison’s disease
DQA1*02:01-DQB1*02:02 Celiac disease, Addison’s disease
DQ8-trans DQA1*05:01-DQB1*03:02 Celiac disease
DR4 DRB1*04:01 Rheumatoid Arthritis, Thyroid disease, Addison’s disease, Alopecia Areata
DRB1*04:02 Rheumatoid Arthritis (protective), Thyroid disease, Addison’s disease
DRB1*04:04 Rheumatoid Arthritis, Thyroid disease, Addison’s disease
DRB1*04:05 Rheumatoid Arthritis, Thyroid disease, Addison’s disease
DR3 DRB1*03:01 Systemic Lupus Erythematous (SLE), Neuromyelitis Optica (NMO), Myasthenia Gravis, Thyroid disease, Addison’s disease
HLA Class I
A2 A*02:01 Vitiligo
A24 A*24:02 unknown
B39 B*39:06 unknown
B18 B*18:01 unknown
B7 B*07:05 unknown
Common HLA alleles associated with type 1 diabetes risk.

Diversity of TCR Repertoires

Adults have approximately 108-1010 unique TCR clonotypes (15, 17, 18, 31). With an assumption that the TCR repertoire size may represent a capacity for responding to diverse antigens, the TCR repertoire diversity in the blood has been examined to determine whether it is associated with immune conditions. For example, having diverse TCR repertoires is associated with desirable responses to immune therapies in cancer (32–34). In T1D, it has been reported that TCR repertoires in peripheral blood of T1D patients are less diverse compared with those without T1D (35). Thus, there may be trends of TCR repertoire sizes that are preferred by a certain immune condition. However, it should be noted that the diversity of TCR repertoires cannot specify a certain disease.

Use of TCR Clonotypes as Surrogates to Quantify Antigen-Specific T Cells

TCR clonotypes determine antigen specificity, and therefore they can be utilized as a surrogate marker to evaluate the presence and prevalence of antigen-specific T cells in the blood. Frequencies of these antigen-specific TCR clonotypes can be quantified by high-throughput sequencing, which is expected to be more specific to individual diseases compared to surveying the broad TCR repertoire. Furthermore, once a panel of antigen-specific TCR clonotypes are determined, a single TCR sequencing assay allows for evaluating specificity to many antigens rather than needing to test specificity to each individual antigen. TCR sequencing has been done from different tissues in many disease states (36), including autoimmune disorders (37) and cancer (38–40). Remarkably, TCR sequencing has been shown to differentiate early-stage cancer patients from healthy individuals (41, 42). This strategy requires a list of TCR clonotypes beforehand that can be searched in blood samples, and such TCR clonotypes used as surrogate biomarkers need to satisfy three factors: (1) publicity (i.e. commonality and shared between individuals), (2) abundancy, and (3) disease specificity. Namely, T cells expressing the same or similar TCR clonotypes need to be commonly present in a number of people; frequency of such T cells in the blood of each person needs to be high enough for quantification; and presence or absence of such T cells needs to be associated with a disease state. In addition, with larger numbers of TCR clonotypes in a given panel, the more specific and sensitive an assay will become. Thus, identifying disease-specific TCR candidates is essential to establish a robust TCR sequencing assay that can discriminate a subset of individuals having a specific stage or feature of T1D such as those who have potential to respond to an interventional therapy. There are several strategies to identify disease-specific TCR clonotypes. Since a significant portion of disease-specific TCRs are likely to recognize islet antigens, TCR clonotypes expressed by islet antigen-specific T cells are reasonable candidates for TCR biomarkers. Such T cell sources include peripheral blood T cells responding to islet antigen stimulation or enriched by staining with fluorescence-conjugated multimers consisting of an islet-derived peptide and a particular HLA molecule (43–45). Alternatively, TCR clonotypes identified in the target organ (i.e. pancreas or pancreatic islets) or draining lymph nodes may be also disease-specific. In any of these T cell sources, specificity (i.e. potential contamination of non-disease associated T cells) as well as sensitivity (i.e. missing a portion of antigen-specific T cells) needs to be carefully considered. For example, T cell samples enriched by antigen stimulation may contain only a few clonotypes that readily proliferate in response to the stimulation or could be non-specific T cells that proliferate due to “bystander effect.” Likewise, T cells in the pancreas and pancreatic lymph nodes may not necessarily be islet-reactive or disease-specific (46). On the other hand, T cell populations enriched by multimer staining may contain only those having high affinity to bind peptide-MHC complexes, and TCRs weakly binding to peptide-MHC may be missed. This possibility is likely important for autoreactive TCRs since T cell responsiveness to self-antigens tends to be low compared to pathogen T cell responses. Nevertheless, identifying TCR clonotypes from samples enriched with antigen-specific T cells is indispensable to identify disease-specific TCR candidates. These TCR clonotypes should then be assessed for frequency in peripheral blood of individuals with different stages of T1D to determine the ultimate association with disease status. The next subsections will summarize features of TCR clonotypes specific to islet-specific autoantigens as well as those potentially associated with T1D pathogenesis.

Lessons From Islet-Specific TCRs in T1D Animal Models

Non-diabetic (NOD) mice spontaneously develop autoimmune diabetes and represent many features of human T1D including a T1D-susceptible MHC allele (I-Ag7), homologous to HLA-DQ8, the development of insulin autoantibodies prior to diabetes onset, and insulitis. A number of T cell clones reacting with islet tissues have been isolated from pancreatic islets and spleens of NOD mice in the past few decades and further characterized for antigen specificity as well as TCR clonotypes (47). In the 1990’s, Santamaria and colleagues discovered that a large portion of CD8 T cells infiltrating NOD islets share an identical Valpha segment (i.e. TRAV16) along with a specific junction motif (i.e. MRD or MRE) (48), and subsequently identified a peptide derived from islet-specific glucose-6-phosphatase catalytic subunit-related protein (IGRP) as an epitope targeted by these CD8 T cells (49). Likewise, CD4 T cell clones as well as T-hybridoma cells that are reactive to insulin B-chain peptides have been established from NOD islets by a number of investigators using different methods (50–56). The majority of these T cells expresses TCRs containing specific Valpha and Jalpha segment motifs, TRAV5D-4 or TRAV10 along with TRAJ53 or TRAJ42. When mice are forced to have only T cells expressing TCRs containing TRAV5D-4, approximately one percent of CD4 T cells becomes specific to an insulin B chain 9-23 peptide (57), and the mice are susceptible to develop anti-insulin autoimmunity (58, 59). Alanine scanning and crystal structure analyses identified several amino acid residues in the TRAV5D-4 and TRAV10 CDR1 and CDR2 regions that are crucial to interact with the insulin peptide-MHC complex (59, 60). Also, among insulin B chain-specific CD4 T cells, those particularly recognize an insulin B chain 12-20 peptide prefer to express TCR beta chains containing a negatively charged amino acid (i.e. aspartic acid [D] or glutamic acid [E]) in the junction region (56). This observation is consistent with a notion that the I-Ag7 T1D-susceptible MHC class II molecule, which has a positively charged patch in the surface area near the p9 pocket due to the lack of a negatively charged amino acid residue at the beta 57 position, engage TCRs having a negatively charged residue when p9 of peptides is not negatively charged (the position 20 of insulin B chain is glycine). Thus, these studies provide a molecular elucidation of how TCR motif selection occurs by interaction with a particular peptide-MHC complex. T1D-specific TCR repertoires in rat models have been extensively studied by the group of Mordes and Blankenhorn (61). Of note, diabetes-susceptible rat strains have a T1D risk MHC haplotype (RT1B/Du), which lacks a negatively charged amino acid residue at the beta chain 57 position and is homologous to HLA-DQ8 (61). In addition to the HLA gene locus, Iddm14, which contains the TCR beta chain genes, is a T1D-susceptible locus (62). The group identified a TCR Vbeta allele, Tcrb-V13S1A1, that is shared among T1D-susceptible rat strains but not with T1D-resistant ones (63), and demonstrated that genetic elimination of this allele or depletion of T cells expressing TCRs containing Vbeta13a (product of the Tcrb-V13S1A1 gene) abrogates diabetes development in T1D-susceptible rats (64–66). A series of these studies elegantly linked the genetic risk with a functional mechanism in which a particular TCR motif facilitates T1D development with a specific MHC molecule. In sum, these animal studies demonstrate the presence of preferred TCR motifs in both germline-encoded and rearranged regions to recognize particular epitope sequences, which can be reasonably explained by molecular interaction between the TCR – peptide – MHC molecule. From a view of TCR biomarker development, TCR motifs shared by antigen-specific or disease-susceptible T cells can be utilized to enrich and classify TCR clonotypes that are distinctive of T1D.

TCR Repertoires in the Pancreas of Humans

Emerging sequencing technologies and increasing availability of human samples, in particular pancreas and peripheral immune tissues isolated from organ donors having T1D, facilitate identification of islet antigen-specific or T1D-associated T cells and TCR clonotypes (7, 67–74). In the 1990’s, two groups in Spain and Japan separately analyzed TCR repertoires in the pancreas and demonstrated clonal expansion of T cells with particular Vgene segment usage in individual patients (75, 76). Importantly, the same group in Spain demonstrated that a clonally expanding TCR in islet and pancreas samples was detected in the blood of the same individual, indicating that islet-residing TCR clonotypes are detectable in peripheral blood samples (77). More recently, Brusko and colleagues further corroborated this concept by studying a larger number of individuals using a next generation sequencing technology that allows to analyze much higher numbers of T cells (72, 78). This high resolution analysis discovered that CD8 TCR clonotypes in the pancreas and draining lymph nodes are detected in peripheral blood more frequently than those expressed by CD4 T cells and provided important insights about the depth of TCR sequencing to achieve quantitative measurement. Another important concept is to consider commonality of TCR repertoires in the pancreas across patients. We recently determined thousands of TCR clonotypes expressed by T cells in the islets of organ donors with and without T1D (73, 74). Our analysis indicated clonal expansion in the pancreas of individual donors regardless of the disease, but also found that the frequency of TCR clonotypes shared between donors is limited. This low frequency of shared TCR clonotypes may be due to diverse HLA restrictions present in different individuals. Another reason could be the fact that T cells in the islets may not be necessarily islet-specific. Indeed, multiple studies analyzing islet T cell specificity found that over half of T cell clones and lines derived from the islets did not respond to preproinsulin and other known islet epitopes (46, 70, 71, 73, 74). However, it should be noted that collecting TCR clonotypes from a larger number of donors significantly increases the number of shared clonotypes and such large TCR repertoire information allows for identifying common motifs even when not sharing entire TCR sequences, which will be essential to precisely cluster TCRs recognizing the same epitope (see below regarding TCR clustering). Thus, continuing efforts to accumulate TCR sequence information from the target organ along with epitope identification is crucial to establish a sufficient list of TCR clonotypes that can be used for disease-associated TCR biomarkers.

Islet Antigen-Specific TCR Clonotypes in Humans

TCRs expressed by islet-reactive T cells may be another optimal source that can be used as clonotypes for T1D biomarkers, especially if they circulate in the peripheral blood. Such clonotypes could come from T cell clones, T cell lines, hybridomas, and transductant cells that have been confirmed to respond to islet antigens, cell subsets enriched by multimer staining, and those activated or proliferated by antigen stimulation. TCR clonotypes for which reactivity to epitopes has been confirmed at a single cell level would be the most reliable source. Here we summarize islet antigen-specific TCR clonotypes that were isolated from individuals having T1D ( ). To date, over a hundred TCR alpha and beta paired sequences specific to common islet epitopes have been reported by a number of investigators, and it is notable that the majority of these TCRs were identified in the past several years (7, 45, 70, 73, 74, 79–99). However, hundreds of disease-associated TCR clonotypes are far too small to cover T1D patients having heterogeneous antigen specificity. With rapidly evolving sequencing technologies, future efforts to identify islet epitope-specific TCR clonotypes is essential to develop TCR biomarkers for T1D. In addition to TCR clonotypes listed in , Bonifacio and colleagues reported hundreds of TCR sequences expressed by T cells that were stained with multimer composed of islet epitopes or those proliferated in response to islet antigens (44, 45). While it is necessary to carefully validate true reactivity to antigens, this type of analysis is an excellent resource to gain T1D-associated TCR clonotypes. Computational tools to decrease the “noise” (i.e. eliminating non-specific binding TCR clonotypes) may help to enrich truly antigen-specific clonotypes (100, 101). Further, these candidate TCR clonotypes could be validated for disease specificity using larger cohorts analyzed with whole blood TCR sequencing, and then clonotypes that were detected only in individuals having various stages of T1D could be assessed for functional reactivity ( ). Retro/lentiviral transduction systems, especially in a moderate to high throughput multiplex assay, will facilitate verifying reactivity to antigens (82, 98, 102, 103).
Table 2

T cell receptors specific to islet epitopes.

Clone/Sequence IDEpitopeEpitope sequenceHLA# TRAVTRAJCDR3aTRBVTRBJCDR3bSource of T cellsMethod to confirm reactivityReference
BRI4.13GAD65:555-567NFFRMVISNPAATDR4TRAV19TRAJ44CALSENRGGTASKLTFTRBV5-1TRBJ1-1CASSLVGGPSSEAFFPBMC CD4Clone/Transgenic cells(7981)
BRI164GAD65:555-567NFFRMVISNPAATDR4TRAV19TRAJ56CALSEEGGGANSKLTFTRBV5-1TRBJ1-6CASSLAGGANSPLHFPBMC CD4Clone/Transgenic cells(80, 82)
T1D2-1&2IGRP:305-324QLYHFLQIPTHEEHLFYVLSDR4TRAV29TRAJ40CAATRTSGTYKYIFTRBV6-6TRBJ2-3CASSPWGAGGTDTQYFPBMC CD4Clone/TCR transductant(81)
T1D4-3&4IGRP:305-324KWCANPDWIHIDTTPFAGLVDR4TRAV2TRAJ15CAVEDLNQAGTALIFTRBV5-1TRBJ2-1CASSLALGQGNQQFFPBMC CD4Clone/TCR transductant(81)
23.G8PPI:36-50VEALYLVCGERGFFYDR4TRAV39TRAJ56CAWRTGANSKLTFTRBV24-1TRBJ2-2CATGLAANTGELFFIslet CD4TCR transductant(83)
SD52.c1PPI:72-90PGAGSLQPLALEGSLQKRGDR4TRAV4TRAJ27CLVGDSLNTNAGKSTFTRBV27TRBJ1-5CASSWSSIGNQPQHFPBMC CD4Clone(82, 84)
95.A9-1PPI:87-101QKRGIVEQCCTSICSDR4.4TRAV9-2TRAJ18CALRTDRGSTLGRLYFTRBV11-2TRBJ1-6CASSLQSSYNSPLHFIslet CD4TCR transductant(83)
Mi.1Insulin A:1-15 (PPI: 90-104)GIVEQCCTSICSLYQDR4TRAV8-3TRAJ44CAVGALAGTASKLTFTRBV29-1TRBJ2-3CSVEATRADTQYFPLN CD4Clone(85)
Ba.14Insulin A:1-15 (PPI: 90-104)GIVEQCCTSICSLYQDR4TRAV39TRAJ33CAVVNMDSNYQLIWTRBV5-1TRBJ2-3CASSLATSGGGSDTQYFPLN CD4Clone(85)
Ba.11Insulin A:1-15 (PPI: 90-104)GIVEQCCTSICSLYQDR4TRAV22TRAV26-2 ## TRAJ52TRAJ47CADAGGTSYKLFCIPGSEEYGNKLVFTRBV5-1TRBJ2-3CASSLATSGGGSDTQYFPLN CD4Clone(85)
6.H11PPI:94-108QCCTSICSLYQLENYDR4.2TRAV26-1TRAJ13CIVRVYSGGYQKVTFTRBV30TRBJ2-3CAWSARLAGGPRTQYFIslet CD4TCR transductant(83)
SD32.5PPI:94-110QCCTSICSLYQLENYCNDR4TRAV26-1TRAJ23CIVRVSSAYYNQGGKLIFTRBV27TRBJ2-3CASSPRANTDTQYFPBMC CD4Clone(82, 84)
B3.3Proinsulin:52-62 (PPI:76-86)SLQPLALEGSLDR4TRAV17TRAJ54CATGPIQGAQKLVFTRBV6-5TRBJ1-1CASSYAWGRATEAFFPBMC CD4Clone(86)
K4.4/K6.4Proinsulin:54-63 (PPI:78-87)QPLALEGSLQDR4TRAV10TRAJ17CVVSAKAAGNKLTFTRBV7-8TRBJ2-7CASSLAGTDHYEQYFPBMC CD4Clone(86)
23.F7PPI:24-38AFVNQHLCGSHLVEADR1TRAV8-2TRAJ29CAVIASGNTPLVFTRBV19TRBJ2-3CASKGPGTVIRADTQYFIslet CD4TCR transductant(83)
55.B3PPI:37-51EALYLVCGERGFFYTDR9TRAV21TRAJ29CAVLPPTPLVFTRBV18TRBJ1-1CASSYPGTGGARTEAFFIslet CD4TCR transductant(83)
55.C10PPI:58-72AEDLQVGQVELGGGPDR53TRAV26-1TRAJ26CIVRSHGQNFVFTRBV20-1TRBJ2-7CSARPGTRNYEQYFIslet CD4TCR transductant(83)
Clone 5Insulin B:9-23 (PPI: 33-47)SHLVEALYLVCGERGDQ8TRAV21TRAJ6CAVKRTGGSYIPTFTRBV11-2TRBJ2-2CASSSFWGSDTGELFFPBMC CD4Clone/TCR transductant(82, 87, 88)
GSE.6H9Insulin B:9-23 (PPI: 33-47)SHLVEALYLVCGERGDQ8, DQ8-transTRAV26-1TRAJ40CIVRVDSGTYKYIFTRBV7-2TRBJ2-1CASSLTAGLASTYNEQFFIslet CD4TCR transductant(73, 83)
GSE.20D11Insulin B:9-23 (PPI: 33-47)SHLVEALYLVCGERGDQ8TRAV12-3TRAJ4CAILSGGYNKLIFTRBV2TRBJ2-5CASSAETQYFIslet CD4TCR transductant(73, 83)
T1D#3 C8Insulin B:11-23 (PPI: 35-47)R22E LVEALYLVCGEEGDQ8TRAV17TRAJ23CATDAGYNQGGKLIFTRBV5-1TBBJ1-3CASSAGNTIYFPBMC CD4Clone(82, 89)
T1D#3 C10Insulin B:11-23 (PPI: 35-47)R22E LVEALYLVCGEEGDQ8TRAV12-3TRAJ26CATAYGQNFVFTRBV4-1TRBJ2-2CASSRGGGNTGELFFPBMC CD4Clone(82, 89)
19.A4PPI:55-69RREAEDLQVGQVELGDQ8TRAV8-6TRAJ32CAVRETGATNKLIFTRBV20-1TRBJ2-7CSARPQGFSSYEQYFIslet CD4TCR transductant(83)
GSE.8E3PPI:72-87hEL:C-peptide (HIP11)PGAGSLQPLALEGSLQSLQPLALEAEDLQVDQ8, DQ8-transTRAV2TRAJ37CAVDGSGNTGKLIFTRBV4-1TRBJ2-7CASSQDLAGVREQYFIslet CD4TCR transductant(73, 83)
6.G4PPI:86-100LQKRGIVEQCCTSICDQ8, DQ8-transTRAV26-1TRAJ8CIVRVRNTGFQKLVFTRBV27TRBJ1-1CASSPGPGNTEAFFIslet CD4TCR transductant(83)
56.B1PPI:40-54YLVCGERGFFYTPKTDQ2TRAV13-1TRAJ40CAVLSPSGTYKYIFTRBV7-9TRBJ1-4CASSLMGNPHEKLFFIslet CD4TCR transductant(83)
53.A4-1PPI:72-87PGAGSLQPLALEGSLQDQ2TRAV39TRAJ33CAVDPMDSNYQLIWTRBV29-1TRBJ2-6CSVGTDPSGANVLTFIslet CD4TCR transductant(83)
23.G6PPI:29-43HLCGSHLVEALYLVCDP4TRAV9-2TRAJ6CALSISGGSYIPTFTRBV5-1TRBJ2-5CASSFRQGEQETQYFIslet CD4TCR transductant(83, 90)
A4.13Proinsulin:41-51 (PPI:65-75)QVELGGGPGAGDQ8TRAV6TRAJ36CALKYGANNLFFTRBV18TRBJ1-1CASSPTTGGDEAFFIslet CD4Clone(70)
A1.1Proinsulin:50-59 (PPI:74-83)AGSLQPLALEDQ8TRAV25TRAJ16CAGGFSDGQKLLFTRBV20-1TRBJ2-7CSARTEAYEQYFIslet CD4Clone(70)
A1.2Proinsulin:50-58 (PPI:74-82)AGSLQPLALDQ8TRAV20TRAJ58CAVIETSGSRLTFTRBV20-1TRBJ2-3CSARDQQRVDTQYFIslet CD4Clone(70)
A2.4Proinsulin:52-62 (PPI:76-86)SLQPLALEGSLDQ8-transTRAV19TRAJ49CALSRAGTGNQFYFTRBV5-1TRBJ2-4CASSLGLRGENIQYFIslet CD4Clone(70)
B3.1Proinsulin:48-59 (PPI:72-83)PGAGSLQPLALEDQ8TRAV12-1TRAJ9CVVKSTGGFKTIFTRBV20-1TRBJ2-5CSAGGLAGASQETQYFPBMC CD4Clone(86)
K3.2/K9.5Proinsulin:54-62 (PPI:78-86)QPLALEGSLDQ2TRAV3TRAJ31CAVRGDNNARLMFTRBV7-2TRBJ2-2CASSPIIWGTGELFFPBMC CD4Clone(86)
K6.2Proinsulin:49-58 (PPI:73-82)GAGSLQPLALDQ8-transTRAV8-2/8-4TRAJ11CAVTPKSGYSTLTFTRBV20-1TRBJ2-3CSARDLAIPDTQYFPBMC CD4Clone(86)
K9.6Proinsulin:41-51 (PPI:65-75)QVELGGGPGAGDQ8TRAV26-1TRAJ54CIVRVEIQGAQKLVFTRBV3-2TRBJ2-1CASSSPGTEYNEQFFPBMC CD4Clone(86)
D1.1/D1.4Proinsulin:34-43 (PPI:58-67)AEDLQVGQVEDQ8TRAV13-1TRAJ38CAARNAGNNRKLIWTRBV4-2TRBJ2-3CASSFRGLGGGTDTQYFPBMC CD4Clone(86)
T6.1Proinsulin:52-63 (PPI:76-87)SLQPLALEGSLQDQ2, DQ2-transFunctional alpha not detectedTRBV9TRBJ2-1CASSVDPGVYNEQFFPBMC CD4Clone(86)
T6.6Proinsulin:56-62 (PPI:80-86)LALEGSLDQ2TRAV35TRAJ28CAAALSGAGSYQLTFTRBV19TRBJ2-3CASRLDPSTDTQYFPBMC CD4Clone(86)
T17.1Proinsulin:56-62 (PPI:80-86)LALEGSLDQ2, DQ2-transTRAV35TRAJ28CAAALSGAGSYQLTFTRBV19TRBJ2-3CASRLDPSTDTQYFPBMC CD4Clone(86)
H3.3/H6.4Proinsulin:52-61 (PPI:76-85)SLQPLALEGSDQ8-transTRAV19TRAJ57CALSGRGSEKLVFTRBV5-1TRBJ2-7CASSTRTGQGGNEQYFPBMC CD4Clone(86)
H3.7/H7.4/H8.5Proinsulin:50-58 (PPI:74-82)AGSLQPLALDQ8TRAV12-1TRAJ20CVVNPTDDYKLSFTRBV20-1TRBJ2-3CSARSLASGGPDTQYFPBMC CD4Clone(86)
H11.5Proinsulin:42-51 (PPI:66-75)VELGGGPGAGDQ8TRAV26-1TRAJ36CIVRVVTGANNLFFTRBV5-1TRBJ2-5CASSLERETQYFPBMC CD4Clone(86)
E2.3Proinsulin:54-62 (PPI:78-86)QPLALEGSLDQ2TRAV30TRAJ37CGTEKPGSGNTGKLIFTRBV20-1TRBJ1-4CSARDGARGEKLFFPBMC CD4Clone(86)
E2.5Proinsulin:35-46 (PPI:59-70)EDLQVGQVELGGDQ8TRAV12-3TRAJ5CVISPPGRRALTFTRBV5-4TRBJ2-2CASSSGTSAGTGELFFPBMC CD4Clone(86)
A3.10hEGGG : IAPP2 (HIP6)GQVELGGGNAVEVLKDQ8TRAV38-1TRAJ54CAFFGQGAGKLVFTRBV5-1TRBJ2-3CASSLSASGGATDTQYFIslet CD4Clone(70, 91, 92)
A1.9Proinsulin:42-50 (PPI:66-74)hEGGG : IAPP2 (HIP6)VELGGGPGAGQVELGGGNAVEVLKDQ8TRAV20TRAJ7CAVQAGGNNRLAFTRBV5-1TRBJ1-2CASSLERDGYTFIslet CD4Clone(70, 92)
A6.15/A5.8Proinsulin:42-50 (PPI:66-74)hEGGG : IAPP2 (HIP6)VELGGGPGAGQVELGGGNAVEVLKDQ8TRAV26-1TRAJ21CIAIYNFNKFYFTRBV5-1TRBJ1-6CASSLEASSYNSPLHFIslet CD4Clone(70, 92)
A2.13Proinsulin:42-50 (PPI:66-74)hEGGG : IAPP2 (HIP6)VELGGGPGAGQVELGGGNAVEVLKDQ8TRAV26-1TRAJ39CIVSHNAGNMLTFTRBV5-1TRBJ2-5CASSLERETQYFIslet CD4Clone(70, 92)
A5.5Proinsulin:42-50 (PPI:66-74)hEGGG : IAPP2 (HIP6)VELGGGPGAGQVELGGGNAVEVLKDQ8TRAV26-1TRAJ54CIVRVEIQGAQKLVFTRBV5-1TRBJ2-5CASSLGPGQRETQYFIslet CD4Clone(70, 92)
A2.11hEGGG : IAPP2 (HIP6)GQVELGGGNAVEVLKNot reportedTRAV38-1TRAJ54CAFMGAGAQKLVFTRBV4-3TRBJ2-3CASSQILRGGPPDTQYFIslet CD4Clone(70, 91)
HIP14-G10/D3hEL : IAPP2 (HIP14)SLQPLALNAVEVLKDRTRAV16TRAV5 ## TRAJ37TRAJ40CARSHGSGNTGKLIFCAESIASGTYKYIFTRBV27TRBJ2-5CASSSGYGGETQYFPBMC CD4Clone(93)
E2bhEL:C-peptide (HIP11)SLQPLALEAEDLQVDQ2TRAV8-4TRAJ43CAVGATNNNDMRFTRBV5-4TRBJ2-1CASSPIGASGGNEQFFPBMC CD4Clone(94)
ET650-2Proinsulin:42-50 (PPI:66-74)hEGGG : IAPP2 (HIP6)HIPL11CVELGGGPGAGQVELGGGNAVEVLKGQVELGGGNAVEVCKDQ8TRAV26-1TRAJ39CIVRVGYNAGNMLTFTRBV20-1TRBJ1-5CSAIAGPNQPQHFPBMC CD4Clone(92)
ET650-4Proinsulin:42-50 (PPI:66-74)hEGGG : IAPP2 (HIP6)HIPL11CVELGGGPGAGQVELGGGNAVEVLKGQVELGGGNAVEVCKDQ8TRAV26-1TRAJ42CIVRVAIEGSQGNLIFTRBV5-1TRBJ1-3CASSLRRGDTIYFPBMC CD4Clone(92)
ET650-5hEGGG : IAPP2 (HIP6)HIPL11CGQVELGGGNAVEVLKGQVELGGGNAVEVCKDQ8TRAV26-1TRAJ9CIVRLQSGGFKTIFTRBV20-1TRBJ1-2CSAYSPGDRDFSNYGYTFPBMC CD4Clone(92)
ET672-1Proinsulin:42-50 (PPI:66-74)hEGGG : IAPP2 (HIP6)HIPL11CVELGGGPGAGQVELGGGNAVEVLKGQVELGGGNAVEVCKDQ8TRAV12-2TRAJ48CAVNHGNEKLTFTRBV18TRBJ1-1CASSPWEGRMDTEAFFPBMC CD4Clone(92)
1E6PPI:15-24ALWGPDPAAAA*02:01TRAV12-3TRAJ12CAMRGDSSYKLIFTRBV12-4TRBJ2-4CASSLWEKLAKNIQYFPBMC CD8Clone(82, 95)
1D5/1D10/2B3/4C6/3E7PPI:3-11LWMRLLPLLA*24:02TRAV5TRAJ37CAEPSGNTGKLIFTRBV7–9TRBJ2-7CASSLHHEQYFPBMC CD8Clone(96)
Clone 7IGRP:265-273VLFGLGFAIA*02:01TRAV41TRAJ48CAVTSNFGNEKLTFTRBV6-2/6-3TRBJ2-7CASSSRFVGEGLFRYGYEQYFPBMC CD8Clone/Transgenic cells(97, 98)
Clone 32IGRP:265-273VLFGLGFAIA*02:01TRAV12-1TRAJ48CVVNILSNFGNEKLTFTRBV20-1TRBJ2-1CSASRQGWVNEQFFPBMC CD8Clone/Transgenic cells(97, 98)
Clone 16/17IGRP:265-273VLFGLGFAIA*02:01TRAV25TRAJ53CAGLGDSGGSNYKLTFTRBV3-1TRBJ2-4CASSQDRWDVMSKNIQYFPBMC CD8Clone(45)
Clone 22/27IGRP:265-273VLFGLGFAIA*02:01TRAV29/DV5TRAJ53CAASGGSNYKLTFTRBV10-3TRBJ1-2CAISDRFMREGMTYGYTFPBMC CD8Clone(45)
Clone#1INS-DRiP:1-9MLYQHLLPLA*02:01TRAV12-2TRAJ34CAVNKTDKLIFTRBV6-1TRBJ1-2CASSVTGNGYTFPBMC CD8Clone(99)
Clone#2INS-DRiP:1-9MLYQHLLPLA*02:01TRAV10TRAJ8CVVNMNTGFQKLVFTRBV12-3/12-4TRBJ2-2CASSPPQGGNTGELFFPBMC CD8Clone(99)
1.C1INS-DRiP:1-9MLYQHLLPLA*02:01TRAV12-1TRAJ39CGENNAGNMLTFTRBV27TRBJ2-5CASSLQPPGTSTETQYFIslet CD8TCR transductant(74)
96.A9INS-DRiP:1-9MLYQHLLPLB*08:01TRAV12-2TRAJ39CAVNVYNAGNMLTFTRBV30TRBJ1-1CAWSVRGGSYMNTEAFFIslet CD8TCR transductant(74)
D222D Clones 2ZNT8:186-194VAANIVLTVA*02:01TRAV17TRAJ36CAVTGANNLFFTRBV19TRBJ2-2CASSIEGPTGELFPBMC CD8Clone(7)
D010R clone 1E2ZNT8:186-194VAANIVLTVA*02:01TRAV35TRAJ36CAGTRNNLFFTRBV19TRBJ2-7CASGGSSYEQYFPBMC CD8Clone(7)
D010R clone 1D3ZNT8:186-194VAANIVLTVA*02:01TRAV25TRAJ20CAGGSNDYKLSFTRBV6-1TRBJ2-3CASSSVGVDTQYFPBMC CD8Clone(7)
D267T 33B8ZNT8:186-194VAANIVLTVA*02:01TRAV19TRAJ23CALSEATYNQGGKLIFTRBV19TRBJ1-3CASSIFPNPGNTIYFPBMC CD8Clone(7)
D349D 178B9ZNT8:186-194VAANIVLTVA*02:01TRAV14/DV4TRAJ9CAMREGLTGGFKTIFTRBV11-2TRBJ1-1CASSPFLTGSNTEAFFPBMC CD8Clone(7)
D351D 188D3ZNT8:186-194VAANIVLTVA*02:01TRAV19TRAJ20CALSPAETSDYKLSFTRBV19TRBJ1-1CASTLTGFAEAFFPBMC CD8Clone(7)
23.F9PPI:1-11MALWMRLLPLLC*03:04TRAV12-3TRAJ48CAMSALGNFGNEKLTFTRBV19TRBJ2-1CASSIAGGNEQFFIslet CD8TCR transductant(74)
19.A1PPI:1-11MALWMRLLPLLC*03:04TRAV8-4TRAJ11CAVSDQGSGYSTLTFTRBV28TRBJ1-5CASSWTANQPQHFIslet CD8TCR transductant(74)
20.E5PPI:1-11MALWMRLLPLLC*03:04TRAV14/DV4TRAJ52CAMSNAGGTSYGKLTFTRBV28TRBJ1-4CASSLARYNEKLFFIslet CD8TCR transductant(74)
20.F1PPI:1-11MALWMRLLPLLC*03:04TRAV14/DV4TRAJ43CAMRLHNNNDMRFTRBV28TRBJ1-5CASIASRYNQPQHFIslet CD8TCR transductant(74)
22.A10PPI:1-11MALWMRLLPLLC*03:04TRAV8-1TRAJ13CAVNAAGGYQKVTFTRBV28TRBJ2-1CASIPDRYNEQFFIslet CD8TCR transductant(74)
1.C8PPI:1-11/2-12/2-10MALWMRLLPLLALWMRLLPLLAALWMRLLPLA*02:01TRAV24TRAJ58CAFKRETSGSRLTFTRBV13TRBJ2-1CASSTRLAGDEQFFIslet CD8TCR transductant(74)
1.F3PPI:2-12ALWMRLLPLLAA*02:01TRAV39TRAJ39CAVENAGNMLTFTRBV10-2TRBJ2-1CASWTVSYNEQFFIslet CD8TCR transductant(74)
96.F5PPI:3-11LWMRLLPLLA*02:01TRAV8-6TRAJ48CAVSDISNFGNEKLTFTRBV9TRBJ2-3CASSVVGLGTDTQYFIslet CD8TCR transductant(74)
23.H5PPI:3-13LWMRLLPLLALA*02:01TRAV38-2/DV8TRAJ22CAYRSPARQLTFTRBV6-1TRBJ2-7CASSEGWGVPSYEQYFIslet CD8TCR transductant(74)
1.B10-1PPI:15-23ALWGPDPAAA*02:01TRAV8-3TRAJ33CAVVADSNYQLIWTRBV4-2/4-3TRBJ2-2CASSQTKGTGELFFIslet CD8TCR transductant(74)
1.F1PPI:15-24ALWGPDPAAAA*02:01TRAV39TRAJ41CAVSNSGYALNFTRBV29-1TRBJ2-5CSVFHRGETQYFIslet CD8TCR transductant(74)
23.C12PPI:15-24/15-25ALWGPDPAAAALWGPDPAAAFA*02:01TRAV41TRAJ42CAVSGGSQGNLIFTRBV28TRBJ1-2CASSPPTGWGGYTFIslet CD8TCR transductant(74)
93.D1PPI:15-25ALWGPDPAAAFA*02:01TRAV5TRAJ8CAVTKDTGFQKLVFTRBV20-1TRBJ2-1CSARDHFGGSGYEQFFIslet CD8TCR transductant(74)
10.C6-1PPI:23-32AAFVNQHLCGC*12:03TRAV19TRAJ39CALSGALNNAGNMLTFTRBV27TRBJ2-5CASSLFGYRQETQYFIslet CD8TCR transductant(74)
28.D3PPI:31-41/34-41CGSHLVEALYLHLVEALYLA*02:01TRAV26-2TRAJ26CILTDNYGQNFVFTRBV27TRBJ1-1CASSLIGLNTEAFFIslet CD8TCR transductant(74)
28.E6PPI:46-54/47-54RGFFYTPKTGFFYTPKTA*29:02TRAV19TRAJ28CALSEAGAGSYQLTFTRBV2TRBJ2-5CASSPSGTSSQETQYFIslet CD8TCR transductant(74)
20.G1PPI:69-77/69-79GGGPGAGSLGGGPGAGSLQPC*03:04TRAV1-2TRAJ8CAVRMNTGFQKLVFTRBV9TRBJ2-1CASSVGMDPGLGYNEQFFIslet CD8TCR transductant(74)
96.B4PPI:91-99IVEQCCTSIC*05:01TRAV12-2TRAJ31CAVNNARLMFTRBV6-5TRBJ2-1CASRPTSGGYNEQFFIslet CD8TCR transductant(74)
86.C1PPI:91-100/92-100/92-102IVEQCCTSICVEQCCTSICVEQCCTSICSLB*41:02TRAV19TRAJ16CALSEAGFSDGQKLLFTRBV19TRBJ2-1CASSIQFSYNEQFFIslet CD8TCR transductant(74)
84.D9PPI:91-100/92-100/92-102IVEQCCTSICVEQCCTSICVEQCCTSICSLB*41:02TRAV29/DV5TRAJ43CAASNSNDMRFTRBV7-9TRBJ2-1CASSLAQREQFFIslet CD8TCR transductant(74)
28.E11PPI:91-100IVEQCCTSICB*18:01TRAV12-2TRAJ49CAVSMNTGNQFYFTRBV29-1TRBJ2-1CSVQVYNEQFFIslet CD8TCR transductant(74)
1.E9-1PPI:92-99VEQCCTSIB*50:01TRAV12-2TRAJ34CAVNIRYNTDKLIFTRBV6-2/6-3TRBJ1-5CASSSIQGSGSGQPQHFIslet CD8TCR transductant(74)
86.G3-2PPI:94-102QCCTSICSLB*35:01TRAV8-6TRAJ33CAVSDGYQLIWTRBV6-1TRBJ2-7CASSGREAPYEQYFIslet CD8TCR transductant(74)
54.F1PPI:96-103CTSICSLYA*01:01TRAV3TRAJ26CAVPDNYGQNFVFTRBV7-2TRBJ2-2CASSLVVELFFIslet CD8TCR transductant(74)

#HLA class II alleles: DR4 (DRB1*04:01); DR4.4 (DRB1*04:04); DR4.2 (DRB1*04:02); DR1 (DRB1*01:01); DR9 (DRB1*09:01); DR53 (DRB4*01:01); DQ8 (DQA1*03:01-DQB1*03:02); DQ8-trans (DQA1*05:01-DQB1*03:02); DQ2 (DQA1*05:01-DQB1*02:01/02:02), DQ2-trans (DQA1*03:01-DQB1*02:01); DP4 (DPA1*01:03-DPB1*04:01).

##Two in-frame alpha chains detected. Functional alpha not determined.

Figure 1

Strategy to determine disease-specific TCR clonotypes. Red and gray circles represent true and false disease-specific TCR clonotypes, respectively. Green circles are true disease-specific clonotypes determined by clustering with known disease-specific TCR clonotypes. (A) TCRs detected in the islets, pancreata, and pancreatic lymph nodes, in particular those for which antigen specificity has been determined as well as those that are clustered with known disease-specific TCRs, can be the initial source for disease-specific TCR candidates. (B) TCRs detected from peripheral blood T cells enriched by antigen stimulation or peptide-MHC-conjugated multimers are also an initial source. Antigen-specific algorithms can enrich TCR clonotypes that are truly specific to antigens. (C) Candidate TCR clonotypes may be assessed for specificity to islet tissues, proteins, and peptides. (D) Using classifying algorithms, candidate TCR clonotypes are assessed for frequency in the blood of individuals with and without T1D to determine disease specificity. Simultaneously, clustering algorithms can select additional clonotypes that are clustered with known disease-specific TCR clonotypes. (E) TCR clonotypes selected by classifying and clustering algorithms are used for machine learning of antigen-specific algorithms to further determine true disease-specificity.

T cell receptors specific to islet epitopes. #HLA class II alleles: DR4 (DRB1*04:01); DR4.4 (DRB1*04:04); DR4.2 (DRB1*04:02); DR1 (DRB1*01:01); DR9 (DRB1*09:01); DR53 (DRB4*01:01); DQ8 (DQA1*03:01-DQB1*03:02); DQ8-trans (DQA1*05:01-DQB1*03:02); DQ2 (DQA1*05:01-DQB1*02:01/02:02), DQ2-trans (DQA1*03:01-DQB1*02:01); DP4 (DPA1*01:03-DPB1*04:01). ##Two in-frame alpha chains detected. Functional alpha not determined. Strategy to determine disease-specific TCR clonotypes. Red and gray circles represent true and false disease-specific TCR clonotypes, respectively. Green circles are true disease-specific clonotypes determined by clustering with known disease-specific TCR clonotypes. (A) TCRs detected in the islets, pancreata, and pancreatic lymph nodes, in particular those for which antigen specificity has been determined as well as those that are clustered with known disease-specific TCRs, can be the initial source for disease-specific TCR candidates. (B) TCRs detected from peripheral blood T cells enriched by antigen stimulation or peptide-MHC-conjugated multimers are also an initial source. Antigen-specific algorithms can enrich TCR clonotypes that are truly specific to antigens. (C) Candidate TCR clonotypes may be assessed for specificity to islet tissues, proteins, and peptides. (D) Using classifying algorithms, candidate TCR clonotypes are assessed for frequency in the blood of individuals with and without T1D to determine disease specificity. Simultaneously, clustering algorithms can select additional clonotypes that are clustered with known disease-specific TCR clonotypes. (E) TCR clonotypes selected by classifying and clustering algorithms are used for machine learning of antigen-specific algorithms to further determine true disease-specificity.

Identification of Disease-Specific TCR Clonotypes Using Big Data

Big data analysis, which seeks to classify TCR repertoires in a specific condition using a large number of TCR samples, is an emerging strategy to identify disease-associated TCR clonotypes. A major advantage of this approach is the capability to identify disease-associated TCR clonotypes without knowing antigen specificity, thereby allowing one to include TCRs that are potentially disease-associated but not islet-specific and also those having low affinity to antigens. Indeed, specificities of large proportions of T cells in the islets are unknown (46, 70–74). Virus infections such as enterovirus and coxsackie B virus (CVB) are suggested to be involved in T1D development (104–106), and TCRs specific to these viruses could be identified by big data analysis by comparing TCR repertoires of individuals having or not having different stages of T1D. Although it has been demonstrated in infectious diseases that big data analyses can identify pathogen-specific TCR clonotypes, it has not yet been successful at identifying T1D-associated TCR clonotypes using PBMC samples from individuals with or without different stages of T1D. This could be explained by several possibilities: (1) the frequency of T1D-associated T cells may be lower than that of pathogen-specific T cells; (2) antigens involved in T1D pathogenesis, especially those at different stages of T1D, may be more heterogeneous than those in infectious diseases; (3) autoreactive TCRs could be more private (i.e. not common between patients) than those of conventional T cells; and (4) sample sizes studied to date have not been large enough. However, having large TCR data sets produced by next generation sequencing will enable machine learning algorithms to cluster and classify TCR clonotypes. Using these newly developed techniques, even infrequent disease-specific TCRs having less publicity (i.e. commonality) between people may be identified from relatively small numbers of samples. Indeed, some computational TCR classifying methods are now capable of identifying cancer patients responding to immune checkpoint inhibitors (40), and also early stages of cancer can be differentiated from healthy individuals using this type of technique (107, 108). In the next section, we will discuss how to take advantages of the latest TCR clustering/classifying techniques for T1D TCR biomarkers.

Clustering and Classification of TCR Clonotypes

TCR clonotypes recognizing the same peptide-MHC complex often share similar motifs and features. For example, influenza-specific TCRs prefer to use TRAV38-1/TRAJ52/TRBV19/TRBJ1-2 (109–111), and melanoma (MART-1)-specific TCRs often contain an alpha chain with TRAV12-2 (112). Likewise, several features common for islet antigen-specific TCRs have been reported. We discovered that insulin B-chain-specific TCRs tend to use TRAV38-1/38-2 and other Valpha segments having similar motifs in the CDR1 and CDR2 regions (113). Also, it has been shown that a specific motif “SGGSNYKLTF” is contained in the CDR3 region of alpha chains specific to an IGRP peptide (45). More recently, crystal structure analysis of TCRs specific to a hybrid insulin peptide composed of proinsulin and islet amyloid polypeptide (IAPP) demonstrated that motifs in the TRBV5-1 segment commonly interact with amino acid residues in IAPP (92). Our work also indicates that T cell responses to hybrid insulin peptides precede clinical T1D onset (114), making these TCR clonotypes excellent candidates for biomarkers. Thus, autoreactive TCRs share commonalities and similarities, which provide clues to cluster TCRs and stratify those specific to a certain condition. A number of algorithms to cluster or classify TCR clonotypes have been developed. Each algorithm has advantages and disadvantages as reviewed by others (115, 116), but in respect to TCR biomarker development for T1D, the algorithms can be divided to two groups. First, those that clusters TCRs by assessing similarities of TCR sequences with each other in datasets. Second, those that seek to classify TCRs by identifying similar to known antigen-specific or disease-specific TCR clonotypes. The former algorithms such as TCRdist (111), GLIPH/GLIPH2 (117, 118), ClusTCR (119), and GIANA (108) do not need information about T1D-specific epitopes and TCR sequences beforehand, and thus can be used to predict disease-specific TCR clonotypes that are specifically detected in T1D patients but not in non-diabetic subjects. On the other hand, machine learning-based algorithms that assess similarities to known antigen-specific TCR datasets to predict epitopes, such as DeepTCR (101), DeepCAT (107), TCRmatch (120), and TCRAI (100) need prior information about disease-specific TCR sequences. These algorithms show excellent performance when classifying TCRs specific to the same epitopes that were used to develop the machine learning algorithm but not for those having different specificities. Therefore, large sets of disease-specific TCR sequence information for machine training are necessary to achieve high specificity and sensitivity. Typically these types of algorithms show better performance to detect antigen-specific TCR clonotypes than the clustering-based algorithms, thereby being useful to validate TCR clonotypes once epitopes or disease-specificity are determined. Alternatively, they can be also used to ‘clean up’ (i.e. eliminate non-specific TCR clonotypes) TCR datasets that are obtained from multimer-stained T cells or those activated by antigen stimulation ( ). In any case, it is essential to prepare TCR datasets from a large number of individuals with and without T1D at multiple time points to elicit the best performance by machine learning and clustering algorithms. Typically, diverse datasets rather than large data but from a limited number of samples improve learning efficiency (100). In addition, it is also important to prepare accurate TCR clonotype information to differentiate T1D patients from healthy subjects. There are now several TCR databases available, which accumulate and curate information about TCR sequences along with target peptide-MHC complexes, such as VDJbase (121, 122), IEDB (123), VDJdb (124), iReceptor (125), and McPAS-TCR (126). While these are incredibly useful resources, a proportion of islet-specific clonotypes is still very small, accounting for only ~100 out of tens of thousands of clonotypes, the majority of which are specific to viruses and tumor antigens. Assuming that self-reactive TCR clonotypes are more heterogeneous and rarer compared to pathogen-specific ones, there is a need for higher numbers of clonotypes specific to T1D. Thus, identifying a large set of accurate disease-specific TCR clonotypes will be a key component to achieve successful big data analysis, which will ultimately lead us to establish TCR biomarkers in T1D ( ).

Perspective

It is still controversial whether T1D patients have distinct islet antigen-specific T cell subsets in the blood compared to healthy individuals. Even in the pancreas, non-diabetic organ donors have preproinsulin-specific T cells in the exocrine compartment, but such antigen-specific T cells accumulate into the islets over the course of T1D progression (127). In the islets, we recently demonstrated that only T1D donors have CD8 T cells highly reactive to preproinsulin (74). Mallone and colleagues also reported that pancreata of T1D donors have a higher number of zinc transporter-8-specific T cells than non-diabetic controls (7). Thus, multiple studies demonstrate that islets of T1D individuals have distinct T cell repertoires from those without diabetes. However, a number of studies indicate that healthy individuals have islet-antigen specific T cells in the blood (7, 113, 128–131), and depending on cell subsets examined, some studies including those looking into pathogenic T cells show that T1D patients have higher numbers of islet-specific T cells, whereas others do not detect differential islet-specific T cells in T1D patients. This controversy could be explained by either (1) detectable numbers of pathogenic T cells in the islets do not leak into the peripheral blood ( ); or (2) pathogenic T cells in the islets do indeed circulate, but because there are already a number of islet-specific (but not harmful) T cells in the circulation, the total numbers of islet-specific T cells (i.e. pathogenic T cells leaked from the islets plus non-pathogenic T cells) are not differentiated enough in the blood of T1D patients from healthy individuals ( ). Given evidence that a portion of T cell repertoires are shared between pancreas, pancreatic lymph nodes, and peripheral blood cells (72), and that TCR repertoires in the islets of T1D organ donors are clonally distinct from those of non-diabetic donors (74), if the latter hypothesis ( ) is correct, islet-derived TCR sequences will be a powerful marker to discriminate pathogenic from physiological T cells, thereby capable of stratifying individuals with active insulitis prior to clinical T1D onset.
Figure 2

Proposed models of islet-specific T cell detection in the blood. While pancreatic islets contain a certain amount of T cells regardless of disease status (gray bars), only islets of individuals with T1D contain T cells that are highly reactive to islet antigens (red bar). Model (A) Leak of T cells from the islets to peripheral blood is limited. T cell repertoires specific to islet antigens are not different between individuals with and without T1D. Model (B) There are substantial amounts of islet-specific but not disease-specific T cells in the blood regardless of disease status (gray bars). T cells in the islets do circulate in the blood (red bar), but the total numbers of islet-specific T cells are not different between individuals with and without T1D. Enumerating only T cells derived from the islets can identify individuals having T1D. TCR clonotypes are a distinct property to identify islet-derived T cells. *** significantly different. ns, not significant.

Proposed models of islet-specific T cell detection in the blood. While pancreatic islets contain a certain amount of T cells regardless of disease status (gray bars), only islets of individuals with T1D contain T cells that are highly reactive to islet antigens (red bar). Model (A) Leak of T cells from the islets to peripheral blood is limited. T cell repertoires specific to islet antigens are not different between individuals with and without T1D. Model (B) There are substantial amounts of islet-specific but not disease-specific T cells in the blood regardless of disease status (gray bars). T cells in the islets do circulate in the blood (red bar), but the total numbers of islet-specific T cells are not different between individuals with and without T1D. Enumerating only T cells derived from the islets can identify individuals having T1D. TCR clonotypes are a distinct property to identify islet-derived T cells. *** significantly different. ns, not significant. To develop practical TCR biomarkers in T1D, a number of obstacles need to be overcome, some of which may be unique to autoimmune diseases. These challenges can be considered from the view of (1) publicity, (2) abundancy, and (3) disease-specificity. Publicity It will be important to understand the frequencies of public vs private TCR clonotypes that are specific to the T1D disease state, and these likely fluctuate over time during T1D development. Given the genetic risk associated with HLA class II genes, heterogeneity provided by HLA diversity could be smaller than other diseases for TCR clonotypes expressed by CD4 T cells. However, autoreactive T cells, which often bind to peptide-MHC complexes with low affinity, may have a larger TCR repertoire than conventional anti-pathogen T cells, resulting in less commonality. Therefore, frequency of public T1D-specific TCR clonotypes may be low. Strategies that compare TCR repertoires in each individual such as pre and post treatment (40) do not need to consider publicity of clonotypes, and therefore may be more easily applicable to T1D immune intervention studies. Abundancy Theoretically, 1015-1016 diverse TCR clonotypes can be assembled (12–14); however, a practical TCR repertoire size is estimated to be about 108-1010 per person (15, 17, 18). This indicates that the frequency of target clonotypes is extremely low. However, there is evidence that identical clonotypes are persistently detected from the same individuals over time (44, 81, 93, 132). We believe quantitative resolution of TCRs will need to be increased. This could be achieved by enriching samples before sequencing (e.g. beads enrichment by antigen-specific multimers). Another very attractive approach is to target sequencing to TCRs containing a preferred Vgene segment of interest, thus greatly enhancing the depth of sequencing by analyzing clonotypes that can be obtained for a specific V allele. Blood sample volume needed to quantitatively evaluate frequency of disease-associated TCR clonotypes is another important consideration, which will need to be addressed given that the T1D disease process does begin in young children. Disease-Specificity Identification of disease-specific TCR clonotypes is an essential component to develop robust T1D TCR biomarkers. A larger number of TCR clonotypes with higher specificity to the disease that are in place will allow for more sensitive and specific assays. Therefore, the key is how to select such truly disease-specific TCR clonotypes. As illustrated in , both accumulation of actual TCR datasets produced from individuals with and without T1D and computational big data analysis will facilitate the development of biomarkers. While the majority of TCR big data analysis currently uses only CDR3-beta sequences, it has been demonstrated that inclusion of entire sequence information such as V and J segments, in particular CDR1 and CDR2 sequences, increases accuracy of classifying TCR clonotypes (100, 120). While the number of T1D-specific clonotypes that have been determined so far is low, evolutions in both TCR sequencing technologies and computational analysis strategies will dramatically impact this effort. In conclusion, the antigen receptor on disease specific T cells holds promise for a non-cell based biomarker of not only the presence of T1D but disease activity as well. Efforts to define the TCR repertoire within the human pancreas of T1D and non-T1D organ donors is underway with a need to define the antigen specificity and HLA restriction of these identified clonotypes. Those clonotypes that are shared between individuals with T1D, frequent, and circulate from the pancreas and pancreatic lymph nodes to the peripheral blood are prime candidates for deep sequencing and clustering of TCRs using developed computational analyses.

Author Contributions

MN and AM wrote and edited the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the National Institutes of Diabetes and Digestive and Kidney Diseases (R01DK099317, R01DK032083, R01DK108868, DP3DK110845, P30DK116073, Juvenile Diabetes Research Foundation (2018-480-S-B, 2020-911-A-N, 2018-557-Q-R), the Leona M. & Harry B. Helmsley Charitable Trust (2103-05093), and the Culshaw Family Junior Investigator Award.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  132 in total

1.  Proinsulin-specific, HLA-DQ8, and HLA-DQ8-transdimer-restricted CD4+ T cells infiltrate islets in type 1 diabetes.

Authors:  Vimukthi Pathiraja; Janine P Kuehlich; Peter D Campbell; Balasubramanian Krishnamurthy; Thomas Loudovaris; P Toby H Coates; Thomas C Brodnicki; Philip J O'Connell; Katherine Kedzierska; Christine Rodda; Philip Bergman; Erin Hill; Anthony W Purcell; Nadine L Dudek; Helen E Thomas; Thomas W H Kay; Stuart I Mannering
Journal:  Diabetes       Date:  2014-08-25       Impact factor: 9.461

Review 2.  T-cell antigen receptor genes and T-cell recognition.

Authors:  M M Davis; P J Bjorkman
Journal:  Nature       Date:  1988-08-04       Impact factor: 49.962

3.  Islet-Derived CD4 T Cells Targeting Proinsulin in Human Autoimmune Diabetes.

Authors:  Aaron W Michels; Laurie G Landry; Kristen A McDaniel; Liping Yu; Martha Campbell-Thompson; William W Kwok; Kenneth L Jones; Peter A Gottlieb; John W Kappler; Qizhi Tang; Bart O Roep; Mark A Atkinson; Clayton E Mathews; Maki Nakayama
Journal:  Diabetes       Date:  2016-12-05       Impact factor: 9.461

4.  A polymorphism of the rat T-cell receptor beta-chain variable gene 13 (BV13S1) correlates with the frequency of BV13S1-positive CD4 cells.

Authors:  M Stienekemeier; K Hofmann; R Gold; T Herrmann
Journal:  Immunogenetics       Date:  2000-04       Impact factor: 2.846

5.  ClusTCR: a Python interface for rapid clustering of large sets of CDR3 sequences with unknown antigen specificity.

Authors:  Sebastiaan Valkiers; Max Van Houcke; Kris Laukens; Pieter Meysman
Journal:  Bioinformatics       Date:  2021-06-16       Impact factor: 6.937

Review 6.  Application of T cell receptor (TCR) repertoire analysis for the advancement of cancer immunotherapy.

Authors:  Kroopa Joshi; Martina Milighetti; Benjamin M Chain
Journal:  Curr Opin Immunol       Date:  2021-08-25       Impact factor: 7.486

7.  Hybrid Insulin Peptides Are Autoantigens in Type 1 Diabetes.

Authors:  Rocky L Baker; Marynette Rihanek; Anita C Hohenstein; Maki Nakayama; Aaron Michels; Peter A Gottlieb; Kathryn Haskins; Thomas Delong
Journal:  Diabetes       Date:  2019-06-07       Impact factor: 9.461

8.  Additive and interaction effects at three amino acid positions in HLA-DQ and HLA-DR molecules drive type 1 diabetes risk.

Authors:  Xinli Hu; Aaron J Deutsch; Tobias L Lenz; Suna Onengut-Gumuscu; Buhm Han; Wei-Min Chen; Joanna M M Howson; John A Todd; Paul I W de Bakker; Stephen S Rich; Soumya Raychaudhuri
Journal:  Nat Genet       Date:  2015-07-13       Impact factor: 38.330

9.  Identification of the beta cell antigen targeted by a prevalent population of pathogenic CD8+ T cells in autoimmune diabetes.

Authors:  Scott M Lieberman; Anne M Evans; Bingye Han; Toshiyuki Takaki; Yuliya Vinnitskaya; Jennifer A Caldwell; David V Serreze; Jeffrey Shabanowitz; Donald F Hunt; Stanley G Nathenson; Pere Santamaria; Teresa P DiLorenzo
Journal:  Proc Natl Acad Sci U S A       Date:  2003-06-18       Impact factor: 12.779

Review 10.  High-throughput and single-cell T cell receptor sequencing technologies.

Authors:  Joy A Pai; Ansuman T Satpathy
Journal:  Nat Methods       Date:  2021-07-19       Impact factor: 47.990

View more
  2 in total

1.  Enhanced T Cell Glucose Uptake Is Associated With Progression of Beta-Cell Function in Type 1 Diabetes.

Authors:  Rong Tang; Ting Zhong; Li Fan; Yuting Xie; Juan Li; Xia Li
Journal:  Front Immunol       Date:  2022-05-23       Impact factor: 8.786

2.  Immunological balance between Treg and Th17 lymphocytes as a key element of type 1 diabetes progression in children.

Authors:  Aleksandra Starosz; Milena Jamiołkowska-Sztabkowska; Barbara Głowińska-Olszewska; Marcin Moniuszko; Artur Bossowski; Kamil Grubczak
Journal:  Front Immunol       Date:  2022-08-24       Impact factor: 8.786

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.