| Literature DB >> 29163767 |
Yingyun Fu1,2, Yazhen Li1, Lan Xu1, Shengguo Liu1, Minlian Wang2, Lu Xiao1, Song Liu3, Yong Dai3.
Abstract
Sarcoidosis is a systemic granulomatous disorder highly related with immune response. The diversity and stability of the immune system could be measured by hypervariable complementarity-determining region 3 (CDR3) segments of the T cell receptor (TCR). Here we used a combination of multiplex PCR and next-generation sequencing to conduct a good quality analysis of the T-cell receptor BV complementarity-determining region 3 (TCR BV CDR3) gene in peripheral blood mononuclear cells (PBMCs) from 7 sarcoidosis patients and lung sarcoidosis tissue from 6 patients. The length distribution of CDR3 sequences identified a significant difference among CD4+, CD8+ and tissue samples. The analysis of Gini coefficient, Shannon entropy and HEC number showed that they all presents in sarcoidosis tissue group clones in a more skewed manner than that of in PMBCs groups. 2 nucleotide sequences and 2 amino acid sequences were shared by all samples. The comparison of TRBV, TRBJ usage and VJ combination frequency identified 2 TRBV genes, 2 TRBJ genes differentially expressed among different groups and different higher usage and lower usage of V-J combinations between each group.Entities:
Keywords: CDR3; Immune response; Immunity; Immunology and Microbiology Section; PMBCs; TCR BV CDR3; immune system; sarcoidosis
Year: 2017 PMID: 29163767 PMCID: PMC5685688 DOI: 10.18632/oncotarget.20085
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Sequencing quality data of all samples
| Sample ID | Total reads | immune sequences number | Unknown sequences numebr | productive sequences number | Non_productive sequences number | In-frame sequences number | Out-of_frame sequences number | Total CDR3 sequences number | Unique cdr3 nt sequences number | Unique cdr3 aa sequences number |
|---|---|---|---|---|---|---|---|---|---|---|
| 1-CD4 | 982482 | 977713 | 4769 | 780923 | 196790 | 812791 | 162802 | 768159 | 44863 | 36275 |
| 1-CD8 | 1019665 | 1010032 | 9633 | 651284 | 358748 | 681950 | 323796 | 635130 | 30211 | 24347 |
| 1-Tissue | 826801 | 814689 | 12112 | 564771 | 249918 | 579149 | 232756 | 549839 | 11864 | 8861 |
| 2-CD4 | 875416 | 871213 | 4203 | 720687 | 150526 | 744888 | 124171 | 709860 | 25395 | 19812 |
| 2-CD8 | 804580 | 799020 | 5560 | 478189 | 320831 | 513158 | 283181 | 454400 | 21587 | 18177 |
| 2-Tissue | 264682 | 257618 | 7064 | 131241 | 126377 | 136759 | 117761 | 126659 | 5849 | 4668 |
| 3-CD4 | 629771 | 626450 | 3321 | 493196 | 133254 | 514152 | 110960 | 477407 | 34832 | 28588 |
| 3-CD8 | 673637 | 667934 | 5703 | 507676 | 160258 | 534196 | 131185 | 490002 | 25857 | 20706 |
| 3-Tissue | 173346 | 166956 | 6390 | 85734 | 81222 | 88235 | 77378 | 36817 | 1138 | 890 |
| 4-CD4 | 636166 | 613326 | 22840 | 334755 | 278571 | 351373 | 254623 | 294852 | 10897 | 8461 |
| 4-CD8 | 504161 | 500700 | 3461 | 377054 | 123646 | 387921 | 110862 | 371353 | 10538 | 8835 |
| 4-Tissue | 875950 | 870778 | 5172 | 612724 | 258054 | 636158 | 231659 | 603077 | 8563 | 6150 |
| 5-CD4 | 529970 | 525726 | 4244 | 377036 | 148690 | 389611 | 133652 | 370826 | 35632 | 33259 |
| 5-CD8 | 684006 | 680876 | 3130 | 530370 | 150506 | 544488 | 134645 | 525921 | 34633 | 31673 |
| 5-Tissue | 276040 | 268304 | 7736 | 170723 | 97581 | 175980 | 89388 | 160569 | 8113 | 6464 |
| 6-CD4 | 457009 | 454886 | 2123 | 359367 | 95519 | 375971 | 77831 | 353271 | 44057 | 38882 |
| 6-CD8 | 411422 | 407853 | 3569 | 325237 | 82616 | 336866 | 68811 | 316420 | 9609 | 7364 |
| 6-Tissue | 423421 | 409533 | 13888 | 247874 | 161659 | 254589 | 150408 | 227670 | 3990 | 3273 |
| 7-CD4 | 805096 | 801055 | 4041 | 527367 | 273688 | 548346 | 250976 | 510035 | 22370 | 17668 |
| 7-CD8 | 781561 | 765334 | 16227 | 376143 | 389191 | 573166 | 186529 | 343384 | 7860 | 6166 |
| Average | 631,759.10 | 624,499.80 | 7,259.30 | 432,617.55 | 191,882.25 | 458,987.35 | 162,668.70 | 416,282.55 | 19,892.90 | 16525.95 |
Figure 1Length distribution of CDR3 sequences (aa) in the CD4+ group, CD8+ group and tissue group
A. The different percentage (%) of each amino acid length in 3 groups. B. The different percentage (%) of each amino acid length compared between CD4+ and CD8+ groups (p < 0.0061). C. The different percentage (%) of each amino acid length compared between CD4+ and tissue groups (p < 0.003, p < 0.047, p < 0.0011, p < 0.0012, p < 0.013, p < 0.034, p < 0.033). D. The different percentage (%) of each amino acid length compared between CD8+ and tissue groups (p < 0.0022, p < 0.018).
Figure 2Comparison of Gini coefficient, Shannon entropy, HEC number, HEC ratio in CD4+, CD8+ and tissue groups
A. The comparison of Gini coefficient in 3 groups shows significant different. B. The comparison of Shannon entropy in 3 groups. C., D. The comparison of HEC number and HEC ration in 3 groups.
Public nucleotide sequences in CD4+, CD8+, Tissue groups
| Group | Public Sequences (NT) | Public Sequences (AA) |
|---|---|---|
| Tissue | GCCAGCAGCGAGCAGGGAAGAGATACGCAGTAT | AISSDSYNEQF |
| AGCCGACGT | SVDGGRQETQY | |
| GCCAGTAGTAGAAGAGATGGCTACACC | ATSREGAGETQY | |
| GCCACCAGCAGGACAGGGTTTAATGAAAAACTGTTT | ASSGTGRWETQY | |
| GCCAGCATCCCCTCGCCTGTTTTCCCCGGGGAGCTGTTT | ATSGIAGETQY | |
| GCCAGCAGTTTTGAGACAGGGGCGGAAGCTTTC | ATSGIAGEELF | |
| GCCAGCAGTGACTCACTAGCGGGAGGGCGGTGG | ASSTAGSVGYGYT | |
| GCCACCAGCCGTCGACAGGGAGTGAATTCACCCCTCCAC | ASRGTGAGKLF | |
| GCCAGCAGCCAAGGCGTCCAAGAGACCCAGTAC | AKLLQTGLTGELF | |
| GCCACCAGCAGAGAGGGGGCGGGTGAGACCCAGTAC | AIKGTGSETQY | |
| GCCAGCAGTGGACAACTAGCGGGGTTTCGCACAGATACGCAGTAT | ATSDFDSGLSGTQY | |
| GCCACCAGCGGCATAGCGGGGGAGGAGCTGTTT | ASSPRDSHGNTIY | |
| GCCACCAGCAAAGACAGGGGGCAAGAGACCCAGTAC | ATSYGNVQPQH | |
| GCCAGCAGCCCCACCCGGACTCAGGGCTACACC | SRR | |
| GCCAGCAGTCCCTGGGGAGGCGGTAATGAAAAACTGTTT | AIKNTGGITEAF | |
| GCCACCAGCTACGGGAACGTTCAGCCCCAGCAT | ASSTWTEADVLT | |
| GCCATCAGTGATTCGGACAGGGCTTACTACGAGCAGTAC | ASVLQGAGSGYT | |
| GCCATCAAAGGGACAGGCTCCGAGACCCAGTAC | ASSQGVQETQY | |
| GCCAGTAGCCCAGGACGCCTTAGCTCTGGAAACACCATATAT | ASIPSPVFPGELF | |
| GCCAGTAGCCCAACAGGGGTGGTGGCAGAGACTCAGTAC | ATSGGSYGYT | |
| GCCACCAGTGAT | ASSPWGGGNEKLF | |
| GCCATCAGCAGTGACTCCTACAATGAGCAGTTC | ASSELRTGNTGELF | |
| GCCAAACTCCTTCAGACAGGCCTAACCGGGGAGCTGTTT | ASKDARNNSPLH | |
| GCCAGCAGTGAATTGCGGACAGGGAACACCGGGGAGCTGTTT | ATSREEKLF | |
| GCCAGCAGCCCTCTTCCGGGACAGGGGCGCGAGCAGTAC | ASSDVLPNTEAF | |
| GCCAGTAGTACATGGACGGAGGCCGACGTCCTGACT | ASFFLAGGPEETQY | |
| GCCACCAGTGATTTGGCCAGGCATTATGGCTACACC | ATSRRQGVNSPLH | |
| GCCAGCAGTCGGGGACTAGCGAGGATATACGAGCAGTAC | ASSPGRLSSGNTIY | |
| GCCATCAGTGAGTCAAAAGGGGACACCGGGGAGCTGTTT | ASRGSNQPQH | |
| GCCAGCAGTGGCAACACGGGGGGCCGCTCCTACGAGCAGTAC | ASSDSLAGGRW | |
| GCCAGCAGCCCGTACAGGGGGGACACCGGGGAGCTGTTT | ASSGKLTEAF | |
| GCCAGCGTATTACAGGGGGCTGGTTCTGGCTACACC | ASADLSIYEHY | |
| GCCAGTAGTACAGCAGGGTCAGTAGGCTATGGCTACACC | ASSPTGVVAETQY | |
| GCCAGTAGTGGGAAGTTGACTGAAGCTTTC | ATSHLGHPQETQY | |
| GCCACCAGTGGAGGTTCGTATGGCTACACC | AISESKGDTGELF | |
| GCTAGTGGTTTATCCCGACTCCAAGAGACCCAGTAC | ATSRTGFNEKLF | |
| GCCAGTGCCGACCTTAGCATATACGAGCATTAC | ASSGNTGGRSYEQY | |
| GCCACCAGCAGAGATGACAGGGGACCCGAAAAACTGTTT | ASSSRWLADNEQF | |
| GCCAGCAGTGGGACAGGGCGATGGGAGACCCAGTAC | ATQKGAAYGYT | |
| GCCTGCAAGCCGGAGTCACTCGGGACCCAGTAC | ASSGQLAGFRTDTQY | |
| GCCAGCTTTTTCCTAGCGGGAGGGCCTGAGGAGACCCAGTAC | ASSDDGRGRRSYEQY | |
| GCCAGCAGGGGGACAGGGGCTGGAAAACTGTTT | ACKPESLGTQY | |
| GCCACCAGCGGCATAGCGGGGGAGACCCAGTAC | ASSRDRYGTGVLYTEAF | |
| GCCAGGGGACTAGCGGGGACTAGCACAGATACGCAGTAT | ATAGTGNTDTQY | |
| GCCAGTAGTATTCAAGAGACCCAGTAC | ASSPTRTQGYT | |
| GCCACCAGTGATTTTGATTCGGGACTTTCGGGGACCCAGTAC | ASSPYRGDTGELF | |
| GCCAGCAGCCCTCGGGACAGCCATGGAAACACCATATAT | ASSIQETQY | |
| GCCAGCAGCCGAGGGGACAACCTAGGGAATCAGCCCCAGCAT | ASSQIGGTQETQY | |
| GCCAGCAGTGATGTGTTACCGAACACTGAAGCTTTC | ATSDLARHYGYT | |
| GCCACCGCCGGGACAGGGAACACAGATACGCAGTAT | ATSKDRGQETQY | |
| GCCACCAGCCACTTGGGACACCCGCAAGAGACCCAGTAC | AISDSDRAYYEQY | |
| GCCACCAGCAGAGAGGAAAAACTGTTT | ASSRGLARIYEQY | |
| GCCATCAAGAATACAGGGGGCATCACTGAAGCTTTC | ATSD | |
| GCCAGCAGGGGTAGCAATCAGCCCCAGCAT | ASSEQGRDTQY | |
| GTGCCTGAGTATGGTACAGCTAATGGCCGTCTTCATTTCCATGCGGTGCACTTTATGCGGACACTTCCTACAGGTAGCGTTGACCCTAAT | ASSPLPGQGREQY | |
| GCCACTCAGAAGGGGGCCGCCTATGGCTACACC | ASGLSRLQETQY | |
| GCCAGCAGCTCCCGGTGGCTAGCGGACAATGAGCAGTTC | ASSFETGAEAF | |
| GCCAGCAGTGATGATGGGCGGGGCCGCCGCTCCTACGAGCAGTAC | ASSRGDNLGNQPQH | |
| GCCAGCAAGGACGCCAGGAATAATTCACCCCTCCAC | VPEYGTANGRLHFHAVHFMRTLPTGSVDPN | |
| ATSRDDRGPEKLF | ||
| ARGLAGTSTDTQY | ||
| ASSRRDGYT | ||
| CD8+ | AGCCGACGT | ASS |
| AGCCGACGTTTTGGCGGCGCAACCTGTGACGACAAATCTGCTCAAATTTATGCGCGCTTCGATAAAAATGATTGGCGTATCCAACCTGCAGAGTTTTATCGCTTCCATGACGCAGAAGTTAACACT | SRR | |
| GCCAGCAGC | SRRFGGATCDDKSAQIYARFDKNDWRIQPAEFYRFHDAEVNT | |
| GTGCCTGAGTATGGTACAGCTAATGGCCGTCTTCATTTCCATGCGGTGCACTTTATGCGGACACTTCCTACAGGTAGCGTTGACCCTAAT | VPEYGTANGRLHFHAVHFMRTLPTGSVDPN | |
| CD4+ | AGCCGACGT | ASS |
| AGCCGACGTTTTGGCGGCGCAACCTGTGACGACAAATCTGCTCAAATTTATGCGCGCTTCGATAAAAATGATTGGCGTATCCAACCTGCAGAGTTTTATCGCTTCCATGACGCAGAAGTTAACACT | SRR | |
| GCCAGCAGT | SRRFGGATCDDKSAQIYARFDKNDWRIQPAEFYRFHDAEVNT | |
| GCCAGCAGC | VPEYGTANGRLHFHAVHFMRTLPTGSVDPN | |
| GTGCCTGAGTATGGTACAGCTAATGGCCGTCTTCATTTCCATGCGGTGCACTTTATGCGGACACTTCCTACAGGTAGCGTTGACCCTAAT | ||
Figure 3Comparison of TRBV gene and TRBJ gene usage in CD4+, CD8+, tissue groups
(A1) Comparison of TRBV gene usage between CD4+ and tissue groups. (A2) Comparison of TRBV gene usage between CD8+ and tissue group. (A3) Comparison of TRBV gene usage in between CD4+ and CD8+ groups. (B1) Comparison of TRBJ gene usage between CD4+ and tissue groups. (B2) Comparison of CD8+ and tissue groups. (B3) Comparison of TRBJ gene usage between CD4+ and CD8+ groups.
Figure 4Comparison of V-J combinations among CD4+, CD8+ and tissue groups
A. Comparison of V-J combinations between CD4+ and CD8+ group. B. Comparison of V-J combinations between CD4+ and tissue groups. C. Comparison of V-J combinations between CD8+ and tissue groups.