| Literature DB >> 29928407 |
Song Liu1, Ying Zhu1, Lie-Wen Lin2, Shun-Kai Ding2, Xiao-Cong Lin1, Ke-Li Zhong2, Kai Pan2, Yong Dai1.
Abstract
Gastric cancer (GC) is the fourth most common type of cancer and the second most common cause of cancer-associated mortality worldwide. B cell-associated autoantibodies against tumor-associated antigens are attractive biomarkers for the development of noninvasive serological tests for the early detection of cancer. This is due to their specificity and stability in the sera. In the present study multiplex polymerase chain reaction and Illumina high-throughput sequencing (HTS) was used to study the composition and variation of the B cell receptor (BCR) complimentary-determining region 3 (CDR3) in GC. The peripheral blood, cancer tissues and peri-cancer tissues were included from 7 patients with GC. On average there was a total of 403,959 CDR3 sequences, with 72,367 unique CDR3 nt sequences and 61,709 unique CDR3 aa sequences per sample identified, which are critical for further understanding the BCR repertoire in GC. The details of GC CDR3s may accelerate the screening process for possible new autoantigens and may provide additional information necessary for generating effective B cell targeted diagnosis and therapeutic strategies.Entities:
Keywords: B cell receptor; complimentary-determining region 3; gastric cancer; immune repertorie; next-generation
Year: 2018 PMID: 29928407 PMCID: PMC6006485 DOI: 10.3892/ol.2018.8677
Source DB: PubMed Journal: Oncol Lett ISSN: 1792-1074 Impact factor: 2.967
Human immunoglobulin heavy chain sequence statistics.
| Data analysis | Blood | Cancer | Peri-cancer |
|---|---|---|---|
| Total reads number | 519954 | 544159 | 503171 |
| Immune sequences number | 515136 | 541462 | 500994 |
| Unknown sequences number | 4818 | 2697 | 2177 |
| Productive sequences number | 407885 | 461434 | 413926 |
| Non_productive sequences number | 107251 | 80028 | 87068 |
| In-frame sequences number | 438135 | 494654 | 443395 |
| Out-of_frame sequences number | 76828 | 46654 | 57415 |
| Total CDR3 sequences number | 382253 | 437438 | 392186 |
| Unique CDR3 nt sequences number | 57658 | 76166 | 83278 |
| Unique CDR3 aa sequences number | 48547 | 64121 | 72460 |
CDR3, complimentary-determining region 3.
Figure 1.Length distribution of B cell receptor complimentary-determining region 3 sequences (aa) in the blood, cancer and peri-cancer group of gastric cancer.
Blood sample's HEC aa sequences for ratios ≥0.1%.
| HEC aa sequence | HEC ratio (%) |
|---|---|
| ARDLSSWYRDMDV | 0.52 |
| ARDQRYYYYMDV | 0.48 |
| ARDRGYWYFDL | 0.45 |
| ARLLTMIHYYMDV | 0.38 |
| ARTTDSGQHWYFDL | 0.37 |
| ARDVIAIYYYYMDV | 0.36 |
| ARGYNPDYGMDV | 0.35 |
| ARLHRSWNEVYYGMDV | 0.33 |
| ARFGTRSNFQH | 0.31 |
| ARVGCGGGRCSLGMHV | 0.31 |
| ARGMYGDYVYYGMDV | 0.30 |
| ARDRAAPYYYYYMDV | 0.28 |
| ARQQLNLHMDV | 0.28 |
| AREDYSYYYYMDV | 0.27 |
| ARSSYYYGMDV | 0.27 |
| ARIPWDYDWYFDL | 0.27 |
| ARRRSIAAGPLDV | 0.27 |
| ARRDTLGV | 0.25 |
| ARAPYVPMDV | 0.25 |
| ARDSLHSWYDDFQN | 0.25 |
| ARDPLYYGMDV | 0.25 |
| ARSSIGYYYMDV | 0.25 |
| ARHWSDQHLHYYYGMDV | 0.25 |
| ARGRGYVTYGMDV | 0.25 |
| ATGRYYYYGMDV | 0.24 |
| ARLPWLGGMDV | 0.24 |
| ARLPLAYSNYYYYYGMDV | 0.23 |
| ARDSGGYYGMDV | 0.23 |
| ARDGGDYYYYYMDV | 0.22 |
| AMGGGYYYYYGMDV | 0.22 |
| ARHPRPVTDYYYYYYMDV | 0.21 |
| ARDTYGDYASYYYYGMDV | 0.21 |
| ARFWTNSSSWYYYGMDV | 0.21 |
| ARDWSYGLDV | 0.21 |
| ARDHWNYDGGYMDV | 0.21 |
| ARTSSGWYLGSYYYGMDV | 0.21 |
| ARDNDYSDYYGMDV | 0.20 |
| ARDGAAGTQYGMDV | 0.20 |
| ARDTAMVGRGRGTYGMDV | 0.20 |
| ARDPGSSNWYFDL | 0.20 |
| ATGRRDYYYYGMDV | 0.20 |
| ARVAYCGGDCYRNLDV | 0.19 |
| ARQGFDYYYMDV | 0.19 |
| ARSLSSTYYYYYMDV | 0.18 |
| ARAVVPAAIDGWYFDL | 0.18 |
| AKCGPRGARGTMDV | 0.18 |
| AKAGRAAAGTGYFQH | 0.18 |
| ARSQMATIDYYYGMDV | 0.18 |
| ARDFGDYMDV | 0.18 |
| ARGGWSWYFDL | 0.18 |
| ARSPGKHIVVVTVVFDL | 0.18 |
| ARDEATGVAGGMDV | 0.18 |
| ARLSPTQYYYYGMDV | 0.17 |
| ARDGIAGLDY | 0.17 |
| ARDMELGFDYYMDV | 0.17 |
| ARVYSGYTITRWYYMDV | 0.17 |
| AKQRHTRMNYMDV | 0.17 |
| ARFKNYYYYGMDV | 0.17 |
| ARGWNTDSYYFYMDV | 0.17 |
| ARVGHMVRIYYYYGMDV | 0.17 |
| AREFPAVTTPGGMDV | 0.17 |
| ARDAEGMDV | 0.17 |
| ARDRTRLLIDYYYYYMDV | 0.17 |
| ARDTGASHFYYYYGMDV | 0.16 |
| ARDTRYDYYYYMDV | 0.16 |
| ARTDGLGTYYYGMDV | 0.16 |
| GRVDTLIMYGMDV | 0.16 |
| AREDEYYGMDV | 0.16 |
| ARDAF | 0.16 |
| ARVTVAAADREGMDV | 0.15 |
| AKADFRSAGDYYYYMDV | 0.15 |
| ARVRFDGSGSRHYYYYGMDV | 0.15 |
| ATYTSSWYFYGMDV | 0.15 |
| ARDSPWNYYYGMDV | 0.15 |
| ARHTKVYGDYDDSYYYYYMDV | 0.15 |
| ARDDEAYRRSSYYYYGMDV | 0.15 |
| ARGQRTYGRGYYYYGMDV | 0.15 |
| ARGGSSSWYMDV | 0.15 |
| ARHPYGYNWNELGTEPNYYYMDV | 0.15 |
| ARDRGSGYFDL | 0.15 |
| ARDQASIAAQNYGMDV | 0.15 |
| ARLSQGYGDRDDIFQFYWYFDL | 0.15 |
| AKDGMSYSSSWHYWYFDL | 0.15 |
| TRDAEGMDV | 0.15 |
| ARDRGIAAAGTPYYYYGMDV | 0.15 |
| ARDGTHCSGSRCYGYFDL | 0.15 |
| AKAGPRWYYYYYMDV | 0.15 |
| AREGLSGSYYYYYGMDV | 0.15 |
| AREISYYYYMDV | 0.15 |
| AKHGGDIAGLRYFHY | 0.15 |
| ARHSMTTSYYYGMDV | 0.14 |
| ARDRTAVVVAASLLYGMDV | 0.14 |
| ARGEWEPPIGYYYYGMDV | 0.14 |
| ARDIAARPEQSVQH | 0.14 |
| ARDFEQQLLYYYYYMDV | 0.14 |
| ARIAAAGHYYYYGMDV | 0.14 |
| AKSATGPRPYWYFDL | 0.14 |
| ARPPEGSYYAFDI | 0.14 |
| ARGGAANDYYYYYMDV | 0.14 |
| ARDGQTTWLYYYYMDV | 0.14 |
| ARDKVPAANYYYGMDV | 0.14 |
| ARDLITGTTGMDV | 0.14 |
| APKPGRRLVDV | 0.14 |
| ARDLRGGSSYGMDV | 0.14 |
| ARHDDTSGQDLHEH | 0.13 |
| ARDTLSGGYYYYYYMDV | 0.13 |
| ARDFPGGMDV | 0.13 |
| ARDVDP | 0.13 |
| ARDRPYSSGWSWGYGMDV | 0.13 |
| AKSAYCTPKCNALDV | 0.13 |
| AKVVVFPVVVPAASDYMDV | 0.13 |
| ARQHRRVRGVMNYYGMDV | 0.13 |
| ARDRRGGDYYYYYGMDV | 0.13 |
| AKDSWYSRPYYGMDV | 0.13 |
| ARHGWFGELGVYYYYGMDV | 0.13 |
| AKVYRDYYYYGMDV | 0.13 |
| ARRRYSGYRDDAFDI | 0.12 |
| ARDLRSGEMATPNYYYYGMDV | 0.12 |
| ARGGVDTAMGYYYYYMDV | 0.12 |
| ARVRGSSSWYRPRGMDV | 0.12 |
| ARVGVPIADEDYYYYMDV | 0.12 |
| ARDGRDSSGWYSYWYFDL | 0.12 |
| ARHLTGELFGMDV | 0.12 |
| ASVAATTNYYYVLDV | 0.12 |
| ARGNWGTSWYFDL | 0.12 |
| ARHKLRDGSGSYFLYMDV | 0.12 |
| ARDLAVLGGSGLGGL | 0.12 |
| ARDRGPYYYYYGMDV | 0.12 |
| ARDPGGGTTGMTYYYYGMDV | 0.12 |
| ARVVLPFGELEAMDV | 0.12 |
| ARGVTIFGVVTYYYYMDV | 0.12 |
| ARFRVSYGYVDYYYYGMDV | 0.12 |
| ASAYGDYGAAFDI | 0.12 |
| AKDEKGVIYYGMDV | 0.12 |
| AREGLRNYYYYYMDV | 0.12 |
| ARGKLLSPMDV | 0.12 |
| ARDLGTTVTTERSYYYGMDV | 0.12 |
| ARDIRREWEPSFYYGMDV | 0.12 |
| ARDFGHIYDEYDFDF | 0.12 |
| ATIRCSGGSCPYYYYGMDV | 0.12 |
| ARSGYYDRYYYMDV | 0.12 |
| ARDFTDQNTVYYGMDV | 0.12 |
| ARGGPYYYYYYGMDV | 0.12 |
| AREYSSNHYYYYMDV | 0.11 |
| ARGRVPAAKGYYYYYMDV | 0.11 |
| ARDRVADDAFDI | 0.11 |
| ARDLGGYFLYRYYGMDV | 0.11 |
| ARDRNYYGSGSYYSTYMDV | 0.11 |
| AKDDGPYYYGMDV | 0.11 |
| AKDPHGSSWYYYYYYYYMDV | 0.11 |
| ARHSYDILTGYYAYYYGMDV | 0.11 |
| AKGPRSGIRFMDV | 0.11 |
| ARQDYYDSSGYYYDYYYYGMDV | 0.11 |
| ARRVSSSWYGWSDY | 0.11 |
| ARAHGPSSWGGMDV | 0.11 |
| ASPNSSSTAYTFDY | 0.11 |
| AREAMVRGALYYYYYGMDV | 0.11 |
| ARECFSSWYCYYYYGMDV | 0.11 |
| AKDSIAVAGDGMDV | 0.11 |
| ARHVGHYYGSGSYYNTYMDV | 0.11 |
| ARALRRYVVVPAAYYYYYMDV | 0.11 |
| ATASGRYFDYYYYMDV | 0.11 |
| ARPGHVVPAAPAAFDI | 0.11 |
| ARHGLRGCSDRCYTSFYYNGMDD | 0.10 |
| ARMSMTTVPK | 0.10 |
| ARGHMTPDYMDV | 0.10 |
| ARESDYYYGMDV | 0.10 |
| AGPDTAMVSRDYYYYGMDV | 0.10 |
| ARHEAGDLGYDAFDI | 0.10 |
| ARDQGLVVVIKDYYYGMDV | 0.10 |
| ARDKEDPWNALDL | 0.10 |
| ALEEVGY | 0.10 |
| AKVRRAQGYYAMDV | 0.10 |
| ARDRGGIQLWRTRYRYYYYGMDV | 0.10 |
| ARVPTPFDYYYYMDV | 0.10 |
| ARGRQVDSSGWYDYYYYGMDV | 0.10 |
| ARDHYSSDAFDI | 0.10 |
| ATLYYDFFVRPGGMDV | 0.10 |
| GRDCVRAGDYGVDV | 0.10 |
| AREPLYYDYYMDV | 0.10 |
HEC, highly expanded clones.
Cancer sample's HEC aa sequences for ratio >0.1%.
| HEC aa sequence | HEC ratio (%) |
|---|---|
| ARWRAAAGTRRNYSYHYMDV | 1.23 |
| AKDFGAGLGGYYMDV | 1.01 |
| ARLQGGAVFQH | 0.79 |
| ARGSMSIRAGWYFDL | 0.75 |
| AADRGDYGDYEDYYYYIDV | 0.64 |
| AREVHQRQQSEDAFDV | 0.57 |
| ARDLCSDGVCDWYYFMDV | 0.52 |
| ARDGRTWHYESRGFHGWFDA | 0.51 |
| AKEGIPAAGMSEGYYYYMDV | 0.38 |
| ARELRGGSWAGGMDV | 0.29 |
| ARTRITILGDMDV | 0.27 |
| ARDLRGSDDY | 0.27 |
| TIGHYST | 0.26 |
| ARTRITIFGDMDV | 0.23 |
| AGSSGFDPFDC | 0.23 |
| AQEIRPNDC | 0.22 |
| ATDAVNNWNSHY | 0.21 |
| EAGGAGFDC | 0.21 |
| ARDASSSGLRYYGMDV | 0.21 |
| ARRNKGSLGWDFDY | 0.21 |
| AKGGNTGGTNQFLSYYYHYMDV | 0.20 |
| ARDRAGDYAADD | 0.20 |
| ARQRGYYYNMDV | 0.20 |
| AADGFQLEDFRYGMDV | 0.20 |
| ARTQWEYWYFDL | 0.18 |
| ARLWTVTPTDYGMDV | 0.18 |
| TIGHYSS | 0.18 |
| ARTVTWGYMDV | 0.17 |
| AKDDTTYCGGDCYFDL | 0.17 |
| AKGGHMGGPNQFLSYYYHYMDV | 0.17 |
| TRGGSRGDSISWYTGMHSYYGMDV | 0.17 |
| ARVLHGGGGHFHH | 0.17 |
| ARSPYDSSGSKYYGMDV | 0.16 |
| TRDERGRAAQTNYYYYYMDV | 0.16 |
| ARGRQDYFDF | 0.15 |
| AKDARYCTPTSCNTPLSYSSFYYMDA | 0.15 |
| ATEDSNGGSYRRH | 0.15 |
| ARTSGSSGDAFDI | 0.15 |
| AKDRGGPNIPDWYFDL | 0.15 |
| ARPSGAVTTMRMDV | 0.14 |
| ARVLHGGGGNFHH | 0.14 |
| ATDSLDV | 0.14 |
| ARGGSRGDSTSWYTGMHSYYGMDV | 0.14 |
| AKDDTTYCGGDCYFDI | 0.14 |
| ATLDTHPAHYYYGMDV | 0.13 |
| AKGGEGRTTYWYLDL | 0.13 |
| ARGRQDYFDY | 0.13 |
| ARFHLKDGTSSEY | 0.13 |
| ARLSGSYYYYYYMDV | 0.13 |
| AKDQQWLIRLVHDY | 0.12 |
| ARDPYYYGSGSRYYYYMDV | 0.12 |
| ARGPGAAAGTYYYYSMDV | 0.12 |
| ARDYGEN | 0.12 |
| ARGPRQIGARSSSGLEDWHFEL | 0.12 |
| ARDISPAYRGNHAFDI | 0.12 |
| AKARTPGTPYYYHMDF | 0.12 |
| ARSSGRAAGVDS | 0.11 |
| ARETQGFDP | 0.11 |
| VRGTITFDY | 0.11 |
| AKTYGDYGGETAFDM | 0.11 |
| ASEVHLQSDRLGMDV | 0.11 |
| ARGGLPSSYYYYMDV | 0.11 |
| ARTSQLGRYLDL | 0.11 |
| ASEIHLGADRLGMGV | 0.11 |
| ARESGGYRYGHIDHYYNAMDV | 0.11 |
| ARLDCSSTSCYPDLGLGYFDL | 0.11 |
| ARDIGSSSWYEHLEQ | 0.11 |
| AKLNSALGVTGRRGRPVYFED | 0.11 |
| ARSRDRYYSYYMDV | 0.11 |
| ARTTYSDV | 0.11 |
| ARRRYGEGFQH | 0.11 |
| ASGTASWYDYYYGMDV | 0.11 |
| ARDPGGYSFDH | 0.11 |
| VRGEDDDGDYVDYYYGMDV | 0.11 |
| ARIRPVYTIDF | 0.11 |
| AKGGEGRTTYWYFDL | 0.11 |
| VTQGDGSLDF | 0.10 |
| SRARITIFGDMDV | 0.10 |
| ARETEGFDP | 0.10 |
| ARGRGNDYGDYSYYYYMDV | 0.10 |
| GHIGTFDL | 0.10 |
| ARGGSRGDSTSWYTGMHVYYGLDV | 0.10 |
HEC, highly expanded clones.
Peri-cancer sample's HEC aa sequences for ratio >0.1%.
| HEC aa sequence | HEC ratio (%) |
|---|---|
| VRHSSGDYRNWYFDL | 1.16 |
| ARAGYTYGEDMDV | 0.72 |
| AREMDV | 0.59 |
| CAYHDY | 0.52 |
| ARDIGSSSWYEHLEQ | 0.39 |
| ASQRAIFRPMDV | 0.30 |
| AALLQHNGRGTFDF | 0.26 |
| AKDDEYHDSFGLDV | 0.21 |
| ARHEVASHDSYYMDV | 0.19 |
| ARLPEGLDWHLDL | 0.17 |
| ARGRRPDHTYFYMDV | 0.16 |
| GVYV | 0.16 |
| ARGRTPGRMDV | 0.15 |
| AVAVHGTYFWYFDV | 0.15 |
| ATSDNYYMDV | 0.14 |
| ASEIHLGADRLGMGV | 0.14 |
| ARVPWGWFFDY | 0.12 |
| AREGI | 0.12 |
| A | 0.12 |
| AKDVGTYYIYNYMDV | 0.12 |
| ARSSDRAEFGGNYYYSMDV | 0.11 |
| ARPGQQRGGWYFDL | 0.11 |
| VRRGFWSEAAIGKDGNYYYMDV | 0.11 |
| ARSATTAANWYFNL | 0.11 |
| AASSDY | 0.11 |
| AKDRIRWSLYDFCSGFDV | 0.10 |
| AKAGGAFLYMDV | 0.10 |
| ARGRRGYSGYENRPLLDQ | 0.10 |
| ATLQQGHSRGSLTNPSFDYYTMDV | 0.10 |
| ARHRRYYGSGYYMDV | 0.10 |
| ARDGMRTGNMDV | 0.10 |
HEC, highly expanded clones.
Top 20 cancer special CDR3 aa and nt sequence.
| aa sequence | nt sequence | Sequence ratio (%) |
|---|---|---|
| GCGAGAGATCCCGGTGGATACTCCTTTGACCAC | ARDPGGYSFDH | 0.11 |
| GCGAGACAGGGTGTTGAAGCAGCAGCTGATTCCTACTACTACTACGGTATGGACGTC | ARQGVEAAADSYYYYGMDV | 0.08 |
| GCGAGAGTATTACATGGCGGTGGCGGGATCCTCCATCAC | ARVLHGGGGILHH | 0.07 |
| GCGAGAGATGGTTCGGGGAGCCCCATGGACGTC | ARDGSGSPMDV | 0.07 |
| GCGAGAGATCCTTATTACTATGGTTCGGGAAGTTCTTCGTACTACTACTTTGACTCC | ARDPYYYGSGSSSYYYFDS | 0.07 |
| GCGAGAACGGCAATTGCTACGAGGGGCTACTACTACATGGACGTC | ARTAIATRGYYYMDV | 0.07 |
| GCAAGAGGTACTATGGTTCGGGGAGTCCTCAGAGGCTACATGGACGTC | ARGTMVRGVLRGYMDV | 0.06 |
| GCGAGAGATTTGACTGCAGCAGCTGGCACCCCTTTCTACTACTACAAAGGTTTGGACGTC | ARDLTAAAGTPFYYYKGLDV | 0.06 |
| GCGAAATGGGAGGGCACTTACCCTAAGTATTACATGGACATC | AKWEGTYPKYYMDI | 0.06 |
| GCGAGAGCTCGTGCTTTTGATATC | ARARAFDI | 0.06 |
| GCGAGAGGGGGTTTGGTAGCTGGAGCTATGGACGTC | ARGGLVAGAMDV | 0.06 |
| GCGAGAGATCCTTCTTACGGTATGGACGTC | ARDPSYGMDV | 0.05 |
| GCGAGAGGTTCGGGGAGTTCTTACTACTACTACTACGGTATGGACGTC | ARGSGSSYYYYYGMDV | 0.05 |
| GCGAGAGTATTACATGGCGGTGGCGGGATCTTCCACCAC | ARVLHGGGGIFHH | 0.05 |
| GCGAGACTTGCCCGTGACCCTGCTATGGAGACTTTAGCCGGCTGGTACTTCGATCTC | ARLARDPAMETLAGWYFDL | 0.05 |
| GCGAGACCGCGGTATAACTGGAACTACATGGGGGACTACTACTACGGTATGGACGTC | ARPRYNWNYMGDYYYGMDV | 0.05 |
| GCGAGAACAGGCAAAGAAATGATGGACTACATGGTCGTC | ARTGKEMMDYMVV | 0.05 |
| GTGAGAGGGGAGGGCGGCTACTACTACGGTATGGAAGTC | VRGEGGYYYGMEV | 0.05 |
| GCGAGATCGTCTGGAAGGGCAGCGGGCGTTGATTCC | ARSSGRAAGVDS | 0.05 |
| GCGAAATATAGTAGTTCGTCTAAGTACTATTATTATATGGACGTC | AKYSSSSKYYYYMDV | 0.04 |
CDR3, complimentary-determining region 3