| Literature DB >> 35382366 |
Khai Pang Leong1,2, Mei Yun Yong1, Liuh Ling Goh3, Chia Mun Woo1, Chia Wei Lim2, Ee Tzun Koh1.
Abstract
Objectives: This study aims to uncover variants of large effect size and allele frequency below 5% by sequencing all extant genes associated with rheumatoid arthritis (RA) in a homogeneous patient cohort. Patients and methods: This retrospective study was conducted between January 2001 and December 2017. We selected Chinese RA patients positive for anti-citrullinated peptide antibody (ACPA). All the 128 known candidate genes identified through genome-wide association studies were sequenced in 48 RA patients (15 males, 33 females; mean age 53.32±8.98 years; range, 32 to 75 years) and 45 controls (11 males, 34 females; mean age 32.18±9.54; range, 21 to 57 years). The exonic regions of these genes were sequenced. The resultant data were analyzed for association using single variant association and pathway-based association enrichment tests. The genetic burden due to low-frequency variants was assessed with the C-alpha test. The candidate variants that showed significant association were validated in a larger cohort of 500 RA cases (71 males, 429 females; mean age 48.6±12.2 years; range, 24 to 92 years) and 500 controls (66 males, 434 females; mean age 32.3±10.1 years; range, 21 to 73 years).Entities:
Keywords: Genetics; rheumatoid arthritis; sequencing
Year: 2021 PMID: 35382366 PMCID: PMC8957766 DOI: 10.46497/ArchRheumatol.2021.8127
Source DB: PubMed Journal: Arch Rheumatol ISSN: 2148-5046 Impact factor: 1.472
List of genes containing rheumatoid arthritis susceptibility loci that are re-sequenced in this project
| SN | Gene | CDS Length (base pairs) | SN | Gene | CDS Length (base pairs) |
| 1 | ABHD6 | 1014 | 65 | IL6 | 639 |
| 2 | ACOXL | 1743 | 66 | IL6R | 1407 |
| 3 | AFF3 | 3756 | 67 | IL6ST | 2757 |
| 4 | AHNAK2 | 17388 | 68 | ILF3 | 2697 |
| 5 | AIRE | 1638 | 69 | INPP5B | 2982 |
| 6 | ANKRD55 | 1845 | 70 | IRAK1 | 2139 |
| 7 | ANXA3 | 972 | 71 | IRF4 | 1356 |
| 8 | ARAP1 | 4353 | 72 | IRF5 | 1545 |
| 9 | ARID5B | 3567 | 73 | IRF8 | 1281 |
| 10 | ATG5 | 828 | 74 | JAZF1 | 732 |
| 11 | ATM | 9171 | 75 | LBH | 336 |
| 12 | B3GNT2 | 1194 | 76 | LOC100506023 | * |
| 13 | BLK | 1518 | 77 | LOC100506403 | * |
| 14 | C1QBP | 849 | 78 | LOC145837 | * |
| 15 | C5 | 5031 | 79 | LOC339442 | * |
| 16 | C5orf30 | 621 | 80 | LY9 | 1968 |
| 17 | CASP8 | 1617 | 81 | MBP | 915 |
| 18 | CCL19 | 315 | 82 | MED1 | 4746 |
| 19 | CCL21 | 405 | 83 | MMEL1 | 2340 |
| 20 | CCR6 | 1125 | 84 | MTF1 | 2262 |
| 21 | CD2 | 1056 | 85 | NFKBIE | 1503 |
| 22 | CD226 | 1011 | 86 | P2RY10 | 1020 |
| 23 | CD244 | 1113 | 87 | PADI4 | 1992 |
| 24 | CD28 | 705 | 88 | PLCL2 | 3384 |
| 25 | CD40 | 834 | 89 | PLD4 | 1542 |
| 26 | CD5 | 1488 | 90 | PPIL4 | 1479 |
| 27 | CD83 | 618 | 91 | PRKCH | 2052 |
| 28 | CDK2 | 1041 | 92 | PRKCQ | 2121 |
| 29 | CDK4 | 912 | 93 | PTPN11 | 1782 |
| 30 | CDK6 | 981 | 94 | PTPN2 | 1248 |
| 31 | CEP57 | 1503 | 95 | PTPN22 | 2424 |
| 32 | CFLAR | 1443 | 96 | PTPRC | 3921 |
| 33 | CLNK | 1287 | 97 | PVT1 | * |
| 34 | COG6 | 1974 | 98 | PYX | 1737 |
| 35 | CSF2 | 435 | 99 | RAD51B | 1278 |
| 36 | CSF3 | 624 | 100 | RAG1 | 3132 |
| 37 | CTLA4 | 672 | 101 | RAG2 | 1584 |
| 38 | CXCR5 | 1119 | 102 | RASGRP1 | 2547 |
| 39 | DNASE1L3 | 918 | 103 | RCAN1 | 834 |
| 40 | EOMES | 2118 | 104 | REL | 1860 |
| 41 | ETS1 | 1458 | 105 | RTKN2 | 1830 |
| 42 | ETV7 | 1026 | 106 | RUNX1 | 1443 |
| 43 | FADS1 | 1335 | 107 | SFTPD | 1128 |
| 44 | FADS2 | 1338 | 108 | SH2B3 | 1728 |
| 45 | FADS3 | 954 | 109 | SMIM20 | 507 |
| 46 | FCGR2A | 933 | 110 | SPRED2 | 1298 |
| 47 | FCGR2B | 810 | 111 | STAT4 | 2247 |
| 48 | FCGR3B | 2229 | 112 | SYNGR1 | 702 |
| 49 | FCRL3 | 1335 | 113 | TAGAP | 2196 |
| 50 | GATA3 | 1878 | 114 | TEC | 1896 |
| 51 | GRHL2 | 810 | 115 | TNF | 702 |
| 52 | HLA-DRB1 | 1422 | 116 | TNFAIP3 | 2373 |
| 53 | ICOSLG | 1071 | 117 | TNFREF14 | 852 |
| 54 | IFNGR2 | 1530 | 118 | TNFRSF9 | 768 |
| 55 | IKZF3 | 537 | 119 | TPD52 | 744 |
| 56 | IL10 | 810 | 120 | TRAF1 | 1251 |
| 57 | IL1B | 462 | 121 | TRAF6 | 1569 |
| 58 | IL2 | 936 | 122 | TXNDC11 | 2958 |
| 59 | IL20RB | 489 | 123 | TYK2 | 3564 |
| 60 | IL21 | 489 | 124 | UBASH3A | 1986 |
| 61 | IL23R | 1890 | 125 | UBE2L3 | 639 |
| 62 | IL23RA | 819 | 126 | WDFY4 | 9555 |
| 63 | IL2RB | 1656 | 127 | YDJC | 972 |
| 64 | IL3 | 459 | 128 | ZNF438 | 2487 |
| CDS: Coding sequence. | |||||
Thirty-nine potential rheumatoid arthritis-associated variants detected in 21 genes through next-generation sequencing of 128 genes
| Number | Gene | SNV | Chromosome | Reference base | Alternate base | Impact | |
| 1 | C1orf141 | 1:67630314 | 0.02942 | 1 | T | C | Intron variant |
| 2 | FCGR2A | rs1131184 | 0.0394 | 1 | T | A | 3 prime UTR variant |
| 3 | FCGR2A | rs12046367 | 0.002902 | 1 | A | T | 3 prime UTR variant |
| 4 | FCGR2B | 1:161632071 | 0.04887 | 1 | T | C | Intron variant |
| 5 | FCGR2B | rs375055702 | 0.03061 | 1 | G | T | Intron variant |
| 6 | FCGR2B | rs56308545 | 0.03666 | 1 | A | G | Intron variant |
| 7 | IL6R | rs76419864 | 0.0007269 | 1 | T | C | Upstream gene variant |
| 8 | CD28 | rs28541784 | 1.69E-07 | 2 | C | T | Upstream gene variant |
| 9 | CD28 | rs28688913 | 5.65E-07 | 2 | C | T | Upstream gene variant |
| 10 | CD28 | rs28718975 | 5.14E-06 | 2 | T | C | Upstream gene variant |
| 11 | PXK* | 3:58376372 | - | 3 | C | A | Missense variant |
| 12 | PXK* | 3:58382826 | - | 3 | A | G | Missense variant |
| 13 | PXK* | rs199881366 | - | 3 | T | C | Missense variant |
| 14 | EOMES | rs12715125 | 0.004245 | 3 | C | G | Missense variant |
| 15 | TEC | 4:48271895 | 0.04527 | 4 | C | T | Upstream gene variant |
| 16 | TEC | 4:48271899 | 0.03937 | 4 | C | G | Upstream gene variant |
| 17 | IL6ST* | 5:55237103 | - | 5 | T | G | Missense variant |
| 18 | IL6ST* | 5:55260065 | - | 5 | C | A | Missense variant |
| 19 | IL6ST* | rs146973784 | - | 5 | A | T | Missense variant |
| 20 | CCR6 | 6:167552230 | 0.02443 | 6 | T | C | 3 prime UTR variant |
| 21 | CCR6 | 6:167552236 | 0.0319 | 6 | A | G | 3 prime UTR variant |
| 22 | ETV7 | rs369515633 | 0.04182 | 6 | A | T | 5 prime UTR variant |
| 23 | HLA-DRB1 | rs3180268 | 0.001196 | 6 | A | G | 3 prime UTR variant |
| 24 | HLA-DRB1 | rs77637983 | 0.01095 | 6 | G | C | Missense variant |
| 25 | HLA-DRB1 | rs9269688 | 0.006133 | 6 | A | G | 3 prime UTR variant |
| 26 | JAZF1 | 7:28220297 | 0.009422 | 7 | G | A | 5 prime UTR variant |
| 27 | IRF5* | 7:128587373 | - | 7 | C | T | Missense variant |
| 28 | IRF5* | rs113806178 | - | 7 | G | A | Missense variant |
| 29 | IRF5* | rs201243166 | - | 7 | C | T | Missense variant |
| 30 | C5 | 9:123814552 | 0.03006 | 9 | G | T | Upstream gene variant |
| 31 | GATA3 | rs11567882 | 0.04523 | 10 | T | C | Upstream gene variant |
| 32 | GATA3 | rs28395794 | 0.004603 | 10 | A | G | 5 prime UTR variant |
| 33 | CD5 | 11:60868006 | 0.0006823 | 11 | A | C | Upstream gene variant |
| 34 | CD5 | rs375347163 | 0.03061 | 11 | C | G | 3 prime UTR variant |
| 35 | CD5 | rs72912997 | 0.04337 | 11 | A | G | 3 prime UTR variant |
| 36 | AHNAK2 | rs2819428 | 0.04304 | 14 | G | A | Missense variant |
| 37 | IRF8 | rs75590645 | 0.023 | 16 | G | A | 3 prime UTR variant |
| 38 | IFNGR2 | 21:34775413 | 0.0432 | 21 | G | A | 5 prime UTR variant |
| 39 | UBE2L3 | 22:21920281 | 0.03778 | 22 | T | A | Intron variant |
| SNV: Single nucleotide variant; UTR: Untranslated region; Asterisks indicate that variants were identified by C-alpha, while the rest was identified through association analysis. | |||||||
Genotype and allelic analysis of 10 of 22 single nucleotide variants in validation cohort
| Assay | Gene | SNV | Genotype | Controls (n=500) | Cases (n=500) | Allelic frequency OR (95% CI) | |
| Sequenom | ETV7 | rs369515633 | TT | 341 | 341 | 1.04 (0.85-1.28) | 0.6918 |
| TA | 84 | 78 | |||||
| AA | 74 | 81 | |||||
| FCGR2A | rs12046367 | TT | 362 | 340 | 1.22 (0.97-1.55) | 0.0943 | |
| TA | 111 | 139 | |||||
| AA | 19 | 20 | |||||
| GATA3 | rs28395794 | GG | 436 | 449 | 0.84 (0.58-1.23) | 0.3733 | |
| GA | 59 | 48 | |||||
| AA | 2 | 3 | |||||
| IL6R | rs76419864 | CC | 500 | 498 | 2.51 (0.12-52.31) | 0.5534 | |
| CT | 0 | 2 | |||||
| TT | 0 | 0 | |||||
| IL6ST | 5:55237103 | TT | 498 | 493 | 3.52 (0.73-16.97) | 0.0948 | |
| TG | 2 | 7 | |||||
| GG | 0 | 0 | |||||
| IL6ST | 5:55260065 | CC | 499 | 492 | 8.06 (1.00-64.53) | 0.0194 | |
| CA | 1 | 8 | |||||
| AA | 0 | 0 | |||||
| IRF5 | rs201243166 | CC | 496 | 497 | 0.75 (0.17-3.36) | 0.7059 | |
| CT | 4 | 3 | |||||
| TT | 0 | 0 | |||||
| IRF8 | rs75590645 | AA | 452 | 445 | 1.198 (0.81-1.78) | 0.3674 | |
| AG | 48 | 53 | |||||
| GG | 0 | 2 | |||||
| PXK | rs199881366 | TT | 499 | 494 | 6.03 (0.72-50.18) | 0.0965 | |
| TC | 1 | 6 | |||||
| CC | 0 | 0 | |||||
| TaqMan | GATA3 | rs11567882 | CC | 439 | 446 | 0.865 (0.58-1.29) | 0.4535 |
| CT | 57 | 47 | |||||
| TT | 2 | 3 | |||||
| SNV: Single nucleotide variant; OR: Odds ratio; CI: Confidence interval; Of 22 single nucleotide variants chosen for validation, four were unsuccessful because of technical reasons and eight were monomorphic (data not shown). | |||||||