| Literature DB >> 28348872 |
Marta A Zelewska1, Madhuri Pulijala1, Russell Spencer-Smith1, Hiba-Tun-Noor A Mahmood1, Billie Norman1, Colin P Churchward1, Alan Calder1, Lori A S Snyder1.
Abstract
There are many types of repeated DNA sequences in the genomes of the species of the genus Neisseria, from homopolymeric tracts to tandem repeats of hundreds of bases. Some of these have roles in the phase-variable expression of genes. When a repeat mediates phase variation, reversible switching between tract lengths occurs, which in the species of the genus Neisseria most often causes the gene to switch between on and off states through frame shifting of the open reading frame. Changes in repeat tract lengths may also influence the strength of transcription from a promoter. For phenotypes that can be readily observed, such as expression of the surface-expressed Opa proteins or pili, verification that repeats are mediating phase variation is relatively straightforward. For other genes, particularly those where the function has not been identified, gathering evidence of repeat tract changes can be more difficult. Here we present analysis of the repetitive sequences that could mediate phase variation in the Neisseria gonorrhoeae strain NCCP11945 genome sequence and compare these results with other gonococcal genome sequences. Evidence is presented for an updated phase-variable gene repertoire in this species, including a class of phase variation that causes amino acid changes at the C-terminus of the protein, not previously described in N. gonorrhoeae.Entities:
Keywords: C-terminal variation; gonococcus; homopolymeric tract; phase variation; simple sequence repeats
Mesh:
Substances:
Year: 2016 PMID: 28348872 PMCID: PMC5320596 DOI: 10.1099/mgen.0.000078
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.Illustrations of the types of phase variation in N. gonorrhoeae. (a): Transcriptional phase variation, in which changes in a repeat tract alter the facing and spacing of the −10 and −35 promoter elements and the level of transcription of the gene. Phase variation of fetA is used as an example, where it has been shown that differences in spacing of the −10 and −35 elements due to changes in the poly-C repeat tract alter expression levels, represented by the widths of the arrows (Carson ). (b): Translational phase variation, in which changes in a repeat tract towards the 5′ end of the coding sequence alter the reading frame of a coding region and switch expression on and off due to frame-shift. Phase variation of pilC is used as an example, where it has been shown that changes in the poly-G repeat tract generate frame-shifts which switch protein expression on and off (Jonsson ). (c): C-terminal phase variation, in which changes in a repeat tract towards the 3′ end of the coding sequence alter the reading frame of a coding region and switch the encoded C-terminal amino acids between the three reading frames. In the example NGK_1211, two of the reading frames result in different C-terminal ends to the protein, while the third generates a fusion with the downstream coding sequence, NGK_1212. Only some examples of C-terminal phase variation result in this type of fusion (Table 3).
Transcriptional phase variable genes in N. gonorrhoeae
| Gene | FA1090 locus* | Repeat in FA1090* | NCCP11945 locus† | Repeat in NCCP11945† | Repeat in FA19‡ | Repeat in FA6140§ | Repeat in 35/02|| | Repeat in MS11¶ | Reference | |
|---|---|---|---|---|---|---|---|---|---|---|
| NGO_04715 | (T)11C(G)6T | NGK_0906 & NGK_0907** | (T)9C(G)6T | (T)8C(G)8T | (T)9C(G)6T | (T)9C(G)6T | (T)9C(G)6TT | Known | ||
| Lipoprotein | NGO2047 | (A)9 | NGK_2186†† | (A)8 | (A)8 | (A)9 | (A)9 | (A)8 | Yes | |
| NGO2093 | (C)13 | NGK_2557 | (C)14 | (C)10 | (C)11 | (C)11 | (C)11 | Known |
*From the N. gonorrhoeae strain FA1090 genome sequence (NC_002946.2).
†From the N. gonorrhoeae strain NCCP11945 genome sequence (CP00150.1).
‡From the N. gonorrhoeae strain FA19 genome sequence (NZ_CP012026.1).
§From the N. gonorrhoeae strain FA6140 genome sequence (NZ_CP012027.1).
||From the N. gonorrhoeae strain 35/02 genome sequence (NZ_CP012028.1).
¶From the N. gonorrhoeae strain MS11 genome sequence (NC_022240.1).
#Gene phase variation candidacy in N. gonorrhoeae. Known, phase variation has been reported in the literature. Yes, there is evidence of repeat tract variation between strains supporting phase variation.
**This coding sequenc appears to be frame-shifted and annotated as two coding sequences.
††NGK_2186 and NGO2047 annotations are on opposite strands.
C-terminal phase-variable genes in N. gonorrhoeae
| Gene | FA1090 locus* | Repeat at in FA1090* | NCCP11945 locus† | Repeat in NCCP11945† | Amino acids after the repeat in each frame of NCCP11945‡‡‡ | FA19§ | FA6140|| | 35/02 ¶ | MA11# | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NGO0072 | (G)8 | NGK_0106 | (G)8 | 26 | 50 | 61 | (G)8 | (G)8 | (G)8 | (G)8 | (Yes) | |
| Menbrane protein | NGO0691 | (G)6 | NGK_1211 | (G)7 | 23 | 8 | 186^ | (G)7 | (G)6 | (G)7 | (G)7 | Yes |
| WX61_RS02820 | (C)7 | NGK_1624 | (C)9 | 46 | 2 | 37 | deletion†† | (C)10 | (C)8 | (C)10 | Yes | |
| Pilin cassette | WX61_RS02820 | CCGC | NGK_2161 | (C)8 | 20 | 8 | 91¶ | (C)5GCC | CCGCC | (C)5 | CCT(C)5 | Yes |
*From the N. gonorrhoeae strain FA1090 genome sequence (NC_002946.2).
†From the N. gonorrhoeae strain NCCP11945 genome sequence (CP00150.1).
‡In each column are the number of amino acids encoded 3′ of the repeat before the closest termination codon in each of the three reading frames.
§From the N. gonorrhoeae strain FA19 genome sequence (NZ_CP012026.1).
||From the N. gonorrhoeae strain FA6140 genome sequence (NZ_CP012027.1).
¶From the N. gonorrhoeae strain 35/02 genome sequence (NZ_CP012028.1).
#From the N. gonorrhoeae strain MS11 genome sequence (NC_022240.1).
**Gene phase variation candidacy in N. gonorrhoeae. Yes, there is evidence of repeat tract variation between strains supporting phase variation. (Yes), although there is no variation between strains investigated here, tracts of this length vary in other genes (Chen ; Shafer ; Adamczyk-Poplawska ).
††There is a 400 bp deletion in this strain encompassing the region that would contain this repeat.
Translational phase-variable genes in N. gonorrhoeae
| Gene | FA1090 locus* | Repeat in FA1090* | NCCP11945 locus† | Repeat in NCCP11945† | Repeat in FA19‡ | Repeat in FA6140§ | Repeat in 35/02|| | Repeat in MS11¶ | Reference | |
|---|---|---|---|---|---|---|---|---|---|---|
| NGO0055 | (G)13 | NGK_0074 | (G)9 | (G)10 | (G)13 | (G)9CAGG | (G)12 | Known | ||
| NGO0066a | (CTTCT)13CTTCG | NGK_0096 | (CTTCT)8CTTCC | (CTTCT)4CTTCC | CTT(CTTCT)10CTTCC | (CTTCT)6(CT TCC)2 | (CTTCT)8CTT CC | Known | ||
| NGO0070 | (CTTCT)9CTTCG | NGK_0102 | (CTTCT)7CTTCC | (CTTCT)8CTTCC | (CTTCT)17CTTCC | (CTTCT)7CTT CC | (CTTCT)7CTT CC | Known | ||
| NGO0086 | (C)10 | no repeat | no repeat | no repeat | Known | |||||
| NGO0087 | A(C)7 | A(C)9 | A(C)6 | A(C)6 | Known | |||||
| NGO0207 | (CAAACAC)4 | NGK_0339 | (CAAACAC)8 (CAAATAC)3 | (CAAACAC)6CAAATACCAAACACCAAATAC | (CAAACAC)10CAAATACCAAACACCAAATACCAAACAC(CAAATAC)2 | (CAAACAC)24 CAAATAC | (CAAACAC)15(CAAATAC)3 | Known | ||
| NGO_02155 | (G)7 | NGK_0571 | (G)7 | (G)8 | (G)9 | (G)7 | (G)7 | Known | ||
| Hypothetical | NGO0527 | (C)6A(C)9GC | NGK_1405 | (C)6A(C)8GC | (C)7A(C)4T(C )3GC | (C)7A(C)4T(C)3GC | (C)11A(C)8GC GC | (C)6A(C)14GC C | Yes | |
| NGO0545 | (CCCAA)12 | NGK_1384 | (CCCAA)11 | (CCCAA)12 | (CCCAA)4 | (CCCAA)11 | (CCCAA)7 | Known | ||
| Replication initiation factor | NGO_06135 | (C)8TTATCTAACA(G)7 | NGK_1957 | (C)11TTATCTAACA(G)8 | (C)7TTATCTAACA(G)7 | (C)6TTATCT AACA(G)5 | (C)11TTATCT AACA(G)8 | (C)10TTATCT AACA(G)7 | Yes | |
| NGO0641 | (GCCA)37 | NGK_1272 | (GCCA)18 | (GCCA)24GTCA | (GCCA)24 | (GCCA)19 | (GCCA)24 | Known | ||
| Replication initiation factor | NGO_06695 | (C)8TTATCTAACA(G)7 | NGK_1486 | (C)9TTATCTAACA(G)7 | (C)7TTATCTAACA(G)7 | (C)9TTATCT AACA(G)6 | (C)11TTATCT AACA(G)8 | (C)9TTATCTA ACA(G)7 | Yes | |
| NGO0950a | (CTTCT)16CTTCC | NGK_0847 | (CTTCT)19CTTCC | (CTTCT)13CTTCC | (CTTCT)16CTTCC | (CTTCT)8CTT CC | (CTTCT)4CTT CC | Known | ||
| Hypothetical | NGO0964 | (AAGC)4 | NGK_0831a | (AAGC)8 | (AAGC)15 | (AAGC)7 | (AAGC)9 | (AAGC)7 | Yes | |
| NGO0985 | (AAGC)3 | NGK_0804 | (AAGC)3 | (AAGC)2 | (AAGC)3 | (AAGC)3 | (AAGC)3 | Yes | ||
| NGO1040a | (CTTCT)20CTTCC | NGK_0749 | (CTTCT)20CTTCC | (CTTCT)10CTTCC | (CTTCT)7CTTCC | (CTTCT)12CT TCC | (CTTCT)13CT TCCCTTCT(C TTCC)2 | Known | ||
| NGO1073a | (CTTCT)2CTTCC | NGK_0693 | (CTTCT)10CTTCC | (CTTCT)11CTTCC | (CTTCT)12CTTCC | (CTTCT)18CT TCC | (CTTCT)7CTT CC | Known | ||
| NGO1277a | (CTTCT)11CTTCC | NGK_1495 | (CTTCT)7CTTCC | CTT(CTTCT)11CTTCC | (CTTCT)11CTTCC | (CTTCT)7CTT CC | (CTTCT)8CTT CC | Known | ||
| Adhesion | NGO1445 | (CAAG)20CAAA | NGK_1705 | (CAAG)12CAAA | (CAAG)12CAAA | (CAAG)6CAAA | (CAAG)9CAAA | (CAAG)6CAAA | Yes | |
| NGO1463a | (CTTCT)10CTTCC | NGK_1729 | (CTTCT)7CTTCC | (CTTCT)11CTTCC | (CTTCT)12CTTCC | (CTTCT)12CT TCC | (CTTCT)10CT TCC | Known | ||
| NGO1513 | (CTTCT)12CTTCG | NGK_1799 | (CTTCT)14CTTCC | CTT(CTTCT)10CTTCC | (CTTCT)6CTT CG | (CTTCT)7CTT CG | Known | |||
| NGO1553a | (CTTCT)4CTTCC | NGK_1847 | (CTTCT)9CT TCC | (CTTCT)17CTTCC | (CTTCT)8CTTCC | (CTTCT)8CTT CC | (CTTCT)14CT TCC | Known | ||
| NGO1689 | (AAGC)3 | NGK_2082 | (AAGC)3 | (AAGC)14 | (AAGC)3 | (AAGC)3 | (AAGC)3 | Known | ||
| NGO1765 | (G)11 | NGK_2516 | GGGAGCGGG | (G)19 | (G)19 | GGGAGCGGG | GGGAGCGGG | Known | ||
| Repetitive large surface lipoprotein | NGO_09875 & NGO_09870** | (G)8 | NGK_2422 & NGK_2423** | (G)7 | (G)7 | (G)7 | (G)7 | (G)7 | Yes | |
| NGO1861a | (CTTCT)13CTTCC | NGK_2410 | (CTTCT)11CTTCC | (CTTCT)13CTTCC | (CTTCT)2CTTCC | (CTTCT)13CT TCC | (CTTCT)30CT TCC | Known | ||
| NGO1912 | (G)11 | NGK_2342 | (G)13 | (G)11 | GGGC(G)11 | (G)15 | (G)11 | Known | ||
| Hypothetical | NGO1953 | (C)8 | NGK_2297 | (C)8 | (C)8 | (C)8 | (C)9 | (C)8 | Yes | |
| Pyrimidine 5′- nucleotidase | NGO2055 & NGO2054** | (C)6 | NGK_2176 | CAAACCCC | CAAACCCC | CAAACCCC | (C)9 | (C)10 | Yes | |
| NGO2060a | (CTTCT)10CTTCG | (CTTCT)6CTT CC | (CTTCT)7CTT CC | Known | ||||||
| NGK_2170 | (CTTCT)14CTTCC | Known | ||||||||
| NGO2072 | (C)11 | NGK_2534 & NGK_2533** | (C)12 | (C)10 | (C)10 | (C)10 | (C)10 | Known | ||
| NGO2110 | (G)9 | NGK_2581 | (G)10 | (G)9 | (G)9 | (G)8 | (G)8 | Known | ||
| NGO11610 | (G)11 | NGK_2630 | (G)11 | (G)14A | (G)17A | (G)20A | (G)10A | Known | ||
| NGO2156 | (G)14 | NGK_2632 | (G)13 | (G)13 | (G)10 | (G)16 | (G)8 | Known | ||
| NGO2158 | A(G)14 | NGK_2634 | A(G)16 | (G)13 | A(G)12 | A(G)18 | A(G)13 | Known |
*From the N. gonorrhoeae strain FA1090 genome sequence (NC_002946.2).
†From the N. gonorrhoeae strain NCCP11945 genome sequence (CP00150.1).
‡From the N. gonorrhoeae strain FA19 genome sequence (NZ_CP012026.1).
§From the N. gonorrhoeae strain FA6140 genome sequence (NZ_CP012027.1).
||From the N. gonorrhoeae strain 35/02 genome sequence (NZ_CP012028.1).
¶From the N. gonorrhoeae strain MS11 genome sequence (NC_022240.1).
#Gene phase variation candidacy in N. gonorrhoeae. Known , phase variation has been reported in the literature. Yes, there is evidence of repeat tract variation between strains supporting phase variation.
**This coding sequence appears to be frame-shifted and annotated as two coding sequences.
np, The coding sequence is not present in this strain.
Genes for which there is no evidence of phase variation in N. gonorrhoeae
| Gene | FA1090 locus* | Repeat in FA1090* | NCCP11945 locus† | Repeat in NCCP11945† | Repeat in FA19‡ | Repeat in FA6140§ | Repeat in 35/02|| | Repeat in MS11¶ | |
|---|---|---|---|---|---|---|---|---|---|
| Prolyl endopeptidase | NGO0026 | GGGGCGG | NGK_0034 | GGGGCGG | GGGGCGG | GGGGCGG | GGGGCGG | GGGGCGG | No. Replacement tract |
| NGO0065 | C(G)6 | NGK_0089** | C(G)6 | C(G)6 | C(G)6 | C(G)6 | C(G)6 | No. No variation. | |
| Phosphoesterase | NGO0081 | (C)7 | NGK_0 n 9 | (C)7 | (C)7 | (C)7 | (C)7 | (C)7 | No. No variation. |
| Hypothetical | NGO0121 | (A)6 | NGK_0167 | (A)6 | (A)6 | (A)6 | (A)6 | (A)6 | No. No variation. |
| NGO0123 | (C)4 | NGK_0168 | (C)4 | (C)4 | (C)4 | (C)4 | (C)4 | No. No variation. | |
| NGO0206 | AA(C)5 | NGK_0338 | AA(C)5 | AA(C)5 | AA(C)5 | AA(C)5 | AA(C)5 | No. No variation. | |
| Hypothetical | NGO0532 | AACCGGCAAACA | NGK_1400 | AACCGGCAAACA | AACCGGCAAACA | AACCGGCAAACA | AACCGGCAAACA | AACCGGCAAACA | No. Replacement tract |
| NGO0636 | CCACACCC | NGK_1278 | CCACACCC | CCACACCC | CCACACCC | CCACACCC | CCACACCC | No. Replacement tract | |
| NGO0639 | (G)7 | NGK_1275 | (G)7 | (G)7 | (G)7 | (G)7 | (G)7 | No. No variation. | |
| Methylase NlalV | NGO0676 | (A)9 | NGK_1230 | (A)9 | (A)9 | (A)9 | (A)9 | (A)9 | No. No variation. |
| NGO0743 | (C)7 | NGK_1135 | (C)7 | (C)7 | (C)7 | (C)7 | (C)7 | No. No variation. | |
| NGO0754 | GGAAGG | NGK_1123 | GGAAGG | GGAAGG | GGAAGG | GGAAGG | GGAAGG | No. Replacement tract | |
| NGO1041 | (C)7 | NGK_0745 | (C)7 | (C)7 | (C)7 | (C)7 | No. No variation. | ||
| NGO1371 | (AT)5 | NGK_1607 | (AT)5 | (AT)5 | (AT)5 | (AT)5 | (AT)5 | No. No variation. | |
| Hypothetical | NGO1384 | G(A)7 | NGK_1622 | (A)8 | (A)8 | (A)8 | (A)8 | G(A)7 | No. No variation in length. |
| NGO1470 | CCCTGCTGG | NGK_1735 | CCCTGCTGG | CCCTGCTGG | CCCTGCTGG | CCCTGCTGG | CCCTGCTGG | No. Replacement tract | |
| NGO1501 | TTCGCCC | NGK_1783 | TTCGCCC | TTCGCCC | TTCGCCC | TTCGCCC | TTCGCCC | No. Replacement tract | |
| NGO1540 | TGTGGGGG | NGK_1830 | TGTGGGGG | TGTGGGGG | TGTGGGGG | TGTGGGGG | TGTGGGGG | No. Replacement tract | |
| NGO1583 | (C)7 | NGK_1884 | (C)7 | (C)7 | (C)7 | (C)7 | (C)7 | No. No variation. | |
| NGO1708 | (C)4T CC | NGK_2106 | (C)4TCC | (C)4TCC | (C)4TCC | (C)4TCC | (C)4TCC | No. Replacement tract | |
| NGO1855 | (C)7 | NGK_2416 | (C)7 | (C)7 | (C)7 | (C)7 | (C)7 | No. No variation. | |
| Hypothetical | NGO1970 | (TA)5 | NGK_2274 | (TA)5 | (TA)5 | (TA)5 | (TA)5 | (TA)5 | No. No variation. |
| NGO1972 | (G)5 | NGK_2270 | (G)5 | (G)5 | (G)5 | (G)5 | (G)5 | No. No variation. | |
| NGO1983 | (C)6 | NGK_2258 | (C)6 | (C)6 | (C)6 | (C)6 | (C)6 | No. No variation. | |
| NGO2171 | (TTCC)3 | NGK_2652 | (TTCC)3 | (TTCC)3 | (TTCC)3 | (TTCC)3 | (TTCC)3 | No. No variation. | |
| NGO0260a | NGK_0401 | GGGGGCGG | GGGGGCGG | TGAAACGG | No. Replacement tract |
*From the N. gonorrhoeae strain FA1090 genome sequence (NC_002946.2).
†From the N. gonorrhoeae strain NCCP11945 genome sequence (CP00150.1).
‡From the N. gonorrhoeae strain FA19 genome sequence (NZ_CP012026.1).
§From the N. gonorrhoeae strain FA6140 genome sequence (NZ_CP012027.1).
||From the N. gonorrhoeae strain 35/02 genome sequence (NZ_CP012028.1).
¶From the N. gonorrhoeae strain MS11 genome sequence (NC_022240.1).
#Gene phase variation candidacy in N. gonorrhoeae. No. Replacement tract: due to the replacement of the repeat tract with other nucleotides, this is not phase-variable. No. No variation: due to no observed variation in the repeat tract, this is not phase-variable. No. No variation in length: due to the equal length tract in all strains, this is not phase-variable.
**This coding sequence contains a point mutation, which generates a premature termination codon.
nr, The region of the coding sequence containing the repeat tract does not have homology to the aligned region in these strains.
Genes for which there is sequencing-based evidence of phase variation in N. gonorrhoeae strain NCCP11945
| Gene | FA1090 locus* | Repeat in FA1090* | NCCP11945 locus† | Repeat in NCCP11945† | Repeat in FA19‡ | Repeat in FA6140§ | Repeat in 35/02|| | Repeat in MS11¶ | Ion Torrent* * | Illumina†† | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| NGO0072 | (G)8 | NGK_0106 | (G)8 | (G)8 | (G)8 | (G)8 | (G)8 | (yes) | Repeat varies | Repeat does not vary | |
| NGO0985 | (AAGC)3 | NGK_0804 | (AAGC)3 | (AAGC)2 | (AAGC)3 | (AAGC)3 | (AAGC)3 | Yes | Repeat does not vary | Repeat does not vary | |
| Hypothetical | NGO0964 | (AAGC)4 | NGK_0831a | (AAGC)8 | (AAGC)15 | (AAGC)7 | (AAGC)9 | (AAGC)7 | Yes | Repeat varies | Repeat varies |
| Membrane protein | NGO0691 | (G)6 | NGK_1211 | (G)7 | (G)7 | (G)6 | (G)7 | (G)7 | Yes | Repeat varies | Repeat does not vary |
| Hypothetical | NGO0527 | (C)6A(C)9GC | NGK_1405 | (C)6A(C)8GC | (C)7A(C)4T(C)3GC | (C)7A(C)4T(C)3GC | (C)11A(C)8GCGC | (C)6A(C)14GCC | Yes | Repeat varies | Repeat does not vary |
| Replication initiation factor | NGO_06695 | (C)8TTATCTAACA(G)7 | NGK1486 | (C)9TTATCTAACA(G)7 | (C)7TTATCTAACA(G)7 | (C)9TTATCTAACA(G)6 | (C)11TT ATCTAACA(G)8 | (C)9T T AT CT AACA(G)7 | Yes | Repeat varies | Repeat varies |
| NGO1386 | (C)7 | NGK_1624 | (C)9 | {deletion} | (C)10 | (C)8 | (C)10 | Yes | Repeat varies | Repeat varies | |
| Adhesion | NGO1445 | (CAAG)20CAAA | NGK_1705 | (CAAG)12CAAA | (CAAG)12CAAA | (CAAG)6CAAA | (CAAG)9CAAA | (CAAG)6CAA A | Yes | Repeat varies | Repeat varies |
| Replication initiation factor | NGO_06135 | (C)8TTATCTAACA(G)7 | NGK_1957 | (C)11TTATCTAACA(G)8 | (C)7TTATCTAACA(G)7 | (C)6TTATCTAACA(G)5 | (C)11TTATCTAACA(G)8 | (C)10T TAT C TAACA(G)7 | Yes | Repeat varies | Repeat varies |
| Pilin cassette | NGO_11140 | CCGC | NGK_2161 | (C)8 | (C)5GCC | CCGCC | (C)5 | CCT (C)5 | Yes | Repeat varies | Repeat varies |
| Pyrimidine 5′-nucleotidase | NGO2055 & NGO2054II | (C)6 | NGK_2176 | CAAACCCC | CAAACCCC | CAAACCCC | (C)9 | (C)10 | Yes | No repeat | No repeat |
| Lipoprotein | NGO2047 | (A)9 | NGK_2186§§ | (A)8 | (A)8 | (A)9 | (A)9 | (A)8 | Yes | Repeat varies | Repeat varies |
| Hypothetical | NGO1953 | (C)8 | NGK_2297 | (C)8 | (C)8 | (C)8 | (C)9 | (C)8 | Yes | Repeat varies | Repeat does not vary |
| Repetitive large surface lipoprotein | NGO_09875 & NGO_09870II | (G)8 | NGK_2422 & NGK_2423|| | (G)7 | (G)7 | (G)7 | (G)7 | (G)7 | Yes | Repeat varies | Repeat does not vary |
| NGO2093 | (C)13 | NGK_2557 | (C)14 | (C)10 | (C)11 | (C)11 | (C)11 | Known | Repeat varies | Repeat varies | |
| NGO1912 | (G)11 | NGK_2342 | (G)13 | (G)11 | GGGC(G)11 | (G)15 | (G)11 | Known | Repeat varies | Repeat varies | |
| NGO0950a | (CTTCT)16CTTCC | NGK_0847 | (CTTCT)19CTTCC | (CTTCT)13CTTCC | (CTTCT)16CTTCC | (CTTCT)8CTTCC | (CT T CT )4CT TCC | Known | No reads through repeat | Repeat varies |
*From the N. gonorrhoeae strain FA1090 genome sequence (NC_002946.2).
†From the N. gonorrhoeae strain NCCP11945 genome sequence (CP00150.1).
‡From the N. gonorrhoeae strain FA19 genome sequence (NZ_CP012026.1).
§From the N. gonorrhoeae strain FA6140 genome sequence (NZ_CP012027.1).
||From the N. gonorrhoeae strain 35/02 genome sequence (NZ_CP012028.1).
¶From the N. gonorrhoeae strain MS11 genome sequence (NC_022240.1).
#Gene phase variation candidacy in N. gonorrhoeae. Known, phase variation has been reported in the literature. Yes, there is evidence of repeat tract variation between strains supporting phase variation. (Yes), although there is no variation between strains investigated here, tracts of this length vary on other genes (Chen ; Shafer ; Adamczyk-Poplawska ).
**Based on Ion Torrent sequencing data from cultures grown with passage for 8 weeks and from cultures grown with passage for 20 weeks (accession numbers SRR3547950, SRR3547951, SRR3547952, SRR3547953).
††Based on Illumina sequencing data from cultures grown with passage for 20 weeks (accession numbers SRR3547954, SRR3547955, SRR3547956, SRR3547957).
‡‡There is a 400 bp deletion in this strain encompassing the region that would contain this repeat.
§§NGK_2186 and NGO2047 annotations are on opposite strands.
||||This coding sequence appears to be frame-shifted and annotated as two coding sequences.