| Literature DB >> 27466808 |
Lea Lango-Scholey1, Jack Aidley2, Alexandra Woodacre2, Michael A Jones1, Christopher D Bayliss2.
Abstract
Mutations in simple sequence repeat tracts are a major mechanism of phase variation in several bacterial species including Campylobacter jejuni. Changes in repeat number of tracts located within the reading frame can produce a high frequency of reversible switches in gene expression between ON and OFF states. The genome of C. jejuni strain NCTC11168 contains 29 loci with polyG/polyC tracts of seven or more repeats. This protocol outlines a method-the 28-locus-CJ11168 PV-analysis assay-for rapidly determining ON/OFF states of 28 of these phase-variable loci in a large number of individual colonies from C. jejuni strain NCTC11168. The method combines a series of multiplex PCR assays with a fragment analysis assay and automated extraction of fragment length, repeat number and expression state. This high throughput, multiplex assay has utility for detecting shifts in phase variation states within and between populations over time and for exploring the effects of phase variation on adaptation to differing selective pressures. Application of this method to analysis of the 28 polyG/polyC tracts in 90 C. jejuni colonies detected a 2.5-fold increase in slippage products as tracts lengthened from G8 to G11 but no difference between tracts of similar length indicating that flanking sequence does not influence slippage rates. Comparison of this observed slippage to previously measured mutation rates for G8 and G11 tracts in C. jejuni indicates that PCR amplification of a DNA sample will over-estimate phase variation frequencies by 20-35-fold. An important output of the 28-locus-CJ11168 PV-analysis assay is combinatorial expression states that cannot be determined by other methods. This method can be adapted to analysis of phase variation in other C. jejuni strains and in a diverse range of bacterial species.Entities:
Mesh:
Year: 2016 PMID: 27466808 PMCID: PMC4965091 DOI: 10.1371/journal.pone.0159634
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Phase-variable genes and repeat tract specific PCR primers.
| Gene | Gene size (bp) | Location of Repeat Tract | In-frame Repeat Number | Primer name | Sequence (5' → 3') |
|---|---|---|---|---|---|
| 3,732 | +2,572 | 9G | cj0031-fwd-FAM | ||
| 723 | +709 | 11G | cj0045-fwd-FAM | ||
| 1,407 | +610 | (11G) | cj0046-fwd-NED | ||
| 744 | +243 | 8G | cj0171-fwd-NED | ||
| 1,251 | +694 | 8G | cj0275-fwd-NED | ||
| 729 | -58 | (10G) | cj0565-fwd | ||
| 609 | +559 | 10G | cj0617-fwd-VIC | ||
| 3,432 | +501 | 11G/10G | CapA-fwd-FAM | ||
| 1,656 | +854 | 10G | cj0676-fwd-NED | ||
| 1,350 | +877 | 9C | cj0685-fwd-FAM | ||
| 909 | +330 | 8G | cj1139-fwd-VIC | ||
| 843 | +294 | 10G | cj1144-fwd-NED | ||
| 1,305 | +143 | 9G | cj1295-fwd-NED | ||
| 792 | +309 | 10G | cj1296-fwd-NED | ||
| 1,215 | +579 | 9G | cj1305-fwd-VIC | ||
| 1,224 | +579 | 9G | cj1306-fwd-NED | ||
| 1,212 | +579 | 9G | cj1310-fwd-FAM | ||
| 1,947 | +167 | 11G | cj1318-fwd-FAM | ||
| 540 | -37 | (10G) | cj1321-fwd-VIC | ||
| 672 | +252 | 9G | cj1326-fwd-FAM | ||
| 1,944 | +168 | 11G | cj1335-fwd-NED | ||
| 1,239 | +560 | 9G | cj1342-fwd-FAM | ||
| 771 | +393 | 9G | cj1420-fwd-FAM | ||
| 1,836 | +87 | 9G | cj1421-22-fwd | ||
| 1,875 | +87 | 9G | cj1421-22-fwd | ||
| 849 | +294 | 10G | cj1426-fwd-FAM | ||
| 924 | +291 | 10G | cj1429-fwd-FAM | ||
| 1,101 | +876 | 9G | cj1437-fwd-VIC |
1Values in brackets indicate the arbitrary number of repeats coded as ON because the repeat tract is intergenic or within a pseudogene.
2G11 would allow the translation to extend by another 15 amino acids, overlapping the start of cj0044c; G9 and G10 (consensus) have little effect.
3capA fragment varies between 11168 strains due to variation in the polyT tract immediately upstream of polyG tract (6T in the lab strain and the hypermotile 11168H strain, 5T in the chicken-adapted 11168ca strain).
Multiplex PCR primer mixes.
| Primer mix | Locus | Dye colour | PCR product size |
|---|---|---|---|
| Blue | 457 | ||
| Blue | 332 | ||
| Yellow | 215 | ||
| Yellow | 282 | ||
| Green | 357 | ||
| Yellow | 353 |
1Dye colour as annotated in output files from PeakScanner™ although yellow is displayed as black in this program.
2PCR product size, as determined from the C. jejuni NCTC11168 genome sequence (7).
PCR amplification and A-tail reactions.
| Step | Reaction mix for GoTaq (Promega) | Reaction mix for KAPA | Reaction in PCR cycler |
|---|---|---|---|
| 5x Buffer—2 μl | 10x Buffer A—1 μl | 94°C—5 min | |
| Transfer to a new 96-well plate: |
Protocol for amplification of 28 PV sites by multiplex PCR in 96-well plates, including the A-tailing step before mixing of samples with Size Standard for GeneScan analysis.
Fig 1Analysis of multiplex fragment analysis data in PeakScanner.
(A) PeakScanner application layout. Labels 1–6 refer to: 1, Sample Name, well location in the 96-well plate; 2, Size Standard, name of size standard as selected from the drop-down menu; 3, Analysis Method, analysis method name, e.g. PP—Primers Present, assumes primers have not been removed from the sample; 4, Size, length of PCR fragment for a specific peak in base pairs; 5, Height, height of each peak indicating fluorescent dye signal strength; 6, Dye Legend, displayed dye colours and relative scale. (B) Enlarged section of Panel A showing how similarly-sized PCR products can be distinguished using different dye colours and the occurrence of flanking peaks around a major peak.
Output files for PSAnalyse.
| Output file name | Format | Information |
|---|---|---|
| Input_name.details.csv | CSV | Detailed output that extracts and displays relevant data from PeakScanner (Sample name, Size, Height, Area, Size of Prev, Area of Prev, Size of Post, Area of Post) and also calculates and displays ‘Ratio to Prev’, ‘Ratio to Post’, ‘Tract Length’ and ‘Score’– 0 or 1 –based on tract length. |
| Input_name.tracts.csv | CSV | Contains tract length data for the 28 PV sites for each sample in a 96-well plate as a single table. ‘N’ is used for missing datapoints and datapoints that are identified as uncertain have the suffix ‘?’. |
| Input_name.scores.csv | CSV | Contains ON/OFF scores (1 or 0 respectively) only for all 28 PV sites for each sample in a 96-well plate as a single table. ‘N’ is used for missing datapoints and datapoints that are identified as uncertain have the suffix ‘?’. |
| Input_name.out.html | Html | Graphical visualisation of Output results for all 28 PV sites in each well (sample); allows easy identification of problematic results as these are coloured in red. |
Fig 2Comparison of two analysis methods for determining the percentage of cells in a population with a gene in an ON state.
The percentage of ON variants in a population was determined by PCR-based fragment analysis of either the relative proportions of peaks obtained using a total DNA extract of the population (x-axis) or from analysis of up to 30 single colonies obtained from serial dilutions of a population (y-axis). The analysis was performed on 16 populations for 28 phase-variable loci. Each circle represents one of the 448 measurements. Black line, line for a 100% correlation between each method. Blue line, linear regression line with 95% confidence interval indicated by shaded area.
Fig 3The effect of repeat number on PCR slippage as detected by fragment analysis of individual colonies with an ON number of repeats.
Fragment length analysis was performed for 11 repeat tracts on a set of single colonies (n = 90). The relative area under each peak (i.e. the flanking peaks and the major peak) was calculated by dividing the area under the peak by the total area under all peaks for that specific locus and sample. These values were multiplied by 100 to obtain percentage values for the relative area under each peak. (A) Dotplot of individual values for the major peak of each locus separated by tract length. The number of loci analysed for each tract length were as follows:- G8, 257; G9, 214; G10, 431; G11, 77; G12, 9. (B) Average values for each locus. Peaks for each locus were separated by the numbers of repeats in the polyG tract of the major peak with -1 and +1 indicating flanking peaks having one less or one more G residue. Horizontal line, average for relative area under the peak; n, number of colonies.