| Literature DB >> 32665571 |
Darya Belikova1, Angelika Jochim1, Jeffrey Power1, Matthew T G Holden2, Simon Heilbronner3,4,5.
Abstract
Gene tandem amplifications are thought to drive bacterial evolution, but they are transient in the absence of selection, making their investigation challenging. Here, we analyze genomic sequences of Staphylococcus aureus USA300 isolates from the same geographical area to identify variations in gene copy number, which we confirm by long-read sequencing. We find several hotspots of variation, including the csa1 cluster encoding lipoproteins known to be immunogenic. We also show that the csa1 locus expands and contracts during bacterial growth in vitro and during systemic infection of mice, and recombination creates rapid heterogeneity in initially clonal cultures. Furthermore, csa1 copy number variants differ in their immunostimulatory capacity, revealing a mechanism by which gene copy number variation can modulate the host immune response.Entities:
Mesh:
Year: 2020 PMID: 32665571 PMCID: PMC7360770 DOI: 10.1038/s41467-020-17277-3
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1csa1 gene copy number variation within the csa1 locus of S. aureus USA300 isolates.
a Schematic diagram of the three categories of repetitive sequences creating copy number variation. Hatched areas denote regions of homology. b Schematic representation of the deletions and putative amplifications of csa1 in different isolates. Coding sequences are indicated. Red and gray lines represent increased coverage (>2 fold) or deletions within the USFL isolates, respectively. c The upper panel shows the csa1 copy number of the indicated clinical isolates as measured by qPCR. Mean and SD of four replicate qPCRs on a single DNA isolation is shown. The lower panel shows DNA fragments amplified by conventional PCR using primers csa1_Sc.F and csa1_Sc.R indicated in (b). The experiment was performed once using the same batch of DNA used for qPCR. Source data are provided as a Source Data file. d USFL isolates were sequenced using MinION technology. For each sequenced isolate, a single long read that is aligned against the csa1 locus of USA300 FPR3757 is shown. Tandem amplifications manifest as multiple regions within the read with homology to csa1, allowing calculation of gene copy number. Hatched red lines emphasize that a single long read mapping repeatedly to the csa1 genes is analyzed. Genes csa1A, csa1B, csa1C, and csa1D are indicated. Shown are read_ch:51530_110 (USFL037); read_ch:10493_161 (USFL091); read_ch:40122_41 (USFL165). Source data are provided as a Source Data file.
Fig. 2Copy number diversification after in vitro evolution.
a USA300 csa1::tetK was grown over three consecutive days in six parallel cultures in the presence or absence of ciprofloxacin (CIP). Each day, the copy number of up to 16 clones of each culture showing high Tc resistance was screened by qPCR. Upper and lower box limits and the horizontal lines within the boxes represent 25 and 75% percentiles and the medians, respectively. The whiskers of the plots indicate minimum and maximum range. All data points are shown. Data are derived from six independent experiments and represent: Day 1, n = 96 and n = 73 of “no Cip” and “1 µg/ml Cip”, respectively; Day 2, n = 93 and n = 96 of “no Cip” and “1 µg/ml Cip”, respectively; Day 3, n = 93 and n = 96 of “no Cip” and “1 µg/ml Cip”, respectively. Datasets were not normal distributed (D’Agostino & Pearson omnibus test <0.0001) and statistical analysis was performed using two-tailed Mann–Whitney test. Source data are provided as a Source Data file. b Growth curves of USA300 csa1::tetK in TSB containing various concentrations of the antibiotic ciprofloxacin (CIP), which is known to enhance RecA expression. As expected, at increasing inhibitory concentrations of CIP, the growth rate of USA300 csa1::tetK decreased as assessed by measuring OD600 every 2 h. This was accompanied by an increase in RecA protein levels as assessed in mid-exponential cells by Western blot analysis using LI-COR infrared technology. A representative blot is shown wherein the lower and upper bands represent RecA and an unspecific signal, respectively. Mean and SEM of three independent experiments are shown. Source data are provided as a Source Data file. c Frequency of amplification (Tc20-resistant clones showing at least a 2 fold increase in csa1 copy number compared to the parental strain by qPCR) within the total population of living cells. Shown are the results of six independent experiments. x indicates a culture in which amplification was not detected. Statistical analysis was performed using two-tailed Mann–Whitney test. Source data are provided as a Source Data file. d A USA300 csa1::tetK high copy number variant (harboring ~50 copies of csa1) was grown over two consecutive days in three parallel cultures in the presence or absence of CIP. Each day, the copy number of the strains of 17 randomly chosen clones was screened by qPCR (n = 51). Shown is a violin plot, length of the box indicates minimum and maximum range. Width of the bar indicates accumulation of data points. Horizontal lines within the boxes represent 25 and 75% percentiles and the medians, respectively. Statistical analysis was performed using two-tailed Mann–Whitney test. Source data are provided as a Source Data file.
Fig. 3Tandem amplification of csa1::tetK.
a csa1 copy number of four independent, high Tc-resistant strains measured by qPCR. Mean and SD of three replicate qPCRs on a single DNA isolation is shown. Source data are provided as a Source Data file. b MinION sequence analysis of high Tc-resistant isolates. For each sequenced isolate, single reads covering the csa1 array were aligned against the csa1::tetK locus of the parental strain. Tandem amplifications manifest as multiple regions within the read with homology to csa1::tetK. Connecting lines within the read are omitted for reasons of clarity. Genes csa1A, csa1B, tetK, csa1C, and csa1D are indicated. Shown are read_ch:51157_511 (C6), read_ch:1460_143 (E28), read_ch:74034_155 (L38), and read_ch: 64813_197 (III37). Source data are provided as a Source Data file.
Fig. 4Tandem amplification of sdrD::tetK.
a USA300 sdrD::tetK was grown over three consecutive days in six parallel cultures in the presence or absence of ciprofloxacin (CIP). Each day, the copy number of up to 16 clones of each culture showing high Tc resistance was screened by qPCR. Upper and lower box limits and the horizontal lines within the boxes represent 25 and 75% percentiles and the medians, respectively. The whiskers of the plots indicate minimum and maximum range. Data are derived from six independent experiments and represent: Day 1, n = 87 and n = 76 of “no Cip” and “1 µg/ml Cip”, respectively; Day 2, n = 90 and n = 95 of “no Cip” and “1 µg/ml Cip”, respectively; Day 3, n = 78 and n = 96 of “no Cip” and “1 µg/ml Cip”, respectively. All data points are shown. Datasets were not normal distributed (D’Agostino & Pearson omnibus test <0.0001) and statistical analysis was performed using two-tailed Mann–Whitney test. Source data are provided as a Source Data file. b Frequency of amplification (Tc20-resistant clones showing at least a 2 fold increase in csa1 copy number compared to the parental strain by qPCR) within the total population of living cells. Shown are the results of six independent experiments. x indicates a culture in which amplification was not detected. Source data are provided as a Source Data file. c sdrD copy number of a high Tc-resistant strain in comparison to the parental strain measured by qPCR. Mean and SD of three replicate qPCRs on a single DNA isolation is shown. Source data are provided as a Source Data file. d MinION sequence analysis of the high Tc-resistant isolate. Single reads covering the sdrCDE array were aligned against the sdrD::tetK locus of the parental strain. Tandem amplifications manifest as multiple regions within the read with homology to sdrD::tetK. Connecting lines within the read are omitted for reasons of clarity. Genes sdrC, tetK, sdrD and sdrE are indicated. Source data are provided as a Source Data file.
Fig. 5Influence of copy number variation on protein expression levels.
a csa1 copy number variants were grown to stationary phase, cell membrane fractions were isolated and analyzed by SDS-page and Western blot. Csa1 proteins were detected using α-Csa1 mouse serum, followed by goat α-mouse secondary antibody coupled to IRDye800 (LI-COR). Protein amount within bands was quantified using LI-COR infrared technology. The fluorescent signal for the WT strain harboring four copies of csa1 was set to one in each experiment and the intensity of the copy number variants was expressed in relation to this value. Shown is a representative blot as well as the mean and SEM of three independent experiments. Statistical analysis was performed using one-way ANOVA (F = 9,434; DF = 26) followed by Bonferroni’s multiple comparisons test. Source data are provided as a Source Data file. b sdrD copy number variants were grown to stationary phase, cell wall-fractions were isolated and analyzed by SDS-page and Western blot analysis. SdrD was detected using α-SdrD rabbit serum, followed by goat α-rabbit secondary antibody coupled to IRDye800 (LI-COR). Protein amount within bands was quantified using LI-COR infrared technology. The fluorescent signal for the WT strain harboring a single copy of sdrD was set to 1 in each experiment and the intensity of the copy number variants was expressed in relation to this value. Shown is a representative blot as well as the mean and SEM of three independent experiments. Statistical analysis was performed using one-way ANOVA (F = 45,49; DF = 8) followed by Bonferroni’s multiple comparisons test. Source data are provided as a Source Data file.
Fig. 6Effects of csa1 amplification on the immunostimulatory capacity of strains.
a Confluent lawns of HEK-hTLR2 cells were stimulated for 18 h with 0.5% culture filtrates of csa1 copy number variants grown to stationary phase. IL-8 protein levels within the supernatants were quantified by ELISA (R&D Systems). IL-8 amounts are displayed as bars referring to the left Y-axis, data derived from a single stimulation experiment are shown. For each lineage, two parallel bacterial cultures (labeled with A and B) were used for stimulation and the csa1 copy number of each independent culture determined by qPCR (displayed as filled circle referring to the right Y-axis). Source data are provided as a Source Data file. b IL-8 protein levels shown in (a) are expressed in function of the csa1 copy number of the stimulating strain. Sample sizes were the following: 0 - n = 2; 1–30 - n = 7; 91–121 - n = 10; 121–151 - n = 3; 181–210 - n = 2). Mean and SEM is shown, statistical analysis was performed using one-way ANOVA (F = 7,710; DF=23) followed by Bonferroni’s multiple comparison test. Source data are provided as a Source Data file. c 5 × 105 HL60 cells were stimulated for 5 h with 1.5% culture filtrates of csa1 copy number variants (∆lpl1lpl2lpl3 background) grown to stationary phase. IL-8 protein levels within the supernatants were quantified using ELISA (R&D Systems). Data represent three independent supernatants of each copy number variant used in three independent stimulations (n = 9 in each group). Mean and SEM are shown. Statistical analysis was performed using one-way ANOVA (F = 3,632; DF = 38) followed by Bonferroni’s multiple comparison test. Source data are provided as a Source Data file. d Polymorphonuclear cells (PMNs) were isolated from fresh blood of healthy human volunteers. 5 × 105 PMNs were stimulated for 5 h with 1.5% culture filtrates of csa1 copy number variants (∆lplΔlpp3∆lpp4 background) grown to stationary phase. IL-8 protein levels within the supernatants were quantified using by ELISA (R&D Systems). Data represent three independent supernatants of each copy number variant used on PMNs of a single donor. Mean and SEM are shown. Statistical analysis was performed using one-way ANOVA (F = 15,74; DF = 11) followed by Bonferroni’s multiple comparison test. The experiment was repeated thrice with PMNs from different donors with similar results. Source data are provided as a Source Data file.
Fig. 7Copy number diversification in vivo.
a–c Six-week old female C57BL/6 mice were challenged with S. aureus live bacteria that carried either a functional csa1::tetK locus (a) or an inactivated csa1(FS)::tetK locus (b, c). Mice were sacrificed 24 h or 48 h post infection as indicated. High Tc resistant clones arising from the input culture and from mouse kidneys were enumerated and csa1 (a) or csa1(FS) (b, c) gene copy number was determined. Horizontal lines show the median. Up to 19 arising clones were screened. No Tc20 resistant clones were recovered from mouse 4 and mouse 2 in (b) and (c), respectively. Source data are provided as a Source Data file. d Frequency of amplification in mouse organs. qPCR analysis was used to calculate the frequency of amplification within each mouse (number of Tc20-resistant clones that showed at least a 2 fold increase in csa1 or csa1(FS) copy number compared to the parental strain within the total population of living cells recovered from each mouse) (dark-filled dots). x indicates a mouse in which amplification was not detected. Six mice were used in each group. Source data are provided as a Source Data file. e C57BL/6 mice were infected either with a low copy number variant (~4 copies) or a high copy number variant (~100 copies) of csa1 genes. Mice were sacrificed 72 h post infection and CFUs within the kidneys and spleen were enumerated. Horizontal lines show the median. Statistical analysis was performed using two-tailed Mann–Whitney test, but no significant differences were found. Six mice were used in each group. Source data are provided as a Source Data file.
Bacterial strains used in this study.
| Strains | Genotype | Source |
|---|---|---|
| Wild type | [ | |
| This study | ||
| This study | ||
| Clean deletion of the | This study | |
| Transposon insertion mutant derived from the Nebraska mutant library ID NE1289 | ||
| Clean deletion of all 4 | This study | |
| Clean deletion of 3 | This study | |
| USFL037 | Clinical MRSA isolate | [ |
| USFL085 | Clinical MRSA isolate | [ |
| USFL086 | Clinical MRSA isolate | [ |
| USFL091 | Clinical MRSA isolate | [ |
| USFL118 | Clinical MRSA isolate | [ |
| USFL162 | Clinical MRSA isolate | [ |
| USFL165 | Clinical MRSA isolate | [ |
| USFL190 | Clinical MRSA isolate | [ |
| USFL202 | Clinical MRSA isolate | [ |
| USFL225 | Clinical MRSA isolate | [ |
| USFL234 | Clinical MRSA isolate | [ |
| USFL275 | Clinical MRSA isolate | [ |
| USFL308 | Clinical MRSA isolate | [ |
| USFL311 | Clinical MRSA isolate | [ |
Plasmids used in this study.
| Plasmids | Description | Source |
|---|---|---|
| pT181 | Staphylococcal plasmid encoding | [ |
| pIMAY | Thermosensitive vector for allelic exchange | [ |
| pIMAY: | Fragment for insertion of | This study |
| pIMAY: | Fragment for intergration of the inactivated | This study |
| pIMAY:∆ | Fragment for deletion of the | This study |
| pIMAY:∆ | Fragment for deletion of the | This study |
| pIMAY:∆ | Fragment for deletion of the | This study |
| pIMAY:∆ | Fragment for deletion of the | This study |
Oligonucleotides used in this study.
| Name | 5′–3′ sequence | purpose |
|---|---|---|
| csa1_Sc.F | GATATTAAGACGAGTATGAAAATAGTTAG | Screening for length variation in the |
| csa1_Sc.R | ATTTTACAGCAACATATTTGAATTTC | Screening for length variation in the |
| qCsa1_F | TCCAGAGGTGCCGAGTTATT | qPCR of the |
| qCsa1_F | TTTATATCCAACTGATGAGCCTTTT | qPCR of the |
| qOri_F | TCGTGATAACGAAGGTGAAGC | qPCR origin of replication. |
| qOri_R | GGTGGTCGATCACTCGAAAT | qPCR origin of replication. |
| qSdrD_F | GCGACAACTTCAGCAAGTGA | qPCR |
| qSdrD_R | TGGTGAAGCTTGCTCATCTG | qPCR |
| sdr_Sc.F | GAGCAATGTTATTAATTAAAATAAGATG | Screening for length variation in the |
| sdr_Sc.R | GAATAAGGATTCCATTTAACATATACAC | Screening for length variation in the |
| 0293_Sc.F | GGAAATAAGTGTAGAGAATAAATTAATAG | Screening for length variation in the SAUSA300_0293…0296 locus. |
| 0296_Sc.R | TATTATTATTTGATGACAACTTTATGG | Screening for length variation in the SAUSA300_0293…0296 locus. |
| csa1KO_A | AGGGAACAAAAGCTGGGTACCACTATGATAAAAAAGTTGAAG | Construction of |
| csa1KO_B | CATCTTACAACTCTCTTCTTTTTAAAATG | Construction of |
| csa1KO_C | GAAGAGAGTTGTAAGATGTAATCATCCACACACACGATTC | Construction of |
| csa1KO_D | ATAGGGCGAATTGGAGCTCCATAAGCAACTGAATCACAAG | Construction of |
| csa1_ScF | CATTTGAAACGAAAATTAATAATGG | Screening for |
| csa1_ScR | CTTTGGTTCGAATGATATGTACGC | Screening for |
| lplKO_A | CACTAAAGGGAACAAAAGCTGGGTACCCAATATAACTTAATTCATGTTCTAAG | Construction of |
| lplKO_B | CATATAAATAATTAATTATTTTGTATATTTCTC | Construction of |
| lp1KO_C | CAAAATAATTAATTATTTATATGTAGGAAGTATAAAATAGATTTAAAAG | Construction of |
| lplKO_D | ACTCACTATAGGGCGAATTGGAGCTCTAGAACTACCGCATCTCTTCCACCTA | Construction of |
| lpl_ScF | AAGTATGATCTTAAGTTGTCTTTTGTAGC | Screening for |
| lpl_ScR | TAGAAATAGGAGCTGGATTATAAACC | Screening for |
| lpp3KO_A | CACTAAAGGGAACAAAAGCTGGGTACCATATTGATGCTATTTCAATTGCAGG | Construction of |
| lpp3KO_B | CATACATTCCCACCGTTTCTCAAAATAC | Construction of |
| lpp3KO_C | GTATTTTGAGAAACGGTGGGAATGTATGTAATACTTATGCTGTAATTATAGAAAC | Construction of |
| lpp3KO_D | CGACTCACTATAGGGCGAATTGGAGCTCGAAGTTAGTGCACATATTGAAGATTTAAG | Construction of |
| lpp3_ScF | TATGTATTTGTAACGCCTATGTGGAACC | Screening for |
| lpp3_ScR | CGATGGATGCATGACAAATATTGGG | Screening for |
| lpp4KO_A | CACTAAAGGGAACAAAAGCTGGGTACCGGAATTAAAATGTATATTTTTGTACAG | Construction of |
| lpp4KO_B | CATTTCACATCCCCATTTTTATTTTTG | Construction of |
| lpp4KO_C | AAATAAAAATGGGGATGTGAAATGTGAATATCAAATAAAACCTGGTAATA | Construction of |
| lpp4KO_D | CGACTCACTATAGGGCGAATTGGAGCTCTACAAGTCTAATATTACATGAATTTCC | Construction of |
| lpp4_ScF | TAAAATGGTTTACTAAATCTAATAGAAC | Screening for |
| lpp4_ScR | TTATATAAACTCTCTCGTCTCTCTCTA | Screening for |
| Sdr_tetK_A | ACTCGAGCTCAACCAATGAGTACGG | Insertion of |
| Sdr_tetK_B | ATCGTAAAACGGGATCCAACATTTGTGT | Insertion of |
| Sdr_tetK_C | GGATCCCGTTTTACGATAAAGAAAAATAATTAAAGTATTG | Insertion of |
| Sdr_tetK_D | CATTTGTCGACGTTTCATTACCTTGAGA | Insertion of |
| Sdr_tetK_Scr.F | GATAGCGATTCAGATTCAGATGCAG | Screening of |
| Sdr_tetK_Scr.R | CAACTTTATTTCCAGTGGTAGATTGTACAC | Screening of |
| tetK_F | GTCAACGGGGTTTTCAATGGGGAAAGCTTCACAGAA | Amplificaton of |
| tetk_R | CATAACACTAACAAAACATCGCTGTTAAAGCTTTTTTATTAC | Amplificaton of |