| Literature DB >> 23496945 |
Mark N Ragheb1, Christopher B Ford, Michael R Chase, Philana Ling Lin, Joanne L Flynn, Sarah M Fortune.
Abstract
BACKGROUND: Mycobacterial interspersed repetitive units (MIRUs) are minisatellites within the Mycobacterium tuberculosis (Mtb) genome. Copy number variation (CNV) in MIRU loci is used for epidemiological typing, making the rate of variation important for tracking the transmission of Mtb strains. In this study, we developed and assessed a whole-genome sequencing (WGS) approach to detect MIRU CNV in Mtb. We applied this methodology to a panel of Mtb strains isolated from the macaque model of tuberculosis (TB), the animal model that best mimics human disease. From these data, we have estimated the rate of MIRU variation in the host environment, providing a benchmark rate for future epidemiologic work.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23496945 PMCID: PMC3635867 DOI: 10.1186/1471-2164-14-145
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
List of CNVs identified via WGS and Sanger resequencing
| 154-A | None | None | 1 |
| 154-B | None | None | 2 |
| 424 | None | None | 2 |
| 577 | None | None | 2 |
| 580 | A-3 | None | 3 |
| 802 | 3; (H-3 = 4) | ||
| 960-A | I-2 | None | 3 |
| 960-B | None | None | 1 |
| 1644 | None | None | 3 |
| 1955-A | None | None | 1 |
| 1955-B | None | None | 1 |
| 2059 | None | None | 1 |
| 2163b | None | None | 3 |
| 2165* | None | None | 3 |
| 2347 | None | None | 1 |
| 2401-A | None | None | 3 |
| 2401-B | None | None | 1 |
| 2461 | None | None | 2 |
| 2531-A | None | None | 1 |
| 2531-B | None | None | 1 |
| 2687-A | None | None | 1 |
| 2687-B | None | None | 1 |
| 2996 | None | C-1 | 5 |
| 3007 | None | None | 3 |
| 3171* | None | None | 2 |
| 3192 | C-1 | 2; (G-3 = 1) | |
| 3690 | None | C-1 | 5 |
| 4052 | None | None | 3 |
| 4156 | None | None | 3 |
| 4348 | None | None | 2 |
Loci containing asterisks (2165 and 3171) were only analyzed using Sanger sequencing. Bolded strains (H-3 at locus 802 and G-3 at locus 3192) were confirmed insertion and deletion events, respectively, via Sanger resequencing. Discrete copy numbers for each strain were determined by Sanger sequencing. Previous strain notation is used [17].
Comparing mean mate pair distance between H37Rv and single copy reference genomes
| A-1 | 165 | 128 | 157 | 146 |
| A-3 | 188 | 137 | 178 | 165 |
| B-1 | 163 | 124 | 153 | 142 |
| C-1 | 156 | 120 | 148 | 138 |
| C-2 | 206 | 159 | 190 | 174 |
| E-1 | 205 | 159 | 188 | 173 |
| F-1 | 174 | 161 | 172 | 169 |
| G-1 | 157 | 119 | 147 | 137 |
| G-3 | 235 | 168 | 203 | 177 |
| H-2 | 201 | 159 | 188 | 174 |
| H-3 | 199 | 155 | 184 | 170 |
| I-1 | 218 | 180 | 208 | 193 |
| I-2 | 230 | 187 | 218 | 200 |
| I-3 | 223 | 183 | 204 | 198 |
| I-4 | 224 | 176 | 212 | 193 |
| I-6 | 204 | 157 | 184 | 171 |
| I-7 | 202 | 151 | 182 | 166 |
Mean mate pair distance values were calculated for each strain at MIRU locus 3192. Values were gathered for the MIRU coordinates as well as the surrounding (+/100 bp) region.
Figure 1Identifying MIRU CNV utilizing RD and mate pair. (a) Mean normalized RD values at MIRU locus 802 (+/− 100 bp). Mean RD values represented by dashed blue line while two standard deviations above and below the mean RD values corresponds to red and green dotted lines, respectively. Strain H-3, containing a MIRU insertion relative to the inoculum strain, is represented by the black line. Shaded area corresponds to the MIRU coordinates. (b) Mean normalized mate pair distance values at MIRU locus 802 (+/− 100 bp). The color scheme is identical to plotted read depth values. Strain shown is again H-3.
Figure 2Correlation between MIRU copy number and RD/mate pair. (a) MIRU copy number vs. mean RD. The mean RD value for each locus was determined by averaging RD across the entire locus. All loci containing the same copy number as determined by Sanger sequencing were subsequently binned and averaged, providing a single RD value for each copy number observed in the data. (b) Graph of MIRU copy number vs. average mate pair distance. Average mate pair distance values were generated as described for RD.
Figure 3Comparison of published MIRU mutation rates. The per locus, per year published estimates of the MIRU mutation rate. Previous estimates shown in shaded grey. Error bars for all estimates shown represent 95% confidence intervals. *Reyes and Tanaka, 2010 provided a range of values but no mean value **Grant, et al. 2008 provided a single estimate and no confidence intervals.
Values of published MIRU mutation rates
| Estimated rate, macaque model | 2.70 × 10-3 | 3.30 × 10-4 | 9.80 × 10-3 |
| Aandahl et al. 2012 | 3.55 × 10-3 | 8.51 × 10-4 | 1.15 × 10-2 |
| Reyes and Tanaka 2010 | None | 7.00 × 10-4 | 1.50 × 10-2 |
| Wirth et al. 2008 | 1.23 × 10-4 | 1.96 × 10-5 | 7.93 × 10-4 |
| Grant et al. 2008* | 1.05 × 10-5 | None | None |
*Grant, et al. 2008 rate was converted from a per generation to a per year mutation rate.