| Literature DB >> 19906940 |
Charles Cunningham1, Derek Gatherer, Birgitta Hilfrich, Katarina Baluchova, Derrick J Dargan, Marian Thomson, Paul D Griffiths, Gavin W G Wilkinson, Thomas F Schulz, Andrew J Davison.
Abstract
We have assessed two approaches to sequencing complete human cytomegalovirus (HCMV) genomes (236 kbp) in DNA extracted from infected cell cultures (strains 3157, HAN13, HAN20 and HAN38) or clinical specimens (strains JP and 3301). The first approach involved amplifying genomes from the DNA samples as overlapping PCR products, sequencing these by the Sanger method, acquiring reads from a capillary instrument and assembling these using the Staden programs. The second approach involved generating sequence data from the DNA samples by using an Illumina Genome Analyzer (IGA), processing the filtered reads by reference-independent (de novo) assembly, utilizing the resulting sequence to direct reference-dependent assembly of the same data and finishing by limited PCR sequencing. Both approaches were successful. In particular, the investigation demonstrated the utility of IGA data for efficiently sequencing genomes from clinical samples containing as little as 3 % HCMV DNA. Analysis of the genome sequences obtained showed that each of the strains grown in cell culture was a mutant. Certain of the mutations were shared among strains from independent clinical sources, thus suggesting that they may have arisen in a common ancestor during natural infection. Moreover, one of the strains (JP) sequenced directly from a clinical specimen was mutated in two genes, one of which encodes a proposed immune-evasion function, viral interleukin-10. These observations imply that HCMV mutants exist in human infections.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19906940 PMCID: PMC2885759 DOI: 10.1099/vir.0.015891-0
Source DB: PubMed Journal: J Gen Virol ISSN: 0022-1317 Impact factor: 3.891
Published HCMV genome sequences
| AD169 varUK | Sanger | Plasmids | BK000394 | |
| AD169 varATCC | Sanger | BAC | AC146999 | |
| FIX | Sanger | BAC | AC146907 | |
| PH | Sanger | BAC | AC146904 | |
| TB40/E | Sanger | BAC | EF999921 | |
| Toledo | Sanger | BAC | AC146905 | |
| Towne varL | Sanger | BAC | GQ121041 | M. McVoy (unpublished data) |
| Towne varS | Sanger | BAC | AC146851 | |
| AY315197 | ||||
| TR | Sanger | BAC | AC146906 | |
| Merlin | Sanger | Virion DNA | AY446894 | |
| AD169 varUC | IGA | Virion DNA | FJ527563 | |
| Towne varL | IGA | Infected cell DNA | FJ616285 |
HCMV strains analysed: sources, sequencing methods and mutations
| 3157 | Urine from a congenitally infected infant | Cardiff, UK | Fibroblast culture virions | 3 | ✓ | RL13, UL40, UL128 | |||
| JP | Post-mortem prostate tissue from an AIDS patient | London, UK | Clinical material | 0 | ✓ | RL5A, UL111A | |||
| AD169 varUC† | Adenoid tissue | Bethesda, MD, USA | Fibroblast culture virions | Many | ✓ | ✓ | Several | ||
| Towne varL† | Urine from a congenitally infected infant | Philadelphia, PA, USA | Fibroblast culture cells | Many | ✓ | ✓ | Several | ||
| HAN13 | Bronchoalveolar lavage | Hannover, Germany | Fibroblast culture cells | 3 | ✓ | ✓ | ✓ | RL5A | |
| HAN20 | Bronchoalveolar lavage | Hannover, Germany | Fibroblast culture cells | 2 | ✓ | US9 | |||
| HAN38 | Bronchoalveolar lavage | Hannover, Germany | Fibroblast culture cells | 2 | ✓ | RL6, US9 | |||
| 3301 | Urine from a congenitally infected infant | Cardiff, UK | Clinical material | 0 | ✓ | ✓ | ✓ | None | |
*DA, Reference-dependent assembly of IGA data; IAC, reference-independent assembly of IGA data using a cognate scaffold at the Gap4 stage; IAG, reference-independent assembly of IGA data using a generic scaffold at the Gap4 stage. ✓ indicates that the method was used.
†DA of IGA data, passage histories and identities of mutated genes have been reported previously for strains AD169 varUC and Towne varL (Bradley ).
Results of assembling Illumina Genome Analyzer (IGA) data
| AD169 varUC† | 6264332 | 5788753 | 92 | 1267 | 10 | 6 | 1 | 100 | 0 | 0 | 1 |
| Towne varL† | 5079235 | 2390159 | 47 | 516 | 96 | 9 | 3 | 99.97 | 2 | 69 | 0 |
| HAN13 | 3157639 | 855653 | 27 | 184 | 137 | 19 | 6 | 99.78 | 5 | 506 | 7 |
| HAN20 | 7035348 | 1040651 | 15 | 224 | 405 | 8 | 2 | 99.97 | 1 | 68 | 2 |
| HAN38 | 7904170 | 591446 | 8 | 127 | 472 | 8 | 3 | 99.85 | 2 | 343 | 1 |
| 3301 | 5967563 | 197665 | 3 | 43 | 560 | 24 | 9 | 99.41 | 8 | 1378 | 1 |
*See Table 2 footnote.
†See Table 2 footnote.
Results of filtering treatments for strain Towne varL
| Unfiltered | 21 | 5079235 | 8 | 25 | 246 | 237 | 36 | 16 | 290 | 42 |
| 25 | 5079235 | 14 | 44 | 246 | 178 | 30 | 13 | 250 | 41 | |
| Singletons removed | 21 | 1682785 | 13 | 32 | 231 | 65 | 27 | 14 | 210 | 82 |
| Overall quality: | ||||||||||
| >1900 | 21 | 2338321 | 3 | 16 | 292 | 765 | 64 | 16 | 110 | 11 |
| >1700 | 21 | 4156964 | 7 | 32 | 264 | 383 | 46 | 13 | 220 | 26 |
| 25 | 4156964 | 11 | 32 | 243 | 150 | 32 | 16 | 190 | 44 | |
| Quality truncation, length >29 | 21 | 2609325 | 7 | 15 | 242 | 233 | 36 | 13 | 65 | 26 |
| Truncated to length: | ||||||||||
| 40 | 21 | 5079235 | 14 | 45 | 251 | 256 | 21 | 12 | 190 | 25 |
| 25 | 5079235 | 18 | 44 | 242 | 114 | 23 | 12 | 160 | 38 | |
| 35 | 21 | 5079235 | 29 | 65 | 246 | 176 | 18 | 8 | 150 | 26 |
| 25 | 5079235 | 31 | 68 | 242 | 99 | 13 | 10 | 110 | 30 | |
| 30 | 21 | 5079235 | 24 | 68 | 240 | 90 | 20 | 11 | 105 | 32 |
| 25 | 5079235 | 54 | 68 | 237 | 61 | 8 | 7 | 60 | 24 | |
*Each dataset was assembled using Velvet-0.7.31 (see Methods). Exp_cov, Optimal expected coverage; cov_cutoff, optimal coverage cut-off.
Filtering treatments and hash lengths (k) used
✓ indicates that data from the treatment method were utilized in assembly.
| Unfiltered | 21 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| 19 | ✓ | ||||||
| 25 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
| Singletons removed | 21 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| 25 | ✓ | ||||||
| 27 | ✓ | ||||||
| Overall quality | |||||||
| >1900 | 21 | ✓ | ✓ | ||||
| >1700 | 21 | ✓ | ✓ | ✓ | |||
| 25 | ✓ | ✓ | |||||
| >1300 | 21 | ✓ | ✓ | ✓ | |||
| 25 | ✓ | ✓ | |||||
| Quality truncation, length | |||||||
| >39 | 21 | ✓ | ✓ | ✓ | |||
| >29 | 21 | ✓ | ✓ | ✓ | ✓ | ||
| Truncated to length | |||||||
| 49 | 21 | ✓ | |||||
| 40 | 21 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Truncated to length 40, then singletons removed | 21 | ✓ | |||||
| Truncated to length | |||||||
| 40 | 29 | ✓ | |||||
| 25 | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| 35 | 21 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Truncated to length 35, then singletons removed | 21 | ✓ | |||||
| Truncated to length | |||||||
| 35 | 25 | ✓ | ✓ | ✓ | ✓ | ||
| 30 | 21 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| 25 | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| 29 | ✓ | ||||||