| Literature DB >> 26282858 |
Stefanie Hartmann1, Natascha Hasenkamp2, Jens Mayer3, Johan Michaux4, Serge Morand5,6, Camila J Mazzoni7,8, Alfred L Roca9, Alex D Greenwood10.
Abstract
BACKGROUND: Endogenous murine leukemia retroviruses (MLVs) are high copy number proviral elements difficult to comprehensively characterize using standard low throughput sequencing approaches. However, high throughput approaches generate data that is challenging to process, interpret and present.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26282858 PMCID: PMC4538763 DOI: 10.1186/s12864-015-1766-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1MLV regions sequenced and summary of sequence cluster information. The structure of the MLV genome is shown between two data tables, with the locations of retroviral regions that were amplified and sequenced indicated by the thick lines. The numbers with which these regions are labeled (1, 2,3, 5, 6) indicate the positions of the regions targeted by PCR, which covered 5 of the 6 recombination sites that created XMRV from PreXMRV-1 and PreXMRV-2 [5]. The target region labeled “2,3” was a single PCR product that included recombination sites 2 and 3. Note that there is no line segment numbered 4, since the PCR targeting the fourth recombination region yielded far fewer reads for all mice tested and was therefore excluded from further analyses. Block arrows point from the analyzed MLV regions to the corresponding table summarizing the clusters identified and analyzed for that genome region. Within the tables, each row represents one cluster of related sequences. A cluster is defined as sequences sharing sufficient identity with each other and with the chosen reference sequences to form a group distinct from other sequences. The first five columns in each table represent the number of sequences in a given cluster for the samples from inbred laboratory mouse strains C3H, LP/J, SJL and two wild caught mice Mmd1 (Corsica) and Mmd2 (mainland France). Shading of these cells correspond to the number of sequences per cluster that were identified per mouse: white for no sequences matching a cluster, light gray for 1-6 sequences, dark gray for more than 6 sequences. Cells shaded in intermediate gray indicates that a cluster was unique to a single mouse. The last four table columns list four different types of MLV (Xmv, XMRV, Pmv or Mpmv), each of which was compared to the mouse sequences generated by the current study. An “X” in these table cells indicates that one or more of the corresponding reference sequences were assigned to the given cluster. When only a single type of MLV reference sequence was assigned to the cluster, the “X” is underlined
Maximum match between polytropic murine leukemia retrovirus (Pmv) and each next generation sequenced MLV region, in 5 mice
| Reference | MLV region | C3H | LP/J | SJL | Mmd1 | Mmd2 |
|---|---|---|---|---|---|---|
| Pmv1 | 1 | 100 | 100 | 100 | 99.5 |
|
| 2,3 | 100 | 100 | 99.7 | 99.7 |
| |
| 5 | 100 | 100 | 100 | 99.5 |
| |
| 6 | 99.8 | 99.8 | 99.8 | 99.8 |
| |
| Pmv5 | 1 | 100 | 100 | 100 | 99.5 |
|
| 2,3 | 100 | 100 | 99.7 | 99.7 |
| |
| 5 | 100 | 100 | 100 | 99.5 |
| |
| 6 | 99.8 | 99.8 | 99.8 | 99.8 |
| |
| Pmv7 | 1 | 99.8 |
| 99.3 | 99.1 | 99.5 |
| 2,3 | 100 |
| 100 | 100 | 100 | |
| 5 | 100 |
| 100 | 99.5 | 100 | |
| 6 | 100 |
| 100 | 99.8 | 100 | |
| Pmv8 | 1 |
|
| 96.4 | 99.3 | 99.8 |
| 2,3 |
|
| 99.7 | 99.7 | 99.7 | |
| 5 |
|
| 100 | 99.5 | 100 | |
| 6 |
|
| 100 | 99.8 | 100 | |
| Pmv9 | 1 | 100 | 100 | 100 | 99.5 | 100 |
| 2,3 | 99.7 | 99.7 | 99.5 | 99.5 | 99.7 | |
| 5 | 100 | 99.5 | 100 | 99.7 | 99.5 | |
| 6 | 99.3 | 99.3 | 99.5 | 98.8 | 99.3 | |
| Pmv10 | 1 |
|
| 99.5 | 99.1 | 100 |
| 2,3 |
|
| 99.7 | 99.7 | 100 | |
| 5 |
|
| 100 | 99.5 | 100 | |
| 6 |
|
| 99.8 | 99.5 | 99.8 | |
| Pmv11 | 1 | 100 |
| 100 | 99.5 | 100 |
| 2,3 | 100 |
| 99.7 | 99.7 | 100 | |
| 5 | 100 |
| 100 | 99.5 | 100 | |
| 6 | 99.8 |
| 99.5 | 99.5 | 99.8 | |
| Pmv12 | 1 | 100 | 100 | 99.8 | 99.3 | 100 |
| 2,3 | 99.5 | 99.7 | 99.5 | 99.5 | 99.5 | |
| 5 | 100 | 100 | 100 | 99.5 | 100 | |
| 6 | 100 | 100 | 100 | 99.8 | 100 | |
| Pmv13 | 1 | 100 | 99.5 | 99.3 | 98.8 |
|
| 2,3 | 100 | 100 | 100 | 100 |
| |
| 5 | 100 | 100 | 100 | 99.5 |
| |
| 6 | 99.8 | 99.8 | 99.5 | 99.5 |
| |
| Pmv14 | 1 |
| 99.8 | 99.5 | 99.3 |
|
| 2,3 |
| 100 | 99.7 | 98.9 |
| |
| 5 |
| 100 | 99.7 | 99.2 |
| |
| 6 |
| 99.8 | 99.8 | 99.5 |
| |
| Pmv15 | 1 | 99.8 | 100 | 99.5 | 99.3 | 99.5 |
| 2,3 | 99.5 | 99.5 | 99.5 | 99.5 | 99.5 | |
| 5 | 100 | 100 | 100 | 99.5 | 100 | |
| 6 | 100 | 100 | 100 | 99.8 | 100 | |
| Pmv16 | 1 | 100 | 100 | 100 | 99.5 |
|
| 2,3 | 100 | 100 | 100 | 100 |
| |
| 5 | 100 | 100 | 100 | 99.5 |
| |
| 6 | 99.8 | 99.8 | 99.8 | 99.8 |
| |
| Pmv18 | 1 | 100 | 100 | 100 | 99.5 | 100 |
| 2,3 | 99.5 | 99.7 | 99.5 | 99.5 | 99.5 | |
| 5 | 100 | 100 | 100 | 99.5 | 100 | |
| 6 | 99.8 | 100 | 99.8 | 99.5 | 99.8 | |
| Pmv19 | 1 |
|
|
| 99.5 |
|
| 2,3 |
|
|
| 100 |
| |
| 5 |
|
|
| 99.5 |
| |
| 6 |
|
|
| 99.8 |
| |
| Pmv20 | 1 | 100 | 100 | 99.8 | 99.3 | 100 |
| 2,3 | 100 | 99.5 | 99.5 | 99.2 | 99.5 | |
| 5 | 100 | 100 | 100 | 99.5 | 100 | |
| 6 | 99.8 | 100 | 99.5 | 99.5 | 100 | |
| Pmv21 | 1 | 99.5 | 99.8 | 99.3 | 98.8 | 99.5 |
| 2,3 | 99.5 | 99.5 | 99.7 | 99.5 | 99.7 | |
| 5 | 100 | 100 | 100 | 99.5 | 100 | |
| 6 | 99.8 | 99.8 | 99.8 | 99.8 | 100 | |
| Pmv22 | 1 | 100 | 100 | 100 | 99.5 | 100 |
| 2,3 | 99.7 | 99.7 | 99.7 | 99.7 | 99.7 | |
| 5 | 100 | 100 | 100 | 99.5 | 100 | |
| 6 | 99.8 | 100 | 99.8 | 99.5 | 99.8 | |
| Pmv23 | 1 | 99.3 | 100 | 99.5 | 98.6 | 99.1 |
| 2,3 | 98.9 | 98.9 | 99.2 | 98.9 | 99.2 | |
| 5 | 99.7 | 100 | 99.7 | 99.2 | 99.7 | |
| 6 | 99.8 | 100 | 99.3 | 99.3 | 99.8 | |
| Pmv24 | 1 | 100 |
| 99.8 | 99.3 |
|
| 2,3 | 100 |
| 100 | 100 |
| |
| 5 | 100 |
| 100 | 99.5 |
| |
| 6 | 99.8 |
| 99.5 | 99.5 |
|
The use of boldface indicates that sequences with 100% identity to a reference were detected for all MLV target regions in a mouse. The Pmv reference sequences are those of Bamunusinghe et al. [4]
Maximum match between polytropic murine leukemia retrovirus (Mpmv) and each next generation sequenced MLV region, in 5 mice
| Reference | MLV region | C3H | LP/J | SJL | Mmd1 | Mmd2 |
|---|---|---|---|---|---|---|
| Mpmv1 | 1 | 99.8 |
| 99.5 | 99.3 | 99.5 |
| 2,3 | 99.7 |
| 99.7 | 99.7 | 100 | |
| 5 | 100 |
| 100 | 99.7 | 100 | |
| 6 | 99.5 |
| 99.8 | 99.3 | 100 | |
| Mpmv2 | 1 | 99.8 | 99 | 98.8 | 98.6 | 100 |
| 2,3 | 100 | 100 | 99.5 | 99.5 | 100 | |
| 5 | 100 | 100 | 100 | 99.7 | 100 | |
| 6 | 100 | 99.8 | 99.5 | 99.5 | 99.8 | |
| Mpmv4 | 1 |
| 99.8 | 99.8 | 99.5 | 100 |
| 2,3 |
| 99.7 | 99.5 | 99.5 | 99.7 | |
| 5 |
| 99.7 | 99.7 | 99.5 | 100 | |
| 6 |
| 100 | 100 | 100 | 100 | |
| Mpmv5 | 1 | 99.5 | 99.3 | 99.3 | 99 | 99.3 |
| 2,3 | 98.9 | 98.9 | 98.9 | 98.7 | 99.2 | |
| 5 | 99.7 | 99.7 | 99.7 | 99.5 | 99.7 | |
| 6 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | |
| Mpmv6 | 1 | 100 | 100 | 99.3 | 98.8 | 99.3 |
| 2,3 | 100 | 100 | 99.5 | 99.5 | 99.7 | |
| 5 | 100 | 100 | 99.7 | 99.7 | 99.7 | |
| 6 | 99.8 | 99.3 | 98.6 | 98.6 | 98.8 | |
| Mpmv7 | 1 | 99.5 |
| 99 | 98.8 | 99.8 |
| 2,3 | 100 |
| 99.7 | 99.7 | 100 | |
| 5 | 100 |
| 100 | 99.7 | 100 | |
| 6 | 100 |
| 99.8 | 99.8 | 100 | |
| Mpmv8 | 1 | 99.3 | 99 | 99 | 98.8 | 99.3 |
| 2,3 | 100 | 99.7 | 99.5 | 99.2 | 99.5 | |
| 5 | 100 | 99.7 | 99.7 | 99.5 | 100 | |
| 6 | 100 | 100 | 100 | 100 | 100 | |
| Mpmv9 | 1 | 99.8 | 99 | 98.8 | 98.6 |
|
| 2,3 | 100 | 100 | 99.5 | 99.5 |
| |
| 5 | 100 | 100 | 100 | 99.7 |
| |
| 6 | 100 | 99.8 | 99.8 | 99.8 |
| |
| Mpmv10 | 1 |
|
| 99.5 | 99.3 | 99.5 |
| 2,3 |
|
| 99.5 | 99.5 | 99.7 | |
| 5 |
|
| - | - | - | |
| 6 |
|
| 99.5 | 99 | 99.3 | |
| Mpmv11 | 1 | 100 | 100 | 100 | 99.8 | 100 |
| 2,3 | 100 | 100 | 99.7 | 99.7 | 100 | |
| 5 | 99.7 | 99.7 | 99.7 | 99.5 | 99.7 | |
| 6 | 99.5 | 99.5 | 99.5 | 99.5 | 99.5 | |
| Mpmv12 | 1 | 100 | 99.3 | 99 | 98.8 | 99.8 |
| 2,3 | 100 | 99.7 | 99.5 | 99.2 | 99.7 | |
| 5 | 100 | 100 | 100 | 99.7 | 100 | |
| 6 | 99.8 | 99.5 | 99.5 | 99.5 | 99.5 | |
| Mpmv13 | 1 | 99.8 | 99.8 | 99.8 | 99.5 | 99.8 |
| 2,3 | 99.5 | 99.5 | 99.5 | 99.2 | 99.7 | |
| 5 | 99.5 | 99.5 | 99.5 | 99.2 | 100 | |
| 6 | 100 | 100 | 100 | 100 | 100 |
The use of boldface indicates that sequences with 100% identity to a reference were detected for all MLV target regions in a mouse. The use of italics indicates that more than one Mpmv sequence in a cluster was 100% identical. Mpmv reference sequences are those of Bamunusinghe et al. [4]. A dash indicates that for a target MLV region, the region is deleted in the reference sequence relative to other MLV sequences
Maximum match between xenotropic murine leukemia retrovirus (Xmv) and each next generation sequenced MLV region, in 5 mice
| Reference | MLV region | C3H | LP/J | SJL | Mmd1 | Mmd2 |
|---|---|---|---|---|---|---|
| Xmv8 | 1 | 88.3 | 88.8 | 86.9 | 88.6 | 95.1 |
| 2,3 | 98.7 | 99.7 | 97.1 | 99.5 | 99.5 | |
| 5 | 98.4 | 98.7 | 92.9 | 97.9 | 98.7 | |
| 6 | 97.1 | 99.5 | 90 | 90 | 95.5 | |
| Xmv9 | 1 | 96.1 | 94.2 | 87.1 | 88 | 98.4 |
| 2,3 | 99.2 | 97.9 | 97.3 | 98.7 | 99.2 | |
| 5 | 98.9 | 99.2 | 93.4 | 98.4 | 99.2 | |
| 6 | 90.4 | 89.8 | 86 | 86.4 | 89.9 | |
| Xmv10 | 1 | 88.6 | 80.4 | 84.7 | 79.6 | 98.4 |
| 2,3 | 98.4 | 99.5 | 96.8 | 99.2 | 99.2 | |
| 5 | - | - | - | - | - | |
| 6 | - | - | - | - | - | |
| Xmv12 | 1 | 99.5 | 98.5 | 77.5 | 88.2 | 98.5 |
| 2,3 | 100 | 99.2 | 99.5 | 99.7 | 100 | |
| 5 | 98.9 | 99.5 | 94 | 98.9 | 100 | |
| 6 | 100 | 100 | 98.3 | 98.1 | 99.8 | |
| Xmv13 | 1 | 96.7 | 96.9 | 95.9 | 96.5 | 96.7 |
| 2,3 | 98.1 | 99.2 | 96.5 | 98.9 | 98.9 | |
| 5 | 99.2 | 99.5 | 93.7 | 98.7 | 99.5 | |
| 6 | 97.6 | 99 | 90.3 | 90 | 95.2 | |
| Xmv15 | 1 | 90.8 | 91.1 | 72 | 81.3 | 91.3 |
| 2,3 | 100 | 99.2 | 99.5 | 99.7 | 100 | |
| 5 | 98.9 | 99.5 | 94 | 98.9 | 100 | |
| 6 | 98.8 | 98.8 | 98.5 | 98.3 | 98.5 | |
| Xmv17 | 1 |
| 96.2 | 82.7 | 95.5 |
|
| 2,3 |
| 99.2 | 99.2 | 99.2 |
| |
| 5 |
| - | - | - |
| |
| 6 |
| 99.5 | 97.9 | 97.6 |
| |
| Xmv18 | 1 | 99.5 | 98.2 | 77.5 | 88.2 |
|
| 2,3 | 100 | 99.2 | 99.5 | 99.7 |
| |
| 5 | 98.9 | 99.5 | 94 | 98.9 |
| |
| 6 | 99.8 | 99.8 | 98.1 | 97.9 |
| |
| Xmv19 | 1 | 98.6 | 99.1 | 77.3 | 87.9 | 98.7 |
| 2,3 | 100 | 99.2 | 99.5 | 99.7 | 100 | |
| 5 | 98.9 | 99.5 | 94 | 98.9 | 100 | |
| 6 | 52.2 | 46.9 | 48.5 | 48.9 | 49.8 | |
| Xmv41 | 1 | 96.8 | 97.5 | 85.4 | 95.1 | 97.3 |
| 2,3 | 97.9 | 97.9 | 95.7 | 97.1 | 98.4 | |
| 5 | 96.3 | 96.6 | 93.4 | 93.1 | 96.8 | |
| 6 | 97.7 | 98.4 | 83.6 | 84.2 | 97.2 | |
| Xmv42 | 1 |
|
|
|
|
|
| 2,3 |
|
|
|
|
| |
| 5 |
|
|
|
|
| |
| 6 |
|
|
|
|
| |
| Xmv43 | 1 | 98.3 | 98.5 | 86.3 | 96.3 |
|
| 2,3 | 98.4 | 98.4 | 96.3 | 97.6 |
| |
| 5 | 98.7 | 99.2 | 93.7 | 98.7 |
| |
| 6 | 98.6 | 98.4 | 84.6 | 84.7 |
|
The use of boldface indicates that across all MLV target regions, the generated sequence read was more similar to the Xmv reference than were other Xmv references. Xmv reference sequences are those of Bamunusinghe et al. [4]. A dash indicates that for a target MLV region, the region is deleted in the reference sequence relative to other MLV sequences
Identity to XMRV-like sequences
| MLV Target | C3H | LP/J | SJL | Mmd1 | Mmd2 | PreXMRV1 | PreXMRV2 | XMRV | |
|---|---|---|---|---|---|---|---|---|---|
| PreXMRV1 | 1 | 96.6 | 96.6 | 84.9 | 96.3 | 94.1 | 100 | 90.8 | 98.3 |
| 2,3 | 99.7 | 99.2 | 92 | 99.2 | 95.2 | 100 | 91.2 | 92.3 | |
| 5 | 98.7 | 98.9 | 92.9 | 99.2 | 92.6 | 100 | 91.1 | 92.2 | |
| 6 | 96.5 | 97.4 | 83.4 | 96.5 | 83.3 | 100 | 78.7 | 86.5 | |
| PreXMRV2 | 1 |
|
|
|
|
| 90.8 | 100 | 92.4 |
| 2,3 |
|
|
|
|
| 91.2 | 100 | 98.9 | |
| 5 |
|
|
|
|
| 91.1 | 100 | 89.8 | |
| 6 |
|
|
|
|
| 78.7 | 100 | 91.6 | |
| XMRV | 1 | 97.3 | 97.3 | 86.5 | 97.3 | 95.6 | 98.3 | 92.4 | 100 |
| 2,3 | 98.9 | 98.9 | 98.4 | 98.9 | 98.9 | 92.3 | 98.9 | 100 | |
| 5 | 95.8 | 95.8 | 92.8 | 95.5 | 92.6 | 92.2 | 89.8 | 100 | |
| 6 | 93.1 | 96.8 | 88.6 | 94.9 | 90 | 86.5 | 91.6 | 100 |
The use of boldface indicates that across all MLV target regions, a generated sequence read was more similar to the reference than were other XMRV references. Although in this table, PreXMRV1 could be predicted to be present in C3H, LP/J and Mmd1, Xmv43 exhibited higher identity to several breakpoints than the sequences obtained from the mice in this study. Thus, we cannot conclude that any of these mice carry PreXMRV1
Haplotype diversity of the Xpr1 gene across five mice
| Exon 4 | ECL 3.2 | ECL 3.3 | ECL 4 | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Residue no. | 103 | 105 | 106 | 500 | 503 | 583 | 590 | ||
| amino acid | A/G | T | A/T | K/E | K/N | T/- | D/N | ||
| SJL | C | A | G | A | A | A | G | ||
| C3H | G | G | A | G | A | - | A | ||
| LP/J | G | G | A | A | T | A | G | ||
| Mmd1 | C | A | G | A | T | A | G | ||
| Mmd2 | C | A | G | A | T | A | G |
ECL3 and 4 stand for the third and forth extracellular loop of Xpr1. The residue numbers indicate the positions in the primary sequence of the XPR1 protein, whereas the row below (amino acid) shows which kind of change occurs. The nucleotide changes are also shown below to give an impression on the amount of synonymous and nonsynonymous variation among the five analyzed mice. A dash stands for a missing nucleotide at the respective position relative to all other shown sequences