| Literature DB >> 17550625 |
Aline Flockerzi1, Jochen Maydt, Oliver Frank, Alessia Ruggieri, Esther Maldener, Wolfgang Seifarth, Patrik Medstrand, Thomas Lengauer, Andreas Meyerhans, Christine Leib-Mösch, Eckart Meese, Jens Mayer.
Abstract
BACKGROUND: The human genome comprises numerous human endogenous retroviruses (HERVs) that formed millions of years ago in ancestral species. A number of loci of the HERV-K(HML-2) family are evolutionarily much younger. A recent study suggested an infectious HERV-K(HML-2) variant in humans and other primates. Isolating such a variant from human individuals would be a significant finding for human biology.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17550625 PMCID: PMC1904241 DOI: 10.1186/1742-4690-4-39
Source DB: PubMed Journal: Retrovirology ISSN: 1742-4690 Impact factor: 4.602
Figure 1Nature of HERV-KX sequences. (A) Similarity of 642 HML-2 cDNA sequences and their best matching HML-2 loci in the human genome. The majority of cDNAs displayed between zero and a few differences to the best match, and were thus assignable to specific HML-2 loci. A minority of cDNAs displayed a greater number of dissimilarities to the best match and were thus not assignable with confidence to specific HML-2 loci. HERV-KX sequences were defined as displaying 18 or more nucleotide differences to the best matching HML-2 locus. (B) Sequence divergence of HERV-KX sequences in comparison to selected HML-2 reference sequences from the human genome, depicted as a neighbour joining-tree of the absolute number of nucleotide differences between sequences. For the sake of clarity, phylogenetically more distant HML-2 reference sequences were not included in the tree, as they were less similar to HERV-KX sequences than the reference sequences included in the tree. Proviral reference sequences are given as "xx_xxx" (see text). Positions with gaps were excluded in pairwise sequence comparisons.
Analysis of test sequences with Locus-Assigner and RECCO.
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
| 22_012 | 1 | 22_012 | 282 – 307 | 4 | 0,001 | 4 | ||
| 7_450 | 1 | 7_450 | 282 – 307 | 1 | 0,001 | 1 | ||
| 4_742 | 2 | 4_742 | 317 – 384 | 0 | 0,003 | 0 | ||
| 10_019 | 2 | 10_019 | 388 – 436 | 3 | 0,001 | |||
| 22_012 | 1 | 10_310 | 59 – 191 | 0 | ||||
| 22_012 | 0 | 22_012 | 8 | |||||
| 11_424/3_610/5 | 0 | 11_424/3_610/5_544/7_345 | 6 | |||||
Test sequences were created in silico by combining sequence regions of assignable HML-2 cDNA sequences, with the aim to construct recombination events between transcripts from corresponding HML-2 loci. Pairwise comparisons, using the in-house Bio-Python script Locus-Assigner, were then performed for test and reference sequences. Column 1 denotes the cDNA sequence name. Column 2: best matching HML-2 proviral locus according to pairwise sequence comparisons with Locus-Assigner. The numbers given here disregard the PCR primer binding regions, and therefore are, for HERV-KX sequences B262, B12 and 93, below the <18 nt differences-threshold initially set for HERV-KX (see text). Due to the lack of a HERV taxonomy, and for the sake of simplicity, HML-2 proviral loci have been numbered artificially in this paper, with the first number indicating chromosomal location of the locus, and the second number being a consecutive number without further connotation. Column 3: nucleotide differences to best match. Column 4: number of recombinations according to RECCO. Column 5: proviral loci involved in recombination(s). More than one locus is given when different loci displayed identical (dis)similarities. Column 6: regions where recombination(s) occured, with the 5' end of the RT-PCR product being defined as nucleotide +1. Column 7: nucleotide differences to individual HML-2 loci. Column 8: p-values of particular predicted recombinations. Column 9: nucleotide differences remaining after considering recombination(s) (see text).
Analysis of HERV-KX sequences with Locus Assigner and RECCO.
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
| 7_450 | 3 | 3_610/5_147/7_345 | 59–67 | 0 | 0,059 | |||
| 7_450 | 182–191 | 0 | 0,001 | |||||
| 10_019 | 385–436 | 3 | 0,023 | |||||
| 7_450 | 2 | 7_450 | 182–191 | 6 | 0,001 | |||
| 10_019 | 385–436 | 3 | 0,023 | |||||
| 1_504/K1_15/19_352 | 1 | |||||||
| 7_450 | 1 | 7_450 | 182–191 | 6 | 0,001 | |||
| 10_019 | 14 | |||||||
| 7_450 | 2 | 7_450 | 164–181 | 0 | 0,001 | |||
| 3_385 | 510–551 | 0 | 0,004 | |||||
| 7_450 | 1 | 7_450 | 164–181 | 0 | 0,001 | |||
| 5_544 | 4 | |||||||
| 7_450 | 1 | 7_450 | 192–205 | 1 | 0,001 | |||
| 1_259 | 6 | |||||||
| 3_610 | 1 | 21_019 | 68–181 | 0 | 0,001 | |||
| 7_450 | 0 | |||||||
| 7_450 | 2 | 7_450 | 192–209 | 3 | 0,001 | |||
| 10_019 | 437–466 | 2 | 0,201 | |||||
| 1_504/19_352/K115 | 2 | |||||||
| 7_450 | 1 | 7_450 | 192–209 | 3 | 0,001 | |||
| 10_019 | 13 | |||||||
| 11_424/3_354/1_504/5_544/K115 | 2 | 5_544 | 68–181 | 2 | 0,001 | |||
| 7_450 | 727–752 | 0 | 0,004 | |||||
| 3_385 | 0 | |||||||
| 3_610 | 1 | 3_610/7_345/5_147 | 68–181 | 0 | 0,001 | |||
| 7_450 | 0 | |||||||
| 7_450 | 1 | 7_450 | 238–242 | 0 | 0,001 | |||
| 1_504/1_259 | 4 | |||||||
| 7_450 | 2 | 7_450 | 164–181 | 0 | 0,001 | |||
| 1_504/22_012/6_324 | 583–712 | 0 | 0,122 | |||||
| 7_450 | 0 | |||||||
| 7_450 | 1 | 7_450 | 164–181 | 0 | 0,001 | |||
| 7_450 | 2 | 7_450 | 267–273 | 0 | 0,001 | |||
| 3_385 | 502–505 | 0 | 0,001 | |||||
| 3_014 | 0 | |||||||
| 1_504 | 2 | 3_354/1_504 | 238–242 | 1 | 0,001 | |||
| 7_450 | 583–712 | 0 | 0,004 | |||||
| 3_385 | 0 | |||||||
| 1_259 | 2 | 11_424 | 389–401 | 2 | 0,001 | |||
| 7_450 | 583–624 | 0 | 0,102 | |||||
| 10_019 | 0 | |||||||
| 1_259 | 1 | 11_424 | 389–401 | 2 | 0,001 | |||
| 7_450 | 9 | |||||||
| 11_456 | 2 | 11_456 | 415–422 | 0 | 0,001 | |||
| 7_450 | 583–651 | 0 | 0,003 | |||||
| 11_456 | 0 | |||||||
| 3_496/7_345 | 1 | 10_019 | 214–263 | 2 | 0,001 | |||
| 7_345/1_504 | 8 | |||||||
| 3_014 | 2 | HOM/6_324 | 316–388 | 0 | 0,001 | |||
| 3_014 | 676–764 | 0 | 0,330 | |||||
| 3_385 | 0 | |||||||
| 3_014 | 1 | HOM/6_324 | 316–388 | 0 | 0,001 | |||
| 3_014 | 5 | |||||||
| 1_504 | 2 | 3_354/1_504 | 192–306 | 0 | 0,021 | |||
| 3_385 | 521–553 | 0 | 0,001 | |||||
| 7_450 | 0 | |||||||
| 1_504 | 1 | 3_385 | 521–553 | 5 | 0,001 | |||
| 7_450 | 0 | |||||||
| 11_424 | 3 | 1_259/11_424 | 256–265 | 0 | 0,006 | |||
| 7_450 | 404–414 | 0 | 0,006 | |||||
| (***) | 583–610 | 1 | 0,002 | |||||
| 7_450 | 0 | |||||||
| 3_496 | 1 | 4_742 | 209–226 | 0 | 0,001 | |||
| 5_544 | 4 | |||||||
| 5_544 | 1 | 5 544 | 583–673 | 7 | 0,001 | |||
| 7_450 | 0 | |||||||
| 7_450 | 1 | 7_450 | 583–610 | 1 | 0,001 | |||
| 21 019 | 0 | |||||||
Designation of columns is the same as in Table 1.
(*) RECCO provided alternative explanations with different p-values. The lowest p-value did not neccessarily correspond to the explanation with the least nt differences. By visual inspection of sequences, another explanation may be favored despite a higher p-value.
(**) HERV-KX sequences without 96 bp sequence. RECCO predicted recombination with loci having the 96 bp sequence (see text), and predicted a recombination involving the 96 bp sequence. Regions of recombination events may therefore be shorter then the ones given by RECCO.
(***) Among 10 different proviral loci, RECCO does not specify a particular locus because all 10 loci are very similar or identical in sequence to each other in the concerned region. By visual inspection, a more optimal explanation among the 10 loci may be specified.
Figure 2RECCO-based explanation of HERV-KX sequences as recombination products. Based on RECCO analysis, HERV-KX sequences can be explained as recombination products between transcripts from different HML-2 proviral loci. Three examples of HERV-KX sequences were multiply aligned with HML-2 proviral sequences, the transcripts of which served as template for recombination events. HERV-KX sequence "93" was generated by one recombination event involving two loci. HERV-KX sequence "B270" was generated by two recombination events involving three loci. HERV-KX sequence "94" was generated by two recombination events involving two different proviral templates. Probable recombination regions are indicated by medium grey background. Light and dark grey background indicates proviral templates. Localization of the RT-PCR amplicon and the 96 bp sequence (see text) within the HML-2 gag gene/provirus is depicted above the sequence comparisons.
Figure 3Graphical representation of HERV-KX sequences as recombination products. Recombination source sequences and HERV-X sequences are depicted graphically in the upper and lower part, respectively. Colors depict composition of HERV-KX sequences from the various source sequences. Thus, color changes indicate regions of predicted recombination events (see Table 2). Lack of a 96 bp sequence (see text) is indicated by a gap.