| Literature DB >> 29351742 |
Nicole Grandi1, Marta Cadeddu1, Jonas Blomberg2, Jens Mayer3, Enzo Tramontano4,5.
Abstract
BACKGROUND: The genomes of all vertebrates harbor remnants of ancient retroviral infections, having affected the germ line cells during the last 100 million years. These sequences, named Endogenous Retroviruses (ERVs), have been transmitted to the offspring in a Mendelian way, being relatively stable components of the host genome even long after their exogenous counterparts went extinct. Among human ERVs (HERVs), the HERV-W group is of particular interest for our physiology and pathology. A HERV-W provirus in locus 7q21.2 has been coopted during evolution to exert an essential role in placenta, and the group expression has been tentatively linked to Multiple Sclerosis and other diseases. Following up on a detailed analysis of 213 HERV-W insertions in the human genome, we now investigated the ERV-W group genomic spread within primate lineages.Entities:
Keywords: Comparative genomics; ERV1–1; Endogenous retroviruses; HERV-W; Monkey and ape retroviruses; Syncytin; Viral evolution
Mesh:
Year: 2018 PMID: 29351742 PMCID: PMC5775608 DOI: 10.1186/s12862-018-1125-1
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Schematic of the phylogeny of the primate species analyzed in this study. Presence of (H)ERV-W or (H)ERV-W-related sequences in respective species is indicated with a filled or an empty circle, respectively. Primates’ parvorders and infraorders are indicated in italics and bold, respectively. Estimated ages of divergences of evolutionary lineages in millions of years ago are given near tree nodes and were taken from Steiper and Young 2006 [39] (first number) and Perelman et al. 2011 [40] (second number). Species marked with an * lack assembled reference genome sequences
Number of orthologous HERV-W loci in the analyzed Catarrhini primate genome sequences
| Chimpanzee | Gorilla | Orangutan | Gibbon | Rhesus | |
|---|---|---|---|---|---|
| ERV-W loci orthologous to human 211aHERV-W elements | 205 | 207 | 205 | 190 | 131 |
ano reliable sequence information was available for two HERV-W loci in human chromosome Y (see text)
Fig. 2Initial formation of 211 HERV-W loci based on respective orthologs in Catarrhini primate reference genomes. The number of orthologs to HERV-W loci initially formed in a particular primate species is given for each species for proviruses, L1-retrotransposed processed pseudogene and undefined elements (see text for more details). For instance, 20 HERV-W loci were initially formed in the common ancestor of human and Gibbon, and 8 HERV-W processed pseudogenes were formed in the common ancestor of human and Orangutan. Note that the majority of HERV-W loci was initially formed in the common ancestor of human and Rhesus and is thus common to all Catarrhini genomes. Approximate time periods of last common ancestors of Catarrhini primate lineages are given in millions of years ago (MYa) below species names
Numbers and orthologs of ERV-W sequences identified by HERV17 BLAT searches in Catarrhini primate genome sequences
| Chimpanzee | Gorilla | Orangutan | Gibbon | Rhesus | |
|---|---|---|---|---|---|
| 1) ERV-W loci with HERV-W orthologs in human genome | 138 (67%) | 132 (64%) | 122 (60%) | 111(58%) | 69 (53%) |
| 2) ERV-W loci corresponding to human solitary LTRs ( | 1 (17) | 1 (17) | 7 (10) | 10* (8) | 14* (0) |
| 3) ERV-W loci present in human as non-canonical HERV-W (like) | 29 | 27 | 24 | 21 | 20 |
| 4) ERV-W loci lacking an ortholog in human | 3 (3) | 5 (4) | 8 (6) | 4 (2) | 68 (66) |
| TOTAL | 171 | 165 | 160 | 145 | 168 |
1) Number of ERV-W elements with an orthologous locus among the 211 HERV-W loci: respective percentages are given in parenthesis. Two HERV-W loci on human chromosome Y were excluded from the analysis (see text)
2) Numbers of ERV-W elements corresponding to a solitary LTR at the orthologous human position. Numbers in parenthesis indicate the proviral insertions acquired in evolutionarily older primate species that were likewise a solitary LTR in the non-human primates analyzed. “*” indicates species with initial formations of proviruses that recombined to solitary LTRs in subsequent primate species: Gibbon (5) and Rhesus (14)
3) Numbers of ERV-W elements with an ortholog in the human reference genome sequence, yet being less similar to HERV-W. Those sequences were not identified as HERV-W elements in a previous analysis [68]
4) ERV-W loci absent in the orthologous human genome positions. Numbers in parenthesis indicate the proportion of species-specific insertions
Fig. 3Pairwise nucleotide sequence comparisons depicting sequence similarities between HERV-W and ERV1–1 groups. Reference sequences and consensus sequences were compared with each other as follows. a Callithrix jaccus ERV1–1 RepBase sequence and HERV-W RepBase sequence; b Callithrix jaccus and Saimiri boliviensis ERV1–1 proviral consensus sequences as generated in this paper; c Callithrix jaccus ERV1–1 proviral consensus as generated in this paper and HERV-W RepBase reference sequence; d Callithrix jaccus ERV1–1 proviral consensus sequence as generated in this paper and a HERV-W proviral consensus as reported recently [44]. Sequence similarities in dot-plot comparisons are highlighted for sequence regions with at least 50% similarity along a 100 nucleotides sequence window. Proviral gene and LTR regions are depicted
Fig. 4Phylogenetic analysis of ERV1–1 Gag, Pol and Env puteins. ERV1–1 puteins, labeled with an empty triangle, were obtained by identification and conceptual translation of Marmoset ERV1–1 proviral consensus sequence Open Reading Frames (see methods). The other Gammaretroviral putein sequences were retrieved from Vargiu et al. 2016 [38]. HERV-W puteins are marked with a filled triangle. The evolutionary relationships were inferred by using the ML method based on the Poisson model. Phylogenies were tested by using the bootstrap method with 100 replicates each: the obtained bootstrap values are reported near each node (bootstrap values lower than 30% are not shown). Length of branches indicates the number of substitutions per site
Fig. 5Phylogenetic analysis of Chimpanzee, Gorilla, Orangutan and Gibbon ERV-W nucleotide sequences lacking an ortholog in the human genome. Gammaretrovirus-like HERV reference sequences were retrieved from RepBase. For the HERV-W group, both RepBase reference and the consensus sequences generated previously from the proviral dataset [44] were included and marked with a filled square. The ERV1–1 reference sequence from RepBase and the consensus generated from the proviral sequences dataset in this study are marked with an empty square. Evolutionary relationships were inferred by using the ML method and the Kimura-2-parameter model. The resulting phylogeny was tested using the bootstrap method with 100 replicates: the obtained bootstrap values are reported near each node (bootstrap values lower than 30% are not shown). Length of branches indicates the number of substitutions per site