Literature DB >> 25142605

Novel Denisovan and Neanderthal retroviruses.

Adam Lee1, Derek Huntley2, Pakorn Aiewsakun3, Ravinder K Kanda3, Claire Lynn2, Michael Tristem3.   

Abstract

Following the recent availability of high-coverage genomes for Denisovan and Neanderthal hominids, we conducted a screen for endogenized retroviruses, identifying six novel, previously unreported HERV-K(HML2) elements (HERV-K is human endogenous retrovirus K). These elements are absent from the human genome (hg38) and appear to be unique to archaic hominids. These findings provide further evidence supporting the recent activity of the HERV-K(HML2) group, which has been implicated in human disease. They will also provide insights into the evolution of archaic hominids.
Copyright © 2014 Lee et al.

Entities:  

Mesh:

Year:  2014        PMID: 25142605      PMCID: PMC4248912          DOI: 10.1128/JVI.01825-14

Source DB:  PubMed          Journal:  J Virol        ISSN: 0022-538X            Impact factor:   5.103


TEXT

In 2008, an archaeological dig at a cave in the Siberian Altai mountain range led to the discovery of a finger bone belonging to a female hominid, dating to at least ∼50,000 years ago (1, 2). From this, the DNA of a subspecies of Homo sapiens, designated Denisovans, was sequenced (1). Similarly, a draft Neanderthal genome—a sister group to Denisovans—–was sequenced from three individuals in 2010 (3, 4). Using this data, Agoni et al. (5) identified 14 novel human endogenous retrovirus K (HERV-K) proviruses, which were absent from the human genome sequence (assembly hg19). The authors suggested that these HERVs were unique to archaic hominids and that no orthologous insertions would be found in modern humans (5). Subsequently, however, Marchi et al. reported that all of these sequences were actually present, or likely to be present, in some modern humans (6). In this study, we screened the most recently available high-coverage genomes for a Denisovan (4) and an Altai Neanderthal (2) for HERV-K proviruses. We present six novel, endogenized retroviruses, absent from the hg38 human genome, 43 modern-human genomes reported by Lee et al. (7), and a further 358 reported by Marchi et al. (8). These may therefore represent the first proviruses unique to Neanderthal and Denisovan hominids. While endogenized retroviral DNA makes up ∼8% of the human genome, only one group—HERV-K(HML2)—appears to have been active within the past million years. This has been demonstrated by the observation that some members of this group, but not others, are insertionally polymorphic, having been identified in some modern humans (9). Although no active, replication-competent HERV-K(HML2) elements have been identified to date, it remains possible that such elements exist and may cause disease in some modern humans. The high-coverage Neanderthal and Denisovan genomes screened in this study were both derived from fossils found in Denisova Cave (2, 4). These genome sequences consist of short, unassembled DNA reads averaging ∼70 to 200 bp and were sequenced to 52- and 30-fold coverage, respectively (versus 1.3-fold [3] and 1.9-fold [1] coverage for the genomes screened previously by Agoni et al. [5]). They therefore likely represent almost-complete genome coverage. Novel retroviral insertions in archaic hominids can be recognized when orthologous flanking DNA in modern humans is not interrupted by a HERV insertion, manifesting instead as an empty preintegration site. We obtained reads containing 5′ host-virus junctions using a perl script that stringently detected the first 20 bp of the start of the HERV-K(HML2) long terminal repeat (LTR), allowing us to build libraries of reads containing hominid and viral DNA. Flanking sequences were then extracted and BLAST searched against the human genome (hg38), using blastn and a word size of 11. Apparent novel HERV-K(HML2) insertions were identified by a lack of retroviral sequence downstream of a matching flank in the modern-human genome. To confirm that putative novel insertions were not a result of sequencing artifacts, such as template switching, we used three approaches. First, we confirmed that each retrovirus was represented by multiple reads, as this would be unlikely to occur in the event of sequencing error. Second, we attempted to identify the corresponding 3′ flanks for each candidate provirus. This involved obtaining the modern-human sequence directly downstream of the flank-virus breakpoint and locating matching sequence in the Denisovan and Neanderthal genomes. Matching reads containing LTR sequence directly upstream of the 3′ flank were extracted using BLAST, utilizing word sizes of 5 to 7. This enabled matches to be returned, despite small mismatches occurring due to target site duplications (TSDs) at the virus-host junction. Lastly, the presence of matching TSDs was considered additional verification of the virus. Conversely, we then repeated each of these steps with the 3′ end of the HERV-K(HML2) LTR to identify 3′ host-virus junctions. Sequence reads are given in Table S1 in the supplemental material. Using this approach, we identified a total of nine HERV-K(HML2) proviruses present within the Neanderthal and Denisovan genomes—while also absent from hg38—that were not reported by Agoni et al. (5). However, their absence from the hg38 sequence does not necessarily imply absence from all modern humans; such elements could be insertionally polymorphic, as demonstrated by Marchi et al. (6). While we did not directly screen further modern-human genomes, we compared our nine elements against those recovered by Lee et al. (7) and Marchi et al. (8) from their analyses of high-coverage modern-human genomes. This revealed that three of these viruses were present within the data of Lee et al. (7), of which two were also identified by Marchi et al. (8). The remaining six elements therefore appear to be absent from both hg38 and the additional 401 modern-human genomes investigated in previous reports (7, 8). For four of these elements, both the 5′ and 3′ virus-host junctions were identified, while the remaining two—which were both represented by multiple sequence reads—were derived from single ends. Of the six proviruses, three were shared by both Denisovans and Neanderthals, while two were unique to Neanderthals and one to Denisovans (Fig. 1). We note that one of these proviruses, De13, is located approximately 1 kb upstream of an existing HERV-K(HML2) solo LTR in hg38; it also appears to share the same TSD, as well as a similar flanking sequence. This would normally suggest that it is a sequencing artifact. However, it is represented by multiple sequence reads in the Denisovan genome and lies within a highly repetitive region. Its flanking sequence exactly matches the region 1 kb upstream of the known solo LTR in hg38.
FIG 1

(A) Distribution of the archaic hominid insertions within Neanderthals and Denisovans, compared to hg38 and the HERV-K elements recovered in previous studies (6–8). Filled circles denote that an element is present, while open circles denote absence. N.D. indicates that there were no sequence data available. The loci of corresponding empty preintegration sites, mapped to hg38, are also given. If this occurred within a repeat, the class of repeat is listed in parentheses. Single asterisks adjacent to the last three viruses represent those elements that were also recovered in some modern humans by Lee et al. (7), while double asterisks represent those viruses also recovered by Marchi et al. (8). The distribution of these three elements is likely explained by ancestral polymorphism. (B) HERV-K(HML2) flanking sequences for the nine endogenized retroviruses identified here. The 5′ and 3′ flanking regions are shown, together with the proximal and distal ends of the HERV LTR. Nomenclature follows the convention and numbering set by Agoni et al. (5). Reads where flanking sequence was extensive were truncated and are provided in full in Table S1 in the supplemental material. + or − denotes the native orientation of the read against hg38. For proviruses for which both 5′ and 3′ flanks were obtained, the matching TSD sequence is underlined and in boldface.

(A) Distribution of the archaic hominid insertions within Neanderthals and Denisovans, compared to hg38 and the HERV-K elements recovered in previous studies (6–8). Filled circles denote that an element is present, while open circles denote absence. N.D. indicates that there were no sequence data available. The loci of corresponding empty preintegration sites, mapped to hg38, are also given. If this occurred within a repeat, the class of repeat is listed in parentheses. Single asterisks adjacent to the last three viruses represent those elements that were also recovered in some modern humans by Lee et al. (7), while double asterisks represent those viruses also recovered by Marchi et al. (8). The distribution of these three elements is likely explained by ancestral polymorphism. (B) HERV-K(HML2) flanking sequences for the nine endogenized retroviruses identified here. The 5′ and 3′ flanking regions are shown, together with the proximal and distal ends of the HERV LTR. Nomenclature follows the convention and numbering set by Agoni et al. (5). Reads where flanking sequence was extensive were truncated and are provided in full in Table S1 in the supplemental material. + or − denotes the native orientation of the read against hg38. For proviruses for which both 5′ and 3′ flanks were obtained, the matching TSD sequence is underlined and in boldface. As a result of genetic drift, neutral HERV insertions can become fixed in a population within a time frame dependent on population size and generation time. It is estimated that the average time taken to fixation in humans is ∼800,000 years (10). Since modern humans are estimated to have diverged from Denisovan and Neanderthal lineages approximately 553,000 to 589,000 years ago (2), we would expect that some—but not all—of the novel HERV-K(HML2) elements in these archaic hominids would be absent in modern humans. This is consistent with our results; six elements appear to be absent from all of the modern-human genomes investigated to date, whereas others (identified in this and previous reports [5, 7, 8]) are present within some of them. However, it remains possible that these six elements are also present in modern humans, albeit at very low allele frequencies. We suggest that at least some of the six proviruses identified in this study inserted into archaic hominids after their divergence from modern humans; however, it is also possible that they inserted before the divergence of archaic hominids and modern humans, with these ancestral polymorphisms being subsequently lost from modern humans by genetic drift. These findings will help improve our understanding of archaic hominid evolution and provide additional insight into the recent activity of the HERV-K(HML2) retroviral group.
  9 in total

1.  Neandertal and Denisovan retroviruses.

Authors:  Lorenzo Agoni; Aaron Golden; Chandan Guha; Jack Lenz
Journal:  Curr Biol       Date:  2012-06-05       Impact factor: 10.834

2.  A high-coverage genome sequence from an archaic Denisovan individual.

Authors:  Matthias Meyer; Martin Kircher; Marie-Theres Gansauge; Heng Li; Fernando Racimo; Swapan Mallick; Joshua G Schraiber; Flora Jay; Kay Prüfer; Cesare de Filippo; Peter H Sudmant; Can Alkan; Qiaomei Fu; Ron Do; Nadin Rohland; Arti Tandon; Michael Siebauer; Richard E Green; Katarzyna Bryc; Adrian W Briggs; Udo Stenzel; Jesse Dabney; Jay Shendure; Jacob Kitzman; Michael F Hammer; Michael V Shunkov; Anatoli P Derevianko; Nick Patterson; Aida M Andrés; Evan E Eichler; Montgomery Slatkin; David Reich; Janet Kelso; Svante Pääbo
Journal:  Science       Date:  2012-08-30       Impact factor: 47.728

3.  A draft sequence of the Neandertal genome.

Authors:  Johannes Krause; Adrian W Briggs; Tomislav Maricic; Udo Stenzel; Martin Kircher; Nick Patterson; Richard E Green; Heng Li; Weiwei Zhai; Markus Hsi-Yang Fritz; Nancy F Hansen; Eric Y Durand; Anna-Sapfo Malaspinas; Jeffrey D Jensen; Tomas Marques-Bonet; Can Alkan; Kay Prüfer; Matthias Meyer; Hernán A Burbano; Jeffrey M Good; Rigo Schultz; Ayinuer Aximu-Petri; Anne Butthof; Barbara Höber; Barbara Höffner; Madlen Siegemund; Antje Weihmann; Chad Nusbaum; Eric S Lander; Carsten Russ; Nathaniel Novod; Jason Affourtit; Michael Egholm; Christine Verna; Pavao Rudan; Dejana Brajkovic; Željko Kucan; Ivan Gušic; Vladimir B Doronichev; Liubov V Golovanova; Carles Lalueza-Fox; Marco de la Rasilla; Javier Fortea; Antonio Rosas; Ralf W Schmitz; Philip L F Johnson; Evan E Eichler; Daniel Falush; Ewan Birney; James C Mullikin; Montgomery Slatkin; Rasmus Nielsen; Janet Kelso; Michael Lachmann; David Reich; Svante Pääbo
Journal:  Science       Date:  2010-05-07       Impact factor: 47.728

4.  Landscape of somatic retrotransposition in human cancers.

Authors:  Eunjung Lee; Rebecca Iskow; Lixing Yang; Omer Gokcumen; Psalm Haseley; Lovelace J Luquette; Jens G Lohr; Christopher C Harris; Li Ding; Richard K Wilson; David A Wheeler; Richard A Gibbs; Raju Kucherlapati; Charles Lee; Peter V Kharchenko; Peter J Park
Journal:  Science       Date:  2012-06-28       Impact factor: 47.728

5.  Genetic history of an archaic hominin group from Denisova Cave in Siberia.

Authors:  David Reich; Richard E Green; Martin Kircher; Johannes Krause; Nick Patterson; Eric Y Durand; Bence Viola; Adrian W Briggs; Udo Stenzel; Philip L F Johnson; Tomislav Maricic; Jeffrey M Good; Tomas Marques-Bonet; Can Alkan; Qiaomei Fu; Swapan Mallick; Heng Li; Matthias Meyer; Evan E Eichler; Mark Stoneking; Michael Richards; Sahra Talamo; Michael V Shunkov; Anatoli P Derevianko; Jean-Jacques Hublin; Janet Kelso; Montgomery Slatkin; Svante Pääbo
Journal:  Nature       Date:  2010-12-23       Impact factor: 49.962

6.  Genomewide screening reveals high levels of insertional polymorphism in the human endogenous retrovirus family HERV-K(HML2): implications for present-day activity.

Authors:  Robert Belshaw; Anna L A Dawson; John Woolven-Allen; Joanna Redding; Austin Burt; Michael Tristem
Journal:  J Virol       Date:  2005-10       Impact factor: 5.103

7.  The complete genome sequence of a Neanderthal from the Altai Mountains.

Authors:  Kay Prüfer; Fernando Racimo; Nick Patterson; Flora Jay; Sriram Sankararaman; Susanna Sawyer; Anja Heinze; Gabriel Renaud; Peter H Sudmant; Cesare de Filippo; Heng Li; Swapan Mallick; Michael Dannemann; Qiaomei Fu; Martin Kircher; Martin Kuhlwilm; Michael Lachmann; Matthias Meyer; Matthias Ongyerth; Michael Siebauer; Christoph Theunert; Arti Tandon; Priya Moorjani; Joseph Pickrell; James C Mullikin; Samuel H Vohr; Richard E Green; Ines Hellmann; Philip L F Johnson; Hélène Blanche; Howard Cann; Jacob O Kitzman; Jay Shendure; Evan E Eichler; Ed S Lein; Trygve E Bakken; Liubov V Golovanova; Vladimir B Doronichev; Michael V Shunkov; Anatoli P Derevianko; Bence Viola; Montgomery Slatkin; David Reich; Janet Kelso; Svante Pääbo
Journal:  Nature       Date:  2013-12-18       Impact factor: 49.962

8.  Unfixed endogenous retroviral insertions in the human population.

Authors:  Emanuele Marchi; Alex Kanapin; Gkikas Magiorkinis; Robert Belshaw
Journal:  J Virol       Date:  2014-06-11       Impact factor: 5.103

9.  Neanderthal and Denisovan retroviruses in modern humans.

Authors:  Emanuele Marchi; Alex Kanapin; Matthew Byott; Gkikas Magiorkinis; Robert Belshaw
Journal:  Curr Biol       Date:  2013-11-18       Impact factor: 10.834

  9 in total
  10 in total

1.  HERV-K HML-2 diversity among humans.

Authors:  Jack Lenz
Journal:  Proc Natl Acad Sci U S A       Date:  2016-04-08       Impact factor: 11.205

2.  Genome-wide amplification of proviral sequences reveals new polymorphic HERV-K(HML-2) proviruses in humans and chimpanzees that are absent from genome assemblies.

Authors:  Catriona M Macfarlane; Richard M Badge
Journal:  Retrovirology       Date:  2015-04-28       Impact factor: 4.602

Review 3.  Molecular functions of human endogenous retroviruses in health and disease.

Authors:  Maria Suntsova; Andrew Garazha; Alena Ivanova; Dmitry Kaminsky; Alex Zhavoronkov; Anton Buzdin
Journal:  Cell Mol Life Sci       Date:  2015-06-18       Impact factor: 9.261

4.  Discovery of unfixed endogenous retrovirus insertions in diverse human populations.

Authors:  Julia Halo Wildschutte; Zachary H Williams; Meagan Montesion; Ravi P Subramanian; Jeffrey M Kidd; John M Coffin
Journal:  Proc Natl Acad Sci U S A       Date:  2016-03-21       Impact factor: 11.205

5.  The effect of life history on retroviral genome invasions.

Authors:  Ravinder K Kanda; Tim Coulson
Journal:  PLoS One       Date:  2015-02-18       Impact factor: 3.240

6.  Classification and characterization of human endogenous retroviruses; mosaic forms are common.

Authors:  Laura Vargiu; Patricia Rodriguez-Tomé; Göran O Sperber; Marta Cadeddu; Nicole Grandi; Vidar Blikstad; Enzo Tramontano; Jonas Blomberg
Journal:  Retrovirology       Date:  2016-01-22       Impact factor: 4.602

7.  Variation in proviral content among human genomes mediated by LTR recombination.

Authors:  Jainy Thomas; Hervé Perron; Cédric Feschotte
Journal:  Mob DNA       Date:  2018-12-18

8.  Motifome comparison between modern human, Neanderthal and Denisovan.

Authors:  Matyas F Cserhati; Mary-Ellen Mooter; Lauren Peterson; Benjamin Wicks; Peng Xiao; Mark Pauley; Chittibabu Guda
Journal:  BMC Genomics       Date:  2018-06-18       Impact factor: 3.969

Review 9.  The diversity and evolution of retroviruses: Perspectives from viral "fossils".

Authors:  Jialu Zheng; Yutong Wei; Guan-Zhu Han
Journal:  Virol Sin       Date:  2022-01-19       Impact factor: 4.327

Review 10.  High-Throughput Sequencing is a Crucial Tool to Investigate the Contribution of Human Endogenous Retroviruses (HERVs) to Human Biology and Development.

Authors:  Maria Paola Pisano; Nicole Grandi; Enzo Tramontano
Journal:  Viruses       Date:  2020-06-11       Impact factor: 5.048

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.