| Literature DB >> 35891351 |
Yian Li1,2, Guojie Zhang3,4, Jie Cui1.
Abstract
Human endogenous retroviruses (HERVs) are viral "fossils" in the human genome that originated from the ancient integration of exogenous retroviruses. Although HERVs have sporadically been reported in nonhuman primate genomes, their deep origination in pan-primates remains to be explored. Hence, based on the in silico genomic mining of full-length HERVs in 49 primates, we performed the largest systematic survey to date of the distribution, phylogeny, and functional predictions of HERVs. Most importantly, we obtained conclusive evidence of nonhuman origin for most contemporary HERVs. We found that various supergroups, including HERVW9, HUERSP, HSERVIII, HERVIPADP, HERVK, and HERVHF, were widely distributed in Strepsirrhini, Platyrrhini (New World monkeys) and Catarrhini (Old World monkeys and apes). We found that numerous HERVHFs are spread by vertical transmission within Catarrhini and one HERVHF was traced in 17 species, indicating its ancient nature. We also discovered that 164 HERVs were likely involved in genomic rearrangement and 107 HERVs were potentially coopted in the form of noncoding RNAs (ncRNAs) in humans. In summary, we provided comprehensive data on the deep origination of modern HERVs in pan-primates.Entities:
Keywords: evolution; genomic rearrangement; human endogenous retroviruses; origin; pan-primates; vertical transmission
Mesh:
Year: 2022 PMID: 35891351 PMCID: PMC9323773 DOI: 10.3390/v14071370
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.818
Figure 1The distribution, classification, and vertical transmission of F-HERVs in 49 primates. (A) The phylogenetic tree of all F-HERVs identified. The classifications of F-HERVs are indicated in different colors, and “Unknown” represents F-HERVs that cannot be classified according to their phylogenetic relationships with reference sequences or the similarity of their RT domains with reference RT sequences. Ultrafast bootstrap approximation (UFBoot) values over 95 are provided beside the nodes. (B) Histogram showing the number of F-HERVs in each species, and the classifications of HERVs are indicated in different colors. (C) The diagram shows the standards that were used to screen vertical transmission events. The blue boxes, pink boxes, and green boxes represent the flanking sequences (Flanking), LTR sequences (5LTR or 3LTR), and internal sequences (Int) of F-HERVs, respectively. The dashed boxes indicate the BLASTN searches we performed and the thresholds of each search (in red). The white segments represent the mismatch or gap in BLASTN. The detailed procedure of vertical transmission identification is provided in the Materials and Methods. (D) The left panel shows the rooted phylogenetic tree and the divergence times of 50 primates (Tupaia glis was used as the root), and the numbers of possible vertical transmission events are indicated on the corresponding nodes by the area of pies. The right panel shows the numbers of vertical transmission-related F-HERVs in each species we studied. The classifications of the F-HERVs are indicated in different colors. (E) The image shows the alignments of the 5′LTR and flanking sequences (upper) and the 3′LTR and flanking sequences (lower) of a widely vertically transmitted F-HERV-H in 17 species. The names of the species are listed on the left, and red boxes indicate the LTR sequences of F-HERVH. The full alignments of the LTR and flanking sequences are provided in the Supplementary Materials.
Figure 2F-HERVs may be involved in host genomic recombination or transcribed into ncRNAs. (A) The pie chart shows the number of F-HERVs with clustered LTRs (dark red) and HERVs with non-clustered LTRs (blue green). (B) The phylogenetic tree (upper) shows an example of LTR separation in one F-HERV-H of humans. The separation of the two LTRs is indicated in red, and ultrafast bootstrap approximation (UFBoot) values over 95 are provided beside the nodes. The BLASTN results of these two pairs of LTRs are shown in the table (lower) together with the BLASTN results. “qcovhsp” indicates the coverage of the query, and the red arrows indicate better alignments. The detailed alignments are shown in the Supplementary Data. (C) The diagram shows the standards used to screen F-HERV-related host genomic recombination events. The pink boxes and green boxes represent the LTR sequences (5LTR or 3LTR) or internal sequences (int) of F-HERVs, respectively. The dashed boxes show the BLASTN searches that we performed and the thresholds of each search (in red). The details of the genomic rearrangement analysis are provided in the Materials and Methods. (D) Histogram showing the number of recombination-related F-HERVs in each species, and the classifications of the F-HERVs are indicated in different colors. “Unknown” represents F-HERVs that cannot be classified according to their phylogenetic relationships with reference sequences or the similarity of their RT domains with reference RT sequences. (E) Histogram showing the number of F-HERVs that share the same locations and orientations with known ncRNAs on each human chromosome. The classifications of the F-HERVs are indicated in different colors.
The annotation of HERV related ncRNAs in human.
| Chromosome | Start | End | HERVname | Strand | Related-ncRNA |
|---|---|---|---|---|---|
| 1 | 22997488 | 23002547 | Homo_sapiens_1_23000272-23002212-HERVHF | − | NONHSAG057580.1 |
| 1 | 43087974 | 43091095 | Homo_sapiens_1_43087972-43089561-HERVHF | + | NONHSAG056389.1 |
| 1 | 68386003 | 68391994 | Homo_sapiens_1_68388791-68390135-HERVHF | + | NONHSAG001773.3 |
| 1 | 82354581 | 82360561 | Homo_sapiens_1_82356924-82357985-HERVHF | − | ENSG00000233290 |
| 1 | 82955299 | 82961592 | Homo_sapiens_1_82956772-82959208-HERVHF | + | ENSG00000230817 |
| 1 | 209451675 | 209454012 | Homo_sapiens_1_209451456-209452546-HERVHF | + | NONHSAG057153.1 |
| 1 | 224840009 | 224846095 | Homo_sapiens_1_224842067-224843662-HERVHF | − | ENSG00000286719 |
| 1 | 228942542 | 228942868 | Homo_sapiens_1_228948757-228950104-HERVHF | + | NONHSAG057288.1 |
| 1 | 232120241 | 232123943 | Homo_sapiens_1_232120704-232121847-HERVHF | + | NONHSAG004630.2 |
| 1 | 241433890 | 241439885 | Homo_sapiens_1_241436435-241438237-HERVHF | + | ENSG00000287516 |
| 2 | 5000768 | 5003355 | Homo_sapiens_2_5003505-5005037-HERVHF | + | NONHSAG077068.1 |
| 2 | 16791950 | 16797713 | Homo_sapiens_2_16793801-16795145-HERVHF | − | NONHSAG078497.1 |
| 2 | 34789818 | 34796058 | Homo_sapiens_2_34792231-34793755-HERVHF | + | NONHSAG077257.1 |
| 2 | 38080800 | 38086513 | Homo_sapiens_2_38082627-38083973-HERVHF | − | ENSG00000138061 |
| 2 | 67333734 | 67337603 | Homo_sapiens_2_67334137-67335887-HERVHF | + | NONHSAG028042.3 |
| 2 | 69789900 | 69795859 | Homo_sapiens_2_69792965-69796021-HUERSP | − | NONHSAG028084.2 |
| 2 | 77965137 | 77970868 | Homo_sapiens_2_77967627-77968976-HERVHF | + | NONHSAG077496.1 |
| 2 | 192506078 | 192513184 | Homo_sapiens_2_192509946-192511342-HERVHF | + | NONHSAG030125.2 |
| 2 | 215922303 | 215928129 | Homo_sapiens_2_215924899-215926434-HERVHF | + | NONHSAG078151.1 |
| 2 | 224225331 | 224230988 | Homo_sapiens_2_224227814-224229160-HERVHF | + | NONHSAG078214.1 |
| 2 | 237606784 | 237611630 | Homo_sapiens_2_237609479-237610820-HERVHF | + | NONHSAG110040.1 |
| 3 | 21189031 | 21194139 | Homo_sapiens_3_21190643-21192440-HERVHF | − | ENSG00000282987 |
| 3 | 54634482 | 54638068 | Homo_sapiens_3_54636349-54637755-HERVHF | − | ENSG00000265992 |
| 3 | 112418410 | 112423366 | Homo_sapiens_3_112419312-112420768-HERVHF | − | NONHSAG035734.2 |
| 3 | 115798715 | 115799166 | Homo_sapiens_3_115795176-115796709-HERVHF | − | NONHSAG085690.1 |
| 3 | 155274423 | 155278762 | Homo_sapiens_3_155276457-155278448-HERVHF | − | NONHSAG036456.2 |
| 3 | 186660747 | 186663692 | Homo_sapiens_3_186660542-186661888-HERVHF | + | ENSG00000113905 |
| 4 | 3927445 | 3930682 | Homo_sapiens_4_3929901-3931242-HERVHF | + | NONHSAG087348.2 |
| 4 | 17000545 | 17003928 | Homo_sapiens_4_17000127-17001778-HERVHF | + | NONHSAG037572.2 |
| 4 | 24500975 | 24501427 | Homo_sapiens_4_24503534-24505060-HERVHF | + | NONHSAG037630.2 |
| 4 | 27974874 | 27981319 | Homo_sapiens_4_27976550-27977552-HERVHF | + | NONHSAG037691.2 |
| 4 | 92271492 | 92275299 | Homo_sapiens_4_92273363-92274770-HERVHF | − | ENSG00000249152 |
| 4 | 103553770 | 103557353 | Homo_sapiens_4_103555460-103556971-HERVHF | − | ENSG00000250920 |
| 4 | 128640901 | 128644450 | Homo_sapiens_4_128642726-128644003-HERVHF | − | NONHSAG088517.2 |
| 4 | 145698823 | 145703505 | Homo_sapiens_4_145701612-145702617-HERVHF | + | ENSG00000237136 |
| 4 | 152741345 | 152747172 | Homo_sapiens_4_152743196-152744540-HERVHF | − | NONHSAG039129.2 |
| 4 | 175461163 | 175467003 | Homo_sapiens_4_175463047-175464647-HERVHF | − | ENSG00000249945 |
| 5 | 92826033 | 92829706 | Homo_sapiens_5_92826486-92827829-HERVHF | + | ENSG00000248588 |
| 5 | 136303790 | 136307028 | Homo_sapiens_5_136303833-136305180-HERVHF | + | ENSG00000250947 |
| 5 | 161245405 | 161254586 | Homo_sapiens_5_161251016-161252646-HERVHF | + | NONHSAG090654.1 |
| 6 | 16259010 | 16264893 | Homo_sapiens_6_16260854-16262201-HERVHF | − | ENSG00000282024 |
| 6 | 18754142 | 18756902 | Homo_sapiens_6_18755932-18757277-HERVHF | − | NONHSAG043117.2 |
| 6 | 80509795 | 80515805 | Homo_sapiens_6_80511941-80513513-HERVHF | − | NONHSAG113295.1 |
| 6 | 94553917 | 94559610 | Homo_sapiens_6_94555806-94557152-HERVHF | − | NONHSAG044390.2 |
| 6 | 97779489 | 97785327 | Homo_sapiens_6_97782122-97783636-HERVHF | + | ENSG00000271860 |
| 6 | 123582333 | 123588007 | Homo_sapiens_6_123584156-123585562-HERVHF | − | ENSG00000186439 |
| 6 | 125701846 | 125707764 | Homo_sapiens_6_125703727-125705069-HERVHF | − | ENSG00000237742 |
| 6 | 126851224 | 126854456 | Homo_sapiens_6_126851273-126852794-HERVHF | + | NONHSAG044785.3 |
| 6 | 131295347 | 131301206 | Homo_sapiens_6_131297975-131299503-HERVHF | + | NONHSAG093612.2 |
| 6 | 131338799 | 131344566 | Homo_sapiens_6_131340338-131342739-HERVHF | + | NONHSAG093612.2 |
| 6 | 131903830 | 131907420 | Homo_sapiens_6_131904209-131905555-HERVHF | + | ENSG00000236673 |
| 6 | 144923164 | 144928866 | Homo_sapiens_6_144925698-144927040-HERVHF | + | NONHSAG095837.2 |
| 7 | 26024199 | 26029809 | Homo_sapiens_7_26026061-26027405-HERVHF | − | NONHSAG047156.2 |
| 7 | 34300132 | 34300573 | Homo_sapiens_7_34301985-34303122-HERVHF | − | NONHSAG047318.2 |
| 7 | 102975230 | 102978736 | Homo_sapiens_7_102976263-102977254-HERVHF | + | ENSG00000230257 |
| 7 | 125920130 | 125924112 | Homo_sapiens_7_125920071-125921895-HERVHF | + | ENSG00000197462 |
| 7 | 155238821 | 155244070 | Homo_sapiens_7_155240740-155241657-HERVK | − | NONHSAG049243.2 |
| 8 | 71676972 | 71680514 | Homo_sapiens_8_71677433-71678968-HERVHF | + | ENSG00000254277 |
| 8 | 90090224 | 90093794 | Homo_sapiens_8_90091914-90093348-HERVHF | − | ENSG00000104327 |
| 8 | 97200769 | 97206658 | Homo_sapiens_8_97202388-97204973-HERVHF | + | NONHSAG098987.1 |
| 8 | 114284546 | 114287727 | Homo_sapiens_8_114284508-114286044-HERVHF | + | ENSG00000254339 |
| 8 | 132080235 | 132086002 | Homo_sapiens_8_132081909-132083445-HERVHF | − | ENSG00000132297 |
| 9 | 12950832 | 12954130 | Homo_sapiens_9_12950845-12952399-HERVHF | + | NONHSAG101172.2 |
| 9 | 80137297 | 80143055 | Homo_sapiens_9_80139873-80141469-HERVHF | + | NONHSAG052646.2 |
| 9 | 85461120 | 85466955 | Homo_sapiens_9_85461166-85462902-HERVHF | + | NONHSAG052703.2 |
| 9 | 115475420 | 115478923 | Homo_sapiens_9_115475976-115477349-HERVHF | + | NONHSAG053288.3 |
| 10 | 6797081 | 6802954 | Homo_sapiens_10_6798770-6800364-HERVHF | − | NONHSAG005151.3 |
| 10 | 25716420 | 25722928 | Homo_sapiens_10_25718978-25720776-HERVHF | + | ENSG00000280809 |
| 11 | 6366039 | 6371662 | Homo_sapiens_11_6368276-6369885-HERVHF | + | NONHSAG007525.2 |
| 11 | 27629072 | 27632889 | Homo_sapiens_11_27630864-27632291-HERVHF | − | ENSG00000254934 |
| 11 | 94641661 | 94647315 | Homo_sapiens_11_94644134-94645475-HERVHF | + | ENSG00000255666 |
| 11 | 96499960 | 96506627 | Homo_sapiens_11_96501724-96503654-HERVHF | + | ENSG00000183340 |
| 11 | 96590439 | 96593677 | Homo_sapiens_11_96590449-96591982-HERVHF | + | ENSG00000254587 |
| 11 | 130565609 | 130570121 | Homo_sapiens_11_130566060-130567548-HERVHF | − | NONHSAG010050.2 |
| 11 | 130753494 | 130756702 | Homo_sapiens_11_130755373-130756704-HERVHF | − | NONHSAG010050.2 |
| 12 | 4018623 | 4023691 | Homo_sapiens_12_4021109-4022208-HERVHF | + | ENSG00000256969 |
| 12 | 11462168 | 11468022 | Homo_sapiens_12_11463877-11465381-HERVHF | − | ENSG00000121335 |
| 12 | 34269097 | 34274242 | Homo_sapiens_12_34268101-34269869-HERVHF | + | NONHSAG010874.2 |
| 12 | 70444894 | 70450107 | Homo_sapiens_12_70446553-70447732-HERVK | − | NONHSAG011664.2 |
| 12 | 86941530 | 86944748 | Homo_sapiens_12_86941432-86943069-HERVHF | + | NONHSAG064903.1 |
| 13 | 42868001 | 42871007 | Homo_sapiens_13_42869513-42870545-HERVHF | − | NONHSAG013351.2 |
| 13 | 48866771 | 48872457 | Homo_sapiens_13_48868391-48870330-HERVHF | − | NONHSAG067525.1 |
| 13 | 51169866 | 51175008 | Homo_sapiens_13_51172521-51173517-HERVHF | + | NONHSAG013541.3 |
| 13 | 54127417 | 54133159 | Homo_sapiens_13_54129305-54130960-HERVHF | − | ENSG00000234787 |
| 13 | 66142250 | 66147037 | Homo_sapiens_13_66143157-66144503-HERVHF | − | NONHSAG013698.2 |
| 13 | 79276611 | 79279830 | Homo_sapiens_13_79276654-79278001-HERVHF | + | NONHSAG067153.1 |
| 14 | 38193319 | 38196529 | Homo_sapiens_14_38193317-38194783-HERVHF | + | ENSG00000258649 |
| 14 | 41521426 | 41521883 | Homo_sapiens_14_41518469-41520184-HERVHF | + | NONHSAG014802.2 |
| 14 | 48262389 | 48263146 | Homo_sapiens_14_48256895-48258584-HERVHF | − | ENSG00000287492 |
| 15 | 74354141 | 74359786 | Homo_sapiens_15_74355867-74357936-HERVHF | + | ENSG00000260266 |
| 15 | 87831107 | 87837024 | Homo_sapiens_15_87833731-87835137-HERVHF | + | NONHSAG017784.2 |
| 16 | 60078536 | 60084582 | Homo_sapiens_16_60081354-60082700-HERVHF | + | NONHSAG071739.1 |
| 16 | 65229803 | 65233421 | Homo_sapiens_16_65231504-65233039-HERVHF | − | ENSG00000260834 |
| 18 | 28693028 | 28696068 | Homo_sapiens_18_28694974-28696314-HERVHF | − | NONHSAG075074.1 |
| 18 | 56417745 | 56421344 | Homo_sapiens_18_56418118-56419466-HERVHF | + | NONHSAG074828.1 |
| 18 | 57064647 | 57070296 | Homo_sapiens_18_57068491-57069834-HERVHF | − | ENSG00000258609 |
| 18 | 73327171 | 73330369 | Homo_sapiens_18_73327166-73328509-HERVHF | + | ENSG00000261780 |
| 19 | 22568269 | 22575022 | Homo_sapiens_19_22570768-22572352-HERVHF | + | NONHSAG025320.2 |
| 20 | 12756027 | 12759632 | Homo_sapiens_20_12756400-12757916-HERVHF | + | NONHSAG031288.2 |
| 20 | 40269047 | 40274769 | Homo_sapiens_20_40271576-40272881-HERVHF | + | NONHSAG081519.1 |
| 21 | 17124024 | 17127764 | Homo_sapiens_21_17123959-17125734-HERVHF | + | NONHSAG110806.1 |
| 21 | 26227947 | 26233485 | Homo_sapiens_21_26229594-26231104-HERVHF | − | NONHSAG032575.2 |
| 21 | 42800845 | 42803999 | Homo_sapiens_21_42802518-42804296-HERVHF | − | NONHSAG083198.1 |
| X | 71264372 | 71272628 | Homo_sapiens_X_71266645-71268493-HERVHF | + | ENSG00000147140 |
| X | 94698818 | 94701832 | Homo_sapiens_X_94700559-94702091-HERVHF | − | NONHSAG054922.2 |
| X | 111543806 | 111549675 | Homo_sapiens_X_111546380-111547978-HERVHF | + | NONHSAG055109.3 |
| X | 122227333 | 122227787 | Homo_sapiens_X_122224556-122226109-HERVHF | + | NONHSAG055239.2 |