Literature DB >> 28482857

Horizontal gene transfer is not a hallmark of the human genome.

Steven L Salzberg1.   

Abstract

Crisp et al. recently reported that 145 human genes have been horizontally transferred from distant species. Here, I re-analyze those genes listed by Crisp et al. as having the highest certainty of having been horizontally transferred, as well as 17 further genes from the 2001 human genome article, and find little or no evidence to support claims of horizontal gene transfer (HGT).Please see related Research article: https://genomebiology.biomedcentral.com/articles/10.1186/s13059-015-0607-3.

Entities:  

Mesh:

Year:  2017        PMID: 28482857      PMCID: PMC5422933          DOI: 10.1186/s13059-017-1214-2

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


Introduction

A recent study by Crisp et al. [1] re-examined a claim, originally made in the landmark 2001 human genome paper, that bacteria had horizontally transferred 223 genes into a vertebrate ancestor of humans [2]. That claim was refuted soon after the original report [3, 4]. Using an alignment-based scoring scheme, the study by Crisp et al. [1] reported that 145 human genes, including 17 of those from the 2001 study, had been horizontally transferred from distant species. Here, I describe a re-analysis of these 17 genes and of the 28 highest-confidence genes newly claimed by Crisp et al. [1] to have been horizontally transferred, taking a more skeptical perspective, and find little or no evidence to support claims of horizontal gene transfer (HGT). Hundreds of eukaryotic genomes and thousands of bacterial genomes have been sequenced in the 15 years since the human genome was published. In their recent report, Crisp et al. [1] argue that, with the availability of this far larger collection of genomes, the likelihood of false HGT findings that are actually the result of gene loss is now greatly reduced. Their reanalysis, which was based on a combination of BLAST searches and phylogenetic trees, identified hundreds of “foreign” genes in animals; this led them to claim that HGT “has occurred on a previously unsuspected scale in metazoans” and that it is a significant factor in animal evolution. In this study, I re-examined the claims of Crisp et al. [1] focusing on the human genes. Instead of using a large-scale, automated analysis, which by its very nature could enrich the results for artifactual findings, I looked at each human gene individually to determine whether the evidence is sufficient to support the conclusion that HGT occurred. An important principal here is that extraordinary claims require extraordinary evidence: there is no doubt that the vast majority of human genes owe their presence in the human genome to the normal process of inheritance by vertical descent. Thus, if other, more mundane processes can explain the alignments of a human gene sequence, these explanations are far more likely than HGT.

Results

For my re-analysis, I re-aligned the 17 human genes that were originally reported as having undergone bacterial-vertebrate transfer (BVT), a finding that has been rejected by our work [3] and that of others [4, 5], but re-claimed by Crisp et al. [1] (Table 1). I found that the evidence does not support HGT for any of them. (One important point worth noting here is that Crisp et al. listed some of these genes as “confirmed” by Salzberg et al. [3]. This was not the case; our previous study invalidated most of the previously claimed HGT events, but was not able to dismiss all of them. Our study made it clear that we did not consider the presence of the remaining genes to be the result of HGT events.) Crisp et al. [1] reported a total of 145 human genes that they claimed to be the result of HGT; 39 of these are labeled in their highest confidence group, class A. Of these 39, seven are included in the first group of 17, leaving 32 newly claimed HGT events. I examined these 32 class A genes (Table 2) and again find no evidence for HGT. A detailed, gene-by-gene description of these analyses can be found in Additional file 1 and the sequences of the genes in Tables 1 and 2 can be found in Additional file 2.
Table 1

Re-analysis of 17 human genes claimed as examples of horizontal gene transfer

Gene group numberEnsembl gene IDGene nameBest metazoan, non-chordate matchBest non-metazoan matchExplanation
SpeciesBit scoreSpeciesBit scoreCrisp et al.This study
1ENSG00000166743, ENSG00000183747, ENSG00000005187, ENSG00000183549Acyl-CoA synthetase medium-chain family members 1, 2A, 3, and 5 Lingula anatina or Aplysia californica 686 Desulfotomaculum thermocisternum or D. kuznetsovii 668Bacterial HGTNo HGT
2ENSG00000047457Ceruloplasmin (ferroxidase) Exaiptasia pallida (sea anemone)898 Klebsormidium flaccidum (alga)543Plant HGTNo HGT
3ENSG00000107618, ENSG00000265203Retinol-binding protein 3None Stackebrandtia nassauensis 152Bacterial HGTGene loss
4ENSG00000181019, ENSG00000124588NAD(P)H dehydrogenase, quinone 1 and 2 L. anatina or Capitella teleta 245 Sinorhizobium fredii 204Bacterial HGTNo HGT
5ENSG00000132744Aspartoacylase (aminocyclase) 3 Saccoglossus kowalevskii 262 Oscillatoria sp. PCC 10802249Bacterial HGTNo HGT
6ENSG00000170961, ENSG00000105509, ENSG00000103044Hyaluronan synthase 1, 2, and 3 Papilio xuthus 78 Lichtheimia ramosa 333Fungal HGTRate variation
7ENSG00000175806Methionine sulfoxide reductase A Lottia gigantea 302 Pleurocapsa sp. PCC 7319306Bacterial HGTNo HGT
8ENSG00000095596Cytochrome P450, family 26, subfamily A, polypeptide 1 C. teleta 396 Geitlerinema sp. PCC 7407302Bacterial HGTNo HGT
9ENSG00000113790Enoyl-CoA, hydratase/3-hydroxyacyl CoA dehydrogenase L. anatina 645 Capsaspora owczarzaki 582Protist HGTNo HGT
10ENSG00000166532Ribosomal modification protein rimK-like family member B A. californica 143 Kitasatospora cheerisanensis 194Archaeal HGTRate variation
11ENSG00000172508Carnosine synthase 1 Crassostrea gigas 614 Perkinsus marinus 151Protist HGTNo HGT

Columns 5 and 7 contain the bitscores of the best BLAST alignment to a gene from the species in columns 4 and 6. Gene group number (column 1) refers to the number used in the main text

Table 2

Re-analysis of genes claimed to be newly discovered human horizontal gene transfers (HGT) in the Crisp et al. [1] study

Gene IDGene nameResults from Crisp et al. [1] studyResults from this study
HGT indexSource HGT taxonBest non-chordate metazoan matchBit scoreBest non-chordate metazoan matchMatching protein IDBit scoreNew HGT index
ENSG00000070269C14orf101337ProtistB4LPG637 Lingula anatina XP_013409033566–192
ENSG00000102805Ceroid-lipofuscinosis, neuronal 584ProtistE3MZU239 Lottia gigantea XP_009059770338–215
ENSG00000116721PRAME family member 172Protista A8MVS244No hits
ENSG00000157358PRAME family member 1545Protista F1SUY589No hits
ENSG00000232423PRAME family member 645Protista F1SUY592No hits
ENSG00000117115, ENSG00000142619, ENSG00000142623, ENSG00000159339Protein-arginine deiminase, types 2,3,1,5464BacteriaB4JS8145 Priapulus caudatus XP_014670176560–51
ENSG000001254585′,3′-nucleotidase, cytosolic135BacteriaE3LP7139 Saccoglossus kowalevskii XP_006825056222–48
ENSG000002053095′,3′-nucleotidase, mitochondrial108BacteriaB5DJB441 L. anatina XP_013404549237–88
ENSG00000133561, ENSG00000133574, ENSG00000179144, ENSG00000196329, ENSG00000213203GTPase, IMAP family members 6, 4, 7, 5, and 151PlantF1QNI494 Crassostrea gigas XP_011437597208–63
ENSG00000136153LIM domain 747ProtistH9JLZ286 S. kowalevskii XP_006813620268–135
ENSG00000136830Family with sequence similarity 129, member B74ProtistG6DB4644 Acropora digitifera XP_0157744621153
ENSG00000140718Fat mass and obesity associated97PlantC9J4C392 S. kowalevskii ALR88588342–153
ENSG00000148288Globoside alpha-1,3-N-acetylgalactosaminyltransferase 191BacteriaA8QE8739 Cimex lectularis XP_014240367.13793
ENSG00000154122Ankylosis, progressive homolog (mouse)129ProtistF5GXN790 S. kowalevskii XP_006818212 388–169
ENSG00000172757Cofilin 131FungiB3S0K880 Helobdella robusta XP_0090112171083
ENSG00000175573Chromosome 11 open reading frame 6860FungiD6WQ6948 S. kowalevskii XP_002740403.1157–49
ENSG00000177181Ribosomal modification protein rimK-like family member A74ArchaeaE1GHX352 Aplysia californica XP_0129361561242
ENSG00000212907Mitochondrially encoded NADH dehydrogenase 4 L69ProtistP1555468 Xenoturbella bocki YP_8509846771
ENSG00000216937Coiled-coil domain containing protein 748ProtistA7SDV064No hits
ENSG00000242265Retrotransposon-derived protein PEG1052FungiG0MWG349No hits
ENSG00000256062ABO blood group (Histo-blood group ABO system transferase)84BacteriaH3INK737 S. kowalevskii XM_006825840218–97

The HGT index, defined by Crisp et al. [1] as the difference in the best bitscore of a BLAST match to a non-metazoan and a metazoan species, is shown along with the bitscore of the best metazoan match. The best metazoan match excluded any matches to the phylum Chordata for these human genes. All of the genes in this table were reported by Crisp et al. [1] as high-confidence (class A) HGT. The recomputed HGT index (last column) is computed by subtracting the bitscore of best non-metazoan found by Crisp et al. [1] from that of the best non-chordate metazoans found by the new searches reported here. “No hits” means that no significant alignments were found to any non-chordate metazoans

aFor PRAME family members 1, 6, and 15, the protist alignment found by Crisp et al. [1] is a false positive caused by contamination. See main text for details

Re-analysis of 17 human genes claimed as examples of horizontal gene transfer Columns 5 and 7 contain the bitscores of the best BLAST alignment to a gene from the species in columns 4 and 6. Gene group number (column 1) refers to the number used in the main text Re-analysis of genes claimed to be newly discovered human horizontal gene transfers (HGT) in the Crisp et al. [1] study The HGT index, defined by Crisp et al. [1] as the difference in the best bitscore of a BLAST match to a non-metazoan and a metazoan species, is shown along with the bitscore of the best metazoan match. The best metazoan match excluded any matches to the phylum Chordata for these human genes. All of the genes in this table were reported by Crisp et al. [1] as high-confidence (class A) HGT. The recomputed HGT index (last column) is computed by subtracting the bitscore of best non-metazoan found by Crisp et al. [1] from that of the best non-chordate metazoans found by the new searches reported here. “No hits” means that no significant alignments were found to any non-chordate metazoans aFor PRAME family members 1, 6, and 15, the protist alignment found by Crisp et al. [1] is a false positive caused by contamination. See main text for details Of the 17 genes from the original human genome paper that Crisp et al. [1] claim are true examples of HGT, my analysis finds that 12 genes fail to pass the authors' own BLAST-based test for HGT, because their closest metazoan match has a bitscore that is greater than the best non-metazoan match (Table 1). Of the 28 genes representing new claims of HGT (Table 2), 26 fail the initial screen for HGT candidates, either because they fail the original BLAST bitscore test, because they represent contaminants in draft genomes, or because they are known mitochondrial or retrotransposed genes. The remaining seven genes (five from Table 1 and two from Table 2) include three close paralogs (HAS1–3) and thus represent four hypothesized HGT events. A combination of gene loss and evolutionary rate variation is more than adequate to explain these genes: among other reasons, the alignments and bitscores are the result of screening more than 20,000 human genes, and one might expect a few genes from this large set to be lost (or to have evolved slightly more rapidly) in the non-chordate genomes. One reason that better BLAST results were found in the current study could well be that this study used data from May 2016, whereas Crisp et al.'s study used data from January 2013. A large number of additional genomes have been deposited in public archives during the three years between the two analyses. These species were not available to the previous study and thus the orthologous genes from these taxa were missed. Insofar as this explanation is correct, it strengthens the argument for gene loss as the explanation for the (very few) human genes that still have better BLAST matches in non-metazoans than in non-chordate metazoans. Another factor is that because only non-chordates are considered, the alignments and bitscores between a human gene and these very distant relatives are necessarily quite weak. This distant relationship makes it more likely that some genes will not be found simply because the sequence has diverged too much for a pairwise alignment to detect it. This study focuses only on human genes, but recent claims of high levels of HGT in other animals have also been reported. The most dramatic claim was the recent report that up to one-sixth of the genes in the tardigrade (Hypsibius dujardini) had been laterally transferred from other species [6], but that claim was quickly shown to be a false result due primarily to contamination of the genome assembly [7]. In Crisp et al. [1], contamination seems to be a likely explanation for the three human genes (PRAME family members 1, 6, and 15) reported as high-confidence HGT events, and a closer scrutiny of other automatically identified HGT candidates might reveal other cases. (Contamination has been reported to create false signals of HGT as far back as 2002 [8].) My re-examination here suggests that HGT is very rare rather than widespread in vertebrate genomes, and that every hypothesized HGT event needs to be subjected to careful scrutiny. As we wrote in 2001 [3], “the argument for lateral gene transfer is essentially a statistical one, necessarily so because of the inherent impossibility of observing events that may have occurred in the distant past”. When searching a large set of genes against an even larger database, one must recognize that such large-scale, automated searches will inevitably find unusual results that include genes that were lost or evolved more rapidly in multiple lineages. Because HGT is such an unlikely event, the results of automated searches should be subjected to individual, close scrutiny with an eye toward explaining them through more mundane processes before concluding that these anomalies represent novel biological discoveries. As demonstrated here, a re-analysis using the latest genome databases shows that other than the well-known mitochondrial genome transfer and retrovirus-mediated events, no genes have been horizontally transferred into the human genome.

Methods

Ensembl identifiers for all genes proposed as examples of HGT were obtained from Crisp et al. [1] and validated by retrieving them from the Ensembl database (www.ensembl.org). Genomes and protein sequences were obtained from the National Center for Biotechnology Information (NCBI; www.ncbi.nlm.nih.gov) and UniProt (www.uniprot.org). Protein sequences were aligned individually using the blastp program and the non-redundant protein database, nr, available through the BLAST server at NCBI (https://blast.ncbi.nih.gov) or for direct download from the same source. To aid analysis, searches were run against the entire database and again with the phylum Chordata (taxon 7711) excluded from the results, which did not affect bitscores. Gene-by-gene analysis of evidence for horizontal gene transfer for all genes in Tables 1 and 2. (DOCX 48 kb) Protein sequences for Ensembl genes in Tables 1 and 2. (FA 24 kb)
  8 in total

1.  Initial sequencing and analysis of the human genome.

Authors:  E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

2.  Phylogenetic analyses do not support horizontal gene transfers from bacteria to vertebrates.

Authors:  M J Stanhope; A Lupas; M J Italia; K K Koretke; C Volker; J R Brown
Journal:  Nature       Date:  2001-06-21       Impact factor: 49.962

3.  Microbial genes in the human genome: lateral transfer or gene loss?

Authors:  S L Salzberg; O White; J Peterson; J A Eisen
Journal:  Science       Date:  2001-05-17       Impact factor: 47.728

4.  Contamination in the draft of the human genome masquerades as lateral gene transfer.

Authors:  Eske Willerslev; Tobias Mourier; Anders J Hansen; Bent Christensen; Ian Barnes; Steven L Salzberg
Journal:  DNA Seq       Date:  2002-04

Review 5.  Much ado about bacteria-to-vertebrate lateral gene transfer.

Authors:  Diane P Genereux; John M Logsdon
Journal:  Trends Genet       Date:  2003-04       Impact factor: 11.639

6.  Evidence for extensive horizontal gene transfer from the draft genome of a tardigrade.

Authors:  Thomas C Boothby; Jennifer R Tenlen; Frank W Smith; Jeremy R Wang; Kiera A Patanella; Erin Osborne Nishimura; Sophia C Tintori; Qing Li; Corbin D Jones; Mark Yandell; David N Messina; Jarret Glasscock; Bob Goldstein
Journal:  Proc Natl Acad Sci U S A       Date:  2015-11-23       Impact factor: 11.205

7.  No evidence for extensive horizontal gene transfer in the genome of the tardigrade Hypsibius dujardini.

Authors:  Georgios Koutsovoulos; Sujai Kumar; Dominik R Laetsch; Lewis Stevens; Jennifer Daub; Claire Conlon; Habib Maroon; Fran Thomas; Aziz A Aboobaker; Mark Blaxter
Journal:  Proc Natl Acad Sci U S A       Date:  2016-03-24       Impact factor: 11.205

8.  Expression of multiple horizontally acquired genes is a hallmark of both vertebrate and invertebrate genomes.

Authors:  Alastair Crisp; Chiara Boschetti; Malcolm Perry; Alan Tunnacliffe; Gos Micklem
Journal:  Genome Biol       Date:  2015-03-13       Impact factor: 13.583

  8 in total
  17 in total

1.  The genomes of polyextremophilic cyanidiales contain 1% horizontally transferred genes with diverse adaptive functions.

Authors:  Alessandro W Rossoni; Dana C Price; Mark Seger; Dagmar Lyska; Peter Lammers; Debashish Bhattacharya; Andreas Pm Weber
Journal:  Elife       Date:  2019-05-31       Impact factor: 8.140

Review 2.  Functional horizontal gene transfer from bacteria to eukaryotes.

Authors:  Filip Husnik; John P McCutcheon
Journal:  Nat Rev Microbiol       Date:  2017-11-27       Impact factor: 60.633

3.  Fungal evolution: cellular, genomic and metabolic complexity.

Authors:  Miguel A Naranjo-Ortiz; Toni Gabaldón
Journal:  Biol Rev Camb Philos Soc       Date:  2020-04-17

4.  Universal and taxon-specific trends in protein sequences as a function of age.

Authors:  Jennifer E James; Sara M Willis; Paul G Nelson; Catherine Weibel; Luke J Kosinski; Joanna Masel
Journal:  Elife       Date:  2021-01-08       Impact factor: 8.140

5.  Potential Effects of Horizontal Gene Exchange in the Human Gut.

Authors:  Aaron Lerner; Torsten Matthias; Rustam Aminov
Journal:  Front Immunol       Date:  2017-11-27       Impact factor: 7.561

6.  Grafting or pruning in the animal tree: lateral gene transfer and gene loss?

Authors:  Julie C Dunning Hotopp
Journal:  BMC Genomics       Date:  2018-06-18       Impact factor: 3.969

7.  Desiccation does not drastically increase the accessibility of exogenous DNA to nuclear genomes: evidence from the frequency of endosymbiotic DNA transfer.

Authors:  Xixi Li; Cheng Fang; Jun-Peng Zhao; Xiao-Yu Zhou; Zhihua Ni; Deng-Ke Niu
Journal:  BMC Genomics       Date:  2020-07-01       Impact factor: 3.969

8.  Recurrent Loss, Horizontal Transfer, and the Obscure Origins of Mitochondrial Introns in Diatoms (Bacillariophyta).

Authors:  Wilson X Guillory; Anastasiia Onyshchenko; Elizabeth C Ruck; Matthew Parks; Teofil Nakov; Norman J Wickett; Andrew J Alverson
Journal:  Genome Biol Evol       Date:  2018-06-01       Impact factor: 3.416

9.  Genomes of trombidid mites reveal novel predicted allergens and laterally transferred genes associated with secondary metabolism.

Authors:  Xiaofeng Dong; Kittipong Chaisiri; Dong Xia; Stuart D Armstrong; Yongxiang Fang; Martin J Donnelly; Tatsuhiko Kadowaki; John W McGarry; Alistair C Darby; Benjamin L Makepeace
Journal:  Gigascience       Date:  2018-12-01       Impact factor: 6.524

10.  What is an archaeon and are the Archaea really unique?

Authors:  Ajith Harish
Journal:  PeerJ       Date:  2018-10-18       Impact factor: 2.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.