| Literature DB >> 21843315 |
Christian Rödelsperger1, Ralf J Sommer.
Abstract
BACKGROUND: The recent sequencing of nematode genomes has laid the basis for comparative genomics approaches to study the impact of horizontal gene transfer (HGT) on the adaptation to new environments and the evolution of parasitism. In the beetle associated nematode Pristionchus pacificus HGT events were found to involve cellulase genes of microbial origin and Diapausin genes that are known from beetles, but not from other nematodes. The insect-to-nematode horizontal transfer is of special interest given that P. pacificus shows a tight association with insects.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21843315 PMCID: PMC3175473 DOI: 10.1186/1471-2148-11-239
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1. (A) For all P. pacificus genes with >100 codons, RSCU values were computed, and RSCU and GC3 deviation of a single gene from the genomewide average were plotted. Both values show a substantial correlation indicating that RSCU values are strongly biased by GC3 variation. (B) Correlation between GC3 and intronic GC content suggests that this bias is due to genomic variation in GC content and not specific for individual transcripts. (C) GC-normalization of RSCU values strongly reduces the correlation between deviation in GC3 and codon usage bias. (D) Cumulative distribution of P. pacificus orphan genes that were ranked by their deviation from the genomewide average of RSCU, GC-normalized RSCU and GC3 content. Genes with most atypical codon usage profiles show an enrichment in orphan genes (P = 0.0002), and the strongest enrichment is observed for the GC-normalized RSCU values.
Accuracies of taxonomic group predictions based on GC-normalized codon usage profiles
| Taxonomic group | insect* | bacteria* | amoebozoa* | fungi* | nematode* | plant * |
|---|---|---|---|---|---|---|
| Insects | 61.1 ± 0.7 | 6.8 ± 0.5 | 0.3 ± 0.1 | 7 ± 0.4 | 22.0 ± 0.3 | 2.8 ± 0.2 |
| Bacteria | 8.5 ± 0.2 | 85.5 ± 0.4 | 0 ± 0 | 1.9 ± 0.1 | 3.6 ± 0.2 | 0.5 ± 0.1 |
| Amoebozoa | 2.2 ± 0.6 | 1.5 ± 0.3 | 82.9 ± 1.2 | 1.7 ± 0.7 | 10.8 ± 1.7 | 0.9 ± 0.6 |
| Fungi | 10.3 ± 0.6 | 1.3 ± 0.1 | 0.2 ± 0.1 | 71.3 ± 0.7 | 9.1 ± 0.5 | 7.9 ± 0.3 |
| Nematodes | 12.0 ± 0.4 | 5.5 ± 0.2 | 0.5 ± 0.1 | 7.1 ± 0.2 | 71.9 ± 0.6 | 3.0 ± 0.2 |
| Plants | 6.8 ± 0.8 | 2.9 ± 0.4 | 0.3 ± 0.1 | 18.1 ± 0.6 | 13.9 ± 0.9 | 58.0 ± 1.2 |
Accuracy was measured using ten randomized data sets including equal number of sequences per species (see Methods). For each taxonomic group (rows), we calculated the mean percentage of predictions for all possible classifications, the "*" indicates the predicted taxonomic group (columns). For instance, while 61.1% of insect genes are correctly recognized as insect sequences based on their codon usage, 22.0% are mistaken as nematode genes.
Figure 2Prediction of taxonomic groups from codon usage profiles. (A) Distribution of predicted taxonomic groups based on GC-normalized RSCU profiles. Codon usage profiles of P. pacificus genes were compared to the average genomewide profiles of 71 species and assigned to the species with the closest Euclidean distance in profiles. (B) Enrichment of predicted taxonomic groups for P. pacificus orphans with respect to all genes. The strongest enrichment in predicted taxonomic groups is observed for bacteria, however due to the small number of bacteria predictions in general, it is not the most significant enrichment and might be a side effect of the depletion in predicted nematode codon profiles. In contrast, the most significant bias is observed for genes with predicted insect-like codon usage. (C) Relative distance to Drosophila melanogaster GC-normalized RSCU profile with respect to distances to P. pacificus and Aspergillus nidulans. In comparison to P. pacificus genes with homologs in other nematodes, both classes of HGT candidates show a strongly reduced distance to the Drosophila melanogaster profile.
Figure 3Phylogenetic relationships for P. pacificus Diapausins and reverse transcriptases. (A) Maximum likelihood tree of Diapausin genes including six P. pacificus HGT candidates. Node labels indicate bootstrap support values (100 replications). All P. pacificus Diapausins show the closest similarity to a sequence from leaf beetle (Gastrophysa atrocyanea). (B) Phylogenetic tree of reverse transcriptase proteins from P. pacificus and other nematode and insect sequences. The two P. pacificus reverse transcriptases show highest similarity to a sequence from the postman butterfly (Heliconius melpomene). (C) Genes in a 40 kb locus on Contig15 of the P. pacificus Hybrid 1 assembly. The locus is bounded by an endonuclease and a reverse transcriptase and contains eight embedded genes. Seven of these genes do not show homology to any other known sequences. The only non-orphan gene is a copper transporter with homology to C. elegans F58G6.3.
Pfam domains in HGT candidates
| Accession ID | Description | Enrichment | ||
|---|---|---|---|---|
| PF00078.20 | Reverse transcriptase (RNA-dependent DNA polymerase) | 58 | 14.0× | |
| PF03372.16 | Endonuclease/Exonuclease/phosphatase family | 27 | 18.8× | |
| PF00096.19 | Zinc finger, C2H2 type | 14 | 5.1× | |
| PF00067.15 | Cytochrome P450 | 11 | 2.5× | |
| PF00069.18 | Protein kinase domain | 9 | 0.9× | 1 |
| PF07714.10 | Protein tyrosine kinase | 9 | 1.0× | 0.9 |
| PF00097.18 | Zinc finger, C3HC4 type (RING finger) | 8 | 3.1× | |
| PF00089.19 | Trypsin | 6 | 6.4× | |
| PF08036.4 | Diapausin family of antimicrobial peptide | 6 | 38.3× | |
| PF08953.4 | Domain of unknown function (DUF1899) | 6 | 25.6× | |
| PF00076.15 | RNA recognition motif (RRM) | 5 | 1.6× | 0.3 |
| PF00644.13 | Poly(ADP-ribose) polymerase catalytic domain | 5 | 24.0× | |
| PF00856.21 | SET domain | 5 | 5.8× | |
| PF03732.10 | Retrotransposon gag protein | 5 | 24.0× | |
| PF12171.1 | Zinc-finger double-stranded RNA-binding | 5 | 3.8× | |
| PF00092.21 | von Willebrand factor type A domain | 4 | 3.2× | 0.06 |
| PF00106.18 | short chain dehydrogenase | 4 | 1.4× | 0.4 |
| PF00178.15 | Ets-domain | 4 | 8.5× | |
| PF00005.20 | ABC transporter | 3 | 1.4× | 0.6 |
| PF00098.16 | Zinc knuckle | 3 | 3.7× |
The 20 most abundant Pfam domains (e-value <0.001) in 509 HGT candidates defined by cross-species homology are shown. Enrichments factors and FDR corrected p-values were calculated relative to all P. pacificus genes with annotated PFAM domains.