| Literature DB >> 27187689 |
Mary Morgan-Richards1, Simon F K Hills1, Patrick J Biggs2, Steven A Trewick1.
Abstract
Hypotheses of hybrid origin are common. Here we use next generation sequencing to test a hybrid hypothesis for a non-model insect with a large genome. We compared a putative hybrid triploid stick insect species (Acanthoxyla geisovii) with its putative paternal diploid taxon (Clitarchus hookeri), a relationship that provides clear predictions for the relative genetic diversity within each genome. The parental taxon is expected to have comparatively low allelic diversity that is nested within the diversity of the hybrid daughter genome. The scale of genome sequencing required was conveniently achieved by extracting mRNA and sequencing cDNA to examine expressed allelic diversity. This allowed us to test hybrid-progenitor relationships among non-model organisms with large genomes and different ploidy levels. Examination of thousands of independent loci avoids potential problems produced by the silencing of parts of one or other of the parental genomes, a phenomenon sometimes associated with the process of stabilisation of a hybrid genome. Transcript assembles were assessed for evidence of paralogs and/or alternative splice variants before proceeding. Comparison of transcript assemblies was not an appropriate measure of genetic variability, but by mapping reads back to clusters derived from each species we determined levels of allelic diversity. We found greater cDNA sequence diversity among alleles in the putative hybrid species (Acanthoxyla geisovii) than the non-hybrid. The allelic diversity within the putative paternal species (Clitachus hookeri) nested within the hybrid-daughter genome, supports the current view of a hybrid-progenitor relationship for these stick insect species. Next generation sequencing technology provides opportunities for testing evolutionary hypotheses with non-model organisms, including, as here, genomes that are large due to polyploidy.Entities:
Mesh:
Year: 2016 PMID: 27187689 PMCID: PMC4871368 DOI: 10.1371/journal.pone.0154911
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1New Zealand stick insects illustrate hybrid speciation hypotheses that arise from evolutionary studies.
(A) Two lineages of stick insects have been sampled across their range in New Zealand and by contrasting maternal relationships from mitochondrial DNA sequences with bi-parental multicopy nuclear markers a role for hybridisation has been inferred. Diploid Clitarchus hookeri (orange squares) has both sexual and asexual populations. No males of any of the Acanthoxyla forms are known (purple squares diploid females, purple circles triploid females). (B) Hybrid species are the product of interspecific mating resulting in genomes that are a mix of the two parental species but are reproductively isolated from both these parent taxa. The resulting allelic diversity is illustrated and compared to the diversity expected within non-hybrids and autopolyploids. When short DNA sequence reads are mapped to parents, related and non-related species, allelic similarities can be used to infer origins.
Fig 2Bioinformatic pipeline for analysis of cDNA from non-model organisms with large polyploid genomes to test hybrid origin hypotheses.
Summary of the cDNA sequences obtained from two New Zealand stick insects using the Illumina pipeline with the length distribution of transcript assemblies following trimming (at a quality of 0.001) and subsequent clustering.
| Yield (kb) | 1,124,720 | 721,022 |
| Raw clusters per tile | 104,803 +/- 9655 | 65,469 +/- 3946 |
| PF clusters per tile (bp) | 60,085 +/- 3433 | 60,085 +/- 3433 |
| Minimum contig length (bp) | 200 | 167 |
| Maximum contig length (bp) | 12,855 | 4,504 |
| Average contig length (bp) | 608.334 | 532.730 |
| Total assembled contig length (bp) | 1,579,843 | 1,410,670 |
| Number of Contigs | 2,597 | 2,648 |
Fig 3Next Generation sequencing results from mRNA of two New Zealand stick insects (A).
Length distributions of transcript assemblies produced from the cDNA sequence of two stick insects were similar. A log length frequency distribution plot used values rounded to 1 decimal place for the longest consensus sequence generated from each cluster. (B) Sequence divergence (measured by SNP density per nucleotide) observed when reads were mapped to ~2,600 loci (transcript assemblies). Loci without variation (SNP-free) were removed. The putative parental Clitarchus hookeri genome contains many loci with low allelic diversity. SNPs detected in less than 10% of the short reads were ignored but reads were included whether or not they passed the strand bias filter within VarScan. Only the longest assembled transcripts generated per cluster were included. (C) SNPs detectable on all transcript assemblies by BWA mapping to ~2,600 Acanthoxyla and Clitarchus transcript assemblies using VarScan with minimum variant frequency of 10% irrespective of strand filter results. The first violin of each color comprises all data, and the second excludes transcript assemblies with no sequence variation (SNP-free). Purple–Acanthoxyla reads mapped onto Acanthoxyla transcript assemblies; Pale green–Clitarchus reads mapped onto Acanthoxyla transcript assemblies; Pale purple–Acanthoxyla reads mapped onto Clitarchus transcript assemblies; Green–Clitarchus reads mapped onto Clitarchus transcript assemblies.
Alpha-actinin cDNA sequence diversity illustrates that allelic diversity of the stick insect Acanthoxyla geisovii is greater than within Clitarchus hookeri as expected of a hybrid.
Within Acanthoxyla geisovii the SNP variant (allele) most similar to Clitarchus hookeri has approximately one third the expression level (inferred from read coverage) because this genome is triploid.
| SequencePosition | SNP (primary/variant) | Read coverage | Variant frequency (%) | SNP (primary/variant) | Read coverage | Variant frequency (%) |
|---|---|---|---|---|---|---|
| 375 | A | 1031 | - | G / A | 824 | |
| 462 | A | 1281 | - | G / A | 704 | |
| 492 | C | 1159 | - | T / C | 631 | |
| 561 | T | 1109 | - | C / T | 543 | |
| 633 | G | 1223 | - | T / G | 548 | |
| 672 | T | 1081 | - | C / T | 569 | |
| 828 | T | 787 | - | C / T | 600 | |
| 928 | T | 1290 | - | C / T | 773 | |
| 1014 | G / T | 802 | G / T | 719 | ||
| 1032 | G / A | 729 | A | 637 | - | |
| 1044 | G | 650 | - | A / G | 594 | |
| 1158 | G | 950 | - | T / G | 671 | |
| 1164 | T | 940 | - | G / T | 707 | |
| 1251 | G | 1272 | - | A / G | 998 | |
| 1285 | T / G | 1398 | T | 1158 | - | |
| 1395 | C | 1437 | - | A / C | 878 | |
| 1404 | A | 1335 | - | T / A | 845 | |
| 1497 | T | 957 | - | C / T | 598 | |
| 1524 | G | 1260 | - | C / G | 787 | |
| 1545 | C | 980 | - | T / C | 712 | |
| 1656 | T | 1067 | - | C / T | 707 | |
| 1662 | G | 1088 | - | A / G | 666 | |
| 1749 | G / A | 1025 | G / A | 952 | ||
| 1809 | C | 1353 | - | T / C | 1216 | |
| 1833 | A | 1426 | - | G / A | 1362 | |
| 1875 | A | 1260 | - | G / A | 1092 | |
| 2019 | T | 1032 | - | C / T | 644 | |
| 2025 | C | 1030 | - | G / C | 657 | |
| 2127 | T | 874 | - | C / T | 649 | |
| 2136 | T | 929 | - | C / T | 706 | |
| 2181 | C | 914 | - | A / C | 793 | |
| 2208 | C | 856 | - | G / C | 704 | |
| 2256 | G | 1087 | - | A / G | 755 | |
| 2433 | A | 1016 | - | G / A | 541 | |
| 2484 | T | 1034 | - | C / T | 506 | |
| 2487 | T | 1084 | - | A / T | 522 | |
Functional groups of assembled transcripts from stick insect cDNA inferred from similarity to an arthropod dataset (Reference) using evolutionary genealogy of groups: Non-supervised Orthologous Groups classification system (eggNOG).
COG = Cluster of Orthologous Groups.
| COG | Reference | Reference fraction | |||||
|---|---|---|---|---|---|---|---|
| S | Function unknown | 8136 | 0.4319 | 296 | 0.2287 | 306 | 0.2116 |
| T | Signal transduction mechanisms | 1733 | 0.092 | 128 | 0.0989 | 131 | 0.0906 |
| K | Transcription | 1360 | 0.0722 | 44 | 0.034 | 43 | 0.0297 |
| O | Posttranslational modification, protein turnover, chaperones | 1122 | 0.0596 | 129 | 0.0997 | 140 | 0.0968 |
| non-singleCOG | 760 | 0.0423 | 46 | 0.0356 | 47 | 0.0327 | |
| G | Carbohydrate transport and metabolism | 631 | 0.0335 | 45 | 0.0348 | 44 | 0.0304 |
| U | Intracellular trafficking, secretion, and vesicular transport | 535 | 0.0284 | 36 | 0.0278 | 39 | 0.027 |
| E | Amino acid transport and metabolism | 503 | 0.0267 | 23 | 0.0178 | 35 | 0.0242 |
| I | Lipid transport and metabolism | 486 | 0.0258 | 46 | 0.0355 | 54 | 0.0373 |
| A | RNA processing and modification | 438 | 0.0233 | 32 | 0.0247 | 44 | 0.0304 |
| J | Translation, ribosomal structure and biogenesis | 416 | 0.0221 | 121 | 0.0935 | 142 | 0.0982 |
| C | Energy production and conversion | 395 | 0.021 | 134 | 0.1036 | 152 | 0.1051 |
| P | Inorganic ion transport and metabolism | 394 | 0.0209 | 32 | 0.0247 | 29 | 0.0201 |
| Z | Cytoskeleton | 378 | 0.0201 | 105 | 0.0811 | 131 | 0.0906 |
| L | Replication, recombination and repair | 374 | 0.0199 | 4 | 0.0031 | 6 | 0.0041 |
| Q | Secondary metabolites biosynthesis, transport and catabolism | 301 | 0.016 | 16 | 0.0124 | 31 | 0.0214 |
| D | Cell cycle control, cell division, chromosome partitioning | 195 | 0.0104 | 8 | 0.0062 | 4 | 0.0028 |
| F | Nucleotide transport and metabolism | 149 | 0.0079 | 2 | 0.0015 | 6 | 0.0041 |
| W | Extracellular structures | 131 | 0.007 | 21 | 0.0162 | 25 | 0.0173 |
| B | Chromatin structure and dynamics | 121 | 0.0064 | 5 | 0.0039 | 7 | 0.0048 |
| H | Coenzyme transport and metabolism | 107 | 0.0057 | 4 | 0.0031 | 11 | 0.0076 |
| M | Cell wall/membrane/envelope biogenesis | 74 | 0.0039 | 9 | 0.007 | 9 | 0.0062 |
| V | Defense mechanisms | 73 | 0.0039 | 7 | 0.0054 | 8 | 0.0055 |
| N | Cell motility | 13 | 0.0007 | 1 | 0.0008 | 1 | 0.0007 |
| Y | Nuclear structure | 12 | 0.0006 | 0 | 0 | 1 | 0.0007 |
| Total | 18837 | 1294 | 1446 |
Fig 4A similar level of genetic divergence within a hybrid stick insect genome as is observed between parental alleles when alleles are compared.
Heat maps of sequence similarity between short-read cDNA sequences mapped to transcript assemblies. (A) Acanthoxyla transcript assemblies with Clitarchus reads (upper right) and Acanthoxyla reads (lower left). (B) Clitarchus transcript assemblies with Clitarchus reads (upper right) and Acanthoxyla reads (lower left).