| Literature DB >> 30957857 |
Victoria G Twort1,2,3, Richard D Newcomb1,4, Thomas R Buckley1,2.
Abstract
Exposure to low temperatures requires an organism to overcome physiological challenges. New Zealand wētā belonging to the genera Hemideina and Deinacrida are found across a wide range of thermal environments and therefore subject to varying selective pressures. Here we assess the selection pressures across the wētā phylogeny, with a particular emphasis on identifying genes under positive or diversifying selection. We used RNA-seq to generate transcriptomes for all 18 Deinacrida and Hemideina species. A total of 755 orthologous genes were identified using a bidirectional best-hit approach, with the resulting gene set encompassing a diverse range of functional classes. Analysis of ortholog ratios of synonymous to nonsynonymous amino acid changes found 83 genes that are under positive selection for at least one codon. A wide variety of Gene Ontology terms, enzymes, and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways are represented among these genes. In particular, enzymes involved in oxidative phosphorylation, melanin synthesis, and free-radical scavenging are represented, consistent with physiological and metabolic changes that are associated with adaptation to alpine environments. Structural alignment of the transcripts with the most codons under positive selection revealed that the majority of sites are surface residues, and therefore have the potential to influence the thermostability of the enzyme, with the exception of prophenoloxidase where two residues near the active site are under selection. These proteins provide interesting candidates for further analysis of protein evolution.Entities:
Keywords: diversifying selection; metabolic rate; molecular evolution; transcriptome
Mesh:
Year: 2019 PMID: 30957857 PMCID: PMC6486805 DOI: 10.1093/gbe/evz070
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Summary of Sample Collection Details and Tissues Sequenced from Deinacrida and Hemideina Species
| Species | Code | Sex | Locality | Latitude/Longitude | Date Collected | Collected by | Tissue(s) Sequenced |
|---|---|---|---|---|---|---|---|
| WT89 | Male | Pig Island, Foveaux straight | 44°56′44.263″S, 168°24′34.097″E, 5 m | January 13 | R. Cole | Muscle | |
| WT68 | Male | Mt Robert | 41°49′55″S 172°48′37″E, 1,300 m | April 12 | K.C. Burns | Muscle | |
| WT104 | Female | Mt Sommers | 43°35'S, 171°19'E | December 13 | M. McDonald | Antennae | |
| WT58 | Male | Tawhiti Rahi, Poor Knights Islands | 35°27′S, 174°44′E, 40 m | April 12 | K. Grayson | Muscle, testis, accessory gland, midgut, antennae | |
| WT74 | Male | Hauturu/Little Barrier Island | 36°13′S, 175°3′E, 10 m | June 12 | K. Lomas, D. Seldon | Muscle | |
| WT57 | Male | Warrenheip, Cambridge | 37°56′S, 175°35′E, 89 m | February 10 | C. Watts | Muscle | |
| WT263 | Unknown | Mt Fyffe | 42°19′S, 173°37′E | April 15 | M. McDonald | Muscle | |
| WT158 | Female | Matukituki River, Mt Aspiring National Park | 44°13′55.077″S, 168°50′2.375″E, 900 m | April 14 | T. Jewell | Muscle | |
| WT72 | Male | Stephens Island | 40°40′12.789″S, 173°59′48.877″E, 220 m | May 12 | L. Moran | Muscle | |
| WT250 | Male | Mt Faraday, Paparoa Range | 42°01.696′, 171°34.785′, 1,297 m | February 15 | T. Buckley, W. Chinn, R. Leschen | Muscle | |
| WT251 | Male | Mt Arthur, Kahurangi National Park | 41°13′0″S, 172°40′0″E, 1,400 m | February 15 | I. Millar | Muscle | |
| WT153 | Male | Denniston Plateau | 41°45′57.784″S, 171°46′26.284″E, 667 m | February 14 | I. Millar, A. Walker | Muscle | |
| WT19 | Male | Karori, Wellington | 41°17.02′S, 174°44.45′E, 200 m | October 11 | P. Richie | Muscle | |
| WT67 | Male | Banks Peninsula, Christchurch | 43°44′S, 173°0′E | March 12 | R. van Heugten | Muscle | |
| WT65 | Male | Rock and Pillar Range, Otago | 45°22′60″S, 170°7′0″E, 1,100 m | July 13 | D. Wharton | Muscle | |
| WT107 | Male | Banks Peninsula, Christchurch | 43°44′S, 173°0′E | July 13 | R. van Heugten | Muscle | |
| WT16 | Male | Birkdale, Auckland | 36°48′0.219″S 174°41′26.641″E, 20 m | September 11 | V. Twort, G. Twort, O. Twort | Muscle | |
| WT99 | Male | Cape Kidnappers, Hawkes Bay | 39°38′24.53″S 177°5′21.44″, 150 m | February 13 | M. Lusk | Muscle |
aSamples for which library construction and sequencing were carried out at BGI.
Summary Statistics of Transcriptome Sequencing, Assembly, and Assessment from Deinacrida and Hemideina Species
| Species | Raw Data (Gb) | No. Cleaned Pairs | No. Transcripts | Total Assembly Length (bp) | BUSCO Results (Complete (Duplicated): Fragmented: Missing) | Reads Mapping to the Final Assembly (%) |
|---|---|---|---|---|---|---|
| 5.794 | 53,028,680 | 158,184 | 85,686,758 | 62(15): 15: 11 | 87.78 | |
| 5.021 | 45,723,934 | 143,006 | 70,585,929 | 50(10): 13: 35 | 87.64 | |
| 3.912 | 35,840,262 | 154,541 | 75,027,101 | 52(10): 14: 33 | 86.68 | |
| 38.025 | 315,605,868 | 491,052 | 238,830,923 | 83(32): 6.5: 9.7 | 86.02 | |
| 8.662 | 79,034,252 | 181,474 | 95,857,769 | 65(15): 10: 23 | 82.06 | |
| 6.527 | 59,526,220 | 136,415 | 63,537,626 | 41(7.3): 15: 43 | 70.03 | |
| 8.838 | 58,759,734 | 276,182 | 124,660,635 | 69(21): 12: 18 | 85.69 | |
| 20.541 | 185,352,010 | 273,395 | 153,195,209 | 84(29): 4.7: 10 | 88.95 | |
| 7.938 | 72,763,410 | 162,088 | 89,502,720 | 67(15): 10: 22 | 80.17 | |
| 10.245 | 68,125,774 | 292,982 | 136,051,565 | 77(25): 9.6: 13 | 85.11 | |
| 9.202 | 61,191,748 | 285,966 | 135,419,927 | 76(25): 8.6: 14 | 82.86 | |
| 19.766 | 176,252,720 | 283,853 | 151,790,647 | 79(26): 6.9: 13 | 81.59 | |
| 5.621 | 52,140,542 | 206,762 | 96,930,441 | 46(10): 16: 37 | 73.97 | |
| 6.101 | 56,290,568 | 171,783 | 90,810,936 | 63(14): 10: 26 | 85.27 | |
| 6.118 | 56,420,004 | 157,489 | 82,514,748 | 59(13): 11: 28 | 86.68 | |
| 20.283 | 184,487,074 | 296,632 | 162,318,013 | 84(28): 4.5: 11 | 88.95 | |
| 5.888 | 53,491,706 | 212,966 | 97,372,708 | 46(11): 17: 35 | 65.02 | |
| 5.55 | 50,694,098 | 134,650 | 65,137,927 | 44(8.1): 14: 40 | 80.33 |
Note.—Data cleaning was carried out to remove reads with ambiguous bases, homopolymer regions and adapters. Mapping back to the transcriptome was carried out with Bowtie2 sensitive option. Completeness was assessed with BUSCO.
aDeinacrida fallai transcriptome was constructed by combining reads from all six tissue libraries together.
. 1.—The effect of increasing data on the completeness of Hemideina and Deinacrida transcriptomes, as assessed by the presence of complete single copy arthropod orthologs (BUSCO).
. 2.—Consensus phylogeny constructed using 755 orthologs obtained from transcriptome sequences from Hemideina and Deinacrida species. Support values are Bayesian posterior probabilities/maximum likelihood bootstrap values/astral local posterior probabilities. Branch lengths are given in coalescent units, as calculated by Astral. Triangles indicate species whose range includes an alpine habitat.
Summary of the Number of Genes Identified as Having Amino Acids under Positive Selection for Various CODEML Site Model Comparisons in PAML
| Model | Number of Genes |
|---|---|
| M0:M3 | 473 |
| M1a:M2a | 83 |
| M7:M8 | 105 |
| M8a:M8 | 159 |
Summary of PAML Site Test Results for the Top Ten Genes under Selection When Ranked by Number of Sites under Positive Selection
| Contig | Annotation | Protein Length (aa) | ω (M0) | M1a:M2a | BEB Sites (M2) | RCSB Ortholog ID | |
|---|---|---|---|---|---|---|---|
| 3,861 | Prophenoloxidase | 727 | 2.54 | 0.43 | 82.85 | 35 | 3HHS |
| 255 | Collagen alpha-2 chain | 1778 | 3.56 | 0.29 | 119.46 | 22 | — |
| 3,721 | Very long-chain-specific acyl-mitochondrial | 635 | 3.19 | 0.34 | 156.29 | 16 | 2UXW |
| 777 | CD63 antigen | 239 | 2.61 | 0.94 | 124.92 | 13 | — |
| 3,921 | Cytoplasmic dynein 1 intermediate chain-like isoform x16 | 664 | 3.48 | 0.14 | 59.89 | 11 | — |
| 4,174 | Glutathione | 242 | 2.57 | 1.01 | 35.74 | 10 | — |
| 2,771 | Superoxide dismutase | 199 | 2.77 | 1.07 | 35.36 | 9 | 2E47 |
| 268 | Nucleoporin nup54 | 560 | 2.72 | 0.24 | 71.99 | 8 | — |
| 1,338 | CD63 antigen-like | 231 | 2.72 | 0.87 | 106.75 | 8 | — |
| 1,060 | Microsomal glutathione | 143 | 3.21 | 0.98 | 28.73 | 7 | — |
Note.—K, ratio of transitions to transversions; ω (M0), ratio of nonsynonymous to synonymous substitutions under the M0 model; M1a:M2a, likelihood ratio test result for the nested model comparison between the M1a and M2a model; aa, amino acid; BEB Sites (M2), number of amino acid detected under positive selection under a Bayes Empirical Bayes model.
P > 99%.
. 3.—The location of sites identified as under positive selection (ω > 1) using Bayes Empirical Bayes analysis under the M2a model in PAML. The structural models of the three genes are used to highlight the location in the 3D protein structure. (A) Prophenoloxidase sites under selection (shown in yellow), (B) Visual representation of prophenoloxidase active site showing the amino acids under selection (yellow), (C) superoxide dismutase positively selected amino acid residues (yellow), (D) very-long-chain-acyl-CoA dehydrogenase 3D representation showing surface residues, and (E) acyl-CoA dehydrogenase ligand (green) binding pocket sites under selection (yellow).