| Literature DB >> 27017527 |
Carrie A Whittle1, Cassandra G Extavour2.
Abstract
Spiders belong to the Chelicerata, the most basally branching arthropod subphylum. The common house spider, Parasteatoda tepidariorum, is an emerging model and provides a valuable system to address key questions in molecular evolution in an arthropod system that is distinct from traditionally studied insects. Here, we provide evidence suggesting that codon usage, amino acid frequency, and protein lengths are each influenced by expression-mediated selection in P. tepidariorum First, highly expressed genes exhibited preferential usage of T3 codons in this spider, suggestive of selection. Second, genes with elevated transcription favored amino acids with low or intermediate size/complexity (S/C) scores (glycine and alanine) and disfavored those with large S/C scores (such as cysteine), consistent with the minimization of biosynthesis costs of abundant proteins. Third, we observed a negative correlation between expression level and coding sequence length. Together, we conclude that protein-coding genes exhibit signals of expression-related selection in this emerging, noninsect, arthropod model.Entities:
Keywords: Chelicerata; amino acids; arachnid; optimal codons; spider
Mesh:
Substances:
Year: 2016 PMID: 27017527 PMCID: PMC5630913 DOI: 10.1093/gbe/evw068
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
The List of ΔRSCU Values for Spider among the 5% Highest and Lowest Expressed CDS in Parasteatoda tepidariorum Embryonic Tissues
| Amino Acid | Codon | ΔRSCU |
|
|---|---|---|---|
| Ala |
|
|
|
| Ala | GCC | −0.028 | |
| Ala | GCA | −0.190 |
|
| Ala | GCG | −0.190 |
|
| Arg |
|
|
|
| Arg | CGC | −0.079 |
|
| Arg | CGA | −0.100 |
|
| Arg | CGG | −0.074 |
|
| Arg | AGA | −0.109 |
|
| Arg | AGG | −0.020 | |
| Asn |
|
|
|
| Asn | AAC | −0.121 |
|
| Asp |
|
|
|
| Asp | GAC | −0.163 |
|
| Cys | TGT | −0.078 |
|
| Cys | TGC | −0.269 |
|
| Gln | CAA | −0.088 |
|
| Gln | CAG | +0.002 | |
| Glu | GAA | +0.007 | |
| Glu | GAG | −0.045 |
|
| Gly |
|
|
|
| Gly | GGC | −0.058 |
|
| Gly | GGA | −0.095 |
|
| Gly | GGG | −0.127 |
|
| His | CAT | −0.031 | |
| His | CAC | −0.156 |
|
| Ile |
|
|
|
| Ile | ATC | −0.028 | |
| Ile | ATA | −0.181 |
|
| Leu |
|
|
|
| Leu | TTG | +0.089 |
|
| Leu | CTT | +0.021 | |
| Leu | CTC | −0.090 |
|
| Leu | CTA | −0.057 |
|
| Leu | CTG | −0.062 |
|
| Lys | AAA | −0.078 |
|
| Lys |
|
|
|
| Phe |
|
|
|
| Phe | TTC | −0.106 |
|
| Pro |
|
|
|
| Pro | CCC | −0.057 |
|
| Pro | CCA | −0.270 |
|
| Pro | CCG | −0.179 |
|
| Ser |
|
|
|
| Ser | TCC | −0.106 |
|
| Ser | TCA | −0.181 |
|
| Ser | TCG | −0.233 |
|
| Ser | AGT | +0.182 |
|
| Ser | AGC | −0.065 |
|
| Thr |
|
|
|
| Thr | ACC | −0.006 | |
| Thr | ACA | −0.147 |
|
| Thr | ACG | −0.224 |
|
| Tyr |
|
| * |
| Tyr | TAC | −0.168 |
|
| Val |
|
|
|
| Val | GTC | −0.120 |
|
| Val | GTA | −0.044 | |
| Val | GTG | −0.106 |
|
Note.—Putative optimal codons are underlined and in bold.
P value of TTA was lower than TTG.
*P ≤ 0.05 and >0.001; **P ≤ 0.001.
F(A) The mean Fop value for CDS with low (below the 5th percentile), moderate (between 5th and 95th percentile), and high (>95th percentile) FPKM values for genes expressed in embryos. Different letters indicate statistically significant differences using ranked ANOVA (P < 0.001) and Dunn’s paired contrasts (P < 0.05). (B) The mean proportion of T at third codon positions (T3) per CDS and T from associated introns for each expression category. (C) The mean proportion of T at third codon positions for 4-fold amino acids (T4) per CDS and T from associated introns for each expression category. For genes with more than one intron, the introns were concatenated. Different letters in B and C indicate a difference using MWU tests (P < 0.001 for all pairs).
The Mean Frequency (percentage and standard error, SE) of Amino Acids in the 5% of Most Highly and Lowly Expressed Genes in Parasteatoda tepidariorum Embryos
| Amino Acid | Mean Frequency (SE) | Percent Difference |
| S/C Score | Types of Codons | |
|---|---|---|---|---|---|---|
|
|
| |||||
| Met | 3.087 (0.056) | 2.324 (0.037) | 24.71 |
| 64.68 | ATG |
| Gly | 6.002 (0.123) | 5.038 (0.102) | 16.06 |
| 1 | GGN |
| Lys | 8.514 (0.130) | 7.298 (0.081) | 14.29 |
| 30.14 | AAA/AAG |
| Ala | 6.035 (0.092) | 5.228 (0.068) | 13.37 |
| 4.76 | GCN |
| Glu | 6.941 (0.116) | 6.226 (0.070) | 10.30 |
| 36.48 | GAA/GAG |
| Val | 6.371 (0.088) | 5.857 (0.060) | 8.057 |
| 12.28 | GTN |
| Gln | 3.752 (0.073) | 3.540 (0.052) | 5.648 | 37.48 | CAA/CAG | |
| Arg | 5.336 (0.098) | 5.107 (0.063) | 4.275 | 56.34 | CGN/AGA/AGG | |
| Asp | 5.278 (0.086) | 5.120 (0.058) | 2.983 | 32.72 | GAT/GAC | |
| Pro | 4.557 (0.093) | 4.443 (0.077) | 2.496 | 31.8 | CCN | |
| Leu | 8.840 (0.108) | 9.440 (0.081) | −6.79 |
| 16.04 | CTN/TTA/TTG |
| Tyr | 3.293 (0.065) | 3.575 (0.054) | −8.56 |
| 57 | TAT/TAC |
| Phe | 4.330 (0.082) | 4.770 (0.058) | −10.1 |
| 44 | TTT/TTC |
| Thr | 5.077 (0.076) | 5.666 (0.062) | −11.5 |
| 21.62 | ACN |
| His | 2.160 (0.056) | 2.427 (0.052) | −12.3 |
| 58.7 | CAT/CAC |
| Ser | 7.044 (0.102) | 8.089 (0.075) | −14.8 |
| 17.86 | TCN/AGT/AGC |
| Ile | 5.810 (0.081) | 6.712 (0.065) | −15.5 |
| 16.04 | ATT/ATC/ATA |
| Asn | 4.622 (0.073) | 5.474 (0.065) | −18.4 |
| 33.72 | AAT/AAC |
| Trp | 1.019 (0.044) | 1.212 (0.027) | −18.9 |
| 73 | TGG |
| Cys | 1.915 (0.071) | 2.443 (0.059) | −27.5 |
| 57.16 | TGT/TGC |
Note.—Amino acids are listed from the largest positive percent difference to largest negative percent difference among high and low expressed genes.
**P < 0.001 using MWU-tests. The types of codons used for each amino acid are shown.
FBox and whisker plots of Fop relative to expression (Exp.) and CDS length (short, intermediate, long). Different letters within each expression class indicate a statistically significant difference between lengths using ranked ANOVA (P < 0.001) and Dunn’s paired contrasts (P < 0.05).
Functional Clustering of Highly Expressed CDS (all CDS above the 95th percentile of FPKM) in Parasteatoda tepidariorum Embryos Using Their Orthologs in the Model Drosophila melanogaster and the Gene Ontology System DAVID (Huang da et al. 2009)
|
| |
|---|---|
| Cluster 1: Enrichment score 35.68 | |
| Cytosolic part | 4.00 × 10−48 |
| Cytosolic ribosome | 1.80 × 10−44 |
| Ribosomal protein | 6.60 × 10−42 |
| Ribosome | 1.10 × 10−39 |
| Structural constituent of ribosome | 4.60 × 10−38 |
| Ribosomal subunit | 6.00 × 10−31 |
| Ribosome | 8.40 × 10−29 |
| Structural molecule activity | 3.40 × 10−19 |
| Cluster 2: Enrichment score 24.41 | |
| Mitotic spindle organization | 6.40 × 10−36 |
| Spindle organization | 3.50 × 10−32 |
| Microtubule cytoskeleton organization | 4.00 × 10−30 |
| Microtubule-based process | 3.70 × 10−28 |
| Mitotic cell cycle | 2.60 × 10−27 |
| Cytoskeleton organization | 2.90 × 10−26 |
| Cell cycle | 2.30 × 10−18 |
| Cell cycle phase | 1.10 × 10−17 |
| Cell cycle process | 1.10 × 10−17 |
| M phase | 1.30 × 10−17 |
| Cluster 3: Enrichment score 9.26 | |
| Mitochondrial ATP synthesis coupled electron transport | 3.10 × 10−12 |
| ATP synthesis coupled electron transport | 1.20 × 10−11 |
| Respiratory electron transport chain | 3.20 × 10−11 |
| Electron transport chain | 4.20 × 10−10 |
| Cellular respiration | 2.40 × 10−9 |
| Energy derivation by oxidation of organic compounds | 1.50 × 10−8 |
| Respiratory chain | 1.50 × 10−8 |
| Mitochondrial respiratory chain | 3.30 × 10−8 |
| Cluster 4: Enrichment score 7.32 | |
| Hydrogen ion transmembrane transporter activity | 6.00 × 10−9 |
| Monovalent inorganic cation transmembrane transporter activity | 9.20 × 10−9 |
| Inorganic cation transmembrane transporter activity | 1.90 × 10−6 |
Note.—P values are from a modified Fisher’s test, wherein lower values indicate greater enrichment. The four GO clusters with the highest enrichment scores are shown.
FThe Fop of Drosophila melanogaster orthologs of Parasteatoda tepidariorum genes with low, moderate, and high expression levels. The D. melanogaster optimal codons, which preferentially end in G or C, were taken from Duret and Mouchiroud (1999). Different letters indicate a statistically significant difference using ranked ANOVA (P < 0.001) and Dunn’s paired contrast (P < 0.05).