| Literature DB >> 29145825 |
Chen Wu1,2,3, Victoria G Twort4,5,6, Ross N Crowhurst7, Richard D Newcomb4,7, Thomas R Buckley4,5.
Abstract
BACKGROUND: Stick insects (Phasmatodea) have a high incidence of parthenogenesis and other alternative reproductive strategies, yet the genetic basis of reproduction is poorly understood. Phasmatodea includes nearly 3000 species, yet only the genome of Timema cristinae has been published to date. Clitarchus hookeri is a geographical parthenogenetic stick insect distributed across New Zealand. Sexual reproduction dominates in northern habitats but is replaced by parthenogenesis in the south. Here, we present a de novo genome assembly of a female C. hookeri and use it to detect candidate genes associated with gamete production and development in females and males. We also explore the factors underlying large genome size in stick insects.Entities:
Keywords: Clitarchus hookeri; Genome assembly; Parthenogenesis; Phasmatodea; RNA-Seq; Reproduction
Mesh:
Substances:
Year: 2017 PMID: 29145825 PMCID: PMC5691397 DOI: 10.1186/s12864-017-4245-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Illumina sequencing output for Clitarchus hookeri whole genome assembly
| Insert size (bp) | Sample × library × lane | Sequencing output (Gb) | Estimated Genome coverage (×) |
|---|---|---|---|
| 200 | CLI525 × 1 × 5, CLI600 × 1 × 1 | 261.2 | 59.4 |
| 350 | CLI525 × 1 × 2 | 88.5 | 20.1 |
| 500 | CLI600 × 1 × 1 | 39.7 | 9.0 |
| 720 | CLI525 × 1 × 1, CLI600 × 2 × 2 | 72.3 | 16.4 |
| 5000 | CLI654 × 1 × 4 | 126.4 | 28.7 |
| 8000 | CLI525 × 1 × 2, CLI654 × 1 × 2 | 145.2 | 33.0 |
| Total | 733.3 | 166.6 |
Genome coverage estimates were obtained using the flow cytometry genome size
Clitarchus hookeri genome assembly statistics
| Size (bp) | Number of scaffolds | |
|---|---|---|
| N90 | 188 | 859,481 |
| N80 | 5446 | 48,684 |
| N70 | 30,351 | 11,639 |
| N60 | 148,642 | 5894 |
| N50 | 255,691 | 3749 |
| Longest | 4,944,527 | – |
| Total (>100 bp) | 4,244,875,252 | 4,114,148 |
| Total (>2 kb) | 3,503,002,174 | 78,458 |
Comparisons of repeats among four Polyneoptera genomes
|
|
|
|
| |||||
|---|---|---|---|---|---|---|---|---|
| Genome size (Mb) | 490 | 1030 | 4240 | 6500 | ||||
| Repeat types | Length (Mb) | P% | Lengh (Mb) | P% | Lengh (Mb) | P% | Length (Mb) | P% |
| DNA | 12.78 | 2.60 | 75.13 | 7.30 | 612.35 | 14.43 | 1480.54 | 22.69 |
| LINE | 22.13 | 4.50 | 34.34 | 3.34 | 55.84 | 1.32 | 1332.72 | 20.42 |
| LTR | 0.72 | 0.15 | 5.89 | 0.57 | 103.59 | 2.44 | 508.68 | 7.80 |
| SINE | 9.78 | 2.00 | 42.88 | 4.17 | 84.08 | 1.98 | 141.18 | 2.16 |
| Simple repeat | 1.39 | 0.28 | 9.00 | 0.87 | 96.82 | 2.28 | 13.03 | 0.20 |
| Other | 0.00 | 0.00 | 1.00 | 0.10 | 0.74 | 0.02 | 0.03 | 0.00 |
| Unknown | 81.32 | 16.50 | 243.05 | 23.61 | 1237.41 | 29.15 | 406.10 | 6.22 |
| Total | 128.41 | 26.00 | 411.29 | 39.96 | 2190.84 | 51.60 | 3840.81 | 58.86 |
Fig. 1Correlation between genome size and gene model features across insects (Holacanthella duospinosa, Locusta migratoria, Clitarchus hookeri, Timema cristinae, Zootermopsis nevadensis, Acyrthosiphon pisum, Pediculus humanus, Nasonia vitripennis, Tribolium costaneum, Bombyx mori, Anopheles gambiae and Drosophila melanogaster) and Daphnia pulex. AS: genome assembly size; ATL: average of transcript length; AIL: average of intron length; ACL: average of coding region length; AEL: average of exon length; AEG: average number per gene. Scale numbering was adjusted by dividing ATL, AIL, ACL and AEL by 104, 104, 103 and 102, respectively
Genomic DNA mapping statistics
| Sample ID | Read pairs | Uniquely mapped | No. of splices | Multiply mapped | Total mapped (%) |
|---|---|---|---|---|---|
| CLI765 | 47,699,036 | 37,219,558 (78.0%) | 12,199,348 | 7,414,350 (15.5%) | 93.6 |
| CLI767 | 50,546,345 | 35,708,416 (70.6%) | 11,153,462 | 12,243,359 (24.2%) | 94.9 |
| CLI768 | 45,978,461 | 34,815,789 (75.7%) | 10,547,208 | 8,859,828 (19.3%) | 95.0 |
| CLI755 | 44,052,337 | 34,808,276 (79.0%) | 10,253,623 | 6,884,797 (15.6%) | 94.6 |
| CLI757 | 45,000,013 | 32,997,927 (73.3%) | 10,140,927 | 8,889,222 (19.8%) | 93.1 |
| CLI760 | 52,261,126 | 40,301,287 (77.1%) | 13,724,663 | 6,631,109 (12.7%) | 89.9 |
Fig. 2Distribution of homologous sequence matches from the Uniprot, SwissProt and Drosophila melanogaster protein databases, respectively
Fig. 3Distribution of homologous sequence matches from the Uniprot, SwissProt and Drosophila melanogaster protein databases for female-biased and male-biased genes, respectively
Summary of top 20 differentially expressed genes from female reproductive tract and testis, respectively
| Gene locus | Hit ID | Hit Name | E-value | Log2 FC | Adjusted p | |
|---|---|---|---|---|---|---|
| Female-biased | XLOC_018823 | FBgn0037405 | CG1077a | 2.38E-06 | −13.06 | 1.58E-12 |
| XLOC_015696 | FBgn0041709 | yellow-ga | 5.04E-19 | −12.65 | 3.91E-11 | |
| XLOC_032619 | FBgn0266435 | CG45065a | 1.72E-151 | −12.36 | 1.64E-10 | |
| XLOC_030196 | FBgn0035328 | yellow-g2a | 4.20E-65 | −12.06 | 2.09E-36 | |
| XLOC_007922 | FBgn0035089 | Phk-3 | 1.59E-08 | −12.01 | 1.45E-91 | |
| XLOC_009044 | FBgn0034145 | CG5065 | 1.75E-13 | −11.93 | 2.21E-45 | |
| XLOC_007550 | FBgn0039084 | CG10175 | 8.89E-79 | −11.9 | 1.73E-50 | |
| XLOC_021021 | FBgn0038799 | MFS9 | 1.96E-09 | −11.89 | 1.59E-09 | |
| XLOC_021022 | FBgn0038799 | MFS9 | 3.49E-45 | −11.85 | 1.88E-21 | |
| XLOC_025391 | FBgn0013680 | mt:ND2 | 6.59E-09 | −11.81 | 1.74E-283 | |
| XLOC_000499 | FBgn0032433 | Oatp33Ea | 3.76E-32 | −11.54 | 2.71E-09 | |
| XLOC_003735 | FBgn0034145 | CG5065 | 3.59E-14 | −11.52 | 6.18E-32 | |
| XLOC_026733 | FBgn0000032 | Acph-1a | 4.73E-69 | −11.46 | 2.48E-41 | |
| XLOC_002774 | FBgn0000261 | Cat | 1.66E-34 | −11.28 | 2.37E-08 | |
| XLOC_005024 | FBgn0039896 | yellow-h | 7.54E-111 | −11.26 | 1.75E-18 | |
| XLOC_024658 | FBgn0030452 | MFS10 | 5.42E-19 | −11.23 | 2.92E-08 | |
| XLOC_016572 | FBgn0038799 | MFS9 | 3.93E-29 | −11.12 | 4.62E-08 | |
| XLOC_032950 | FBgn0000032 | Acph-1a | 3.86E-21 | −10.99 | 6.21E-18 | |
| XLOC_036184 | FBgn0259247 | laccase2 | 2.23E-09 | −10.94 | 3.44E-09 | |
| XLOC_018465 | FBgn0026314 | Ugt35b | 6.37E-96 | −10.91 | 4.59E-18 | |
| Male-biased | XLOC_037165 | FBgn0014869 | Pglym78 | 7.44E-70 | 13.61 | 1.81E-13 |
| XLOC_034828 | FBgn0035273 | CG12020a | 9.64E-35 | 13.59 | 2.02E-13 | |
| XLOC_033633 | FBgn0020412 | JIL-1 | 1.21E-19 | 13.47 | 4.23E-13 | |
| XLOC_003842 | FBgn0032478 | CG5458a | 3.75E-45 | 13.31 | 1.07E-12 | |
| XLOC_039688 | FBgn0038385 | Fbxl7 | 6.21E-07 | 13.29 | 1.15E-12 | |
| XLOC_035079 | FBgn0019982 | Gs1l | 2.23E-53 | 13.29 | 1.19E-12 | |
| XLOC_019177 | FBgn0020412 | JIL-1 | 2.99E-16 | 13.25 | 1.45E-12 | |
| XLOC_039094 | FBgn0052392 | CG32392a | 2.03E-08 | 13.12 | 3.14E-12 | |
| XLOC_010029 | FBgn0005612 | Sox14 | 3.45E-08 | 13.05 | 4.59E-12 | |
| XLOC_016670 | FBgn0033635 | CG7777 | 2.72E-26 | 12.98 | 2.59E-27 | |
| XLOC_035177 | FBgn0004380 | Klp64D | 1.96E-09 | 12.94 | 1.30E-35 | |
| XLOC_024216 | FBgn0264574 | Glut1 | 1.89E-43 | 12.88 | 8.30E-25 | |
| XLOC_004823 | FBgn0035799 | CG14838a | 1.28E-72 | 12.84 | 1.01E-25 | |
| XLOC_010324 | FBgn0034546 | CG13442a | 2.57E-10 | 12.84 | 1.45E-11 | |
| XLOC_019341 | FBgn0036211 | CG5946 | 9.38E-16 | 12.77 | 2.02E-11 | |
| XLOC_002845 | FBgn0020412 | JIL-1 | 6.64E-13 | 12.75 | 2.79E-24 | |
| XLOC_035152 | FBgn0051068 | CG31068a | 1.24E-33 | 12.73 | 2.62E-26 | |
| XLOC_038804 | FBgn0031988 | CG8668 | 1.07E-63 | 12.65 | 7.43E-25 | |
| XLOC_020898 | FBgn0031859 | CG17377a | 2.27E-10 | 12.58 | 1.47E-24 | |
| XLOC_038151 | FBgn0039396 | CCAP-R | 6.78E-28 | 12.57 | 5.83E-11 |
“a” indicates the transcript is also highly expressed in the Drosophila melanogaster ovary and testicle respectively
Fig. 4Bubble plots showing enriched GO terms generated from sex-biased genes. “Z-score” was calculated for each term using the formula: male-biased minus female-biased gene number divided by the square root of sex-biased gene number. The significant GOs are indicated above the yellow line. The bubbles calculated with minus z-scores represent GOs containing more female-biased genes, while GOs with more male-biased genes have z-scores larger than zero. The bubble size represents the number of genes. The GO descriptions on the right are listed from the highest to lowest significance of enrichment. (a) Level one terms; (b) after removing all parental terms
Fig. 5KEGG pathways enriched in gonadal sex-biased genes. Enriched pathways with modified Fisher exact test p value below 0.1 are shown here. Stars indicate pathways with BH adjusted p value below 0.05