| Literature DB >> 26597228 |
Felix Mesak1,2, Andrey Tatarenkov3, John C Avise4.
Abstract
BACKGROUND: Many vertebrate species have the ability to undergo weeks or even months of diapause (a temporary arrest of development during early ontogeny). Identification of diapause genes has been challenging due in part to the genetic heterogeneity of most vertebrate animals.Entities:
Mesh:
Year: 2015 PMID: 26597228 PMCID: PMC4657215 DOI: 10.1186/s12864-015-2210-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Images of diapause or hatched Kmar embryos. a Each Kmar embryo enters diapause at stage 32, which is marked by the diminished presence of nutritional oil droplets (bubble features clearly visible at stage 30). Any prolongation of embryonic stage 32 beyond 24 days is considered diapause. b Kmar embryos can either enter diapause or hatch (the biological mechanism that allows an embryo to enter and stay in diapause instead of hatching is unknown)
Alignment of Kmar RNA-Seq reads against various reference genomes
| Reference Genome/Transcriptome | No. of Mapped RNA-Seq Reads: | Percentage: | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| RNA01:pre-Dia | RNA02:Dia-R1 | RNA03:Dia-R2 | RNA04:Dia-R3 | RNA05:post-Dia | RNA01:pre-Dia | RNA02:Dia-R1 | RNA03:Dia-R2 | RNA04:Dia-R3 | RNA05:post-Dia | |
| Kmar | 31,590,161 | 39,192,211 | 31,524,390 | 32,811,042 | 30,944,064 | 97.08 | 97.42 | 97.35 | 97.32 | 96.98 |
| Kmar | 30,584,171 | 37,850,616 | 30,651,957 | 31,942,287 | 29,994,937 | 93.99 | 94.08 | 94.66 | 94.74 | 94.01 |
| Kmar | 29,039,228 | 35,715,712 | 29,057,499 | 30,264,954 | 28,112,433 | 89.24 | 88.78 | 89.74 | 89.77 | 88.11 |
| Kmar | 4,078,090 | 4,631,901 | 3,720,990 | 3,781,213 | 3,789,332 | 12.53 | 11.51 | 11.49 | 11.22 | 11.88 |
| Amazon Molly Genomec | 3,735,199 | 4,105,696 | 3,022,398 | 3,096,027 | 2,577,109 | 11.48 | 10.21 | 9.33 | 9.18 | 8.08 |
| Platyfish Genomed | 3,250,075 | 3,733,889 | 2,736,897 | 2,824,479 | 2,278,491 | 9.99 | 9.28 | 8.45 | 8.38 | 7.14 |
| Medaka Genomee | 3,668,978 | 3,476,396 | 2,512,691 | 2,499,786 | 2,198,448 | 11.28 | 8.64 | 7.76 | 7.41 | 6.89 |
| Zebrafish Genomef | 1,841,434 | 1,552,206 | 1,124,437 | 1,073,152 | 982,792 | 5.66 | 3.86 | 3.47 | 3.18 | 3.08 |
| Fugu Genomeg | 1,302,617 | 1,557,208 | 1,097,370 | 1,143,775 | 1,036,617 | 4.00 | 3.87 | 3.39 | 3.39 | 3.25 |
| Mouse Genomeh | 1,324,000 | 998,352 | 763,984 | 712,118 | 639,505 | 4.07 | 2.48 | 2.36 | 2.11 | 2.00 |
| Human Genomei | 1,681,163 | 1,273,803 | 961,456 | 899,559 | 853,593 | 5.17 | 3.17 | 2.97 | 2.67 | 2.68 |
aThis study;
b(Mesak, et al. [32]);
cPoeFor_5.1.2;
dXipmac4.4.2 (Schartl, et al. [40]);
eoryLat2, MEDAKA1 (Kasahara, et al. [41]);
fdanRer7, Zv9 (Howe, et al. [42]);
gfr3, FUGU4 (Aparicio, et al. [43]);
hmm10, GRCm38 (Mouse Genome Sequencing, et al. [44]);
ihg38, GRCh38 (Lander, et al. [45]; Venter, et al. [46]);
Reference genome database can be downloaded from http://www.ensembl.org/info/data/ftp/index.html or http://hgdownload.soe.ucsc.edu/downloads.html
Fig. 2Optimization of Kmar de novo genome assemblies. Raw reads for pair-ended Kmar WGS were assembled at various k-mer values to build a de novo genome. The graph shows a positive correlation between higher k-mer values and Kmar genome size. A similar correlation was found for N50, but peaked at k = 60. Thus, a Kmar genome with the highest N50 value (32,044 bp) that yielded a genome size of 642,279,823 bp was selected to serve as a reference genome for the RNA-Seq data analyses. [Note: k-mer at higher than k = 64 failed to assemble a de novo genome]
Fig. 3Mitochondrial housekeeping gene expression. The graph shows 93 highly abundant Kmar mitochondrial transcripts that were identified from the de novo genome (ABySS) assembly
The percentages of transcripts that exist in various developmental stages of Kmar fish
| Comparison | Transcripts detected in | WGSb ( | RNA-Seqc ( | RNA-Seqd ( |
|---|---|---|---|---|
| Diapause vs. pre-Diapause | Diapause only | 1.92 + 0.38 % | 1.11 + 0.04 % | 0.23 + 0.00 % |
| pre-Diapause only | 0.86 + 0.40 % | 0.62 + 0.15 % | 0.16 + 0.04 % | |
| both Diapause & pre-Diapause | 96.94 + 0.16 % | 98.01 + 0.14 % | 99.59 + 0.04 % | |
| othera | 0.28 + 0.14 % | 0.50 + 0.15 % | 0.02 + 0.00 % | |
| Diapause vs. post-Diapause | Diapause only | 1.05 + 0.48 % | 1.21 + 0.09 % | 0.36 + 0.00 % |
| post-Diapause only | 1.03 + 0.49 % | 1.01 + 0.22 % | 0.17 + 0.04 % | |
| both Diapause & post-Diapause | 97.81 + 1.02 % | 97.68 + 0.18 % | 99.45 + 0.04 % | |
| othera | 0.11 + 0.06 % | 0.09 + 0.02 % | 0.01 + 0.00 % |
atranscripts exist in other developmental stage, i.e. in post-diapause for diapause vs. pre-diapause, and vice versa: in pre-diapause for diapause vs. post-diapause;
bassembly was built by ABySS;
ctranscriptome was assembled by Trans-ABySS;
dtranscriptome was assembled by Trinity
Fig. 4This composite graph shows differentially expressed genes in diapause versus pre- or post-diapause, as identified from the de novo genome and transcriptomes assemblies
Percentages of up- or down-regulated transcripts in diapause versus pre- or post-diapause
| Reference | Total transcripts | No. of differentially regulated transcripts | Diapause/pre-Diapause | Diapause/post-Diapause | ||
|---|---|---|---|---|---|---|
| Up-regulated | Down-regulated | Up-regulated | Down-regulated | |||
| Kmar | 67,374 | 120 | 21.67 % | 9.17 % | 20.00 % | 66.67 % |
| Kmar | 206,747 | 198 | 10.61 % | 11.62 % | 11.62 % | 78.28 % |
| Kmar | 97,979 | 176 | 19.89 % | 15.34 % | 13.64 % | 69.89 % |
Clusters of genes that were up- or down-regulated during diapause
| Gene Cluster | List of Genes | No. of Dataa | Diapause versus | |
|---|---|---|---|---|
| pre-Diapause | post-Diapause | |||
| Cardiac and Skeletal Muscle |
| 21 | UPb (1.5) | UP (1.9) |
| Immune System |
| 27 | UP (2.4) | -c |
| Extracellular Matrices (ECM) |
| 237 | - | DOWNd (−3.2) |
| Respiration |
| 126 | - | DOWN (−1.6) |
| Development |
| 114 | - | DOWN (−1.7) |
| Other Cellular and Enzymatic Processes |
| 63 | - | DOWN (−3.4) |
| Hormones |
| 36 | - | DOWN (−3.5) |
| Toxin-like Proteins |
| 33 | - | DOWN (−4.0) |
| Cytoskeleton |
| 30 | - | DOWN (−2.3) |
| Ion Binding |
| 30 | - | DOWN (−4.3) |
| Protein Modification |
| 114 | - | - |
| Nucleic Acid Modification |
| 93 | - | - |
| Carbohydrate Modification |
| 54 | - | - |
| Environmental Responses |
| 51 | - | - |
| Cell Signaling |
| 30 | - | - |
| Transporter Functions |
| 24 | - | - |
| Unknown | n/a | 381 | - | DOWN (−2.0) |
aNo. of Data are from 3 diapause replicates and 3 genome/transcriptome references (RPKM and fold change data are listed in the Supplementary document);
bAVERAGE(Log2(FoldChange)) ≥ 1.5;
c-1.5 < AVERAGE(Log2(FoldChange)) < 1.5;
dAVERAGE(Log2(FoldChange))≤1.5
Fig. 5Image from interactome analysis that visualize the interaction between four genes (myo18b, mylk2, myh2, and bag3) that are related to cardiac and skeletal muscle function. Inset: the up-regulated cluster of genes related to cardiac and skeletal muscle in diapause versus both pre- and post-diapause. The graph shows fold-changes of transcript abundance
Diapause genes identified from all three references
| Gene Name | No. of Dataa | Diapause versus | |
|---|---|---|---|
| pre-Diapause | post-Diapause | ||
| Dual specificity phosphatase 27 ( | 12 | UPb (2.7) | UP (2.1) |
| Kelch-like family member 38 ( | 12 | UP (4.8) | UP (2.4) |
| Sequestosome 1 ( | 9 | UP (2.1) | UP (2.5) |
| Collagen a-1(IX) chain-like ( | 15 | DOWNc (−2.2) | DOWN (−2.6) |
| Dentin sialophosphoprotein-like ( | 12 | DOWN (−2.3) | DOWN (−2.3) |
| Formin-like protein ( | 9 | DOWN (−2.8) | DOWN (−3.0) |
| Transmembrane 4 L6 family member 5-like ( | 9 | UP (1.6) | DOWN (−3.1) |
| Arginase-2-like ( | 15 | UP (2.9) | − d |
| Rhesus glycoprotein ( | 12 | UP (3.7) | - |
| Elastase-1-like ( | 9 | UP (2.5) | - |
| Interleukin-12 subunit b-like ( | 9 | - | UP (3.5) |
| Galactose-specific lectin nattectin-like ( | 15 | DOWN (−3.9) | UP (3.2) |
| Egl-9 family hypoxia-inducible factor 3 ( | 12 | DOWN (−2.5) | - |
| Hemoglobin subunit b-like ( | 9 | DOWN (−3.6) | - |
| a-Tectorin-like ( | 78 | - | DOWN (−4.6) |
| Mucin-2-like ( | 39 | - | DOWN (−2.8) |
| Cartilage intermediate layer protein 2-like ( | 27 | - | DOWN (−2.1) |
| GATA zinc finger domain-containing protein 14-like ( | 24 | - | DOWN (−2.7) |
| Natterin-3-like | 24 | - | DOWN (−4.6) |
| Interferon-inducible GTPase 5-like ( | 21 | - | DOWN (−3.4) |
| CUB and zona pellucida-like domain-containing protein 1-like ( | 18 | - | DOWN (−4.2) |
| Sperm acrosome membrane-associated protein 4-like ( | 15 | - | DOWN (−3.3) |
| Neoverrucotoxin subunit a-like ( | 12 | - | DOWN (−4.5) |
| Cytolysin RTX-S-2-like ( | 9 | - | DOWN (−3.7) |
| Cytoskeleton-associated protein 4 ( | 9 | - | DOWN (−3.3) |
| Seminal plasma glycoprotein 120 ( | 9 | - | DOWN (−4.1) |
| Retinol binding protein 2 (rbp2) | 9 | - | DOWN (−2.2) |
aNo. of Data are from three diapause replicates and three genome/transcriptome references (RPKM and fold change data are listed in the Supplementary document);
bAVERAGE(Log2(FoldChange)) ≥ 1.5;
cAVERAGE(Log2(FoldChange))≤−1.5;
d-1.5 < AVERAGE(Log2(FoldChange)) < 1.5