| Literature DB >> 30202061 |
Roger Huerlimann1,2, Nicholas M Wade3,4, Lavinia Gordon3,5, Juan D Montenegro5, Jake Goodall3,4, Sean McWilliam4, Matthew Tinning3,5, Kirby Siemering3,5, Erika Giardina3,6, Dallas Donovan3,6, Melony J Sellars3,4, Jeff A Cowley3,4, Kelly Condon3,7, Greg J Coman3,8, Mehar S Khatkar3,9, Herman W Raadsma3,9, Gregory E Maes7,10,11, Kyall R Zenger3,7, Dean R Jerry3,7.
Abstract
The black tiger shrimp (Penaeus monodon) remains the second most widely cultured shrimp species globally; however, issues with disease and domestication have seen production levels stagnate over the past two decades. To help identify innovative solutions needed to resolve bottlenecks hampering the culture of this species, it is important to generate genetic and genomic resources. Towards this aim, we have produced the most complete publicly available P. monodon transcriptome database to date based on nine adult tissues and eight early life-history stages (BUSCO - Complete: 98.2% [Duplicated: 51.3%], Fragmented: 0.8%, Missing: 1.0%). The assembly resulted in 236,388 contigs, which were then further segregated into 99,203 adult tissue specific and 58,678 early life-history stage specific clusters. While annotation rates were low (approximately 30%), as is typical for a non-model organisms, annotated transcript clusters were successfully mapped to several hundred functional KEGG pathways. Transcripts were clustered into groups within tissues and early life-history stages, providing initial evidence for their roles in specific tissue functions, or developmental transitions. We expect the transcriptome to provide an essential resource to investigate the molecular basis of commercially relevant-significant traits in P. monodon and other shrimp species.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30202061 PMCID: PMC6131155 DOI: 10.1038/s41598-018-31148-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
List of shrimp tissue types and early life-history stages used for transcriptome sequencing.
| Shrimp ID | Sex | Tissue | Number of paired-end reads |
|---|---|---|---|
| PM_F_08 | Female | Eyestalk | 18,984,152 |
| Gill | 19,971,115 | ||
| Hepatopancreas | 18,831,682 | ||
| PM_F_02 | Female | Female Gonad | 21,338,933 |
| Haemolymph | 20,105,399 | ||
| Muscle | 20,361,299 | ||
| Stomach | 13,470,106 | ||
| PM_F_04 | Female | Female Gonad | 20,255,448 |
| Gill | 21,362,076 | ||
| Haemolymph | 20,247,206 | ||
| Stomach | 21,461,589 | ||
| PM_F_03 | Female | Female Gonad | 20,759,890 |
| PM_M_02 | Male | Eyestalk | 21,076,111 |
| Hepatopancreas | 19,029,973 | ||
| Male Gonad | 20,669,419 | ||
| Muscle | 20,129,858 | ||
| PM_M_04 | Male | Eyestalk | 22,250,295 |
| Gill | 20,396,956 | ||
| Haemolymph | 21,637,767 | ||
| Hepatopancreas | 20,854,492 | ||
| Male Gonad | 20,600,256 | ||
| Muscle | 22,464,431 | ||
| Stomach | 16,444,377 | ||
| PM_M_06 | Male | Male Gonad | 19,800,274 |
| PM_M_C2 | Male | Lymphoid Organ | 19,873,753 |
| PM_M_C3 | Male | Lymphoid Organ | 20,480,178 |
| PM_F_C1 | Female | Lymphoid Organ | 20,372,862 |
| Pool_E | Embryo | 19,745,313 | |
| Pool_N | Nauplii | 18,310,089 | |
| Pool_Z | Zoea | 19,528,689 | |
| Pool_M | Mysis | 19,744,563 | |
| Pool_PL1 | PL1 | 19,815,103 | |
| Pool_PL4 | PL4 | 18,680,555 | |
| Pool_PL10 | PL10 | 18,773,667 | |
| Pool_PL15 | PL15 | 19,661,826 |
PL = post-larval stages 1 (PL1), 4 (PL4), 10 (PL10), 15 (PL15).
Figure 1Species distribution of successfully annotated sequences across the top 29 species using the SwissProt (A) and arthropod subsection of the non-redundant (B) database.
Figure 2Distribution of sequence annotations based on the SWISS-PROT database across the top 20 GO terms at level 2. Determined across the entire dataset for Biological Process (BP, green), Molecular Function (MF, blue), and Cellular Component (CC, yellow).
Numbers of transcript clusters and cluster annotation rates across transcriptomes determined for the nine adult P. monodon tissue types analysed.
| Tissue type | Total expressed clusters | Uniquely expressed clusters | ||
|---|---|---|---|---|
| Number | % Annotated (SP/nrA) | Number | % Annotated (SP/nrA) | |
| Eyestalk | 11,173 | 67.3/72.8 | 31 | 29.0/48.4 |
| Female Gonad | 9,941 | 74.3/79.7 | 161 | 37.3/45.3 |
| Gill | 12,255 | 63.7/69.8 | 153 | 30.7/39.2 |
| Haemolymph | 10,577 | 66.1/71.4 | 17 | 23.5/29.4 |
| Hepatopancreas | 12,169 | 67.7/73.9 | 316 | 49.7/66.8 |
| Lymphoid Organ | 11,923 | 63.0/68.5 | 24 | 54.2/66.7 |
| Male Gonad | 10,387 | 71.9/77.5 | 71 | 32.4/42.3 |
| Muscle | 11,405 | 66.9/72.4 | 77 | 33.8/48.1 |
| Stomach | 9,939 | 68.6/73.7 | 24 | 12.5/33.3 |
| Constitutive | 4,300 | 82.5/85.9 | — | — |
Total numbers of expressed clusters (>50 normalized read counts), uniquely expressed clusters (normalized read count of >10 in a specific tissue, while having <10 read counts in all other tissues) and constitutively expressed (>50 normalized read counts in all) clusters within all tissues in this study, and their relative annotation statistics. Numbers represent clusters across all three respective tissue replicates. SP = SWISS-PROT database, nrA = non-redundant Arthropod database.
Figure 3Principal component analysis showing the top most highly differentially expressed transcripts of (A) nine tissue types (top 1,000) and (B) eight early life-history stages (top 500). PC = principal component, PL = post-larvae.
Figure 4Heatmap and hierarchical grouping of the top 2,000 differentially expressed genes in the nine different tissue types. A darker colour indicates a higher expression level. Gene expression patterns (rows) were grouped into nine expression groups based on Pearson’s correlation and the three replicates of each tissue type (columns) into nine tissue groups based on Euclidean distance. EY – eyestalk; FG – female gonad; GI – gill; HL – hemolymph; HP – hepatopancreas; LO – lymphoid organ; MG – male gonad; MU – muscle; ST – stomach.
Groupings of the top 2,000 highly variably expressed transcript clusters among all nine adult tissue types based on Pearson’s correlation.
| Groups | Predominant tissue type expression site | Number of clusters | % Annotated (SP/nrA) | % in adult but not larval tissues |
|---|---|---|---|---|
| 1 | Lymphoid Organ | 81 | 64.2%/76.5% | 0.0% |
| 2 | Haemolymph | 139 | 63.3%/84.9% | 1.4% |
| 3 | Female Gonad | 208 | 55.3%/81.7% | 6.7% |
| 4 | Gill | 177 | 53.1%/66.1% | 3.4% |
| 5 | Stomach | 72 | 62.5%/68.1% | 8.3% |
| 6 | Male Gonad | 124 | 29.0%/38.7% | 35.5% |
| 7 | Hepatopancreas | 533 | 66.6%/85.7% | 0.8% |
| 8 | Eyestalk | 75 | 66.7%/73.3% | 1.3% |
| 9 | Muscle | 591 | 75.1%/86.0% | 1.5% |
| Total | — | 2000 | 64.0%/84.6% | 4.3% |
This includes annotation success and tissue type where each group was predominantly expressed, and the percent of clusters in each group found in adult tissue types but not in the larval stages examined. SP = SWISS-PROT database, nrA = non-redundant Arthropod database.
Figure 5Heatmap and hierarchical grouping of the top 500 differentially expressed genes in the eight larval and post-larval stages examined. A darker colour indicates a higher expression level. Gene expression patterns in each larval/post-larval stage (row) were grouped into nine expression groups based on Pearson’s correlation.
Groupings of the top 500 highly variably expressed transcript clusters among the four larval and four post-larval stages based on Pearson’s correlation.
| Groups | Stage(s) with predominant expression | Number of clusters | % Annotated (SP/nrA) | % unique to larvae |
|---|---|---|---|---|
| 1 | Mid larval to PL (Z, M, PL01, PL04, PL10, PL15) | 77 | 75.3/83.1 | 9.1 |
| 2 | Mid Larval (Z, M) | 35 | 42.9/68.6 | 62.9 |
| 3 | Mid PL (PL4, PL10) | 12 | 0.0/25.0 | 33.3 |
| 4 | Late larval to PL (M, PL1, PL4, PL10, PL15) | 152 | 61.2/69.7 | 18.4 |
| 5 | PL15 | 13 | 69.2/92.3 | 76.9 |
| 6 | Late PL (PL4, PL10, PL15) | 38 | 84.2/84.2 | 10.5 |
| 7 | Embryo (E) | 12 | 0.0/16.7 | 58.3 |
| 8 | Early larval (E, N) | 140 | 24.3/37.1 | 85.0 |
| 9 | Larval (E, N, Z, M, PL15) | 21 | 33.3/61.9 | 38.1 |
| Total | — | 500 | 49.6/61.6 | 50.4 |
This includes annotation success, stages in which transcript groups were predominantly expressed and the percent of clusters in each group found in larval stages, but not in the adult tissue types examined.SP = SWISS-PROT database, nrA = non-redundant Arthropod database, E = embryo, N = nauplii, Z = zoea, M = mysis, PL = post larvae (day).
Figure 6Presence of mRNA contigs that encode for RNA polymerase subunits based on KEGG pathway analysis[73–75]. Green shading highlights the presence of gene orthologs in the P. monodon transcriptome.
Figure 7Presence of eukaryotic basal transcription factor sequences based on KEGG pathway analysis[73–75]. Green shading highlights the presence of gene in the P. monodon transcriptome.