| Literature DB >> 31466296 |
Dengdong Wang1,2, Xiyang Chen3,4, Xinhui Zhang4, Jia Li4, Yunhai Yi3,4, Chao Bian4, Qiong Shi3,4,5, Haoran Lin1,2, Shuisheng Li6,7, Yong Zhang8,9, Xinxin You10,11.
Abstract
Giant groupers, the largest grouper type in the world, are of economic importance in marine aquaculture for their rapid growth. At the same time, bacterial and viral diseases have become the main threats to the grouper industry. Here, we report a high-quality genome of a giant grouper sequenced by an Illumina HiSeq X-Ten and PacBio Bioscience Sequel platform. A total of 254 putative antimicrobial peptide (AMP) genes were identified, which can be divided into 34 classes according to the annotation of the Antimicrobial Peptides Database (APD3). Their locations in pseudochromosomes were also determined. Thrombin-, lectin-, and scolopendin-derived putative AMPs were the three largest parts. In addition, expressions of putative AMPs were measured by our transcriptome data. Two putative AMP genes (gapdh1 and gapdh2) were involved in glycolysis, which had extremely high expression levels in giant grouper muscle. As it has been reported that AMPs inhibit the growth of a broad spectrum of microbes and participate in regulating innate and adaptive immune responses, genome sequencing of this study provides a comprehensive cataloging of putative AMPs of groupers, supporting antimicrobial research and aquaculture therapy. These genomic resources will be beneficial to further molecular breeding of this economically important fish.Entities:
Keywords: Epinephelus lanceolatus; antimicrobial peptide; genome sequencing; giant grouper; growth
Mesh:
Substances:
Year: 2019 PMID: 31466296 PMCID: PMC6780625 DOI: 10.3390/md17090503
Source DB: PubMed Journal: Mar Drugs ISSN: 1660-3397 Impact factor: 5.118
Figure 1K-mer distribution of the giant grouper with a k-mer size of 17. The x-axis and y-axis are the sequencing depth and percentage of unique 17-mers, respectively.
Genome assembly statistics of the giant grouper.
| Criteria | Contig | Scaffold |
|---|---|---|
| Number | 3207 | 3187 |
| Total length (bp) | 1,128,030,970 | 1,128,030,990 |
| Longest (bp) | 9,533,321 | 9,533,321 |
| N50 (bp) | 1,469,414 | 1,505,601 |
| N90 (bp) | 209,611 | 210,944 |
| >2 kb | 3182 | 3162 |
Characteristics of pseudochromosomes of Epinephelus lanceolatus.
| Chr | Length (Mb) | Number of Genes | Number of Scaffolds |
|---|---|---|---|
| 1 | 39.05 | 917 | 53 |
| 2 | 41.84 | 899 | 54 |
| 3 | 20.61 | 374 | 18 |
| 4 | 50.74 | 1198 | 78 |
| 5 | 41.95 | 912 | 53 |
| 6 | 50.39 | 978 | 72 |
| 7 | 44.01 | 1062 | 55 |
| 8 | 54.01 | 1051 | 56 |
| 9 | 52.56 | 1127 | 70 |
| 10 | 45.93 | 1209 | 49 |
| 11 | 41.12 | 820 | 41 |
| 12 | 35.95 | 918 | 45 |
| 13 | 54.06 | 1359 | 56 |
| 14 | 47.27 | 947 | 60 |
| 15 | 38.31 | 894 | 35 |
| 16 | 39.82 | 728 | 57 |
| 17 | 40.89 | 858 | 47 |
| 18 | 34.17 | 675 | 58 |
| 19 | 31.81 | 898 | 24 |
| 20 | 22.54 | 491 | 24 |
| 21 | 45.46 | 1007 | 68 |
| 22 | 44.25 | 944 | 80 |
| 23 | 42.07 | 1035 | 43 |
| 24 | 40.81 | 905 | 60 |
| total | 999.69 | 22,206 | 1256 |
Figure 2Circos atlas representation of pseudochromosome information. (I) The length of each pseudochromosome. (II) GC content of 100-kb genomic intervals (GC content from 0.25 to 0.51). (III) Density of gene distribution in each 100-kb genomic interval. (IV) Schematic presentation of major interchromosomal relationships in the giant grouper genome, which represents the collinearity of genes between two chromosomes.
The top 20 transcripts per million (TPM) rankings of antimicrobial peptides (AMPs) or putative AMP precursors in each of the three transcriptomic datasets.
| Ranking | Muscle | Liver | Brain |
|---|---|---|---|
| 1 | GAPDH1 1 (49,379.59) | Hemoglobin1 (24,013.91) | Hemoglobin1 (11,531.97) |
| 2 | GAPDH2 (5017.39) | Hemoglobin12 (23,219.52) | Hemoglobin12 (7262.71) |
| 3 | Hemoglobin12 (2440.09) | sOT2 2 (14,410.88) | sOT2 (1416.06) |
| 4 | Hemoglobin1 (2295.59) | Antiproteinase1 (11,271.67) | β2-Microglobin1 (1136.57) |
| 5 | Ap-s 3 (339.63) | Antiproteinase5 (7898.22) | Neuropeptide5 (858.07) |
| 6 | β2-Microglobin1 (258.26) | Antiproteinase2 (6566.73) | BPTI4 4 (654.21) |
| 7 | β2-Microglobin4 (137.71) | Thrombin1 (5759.76) | Neuropeptide6 (631.31) |
| 8 | Ubiquicidin (86.92) | GAPDH1 (5378.38) | Saposin2 (572.89) |
| 9 | BPTI16 (85.56) | BPTI12 (4143.13) | Lectin12 (535.93) |
| 10 | Saposin2 (81.02) | Thrombin23 (2039.84) | Synuclein (350.56) |
| 11 | BPTI7 (57.05) | Antiproteinase3 (1806.51) | β2-Microglobin4 (340.17) |
| 12 | Lectin25 (43.21) | Thrombin29 (1544.98) | Amyloid2 (250.76) |
| 13 | Thrombin53 (40.79) | Thrombin22 (1313.59) | Amyloid1 (249.58) |
| 14 | Thrombin31 (38.01) | Thrombin46 (1201.45) | Lysozyme2 (208.82) |
| 15 | BPTI15 (32.09) | Thrombin6 (1039.43) | Ubiquicidin (191.88) |
| 16 | CcAMP 5 (30.21) | β2-Microglobin1 (662.14) | Thymosin2 (177.76) |
| 17 | BPTI14 (27.33) | Thrombin64 (621.69) | Thrombin45 (171.27) |
| 18 | Lectin3 (23.87) | Thrombin42 (590.54) | Ubiquitin10 (119.27) |
| 19 | Ubiquitin1 (23.61) | Thrombin47 (554.85) | LEAP-2_2 6 (116.44) |
| 20 | Ubiquitin5 (21.93) | Thrombin18 (473.18) | Lectin19 (103.89) |
1 glyceraldehyde 3-phosphate dehydrogenase; 2 an AMP derived from Pelodiscus sinensis; 3 an AMP purified from Argopecten purpuratus; 4 bovine pancreatic trypsin inhibitor; 5 an AMP from Coridius chinensis; 6 liver-expressed antimicrobial peptide 2. NOTE: TPM values of putative AMPs are exhibited in parentheses.
Figure 3Pseudochromosome lengths, genes involved in AMPs and glycolysis, and Ca2+-regulating genes of the giant grouper. NOTE: Black bars represent AMPs; red bars represent genes involved glycolysis; green bars represent genes involved Ca2+ regulation; and blue bars represent the gapdhs, which are both AMPs and glycolysis.
Figure 4(a) AMP distribution and (b) KEGG metabolic pathway annotation of AMPs.
Location of glycolytic- and Ca2+-regulating genes in the giant grouper pseudochromosome.
| Gene Name | Chr | Gene ID | Function Type |
|---|---|---|---|
|
| Chr1 | longdun_GLEAN_10010987 | Ca2+ regulating |
|
| Chr1 | longdun_GLEAN_10010985 | Ca2+ regulating |
|
| Chr2 | longdun_GLEAN_10005384 | Glycolytic |
|
| Chr3 | longdun_GLEAN_10021266 | Ca2+ regulating |
|
| Chr4 | longdun_GLEAN_10018260 | Ca2+ regulating |
|
| Chr4 | longdun_GLEAN_10017734 | Ca2+ regulating |
|
| Chr5 | longdun_GLEAN_10022627 | Glycolytic |
|
| Chr6 | longdun_GLEAN_10022325 | Glycolytic |
|
| Chr8 | longdun_GLEAN_10012880 | Glycolytic |
|
| Chr8 | longdun_GLEAN_10018335 | Glycolytic |
|
| Chr9 | longdun_GLEAN_10019679 | Glycolytic |
|
| Chr10 | longdun_GLEAN_10019814 | Glycolytic |
|
| Chr11 | longdun_GLEAN_10018530 | Glycolytic |
|
| Chr12 | longdun_GLEAN_10020822 | Glycolytic |
|
| Chr14 | longdun_GLEAN_10008616 | Glycolytic |
|
| Chr15 | longdun_GLEAN_10008973 | Ca2+ regulating |
|
| Chr17 | longdun_GLEAN_10012515 | Glycolytic |
|
| Chr18 | longdun_GLEAN_10002289 | Glycolytic |
|
| Chr19 | longdun_GLEAN_10017174 | Glycolytic |
|
| Chr19 | longdun_GLEAN_10017191 | Glycolytic |
|
| Chr21 | longdun_GLEAN_10010008 | Ca2+ regulating |
|
| Chr22 | longdun_GLEAN_10014462 | Glycolytic |
|
| Chr22 | longdun_GLEAN_10014466 | Glycolytic |
|
| Chr22 | longdun_GLEAN_10014467 | Glycolytic |
1 troponin I; 2 troponin T; 3 phosphoglycerate kinase; 4 troponin C; 5 phosphoglycerate mutase; 6 phosphoglucomutase; 7 phosphoglucose isomerase; 8 pyruvate kinase; 9 phosphofructokinases; 10 fructose-bisphosphate aldolase; 11 ryanodine receptor; 12 enolases; 13 glyceraldehyde phosphate dehydrogenase; and 14 triosephosphate isomerase.
Congruent relationship and transcription levels of the 2 growth-related putative AMP genes.
| Putative AMP Gene Name | Gene ID | Query AMP (AMP ID in APD3 Database) | TPM | ||
|---|---|---|---|---|---|
| Brain | Liver | Muscle | |||
|
| longdun_GLEAN_10017174 | Skipjack tuna GAPDH-related antimicrobial peptide (SJGAP) (2680) | 4.77 | 5378.38 | 49379.59 |
|
| longdun_GLEAN_10014462 | Yellowfin tuna glyceraldehyde-3-phosphate dehydrogenase-related antimicrobial peptide (YFGAP) (2012) | 12.19 | 0.17 | 5017.39 |
Figure 5Structure of the 2 growth-related putative AMP genes (gapdh1 and gapdh2) in zebrafish, giant groupers, and yellowfin tuna. The pink, blue, and green boxes represent coding sequence (CDS) of genes in zebrafish, giant groupers, and yellowfin tuna, respectively. Multiple sequence alignments of partial gapdh1 (a) and gapdh2 (b) that match to YFGAP and SJGAP from zebrafish, yellowfin tuna, and giant grouper were displayed. Blue and yellow marks represent >80% and >50% identity, respectively.