| Literature DB >> 32308726 |
Xiujuan Zhang1, Jiabin Zhou1, Linmiao Li1, Wenzhong Huang1, Hafiz Ishfaq Ahmad1, Huiming Li1, Haiying Jiang1, Jinping Chen1.
Abstract
BACKGROUND: Sturgeons (Acipenseriformes) are polyploid chondrostean fish that constitute an important model species for studying development and evolution in vertebrates. To better understand the mechanisms of reproduction regulation in sturgeon, this study combined PacBio isoform sequencing (Iso-Seq) with Illumina short-read RNA-seq methods to discover full-length genes involved in early gametogenesis of the Amur sturgeon, Acipenser schrenckii.Entities:
Keywords: Amur sturgeon, Acipenser schrenckii; Early gametogenesis; Gonad transcriptome; Isoform sequencing
Year: 2020 PMID: 32308726 PMCID: PMC7147073 DOI: 10.1186/s12983-020-00355-z
Source DB: PubMed Journal: Front Zool ISSN: 1742-9994 Impact factor: 3.172
Fig. 1Histological characteristics of ovary (a and b) and testes (c and d) from 3-year-old A. schrenckii individuals. OI, ovarian lamellae; OL, ovarian lumen; PG, primary growth oocyte; BM, basement membrane; N, oocyte nucleus; Nu, oocyte nucleoli; OG, oogonia; BV, blood vessels; Lo, lobule; SG, spermatogonia; SC, Sertoli cell; The gonadal tissues were stained with hematoxylin and eosin (HE staining). Scale bar in A = 100 μm and in B, C, D = 50 μm
Fig. 2Distribution of nonredundant full-length unigenes from the gonad transcriptome of A. schrenckii by the PacBio Sequel platform
Description of Iso-Seq from the gonads of A. schrenckii by PacBio Sequel platform
| species | SMRT cells | Subreads base (G) | Reads of Insert (ROIs) | nonfull-length reads | Full-length reads | FLNC reads | Consensus transcripts | Mean length (bp) | High-quality consensus transcripts | Low-quality consensus transcripts | Unigenes |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | 50.04 | 1,260,958 | 358,153 | 869,918 | 860,617 | 461,596 | 2782 | 335,067 | 125,969 | 164,618 |
Statistics of full-length unigene annotation performed with seven different databases
| Database | NO. transcripts annotated | Annotated rate (%) |
|---|---|---|
| NO.unigene | 164,618 | – |
| NR | 153,455 | 93.21 |
| KOG | 151,313 | 91.92 |
| Pfam | 134,181 | 81.51 |
| SwissProt | 102,427 | 62.22 |
| KEGG | 87,086 | 52.90 |
| GO | 72,758 | 44.20 |
| eggNOG | 47,463 | 28.83 |
| Total | 154,006 | 93.55 |
Searching for genes putatively involved in early gametogenesis from the full-length gonad transcriptome of A. scipenserki
| Gene Symbol | Gene Description | Length (bp) | NO. Unigene | NO. Unigene with ORF |
|---|---|---|---|---|
| Dmrt1 | double sex and mab-3 related transcription factor 1 | 2893&3843 | 2 | 0 |
| Dmrt2 | double sex and mab-3 related transcription factor 2 | 2291 | 1 | 1 |
| DmrtA1 | double sex and mab-3 related transcription factor A1 | 1422–2140 | 4 | 3 |
| DmrtA2 | double sex and mab-3 related transcription factor A2 | 2297–3868 | 12 | 12 |
| DmrtB1 | doublesex and mab-3 related transcription factor B1 | 2325–2377 | 4 | 0 |
| Amh | anti-müllerian hormone | 2071–3443 | 5 | 5 |
| Foxl2 | forkhead box protein L2 | 1802–2143 | 3 | 3 |
| Bmp15 | bone morphogenetic protein 15 | 1906–3389 | 11 | 7 |
| Cyp19a | cytochrome P450 1A | 1757–3305 | 50 | 49 |
| Ctnnb1 | catenin beta-1 | 3074–3598 | 13 | 13 |
| OCT4 | POU2 | 1830–4694 | 104 | 100 |
| Lhx1 | LIM homeobox 1 | 1213 | 1 | 1 |
| Sf1 | steroidogenic acute regulatory protein | 2674 | 1 | 1 |
| Rspo1 | R spondin 1 | 2372 | 1 | 1 |
| Sox3 | SRY box 3 | 1335–3125 | 56 | 55 |
| Sox4 | SRY box 4 | 2257–3666 | 5 | 4 |
| Sox5 | SRY box 5 | 1500–6536 | 7 | 5 |
| Sox6 | SRY box 6 | 1491 | 1 | 0 |
| Sox7 | SRY box 7 | 1679–2211 | 15 | 13 |
| Sox8 | SRY box 8 | 2619–3152 | 7 | 7 |
| Sox9 | SRY-related HMG box gene 9 | 2657–3491 | 6 | 6 |
| Sox10 | SRY box 10 | 1968 | 1 | 1 |
| Sox18 | SRY box 18 | 2439 | 1 | 1 |
| AR | androgen receptor | 2941–8079 | 6 | 5 |
| Vasa | vasa | 2284–2876 | 30 | 27 |
| ERɑ, ERβ | estrogen receptor | 1956–6770 | 17 | 11 |
| Pdgfrα | platelet-derived growth factor receptor alpha | 3552 | 1 | 0 |
| Hsd11b2 | 11-beta-hydroxysteroid dehydrogenase | 2243–3278 | 3 | 2 |
| Lhx9 | LIM homeobox 9 | 3300&3332 | 2 | 2 |
| Emx2 | homeobox protein EMX2 | 2058 | 1 | 1 |
| Fshr | follicle-stimulating hormone receptor | 4201–5062 | 4 | 4 |
| Gnrhr | gonadotropin-releasing hormone receptor | 815&1002 | 2 | 0 |
| Igf I | insulin-like growth factor I | 2729&4884 | 2 | 2 |
| Dkk1 | dickkopf-related protein 1 | 803–1948 | 9 | 7 |
| Wt1 | wilms tumor protein homolog | 3322 | 1 | 1 |
| Fgfr2 | fibroblast growth factor receptor 2 | 3916 | 1 | 1 |
| VR | vitellogenin receptor | 1224&3294 | 2 | 1 |
| Fem1 | fem-1 homolog A | 1763–4084 | 13 | 10 |
| Gsdf | gonadal soma derived factor | 1602–2461 | 12 | 7 |
| Spo11 | meiotic recombination protein | 1652–1797 | 5 | 3 |
| Fstb2 | follistatin b2 | 3181 | 1 | 1 |
| Ozf6 | oocyte zinc finger protein 6 | 1506–4396 | 47 | 35 |
| Ozf7 | oocyte zinc finger protein 7 | 1487–4252 | 21 | 17 |
| Ozf22 | oocyte zinc finger protein 22 | 3066 | 1 | 0 |
| Hsp | heat shock protein | 479–3651 | 7 | 7 |
| Hsp70 | heat shock protein 70 | 971–4279 | 64 | 59 |
| Hsp75 | heat shock protein 75 | 2289–2505 | 3 | 0 |
| Hsp90 | heat shock protein 90 | 403–7147 | 48 | 35 |
| Kpi1 | kunitz-type protease inhibitor 1 | 2541–4371 | 13 | 10 |
| Gsf1 | gametocyte-specific factor 1 | 592–6520 | 27 | 24 |
| ATRX | ATRX | 1417–8148 | 21 | 16 |
| Sap2 | spermatogenesis-associated protein 2 | 2141–4829 | 26 | 25 |
| Sap5 | spermatogenesis-associated protein 5 | 2419–3622 | 13 | 10 |
| Sap6 | spermatogenesis-associated protein 6 | 2173–4421 | 10 | 10 |
| Sap7 | spermatogenesis-associated protein 7 | 1666–3201 | 14 | 11 |
| Sap13 | spermatogenesis-associated protein 13 | 4707–6505 | 5 | 5 |
| Sap17 | spermatogenesis-associated protein 17 | 644–2371 | 3 | 2 |
| Sap20 | spermatogenesis-associated protein 20 | 3934 | 1 | 1 |
| Sap22 | spermatogenesis-associated protein 22 | 1406–1699 | 4 | 3 |
| Sap24 | spermatogenesis-associated protein 24 | 1691–5433 | 4 | 0 |
Fig. 3Difference analysis between the testes and ovaries of the full-length unigenes from the gonad transcriptome of A. schrenckii by the PacBio Sequel platform.a Venn diagram showing the distribution of testis-specific and ovary-specific unigenes. b and c the significantly enriched KEGG pathways in ovary-biased and testis-biased DEUs, respectively (corrected P < 0.05)
Differential expression genes (DEGs) between ovary and testis of A.schrenckii
| Unigene ID | Nr annotation ID | Gene symbol | Log2(OV/TE)a | FDR |
|---|---|---|---|---|
| F01_cb14613_c0/f2p1/2177 | gi|742,229,200|ref.|XP_010900597.1| | Foxl2 | 4.52 | 0.0005 |
| F01_cb22280_c14221/f1p1/3240 | gi|348,162,225|gb|AEN19340.2| | Cyp19a | 5.87 | 5.40E-07 |
| F01_cb1470_c88/f1p1/2748 | gi|341,579,644|gb|AEK81554.1| | OCT4 | 10.15 | 1.9E-05 |
| F01_cb13001_c172/f9p26/1973 | gi|573,890,381|ref.|XP_006632952.1| | Sox3 | 11.59 | 2.71E-12 |
| F01_cb14611_c1/f1p0/2047 | gi|573,875,677|ref.|XP_006626146.1| | Sox7 | 3.81 | 0.0027 |
| F01_cb3737_c21/f1p0/1926 | gi|573,890,192|ref.|XP_006632858.1| | Bmp15 | 4.75 | 4.3E-05 |
| F01_cb15637_c23/f29p6/1372 | gi|573,884,921|ref.|XP_006630539.1| | Dkk1 | 4.79 | 2.4E-05 |
| F01_cb12452_c41/f1p0/1127 | gi|742,184,514|ref.|XP_010888880.1| | Gsf1 | 4.08 | 0.0023 |
| F01_cb8778_c103/f1p0/1710 | gi|281,485,070|gb|ADA70351.1| | Hsp | 3.69 | 0.0012 |
| F01_cb13963_c503/f1p126/1797 | gi|393,809,558|gb|AFM75819.2| | Hsp70 | 7.16 | 2.6E-06 |
| F01_cb22281_c130670/f1p2/2775 | gi|407,067,884|gb|AFS88930.1| | Hsp90 | 7.38 | 3.07E-08 |
| F01_cb15971_c8/f2p1/1691 | gi|632,957,530|ref.|XP_007894530.1| | Sap24 | 3.48 | 0.0067 |
| F01_cb7465_c17/f5p1/2348 | gi|699,584,302|ref.|XP_009864039.1| | DmrtB1 | −7.16 | 0.0069 |
| F01_cb11611_c2106/f1p6/2113 | gi|299,773,492|gb|ADJ38820.1| | Amh | inf b | 0.0035 |
| F01_cb6093_c9/f1p0/2436 | gi|51,599,123|gb|AAU08212.1| | Sox4 | −4.33 | 0.0057 |
| F01_cb1461_c13/f1p0/2264 | gi|573,890,923|ref.|XP_006633221.1| | Sox5 | −6.84 | 0.0002 |
| F01_cb7554_c7/f1p0/2619 | gi|410,586,767|gb|AFV74660.1| | Sox8 | −4.39 | 0.0065 |
| F01_cb5663_c3/f1p0/3843 | gi|634,859,782|gb|AHZ62758.1| | Sox9 | −5.38 | 0.0003 |
| F01_cb6729_c44/f1p0/3330 | gi|307,548,813|dbj|BAJ19133.1| | Vasa | −4.94 | 0.0053 |
| F01_cb14310_c1/f1p0/2372 | gi|573,887,629|ref.|XP_006631587.1| | Rspo1 | −5.86 | 0.0008 |
| F01_cb223_c11/f1p0/6792 | gi|211,926,878|dbj|BAG82652.1| | ERβ | −4.55 | 0.0052 |
| F01_cb11611_c593/f1p1/1932 | gi|469,923,991|emb|CCP19133.1| | Gsdf | −8.82 | 2E-05 |
| F01_cb11694_c1/f1p0/3278 | gi|556,825,703|gb|AGZ80888.1| | hsd11b2 | −6.49 | 0.0089 |
| F01_cb10604_c20163/f1p0/4596 | gi|573,902,226|ref.|XP_006638838.1| | Fshr | −7.03 | 1.3E-05 |
| F01_cb10604_c21398/f1p1/4095 | gi|573,889,889|ref.|XP_006632708.1| | ATRX | −4.87 | 0.0021 |
| F01_cb10604_c55816/f1p3/3702 | gi|742,250,963|ref.|XP_010867269.1| | Ozf6 | −4.22 | 0.0021 |
| F01_cb10604_c33459/f1p1/4252 | gi|742,242,686|ref.|XP_010862827.1| | Ozf7 | −4.77 | 0.0009 |
| F01_cb22280_c10055/f2p1/3617 | gi|573,907,277|ref.|XP_006641358.1| | Sap2 | −4.11 | 0.0043 |
| F01_cb22280_c4150/f1p0/3622 | gi|573,881,115|ref.|XP_006628756.1| | Sap5 | −6.48 | 0.0006 |
| F01_cb22282_c89869/f1p0/2184 | gi|573,883,855|ref.|XP_006630079.1| | Sap6 | −4.09 | 0.0098 |
aThe relative expression level of genes in ovary compared to that in testis. OV ovary, TE testis
b“inf” indicates tissue-specific expression pattern
DEUs of the early gametogenesis related GO terms and KEGG pathways in A.schrenckii
| Biological process | reproductive process | 1188 | 143 |
| Biological process | reproduction | 1134 | 151 |
| MAPK signaling pathway | ko04010 | 1834 | 153 |
| Oocyte meiosis | ko04114 | 1731 | 277 |
| Progesterone-mediated oocyte maturation | ko04914 | 1398 | 201 |
| Wnt signaling pathway | ko04310 | 1298 | 123 |
| TGF-beta signaling pathway | ko04350 | 768 | 108 |
| GnRH signaling pathway | ko04912 | 712 | 67 |
| Apoptosis | ko04210 | 563 | 45 |
| Hedgehog signaling pathway | ko04340 | 399 | 40 |
| Oxytocin signaling pathway | ko04921 | 203 | 18 |
| Estrogen signaling pathway | ko04915 | 178 | 21 |
| Steroid hormone biosynthesis | ko00140 | 150 | 12 |
| Steroid biosynthesis | ko00100 | 119 | 13 |
| Prolactin signaling pathway | ko04917 | 68 | 3 |
| Ovarian steroidogenesis | ko04913 | 31 | 4 |
Fig. 4Alternative splicing (AS) analysis of the full-length gonad transcriptome of A. schrenckii.a Statistics of the full-length unigenes detected with AS events. b The cluster heatmap (Log2(FPKM+ 1) values) indicates the expression patterns of different alternative isoforms in the testes and ovaries of A. schrenckii. Vasa (unigene ID: F01_cb6729_c68/f1p2/2928) predicted with four alternative isoforms and Fem1 (unigene ID: F01_cb8161_c15/f1p2/1763) with five alternative isoforms were selected as samples. c Distribution of AS events in early gametogenesis related GO terms and signaling pathways
Fig. 5Identification of long noncoding RNAs (lncRNAs) and transcript factor (TF) analysis from the full-length gonad transcriptome of A. schrenckii. a Venn diagram of lncRNA prediction by four programs, including PLEK, CNCI, CPC and Pfam. b The top 20 abundant tems. c Transcript factor Sox family members were screened using AnimalTFDB alignment, SMART protein motif prediction and NR annotation
Characteristics of Sox9 gene sequences and their proteins
| Name | Full-length (bp) | ORF (bp) | 5’UTR (bp) | 3’UTR (bp) | Amino acids (aa) | Molecular weight (kDa) | Isoelectric point |
|---|---|---|---|---|---|---|---|
| Asi_Sox9 (AHZ62758.1) | 2145 | 1467 | 116 | 562 | 488 | 54.15 | 6.51 |
| Asc_Sox9–1 (F01_cb5663_c14/f1p0/2873) | 2873 | 1332 | 367 | 1174 | 443 | 48.98 | 6.58 |
| Asc_Sox9–2 (F01_cb5663_c6/f1p0/3396) | 3413 | 1467 | 347 | 1599 | 488 | 54.19 | 6.51 |
| Asc_Sox9–3 (F01_cb5663_c3/f1p0/3843) | 3429 | 1464 | 383 | 1582 | 487 | 53.94 | 6.48 |
| Asc_Sox9–4 (F01_cb5663_c11/f1p0/3469) | 3491 | 1290 | 604 | 1597 | 429 | 48.04 | 7.48 |
Fig. 6Gene structure of Asc_Sox9–1-4 and expression abundance (FKPM levels) in the testes and ovaries of A. schrenckii. a Achematic diagram of gene structure. The gray part indicates the UTR regions. The region between the black and vertical bars presents the SMART protein motifs. The diamond box shows conserved the HMG domain, and the red square indicates low complexity. b Expression abundance (FPKM levels) of Asc_Sox9–1-4 in testes and ovaries
Fig. 7The phylogenetic tree of the Sox9 protein sequences was constructed using the neighbor-joining method. The node values represent the percent bootstrap confidence level derived from replicates. The accession numbers of the Sox9 proteins are shown in Supplementary Table 9. The five classes are comprised of Mammalia, Aves, amphibian, Reptilian and Osteichthyes. Meanwhile, Sox2 from zebrafish Danio rerio (accession number: BAE48583.1) was chosen as the out-group protein sequence