| Literature DB >> 27966611 |
Abstract
Adults of the lady beetle species Harmonia axyridis (Pallas) are bred artificially en masse for classic biological control, which requires egg-laying by the H. axyridis ovary. Development-related genes may impact the growth of the H. axyridis adult ovary but have not been reported. Here, we used integrative time-series RNA-seq analysis of the ovary in H. axyridis adults to detect development-related genes. A total of 28,558 unigenes were functionally annotated using seven types of databases to obtain an annotated unigene database for ovaries in H. axyridis adults. We also analysed differentially expressed genes (DEGs) between samples. Based on a combination of the results of this bioinformatics analysis with literature reports and gene expression level changes in four different stages, we focused on the development of oocyte reproductive stem cell and yolk formation process and identified 26 genes with high similarity to development-related genes. 20 DEGs were randomly chosen for quantitative real-time PCR (qRT-PCR) to validate the accuracy of the RNA-seq results. This study establishes a robust pipeline for the discovery of key genes using high-throughput sequencing and the identification of a class of development-related genes for characterization.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27966611 PMCID: PMC5155419 DOI: 10.1038/srep39109
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Sequencing data statistics.
| Samples | Clean Reads | Clean Data | GC content (%) | % ≥Q30 |
|---|---|---|---|---|
| Stage 1 | 22,364,281 | 5,591,070,250 | 41.06 | 93.16 |
| Stage 2 | 18,460,991 | 4,615,247,750 | 40.42 | 92.42 |
| Stage 3 | 20,636,344 | 5,159,086,000 | 39.83 | 92.7 |
| Stage 4 | 24,531,820 | 6,132,955,000 | 37.67 | 92.47 |
Samples: the name of the sample; Clean Reads: total number of paired-end reads in the clean data; Clean Data: total number of bases in the clean data; GC content: the percentage of G and C in the total bases; % ≥Q30: the percentage of bases whose quality value was ≥30.
Length range: different length ranges of the transcripts/unigenes.
| Length range | Transcripts | Unigenes |
|---|---|---|
| 300–500 | 28,646 (44.598%) | 23,270 (48.938%) |
| 500–1000 | 17,718 (27.585%) | 12,919 (27.169%) |
| 1000–2000 | 10,797 (16.810%) | 6993 (14.707%) |
| 2000+ | 7070 (11.007%) | 4368 (9.186%) |
| Total number | 64,231 | 47,550 |
| Total length | 61,551,345 | 41,919,319 |
| N50 length | 1448 | 1282 |
| Mean length | 958.28 | 881.58 |
Unigene annotation statistics.
| Annotated databases | Unigenes | ≥300 bp | ≥1000 bp |
|---|---|---|---|
| COG | 7296 | 3522 | 3774 |
| GO | 12,985 | 7831 | 5154 |
| KEGG | 8144 | 4634 | 3510 |
| KOG | 17,235 | 9655 | 7580 |
| Pfam | 16,822 | 8692 | 8130 |
| Swiss-Prot | 16,863 | 9072 | 7791 |
| nr | 28,186 | 17,907 | 17,907 |
| All | 28,558 | 18,258 | 10,300 |
Number of differentially expressed genes.
| DEGs Set | DEGs Number | Up-regulated | Down-regulated |
|---|---|---|---|
| S1vsS2 | 962 | 134 | 828 |
| S1vsS3 | 1334 | 524 | 810 |
| S1vsS4 | 9640 | 4994 | 4646 |
The threshold value of significance in S1Vs S2, S1Vs S3, or S1Vs S4 were FPKM fold change >2 and FDR <0.05.
Figure 1Scatter diagrams representing genome-wide expression profiling comparisons between S1 and the other three stage.
Note: Each black point represents a non-significant difference in gene expression, and each red point indicates a significant differentially expressed gene. Abscissa: log10 (Counts); Ordinate: log2 (FC) expression difference multiple values between two gene samples. a: S1vs. S2, b: S1vs. S3, c:S1vs. S4.
Basic information for the identified development-related genes in the H. axyridis (Pallas) adult ovaryin nr.
| Category or Unigene ID | Nucleotide length (bp) | Full length | Protein length (aa) | Homologous function in nr | Homology species & Accession number | E-value | GenBank accession numbers |
|---|---|---|---|---|---|---|---|
| TRINITY_DN20651_c0_g14 | 397 | Yes | 102 | embryonic polarity protein dorsal-like isoform X1 | Polistes Canadensis L XP_015172818.1 | 7E-17 | KY020084 |
| TRINITY_DN19519_c1_g1 | 3186 | Yes | 517 | bone morphogenetic protein receptor type-1B isoform X1 | 0.0 | KY020085 | |
| TRINITY_DN20146_c0_g1 | 4758 | Yes | 821 | bone morphogenetic protein receptor type-2 | 0.0 | KY020086 | |
| TRINITY_DN17181_c3_g1 | 4956 | Yes | 1444 | DE cadherin protein | 0.0 | KY020087 | |
| TRINITY_DN19612_c0_g1 | 2222 | 5′ | 737 | insulin receptor substrate 1-like | 0.0 | KY020088 | |
| TRINITY_DN19612_c0_g9 | 352 | 5′ | 105 | insulin receptor substrate 1 isoform X6 | 3E-13 | KY020089 | |
| TRINITY_DN20112_c0_g2 | 3802 | Yes | 792 | ecdysone-induced protein 75B, isoforms C/D isoform X3 | 0.0 | KY020090 | |
| TRINITY_DN19749_c1_g1 | 2905 | Yes | 379 | fringe glycosyltransferase | 4E-173 | KY020091 | |
| TRINITY_DN20315_c0_g1 | 3282 | Yes | 792 | CREB-binding protein isoform X3 | 0.0 | KY032003 | |
| TRINITY_DN20898_c0_g2 | 453 | 3′ | 142 | histone acetyltransferase KAT2A | 7E-49 | KY020092 | |
| TRINITY_DN20651_c0_g18 | 400 | 3′ | 119 | Dorsal1 | 8E-48 | KY031985 | |
| TRINITY_DN20799_c0_g3 | 3812 | 5′ | 1135 | tight junction protein ZO-2 isoform X3 | 0.0 | KY031986 | |
| TRINITY_DN18711_c0_g1 | 1845 | 5′ | 543 | neurogenic locus Notch protein | 0.0 | KY031987 | |
| TRINITY_DN9215_c0_g1 | 1518 | Yes | 100 | gamma-secretase subunit pen-2 | 2E-42 | KY031988 | |
| TRINITY_DN15150_c0_g1 | 3585 | Yes | 884 | protein strawberry notch isoform X1 | 0.0 | KY031989 | |
| TRINITY_DN21369_c1_g1 | 6522 | Yes | 375 | notch isoform X2 | 0.0 | KY031990 | |
| TRINITY_DN17863_c0_g1 | 4186 | Yes | 1285 | protein jagged-1b isoform X2 | 0.0 | KY031991 | |
| TRINITY_DN20651_c0_g5 | 561 | 5′ | 185 | Dorsal2 | 8E-57 | KY031993 | |
| TRINITY_DN15528_c0_g1 | 1444 | Yes | 279 | suppressor of cytokine signalling 2 | Tribolium castaneum. Herbst XP_008196669.1 | 1E-79 | KY031994 |
| TRINITY_DN17727_c1_g1 | 1766 | Yes | 481 | RNA-binding protein 41 | Tribolium castaneum. Herbst XP_008191526.1 | 5E-39 | KY031995 |
| TRINITY_DN18156_c0_g1 | 3760 | Yes | 380 | sprouty-related protein with EVH-1 domain isoform X3 | 6E-145 | KY031996 | |
| TRINITY_DN20474_c0_g2 | 2362 | Yes | 250 | protein sprouty | 3E-70 | KY031997 | |
| TRINITY_DN20711_c0_g4 | 2083 | 5′ | 589 | phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit delta isoform | 2E-50 | KY031999 | |
| TRINITY_DN20508_c0_g4 | 308 | Yes | 34 | vitellogenin receptor | 3E-35 | KY032000 | |
| TRINITY_DN20502_c0_g1 | 5471 | Yes | 1785 | vitellogenin | 0.0 | KY032001 | |
| TRINITY_DN20736_c0_g1 | 5505 | Yes | 1791 | vitellogenin | 0.0 | KY032002 | |
Figure 2Clustering of development-related genes.
The colour represents the gene expression levels in the samples in different columns. Different columns represent different samples, and different lines represent different genes.
Figure 3GO annotation classification of development-related genes converted to unigenes annotation database of ovaries in H. axyridis adult.
The abscissa is the GO classification, the ordinate left is the gene number percentage, and the right is the number of genes.
Figure 4KEGG pathway annotation classification of development-related genes converted to unigenes annotation database of ovaries in H. axyridis adult.
The abscissa is the gene number, and the ordinate left is the KEGG classification.
Figure 5The correlation between mRNA expression levels using qPCR and mRNA sequencing.
The blue bar denotes expression values based on qPCR, and the red line denotes expression values based on mRNA sequencing.
Sequences of the primers for qRT-PCR analysis of 20 random genes.
| GeneID | Location of primer | Primer | Product size | Tm | Expression Level |
|---|---|---|---|---|---|
| β-actin | F | CTATGTCGGAGCCCATCACT | 112 | 60 | |
| D | AGCAGTTGTAGCTTCTCCGT | ||||
| TRINITY_DN20799_c0_g3 | F | GATCCCAGTGTTGTGATGGC | 113 | 60 | down |
| D | GCGTGTCTGGTTCTGCTATG | ||||
| TRINITY_DN20651_c0_g5 | F | TTATATAACACGCTTCTAAG | 124 | 60 | up |
| D | TCCTGAGGAACTTGTTGTTC | ||||
| TRINITY_DN21369_c1_g1 | F | CAACAACAACGGTACCTGCA | 105 | 60 | down |
| D | CGAGCAAGGGTTGGATAAGC | ||||
| TRINITY_DN17863_c0_g1 | F | TGGTGGGCGATTATGTCTGT | 126 | 60 | down |
| D | GGCAAGCACAGTGATAGTCG | ||||
| TRINITY_DN20651_c0_g13 | F | CCTCACCCTCACAACTTGGT | 105 | 60 | up |
| D | TACATCCCTCACGACCAACC | ||||
| TRINITY_DN17712_c0_g1 | F | AGAGAACCAACCCCATCTGG | 118 | 60 | up |
| D | TCTCATCACCTTCGAGCTCC | ||||
| TRINITY_DN20851_c0_g4 | F | AGCCTTTGTACTCGGTTCTGA | 111 | 60 | down |
| D | GCCAGTGAAATCCGGTCTTC | ||||
| TRINITY_DN19101_c0_g2 | F | GGATCAGCAAATATTTCTGGGGA | 118 | 60 | up |
| D | ACCAGAGGTCCCCAGAAATA | ||||
| TRINITY_DN19788_c0_g2 | F | TAGAGAAGCCAGGCGACAAA | 124 | 60 | up |
| D | CGGCACCCATTAACAGGATC | ||||
| TRINITY_DN18367_c0_g4 | F | CCATCGTAGCGCCATTTACC | 98 | 60 | down |
| D | ATTGGGGTAGTTGGCGAAGT | ||||
| TRINITY_DN13604_c0_g1 | F | GACCAGGGCTTGTTCCAAAG | 124 | 60 | down |
| D | CTCTCTGCTGAACCTTTCGC | ||||
| TRINITY_DN19101_c0_g1 | F | CTGGAATGGTTGGTGGGTTC | 117 | 60 | up |
| D | AACGCTGCCATGTTTCCAAT | ||||
| TRINITY_DN16334_c3_g1 | F | TCTCGTCCCAGTACGATTCG | 113 | 60 | down |
| D | TGACTTTGGCACGTTCACAG | ||||
| TRINITY_DN6859_c0_g1 | F | GATCTCCGTTTCCAATCGGC | 114 | 60 | up |
| D | GACACGCTTGGCATGGATAG | ||||
| TRINITY_DN21302_c0_g2 | F | TGAGGAATGGGAGGAGGTCT | 145 | 60 | down |
| D | CCATTCCTCATCCACCCGG | ||||
| TRINITY_DN13782_c0_g1 | F | TGGTCCGTACAAAGCAAACG | 136 | 60 | up |
| D | ACGGAATCTGTGGGGTCTTT | ||||
| TRINITY_DN27602_c0_g1 | F | ACTTCACACGGATTTGCGAG | 120 | 60 | up |
| D | GTTTGCTAAGATCGCCAGGG | ||||
| TRINITY_DN19725_c0_g1 | F | GGTGGAAGTGAAGCGCAAAA | 96 | 60 | down |
| D | GTCTCCCAAACTCTCCTCGA | ||||
| TRINITY_DN11810_c0_g1 | F | GAGCCAAGCACTTCGAGATG | 115 | 60 | up |
| D | TCCACTGATCAAACTGGCGA | ||||
| TRINITY_DN15723_c0_g2 | F | TGGTGTCTGATTGTTTGGCG | 134 | 60 | up |
| D | TCGTCAGATATGGGTACGTCC |
F indicates forward primer, and D indicates reverse primer. Tm indicates the temperature at which 50% of the primers and complementary sequence form the double-stranded DNA molecule.