| Literature DB >> 31174467 |
Qian Zhao1,2, Weimin Zhong3, Weiyi He1,2, Yiying Li1,2,3, Yaqing Li1,2,4, Tianpu Li1,2,4, Liette Vasseur1,2,5, Minsheng You6,7.
Abstract
BACKGROUND: The diamondback moth (DBM), Plutella xylostella (L.), is a major pest of cruciferous crops worldwide. While the species has become a model for genomics, post-transcriptional mechanisms associated with development and sex determination have not been comprehensively studied and the lack of complete structure of mRNA transcripts limits further research.Entities:
Keywords: Alternative splicing; Development; Diamondback moth (DBM); IsoSeq; RNA-seq; Sex-determination
Mesh:
Substances:
Year: 2019 PMID: 31174467 PMCID: PMC6556048 DOI: 10.1186/s12864-019-5838-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Data overview of RNA-seq and IsoSeq
| Sequencing pattern | Transcripts No. | Non-redundant transcripts No. | Gene coverage (%) | Novel transcripts |
|---|---|---|---|---|
| RNA-seq | 913,851 | 217,535 | 76% | 77,648 |
| IsoSeq | 52,093 | 16,398 | 33% | 2652 |
Fig. 1Summary of the sequencing data. a Novel transcripts verified by RT-PCR. b Basic information of the sequencing data based on Iso-seq. Figure on the left showed read length distributions of Iso-seq; Figure on the right showed read length distributions of Iso-seq among exons, introns and intergenic regions. c Basic information of the sequencing data based on RNA-seq. Figure on the left showed read length distributions of RNA-seq. Figure on the right showed read length distributions of RNA-seq among exons, introns and intergenic regions
Fig. 2Examples of falsely-annotated genes in the DBM genome. a Gene structure of mis-annotated genes. The Iso-seq evident was marked by purple and the RNA-seq evident was marked by blue. The box indicated exons and the solid lines represented introns. b The distribution of Pearson’s correction coefficients by pairwise misannotated genes. c Gene expression pattern analysis of the mis-annotated genes. Heatmap of expression changes along developmental stages was shown by hierarchical clustering. Expression values were calculated as log10FPKM. Abbreviations: egg, first for first-stage larvae, second for second-stage larvae, third for third-stage larvae, 4th_f for female forth-stage larvae, 4th_m for male forth-stage larvae, pupa_f for female pupa, pupa_m for male pupa, adult_f for female adult, and adult_m for male adult
Types of alternative splicing events identified with two different sequencing methods
| Sequencing method | IR | ES-EE | AA | AD | Total |
|---|---|---|---|---|---|
| RNA-seq | 378 | 466 | 306 | 317 | 1151 |
| IsoSeq | 496 | 242 | 245 | 224 | 920 |
| Interaction | 66 | 98 | 65 | 53 | 267 |
| Total | 1804 | ||||
Fig. 3Alternative splicing events in P. xylostella. a Four types of AS events identified in this study. b CIRCOS visualization of data at the genome-wide level. Only 9 scaffolds are shown. From outer to inner circle display GC coverage, gene density, novel genes and AS events respectively. The GC content value higher than the average GC content value was marked by red while lower than average was marked by blue. c Examples of AS events in P. xylostella. The box indicated exons and the solid lines represented introns. d Examples of genes showing different types of AS transcripts. The box indicated exons and the solid lines represented introns
Alternative splicing events in different developmental stages and sexes where numerical values represent the instar stages, and f for female and m for male
| Splicing model | egg | 1st | 2nd | 3rd | 4th_f | 4th_m | pupa_f | pupa_m | adult_f | adult_m | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| IR | 105 | 64 | 69 | 59 | 58 | 61 | 78 | 79 | 50 | 82 | 705 |
| ES-EE | 74 | 84 | 79 | 71 | 57 | 76 | 89 | 105 | 74 | 104 | 813 |
| AA | 52 | 48 | 45 | 42 | 31 | 42 | 51 | 61 | 50 | 57 | 479 |
| DD | 64 | 64 | 53 | 49 | 43 | 58 | 64 | 74 | 65 | 68 | 602 |
| Total | 259 | 233 | 221 | 201 | 170 | 215 | 254 | 282 | 218 | 277 |
Number of proportion of stage-specific expressed genes with AS events with values 5 times higher than any other stages
| egg | 1st | 2nd | 3rd | 4th_m | 4th_f | pupa_m | pupa_f | adult_m | adult_f | Total | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| AS events | 4 | 0 | 0 | 0 | 2 | 1 | 3 | 1 | 8 | 4 | 23 |
| Stage-specific genes | 81 | 15 | 0 | 6 | 27 | 33 | 68 | 90 | 198 | 105 | 617 |
| Proportion (%) | 4.9 | 0 | 0 | 0 | 7.4 | 3 | 4.4 | 1.1 | 4 | 3.8 | 3.7 |
Fig. 4Expression profiling based on RNA-seq. Heatmap showing expression changes among different developmental stages. Expression values were calculated as log10FPKM. See Fig. 2 for abbreviations of development stages. a Genes showing stage-specific expression patterns. b Genes showing larval-stage specific expression patterns and their function annotations. c Genes showing sex-specific expression patterns. Figure on the left showing female-specific expression. Figure on the right showing male-specific expression
Go annotation of specific proteins with significantly high expression patterns at larval stages and differentially between males and females
| GO ID | Functional description | FDR | |
|---|---|---|---|
| Proteins with specific expression patterns in larvae stages | |||
| Cellular Component | |||
| GO:0009331 | Glycerol-3-phosphate dehydrogenase complex | 0.003500 | 0.041700 |
| GO:1990204 | Oxidoreductase complex | 0.012500 | 0.074800 |
| Molecular Function | |||
| GO:0004252 | Serine-type endopeptidase activity | 0.000005 | 0.000118 |
| GO:0008236 | Serine-type peptidase activity | 0.000010 | 0.000118 |
| GO:0017171 | Serine hydrolase activity | 0.000010 | 0.000118 |
| GO:0004175 | Endopeptidase activity | 0.000058 | 0.000533 |
| GO:0070011 | Peptidase activity, acting on L-amino acid peptides | 0.000311 | 0.002300 |
| Biological Process | |||
| GO:0006508 | Proteolysis | 0.000540 | 0.015665 |
| GO:0071704 | Organic substance metabolic process | 0.001703 | 0.019009 |
| GO:0008152 | Metabolic process | 0.001966 | 0.019009 |
| GO:0005975 | Carbohydrate metabolic process | 0.003261 | 0.020091 |
| GO:0046168 | Glycerol-3-phosphate catabolic process | 0.003464 | 0.020091 |
| Proteins with different AS events and expression patterns between males and females | |||
| Cellular Component | |||
| GO:0043234 | Protein complex | 0.005020 | 0.153015 |
| GO:0031981 | Nuclear lumen | 0.007547 | 0.153015 |
| GO:0098796 | Transcription elongation factor complex | 0.009963 | 0.153015 |
| GO:0034702 | Organelle lumen | 0.012343 | 0.153015 |
| GO:0034703 | intracellular organelle lumen | 0.012343 | 0.153015 |
| Molecular Function | |||
| GO:0005524 | ATP binding | 0.010270 | 0.116441 |
| GO:0016773 | phosphotransferase activity, alcohol group as acceptor | 0.000154 | 0.010854 |
| GO:0005344 | Oxygen transporter activity | 0.004442 | 0.100065 |
| GO:0001883 | Purine nucleoside binding | 0.016417 | 0.119449 |
| GO:0022890 | Inorganic cation transmembrane transporter activity | 0.038358 | 0.165173 |
| Biological Process | |||
| GO:0006464 | Cellular protein modification process | 0.000382 | 0.030106 |
| GO:0006812 | Cation transports | 0.000519 | 0.030106 |
| GO:0006468 | Protein phosphorylation | 0.000578 | 0.030106 |
| GO:0006811 | Ion transport | 0.010780 | 0.030106 |
| GO:0042278 | Purine nucleoside metabolic process | 0.024593 | 0.220958 |
Only p value < 0.05 are listed. FDR False discovery rate. For each of the functions, at most 5 GO terms are listed. The GO terms are all detailed in Additional file 5: Table S4
Fig. 5Isoforms produce functional variants. a Different alternative splicing transcripts of tra and sxl in P. xylostella and its expression patterns. The transcripts of IsoSeq and RNA-seq are merged together to generate the full-length transcripts, which are marked as green. b Genes showing sex-specific expression patterns and different alternative splicing in different sexes. c Functional annotations of these genes