| Literature DB >> 35741786 |
Xiaomao Cheng1, Fei Wang1, Wen Luo1, Jingge Kuang1, Xiaoxia Huang1.
Abstract
Pistacia chinensis Bunge (P. chinensis), a dioecious plant species, has been widely found in China. The female P. chinensis plants are more important than male plants in agricultural production, as their seeds can serve as an ideal feedstock for biodiesel. However, the sex of P. chinensis plants is hard to distinguish during the seedling stage due to the scarcity of available transcriptomic and genomic information. In this work, Illumina paired-end RNA sequencing assay was conducted to unravel the transcriptomic profiles of female and male P. chinensis flower buds. In total, 50,925,088 and 51,470,578 clean reads were obtained from the female and male cDNA libraries, respectively. After quality checks and de novo assembly, a total of 83,370 unigenes with a mean length of 1.3 kb were screened. Overall, 64,539 unigenes (77.48%) could be matched in at least one of the NR, NT, Swiss-Prot, COG, KEGG, and GO databases, 71 of which were putatively related to the floral development of P. chinensis. Additionally, 21,662 simple sequence repeat (SSR) motifs were identified in 17,028 unigenes of P. chinensis, and the mononucleotide motif was the most dominant type of repeats (52.59%) in P. chinensis, followed by dinucleotide (22.29%), trinucleotide (20.15%). The most abundant repeats were AG/CT (13.97%), followed by AAC/GTT (6.75%) and AT/TA (6.10%). Based on these SSR, 983 EST-SSR primers were designed, 151 of which were randomly chosen for validation. Of these validated EST-SSR markers, 25 SSR markers were found to be polymorphic between male and female plants. One SSR marker, namelyPCSSR55, displayed excellent specificity in female plants, which could clearly distinguish between male and female P. chinensis. Altogether, our findings not only reveal that the EST-SSR marker is extremely effective in distinguishing between male and female P. chinensis but also provide a solid framework for sex determination of plant seedlings.Entities:
Keywords: EST-SSR; Pistacia chinensis; sex identification; transcriptome
Mesh:
Substances:
Year: 2022 PMID: 35741786 PMCID: PMC9222763 DOI: 10.3390/genes13061024
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1Male (A) and female (B) inflorescences of P. chinensis.
Raw data output of Illumina RNA sequencing.
| Item | Total Raw Reads | Total Clean Reads | Total Clean Nucleotides (nt) | Q20 | Q30 | GC Content |
|---|---|---|---|---|---|---|
| PC | 54,895,796 | 50,925,088 | 7638,763,200 | 98.29% | 96.56% | 43.52% |
| PX | 55,210,442 | 51,470,578 | 7720,586,700 | 98.25% | 96.43% | 43.43% |
Note: PC mean female bulk and PX mean male bulk. Q20 and Q30 percentages are the proportion of nucleotides with quality value larger than 20 and 30, respectively; GC percentage is the proportion of guanine and cytosine nucleotides among total nucleotides.
Quality of the assembled RNA sequencing data.
| Item | Total Number (nt) | Total Length (nt) | Mean Length (nt) | N50 | Distinct Clusters | Distinct Singletons |
|---|---|---|---|---|---|---|
| PC_Contigs | 89,442 | 93,938,149 | 1050 | 1739 | ||
| PX_Contigs | 89,724 | 96,205,186 | 1072 | 1768 | ||
| PC_Unigenes | 65,520 | 80,247,826 | 1225 | 1903 | 29,689 | 35,831 |
| PX_Unigenes | 65,752 | 82,513,165 | 1255 | 1933 | 30,903 | 34,849 |
| All_Unigenes | 83,370 | 110,503,948 | 1325 | 2027 | 42,960 | 40,410 |
Figure 2Sequence length distribution of the unigenes in P. chinensis.
Summary of the annotated and assembled sequences of P. chinensis.
| Sequence File |
|
|
|
|
|
| All Annotated Unigenes | All Assembled Unigenes |
|---|---|---|---|---|---|---|---|---|
| Number of Unigenes | 58,543 | 59,316 | 38,879 | 36,136 | 47,049 | 40,643 | 64,539 | 83,370 |
| Annotated/All-Unigene (%) | 70.22 | 71.15 | 46.63 | 43.34 | 56.43 | 48.75 | 77.48 |
Figure 3Characterization of the homologous sequences of P. chinensis unigenes blasted against the non-redundant database. (A) Frequency distribution of the unigene sequences according to their E values (cut-off value = 1.0 × 10−5). (B) Percentage of the top matched unigene sequences in P. chinensis. (C) Species distribution of the matched homologous sequences with an E-value of 1.0 × 10−5.
Figure 4Gene ontology annotations of the assembled unigenes.
Figure 5Results for the clusters of orthologous groups classification.
Top 20 pathways with highest sequence numbers.
| Rank | Pathway | Genes within the Coverage of Pathway Annotations ( | Pathway ID |
|---|---|---|---|
| 1 | Metabolic pathways | 8208 (22.71%) | ko01100 |
| 2 | Biosynthesis of secondary metabolites | 4045 (11.19%) | ko01110 |
| 3 | Plant-pathogen interaction | 2516 (6.96%) | ko04626 |
| 4 | Plant hormone signal transduction | 1729 (4.78%) | ko04075 |
| 5 | Spliceosome | 1336 (3.7%) | ko03040 |
| 6 | RNA transport | 1203 (3.33%) | ko03013 |
| 7 | Pyrimidine metabolism | 1096 (3.03%) | ko00240 |
| 8 | Purine metabolism | 1086 (3.01%) | ko00230 |
| 9 | Ribosome | 981 (2.71%) | ko03010 |
| 10 | Protein processing in endoplasmic reticulum | 962 (2.66%) | ko04141 |
| 11 | Endocytosis | 811 (2.24%) | ko04144 |
| 12 | Starch and sucrose metabolism | 757 (2.09%) | ko00500 |
| 13 | Ubiquitin mediated proteolysis | 746 (2.06%) | ko04120 |
| 14 | RNA polymerase | 719 (1.99%) | ko03020 |
| 15 | Ribosome biogenesis in eukaryotes | 684 (1.89%) | ko03008 |
| 16 | Glycerophospholipid metabolism | 677 (1.87%) | ko00564 |
| 17 | RNA degradation | 648 (1.79%) | ko03018 |
| 18 | mRNA surveillance pathway | 600 (1.66%) | ko03015 |
| 19 | Phenylpropanoid biosynthesis | 574 (1.59%) | ko00940 |
| 20 | Glycolysis/Gluconeogenesis | 530 (1.47%) | ko00010 |
Sequence searching for the SSR markers of P. chinensis.
| Searching Item | Numbers |
|---|---|
| Total number of examined sequences | 83,370 |
| Total size of examined sequences (bp) | 110,503,948 |
| Total number of identified SSR markers | 21,662 |
| Number of SSR-containing sequences | 17,028 |
| Number of sequences with >1 SSR | 3545 |
| Number of SSR markers found in compound formation | 1376 |
Figure 6Electrophoresis-based visualization of PCR bands amplified with the EST-SSR primer pair PcSSR51. Lanes F and M denote female bulk and male bulk, respectively. (A) Lanes 1–20 and 21–40 represent the PCR products of male and female P. chinensis, respectively. (B) Lanes with white braces 1–102 indicate the PCR products of male P. chinensis; lanes with white braces 1–100 represent the PCR products of female P. chinensis; and white arrowheads denote the female-specific EST-SSR marker amplified with PcSSR55 primer pair.