| Literature DB >> 30011952 |
Krishna Bhattarai1, Weining Wang2, Zhe Cao3, Zhanao Deng4.
Abstract
Impatiens downy mildew (IDM) is a devastating disease to garden impatiens. A good understanding of IDM resistance in New Guinea impatiens is essential for improving garden impatiens resistance to this disease. The present study was conducted to sequence, assemble, annotate and compare the leaf transcriptomes of two impatiens cultivars differing in resistance to IDM, reveal sequence polymorphisms and identify candidate genes for IDM resistance. RNA-Seq was performed on cultivars Super Elfin® XP Pink (SEP) and SunPatiens® Compact Royal Magenta (SPR). De novo assembly of obtained sequence reads resulted in 121,497 unigenes with an average length of 1156 nucleotides and N50 length of 1778 nucleotides. Searching the non-redundant protein and non-redundant nucleotide, Swiss-Prot, Kyoto Encyclopedia of Genes and Genomes and Clusters of Orthologous Groups and Gene Ontology databases, resulted in annotation of 57.7% to 73.6% of the unigenes. Fifteen unigenes were highly similar to disease resistance genes and more abundant in the IDM-resistant cultivar than in the susceptible cultivar. A total of 22,484 simple sequence repeats (SSRs) and 245,936 and 120,073 single nucleotide polymorphisms (SNPs) were identified from SPR and SEP respectively. The assembled transcripts and unigenes, identified disease resistance genes and SSRs and SNPs sites will be a valuable resource for improving impatiens and its IDM resistance.Entities:
Keywords: New Guinea impatiens; RNA-Seq; candidate disease resistance gene; disease resistance; downy mildew; garden impatiens; leaf transcriptome
Mesh:
Year: 2018 PMID: 30011952 PMCID: PMC6073305 DOI: 10.3390/ijms19072057
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Summary of HiSeq sequencing data for two impatiens leaf transcriptomes.
| Samples | Total Raw Reads | Total Clean Reads | Total Clean Nucleotides (Mb) | Q20 (%) 1 | N (%) 2 | GC (%) |
|---|---|---|---|---|---|---|
| SunPatiens® Compact Royal Magenta (SPR) | 63,769,252 | 58,266,032 | 5826 | 97.59 | 0.01 | 44.79 |
| Super Elfin® XP Pink (SEP) | 62,996,124 | 58,834,368 | 5883 | 97.50 | 0.01 | 45.11 |
| Total | 126,765,376 | 117,100,400 | 11,710 |
1 Q20 (%) is the proportion of nucleotides with Q20 quality value greater than 20. 2 N (%) is the proportion of unknown nucleotides in clean reads.
Description of the impatiens transcriptome assembly and quality of unigenes and contigs.
| Types | Sample | Total Number | Total Length (nt) | Mean Length (nt) | N50 | Total Consensus Sequences | Distinct Clusters | Distinct Singletons |
|---|---|---|---|---|---|---|---|---|
| Contig | SunPatiens® Compact Royal Magenta (SPR) | 122,166 | 48,323,299 | 396 | 965 | |||
| Super Elfin® XP Pink (SEP) | 104,752 | 44,944,832 | 429 | 1052 | ||||
| Unigene | SunPatiens® Compact Royal Magenta (SPR) | 87,415 | 88,434,291 | 1012 | 1774 | 87,415 | 44,629 | 42,789 |
| Super Elfin® XP Pink (SEP) | 69,369 | 67,514,711 | 973 | 1726 | 69,369 | 27,516 | 41,853 | |
| All | 121,497 | 140,506,651 | 1156 | 1778 | 121,497 | 78,448 | 43,049 |
Annotation of impatiens unigenes using the NT (the non-redundant nucleotide database), NR (the non-redundant protein database), SWISS-PROT, COG (Clusters of Orthologous Groups of proteins), KEGG (Kyoto Encyclopedia of Genes and Genomes) and GO (Gene Ontology Consortium) databases.
| Database | Unigene Annotated | Percentage |
|---|---|---|
| NR | 89,490 | 73.66 |
| NT | 71,482 | 58.83 |
| Swiss-Prot | 59,403 | 48.89 |
| KEGG | 54,521 | 44.87 |
| COG | 37,576 | 30.93 |
| GO | 70,190 | 57.77 |
| Total annotated genes | 91,187 | 75.05 |
| Total unigenes | 121,497 | 100.00 |
Figure 1Distribution of similarities of impatiens unigenes with other plant species based on searching the non-redundant protein database according to (A) E-value (B) Similarity and (C). In relation to other species with unigenes similarity.
Classification of annotated unigene pathways according to categorical processes based on KEGG Pathway Mapping.
| Pathway Category | Number | Percentage |
|---|---|---|
| Cellular Processes | 4 | 3.13 |
| Environmental Information Processing | 3 | 2.34 |
| Genetic Information Processing | 21 | 16.41 |
| Metabolism | 96 | 75.00 |
| Organismal Systems | 4 | 3.13 |
| Total | 128 | 100 |
Figure 2Top 25 pathways to which impatiens leaf transcriptomes were mapped using the Kyoto Encyclopedia of Genes and Genome Database (KEGG).
Number of impatiens unigenes involved in top 25 pathways listed by KEGG database.
| Pathway | Genes with Pathway Annotation | % of Genes |
|---|---|---|
| Metabolic pathways | 11,493 | 24.70 |
| Biosynthesis of secondary metabolites | 5590 | 12.01 |
| Plant-pathogen interaction | 3491 | 7.50 |
| Plant hormone signal transduction | 3224 | 6.93 |
| Spliceosome | 2001 | 4.30 |
| RNA transport | 1760 | 3.78 |
| Endocytosis | 1488 | 3.20 |
| Protein processing in endoplasmic reticulum | 1484 | 3.19 |
| RNA degradation | 1366 | 2.94 |
| Glycerophospholipid metabolism | 1334 | 2.87 |
| Purine metabolism | 1237 | 2.66 |
| Starch and sucrose metabolism | 1186 | 2.55 |
| Ribosome biogenesis in eukaryotes | 1171 | 2.52 |
| Ribosome | 1117 | 2.40 |
| mRNA surveillance pathway | 1033 | 2.22 |
| Pyrimidine metabolism | 1027 | 2.21 |
| Ubiquitin mediated proteolysis | 967 | 2.08 |
| Ether lipid metabolism | 943 | 2.03 |
| Circadian rhythm–plant | 764 | 1.64 |
| ABC transporters | 758 | 1.63 |
| Amino sugar and nucleotide sugar metabolism | 712 | 1.53 |
| Phenylpropanoid biosynthesis | 660 | 1.42 |
| Pentose and glucoronate interconversions | 585 | 1.26 |
| Glycolysis / Gluconeogenesis | 584 | 1.25 |
| Oxidative phosphorylation | 564 | 1.21 |
| Total | 46,539 | 100 |
Figure 3Functional classes of unigenes based on the Cluster of Orthologous Group (COG).
Figure 4Classification of impatiens unigenes based on the Gene Ontology database.
List of impatiens unigenes selected based on more than 2-fold higher abundancy in IDM-resistant impatiens (SPR) compared to IDM-susceptible impatiens (SEP) that were annotated to previously known genes for disease resistance or defense using NR (non-redundant protein), NT (non-redundant nucleotide), SWISS-PROT, KEGG (Kyoto Encyclopedia of Genes and Genomes), COG (Clusters of Orthologous Groups of proteins) and GO (Gene Ontology) databases.
| Unigene ID | Length (bp) | Abundance (FPKM 1) in SPR | Abundance (FPKM 1) in SEP | FDR | Score | Annotation | |
|---|---|---|---|---|---|---|---|
| CL10.Contig1 | 5073 | 47.92 | 0.04 | 7.13 × 10−24 | 2.66 × 10−23 | 164 | Putative disease resistance protein At5g05400 |
| CL12505.Contig5 | 4428 | 2.40 | 0.45 | 4.91 × 10−29 | 2.04 × 10−28 | 689 | Putative disease resistance RPP13-like protein 1 |
| CL12796.Contig1 | 2489 | 9.11 | 0.38 | 8.20 × 10−108 | 7.28 × 10−107 | 103 | LETM1-like protein |
| CL14017.Contig1 | 2600 | 18.20 | 0.95 | 2.35 × 10−212 | 3.47 × 10−211 | 805 | Cation efflux family protein |
| CL14259.Contig1 | 2360 | 69.99 | 0.00 | 0 | 0 | 400 | Disease resistance protein (CC-NB-LRR class) family |
| CL1855.Contig4 | 2257 | 96.38 | 0.02 | 5.25 × 10−09 | 1.17 × 10−08 | 473 | Putative disease resistance RPP13-like protein 1 |
| CL2138.Contig3 | 2505 | 5.82 | 0.00 | 1.69 × 10−88 | 1.32 × 10−87 | 75.5 | Uncharacterized mitochondrial protein AtMg00860 |
| CL3381.Contig3 | 4495 | 22.67 | 0.06 | 0 | 0 | 543 | Probable disease resistance protein At5g45510 |
| CL8803.Contig4 | 2187 | 4.60 | 0.02 | 2.46 × 10−59 | 1.51 × 10−58 | 54.7 | Leucine-rich repeat (LRR) family protein |
| CL8803.Contig5 | 2321 | 2.61 | 0.02 | 1.99 × 10−35 | 9.18 × 10−35 | 73.2 | Leucine-rich repeat (LRR) family protein |
| Unigene2713 | 2869 | 7.12 | 0.00 | 5.52 × 10−94 | 4.49 × 10−93 | 254 | Receptor-like protein 35 |
| Unigene13240 | 2592 | 24.54 | 0.05 | 1.00 × 10−123 | 9.81 × 10−123 | 414 | Putative disease resistance protein At1g59780 |
| Unigene2747 | 2575 | 37.32 | 0.00 | 2.08 × 10−23 | 7.68 × 10−23 | 308 | Putative disease resistance protein At3g14460 |
| Unigene3612 | 3233 | 2.05 | 0.04 | 0.000496 | 0.000733 | 177 | DNA-binding storekeeper protein-related transcriptional regulator |
| Unigene6688 | 2671 | 13.23 | 0.00 | disease resistance protein (TIR-NB-LRR class) |
1 FPKM: Fragments per kilobase of transcripts per million mapped reads.
Figure 5Distribution of simple sequence repeats (SSR) repeat types in impatiens unigenes.
Figure 6Description and classification of single nucleotide polymorphisms (SNPs) identified in SPR and SEP lines.