| Literature DB >> 31332220 |
Fengqi Li1, Chunyan Wu2, Mengzhu Gao1, Mengmeng Jiao1, Cheng Qu1, Asier Gonzalez-Uriarte3, Chen Luo4.
Abstract
The London Planetree (Platanus acerifolia) are present throughout the world. The tree is considered a greening plant and is commonly planted in streets, parks, and courtyards. The Sycamore lace bug (Corythucha ciliata) is a serious pest of this tree. To determine the molecular mechanism behind the interaction between the London Planetree and the Sycamore lace bug, we generated a comprehensive RNA-seq dataset (630,835,762 clean reads) for P. acerifolia by sequencing both infected and non-infected leaves of C. ciliata using the Illumina Hiseq 4000 system. We assembled the transcriptomes using the Trinity De Novo assembly followed by annotation. In total, 121,136 unigenes were obtained, and 80,559 unigenes were successfully annotated. From the 121,136 unigenes, we identified 3,010,256 SNPs, 39,097 microsatellites locus, and 1,916 transcription factors. The transcriptomic dataset we present are the first reports of transcriptome information in Platanus species and will be incredibly useful in future studies with P. acerifolia and other Platanus species, especially in the areas of genomics, molecular biology, physiology, and population genetics.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31332220 PMCID: PMC6646373 DOI: 10.1038/s41597-019-0111-9
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Characteristics of the Platanus acerifolia transcriptome sequencing project.
| Item | Description |
|---|---|
| Investigation type | Eukaryote transcriptome |
| Sampling date | 5 July 2017 |
| Geographic location | 9°56′32.60″N E116°16′53.73″E |
| Tissue type | Leaves |
| Sequencing technology | Illumina Hiseq 4000 |
| Assembly | Trinity |
| Finishing strategy | Contigs |
| Data accessibility | SRP156640 |
Sequencing statistics for the Corythucha ciliata infection and the non-infected leaves of Platanus acerifolia.
| Sample | Raw reads | Raw bases | Clean reads | Clean bases | Error rate (%) | Q20 (%) | Q30 (%) | GC content (%) |
|---|---|---|---|---|---|---|---|---|
| CK24h_1 | 57,342,752 | 8,658,755,552 | 56,754,744 | 8,464,801,247 | 0.0254 | 97.91 | 93.7 | 47.34 |
| CK24h_2 | 54,566,682 | 8,239,568,982 | 54,031,744 | 8,064,184,328 | 0.025 | 98.1 | 94.13 | 46.27 |
| CK24h_3 | 49,729,386 | 7,509,137,286 | 49,215,194 | 7,349,321,932 | 0.0258 | 97.76 | 93.31 | 46.13 |
| CK48h_1 | 54,194,972 | 8,183,440,772 | 53,612,120 | 7,995,900,899 | 0.026 | 97.68 | 93.17 | 46.45 |
| CK48h_2 | 46,890,842 | 7,080,517,142 | 46,481,784 | 6,938,620,927 | 0.0247 | 98.22 | 94.41 | 46.21 |
| CK48h_3 | 55,931,148 | 8,445,603,348 | 55,345,842 | 8,252,377,339 | 0.0255 | 97.89 | 93.64 | 46.16 |
| T24h_1 | 52,994,650 | 8,002,192,150 | 52,365,790 | 7,812,178,204 | 0.0259 | 97.71 | 93.22 | 46.34 |
| T24h_2 | 49,989,010 | 7,548,340,510 | 49,508,518 | 7,398,086,642 | 0.0249 | 98.13 | 94.2 | 46.39 |
| T24h_3 | 53,964,742 | 8,148,676,042 | 53,420,354 | 7,974,482,782 | 0.0252 | 97.99 | 93.87 | 46.4 |
| T48h_1 | 55,717,534 | 8,413,347,634 | 55,181,552 | 8,233,891,794 | 0.0254 | 97.92 | 93.7 | 46.21 |
| T48h_2 | 53,547,068 | 8,085,607,268 | 52,953,230 | 7,899,595,930 | 0.0256 | 97.84 | 93.52 | 46.3 |
| T48h_3 | 52,456,100 | 7,920,871,100 | 51,964,890 | 7,753,024,901 | 0.0249 | 98.14 | 94.21 | 46.65 |
| Total | 637,324,886 | 96,236,057,786 | 630,835,762 | 94,136,466,925 |
Assembly information of the Platanus acerifolia transcriptome dataset.
| Type | Resource |
|---|---|
| Total transcripts number | 199,080 |
| Total unigenes number | 121,136 |
| Total sequence base | 202,095,905 |
| Largest | 18,931 |
| Smallest | 201 |
| Average length | 1015.15 |
| N50 | 1579 |
| E90N50 | 1762 |
| GC percent | 41.50 |
| Mean mapped reads | 1442.7636918 |
| TransRate score | 0.25922 |
| BUSCO score | 72.1% (3.3%) |
Annotation information of the Platanus acerifolia transcriptome dataset.
| Unigene number (%) | |
|---|---|
| NR | 76,203 (62.91) |
| Swiss-Prot | 52,758 (43.55) |
| Pfam | 48,527 (40.06) |
| COG | 8,849 (7.31) |
| GO | 57,997 (47.88) |
| KEGG | 34,193 (28.23) |
| Total_annotation | 80,559 (66.5) |
| Total | 121,136 (100) |
Fig. 1Species distribution of the NR annotation.
Fig. 2The COG functional categories.
Fig. 3GO classification of the Platanus acerifolia unigenes.
Fig. 4KEGG pathway distribution of the Platanus acerifolia unigenes.
Fig. 5Percentage of different SSR motifs.
Fig. 6Transcription factor family of the Platanus acerifolia unigenes.
| Design Type(s) | transcription profiling design • sequence assembly objective • sequence annotation objective |
| Measurement Type(s) | transcription profiling assay |
| Technology Type(s) | RNA sequencing |
| Factor Type(s) | experimental condition • temporal_interval |
| Sample Characteristic(s) | Platanus × hispanica • leaf |