| Literature DB >> 28503371 |
Bing He1, Yingang Li1,2, Zhouxian Ni1, Li-An Xu1.
Abstract
BACKGROUND: Phoebe chekiangensis is a rare tree species that is only distributed in south-eastern China. Although this species is famous for its excellent wood properties, it has not been extensively studied at the molecular level.Entities:
Keywords: Next-generation sequencing; Phoebe chekiangensis; SNP prediction; Sanger method; Software comparison
Year: 2017 PMID: 28503371 PMCID: PMC5428327 DOI: 10.7717/peerj.3193
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Geographical locations and main climatic conditions for nine populations of P. chekiangensis.
| Sampling site | Population type | No. of individuals sampled | Longitude (E) | Latitude (N) | Altitude (m) |
|---|---|---|---|---|---|
| Xihu Lake, Hangzhou | Wild population | 30 | 120.06° | 30.12° | 135 |
| Yinzhou, Ningbo | Wild population | 8 | 121.47° | 29.47° | 280 |
| Lin’an | Population of ancient trees | 7 | 119.26° | 30.19° | 355 |
| Taishun | Wild population | 8 | 119.45° | 27.22° | 556 |
| Qixi, Kaihua | Population of ancient trees | 9 | 118.22° | 29.23° | 371 |
| Huabu, Kaihua | Wild population | 8 | 118.16° | 29.01° | 152 |
| Qingyuan | Wild population | 7 | 118.55° | 27.44° | 366 |
| Jiangshan | Wild population | 11 | 118.39° | 28.50° | 173 |
| Wuyuan | Population of ancient trees | 16 | 117.50° | 29.12° | 78 |
Summary of P. chekiangensis base quality.
| Sample | Raw reads | Clean reads | Clean bases | Error (%) | Q20 (%) | Q30 (%) | GC |
|---|---|---|---|---|---|---|---|
| NM_1 | 67,268,601 | 64,118,847 | 6.41G | 0.03 | 98.30 | 93.99 | 47.80 |
| NM_2 | 67,268,601 | 64,118,847 | 6.41G | 0.03 | 97.99 | 93.47 | 47.87 |
Figure 1Length distribution of assembled transcripts.
Summary for the annotation of P. chekiangensis unigenes.
| Number of unigenes | Percentage (%) | Functional categories | |
|---|---|---|---|
| Annotated in NR | 26,693 | 55.59 | |
| Annotated in NT | 10,641 | 22.16 | |
| Annotated in KEGG | 9,132 | 19.02 | 31 |
| Annotated in SwissProt | 19,828 | 41.29 | |
| Annotated in PFAM | 20,268 | 42.21 | |
| Annotated in GO | 21,164 | 44.08 | 51 |
| Annotated in KOG | 12,799 | 26.65 | 26 |
| Annotated in all databases | 3,952 | 8.23 | |
| Annotated in at least one database | 29,714 | 61.88 | |
| Total unigenes | 48,011 |
Figure 2Functional gene ontology classification of P. chekiangensis unigenes.
Figure 3KEGG annotation of P. chekiangensis unigenes.
All of the unigenes were divided into five subgroups: (A) Cellular processes; (B) Environmental information processing; (C) Genetic information processing; (D) Metabolism; (E) Organismal systems.
Figure 4Frequency distribution of SNP densityin P. chekiangensis according to predicted results using SOAPsnp.
Primers and variants of 25 feasible SNPs in P. chekiangensis.
| Unigene | SNP locus | PCR primers (F and R) | Allele | Unigene function |
|---|---|---|---|---|
| comp100159_c0 | 325 | F: GAGGAAAGAAGCTTATGG | T/C | Unknown |
| R: TGCATGCGACTAACAACT | ||||
| comp100433_c0 | 567 | F: TCAGAATTGCTGACTTGT | G/A | DNA-directed RNA polymerase subunit beta |
| R: CATCAATACCAATTGCCAA | ||||
| comp102740_c0 | 416 | F: AGTAGTGTGGATCCAACCC | C/A | Unknown |
| R: ACTATCTCTATGCATATCA | ||||
| comp39317_c0 | 244 | F: TCTAAAATGATGAAAACGA | A/C | Pentatricopeptide repeat-containing protein |
| R: AGCAGTTTGAATACATGG | ||||
| comp42809_c0 | 57;103 | F: CAAGACAAATCTTGGATT | G/A;G/C | Thiopurine S-methyltransferase |
| R: GAAACGGAGATTGAAGTTT | ||||
| comp44031_c0 | 891 | F: TGTTAACTCTAATGGCATC | A/G | Lysine histidine transporter-like 7 |
| R: AAGCATCAGAGAGTGGAG | ||||
| comp45316_c1 | 930 | F: GCCGTTCCCTCGAGCCTTG | A/C | myb proto-oncogene protein |
| R: GAAGAAGATGAGGGTGGG | ||||
| comp41583_c0 | 523 | F: TCCTGCTAATTGTTGAGAC | C/G | Peptidyl-prolyl cis-trans isomerase NIMA-interacting 4 |
| R: TCATAGGTTATCCATAAT | ||||
| comp44876_c0 | 494 | F: CTGCAGAGAAGAAGGAGAG | C/T | Jasmonate ZIM domain-containing protein |
| R: AATGTGATAAGAGCCTTTC | ||||
| comp44881_c0 | 153 | F: GGGTGAGATCTGAAAAGAAA | A/G | Unknown |
| R: GACCGTTGAATTGAAAGG | ||||
| comp45780_c0 | 54;209 | F: CATGCGTTTGAAAGGAAGC | A/G;G/T | RNA 3′ terminal phosphate cyclase |
| R: GTTAGGATGATTGTCATG | ||||
| comp47295_c0 | 846 | F: TCCACCTTACAAGATTTA | C/G | Putative glutamine amidotransferase YLR126C-like |
| R: TACGAAGGCTTCGTCATCA | ||||
| comp48234_c0 | 664 | F: TTCATCATCTGTCGTCGAA | C/T | 30S ribosomal protein 2 |
| R: CTCGGATGCTCAAGAGAAA | ||||
| comp48580_c0 | 331 | F: GTTAAAATGAATTGTTTTT | A/G | Unnamed protein product |
| R: AATGTGTCAAGAATACTAC | ||||
| comp50565_c0 | 295;324 | F: CGCATGGCGTACAGCCCTA | C/A;A/T | Nucleotide binding protein, putative |
| R: TTGAGCAGAAGCTTGACCT | ||||
| comp50815_c0 | 148;252 | F: CGGAGGCTCTCGCGGTCTC | T/G;G/C | Putative lipase ROG1-like |
| R: ACAAAGACAGAAGGCCAG | ||||
| comp531362_c0 | 256;402 | F: CCAAGACTTAAGAAGGGG | T/G;G/A | UPF0481 protein At3g47200-like isoform 1 |
| R: TATCCACCTCCCTATACAG | ||||
| comp5334_c0 | 363 | F: CACGATCGGGCCGAGGAC | C/G | Unknown |
| R: TGCCGGTGCGGCACGAGCT | ||||
| comp5410_c0 | 586 | F: GCAGCTTCTTCTTCTTCT | C/A | Surfeit locus protein |
| R: GATCCAGTGATGAATTGG | ||||
| comp544568_c0 | 257 | F: TCTACTGGAGAGGCCAAC | A/T | Pre-mRNA-splicing factor ATP-dependent RNA helicase |
| R: TCTTCAGGAGCTCTCTGTT |
Figure 5Homogenization distribution curve of P. chekiangensis transcripts.
The vertical axis represents the average values of sequencing depth.
Figure 6Comparison of SNP prediction results when using three different SNP calling packages.