| Literature DB >> 27378489 |
Xin Yao1,2, Yun-Hong Tan1, Ying-Ying Liu3, Yu Song1,2, Jun-Bo Yang4, Richard T Corlett1.
Abstract
Aquifoliaceae is the largest family in the campanulid order Aquifoliales. It consists of a single genus, Ilex, the hollies, which is the largest woody dioecious genus in the angiosperms. Most species are in East Asia or South America. The taxonomy and evolutionary history remain unclear due to the lack of a robust species-level phylogeny. We produced the first complete chloroplast genomes in this family, including seven Ilex species, by Illumina sequencing of long-range PCR products and subsequent reference-guided de novo assembly. These genomes have a typical bicyclic structure with a conserved genome arrangement and moderate divergence. The total length is 157,741 bp and there is one large single-copy region (LSC) with 87,109 bp, one small single-copy with 18,436 bp, and a pair of inverted repeat regions (IR) with 52,196 bp. A total of 144 genes were identified, including 96 protein-coding genes, 40 tRNA and 8 rRNA. Thirty-four repetitive sequences were identified in Ilex pubescens, with lengths >14 bp and identity >90%, and 11 divergence hotspot regions that could be targeted for phylogenetic markers. This study will contribute to improved resolution of deep branches of the Ilex phylogeny and facilitate identification of Ilex species.Entities:
Mesh:
Year: 2016 PMID: 27378489 PMCID: PMC4932625 DOI: 10.1038/srep28559
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Comparison of plastid genomic characteristics in seven Ilex species and Helwingia himalaica.
| Total paired-end reads | 1,057,844 | 1,292,586 | 1,135,472 | 467,116 | 1,338,864 | 1,201,978 | 1,459,906 | 914,296 |
| Aligned paired-end reads | 1,045,069 | 1,257,399 | 1,131,722 | 465,040 | 1,327,085 | 1,198,537 | 1,448,390 | 907,908 |
| Mean coverage | 143.8 | 523.2 | 504 | 523.2 | 1006.0 | 562.3 | 342.6 | 188.2 |
| Number of contigs | 230 | 84 | 68 | 47 | 44 | 76 | 106 | 142 |
| Mean length (bp) | 1,649 | 2,419 | 2,834 | 3,505 | 3,654 | 2,564 | 1,982 | 1,820 |
| N50 (bp) | 2,806 | 8,762 | 9,089 | 9,503 | 9,506 | 8,343 | 8,346 | 4,534 |
| Sum contigs length (bp) | 379,388 | 203,271 | 192,740 | 164,767 | 160,803 | 194,865 | 210,138 | 258,537 |
| Size (bp) | 157,610 | 157,900 | 157,741 | 157,621 | 157,611 | 157,671 | 157,918 | 158,362 |
| LSC length (bp) | 86,952 | 87,204 | 87,109 | 87,064 | 86,948 | 87,000 | 87,266 | 87,810 |
| SSC length (bp) | 18,429 | 18,513 | 18,436 | 18,435 | 18,434 | 18,436 | 18,432 | 18,560 |
| IRs length (bp) | 52,229 | 52,183 | 52,196 | 52,122 | 52,229 | 52,235 | 52,220 | 51,992 |
| Protein Genes [unique] | 96[74] | 96[74] | 96[74] | 96[74] | 96[74] | 96[74] | 96[74] | 94[76] |
| tRNA [unique] | 40[26] | 40[26] | 40[26] | 40[26] | 40[26] | 40[26] | 40[26] | 40[26] |
| rRNA [unique] | 8[0] | 8[0] | 8[0] | 8[0] | 8[0] | 8[0] | 8[0] | 8[0] |
| GC content (%) | 37.60 | 37.70 | 37.60 | 37.60 | 37.60 | 37.60 | 37.60 | 37.70 |
Figure 1Gene map of the Ilex chloroplast genome.
Genes shown outside the outer circle are transcribed clockwise and those inside are transcribed counterclockwise. Gray arrows indicate the direction of sequence coding. Genes belonging to different functional groups are color-coded. Dashed area in the inner circle indicates the GC content of the chloroplast genome.
List of genes in the chloroplast genome of Ilex.
| Category | Groups of gene | Name of genes |
|---|---|---|
| Protein synthesis and DNA-replication | Transfer RNAs | |
| Ribosomal RNAs | ||
| Ribosomal protein small subunit | ||
| Ribosomal protein large subunit | ||
| Subunits of RNA polymerase | ||
| Photosynthesis | photosystem I | |
| Photosystem II | ||
| Cythochrome b/f complex | ||
| ATP synthase | ||
| NADH-dehydrogenase | ||
| Large subunit Rubisco | ||
| Miscellaneous group | Translation initiation factor | |
| Acetyl-CoA carboxylase | ||
| Cytochrome c biogenesis | ||
| Maturase | ||
| ATP-dependent protease | ||
| Inner membrane protein | ||
| Pseudogene unknown function | Conserved hypothetical chloroplast ORF |
Figure 2Visualization of the alignment of the seven Ilex chloroplast genomes.
VISTA-based identity plots showing sequence identity with the Helwingia himalaica chloroplast genome as a reference. LSC indicates long single copy region; SSC indicates short single copy region; IRa and IRb indicate two inverted regions. Locations of divergent hotspot regions are labeled above alignment.
Repetitive sequences of Ilex pubescens calculated in REPuter.
| Repeat length (bp) | Repeat bases | Repeat location | Copy of repeat location |
|---|---|---|---|
| 15 | TCTTTCTTTTTTTTT | ||
| 16 | TTTTTTTTTTTTTTTT | ||
| 16 | TTGAAAAAAAAAAAAA | ||
| 16 | TGAAAAAAAAAAAAAA | ||
| 16 | ATTTCTTTTTTAGTTT | ||
| 16 | TTTTTTGAAAAAAAAA | ||
| 16 | GAAAAAAAAAAAAAGA | ||
| 16 | AAAAAAAAAAAAAGAA | ||
| 16 | ATTATTAATTTGTATG | ||
| 16 | TAGTCACTTCTTTTTT | ||
| 16 | CTTTCTTTTTTTTTTC | ||
| 16 | ATTTTTATTTTGTTTT | ||
| 17 | TTTTTTTTTTTTTTATT | ||
| 17 | CTTTTTTGAAAAAAAAA | ||
| 17 | TTTTTTGAAAAAAAAAA | ||
| 17 | TAGTAAAAATAAAAAGA | ||
| 17 | AAGACGAAAAAAAAAAA | ||
| 17 | CTATATATTTTTCCAGT | ||
| 17 | GCTTTTGTTTATAAAAA | ||
| 17 | GATATTGATGCTAGTGA | ||
| 18 | TCCACTCAGCCATCTCTC | ||
| 18 | CGAAAATTCTTTTTTCTC | ||
| 18 | ATTGTATCCATTGAGCAA | ||
| 18 | ATGCAATAGCTAAATGAT | ||
| 18 | CTTTTCTGAGTGAACTAG | ||
| 18 | AGAACTACGAGATCACCC | ||
| 19 | TGCGGGTTCGATTCCCGCT | ||
| 21 | ATGCTGCTGCAGAATAAACCA | ||
| 21 | AAGAGAGGGATTCGAACCCTC | ||
| 21 | AGACAGGATTTGAACCCGTGA | ||
| 23 | TCATTGTTCCACTCTTTGACAAC | ||
| 26 | GTGAGATTTTCATCTCATACGGCTCC | ||
| 26 | TTATTTATTTTATATTCTATTTCAAT | ||
| 29 | TCGATATTGATGATAGTGACGATATTGAT |
Figure 3Phylogenetic relationships of the seven Ilex species and Helwingia himalaica constructed from chloroplast genes.
Numbers near nodes indicate the maximum parsimony bootstrap (left) values for each clade present in the 50% majority-rule consensus tree, Bayesian posterior probability (middle), and maximum likelihood bootstrap (right) values for each clade present in the 50% majority-rule consensus tree.
Figure 4Phylogenetic relationships of the seven Ilex species and Helwingia himalaica based on the SSC region.
In plot ‘a’ numbers near nodes indicate the Bayesian posterior probability (left) and maximum likelihood bootstrap (right) values for each clade present in the 50% majority-rule consensus tree; in plot ‘b’ numbers near nodes indicate the maximum parsimony bootstrap of each clade present in the 50% majority-rule consensus tree.
Sampled species and their voucher specimens used in this study according to the taxonomic treatment in the Flora of China.
| Species | Subgenus | Section | Geographic origin | Voucher | Accession number in GenBank |
|---|---|---|---|---|---|
| XTBG | YX1303 | KX426465 | |||
| Dali, Yunnan | YX1418 | KX426466 | |||
| Xishuangbanna, Yunnan | YX1676 | KX426467 | |||
| Xishuangbanna, Yunnan | YX1680 | KX426468 | |||
| – | – | Xishuangbanna, Yunnan | YX1681 | KX426469 | |
| Dali, Yunnan | YX1723 | KX426470 | |||
| Yichun, Jiangxi | YX1748 | KX426471 | |||
| – | – | Gongshan, Nujiang | YX1678 | KX434807 |