| Literature DB >> 27558458 |
Guo-Feng Jiang1, Damien Daniel Hinsinger1, Joeri Sergej Strijk1.
Abstract
Cycads are among the most threatened plant species. Increasing the availability of genomic information by adding whole chloroplast data is a fundamental step in supporting phylogenetic studies and conservation efforts. Here, we assemble a dataset encompassing three taxonomic levels in cycads, including ten genera, three species in the genus Cycas and two individuals of C. debaoensis. Repeated sequences, SSRs and variations of the chloroplast were analyzed at the intraspecific, interspecific and intergeneric scale, and using our sequence data, we reconstruct a phylogenomic tree for cycads. The chloroplast was 162,094 bp in length, with 133 genes annotated, including 87 protein-coding, 37 tRNA and 8 rRNA genes. We found 7 repeated sequences and 39 SSRs. Seven loci showed promising levels of variations for application in DNA-barcoding. The chloroplast phylogeny confirmed the division of Cycadales in two suborders, each of them being monophyletic, revealing a contradiction with the current family circumscription and its evolution. Finally, 10 intraspecific SNPs were found. Our results showed that despite the extremely restricted distribution range of C. debaoensis, using complete chloroplast data is useful not only in intraspecific studies, but also to improve our understanding of cycad evolution and in defining conservation strategies for this emblematic group.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27558458 PMCID: PMC4997344 DOI: 10.1038/srep31473
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Circular gene map of the plastid genome of Cycas debaoensis.
Genes drawn within the circle are transcribed clockwise, while those drawn outside are transcribed counter clockwise. Genes are colour-coded according to their functional groups. Inner circle: GC content.
Characteristics of the complete chloroplasts used in the study.
| Species | Length | LSC | SSC | IR | GC% | Coding | tRNA | rRNA | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Total | LSC | SSC | IR | ||||||||
| 162,094 | 88,854 | 23,088 | 25,076 | 39.4 | 38.7 | 36.6 | 42.0 | 87 | 37 | 8 | |
| 162,092 | 88,852 | 23,088 | 25,076 | 39.4 | 38.7 | 36.5 | 42.0 | 87 | 37 | 8 | |
| 162,489 | 88,977 | 23,376 | 25,068 | 39.4 | 38.6 | 38.9 | 42.0 | 109 | 39 | 8 | |
| 163,403 | 90,216 | 23,039 | 25,074 | 39.5 | 38.8 | 36.5 | 42.0 | 122 | 38 | 8 | |
The Length, Large Single Copy (LSC) region; Small Single Copy (SSC) region; Inverted Repeats (IR), the GC content of each region, and the number of coding, tRNA and rRNA loci are shown. Data from GenBank for C. debaoensis (KM459003), C. taitungensis17 (NC_009618) and C. revoluta (NC_020319).
Genes present in the C.debaoensis (KU743927) chloroplast genome.
| Group of genes | Gene names | |
|---|---|---|
| 1 | Photosystem I | |
| 2 | Photosystem II | |
| 3 | Cytochrome b/f complex | |
| 4 | ATP synthase | |
| 5 | NADH dehydrogenase | |
| 6 | RubisCO large subunit | |
| 7 | RNA polymerase | |
| 8 | Ribosomal proteins (SSC) | |
| 9 | Ribosomal proteins (LSC) | |
| 10 | Other genes | |
| 11 | Proteins of unknown function | |
| 12 | Ribosomal RNAs | |
| 13 | Transfer RNAs |
(×2) Two gene copies in the IRs.
aGene containing one intron.
bGene containing two introns.
Repeat sequences and their distribution found by REPuter in the C. debaoensis (KU743927) chloroplast genome.
| Repeat1 start (location) | Repeat2 start (location) | Size (bp) | Type | Region | |
|---|---|---|---|---|---|
| 1 | 88,852 ( | 137,037 (IGS | 20,257 | P | LSC; IRb |
| 2 | 28,396 (IGS | 28,470 (IGS | 39 | F | LSC |
| 3 | 55,293 ( | 55,340 (IGS | 32 | P | LSC |
| 4 | 48,603 (IGS | 48,629 (IGS | 31 | F | LSC |
| 5 | 113,680 (IGS | 113,680 (IGS | 30 | P | IRa |
| 6 | 113,680 (IGS | 137,236 (IGS | 30 | F | IRa; IRb |
| 7 | 137,236 (IGS | 137,236 (IGS | 30 | P | IRb |
IGS: Intergenic spacer.
Figure 2Repeat sequences in four chloroplast genomes of Cycas.
REPuter was used to identify repeat sequences with length ≥30 bp and sequence identity ≥90% in the chloroplast genomes. F and P indicate the repeat type F (forward) and P (palindrome), respectively. Repeats with different lengths are indicated in different patterns.
Figure 3Number of simple sequence repeats in four chloroplast genomes of Cycas, classified by repeat type.
mono-: mononucleotide SSRs; di-: dinucleotide SSRs.
Simple sequence repeats in the C. debaoensis (KU743927) chloroplast genome.
| No. | SSR type | SSR | size | start | SSR-containing region |
|---|---|---|---|---|---|
| 2 | p1 | A | 20 | 8307 | IGS ( |
| p1 | A | 20 | 73073 | ||
| 1 | p1 | A | 19 | 123912 | IGS ( |
| 1 | p1 | A | 14 | 32963 | IGS ( |
| 1 | p1 | A | 13 | 68694 | IGS ( |
| 2 | p1 | A | 11 | 101911 | IGS ( |
| p1 | A | 11 | 148078 | IGS ( | |
| 2 | p1 | A | 10 | 1872 | |
| p1 | A | 10 | 11723 | IGS ( | |
| 1 | p1 | T | 19 | 15487 | IGS ( |
| 1 | p1 | T | 18 | 10133 | IGS ( |
| 2 | p1 | T | 16 | 57648 | IGS ( |
| p1 | T | 16 | 84489 | ||
| 3 | p1 | T | 15 | 54643 | IGS ( |
| p1 | T | 15 | 84795 | ||
| p1 | T | 15 | 134345 | IGS ( | |
| 1 | p1 | T | 13 | 74064 | |
| 2 | p1 | T | 12 | 5951 | |
| p1 | T | 12 | 118369 | IGS ( | |
| 4 | p1 | T | 11 | 83227 | |
| p1 | T | 11 | 88618 | IGS ( | |
| p1 | T | 11 | 102859 | IGS ( | |
| p1 | T | 11 | 149026 | IGS ( | |
| 3 | p1 | T | 10 | 63350 | IGS ( |
| p1 | T | 10 | 69919 | IGS ( | |
| p1 | T | 10 | 125905 | ||
| 1 | p1 | G | 14 | 70451 | IGS ( |
| 1 | p1 | G | 11 | 52764 | IGS ( |
| 2 | p1 | G | 10 | 70921 | IGS ( |
| p1 | G | 10 | 149995 | IGS ( | |
| 1 | p1 | C | 14 | 44747 | |
| 1 | p1 | C | 11 | 5229 | |
| 2 | p1 | C | 10 | 17681 | IGS ( |
| p1 | C | 10 | 100943 | IGS ( | |
| 3 | p2 | (TA)14 | 28 | 29832 | IGS ( |
| p2 | (TA)9 | 18 | 1453 | IGS ( | |
| p2 | (TA)6 | 12 | 15670 | IGS ( | |
| 1 | p2 | (GA)6 | 12 | 68377 | IGS ( |
| 1 | p2 | (AT)6 | 12 | 15808 | IGS ( |
Figure 4Maximum Likelihood phylogenetic tree of the available chloroplast sequences in GenBank for Cycadales, plus the chloroplast sequences of Cupressus sempervirens, Taxus mairei, Pinus strobus, Araucaria heterophylla and Ginkgo biloba as outgroups (A). For readability and better understanding of the branches lengths, a zoom on the Cycadales family is shown (B).
Figure 5mVISTA percent identity plot of available cycad chloroplasts, using C. debaoensis as a reference.
Vertical scale indicates the percentage of identity ranging from 50% to 100%. Coding regions are in blue and noncoding regions are in pink. Cladogram redrawn from Fig. 5B; branch lengths are not representative of evolutionary changes; bootstrap support is indicated on the nodes. Black arrows indicate intraspecific variations in C. debaoensis.
Intraspecific SNPs between two individuals of C. debaoensis (KU743927).
| Position | Type | Nucleotide | location | location type |
|---|---|---|---|---|
| 8432 | A/T | IGS | ||
| 9889 | C/T | IGS | ||
| 33707 | G/T | coding | ||
| 44878 | C/G | exon 1 | ||
| 48140 | A/C | IGS | ||
| 60132 | G/T | IGS | ||
| 62218 | A/C | IGS | ||
| 73073 | A/. | intron 2 | ||
| 86731 | C/T | IGS | ||
| 90307 | G/T | coding (IRa) | ||
| 160640 | A/C | coding (IRb) |