| Literature DB >> 28294981 |
Shuyun Zeng1, Tao Zhou2, Kai Han3, Yanci Yang4, Jianhua Zhao5, Zhan-Lin Liu6.
Abstract
Rehmannia is a non-parasitic genus in Orobanchaceae including six species mainly distributed in central and north China. Its phylogenetic position and infrageneric relationships remain uncertain due to potential hybridization and polyploidization. In this study, we sequenced and compared the complete chloroplast genomes of six Rehmannia species using Illumina sequencing technology to elucidate the interspecific variations. Rehmannia plastomes exhibited typical quadripartite and circular structures with good synteny of gene order. The complete genomes ranged from 153,622 bp to 154,055 bp in length, including 133 genes encoding 88 proteins, 37 tRNAs, and 8 rRNAs. Three genes (rpoA, rpoC2, accD) have potentially experienced positive selection. Plastome size variation of Rehmannia was mainly ascribed to the expansion and contraction of the border regions between the inverted repeat (IR) region and the single-copy (SC) regions. Despite of the conserved structure in Rehmannia plastomes, sequence variations provide useful phylogenetic information. Phylogenetic trees of 23 Lamiales species reconstructed with the complete plastomes suggested that Rehmannia was monophyletic and sister to the clade of Lindenbergia and the parasitic taxa in Orobanchaceae. The interspecific relationships within Rehmannia were completely different with the previous studies. In future, population phylogenomic works based on plastomes are urgently needed to clarify the evolutionary history of Rehmannia.Entities:
Keywords: Rehmannia; chloroplast genome; phylogeny; positive selection; repeat
Year: 2017 PMID: 28294981 PMCID: PMC5368707 DOI: 10.3390/genes8030103
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Sample information of six Rehmannia species in this study.
| Species | Location | Longitude | Latitude | Clean reads | Mean coverage |
|---|---|---|---|---|---|
| Yulin, Shaanxi, China | 110.57 | 37.77 | 3,721,846 | 1583.5× | |
| Yichang, Hubei, China | 10.68 | 31.31 | 31,171,142 | 119.6× | |
| Amsterdam, Holland | 4.88 | 52.36 | 26,976,944 | 137.1× | |
| Shiquan, Shaanxi, China | 108.63 | 32.04 | 26,815,865 | 125.3× | |
| Lishui, Zhejiang, China | 120.15 | 28.64 | 25,724,095 | 131.5× | |
| Chengkou, Chongqing, China | 108.62 | 1.54 | 29,076,484 | 141.8× |
Summary of chloroplast (cp) genomic data of all Lamiales taxa used in the study. The numbers in parenthesis indicate the genes duplicated in the inverted repeat (IR) regions.
| Taxon | Species | GenBank | Length | LSC | SSC | IR | Gene | PCG | tRNA | rRNA | GC (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Orobanchaceae | KX636157 | 153622 | 84605 | 17579 | 25719 | 133 | 88 | 37 (7) | 8 (4) | 38 | |
| KX426347 | 154055 | 84966 | 17675 | 25707 | 133 | 88 | 37 (7) | 8 (4) | 38 | ||
| KX636158 | 153890 | 84837 | 17679 | 25687 | 133 | 88 | 37 (7) | 8 (4) | 37.9 | ||
| KX636161 | 153772 | 84788 | 17652 | 25666 | 133 | 88 | 37 (7) | 8 (4) | 38 | ||
| KX636160 | 153952 | 84899 | 17674 | 25676 | 133 | 88 | 37 (7) | 8 (4) | 37.9 | ||
| KX636159 | 153989 | 84839 | 17680 | 25735 | 133 | 88 | 37 (7) | 8 (4) | 37.9 | ||
| NC_025642 | 94380 | 32648 | 8646 | 26543 | 99 | 30 | 42 (9) | 8 (4) | 36.6 | ||
| KC_128846 | 102657 | 49130 | 8819 | 22354 | 106 | 31 | 36 (7) | 8 (4) | 36.8 | ||
| NC_025651 | 120840 | 62000 | 8516 | 25162 | 123 | 45 | 41 (6) | 8 (4) | 36.7 | ||
| NC_022859 | 155103 | 85594 | 17885 | 25812 | 137 | 85 | 37 (7) | 8 (4) | 37.8 | ||
| HG_738866 | 160910 | 84756 | 18899 | 28627 | 128 | 82 | 37 (7) | 8 (4) | 38.1 | ||
| Lamiaceae | KR_232566 | 152462 | 83355 | 17969 | 25569 | 134 | 86 | 37 (7) | 8 (4) | 38 | |
| NC_020098 | 153953 | 85318 | 17741 | 25447 | 134 | 86 | 37 (7) | 8 (4) | 37.9 | ||
| JX_880022 | 151935 | 83135 | 17727 | 25533 | 134 | 86 | 37 (7) | 8 (4) | 37.8 | ||
| NC_020431 | 151328 | 82695 | 17555 | 25539 | 133 | 87 | 37 (7) | 8 (4) | 38 | ||
| NC_026291 | 155293 | 86078 | 17689 | 25763 | 133 | 87 | 37 (7) | 8 (4) | 37.9 | ||
| KR_233163 | 152731 | 83946 | 17477 | 25654 | 132 | 87 | 36 (7) | 8 (4) | 38.4 | ||
| Scrophulariaceae | KM_590983 | 152425 | 85531 | 17938 | 23478 | 132 | 88 | 36 (6) | 8 (4) | 38.1 | |
| Gesneriaceae | NC_016468 | 153493 | 84698 | 17903 | 25446 | 145 | 85 | 36 (7) | 8 (4) | 37.6 | |
| Acanthaceae | NC_022451 | 150249 | 82459 | 17110 | 25340 | 132 | 87 | 37 (7) | 8 (4) | 38.3 | |
| Lentibulariaceae | NC_021449 | 152113 | 81818 | 14187 | 27904 | 133 | 87 | 37 (6) | 8 (4) | 37.6 | |
| Pedaliaceae | JN_637766 | 153324 | 85170 | 17872 | 25141 | 134 | 87 | 37 (7) | 8 (4) | 38.2 | |
| Oleaceae | GU_931818 | 155889 | 86614 | 17791 | 25742 | 133 | 87 | 37 (7) | 8 (4) | 37.8 |
LSC: large single-copy; SSC: small single-copy; PCG: protein coding genes; tRNA: transfer RNA; rRNA: ribosomal RNA
Figure 1Gene map of Rehmannia chloroplast genomes. Genes shown outside the outer circle are transcribed clockwise and those inside are transcribed counterclockwise. Genes belonging to different functional groups are color coded. Dashed area in the inner circle indicates the GC content of the chloroplast genome. ORF: open reading frame.
Gene list of plastomes of six Rehmannia species.
| Category | Group | Name |
|---|---|---|
| Photosynthesis related genes | Rubisco | |
| Photosystem I | ||
| Assembly/stability of photosystem I | ** | |
| Photosystem II | ||
| ATP synthase | ||
| cytochrome b/f complex | ||
| cytochrome c synthesis | ||
| NADPH dehydrogenase | ||
| Transcription and translation related genes | transcription | |
| ribosomal proteins | ||
| translation initiation factor | ||
| RNA genes | ribosomal RNA | a |
| transfer RNA | *,a
| |
| Other genes | RNA processing | |
| carbon metabolism | ||
| fatty acid synthesis | ||
| proteolysis | ** | |
| Genes of unknown function | conserved reading frames | a
|
* gene with one intron, ** gene with two introns, a gene with two copies.
Figure 2Comparison of the borders of large single-copy (LSC), small single-copy (SSC), and inverted repeat (IR) regions among the chloroplast genomes of six Rehmannia species. The location of two parts of inverted repeat region (IRA and IRB) was referred to Figure 1.
Figure 3Analyses of repeated sequences in Rehmannia plastomes. (A) Number of three repeat types in the six chloroplast genomes; (B) Frequency of repeat sequences by length; (C) Frequency of repeat types; (D) Location of the all repeats from six species.
Figure 4Visualization of alignment of the six Rehmannia species chloroplast genome sequences. VISTA-based identity plots showed sequence identity of six sequenced chloroplast genomes with R. glutinosa as a reference. The sequence similarity of the aligned regions is shown as horizontal bars indicating the average percent identity between 50% and 100% (shown on the y-axis of the graph). The x-axis represents the coordinate in the chloroplast genome. The divergent hotspot region is indicated in the chloroplast genome. Genome regions are color coded as protein coding, rRNA coding, tRNA coding or conserved noncoding sequences (CNS).
Figure 5Percentage of variable characters in aligned noncoding regions of the six plastid genomes.
Figure 6Phylogenetic trees of 23 species as determined from different data partitions. Support values are shown for nodes as Bayesian inference posterior probability (above branches)/maximum likelihood bootstrap (below branches). Branch lengths were calculated through Bayesian analysis, and scale bar denotes substitutions per site. (A) the whole chloroplast genomes; (B) Protein coding genes; (C) LSC region; (D) SSC region. Red represents Rehmannia species; Blue represents other species of Orobanchaceae; Orange represents Lamiaceae species.