| Literature DB >> 35205413 |
Jianshuang Shen1, Xueqin Li1, Xia Chen1, Xiaoling Huang1, Songheng Jin1,2.
Abstract
Carya cathayensis, an important economic nut tree, is narrowly endemic to eastern China in the wild. The complete cp genome of C. cathayensis was sequenced with NGS using an Illumina HiSeq2500, analyzed, and compared to its closely related species. The cp genome is 160,825 bp in length with an overall GC content of 36.13%, presenting a quadripartite structure comprising a large single copy (LSC; 90,115 bp), a small single copy (SSC; 18,760 bp), and a pair of inverted repeats (IRs; 25,975 bp). The genome contains 129 genes, including 84 protein-coding genes, 37 tRNA genes, and 8 rRNA genes. A total of 252 simple sequence repeats (SSRs) and 55 long repeats were identified. Gene selective pressure analysis showed that seven genes (rps15, rpoA, rpoB, petD, ccsA, atpI, and ycf1-2) were possibly under positive selection compared with the other Juglandaceae species. Phylogenetic relationships of 46 species inferred that Juglandaceae is monophyletic, and that C. cathayensis is sister to Carya kweichowensis and Carya illinoinensis. The genome comparison revealed that there is a wide variability of the junction sites, and there is higher divergence in the noncoding regions than in coding regions. These results suggest a great potential in phylogenetic research. The newly characterized cp genome of C. cathayensis provides valuable information for further studies of this economically important species.Entities:
Keywords: Carya cathayensis; chloroplast genome; genome skimming; phylogenetic relationship
Mesh:
Year: 2022 PMID: 35205413 PMCID: PMC8871582 DOI: 10.3390/genes13020369
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1The complete C. cathayensis chloroplast (cp) genome. Genes shown outside the outer circle are transcribed clockwise, whereas those shown inside are transcribed counterclockwise. The gray plots in the inner circle represent GC contents. The circular gene map was drawn using OGDRAWv1.2.
Figure 2GC content of the C. cathayensis cp genome.
Annotated genes in the C. cathayensis cp genome.
| Category | Group of Genes | Name of Gene |
|---|---|---|
| Self-replication | Ribosomal RNA | |
| Transfer RNA | ||
| Small subunit of ribosome | ||
| Large subunit of ribosome | ||
| RNA polymerase subunits | ||
| Subunits of photosystem I | ||
| Subunits of photosystem II | ||
| Photosynthesis | Subunits of cytochrome | |
| Subunits of ATP synthase | ||
| Large subunit of RuBisCO |
| |
| Subunits of NADH | ||
| Other gene | Maturase |
|
| Envelope membrane protein |
| |
| Subunit of acetyl-CoA |
| |
| C-type cytochrome synthesis gene |
| |
| Protease | ||
| Unknown function | Conserved open reading frames |
1 Gene containing a single intron; 2 gene containing two introns; 3 two gene copies in the IRs; 4 duplicated gene in the LSC region.
Genes with introns in the C. cathayensis cp genome.
| Gene | Region (bp) | Exon Ⅰ (bp) | Intron Ⅰ (bp) | Exon Ⅱ (bp) | Intron Ⅱ (bp) | Exon Ⅲ (bp) |
|---|---|---|---|---|---|---|
|
| LSC | 144 — | 762 | 411 — | ||
|
| LSC | 71 — | 847 | 292 — | 617 | 227 — |
|
| SSC | 552 — | 1211 | 540 — | ||
|
| IRB | 777 — | 686 | 762 — | ||
|
| IRA | 775 + | 686 | 760 + | ||
|
| LSC | 4 + | 822 | 640 + | ||
|
| LSC | 6 + | 615 | 485 + | ||
|
| LSC | 9 — | 919 | 399 — | ||
|
| IRB | 390 — | 663 | 435 — | ||
|
| IRA | 388 + | 663 | 433 + | ||
|
| LSC | 430 — | 843 | 1619 — | ||
|
| IRB | 114 — | - | 229 + | 537 | 29 + |
|
| IRA | 114 — | - | 231 — | 537 | 29 — |
|
| LSC | 40 — | 894 | 230 — | ||
|
| IRB | 36 + | 801 | 40 + | ||
|
| IRA | 38 — | 801 | 42 — | ||
|
| LSC | 22 + | 715 | 45 + | ||
|
| IRB | 40 + | 950 | 33 + | ||
|
| IRA | 42 — | 950 | 35 — | ||
|
| LSC | 37 — | 2557 | 35 — | ||
|
| LSC | 35 + | 524 | 48 + | ||
|
| LSC | 38 — | 615 | 37 — | ||
|
| LSC | 126 — | 720 | 229 — | 793 | 151 — |
+ Exon is transcribed counterclockwise in Figure 1; — exon is transcribed clockwise in Figure 1; - spliceosomal intron.
Figure 3Codon usage frequency of the C. cathayensis cp genome.
Figure 4ML phylogenetic tree of 46 complete cp genomes resolved by Raxml. Bootstrap values are shown near each node.
Figure 5Comparison of the LSC, SSC, and IR regions among six selected cp genomes in the family Juglandaceae. Genes are denoted by colored boxes. The gaps between the genes and boundaries are proportional to the distances in bps.
Figure 6Variable characters in homologous regions among C. cathayensis and five related species. The homologous regions are oriented according to their locations in the cp genome. The gray arrows above the alignment indicate the gene orientations. The Y-axis shows the identity from 50% to 100%.
Figure 7Gene nucleotide variability (pi) values of six Juglandaceae species. The Y-axis shows the pi values; the X-axis shows the genes.