| Literature DB >> 33794772 |
Joonhyung Jung1, Changkyun Kim2, Joo-Hwan Kim3.
Abstract
BACKGROUND: Commelinaceae (Commelinales) comprise 41 genera and are widely distributed in both the Old and New Worlds, except in Europe. The relationships among genera in this family have been suggested in several morphological and molecular studies. However, it is difficult to explain their relationships due to high morphological variations and low support values. Currently, many researchers have been using complete chloroplast genome data for inferring the evolution of land plants. In this study, we completed 15 new plastid genome sequences of subfamily Commelinoideae using the Mi-seq platform. We utilized genome data to reveal the structural variations and reconstruct the problematic positions of genera for the first time.Entities:
Keywords: Chloroplast genome; Commelinaceae; Nucleotide diversity; Phylogenomics; Plastome
Year: 2021 PMID: 33794772 PMCID: PMC8017861 DOI: 10.1186/s12864-021-07541-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Comparison of the features of plastomes from 16 genera of Commelinaceae
| Taxa | Tribe | Subtribe | Length and G + C content | GenBank accession number | Voucher | |||
|---|---|---|---|---|---|---|---|---|
| LSC bp | SSC bp | IR bp | Total bp | |||||
| Tradescantieae | Tradescantiinae | 89,154(33.3) | 18,278(30.5) | 26,953(42.5) | 161,338(36.1) | MW617987 | JH200402001 | |
| Tradescantieae | Tradescantiinae | 91,991(32.7) | 18,462(30.2) | 27,236(42.3) | 164,925(35.6) | MW617994 | JH170813001 | |
| Tradescantieae | Tradescantiinae | 89,446(33.2) | 18,252(30.3) | 27,078(42.5) | 161,854(36.0) | MW617982 | JH190318001 | |
| Tradescantieae | Tradescantiinae | 95,029(32.6) | 19,024(30.3) | 27,233(42.6) | 168,519(35.5) | MW617995 | JH190730001 | |
| Tradescantieae | Coleotrypinae | 94,525(32.9) | 19,255(30.4) | 27,385(42.4) | 168,550(35.7) | MW617981 | JH191109002 | |
| Tradescantieae | Cyanotinae | 96,164(31.3) | 20,224(28.0) | 27,241(42.6) | 170,870(34.5) | MK133255.1 | – | |
| Tradescantieae | Dichorisandrinae | 92,560(33.2) | 18,856(30.4) | 27,276(42.5) | 165,968(35.9) | MW617983 | JH190310001 | |
| Tradescantieae | Dichorisandrinae | 94,583(32.8) | 18,612(30.7) | 27,098(42.5) | 167,391(35.7) | MW617986 | JH190803001 | |
| Tradescantieae | Dichorisandrinae | 94,347(32.9) | 18,348(31.1) | 27,194(42.6) | 167,083(35.8) | MW617985 | JH190616001 | |
| Tradescantieae | Dichorisandrinae | 94,389(32.9) | 18,606(31.0) | 27,196(42.6) | 167,387(35.8) | MW617992 | XX-0-GENT-19822394 | |
| Tradescantieae | Streptoliriinae | 91,528(33.1) | 19,595(29.3) | 27,447(42.0) | 166,017(35.6) | MW617993 | JH180919003 | |
| Tradescantieae | Palisotinae | 93,315(33.5) | 18,905(30.8) | 27,074(42.7) | 166,368(36.2) | MW617989 | JH190222001 | |
| Commelineae | – | 90,295(33.2) | 19,151(29.7) | 27,604(42.2) | 164,654(35.8) | MW617990 | JH180805001 | |
| Commelineae | – | 87,602(33.2) | 18,354(29.5) | 27,487(42.1) | 160,930(35.8) | MW617991 | JH191109014 | |
| Commelineae | – | 87,363(33.0) | 18,561(29.1) | 27,096(42.3) | 160,116(35.7) | MW617984 | JH180709001 | |
| Commelineae | – | 96,248(31.4) | 20,798(27.7) | 27,464(42.1) | 171,974(34.4) | MW617988 | JH191110010 | |
Fig. 1Representative chloroplast genome of Commelinaceae. The colored boxes represent conserved chloroplast genes. Genes shown inside the circle are transcribed clockwise, whereas genes outside the circle are transcribed counter-clockwise. The small grey bar graphs inner circle shows the GC contents
Gene composition within chloroplast genomes of Commelinaceae species
| Groups of genes | Names of genes | No. | |
|---|---|---|---|
| RNA genes | Ribosomal RNAs | 8 | |
| Transfer RNAs | 38 | ||
| Protein genes | Photosystem I | 5 | |
| Photosystem II | 15 | ||
| Cytochrome | 6 | ||
| ATP synthases | 6 | ||
| Large unit of Rubisco | 1 | ||
| NADH dehydrogenase | 12 | ||
| ATP-dependent protease subunit P | 1 | ||
| Envelope membrane protein | 1 | ||
| Ribosomal proteins | Large units of ribosome | 12 | |
| Small units of ribosome | 15 | ||
| Transcription/translation | RNA polymerase | 3 | |
| Initiation factor | 1 | ||
| Miscellaneous protein | 2 | ||
| Hypothetical proteins and conserved reading frames | 5 | ||
| Total | 131 | ||
agene with one intron; bgene with two introns; X2: duplicated gene; ⍦: pseudogene
Fig. 2Plots of percent sequence identity of the chloroplast genomes of 16 Commelinaceae species with Hanguana malayana as a reference. The percentage of sequence identities was estimated, and the plots were visualized in mVISTA
Fig. 3Comparisons of LSC, SSC, and IR regions boundaries between 16 Commelinaceae species
Fig. 4Nucleotide diversity (Pi) values in protein-coding genes, tRNA, and rRNA in 16 Commelinaceae species. The dashed lines are the borders of the LSC, IR and SSC regions
Fig. 5The Maximum Likelihood tree of 42 monocots inferred from 77 chloroplast protein-coding genes. Numbers indicate support (maximum parsimony bootstrap (PBP)/maximum likelihood bootstrap (MBP)/posterior probability (PP)). Only support under PBP = 90/MBP = 100/PP = 1.00 is shown. The dashes “-” indicate incongruence between MP and ML/BI trees