| Literature DB >> 30143661 |
Xiaofang Liao1,2,3, Yanhong Zhao3, Xiangjun Kong2, Aziz Khan2, Bujin Zhou2, Dongmei Liu4, Muhammad Haneef Kashif2, Peng Chen2, Hong Wang5, Ruiyang Zhou6.
Abstract
Plant mitochondrial (mt) genomes are species specific due to the vast of foreign DNA migration and frequent recombination of repeated sequences. Sequencing of the mt genome of kenaf (Hibiscus cannabinus) is essential for elucidating its evolutionary characteristics. In the present study, single-molecule real-time sequencing technology (SMRT) was used to sequence the complete mt genome of kenaf. Results showed that the complete kenaf mt genome was 569,915 bp long and consisted of 62 genes, including 36 protein-coding, 3 rRNA and 23 tRNA genes. Twenty-five introns were found among nine of the 36 protein-coding genes, and five introns were trans-spliced. A comparative analysis with other plant mt genomes showed that four syntenic gene clusters were conserved in all plant mtDNAs. Fifteen chloroplast-derived fragments were strongly associated with mt genes, including the intact sequences of the chloroplast genes psaA, ndhB and rps7. According to the plant mt genome evolution analysis, some ribosomal protein genes and succinate dehydrogenase genes were frequently lost during the evolution of angiosperms. Our data suggest that the kenaf mt genome retained evolutionarily conserved characteristics. Overall, the complete sequencing of the kenaf mt genome provides additional information and enhances our better understanding of mt genomic evolution across angiosperms.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30143661 PMCID: PMC6109132 DOI: 10.1038/s41598-018-30297-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Map of the Hibiscus cannabinus (kenaf) mt genome.
Figure 2Distribution of protein-coding genes in plant mitochondrial genomes. White boxes indicate that the gene is not present in the mt genome. Yellow, green, purple, blue and black boxes indicate that one, two, three, four and six copies exist in the particular mt genome, respectively. Red boxes indicate trans-splicing. Kenaf (Hibiscus cannabinus) is shown in bold.
Repeats (≥60 bp) in the kenaf mt genome.
| No. | Size (bp) | Identity (%) | Copy-1 | Copy-2a | Copy-3a | Typeb | |||
|---|---|---|---|---|---|---|---|---|---|
| Start | End | Start | End | Start | End | ||||
| R1 | 7782 | 100 | 814 | 8595 | 196628 | 188847 | P | ||
| R2 | 1877 | 100 | 62808 | 64684 | 356837 | 354961 | P | ||
| R3 | 1525 | 100 | 60919 | 62443 | 446303 | 444779 | P | ||
| R4 | 842 | 100 | 204427 | 205268 | 367743 | 384565 | F | ||
| R5 | 535 | 100 | 62438 | 63272 | 445775 | 444941 | P | ||
| R6 | 468 | 100 | 287766 | 288233 | 413387 | 412920 | P | ||
| R7 | 433 | 100 | 62845 | 63277 | 136700 | 137132 | 358244 | 357812 | F/P |
| R8 | 394 | 100 | 431948 | 432341 | 569460 | 569853 | F | ||
| R9 | 374 | 100 | 4933 | 5306 | 199917 | 199544 | 380908 | 381281 | P/F |
| R10 | 229 | 100 | 237021 | 237249 | 320344 | 320572 | F | ||
| R11 | 210 | 100 | 28517 | 28726 | 243614 | 243405 | P | ||
| R12 | 204 | 100 | 84633 | 84836 | 413895 | 414098 | F | ||
| R13 | 190 | 100 | 326 | 515 | 204717 | 204528 | 368033 | 367844 | P |
| R14 | 181 | 100 | 284825 | 284645 | 539224 | 539404 | F | ||
| R15 | 174 | 100 | 73673 | 73846 | 539296 | 539469 | F | ||
| R16 | 165 | 100 | 358549 | 358713 | 445775 | 445939 | F | ||
| R17 | 146 | 100 | 247536 | 247681 | 510485 | 510630 | F | ||
| R18 | 137 | 100 | 102129 | 102265 | 458421 | 458557 | F | ||
| R19 | 135 | 100 | 683 | 817 | 204407 | 204273 | P | ||
| R20 | 128 | 100 | 136700 | 136827 | 445775 | 445648 | P | ||
| R21 | 115 | 100 | 683 | 797 | 367743 | 367629 | P | ||
| R22 | 109 | 100 | 45409 | 45517 | 456623 | 456731 | F | ||
| R23 | 109 | 100 | 73673 | 73781 | 284897 | 285005 | F | ||
| R24 | 107 | 100 | 85072 | 85178 | 273126 | 273232 | F | ||
| R25 | 103 | 100 | 176182 | 17714 | 380789 | 380891 | F | ||
| R26 | 97 | 100 | 47563 | 47659 | 538666 | 538762 | F | ||
| R27 | 91 | 100 | 362171 | 362261 | 532778 | 532868 | P | ||
| R28 | 88 | 100 | 34694 | 34781 | 258881 | 258968 | F | ||
| R29 | 88 | 100 | 102046 | 102133 | 137837 | 137924 | P | ||
| R30 | 88 | 100 | 140828 | 140915 | 465024 | 465111 | P | ||
| R31 | 86 | 100 | 62665 | 62750 | 268392 | 268477 | 445997 | 446082 | F/ P |
| R32 | 82 | 100 | 31305 | 31386 | 301188 | 301269 | F | ||
| R33 | 81 | 100 | 368575 | 368655 | 456163 | 456243 | F | ||
| R34 | 80 | 100 | 4243 | 4322 | 200901 | 200980 | 268607 | 268686 | P /F |
| R35 | 77 | 100 | 28446 | 28522 | 243819 | 243895 | P | ||
| R36 | 75 | 100 | 333330 | 333404 | 429353 | 429427 | P | ||
| R37 | 74 | 100 | 11547 | 11620 | 134973 | 135046 | F | ||
| R38 | 73 | 100 | 41569 | 41641 | 438195 | 438267 | F | ||
| R39 | 73 | 100 | 268577 | 268647 | 539445 | 539517 | F | ||
| R40 | 70 | 100 | 189 | 258 | 204978 | 205047 | 368294 | 368373 | P |
| R41 | 66 | 100 | 141384 | 141449 | 452383 | 452448 | P | ||
| R42 | 65 | 100 | 4873 | 4937 | 200286 | 200350 | 380849 | 380913 | P/F |
| R43 | 65 | 100 | 200286 | 380852 | P | ||||
| R44 | 63 | 100 | 141511 | 141573 | 452513 | 452575 | F | ||
| R45 | 63 | 100 | 41704 | 41766 | 438338 | 438400 | F | ||
| R46 | 61 | 100 | 47590 | 475650 | 275165 | 275225 | 538696 | 538756 | P |
| R47 | 61 | 100 | 432343 | 432403 | 569854 | 569914 | F | ||
| R48 | 61 | 100 | 73792 | 73852 | 285008 | 285068 | F | ||
aCompare with copy-1 as control. bThe letters F and P represent forward repeats and palindromic repeats, respectively. The numbers listed in the starting and ending points refer to positions in the kenaf mt genome sequence (GenBank accession MF163174).
Figure 3Analysis of conservative gene clusters between the kenaf mt genome and other higher plant mt genomes.
Figure 4tRNA distribution map in plant mitochondrial genomes. Yellow boxes and green boxes represent mt tRNA genes and chloroplast-like tRNA genes with one copy in plant mtDNA, respectively. The numbers in the cells represent the copy numbers in the plant mtDNA. Blank boxes indicate that the tRNA gene is absent.
Figure 5The original phylogenetic tree of 22 functionally related genes. The genes used are listed in Table S2 and include 17 respiratory complex genes, four cytochrome c biogenesis genes and a cob gene, and the tree was rooted with Marchantia paleacea.
Figure 6NCBI taxonomy common tree of 28 analysed species.