| Literature DB >> 23915326 |
Li Wei1, Yi Xin, Dongmei Wang, Xiaoyan Jing, Qian Zhou, Xiaoquan Su, Jing Jia, Kang Ning, Feng Chen, Qiang Hu, Jian Xu.
Abstract
BACKGROUND: Microalgae are promising feedstock for production of lipids, sugars, bioactive compounds and in particular biofuels, yet development of sensitive and reliable phylotyping strategies for microalgae has been hindered by the paucity of phylogenetically closely-related finished genomes.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23915326 PMCID: PMC3750441 DOI: 10.1186/1471-2164-14-534
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Features of the organelle genomes (Plastid/Mitochondria)
| 117,548/38,057 | 117,634/38,057 | 114,883/41,907 | 114,867/42,206 | 117,463/38,444 | 117,806/38,543 | 117,672/38,791 | |
| 57,360/- | 57,387/- | 56,882/- | 56,925/- | 57,287/- | 57,444/- | 57,352/- | |
| 45,235/- | 45,240/- | 47,364/- | 47,698/- | 45,227/- | 45,259/- | 45,247/- | |
| 7,485/- | 7,496/- | 5,320/- | 5,122/- | 7,476/- | 7,549/- | 7,527/- | |
| 160/63 | 160/63 | 156/64 | 156/64 | 160/63 | 160/63 | 160/63 | |
| 126/35 | 126/35 | 123/36 | 123/36 | 126/35 | 126/35 | 126/35 | |
| 34/28 | 34/28 | 33/28 | 33/28 | 34/28 | 34/28 | 34/28 | |
| 33.6/31.9 | 33.6/31.9 | 33.1/31.4 | 33.0/31.4 | 33.4/31.8 | 33.5/31.7 | 33.4/32.0 | |
| 83.5/87.5 | 83.4/87.5 | 83.6/81.4 | 83.8/80.9 | 83.5/84.7 | 83.7/84.6 | 84.5/84.1 |
Figure 1Plastid and mitochondrial genomes of seven strains. (A) Genome map of the complete pt sequence of N. oceanica IMET1. (B) Genome map of the complete mt sequence of N. oceanica IMET1. Genes shown outside the outer circle are transcribed clockwise and those inside are transcribed counter clockwise. Genes belonging to different functional groups are color-coded. Alignment of the Nannochloropsis plastid (C) and mitochondrial (D) genomes were also shown respectively. Genomic regions are color-coded as protein-coding (blue), rRNA/tRNA-coding (cyan) and conserved noncoding sequences (red). * CCMP527 and CCMP537 do not contain the region. **Two copies of cox1 are present in CCMP527 and CCMP537. ***In CCMP529, trnD-GUC was translocated to the interval between cox2 and rrn16.
Figure 2Comparison of functional complements of organelle genomes. Among Nannochloropsis, brown algae and diatoms, shared and lineage-specific genes from plastid and mitochondrial genomes are compared via Venn diagrams. (A) Shared and lineage-specific genes of different plastid genomes. (B) Shared and lineage-specific genes of different mitochondrial genomes.
Figure 3Hotspots of structural polymorphism that drive the diversification of organelle genomes. The phylogenetic tree on the left was based on whole-genome alignment of the seven complete mt genomes. (A) Structural and sequence polymorphism of inverted repeat in the pt genomes. (B) Structural and sequence polymorphism of the hotspot in the mt genomes. Within each sub-figure, genomic features were drawn proportionally to their actual length. Grey solid lines, inserted for alignment purposes, were not actual sequences.
Figure 4New inter-species phylogenetic markers for (A) Comparison of the sensitivity and specificity among candidate regions/markers for inter-species phylogenetic reconstruction. The regions/markers derived from genic and intergenic sequences of pt and mt of the seven strains were listed on the X axe. The % nucleotide difference of each region/markers was calculated as the index for sensitivity. The average bootstrap values of all branches in the sequence-specific phylogenetic trees (maximum parsimony; MP) were shown as the index for specificity. *: presently used markers. Arrow: candidate phylogenetic markers that provided higher sensitivity than ITS2.(B) Phylogeny of Nannochloropsis strains reconstructed from the presently used and our proposed sequence markers. Phylograms were derived using MP analysis. *: reference trees based on the concatenated sequences of all pt/mt protein-coding genes. **: presently used markers. Blank box: presently used markers whose phylogenetic reconstructions are not consistent with the reference trees. CCMP537 was assigned as the root for each of the trees.
Comparison of candidate markers for interspecies phylotyping in genus
| mt | 297 | 37.71 | 0.34 | 0.397 | 0.046 | |
| pt | 252-261 | 33.72 | 0.00 | 0.409 | 0.038 | |
| mt | 102-195 | 32.83 | 0.00 | 0.500 | 0.048 | |
| mt | 726 | 32.09 | 0.41 | 0.260 | 0.031 | |
| mt | 552 | 32.07 | 0.36 | 0.263 | 0.036 | |
| pt | 204-224 | 31.72 | 0.13 | 0.211 | 0.018 | |
| mt | 525-540 | 31.67 | 0.57 | 0.299 | 0.037 | |
| pt | 447-450 | 29.33 | 0.22 | 0.328 | 0.040 | |
| mt | 156-162 | 29.01 | 0.00 | 0.258 | 0.041 | |
| mt | 381 | 28.35 | 0.79 | 0.167 | 0.023 | |
| pt | 269-274 | 28.00 | 0.00 | 0.253 | 0.032 | |
| pt | 648-673 | 27.99 | 0.46 | 0.234 | 0.021 | |
| nc | 385-499 | 27.67 | 0.52 | - | - | |
| pt | 162-193 | 25.63 | 1.04 | 0.281 | 0.036 | |
| mt | 432 | 24.54 | 0.00 | 0.098 | 0.018 | |
| mt | 1578 | 23.76 | 0.63 | 0.038 | 0.008 | |
| mt | 215-233 | 23.50 | 0.43 | 0.107 | 0.022 | |
| pt | 1260-1272 | 22.90 | 0.00 | 0.111 | 0.010 | |
| pt | 687 | 22.56 | 0.00 | 0.114 | 0.012 | |
| pt | 1479 | 21.37 | 0.00 | 0.084 | 0.008 | |
| pt | 543 | 20.99 | 0.18 | 0.054 | 0.008 | |
| pt | 918-921 | 20.96 | 0.22 | 0.105 | 0.015 | |
| pt | 1326-1335 | 20.37 | 0.00 | 0.062 | 0.008 | |
| pt | 1044 | 20.31 | 0.00 | 0.054 | 0.007 | |
| pt | 1011 | 19.88 | 0.10 | 0.053 | 0.008 | |
| pt | 867 | 19.26 | 0.12 | 0.026 | 0.005 | |
| pt | 627 | 19.14 | 0.00 | 0.029 | 0.005 | |
| pt | 3168 | 19.10 | 0.03 | 0.034 | 0.004 | |
| pt | 1809 | 19.02 | 0.11 | 0.030 | 0.004 | |
| pt | 1155 | 18.87 | 0.09 | 0.020 | 0.004 | |
| mt | 1294-1413 | 18.55 | 0.29 | 0.140 | 0.012 | |
| pt | 1521-1524 | 18.37 | 0.00 | 0.023 | 0.005 | |
| pt | 314-330 | 18.15 | 0.00 | 0.052 | 0.011 | |
| mt | 369 | 17.89 | 0.27 | 0.108 | 0.009 | |
| pt | 391-397 | 17.84 | 0.00 | 0.055 | 0.010 | |
| pt | 255 | 17.65 | 0.00 | 0.119 | 0.025 | |
| mt | 1521 | 17.55 | 0.13 | 0.144 | 0.018 | |
| pt | 294-297 | 16.84 | 0.00 | 0.035 | 0.007 | |
| pt | 504 | 16.47 | 0.00 | 0.059 | 0.009 | |
| pt | 360 | 16.11 | 0.00 | 0.062 | 0.007 | |
| mt | 1161 | 15.25 | 0.00 | 0.207 | 0.024 | |
| pt | 504-512 | 15.04 | 0.00 | 0.091 | 0.019 | |
| mt | 1482 | 14.71 | 0.34 | 0.039 | 0.005 | |
| pt | 645-699 | 14.57 | 0.14 | 0.086 | 0.009 | |
| pt | 372 | 14.25 | 0.00 | 0.107 | 0.011 | |
| mt | 813 | 13.65 | 0.12 | 0.221 | 0.021 | |
| mt | 909 | 13.53 | 0.00 | 0.215 | 0.018 | |
| pt | 1230-1275 | 13.36 | 0.08 | 0.067 | 0.009 | |
| pt | 456-477 | 12.55 | 0.00 | 0.160 | 0.017 | |
| pt | 477-483 | 11.18 | 0.00 | 0.171 | 0.014 | |
| pt | 1464 | 10.04 | 0.00 | 0.213 | 0.021 | |
| pt | 204 | 9.80 | 0.00 | 0.199 | 0.017 | |
| mt | 2235 | 8.18 | 0.18 | 0.365 | 0.038 | |
| mt | 1491-1494 | 6.87 | 0.00 | 0.381 | 0.035 | |
| pt | 1083 | 4.16 | 0.00 | 0.364 | 0.036 | |
| nc | 1790-1792 | 2.51 | 0.16 | - | - | |
Note: “-” indicated that the gene was not encoded by the organelle genomes. *Difference = SNP/Size; **Difference between IMET1 and CCMP531; ***A measure of the similarity between two trees calculated from the two p-distance matrixes that each represents a phylogenetic tree; ****Square deviation of the corresponding p-distance between two matrixes.
Intraspecies phylogenetic markers of the three strains of CCMP531, CCMP1779 and IMET1
| 8512 | G | A | A | - | |
| | 8513 | C | T | T | - |
| | 8555 | - | G | G | - |
| | 8577 | C | A | A | - |
| 8710 | T | C | C | synonymous | |
| | 8720 | G | A | A | synonymous |
| | 8941 | C | A | A | synonymous |
| | 8956 | C | T | T | synonymous |
| 21403 | G | C | C | synonymous | |
| | 21488 | T | C | C | synonymous |
| | 21583 | A | T | T | synonymous |
| | 21637 | A | G | G | synonymous |
| | 21682 | A | G | G | synonymous |
| | 21940 | G | A | A | synonymous |
| 34621 | G | T | T | - | |
| | 34727 | A | G | G | - |
| | 34756 | C | T | T | - |
| | 34801 | A | G | G | - |
| | 34866 | - | T | T | - |
| | 34867 | - | T | T | - |
| | 34870 | - | A | A | - |
| | 34871 | - | A | A | - |
| | 34880 | A | - | - | - |
| | 34885 | A | - | - | - |
| | 34898 | T | C | C | - |
| | 34927 | - | A | A | - |
| | 34934 | C | T | T | - |
| | 34971 | G | A | A | - |
| 34995 | A | A | T | - |
Note: “-” indicated that the mutation was located at a non-protein-coding region.
Figure 5Multiple locus sequence tag for high-resolution phylotyping of three closely related strains(A) Comparison of the sensitivity and specificity among candidate regions/markers for intra-species discrimination. The regions/markers derived from genic and intergenic sequences of pt and mt of the N.oceanica strains (IMET1 and CCMP531) were listed on the X axe. The % nucleotide difference of each region/markers was calculated as the index for sensitivity. The bootstrap values of the IMET1-CCMP531 branches in the sequence-specific phylogenetic trees (maximum parsimony; MP) were shown as the index for specificity. *: presently used markers. Arrow: the three regions (rps11_mt-nad4, rps3_mt and cox2-rrn16_mt) for Multiple-Locus Sequence Typing (MLST). (B) Phylogeny of IMET1, CCMP531 and CCMP1779 as reconstructed from three MLST loci. Grey background: identical loci in two strains. Blank box and *: A base that is different between CCMP1779 and IMET1.
Simple sequence repeat (SSRs) for intra-species discrimination
| 10 | Intergenic | pt | IMET1, 531 | ||
| 10 | Genic | pt | IMET1, 531 | ||
| 10 | Intergenic | pt | IMET1, 531 | ||
| 10 | Intergenic | pt | IMET1, 531 | ||
| 14 | Intergenic | pt | IMET1 | ||
| 14 | Intergenic | pt | IMET1 | ||
| 10 | Intergenic | pt | 531 | ||
| 11 | Genic | mt | IMET1, 531 | ||
| 11 | Genic | mt | IMET1, 531 | ||
| 10 | Intergenic | mt | IMET1, 531 | ||
| 10 | Genic | mt | IMET1, 531 | ||
| 10 | Intergenic | mt | IMET1, 531 | ||
| 10 | Genic | mt | IMET1, 531 | ||
| 10 | Intergenic | mt | IMET1, 531 | ||
| 12 | Genic | mt | 531 |
* SSRs that are specifically present in IMET1 or CCMP531.