| Literature DB >> 29299281 |
Xin Zhao1, Kun Tian1, Rong L He2, Stephen S-T Yau1.
Abstract
Prochlorococcus marinus, one of the most abundant marine cyanobacteria in the global ocean, is classified into low-light (LL) and high-light (HL) adapted ecotypes. These two adapted ecotypes differ in their ecophysiological characteristics, especially whether adapted for growth at high-light or low-light intensities. However, some evolutionary relationships of Prochlorococcus phylogeny remain to be resolved, such as whether the strains SS120 and MIT9211 form a monophyletic group. We use the Natural Vector (NV) method to represent the sequence in order to identify the phylogeny of the Prochlorococcus. The natural vector method is alignment free without any model assumptions. This study added the covariances of amino acids in protein sequence to the natural vector method. Based on these new natural vectors, we can compute the Hausdorff distance between the two clades which represents the dissimilarity. This method enables us to systematically analyze both the dataset of ribosomal proteomes and the dataset of 16s-23s rRNA sequences in order to reconstruct the phylogeny of Prochlorococcus. Furthermore, we apply classification to inspect the relationship of SS120 and MIT9211. From the reconstructed phylogenetic trees and classification results, we may conclude that the SS120 does not cluster with MIT9211. This study demonstrates a new method for performing phylogenetic analysis. The results confirm that these two strains do not form a monophyletic clade in the phylogeny of Prochlorococcus.Entities:
Keywords: Prochlorococcus; natural vector; phylogenetic analysis; sequence analysis
Year: 2017 PMID: 29299281 PMCID: PMC5743538 DOI: 10.1002/ece3.3535
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
The strain names and number of the proteins in the ribosomal protein dataset
| Strain names | Light adaptation | No. of ribosomal proteins |
|---|---|---|
| MED4 | HL | 118 |
| MIT9515 | HL | 114 |
| MIT9312 | HL | 114 |
| AS9601 | HL | 106 |
| MIT9301 | HL | 107 |
| MIT9215 | HL | 109 |
| SS120 | LL | 188 |
| MIT9211 | LL | 105 |
| NATL2A | LL | 113 |
| NATL1A | LL | 106 |
| MIT9303 | LL | 114 |
| MIT9313 | LL | 129 |
Figure 1Phylogenetic tree reconstructed by the Euclidean distance and the Hausdorff distance based on the natural vectors of ribosomal proteins
Figure 2Phylogenetic tree reconstructed by the Manhattan distance and the Hausdorff distance based on the natural vectors of ribosomal proteins
Figure 3Phylogenetic tree reconstructed by the Euclidean distance based on the natural vectors of 16s‐23s rRNA sequences
Figure 4Phylogenetic tree reconstructed by 3‐mer amino acid composition method based on the full set of ribosomal proteins
Figure 5Bootstrap values on three phylogenetic trees for Prochlorococcus using natural vector method and single‐linkage method. (a) Phylogenetic tree reconstructed by the Euclidean distance and the Hausdorff distance based on the natural vectors of ribosomal proteins. (b) Phylogenetic tree reconstructed by the Manhattan distance and the Hausdorff distance based on the natural vectors of ribosomal proteins. (c) Phylogenetic tree reconstructed by the Euclidean distance based on the natural vectors of 16s‐23s rRNA sequences
Classification result of 12 Prochlorococcus strains
| Strains | Accuracy |
|---|---|
| AS9601 | 0.8868 |
| MED4 | 0.8983 |
| MIT9211 | 0.8286 |
| MIT9215 | 0.6881 |
| MIT9301 | 0.6168 |
| MIT9303 | 0.8509 |
| MIT9312 | 0.7982 |
| MIT9313 | 0.6589 |
| MIT9515 | 0.8070 |
| NATL1A | 0.8962 |
| NATL2A | 0.5310 |
| SS120 | 0.9787 |
| Total | 0.7866 |
Most wrong strains and most error rates in classification
| Strains | Most wrong strain | Most error rate, % |
|---|---|---|
| AS9601 | MIT9215 | 4.72 |
| MED4 | AS9601 | 5.08 |
| MIT9211 | SS120 | 3.82 |
| MIT9215 | AS9601 | 22.94 |
| MIT9301 | AS9601 | 26.17 |
| MIT9303 | MIT9313 | 12.28 |
| MIT9312 | AS9601 | 12.28 |
| MIT9313 | MIT9303 | 31.78 |
| MIT9515 | MED4 | 10.53 |
| NATL1A | NATL2A | 10.38 |
| NATL2A | NATL1A | 46.02 |
| SS120 | MIT9313 | 1.60 |