| Literature DB >> 29915114 |
Wei Yan1, Shuzhen Wei1, Qiong Wang1, Xilin Xiao1, Qinglu Zeng2,3, Nianzhi Jiao4, Rui Zhang4.
Abstract
Prochlorococcus is the most abundant and smallest known free-living photosynthetic microorganism and is a key player in marine ecosystems and biogeochemical cycles. Prochlorococcus can be broadly divided into high-light-adapted (HL) and low-light-adapted (LL) clades. In this study, we isolated two low-light-adapted clade I (LLI) strains from the western Pacific Ocean and obtained their genomic data. We reconstructed Prochlorococcus evolution based on genome rearrangement. Our results showed that genome rearrangement might have played an important role in Prochlorococcus evolution. We also found that the Prochlorococcus clades with streamlined genomes maintained relatively high synteny throughout most of their genomes, and several regions served as rearrangement hotspots. Backbone analysis showed that different clades shared a conserved backbone but also had clade-specific regions, and the genes in these regions were associated with ecological adaptations.IMPORTANCE Prochlorococcus, the most abundant and smallest known free-living photosynthetic microorganism, plays a key role in marine ecosystems and biogeochemical cycles. Prochlorococcus genome evolution is a fundamental issue related to how Prochlorococcus clades adapted to different ecological niches. Recent studies revealed that the gene gain and loss is crucial to the clade differentiation. The significance of our research is that we interpreted the Prochlorococcus genome evolution from the perspective of genome structure and associated the genome rearrangement with the Prochlorococcus clade differentiation and subsequent ecological adaptation.Entities:
Keywords: Prochlorococcus; genome evolution; genome rearrangement; genomic backbone
Mesh:
Year: 2018 PMID: 29915114 PMCID: PMC6102989 DOI: 10.1128/AEM.01178-18
Source DB: PubMed Journal: Appl Environ Microbiol ISSN: 0099-2240 Impact factor: 4.792
Prochlorococcus strain isolation location, genome characteristics, and assembly statistics
| Genome feature | Value for strain: | |
|---|---|---|
| XMU1403 | XMU1408 | |
| Ecotype/clade | LLI | LLI |
| Isolation location | ||
| Longitude (E) | 124 | 121 |
| Latitude (N) | 20 | 20 |
| Isolation depth (m) | 150 | 150 |
| Sequencing method | HiSeq | HiSeq |
| Read length (bp) | 150 × 2 | 150 × 2 |
| Assembly size (bp) | 1,746,033 | 1,795,147 |
| G+C content (%) | 35.2 | 34.1 |
| No. of contigs | 7 | 6 |
| N50 (bp) | 335,809 | 389,747 |
| No. of protein-coding sequences | 2,042 | 2,059 |
| Completeness | >98 | >98 |
| NCBI accession no. | ||
Completeness is estimated by CheckM (35).
Reference Prochlorococcus genomes used in this study
| Strain | Ecotype | Assembly size (bp) | %GC | No. of coding sequences | NCBI accession no. |
|---|---|---|---|---|---|
| MIT9515 | HLI | 1,704,176 | 30.8 | 1,964 | |
| EQPAC1 | HLI | 1,654,739 | 30.8 | 1,958 | |
| MED4 | HLI | 1,657,990 | 30.8 | 1,962 | |
| MIT0604 | HLII | 1,780,061 | 31.2 | 2,092 | |
| AS9601 | HLII | 1,669,886 | 31.3 | 1,938 | |
| GP2 | HLII | 1,624,310 | 31.2 | 1,878 | |
| MIT9107 | HLII | 1,699,937 | 31 | 1,994 | |
| MIT9116 | HLII | 1,685,398 | 31 | 1,984 | |
| MIT9123 | HLII | 1,697,748 | 31 | 1,999 | |
| MIT9201 | HLII | 1,672,416 | 31.3 | 1,987 | |
| MIT9202 | HLII | 1,691,453 | 31.1 | 2,019 | |
| MIT9215 | HLII | 1,738,790 | 31.1 | 2,043 | |
| MIT9301 | HLII | 1,641,879 | 31.3 | 1,927 | |
| MIT9302 | HLII | 1,745,343 | 31.1 | 2,016 | |
| MIT9311 | HLII | 1,711,064 | 31.2 | 1,978 | |
| MIT9312 | HLII | 1,709,204 | 31.2 | 1,979 | |
| MIT9314 | HLII | 1,690,556 | 31.2 | 1,979 | |
| MIT9321 | HLII | 1,658,664 | 31.2 | 1,962 | |
| MIT9322 | HLII | 1,657,550 | 31.2 | 1,961 | |
| MIT9401 | HLII | 1,666,808 | 31.2 | 1,969 | |
| SB | HLII | 1,669,823 | 31.5 | 1,932 | |
| MIT0801 | LLI | 1,929,203 | 34.9 | 2,278 | |
| NATL1A | LLI | 1,864,731 | 35 | 2,248 | |
| NATL2A | LLI | 1,842,899 | 35.1 | 2,209 | |
| PAC1 | LLI | 1,841,163 | 35.1 | 2,254 | |
| LG | LLII/III | 1,754,063 | 36.4 | 1,989 | |
| MIT0601 | LLII/III | 1,707,342 | 37 | 1,936 | |
| MIT0602 | LLII/III | 1,750,918 | 36.3 | 2,002 | |
| MIT0603 | LLII/III | 1,752,482 | 36.3 | 2,012 | |
| MIT9211 | LLII/III | 1,688,963 | 38 | 1,952 | |
| SS35 | LLII/III | 1,751,015 | 36.4 | 1,988 | |
| SS52 | LLII/III | 1,754,053 | 36.4 | 1,985 | |
| SS120 | LLII/III | 1,751,080 | 36.4 | 1,982 | |
| SS2 | LLII/III | 1,752,772 | 36.4 | 1,986 | |
| SS51 | LLII/III | 1,746,977 | 36.4 | 1,977 | |
| MIT0701 | LLIV | 2,592,571 | 50.6 | 3,082 | |
| MIT0702 | LLIV | 2,583,057 | 50.6 | 3,084 | |
| MIT0703 | LLIV | 2,575,057 | 50.6 | 3,078 | |
| MIT9303 | LLIV | 2,682,675 | 50 | 3,243 | |
| MIT9313 | LLIV | 2,410,873 | 50.7 | 3,002 |
FIG 1Phylogenomic relationships of Prochlorococcus. Shown is a phylogenetic tree reconstructed by concatenated protein sequences of 31 core genes with the maximum likelihood method, using Synechococcus sp. strain WH5701 as an outgroup. Numbers at the nodes represent bootstrap values (1,000 resamplings).
FIG 2Genome comparison among 14 Prochlorococcus complete genomes. The genome comparison was generated by Mauve (53). (A) Rearrangements of locally colinear blocks (LCBs) among Prochlorococcus genomes. LCBs are color coded. LCBs below the black line have an inverse orientation relative to that of MIT9301. (B) Visualization of the backbone and clade-specific regions among Prochlorococcus genomes. Pink-colored segments are conserved among all strains (backbone), whereas other colored segments are conserved in a subset of strains. Segments below the black line have an inverse orientation relative to that of MIT9301. For both panels A and B, the large white regions indicate strain-specific content. The phylogenetic tree was reconstructed based on permutations of 69 LCBs with the maximum likelihood method. Numbers at the nodes represent bootstrap values (1,000 resamplings).
FIG 3Comparison of the backbone, genomic islands, and locally colinear blocks (LCBs) between the HLII (MIT9312) and LLI (NATL1A) clades. The genome comparison was generated by Mauve (53). Pink-colored segments are conserved among all strains within a clade (backbone), and other colored segments are conserved in a subset of strains. Shaded regions indicate genomic islands (ISL). LCBs are color coded. LCBs below the black line have an inverse orientation relative to that of MIT9312. Connecting lines between the two genomes indicate corresponding LCBs.
FIG 4Visualization of the backbone and clade-specific regions among streamlined Prochlorococcus genomes. The genome comparison was generated by Mauve (53). Pink-colored segments are conserved among all strains (backbone), whereas segments in other colors are conserved in a subset of strains. All segments are above the black line because orientation was set relative to each genome for easier visualization. The large white regions indicate strain-specific content. Genome names in dashed boxes indicate draft genomes.
FIG 5Visualization of the backbone and subclade-specific regions among six Prochlorococcus LLI clade genomes. The genome comparison was generated by Mauve (53). Pink-colored segments are conserved among all strains (backbone), whereas segments in other colors are conserved in a subset of strains. Segments below the black line have an inverse orientation relative to PAC1. The large white regions indicate strain-specific content. The phylogenetic tree was reconstructed using concatenated protein sequences of 31 core genes with the maximum likelihood method using MIT9313 as an outgroup (not shown). Numbers at the nodes represent bootstrap values (1,000 resamplings). Vertical red lines indicate contig breaks.
FIG 6Visualization of the backbone and subclade-specific regions among 10 Prochlorococcus LLII/III clade genomes. The genome comparison was generated by Mauve (53). Pink-colored segments are conserved among all strains (backbone), whereas other segments are conserved in a subset of strains. Segments below the black line have an inverse orientation relative to that of SS120. The large white regions indicate strain-specific content. The phylogenetic tree was reconstructed using concatenated protein sequences of 31 core genes with the maximum likelihood method using MIT9313 as an outgroup (not shown). Numbers at the nodes represent bootstrap values (1,000 resamplings). Vertical red lines indicate contig breaks.
Examples of genes found in clade-specific regions
| Section | Gene name | Product | Locus |
|---|---|---|---|
| HL clade-specific region | Possible phosphate-binding protein | MED4_665181_665417 | |
| Phosphate transport system permease protein PstC | MED4_685879_686826 | ||
| Phosphate transport system permease protein PstA | MED4_686833_687726 | ||
| Phosphate transport ATP-binding protein PstB | MED4_687728_688537 | ||
| LLI clade-specific region | Phosphate transport ATP-binding protein PstB | NATL2A_847867_847064 | |
| Phosphate transport system permease protein PstA | NATL2A_848813_847899 | ||
| Phosphate transport system permease protein PstC | NATL2A_849674_848814 | ||
| DNA ligase | NATL2A_955315_954713 | ||
| DNA ligase | NATL2A_959226_959912 | ||
| HLI protein hli11 | NATL2A_967126_966956 | ||
| Ferric siderophore transport system, periplasmic binding protein TonB | NATL2A_1209792_1210433 | ||
| LLI subclade-specific region | HLI protein hli11 | NATL2A_1052907_1052761 | |
| Endonuclease VIII | NATL2A_1060998_1061843 | ||
| LLII/III clade-specific region | Helicase, SNF2/RAD54 family | SS120_49788_46600 | |
| Phage-encoded HLI protein, HliP | SS120_1097431_1097565 | ||
| Possible HLI protein | SS120_1097580_1097789 | ||
| LLII subclade-specific region | DNA double-strand break repair Rad50 ATPase | SS120_645533_642819 | |
| DNA double-strand break repair protein Mre11 | SS120_646724_645540 | ||
| Possible DNA gyrase/topoisomerase IV, subunit | SS120_647907_647629 | ||
| HL and LLI shared region | Urea ABC transporter, ATPase protein UrtE | NATL2A_1539531_1538824 | |
| Urea ABC transporter, ATPase protein UrtD | NATL2A_1540283_1539531 | ||
| Urea ABC transporter, permease protein UrtC | NATL2A_1541395_1540280 | ||
| Urea ABC transporter, permease protein UrtB | NATL2A_1542556_1541399 | ||
| Urea ABC transporter, substrate binding protein UrtA | NATL2A_1543910_1542633 | ||
| Urease accessory protein UreG | NATL2A_1544660_1544043 | ||
| Urease accessory protein UreF | NATL2A_1545331_1544660 | ||
| Urease accessory protein UreE | NATL2A_1545747_1545352 | ||
| Urease accessory protein UreD | NATL2A_1545895_1546818 |
XMU1408 does not contain this region.
MIT9515 does not contain this region.