| Literature DB >> 30136922 |
Luc Cornet1,2, Amandine R Bertrand1,3, Marc Hanikenne3, Emmanuelle J Javaux2, Annick Wilmotte4, Denis Baurain1.
Abstract
Cyanobacteria form one of the most diversified phyla of Bacteria. They are important ecologically as primary producers, for Earth evolution and biotechnological applications. Yet, Cyanobacteria are notably difficult to purify and grow axenically, and most strains in culture collections contain heterotrophic bacteria that were probably associated with Cyanobacteria in the environment. Obtaining cyanobacterial DNA without contaminant sequences is thus a challenging and time-consuming task. Here, we describe a metagenomic pipeline that enables the easy recovery of genomes from non-axenic cultures. We tested this pipeline on 17 cyanobacterial cultures from the BCCM/ULC public collection and generated novel genome sequences for 12 polar or subpolar strains and three temperate ones, including three early-branching organisms that will be useful for phylogenomics. In parallel, we assembled 31 co-cultivated bacteria (12 nearly complete) from the same cultures and showed that they mostly belong to Bacteroidetes and Proteobacteria, some of them being very closely related in spite of geographically distant sampling sites.Entities:
Keywords: Antarctic; Arctic; Cyanobacteria; metagenomics; microbiome; phylogenomic analysis
Mesh:
Substances:
Year: 2018 PMID: 30136922 PMCID: PMC6202449 DOI: 10.1099/mgen.0.000212
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Details of the ULC strains
All details were extracted from the BCCM/ULC website: http://bccm.belspo.be/about-us/bccm-ulc. RT, room temperature; NA, not applicable.
| Assembly | Strain | Name | Type | Prior affiliation | Morphology | Sheath | Deposit date | Habitat | Culture medium | Temperature |
|---|---|---|---|---|---|---|---|---|---|---|
| QBLS00000000 | ULC187 | Non-axenic | Clade F | Filamentous | No | 2012 | Belgium, lake Ri Jaune | BG11 | RT | |
| QBML00000000 | ULC066 | Non-axenic | Clade F | Filamentous | No | 2011 | Canadian Arctic, Bylot Island | BG11 | 12 | |
| QBMK00000000 | ULC068 | Non-axenic | Clade F | Filamentous | No | 2011 | Canadian Subarctic, Québec, Kuujjuarapik | BG11 | 12 | |
| QBMM00000000 | ULC065 | Non-axenic | Clade C1 | Unicellular | No | 2011 | Canadian Arctic, Bylot Island | BG11 | 12 | |
| QBMG00000000 | ULC082 | Non-axenic | Clade C1 | Unicellular | No | 2011 | Antarctica, Livingston Island | BG11 | 12 | |
| QBMF00000000 | ULC084 | Non-axenic | Clade C1 | Unicellular | No | 2011 | Antarctica, Livingston Island | BG11 | 12 | |
| QBMH00000000 | ULC077 | Non-axenic | Clade C3 | Filamentous | No | 2011 | Canadian Arctic, Bylot Island | BG11 | 12 | |
| QBMQ00000000 | ULC007 | Axenic | Clade C3 | Filamentous | No | 2011 | Antarctica, Larsemann Hills | BG11 | 18 | |
| NA | ULC165 | Non-axenic | Clade C3 | Filamentous | Yes | 2012 | Antarctica, Sor Rondane Mountains | BG11 | 12 | |
| QBMC00000000 | ULC129 | Non-axenic | Clade C3 | Filamentous | No | 2011 | Antarctica, Transantarctic Mountains | BG11 | 12 | |
| QBMP00000000 | ULC027 | Non-axenic | Clade C3 | Filamentous | No | 2011 | Antarctica, Larsemann Hills | BG11 | 18 | |
| QBLT00000000 | ULC186 | Non-axenic | Clade C3 | Filamentous | No | 2012 | Belgium, Renipont lake | BG11 | RT | |
| QBMN00000000 | ULC041 | Non-axenic | Clade C3 | Filamentous | No | 2011 | Antarctica, Vestfold Hills | BG11 | 12 | |
| QBMJ00000000 | ULC073 | Non-axenic | Clade C3 | Filamentous | Yes | 2011 | Antarctica, Transantarctic Mountains | BG11 | 18 | |
| QBMS00000000 | ULC335 | Non-axenic | Clade B2 | Unicellular | Yes | 2014 | Belgium, lake Falemprise | BG11 | RT | |
| NA | ULC146 | Non-axenic | Clade B1 | Filamentous heterocystous | Yes | 2012 | Antarctica, Sor Rondane Mountains | BG110 | 18 | |
| NA | ULC179 | Non-axenic | Clade B2 | Filamentous heterocystous | Yes | 2012 | Antarctica, Sor Rondane Mountains | BG110 | 12 |
Details regarding reference proteomes
All details were extracted from the NCBI metadata.
| Assembly | Bioproject | Taxid | Name |
|---|---|---|---|
| GCA_000484535.1 | PRJNA162637 | 1183438 | |
| GCF_000011385.1 | PRJNA58011 | 251221 | |
| GCF_000013205.1 | PRJNA224116 | 321327 | |
| GCF_000013225.1 | PRJNA224116 | 321332 | |
| GCF_000332275.1 | PRJNA224116 | 195250 | |
| GCF_000317065.1 | PRJNA224116 | 82654 | |
| GCF_000332215.1 | PRJNA224116 | 927668 | |
| GCF_000317085.1 | PRJNA224116 | 1173263 | |
| GCF_000332175.1 | PRJNA224116 | 118173 | |
| GCF_000018105.1 | PRJNA224116 | 329726 | |
| GCA_000022045.1 | PRJNA28337 | 395961 | |
| GCF_000505665.1 | PRJNA224116 | 1394889 | |
| GCF_000316685.1 | PRJNA224116 | 195253 | |
| GCF_000775285.1 | PRJNA224116 | 1497020 | |
| GCF_000309945.1 | PRJNA224116 | 864702 | |
| GCF_001895925.1 | PRJNA224116 | 1920490 | |
| GCF_001650195.1 | PRJNA224116 | 1850361 | |
| GCF_000353285.1 | PRJNA224116 | 272134 | |
| GCF_000733415.1 | PRJNA224116 | 1487953 | |
| GCF_000332095.2 | PRJNA224116 | 1173264 | |
| GCF_000763385.1 | PRJNA224116 | 1229172 | |
| GCF_000309385.1 | PRJNA224116 | 118166 | |
| GCF_000155595.1 | PRJNA224116 | 91464 | |
| GCF_000482245.1 | PRJNA224116 | 1385935 | |
| GCF_000316115.1 | PRJNA224116 | 102129 | |
| GCF_000464785.1 | PRJNA224116 | 1255374 | |
| GCF_000175415.3 | PRJNA224116 | 634502 | |
| GCF_000478195.2 | PRJNA224116 | 1348334 | |
| GCF_000332155.1 | PRJNA224116 | 402777 | |
| GCF_000317475.1 | PRJNA224116 | 179408 | |
| GCF_000317105.1 | PRJNA224116 | 56110 | |
| GCF_000317515.1 | PRJNA224116 | 1173027 | |
| GCF_000021825.1 | PRJNA224116 | 65393 | |
| GCA_000307995.2 | PRJEA88171 | 1160280 | |
| GCF_000021805.1 | PRJNA224116 | 41431 | |
| GCF_000737945.1 | PRJNA256120 | 1527444 | |
| GCF_000284135.1 | PRJNA224116 | 1080228 | |
| GCF_000715475.1 | PRJNA224116 | 490193 | |
| GCF_000317655.1 | PRJNA39697 | 292563 | |
| GCF_000332055.1 | PRJNA224116 | 102125 | |
| GCF_000317575.1 | PRJNA224116 | 111780 | |
| GCF_000380225.1 | PRJNA224116 | 1128427 | filamentous cyanobacterium ESFC-1 |
| GCF_000317615.1 | PRJNA224116 | 13035 | |
| GCF_000317495.1 | PRJNA224116 | 1173022 | |
| GCF_000317555.1 | PRJNA224116 | 1173026 | |
| GCF_000317125.1 | PRJNA224116 | 251229 | |
| GCF_000582685.1 | PRJNA224116 | 1469607 | [ |
| GCF_000789435.1 | PRJNA224116 | 1532906 | |
| GCF_000196515.1 | PRJNA224116 | 551115 | ' |
| GCF_000316645.1 | PRJNA224116 | 28072 | |
| GCF_000204075.1 | PRJNA10642 | 240292 | |
| GCA_000340565.3 | PRJNA185469 | 313624 | |
| GCF_000020025.1 | PRJNA224116 | 63737 | |
| GCF_000332295.1 | PRJNA224116 | 643473 | |
| GCF_000346485.2 | PRJNA224116 | 128403 | |
| GCF_000734895.2 | PRJNA224116 | 1337936 | |
| GCF_000332255.1 | PRJNA224116 | 1173021 | cyanobacterium PCC 7702 |
| GCF_000317225.1 | PRJNA224116 | 98439 | |
| GCF_000012525.1 | PRJNA224116 | 1140 | |
| GCF_000586015.1 | PRJNA224116 | 1451353 | |
| GCF_000155635.1 | PRJNA224116 | 180281 | |
| GCA_000015705.1 | PRJNA13496 | 59922 | |
| GCF_000011485.1 | PRJNA224116 | 74547 | |
| GCF_000153805.1 | PRJNA224116 | 313625 |
Assembly statistics, taxonomy, completeness, contamination and coverage of genome bins
The taxonomic label (CM taxon), genome completeness (CM compl.) and contamination level (CM contam.) were computed with CheckM. Sequencing coverage (med) was computed with BBMap, while bin length was extracted from QUAST output. Length (%) represents the proportion of assembled data in a bin with respect to the total amount of data of the corresponding metagenome. In the Nature column, cyanobacterial bins are denoted by C, microbiome bins by M, unclassified bins by U and nobins by No. Genome bins used in phylogenetic inference are marked by an asterisk (*) and discarded bins by a dash (−). NA, not applicable.
| Strain | MetaBAT setting | Bin | CM taxon | Nature | No. of scaffolds | Length (%) | Coverage (med) | CM compl. | CM contam. |
|---|---|---|---|---|---|---|---|---|---|
| ULC335 | Veryspecific | 1 | C | 238 | 20.84 | 10.90 | 98.91 | 0.51 | |
| 2 | M | 67 | 13.73 | 11.12 | 99.29 | 0.12 | |||
| 3 | M | 576 | 12.83 | 4.46 | 65.45 | 0.49 | |||
| 4 | M | 271 | 4.79 | 4.13 | 32.28 | 0 | |||
| 0 | Nobin | No | 23 056 | 47.81 | 1.88 | ||||
| ULC007 | Superspecific | 1 | C | 84 | 91.14 | 26.62 | 98.11 | 0 | |
| 2 | Unclassified | U | 12 | 4.95 | 72.12 | 0 | 0 | ||
| 0 | Nobin | No | 358 | 3.91 | 1.48 | ||||
| ULC027 | Verysensitive | 1 | C | 439 | 21.40 | 6.27 | 90.43 | 0.27 | |
| 2 | M | 190 | 16.16 | 7.71 | 95.02 | 1.16 | |||
| 3 | M | 293 | 12.03 | 6.18 | 60.21 | 2.35 | |||
| 4 | Unclassified | U | 164 | 4.16 | 5.09 | 4.17 | 0 | ||
| 0 | Nobin | No | 24 364 | 46.24 | 1.89 | ||||
| ULC041 | Verysensitive | 1 | C | 287 | 84.76 | 31.38 | 96.2 | 1.63 | |
| 2 | Unclassified | U | 24 | 9.36 | 44.33 | 0 | 0 | ||
| 0 | Nobin | No | 441 | 5.88 | 3.97 | ||||
| ULC065 | Veryspecific | 1 | C | 95 | 22.36 | 38.37 | 99.09 | 0.27 | |
| 2 | M | 332 | 19.33 | 6.19 | 83.73 | 1.23 | |||
| 0 | Nobin | No | 20 555 | 58.31 | 1.73 | ||||
| ULC066 | Superspecific | 1 | C | 67 | 28.81 | 21.86 | 98.82 | 0.47 | |
| 2 | M | 401 | 13.94 | 4.93 | 76.91 | 1.23 | |||
| 3 | M | 152 | 2.86 | 3.48 | 15.86 | 0 | |||
| 0 | Nobin | No | 24 558 | 54.38 | 1.69 | ||||
| ULC068 | Superspecific | 1 | C | 60 | 57.04 | 29.34 | 97.09 | 0.71 | |
| 2 | Unclassified | U | 3 | 2.56 | 22.60 | 0 | 0 | ||
| 0 | Nobin | No | 10 385 | 40.41 | 1.42 | ||||
| ULC073 | Verysensitive | 1 | C | 476 | 22.70 | 10.74 | 92.03 | 1.42 | |
| 2 | M | 65 | 16.26 | 7.99 | 97.92 | 0.67 | |||
| 3 | M | 603 | 15.78 | 4.94 | 70.57 | 5.3 | |||
| 4 | M | 156 | 2.79 | 4.39 | 10.71 | 0 | |||
| 5 | Unclassified | U | 26 | 1.40 | 15.02 | 0 | 0 | ||
| 6 | Unclassified | U | 29 | 1.38 | 6.45 | 0 | 0 | ||
| 0 | Nobin | No | 16 790 | 39.68 | 1.94 | ||||
| ULC077 | Veryspecific | 1 | C | 407 | 47.37 | 15.08 | 97.64 | 0.47 | |
| 0 | Nobin | No | 14 903 | 52.63 | 1.83 | ||||
| ULC082 | Veryspecific | 1 | C | 124 | 11.49 | 19.85 | 97.74 | 0.27 | |
| 2 | M | 529 | 9.77 | 4.50 | 62.77 | 7.54 | |||
| 3 | M | 542 | 8.16 | 3.88 | 46.21 | 9.28 | |||
| 4 | M | 120 | 1.72 | 4.73 | 11.13 | 0 | |||
| 5 | Unclassified | U | 74 | 1.67 | 4.57 | 0 | 0 | ||
| 0 | Nobin | No | 30 077 | 67.18 | 2.15 | ||||
| ULC084 | Superspecific | 1 | M | 232 | 23.15 | 5.67 | 93.61 | 1.73 | |
| 2 | M | 222 | 22.39 | 6.65 | 92.46 | 1.38 | |||
| 3 | C | 116 | 21.88 | 20.78 | 98.55 | 0 | |||
| 0 | Nobin | No | 10 835 | 32.58 | 1.59 | ||||
| ULC129 | Verysensitive | 1 | C | 299 | 38.35 | 18.46 | 98.64 | 0.77 | |
| 0 | Nobin | No | 21 968 | 61.65 | 1.62 | ||||
| ULC146 | Superspecific | 1 | M | 177 | 16.18 | 10.96 | 96.57 | 0.93 | |
| 2 | M | 285 | 12.91 | 6.27 | 94.94 | 0.35 | |||
| 3 | M | 74 | 11.54 | 14.23 | 88.9 | 1.39 | |||
| 4 | M | 98 | 10.85 | 7.64 | 97.46 | 1.09 | |||
| 5 | M | 350 | 7.56 | 6.25 | 75.87 | 0.32 | |||
| 6 | M | 243 | 3.11 | 4.68 | 10.82 | 0 | |||
| 7 | Unclassified | U | 21 | 1.86 | 12.53 | 8.33 | 0 | ||
| 0 | Nobin | No | 28 569 | 35.99 | 1.72 | ||||
| ULC165 | Verysensitive | 1 | M | 53 | 15.37 | 24.76 | 99.54 | 0.8 | |
| 2 | M | 167 | 14.52 | 7.75 | 96.29 | 1.22 | |||
| 3 | M | 473 | 10.01 | 4.40 | 41.41 | 0.47 | |||
| 4 | C | 356 | 6.30 | 3.90 | 24.14 | 1.72 | |||
| 0 | Nobin | No | 19 409 | 53.79 | 2.08 | ||||
| ULC179 | Superspecific | 1 | M | 247 | 18.89 | 16.30 | 98.54 | 60.19 | |
| 2 | M | 261 | 16.95 | 8.86 | 94.78 | 0.94 | |||
| 3 | M | 111 | 13.62 | 21.92 | 98.73 | 0.22 | |||
| 4 | M | 718 | 13.40 | 4.60 | 67.06 | 0.3 | |||
| 5 | M | 68 | 4.70 | 16.67 | 35.78 | 0 | |||
| 6 | M | 170 | 2.16 | 4.18 | 12.58 | 0 | |||
| 7 | Unclassified | U | 16 | 1.69 | 41.33 | 0 | 0 | ||
| 0 | Nobin | No | 13 101 | 28.59 | 1.94 | ||||
| ULC186 | Verysensitive | 1 | C | 412 | 67.38 | 21.10 | 93.18 | 1.64 | |
| 0 | Nobin | No | 6559 | 32.62 | 1.52 | ||||
| ULC187 | Veryspecific | 1 | C | 62 | 62.18 | 33.11 | 99.29 | 0.47 | |
| 0 | Nobin | No | 8482 | 37.82 | 1.43 |
Fig. 1.Phylogenomic tree of 64 broadly sampled Cyanobacteria showing the phylogenetic position of the 15 cyanobacterial genome bins. The Bayesian tree was inferred under the CAT+Γ4 model from a supermatrix made of 675 genes (79 organisms×170 983 amino-acid positions). Cyanobacterial clades (see Table 1) were named according to Shih et al. [52]. Trailing numbers in tip labels give the number of amino-acid positions effectively present in the corresponding concatenated sequence, whereas numbers at nodes are posterior probabilities (PP) computed from two independent chains (only PP values ≤1.0 are shown). Genome bins are shown in red. The location of the alternative root proposed by Tria et al. [70] is indicated by an arrowhead.
Fig. 2.Phylogenomic tree of 196 broadly sampled Bacteria and Archaea showing the phylogenetic position of 27 microbiome genome bins. The Bayesian tree was inferred under the CAT+Γ4 model from a supermatrix made of 53 ribosomal genes (223 organisms×7060 amino-acid positions). PP values ≤1.0 are shown at the corresponding nodes. Surrounding subtrees are excerpts from a large maximum-likelihood tree inferred under the LG4X model from the full supermatrix (3501 organisms ×6613 amino-acid positions; Fig. S1). The 27 microbiome bins are indicated in red. Bacteroidetes bins are shown on a green background, whereas Protebacteria bins are shown on an orange background.
SSU rRNA (16S) gene prediction, taxonomy and coverage
The last-common ancestor (LCA) classification and top hits were retrieved from sina analyses. The bins with SSU rRNA (16S) genes directly predicted from the genome bins (without manual assignment) are indicated by an asterisk (*). Coverage values were computed with BBMap. NA, not applicable.
| Strain | SSUref_128 taxon | SSUref_128 top hit | Bin affiliation | Coverage | |
|---|---|---|---|---|---|
| ULC335 | bin0 | bin1 | 37.00 | ||
| ULC335 | Uncultured | bin0 | 22.23 | ||
| ULC335 | Uncultured bacterium clone N4_091 | bin0 | bin2 | 58.44 | |
| ULC335 | Unclassified | bin0 | 10.25 | ||
| ULC335 | bin0 | 9.64 | |||
| ULC335 | Uncultured bacterium clone ZWB3-3 | bin0 | 7.79 | ||
| ULC007 | bin0 | bin1 | 85.23 | ||
| ULC027 | Unclassified | bin2 | bin2* | 54.08 | |
| ULC041 | bin0 | bin1 | 97.23 | ||
| ULC065 | Uncultured bacterium clone a33 | bin0 | bin2 | 40.66 | |
| ULC065 | bin0 | bin1 | 165.13 | ||
| ULC066 | Uncultured bacterium clone S25 | bin0 | bin3 | 14.15 | |
| ULC066 | Unclassified | bin0 | 21.51 | ||
| ULC066 | FamilyI | bin0 | bin1 | 50.06 | |
| ULC068 | FamilyI | bin0 | bin1 | 68.53 | |
| ULC073 | bin6 | bin6* | 31.87 | ||
| ULC073 | bin0 | bin1 | 33.60 | ||
| ULC073 | Uncultured bacterium clone B3NR69D12 | bin0 | bin2 | 19.58 | |
| ULC077 | Unclassified | bin0 | bin1 | 52.80 | |
| ULC082 | Uncultured | bin0 | 18.85 | ||
| ULC082 | Uncultured alphaproteobacterium clone KWK6S.50 | bin0 | 25.08 | ||
| ULC082 | Unclassified | bin0 | 32.77 | ||
| ULC082 | bin0 | 32.71 | |||
| ULC082 | bin0 | bin1 | 93.93 | ||
| ULC084 | Uncultured alphaproteobacterium | bin0 | bin2 | 31.39 | |
| ULC084 | Uncultured bacterium clone MS81 | bin0 | bin3 | 87.30 | |
| ULC084 | Uncultured bacterium clone B3NR69D12 | bin0 | bin1 | 16.99 | |
| ULC129 | Uncultured bacterium clone GBII-52 | bin0 | bin1 | 52.71 | |
| ULC146 | bin3 | bin3* | 81.39 | ||
| ULC146 | bin0 | bin2 | 25.91 | ||
| ULC146 | bin1 | bin1* | 61.13 | ||
| ULC165 | Unclassified | bin0 | 85.20 | ||
| ULC165 | Unclassified | bin0 | 98.62 | ||
| ULC179 | bin0 | 97.88 | |||
| ULC179 | Unclassified | bin0 | 16.43 | ||
| ULC179 | Uncultured | bin3 | bin3* | 91.23 | |
| ULC186 | FamilyI | bin0 | bin1 | 116.04 | |
| ULC187 | FamilyI | bin0 | bin1 | 81.01 | |