| Literature DB >> 18179699 |
Gwenael Piganeau1, Yves Desdevises, Evelyne Derelle, Herve Moreau.
Abstract
BACKGROUND: With genome sequencing becoming more and more affordable, environmental shotgun sequencing of the microorganisms present in an environment generates a challenging amount of sequence data for the scientific community. These sequence data enable the diversity of the microbial world and the metabolic pathways within an environment to be investigated, a previously unthinkable achievement when using traditional approaches. DNA sequence data assembled from extracts of 0.8 microm filtered Sargasso seawater unveiled an unprecedented glimpse of marine prokaryotic diversity and gene content. Serendipitously, many sequences representing picoeukaryotes (cell size <2 microm) were also present within this dataset. We investigated the picoeukaryotic diversity of this database by searching sequences containing homologs of eight nuclear anchor genes that are well conserved throughout the eukaryotic lineage, as well as one chloroplastic and one mitochondrial gene.Entities:
Mesh:
Year: 2008 PMID: 18179699 PMCID: PMC2395239 DOI: 10.1186/gb-2008-9-1-r5
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Phylogenetic distribution of the eukaryotic SSD scaffolds
| Number of SSD sequences | |||||||||||
| Supergroup | Group | 18S rRNA | 28S rRNA | ||||||||
| Total | 38 | 41 | 11 | 4 | 30 | 12 | 13 | 15 | 11 | 4 | |
| Rhizaria | Cercozoa | 1 | - | - | - | 2 | 1 | - | - | - | - |
| Polycystinea | 3 | 1 | - | - | - | - | - | - | - | - | |
| Chromalveolates | Apicomplexa | 1 | 5 | - | 1 | 1 | - | - | - | - | - |
| Ciliophora | 1 | - | 1 | - | 1 | - | 3 | 3 | - | - | |
| Dinophyceae | 10* (3) | 11 | 1 | - | - | - | 3 | 3 | 1 | - | |
| Stramenopiles | 1 (1) | 6 (4) | 2 | - | - | 1 | 1 | 2 | 2 | 1 | |
| Excavates | Euglenozoa | 1 | - | - | 1 | - | 1 | - | 1 | - | - |
| Heterolobosea | - | - | - | - | 2 | - | - | - | - | - | |
| Plantae | Chlorophyta | 3 (2) | 3 (3) | 2 | - | x | 1* | 1 | 1 | 4 | 1* |
| Streptophyta | 1 | 1 | 3 | 2* | 7* | 2 | 2* | 2* | 3* | - | |
| Rhodophyta | - | - | - | - | 4 | - | - | - | - | 2 | |
| Unikonts | Ichthyosporea | - | 1 (1) | - | - | 1 | - | - | - | - | x |
| Arthropoda | 7 | 7 (5) | 1* | - | - | 1 | - | - | - | x | |
| Bryozoa | 2 | 1 (1) | - | - | - | - | - | - | - | x | |
| Cnidaria | 4 | 2* (1) | - | - | - | - | - | - | - | x | |
| Fungi | 1 | 1 | - | - | 10 | 3 | - | - | - | x | |
| Platyhelminthes | 1 | - | - | - | 2 | - | - | - | - | x | |
| Urochordata | - | 1 (1) | 1 | - | - | 1 | 1 | 1 | 1 | x | |
| Unknown | 1 (32) | 1 (25) | - | - | - | 1 | - | 2 | - | - | |
The number of scaffolds for which taxonomic affiliation was confirmed by phylogenetic analysis (Additional data files 1 and 2) is indicated in brackets. The taxonomic affiliation of the largest scaffold is indicated by an asterisk. The groups for which the anchor gene has no representative in the GenBank database are indicated by x.
Figure 1Phylogenetic position of the SSD Ostreococcus-like sequence as inferred from the 18S rRNA sequences in [30]. Outgroup sequence, Bathycoccus; OT95, Ostreococcus tauri (clade C); RCC356, RCC344 and MIC106, surface strains (clade A); RCC393 and RCC143, deep strains (clade B); RCC501, surface strain (clade D). Numbers on branches are support values (posterior probability).
Comparison of the sequence features of the picoeukaryotic scaffolds retrieved from the SSD
| Number of scaffolds | Average length* (bp) | Length* of largest scaffold (Kbp) | Average distance between gap (bp) | Average AT content (%) (minimum-maximum) | |
| All SSD | 232,141 | 2,165 | 205.9 | 920 | 61.4 (16.4-99.2) |
| 16S rRNA | 558 | 3,942 | 76.1 | 1,103 | 59.9 (38.6-74.7) |
| 18S rRNA | 38 | 2,673 | 24.3 | 1,028 | 51.9 (32.5-71.1) |
| 28S rRNA | 41 | 1,760 | 4.3 | 880 | 50.5 (34.7-66.7) |
| 11 | 11,301 | 86.6 | 1,518 | 46 (33.7-67.2) | |
| 4 | 4,163 | 8.8 | 1,342 | 41.6 (38.5-44.6) | |
| 30 | 1,724 | 2.7 | 862 | 61.9 (34.9-76.7) | |
| 12 | 2,844 | 17.2 | 910 | 41.5 (30.1-60.3) | |
| 13 | 1,986 | 6.7 | 864 | 48.6 (31.3-73.7) | |
| 15 | 1,887 | 6.7 | 832 | 46.2 (30.5-73.7) | |
| All nuclear | 128 | 2,910 | 86.6 | 1,253 | 51.4 (30.1-76.7) |
| 11 | 3,962 | 19.4 | 1,230 | 66.3 (58.6-70.5) | |
| 4 | 4,173 | 11.3 | 2,033 | 64.7 (59.4-67.6) |
*Excluding gaps.
Figure 2AT frequency distribution in the 128 eukaryotic SSD scaffolds retrieved (white bars) versus AT frequency distribution in the total SSD scaffolds (black bars).