| Literature DB >> 30379893 |
Samuele Bovo1, Anisa Ribani1, Valerio Joe Utzeri1, Giuseppina Schiavo1, Francesca Bertolini2, Luca Fontanesi1.
Abstract
Honey bees are considered large-scale monitoring tools due to their environmental exploration and foraging activities. Traces of these activities can be recovered in the honey that also may reflect the hive ecological micro-conditions in which it has been produced. This study applied a next generation sequencing platform (Ion Torrent) for shotgun metagenomic analysis of honey environmental DNA (eDNA). The study tested a methodological framework to interpret DNA sequence information useful to describe the complex ecosystems of the honey bee colony superorganism, its pathosphere and the heterogeneity of the agroecological environments and environmental sources that left DNA marks in the honey. Analysis of two honeys reported sequence reads from five main organism groups (kingdoms or phyla): arthropods (that mainly included reads from Apis mellifera, several other members of the Hymenotpera, in addition to members of the Diptera, Coleoptera and Lepidoptera, as well as aphids and mites), plants (that clearly confirmed the botanical origin of the two honeys, i.e. orange tree blossom and eucalyptus tree blossom honeys), fungi and bacteria (including common hive and honey bee gut microorganisms, honey bee pathogens and plant pathogens), and viruses (which accounted for the largest number of reads in both honeys, mainly assigned to Apis mellifera filamentous virus). The shotgun metagenomic approach that was used in this study can be applied in large scale experiments that might have multiple objectives according to the multi-kingdom derived eDNA that is contained in the honey.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30379893 PMCID: PMC6209200 DOI: 10.1371/journal.pone.0205575
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sequenced and aligned reads of the analyzed honeys.
| Honey | Raw reads | Mapped reads | Retained reads |
|---|---|---|---|
| Orange tree blossom honey | 339,581 | 279,709 | 270,439 |
| Eucalyptus tree blossom honey | 467,951 | 286,484 | 264,903 |
† Reads have been retained after pruning reads matching sequence entries labelled with taxids belonging to “environmental samples” class. A few other retained sequences (<1,000 per honey) were discarded later since they did not match the considered kingdoms or phyla (see Table A in S1 File).
Fig 1Effect of minimum sequence identity cut-off values on the number of mapped sequenced reads (considering the genus and species levels).
A) Orange tree blossom honey, genus level; B) Orange tree blossom honey, species level; C) Eucalyptus tree blossom honey, genus level; D) Eucalyptus tree blossom honey, species level. Colors are as follow: blue, arthropods; yellow, plants; gray, fungi; orange, bacteria; green, viruses. Data are not reported for viruses at the genus level since this level is missing for some of them, leading to unreliable statistics. Plots are reported for a sequence identity ranging from 100 to 70% (from the left to the right) as lower values did not change the number of assigned sequences (as evidenced from the plateau reached at all levels).
Lowest percentage of sequence identity below which the comparison against the NCBI nt database did not report any decrease in the number of annotated reads and identified taxa.
| Orange tree blossom honey | Eucalyptus tree blossom honey | |||||||
|---|---|---|---|---|---|---|---|---|
| Level: species | Level: genus | Level: species | Level: genus | |||||
| Datasets | Reads | Taxa | Reads | Taxa | Reads | Taxa | Reads | Taxa |
| Arthropods | 77 | 83 | 77 | 87 | 75 | 77 | 75 | 77 |
| Plants | 78 | 78 | 78 | 81 | 77 | 77 | 77 | 79 |
| Fungi | 77 | 88 | 77 | 82 | 75 | 75 | 75 | 78 |
| Bacteria | 74 | 74 | 74 | 76 | 73 | 75 | 73 | 75 |
| Viruses | 78 | 87 | - | - | 75 | 81 | - | - |
† Percentage of sequence identity.
‡ Data are not reported for viruses since the genus level is missing for some of them, leading to unreliable statistics.
Number of putative MOTUs at the species and genera levels characterizing the two honey samples.
Data are presented for each dataset, considering three different thresholds of sequence identity (<75%, that means without any threshold; ≥ 75%; ≥ 97%).
| Honey | Orange tree blossom honey | Eucalyptus tree blossom honey | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sequence identity | < 75% | ≥ 75% | ≥ 97% | < 75% | ≥ 75% | ≥ 97% | |||||||
| Dataset | Level | N. of reads | N. of MOTUs | N. of reads | N. of MOTUs | N. of reads | N. of MOTUs | N. of reads | N. of MOTUs | N. of reads | N. of MOTUs | N. of reads | N. of MOTUs |
| 96 | 23 | 96 | 23 | 53 | 7 | 2500 | 58 | 2497 | 56 | 1620 | 20 | ||
| 104 | 21 | 104 | 21 | 58 | 7 | 2709 | 42 | 2706 | 40 | 1793 | 11 | ||
| 442 | 98 | 442 | 98 | 141 | 38 | 579 | 89 | 578 | 88 | 165 | 36 | ||
| 538 | 86 | 538 | 86 | 209 | 38 | 614 | 74 | 613 | 73 | 191 | 32 | ||
| 129 | 42 | 129 | 42 | 49 | 13 | 1497 | 101 | 1495 | 101 | 467 | 40 | ||
| 180 | 33 | 180 | 33 | 59 | 12 | 2747 | 54 | 2746 | 54 | 958 | 23 | ||
| 17465 | 688 | 17452 | 683 | 2514 | 121 | 78332 | 846 | 78199 | 834 | 36648 | 229 | ||
| 19867 | 280 | 19843 | 278 | 2834 | 65 | 83343 | 321 | 83192 | 318 | 37939 | 92 | ||
| 248139 | 4 | 248139 | 4 | 188360 | 3 | 172260 | 14 | 172258 | 14 | 99540 | 4 | ||
| - | - | - | - | - | - | - | - | - | - | - | - | ||
† Data are not reported for viruses since the genus level is missing for some of them, leading to unreliable statistics.
Fig 2Proportion of reads assigned to the five organism groups in the orange tree and eucalyptus tree blossom honeys using different sequence identity.
Orange tree blossom honey: sequence identity ≥ 75% (A) and ≥ 97% (B); eucalyptus tree blossom: sequence identity ≥ 75% (C) and ≥ 97% (D). Percentages are computed considering only reads annotated at the taxonomic rank of species (see Table 3).
Predominant species identified in the two analyzed honeys considering the ≥75% sequence identity level based on reads accounting >5% overall reads of the identified organism groups.
| Honey | Dataset | Taxid | Scientific name | N. of reads | % of reads | ‰ of readsTOT | |
|---|---|---|---|---|---|---|---|
| 7460 | 41 | 39.05 | 0.15 | ||||
| 7176 | 8 | 7.62 | 0.03 | ||||
| 28612 | 8 | 7.62 | 0.03 | ||||
| 7029 | 7 | 6.67 | 0.03 | ||||
| 2711 | 72 | 11.92 | 0.27 | ||||
| 29760 | 51 | 8.44 | 0.19 | ||||
| 85681 | 41 | 6.79 | 0.15 | ||||
| 3880 | 37 | 6.13 | 0.14 | ||||
| 1108849 | 27 | 14.52 | 0.10 | ||||
| 4956 | 19 | 10.22 | 0.07 | ||||
| 500148 | 15 | 8.06 | 0.06 | ||||
| 1906742 | 3,479 | 17.36 | 12.86 | ||||
| 2033 | 3,075 | 15.34 | 11.37 | ||||
| 148814 | 1,989 | 9.92 | 7.35 | ||||
| 1100043 | 248,133 | 100.00 | 917.52 | ||||
| 7460 | 1,820 | 66.50 | 6.87 | ||||
| 7461 | 165 | 6.03 | 0.62 | ||||
| 597456 | 162 | 5.92 | 0.61 | ||||
| 71139 | 185 | 28.20 | 0.70 | ||||
| 223129 | 112 | 17.07 | 0.42 | ||||
| 4686 | 55 | 8.38 | 0.21 | ||||
| 4956 | 456 | 16.26 | 1.72 | ||||
| 1365886 | 282 | 10.05 | 1.06 | ||||
| 4896 | 225 | 8.02 | 0.85 | ||||
| 148814 | 37,752 | 44.11 | 142.51 | ||||
| 542 | 11,721 | 13.70 | 44.25 | ||||
| 1100043 | 172,213 | 99.97 | 650.10 | ||||
† Data considers as background the specific number of reads representing the five subsets and the honeys, as reported in Table A in S1 File (columns “sequence identity <75%”).
‡ Data considers as background the total number of annotated reads of that honey, as reported in Table A in S1 File (columns “sequence identity <75%”).
Fig 3Proportion of reads assigned to distinct bacteria families (derived by the sum of reads of the species belonging to the family) in the two analyzed honeys.
Orange tree blossom honey: sequence identity set at ≥75% (A) and ≥97% (C); eucalyptus tree blossom honey: sequence identity set at ≥75% (B) and ≥97% (D). “Others” indicates all other families detected that did not pass the 1% of reads over all bacteria reads.