Literature DB >> 30110939

Overview of Trends in the Application of Metagenomic Techniques in the Analysis of Human Enteric Viral Diversity in Africa's Environmental Regimes.

Cecilia Oluseyi Osunmakinde1, Ramganesh Selvarajan2, Timothy Sibanda3, Bhekie B Mamba4, Titus A M Msagati5.   

Abstract

There has been an increase in the quest for metagenomics as an approach for the identification and study of the diversity of human viruses found in aquatic systems, both for their role as waterborne pathogens and as water quality indicators. In the last few years, environmental viral metagenomics has grown significantly and has enabled the identification, diversity and entire genome sequencing of viruses in environmental and clinical samples extensively. Prior to the arrival of metagenomics, traditional molecular procedures such as the polymerase chain reaction (PCR) and sequencing, were mostly used to identify and classify enteric viral species in different environmental milieu. After the advent of metagenomics, more detailed reports have emerged about the important waterborne viruses identified in wastewater treatment plant effluents and surface water. This paper provides a review of methods that have been used for the concentration, detection and identification of viral species from different environmental matrices. The review also takes into consideration where metagenomics has been explored in different African countries, as well as the limitations and challenges facing the approach. Procedures including sample processing, experimental design, sequencing technology, and bioinformatics analysis are discussed. The review concludes by summarising the current thinking and practices in the field and lays bare key issues that those venturing into this field need to consider and address.

Entities:  

Keywords:  enteric viruses; metagenomics; viral diversity; virus identification

Mesh:

Substances:

Year:  2018        PMID: 30110939      PMCID: PMC6115975          DOI: 10.3390/v10080429

Source DB:  PubMed          Journal:  Viruses        ISSN: 1999-4915            Impact factor:   5.048


1. Introduction

By definition, metagenomics refers to the direct study of microbes’ genetic material in their natural habitat [1,2]. It is an approach that allows for the identification of both cultivable and uncultivable microbes in a mixed community, based on a genomic technique [2,3,4]. The application of metagenomics was first reported in the late 19th century, when Norman Pace’s laboratory conceived the notion of gross extraction of deoxyribonucleic acid (DNA) from a sample with a mixture of nucleic acid. Since then, significant progress has been made in metagenomics in different types of environmental compartments. The presence of nucleic acid has been identified from diverse environment such as soil, ocean, sediment, groundwater, as well as in clinical samples. Currently, metagenomics studies are being explored in marine environments [4,5,6] and in disease diagnosis [7,8] to name but a few. In addition, other metagenomics studies have been conducted in terms of human and animal genetics [9,10,11], veterinary medicine [12,13,14] the textile industry [2,15], food and pharmaceutical products [16], biosensors [17], and agriculture biotechnology [18]. Metagenomic approaches have become an emerging and alternative tool for the study of viral taxonomy and varieties in the functional compositions within the aquatic environments, via next generation sequencing (NGS) technology [19]. The merits and opportunities obtained from metagenomics include the study and discovery of microbial genomes that could not be determined previously, due to certain cultivation difficulties. NGS is a genomic sequencing technique that enables massive parallel sequencing of the small fragments of the entire genetic material obtained from a microbial community, which generates massive data output in only one run, through the use of a high-throughput instrumentation [1,20] NGS sequencing technologies are spread out under different sequencing platforms, though they follow the same experimental work flow [21,22]. The general experimental workflow for metagenomics study applying NGS is presented in Figure 1.
Figure 1

Schematic diagram of the experimental workflow of different next generation sequencing (NGS) platforms. (KEGG—Kyoto Encyclopedia of Genes and Genomes; SEED—Database contains all publicly available genome sequences).

Metagenome analysis by NGS involves several distinct steps, with the most important step being the extraction of high quality total DNA from a sample. This is followed by fragmentation and appropriate adapter ligation on the desired platform for the library preparation and sequencing [1,23]. The analysis of the pieces of fragments and voluminous data generated from the different high-throughput platforms, is done by sorting and assembling them into contigs through bioinformatics tools, which is usually the most challenging and tedious task when undertaking metagenomics projects [1,24]. The filtering of the raw sequences is the first step before downstream analysis, and this is achieved through the elimination of low-quality reads and adapters which were attached to the primer sequences. For instance tools like Btrim, Cutadapt, AdapterRemoval, FASTX toolkit and Krakeen are very efficient tools for filtering of low-quality read sequences, removal of adapters and barcodes and for a detailed quality control on raw reads [25]. The genomes are assembled together to form a contigs using various assembly tools. Over the years, quite a lot of assembly tools or algorithms have been developed that depend solely on specific parameters for the assembling of the raw reads [1,22,24]. The assembling of the raw reads are either through a reference-guide genome assembly or through a de novo genome assembly [26]. Assembling tools such as SSAKE, Edena, Velvet, VCAKE, SOAPdenovo, De Bruijn graph-based assemblers and the latest addition to the group EULER has been used to assemble reads each with its own strength and weakness [27,28,29]. After the assembling, the sequences are mapped or aligned against a reference database that contains genomes that are specific to taxonomic classification. In this regard, tools and software packages such as Newbler, MIRA, AMOS, Botiwe, BLAT, Bfast, BWA, NovoAlign and MetaAMOS are commonly used in metagenomics for performing referenced-based assemblies [26,30]. The taxonomic designations and phylogenetic tree analysis of the organisms are done using sequences already deposited on the public sequence database that are specifically designed for the nucleotide and protein translations, with examples such as the European molecular biology laboratory (EMBL), GenBank, Basic Local Alignment Search Tool (BLAST), Reference Sequence (RefSeq) and the SWISS-PROT [31,32]. Numerous tool programme and software packages such as ARB [33], Naïve Bayes Classification (NBC) [34,35], k-SLAM [36], CLARK [34,36] MEGAN [34], SILVAngs [26], MetaPhlAn [34], Kraken [34,36,37], CARMA [26], interpolated Markov models [36,38], to name just a few, have been used. Bioinformatics tools are playing significant roles in all fields, in medicines for the treatment and cure of some notable diseases [31,32], drug discovery and testing [39,40], microbial genome [31,32], gene discovery and therapy [31,32,41], agriculture [31,42], antibiotic resistance [43,44,45], alternative energy source [32] and also in the study of climate changes [45]. Viruses undergo a vital part in the environment such as recycling of carbon in the marine environment, infecting and destroying bacteria in aquatic microbial communities [46,47,48]. The existence and great quantity of viruses on Earth has been pointed out, hence this has increased awareness about their wide diversity [46]. Generally, viruses are known to be intracellular parasites made up of a nucleic acid core. The viruses are enclosed by a protein coat known as capsid that is capable of replication through adsorption, penetration, uncoating, viral genome replication, maturation and release, which is only possible within the living cells of bacteria, animals and plants [49,50]. Viruses depend on their host’s cells’ metabolism, for energy, enzymes, and precursors, in order to replicate and multiply. A virion is made up of a protein coat and genomic information, encoded in DNA/RNA. Viruses are categorized on the basis of their dimension, mode of replication, chemical configuration and morphology [50], as well as to establish whether they are single stranded or double stranded, linear or circular [49]. The main function of the virion is to deliver its genome into the host cell for expression and replication of itself [49]. Viruses are host specific and they depend on the host organism to supply the complex metabolic and biosynthetic machinery of eukaryotic or prokaryotic cells [50]. For viruses to propagate successfully in any cell, the virion must be able to identify and bind to its cellular receptor, as well as replicate its own genome. Studies have shown that the most prominent viral species within the aquatic ecosystem are human enteric viruses (HEV) [51,52,53], which have the ability to survive in the intestinal tract of humans and animals [54,55]. At present over 140 enteric viral serotypes that are acknowledged to infect humans, and the major illness associated with HEV is gastrointestinal illness [50]. HEV have also been implicated in acute illnesses, such as meningitis, conjunctivitis, hepatitis, poliomyelitis, respiratory diseases and severe fever [50,51]. These groups of viruses are easily transported and transmitted via adsorption phenomena, in the following way: from one contaminated water point to another (especially through the fecal–oral route) [50,52,56], from wastewater treatment plants’ effluents [51,57,58], due to agriculture runoff [51,55], leaking septic tank systems [51,59], and recreational and food products [51,60]. Although HEV cannot reproduce themselves outside their host’s cells, they still have the potential to stay alive for extended periods of time within the aquatic environment [50,61]. Moreover, some serotypes have a strong resistance to chlorine disinfection, which is the most common treatment used at many wastewater treatment facilities [50,53]. The resistance towards chlorine treatment may be due to their high resistant protein coat. However, after treatment, the effluents are released into the aquatic ecosystems, as they are the main sources for drinking water, aquaculture and recreation [61]. The outbreak of HEV disease in both developed and undeveloped nations, has been globally documented by the World Health Organization (WHO) [62]. In the United Kingdom for instance, the effects of these outbreaks has led to a huge strain on the healthcare system, economic burden, and also decreased productivity in affected persons [63]. Table 1 shows some known and identified HEV that are a threat to the global aquatic ecosystem.
Table 1

Human enteric viruses (HEV) that have been identified in various aquatic environments.

FamilyGenusCollective NamesAdverse EffectsReferences
Picornaviridae Enterovirus (ssRNA) Hepatovirus, Hepevirus, Sapovirus (ssRNA) Poliovirus, Echovirus, Coxsackievirus A, BHepatitis A, ESapporo-like virusMeningitis, Paralysis, Myocarditis, respiratory infections, gastroenteritisInfectious Hepatitis[51,52,62,64,65,66,67]
Reoviridae Rotaviridae (dsRNA) Human rotavirusGastroenteritis[51,52,62,64,65,66,67]
Adenoviridae Mastadenovirus (dsDNA) Human AdenovirusConjunctivitisGastroenteritisRespiratory diseases[51,52,62,64,65,66,67]
Caliciviridae Calicivirus (ssRNA) Polyomavirus (dsDNA) Human calicivirusNorwalk virusPolyomavirusGastroenteritis, FeverProgressive Multifocal leukoencephalopathy,Urinary tract diseases[51,52,62,64,65,66,67]
Astroviridae Mamastrovirus Parovirus Human astrovirusHuman parvovirusGastroenteritis[51,52,62,64,65,66,67]
Coronoviridae Coronavirus (ssRNA) Human coronavirusGastroenteritisRespiratory diseases[51,52,62,64,65,66,67]
Circovirus Torovirus (ssDNA) Human TorovirusGastroenteritis[51,52,62,64,65,66,67]
In South Africa, hepatitis A, adenoviruses, astroviruses, noroviruses, enteroviruses, rotaviruses and bacteriophages, have been detected in surface water [50,68,69], wastewater treatment plants [70,71], and in treated drinking water sources [59,70,72,73] in some provinces in South Africa. The identification and quantification of HEV in South Africa was mostly done using conventional and traditional methods in both clinical and environmental samples. Figure 2 shows the different provinces in South Africa where HEV have been studied and identified in different aquatic environments. Over the years, Taylor and his co-workers have extensively investigated the consecutive outbreaks and presence of some HEV outbreaks from some patients through the exposure to surface waters, dams, WWTPs [74,75,76,77]. Techniques such as metagenomics, is still an emerging technique for the identification and diversification of HEV in both environmental and clinical samples in South Africa. There is little knowledge pertaining to the viral content and diversity in wastewater systems in South Africa, which demonstrates the need to survey viral communities using metagenomics. Based on the limitations of the existing molecular methods that target specific viruses, and specific bacterial indicators, new methodologies such as metagenomics are vital for the identification of unique or unlooked-for viruses in the aquatic ecosystems.
Figure 2

Chart showing the percentage of HEV study in different province of South African aquatic ecosystems between 1993–2015 [58,69,73,74,77].

2. Conventional Methods for the Identification of HEV in Environmental Samples

Sample volume in addition to sampling method are the most challenging steps required in the identification of HEV in environmental samples [78,79]. For the initial concentration of viruses, the adsorption elution principle has been widely applied for the primary concentration of enteric viruses from water, based on the fact that viruses mechanisms are linked to the surface charge [80,81]. In line with the distinguishing viral particle surface capabilities, they have the potential to eagerly adsorb to a number of materials [82]. However, in recent years, a wide range of concentration procedures and techniques have been implemented for the primary and secondary concentration of viruses in water samples. This entails the adsorption of virus-related particles or phages onto the surface of a filter membrane, through the interaction of electrostatic charges, followed by elution with the appropriate buffer system [82,83,84,85,86,87]. Alternatively, the concentration of viral particles could also be based upon size exclusion of the particles, rather than the electrostatic interactions of the filters on the viruses [82], with varying adsorbent material and elution buffers. In Africa, some of these concentration techniques have been used and reported [59,70,72,88]. Table 2 provides a short summary of the conventional and improved concentration procedures used for the recovery of HEV in environmental samples.
Table 2

Different concentration techniques used for the concentration, recovery and isolation of viruses in environmental samples.

VirusTechniqueAdvantagesDisadvantagesReferences
Enterovirus Membrane adsorption techniqueSimple, speed, sensitiveLow efficiency of virus adsorptionEasy clogging of membrane filter[89]
Enterovirus Aqueous polymer two phase separationSimple and cost effectiveRequires small sample volumesLimited serotypes identifiedInhibitory action of salts[82,89,90]
Poliovirus, Herpesvirus, Echovirus Adsorption to precipitable salts, iron oxide, and polyelectrolytesRequires large sample volume, simple, time effectiveSpecific to certain viruses and water samples[81,82]
Poliovirus Soluble alginate filterSimple,Non-cytotoxicClogging of filters,Pre-filtration required, time consuming[81,82,89]
Poliovirus Continuous-flow ultracentrifugationOpportunity for diversityExpensive instrumentation, time consuming[81,82,89]
Bacteriophage Forced-flow electrophoresis and electro-osmosisSmall sample volumes, less processing timeSmall sample volume[82,89]
Enterovirus Hydro extractionGood recoveriesSmall sample volume[82,89]
Poliovirus Gauze samplerLarge sample volume, cost effectiveLow efficiencyMinimal recovery of viruses[82,89]
Poliovirus, Norovirus, Enterovirus Electropositive FiltrationLarge sample volume, pre-conditioning step not required, Cost effectiveNot effective for selected environmental samples including marine water and sediments, expensive[82,83,91,92]
Poliovirus, Echovirus, Reovirus, Coxsackievirus Electronegative FiltrationVarieties of adsorbent materials, available, High recoveriesConditioning of large volumes of water is difficult,Acidification protocol may lead to the formation of precipitates, Filter clogs easily, Expensive[80,82]
Poliovirus, Enterovirus, Rotavirus Glass woolLess expensive,Pre-conditioning of water sample is not requiredNot suitable for large sample volume[70,71,93,94]
Poliovirus, Echovirus, Hepatitis A Ultrafiltration (Tangential flow, Dead-end flow, Vortex)No pre-conditioning steps requiredExpensive, retreatment of fibres important[82,95,96]
Calcivirus, Hepatitis A UltracentrifugationLess time consuming,Large volumes of water are concentrated to millilitresClarification step required,Loss of viruses through the use of membrane filters, expensive[82,89,97]

2.1. Culture Based Methods

In vitro growth methods such as cell culture are the most pronounced traditional standards used to identify and detect the occurrence of HEV in environmental samples [82,98,99]. Cell culture is a technique whereby a microorganism’s cells are grown at a carefully controlled condition outside of the living animal [100]. It is a very time consuming, laborious and expensive approach that usually demands prior knowledge of the targeted species [51,70]. The limiting factor with this method is that there are some viral species that are not capable of producing any cytopathic effect when propagated on a cell line [51]. HEV detection has also been explored using the integrated cell culture polymerase chain reaction (ICC-PCR), this technique has also been used for the discovery of HEV in ecological samples [65,101,102]. The merit of this technique is that it gives room for several modifications of the protocols, enhanced the direct analysis and monitoring of HEV in environmental samples [103,104,105,106]. Epifluoroescence and transmission microscopy, is another type of conventional technique that has been explored for the abundance, morphological and enumeration studies of viral entities within the aquatic environments [107,108]. Here, the virus-like particles are counted using fluorescent nucleic acid stains through visualisation [107,108,109,110]. Flow cytometry and vortex flow filtration (VFF) have also been used for the quantification and counting of virus-like particles and prokaryotes in aquatic environments [98,111,112]. Figure 3 exhibits the numerous molecular approaches that have been used in the diagnostics and identification of HEV in environmental samples.
Figure 3

Schematic illustration of various molecular techniques applied for the identification of HEV from different environmental samples.

2.2. Polymerase Chain Reaction Methods (PCR Assays)

Polymerase chain reaction (PCR) is a sensitive conventional assay technique that is used on targeted amplification of the viral DNA or RNA over a range of magnitude to produce thousands or millions of copies [51,106]. PCR methods are designed to amplify a single specific nucleic acid sequence a million times under three distinctive steps that include denaturation, annealing and extension. For denaturation to take place, the target DNA is subjected to a high temperature in other for the DNA strands to be separated. Annealing of the primers to the target DNA allows the DNA to polymerase and selectively amplify the target DNA at a lower temperature [51]. PCR assays are very sensitive, highly specific, and particularly attractive for detection of non-cultivable infectious agents thereby making it an attractive method for the detection of target pathogens [51]. A comprehensive array of PCR systems exists for rapid detection and confirmation of the presence of HEV in different environmental samples. These samples include water sediments [113,114], wastewater treatment plants (WWTP) [59,115], treated and untreated sewage [115,116], groundwater [117], and surface water [69,102]. A wide range of primers have been designed for the precise detection of many HEV and an immediate overview of these is presented in Table 3. The chief limitation of the PCR techniques is that they are incapable of distinguishing between active and inactive targets, and are found to be prone to inhibition due to the interaction with DNA or interference with the DNA polymerase which increases false negative results. In addition, different primer sequences make it inappropriate for use, especially with the discovery of unique viruses. Previous information of the viral sequence is, therefore, a pre-requisite for any PCR reaction. Various modifications of the PCR assay have been used for detection of HEV, and they include the nested [118], multiplex [119,120,121], real time [106], and reverse-transcription polymerase chain reaction [118], all displaying their own merits and demerits.
Table 3

Review and summary of published primers for PCR Assays.

HEVPrimers and Labelled TaqMan ProbesTarget RegionReferences
Hepatitis A virusHAV68 (F): 5′-TCA CCG CCG TTT GCC TAG-3′HAV240 (R): 5′-GGA GAG CCC TGG AAG AAA G3′HAV150 (P): 5′-FAM-CCT GAA CCT GCA GGA ATT AA-MGBNFQ-3′capsid gene VP1/P2B[69,73,116,122,123]
EnterovirusEV1 (F): 5′-CCCTGAATGCGGCTAAT-3′EV1 (R): 5′-TGTCACCATA AGCAGCCA-3′EV-BHQ (P): 5′-FAM-ACGGACACCCAAAGTAGTCGGTTC-MGBNFQ-35′ Non-coding region[57,58,69,73,124,125]
RotavirusJVK (F): 5′-CAGTGGTTGATGCTCAAGATGGA-3′JVK (R): 5′-TCATTGTAATCATATTGAATACCCA-3′JVK (P): 5′-FAM-ACAACTGCAGCTTCAAAAGAAGWGT-MGBNFQ-3′NSP3 gene[69,73,126]
NorovirusesGIGIIJV13I (F) 5′-TCA TCA TCA CCA TAG AAI GAG-3′JV12Y (R) 5′-ATA CCA CTA TGA TGC AGA YTA-3′JV13I (F) 5′-TCA TCA TCA CCA TAG AAI GAG-3′G1 (R) 5′-TCN GAA ATG GAT GTT GG-3′JV12Y (F) 5′-ATA CCA CTA TGA TGC AGA YTA-3′Noro11(R) 5′-AGC CAG TGG GCG ATG GAA TTC-3′Polymerase region[73,127]
AdenovirusesJTVX(F) 5′-GGACGCCTCGGAGTACCTGAG-3′JTVX(R) 5′-ACIGTGGGGTTTCTGAACTTGTT-3′JTVX(P):5′-FAM-CTGGTGCAGTTCGCCCGTGCCA-MGBFQ-3′Hexon gene[58,128,129]
AstrovirusHAst.(F): TCAACGTGTCCGTAAMATTGTCAHAstV. (R):TGCWGGTTTTGGTCCTGTGAHAstV.probe1(FAM): CAACTCAGGAAACAGGHAstV.probe2 (FAM): CAACTCAGGAAACAAGORF 1b-VPg region ssRNA[130]
Sapovirus GI, II and IVSapo (F) A: ACCAGGCTCTCGCCACCTASapo (F) B: ATTTGGCCCTCGCCACCTASapo (R): GCCCTCCATYTCAAACACTAWTTTSapo.probeA (FAM) CTGTACCACCTATGAACCASapo.probeB (FAM) TTGTACCACCTATGAACCASapo.probe C (FAM) TGTACCACCTATAAACCASapo.probe D (FAM) TGCACCACCTATGAACRdRp-VP1 region[130,131]
SalivirusF: 5′-TCTGCTTGGTGCCAACCTC-3′R: 5′-CCARGCACACACATGAGRGGATAC-3′Probe: 5′-FAM- TGCGGGAGTGCTCTMGB- NFQ-3′VP1 region or 3CD region[132,133]
KlassevirusKLA-F; 5′-TCTGCT TGGTGCCAACCTC-3′KLA-R; 5′-CCARGC ACACACATGAGRGGATAC-3′KLA-TP; 5′FAM-TGCGGGAGTGCTCT-MGB-NFQ-3′VP0/VP3 regions[133]
Human ParechovirusF: 5′-CCA AAA TTC RTG GGG TTC-3′R: 5′-AAA CCY CTR TCT AAA TAW GC-3′VP1 capsid gene or 3CD region[134,135]
Aichi virusF: ACA CTC CCA CCT CCAGCC AGT AR: GGA AGA GCT GGG TGT CAA GA3CD junction region[134,135]
The presence of norovirus, astrovirus, enterovirus have been established have been established in surface water, ground water and wastewater samples via multiplex and nested PCR [51,120,136]. Other modified PCR techniques developed are the reverse-transcriptase polymerase chain reaction (RT-PCR) and real-time or quantitative polymerase chain reaction (qRT-PCR). The RT-PCR are able to amplify and detect HEV viruses that possess only the RNA genomic information [27,49,69,89,109,110,111,112]. These techniques has been implemented for the identification of different groups of the HEV in various environments [78,83,84,106,117,137,138,139,140,141,142]. These techniques also offer better rates of detection, and great sensitivity and accuracy. In addition, they are precise, they reduce experiment time and the possible source of contamination is reduced [51,78]. A summary of the numerous molecular techniques, principles, merits and limitations is presented in Table 4.
Table 4

Summary of the Pros and Cons of molecular methods for HEV identification.

TechniquePrincipleAdvantageDisadvantagesReferences
Cell cultureCytopathic effects potential for virusesDirect isolation of a variety of cultivable viruses to high titresHighly skilledRequires controlled conditionsExpensive and time consuming[51,99,105]
Electron microscopetransmission electron microscopyElectron beam used to illuminate viruses.Counting of the viral particles and morphologyPrior knowledge of organism not requiredDNA provides high resolution imageIt requires technical skills and expertisePoor detection limitHigh concentrationsHigh cost of maintenance and training of the instrument[97,109,110,143]
Flow cytometryDirect and rapid assays for the determination of cell numbers and morphologyHigh speed and velocitySkill generation and refrigeration a pre-requisite, expensive[112,144]
Vortex flow filtrationCounting and quantifying virus-like particlesHigh recoveryReduces filter cloggingExpensive method[107,112,144]
PCR AssayAmplification assays based on specific primers and enzyme to generate more copies of DNASequence dependentCost effectiveHigh sensitivity and specificityCannot detect new viral speciesRisk of contaminationFalse positive results[51,66,106]
ICC-PCRViral particle is amplified via host cell assaysLess vulnerable to PCR inhibition Identify non-cytopathic virusesDoes not detect non-culturable viruses, Requires multiple cell linesTime consuming, More costly than direct PCR detection[51,66,105,106]
Multiple PCRSimultaneous amplification of sequences of several pathogenic microorganisms in a reaction mixtureSequence dependent,Cost and time effective, High sensitivity and specificCannot detect new viral speciesChallenges with optimisation and sensitivityfor all targeted speciesContaminationNon-specific amplification inenvironmental samples[51,106,136,145]
Nested/Semi Nested PCRDistinct pair of primers amplifies enormous region of DNAThe amplified PCR product is now used as a template for the next round of amplificationIncreased sensitivityPotential risk of contamination and carry-over[51,106,120,146]
RT-PCR(Reverse-transcriptase PCR)Amplification is achieved by converting DNA to complementary DNA (cDNA) in a reverse transcriptionprocedureSpeed sensitivityContaminationSpecificityRepeatabilitySequence knowledge is a perquisite, expensive, Possible reaction inhibition, and there is a need for experts for the interpretation as well as the accuracy of results[51,66,78,106,137,147,148,149]
qRT-PCR (quantitative real-time PCR)Quantifies and measures amplification of DNA using dyes or fluorescent dyes or probesElimination of gel electrophoresis applicable for both culturable and unculturable microorganisms
Microarray technologyDetection is done by means of radio-labelled probes or fluorescent tagsKnown viral sequencesExpensiveReproducibility test results are poor[150,151,152,153,154]
NASBAIsothermal amplification of RNASensitive, rapid simpleResistant to matrix influenceCan be used only for organisms, which are already known[106,155,156,157,158,159,160,161]
Immunology-based methodFormation of antigen—antibody through recognition and bindingHigh sensitivitySpecificity speed Easy automation and equipmentQC assurance dependentRisk of interferencesExpensive[162,163]
Biosensor-based methodsanalytical device that identifies analytes via an electrical signalDetects non-polar moleculesHigh specificityReaction time is shortRelies on specific antibodies or DNAProbesNecessary chemical inactivation for the recognition sites[164,165,166,167,168]
NGSParallel sequencing of multiple small fragments of DNA to determine its sequence using high-throughput instrumentationFast and easy to approach for DNA sequencingLarge sequencing data per runExpensive equipment[169]

2.3. Viral Metagenomics

Viral metagenomics is a modern genomic technique used for studying viral communities in their natural habitat, without the isolation and laboratory cultivation of single species [170,171,172,173]. The sequencing of the genomic DNA information using metagenomics can be achieved either through the PCR amplicon sequencing or via shotgun metagenomics. The PCR amplicon approach, is mainly used for targeted species, the identification and characterization of the specific genomic regions is done through the use of specific primers [174,175]. The second approach, shotgun metagenomics, is a technique whereby unculturable and difficult microbes are analysed and studied extensively without prior knowledge of the state of these communities [174,176]. There has not been an individual gene marker that is peculiar to most viral genomes, like the 16S RNA used to denote the bacteria genome [1,171], hence, this has limited the understanding and investigation of viruses by amplicon sequencing and ribosomal DNA profiling [1]. Studies on viral metagenomic have revealed that a lot of the generated sequences are not similar or matching to known viruses, hence the need for viral metagenomic analysis in the virology field [171,177]. Specifically, viral metagenomics has provided the detection of viral species presumed to be a potential threat to human health [130,178], means for virus discovery [179], and the characterization of the viral population [171,180]. Figure 4A, B provide an overview of the number of research articles on metagenomic studies on human virome in diverse parts of the world. They also indicate how the number of research articles has risen from around 200 articles in 2002, to more than 12,000 articles in 2017. Due to this, more metagenomic datasets of viruses have been established [171,177]. Africa is still far behind in terms of research articles being produced, with approximately 50 articles available, to date.
Figure 4

(A) Overview of the research publications of viral metagenomic studies around the world, (B) Overview of the publication of viral metagenomic studies in Africa.

The first-generation sequencing is a chain-termination technique, where sequencing is achieved by the selective incorporation of chemical analogues of deoxyribonucleotide triphosphates (dNTPs), the monomers for DNA strand synthesis [181,182], with an approximate reads of approximately 1200 bp long [183]. This technique has been used to characterize the presence of the different groups of human adenoviruses (HAdVs) in environmental samples [184]. The main setback of this technology is that it is a low throughput, thereby limiting it as a means for diagnosis, and is labour intensive and slow [181,183]. In 2004, the revolution and activation of an improved sequencing knowledge began through the introduction of the second-generation sequencing platform [181,185]. The second-generation platform includes 454 Roche platform, Ion Torrent Personal Genome Machine, AB SOLiD and Illumina Solexa sequencers [22,23,181,185,186]. The 454 sequencing platform has been used to examine the diversity of human RNA viruses present in Lake Needwood, a freshwater lake in Maryland, USA, with results indicating the presence of four different types of viruses [187]. Likewise 454 platform was able to detect and study the dominant DNA and RNA viral species in reclaimed water, the study showed that both the reclaimed and portable water was dominated by phages [188], it has also be used as a monitoring tool for identification of viral agents of animal, plant and human diseases in freshwater samples [189]. Ion Torrent platform has also been explored for the sequencing and microbial profiling of multiple viral groups from animal samples and sediments from the Athabasca River [190,191]. The Illumina Solexa technology system seems to be the most favoured platform over other existing second-generation platforms. The sequencing of microbes is based on the sequence by synthesis (SBS), with upgraded system versions [22,185,192]. Illumina systems have been used to sequence viruses from both clinical and environmental samples [193,194]. Table 5 shows the strength and weakness of the second- and third-generation platforms. The rudimentary workflow for second-generation sequencing is shown in Figure 4.
Table 5

Summary of the various features of the different second-generation platforms indicating strength and weakness.

PlatformAmplification TechniqueChemistryRead LengthOutput and DurationAdvantagesDisadvantagesReferences
Roche 454Emulsion PCRPyro-sequencing400–700 bp100–700 Mb10–23 hLong read length,short run timesHigh error rate[22,23,186,196,198,199,200]
AB SOLiDEmulsion PCRLigation35 bp80–360 Gb between 6–8 daysLow error rateShort readsLong run time[22,23,185,186,195,196,199]
Ion Torrent (PGM)Emulsion PCRProton detection100–400100–64 Gb for 2–7 hLess sequencing time, reduces costsShort readsHomopolymer errors[22,186,192]
Illumina Solexa(MiSeq, HiSeq)Bridge PCRReversible terminators100–300600 Gb5 h to 3 days runHigh throughput, Cost and time effective, minimal error rateShort readsDecrease in quality of reads towards the ends[22,186,192]
Pacific Bioscience (SMRT)Single molecule real time (SMRT)Fluorescently labelled nucleotides4000–5000 nts200 Mb–1 Gb generated under few hoursData generation is monitored in real-time, AccurateExpensive,high error rates[22,23,186,195,196]
Helicos TM Genetic Analysis Systemnon-amplified DNA templatesFluorescently labelled nucleotides24–70 bp35 Gb for a few hoursAccurateExpensive, low data output[23,186,196]
Oxford Nanopore (MinION)Single molecule real time (SMRT)Reversible terminators90 Mbp of data with 16,000 reads6 kb–60 kbAccurateExpensive, high error rate, low throughput[22,23,186,195,196]
Recently, the emerging third-generation sequencing technologies that are being introduced in the genomic scientific world are the Pacific Biosciences Single Molecule Real Time (SMRT) sequencing, Nanopore sequencing by Oxford Nanopore, and the Helicos TM Genetic Analysis System [23,169,186,195,196]. The technology has the potential of generating high read lengths of up to 100,000 bp within hours, and is very expensive to acquire [186,195,196]. The most recent third-generation technology is Nanopore Technology, which involves the use of a small device or membrane with a pore size of approximately 1.5–2 nm [186]. The distinguishing feature of all the third-generation sequencing platforms is that the technique does not require an amplification step during the library preparation [196]. In addition, the read lengths are between 25–15,000 bp, with a run time of approximately 30 min, when compared with the second-generation platforms [195,196]. Pacific Biosciences Single Molecule Real Time technologies has explored some microbial populations [197]. Currently, these technologies are being developed and upgraded, but they have not been exclusively explored to the fullest for the determination and analysis of the HEV, probably due to cost of set-up and lack of technical skills.

3. Metagenomics and Its Application in Africa

In certain countries, viral metagenomic studies have increased gradually [171,201,202]. It is emerging as an alternative technique for viral identification, diversity and abundance, in a range of environmental samples which includes the ocean environment [48,170,203], surface freshwater bodies and lakes [187,204], ballast water [202], wastewater plants [205,206], reclaimed water [188], the atmosphere [207], plants [208], aquaculture [209], and in clinical samples such as feces [210], blood [211], and in some animals [212]. In the face of the advances in the biological world, where the cost of sequencing is gradually reducing, developing countries such as South Africa are still a long way from benefiting from the technology. Over the years, environmental metagenomic studies in South Africa have focused mainly on studying diversity and abundance of bacteria in different aquatic ecosystems and extreme environments [213]. In 2015, Tekere and co-workers carried out a metagenomic analysis study in a thermal hot spring in Limpopo. The aim was to define the genetic and phylogenetic diversity of thermophiles in this environment. The community composition, distribution and abundance of the thermophiles living in the different hot spring waters, and biofilms of South Africa, were assessed [149,213,214,215]. In addition, the abundance of halophilic bacteria were also identified from a salt pan in the Limpopo province [216]. In 2018, Abia and co-workers used metagenomics to analyse the functional profiles of some bacterial populations in sediments as well as in surface water samples. It was observed that the abundance and diversity of bacterial is attributed mainly to the occurrence of an unapproved informal settlement with poor infrastructure. The functional profiling revealed that bacteria could be a possible pathway in human diseases [217]. In addition to the natural environments, man-made extreme environments such as industrial wastewater, was also explored for bacteria diversity [218]. Metagenomics is progressing slightly in Kenya, since it has been observed that arthropods—which are referred to as blood-feeding agents for viruses—could cause an exceptional health concern [219]. The intercontinental virome diversity studies on the culex mosquitoes were done using samples from Kenya and China and analysed using NGS. The study revealed that mosquitoes are vital vectors as well as the fact that viruses are harbored by these arthropods [219]. The study also indicated the presence of some specific vertebrates, invertebrates, plants, and protozoa as well as uncategorized assembly of viruses [219]. Another part of Africa that metagenomics is also gaining momentum in is Namibia. Metagenomics has been employed to better understand virus abundance, ecology and diversity in the soil samples [220]. The enumeration of these viral particles on different types of soils has shown that viral abundance can range from 1.5 × 108 to 6.4 × 108 per gram of soil [220,221]. NGS has also been used to determine the diverse ecological patterns in the Namib Desert, the cold Miers Valley, and the Antarctica hyper arid deserts, so as to understand the response to, and microbial adaptation to, environmental stressors [222]. Likewise, comparative metagenomic studies have been conducted on the mechanisms that are likely responsible for the stress response in hypoliths in extremely hot hyper-arid desert soils [223]. In Kampala, Uganda, the diversity and richness of some HEV was investigated from wastewater samples and surface water using viral metagenomics. In this study, numerous human and vertebrate viruses were discovered, such as Herpesvirales, Iridoviridae, Poxviridae, Circoviridae, Parvoviridae, Bunyaviridae from the effluent samples [178]. Through the study, it was also established that the discharge from the wastewater treatment plant appears to influence the quality of the surface water through high viral concentrations levels. Although in this study, only the sampling and filtering of the water samples was done in Uganda, the NGS analysis, and data interpretation of the sample was done at Michigan State University in the United States. This was probably due to the fact that most of the infrastructure, cost and manpower associated with the metagenomic study and pipeline were not available. In South Africa, a study of viral diversity using metagenomics has not been explored to the fullest, except in few environments. In Kogelberg Biosphere Reserve in South Africa, the unique plant viral biodiversity was explored in a vegetation in the western province using metaviromic technique. The recovered DNA from the soil samples was sequenced under the Illumina Platform with some bioinformatics analysis carried out which detected biodiversity among the Caudovirales group [224]. The functional and phylogenetic analysis of the metaviromes revealed a high percentage of phages while distinct viromes from known isolates were left. New and emerging phage related protein sequences were also identified in this research study, thereby presenting a prospect for more research studies in such environments to explore more viral diversity using metagenomics. Metagenomics was also explored in South Africa, in Western Cape province, to determine the unique interaction of viruses’ diversity in an African hot spring community; this was achieved via electron microscopy and sequencing [225]. In this study, the metaviromes analysis was able to detect the presence of salterproviruses using a polymerase B gene phylogeny [225]. The diversified presence of phages, as well as novel archaea viruses, was also discovered in the hot spring. Likewise, a research group in the Eastern Cape province employed the approach of viral metagenomics to screen, identify, and recover, the prevalent species of Human Adenovirus (HAdV) present in sewage and mussel samples, which are associated with human infections [226]. In this study, the metaviromes indicated the predominant presence of HAdV-17 in mussel samples. This is an indication that it is not only the environmental samples that should be the most important priority; both food products and clinical samples should be screened thoroughly. The manifestation of HAdV-D17 in the seafood samples raises an alarm round the ecological health state of the river as well as the extent of contamination existing in the Swartkops River estuary [226]. Table 6 demonstrates the trends of the metagenomics approach using different sequencing platforms in Africa.
Table 6

Recent studies and application of metagenomics in some African countries.

CountryMicrobeNGS PlatformEnvironmentReferences
South AfricaBacteriathermophilesRoche 454Hot spring[149]
South AfricaBacteriaIllumina MiSeqSurface waterSediments, Industrial wastewater[215,217]
NamibiaVirus Soil, deserts[220,221]
KenyaMosquitoIlluminaClinical sample[219]
South AfricaVirusIlluminaHot spring[225]
UgandaViruses (HEV)IlluminaSurface water, WWTP[178]
South AfricaViruses (HADV)IlluminaSewageMussels[226]
South AfricaViruses (Caudovirales, phages)Illumina MiSeqSoil[224]

4. Open Research Work and Implications for Environmental Genomes

More insight into virology ecology has expanded since the commencement of viral metagenomics. At present, in South Africa, conventional molecular techniques have mainly been used in the isolation, quantification and identification of HEV. In all these conventional approaches used thus far, our knowledge of the different species of viruses in the environment has been limited. More information about the occurrence, abundance, diversity and ecological richness of these microbes remain unexplored due to lack of skills and technology. Characterization of viral communities through conventional methods or protocols is often biased, as they do not allow for total viral community analyses. Some of these techniques are peculiar to a gene or organism, tedious and specific since no specific molecular assay has the potential to determine all viruses present in a sample in one single run. NGS has received huge success and application in viral ecology in various matrices, where other techniques have had setbacks. Based on literature and scientific reports, identification of HEV using metagenomics is still an upcoming approach in resource-poor settings like underdeveloped or developing regions. The non-stop monitoring of bio-indicators in wastewater systems using metagenomics could also attribute to evaluating the distribution patterns of viral infections, as well as the microbial risk assessment, which can make available early advice of any potential disease outbreaks. The South African aquatic systems have the prospect of an almost unimaginable microbial diversity, despite the water scarcity syndrome been experienced in recent years. Techniques such as viral metagenomics can be used to improve surveillance of viral pathogens, to understand the evolution and diminishing viral species due to climate changes, and for diversity in food security and public health.

5. Conclusions and Future Perspectives

Since the introduction of metagenomics and NGS, the field has gained momentum, giving room and opportunity for the characterization of all possible microbes in a sample. Since there is not much development in the areas of cutting-edge technologies in developing nations, the quest for information regarding the state of our water systems continues to deteriorate. Emerging and recurring viral species may not be the only setbacks facing developing countries, but a problem that the entire world faces. This is due to the fact that these viruses have a mysterious way of contaminating and polluting the world’s entire aquatic ecosystem. It is proposed that the investigation about the prevalence of possible microorganisms within the aquatic system is essential because diverse activities are carried out in various parts of the world. The relatively high cost of modern molecular technologies, as well as computational human expertise for the analysis of the data generated, have greatly contributed to the slow growth of the viral microbial ecological research community in Africa. NGS is undeniably a key technology; however, the implementation of this technique is still a challenge in Africa. A wide range of challenges are defying researchers in Africa, such as limited scientific resources, limited human skills, insufficient training and lack of access to genome sequencing facilities. In addition, we recommend that more energy should be directed towards instituting more water and safety programmes in emerging nations, as this may help to break the barriers and restrictions that are swallowing up the scientific community.
  199 in total

1.  Introduction. Microbiological food safety.

Authors:  Jianghong Meng; Michael P Doyle
Journal:  Microbes Infect       Date:  2002-04       Impact factor: 2.700

2.  Integrated cell culture/PCR for detection of enteric viruses in environmental samples.

Authors:  Kelly A Reynolds
Journal:  Methods Mol Biol       Date:  2004

3.  Optimization of procedures for counting viruses by flow cytometry.

Authors:  Corina P D Brussaard
Journal:  Appl Environ Microbiol       Date:  2004-03       Impact factor: 4.792

4.  ARB: a software environment for sequence data.

Authors:  Wolfgang Ludwig; Oliver Strunk; Ralf Westram; Lothar Richter; Harald Meier; Arno Buchner; Tina Lai; Susanne Steppi; Gangolf Jobb; Wolfram Förster; Igor Brettske; Stefan Gerber; Anton W Ginhart; Oliver Gross; Silke Grumann; Stefan Hermann; Ralf Jost; Andreas König; Thomas Liss; Ralph Lüssmann; Michael May; Björn Nonhoff; Boris Reichel; Robert Strehlow; Alexandros Stamatakis; Norbert Stuckmann; Alexander Vilbig; Michael Lenke; Thomas Ludwig; Arndt Bode; Karl-Heinz Schleifer
Journal:  Nucleic Acids Res       Date:  2004-02-25       Impact factor: 16.971

5.  Roles of viruses in the environment.

Authors:  Forest Rohwer; David Prangishvili; Debbie Lindell
Journal:  Environ Microbiol       Date:  2009-11       Impact factor: 5.491

Review 6.  High-throughput sequencing technologies.

Authors:  Jason A Reuter; Damek V Spacek; Michael P Snyder
Journal:  Mol Cell       Date:  2015-05-21       Impact factor: 17.970

7.  Detection of enteroviruses in groundwater with the polymerase chain reaction.

Authors:  M Abbaszadegan; M S Huber; C P Gerba; I L Pepper
Journal:  Appl Environ Microbiol       Date:  1993-05       Impact factor: 4.792

Review 8.  Inadequately treated wastewater as a source of human enteric viruses in the environment.

Authors:  Anthony I Okoh; Thulani Sibanda; Siyabulela S Gusha
Journal:  Int J Environ Res Public Health       Date:  2010-06-14       Impact factor: 3.390

Review 9.  Traditional and Modern Cell Culture in Virus Diagnosis.

Authors:  Ali Hematian; Nourkhoda Sadeghifard; Reza Mohebi; Morovat Taherikalani; Abbas Nasrolahi; Mansour Amraei; Sobhan Ghafourian
Journal:  Osong Public Health Res Perspect       Date:  2016-01-08

10.  Surveillance of adenoviruses and noroviruses in European recreational waters.

Authors:  A Peter Wyn-Jones; Annalaura Carducci; Nigel Cook; Martin D'Agostino; Maurizio Divizia; Jens Fleischer; Christophe Gantzer; Andrew Gawler; Rosina Girones; Christiane Höller; Ana Maria de Roda Husman; David Kay; Iwona Kozyra; Juan López-Pila; Michele Muscillo; Maria São José Nascimento; George Papageorgiou; Saskia Rutjes; Jane Sellwood; Regine Szewzyk; Mark Wyer
Journal:  Water Res       Date:  2010-10-29       Impact factor: 11.236

View more
  2 in total

1.  Metavirome Sequencing to Evaluate Norovirus Diversity in Sewage and Related Bioaccumulated Oysters.

Authors:  Sofia Strubbia; Julien Schaeffer; Bas B Oude Munnink; Alban Besnard; My V T Phan; David F Nieuwenhuijse; Miranda de Graaf; Claudia M E Schapendonk; Candice Wacrenier; Matthew Cotten; Marion P G Koopmans; Françoise S Le Guyader
Journal:  Front Microbiol       Date:  2019-10-17       Impact factor: 5.640

2.  Profiling Bacterial Diversity and Potential Pathogens in Wastewater Treatment Plants Using High-Throughput Sequencing Analysis.

Authors:  Cecilia Oluseyi Osunmakinde; Ramganesh Selvarajan; Bhekie B Mamba; Titus A M Msagati
Journal:  Microorganisms       Date:  2019-10-29
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.