| Literature DB >> 35099622 |
Camille S Delavaux1,2,3, Robert J Ramos4,5, Sidney L Sturmer6, James D Bever4,5.
Abstract
Arbuscular mycorrhizal fungi (AMF; Glomeromycota) are difficult to culture; therefore, establishing a robust amplicon-based approach to taxa identification is imperative to describe AMF diversity. Further, due to low and biased sampling of AMF taxa, molecular databases do not represent the breadth of AMF diversity, making database matching approaches suboptimal. Therefore, a full description of AMF diversity requires a tool to determine sequence-based placement in the Glomeromycota clade. Nonetheless, commonly used gene regions, including the SSU and ITS, do not enable reliable phylogenetic placement. Here, we present an improved database and pipeline for the phylogenetic determination of AMF using amplicons from the large subunit (LSU) rRNA gene. We improve our database and backbone tree by including additional outgroup sequences. We also improve an existing bioinformatics pipeline by aligning forward and reverse reads separately, using a universal alignment for all tree building, and implementing a BLAST screening prior to tree building to remove non-homologous sequences. Finally, we present a script to extract AMF belonging to 11 major families as well as an amplicon sequencing variant (ASV) version of our pipeline. We test the utility of the pipeline by testing the placement of known AMF, known non-AMF, and Acaulospora sp. spore sequences. This work represents the most comprehensive database and pipeline for phylogenetic placement of AMF LSU amplicon sequences within the Glomeromycota clade.Entities:
Keywords: Amplicon sequencing; Bioinformatics; Environmental DNA; Glomeromycota; Large subunit; Phylogenetics
Mesh:
Substances:
Year: 2022 PMID: 35099622 PMCID: PMC8907093 DOI: 10.1007/s00572-022-01068-3
Source DB: PubMed Journal: Mycorrhiza ISSN: 0940-6360 Impact factor: 3.387
Pipeline steps and associated outputs
| Step | Description | Major output |
|---|---|---|
| Initial processing | Primer (LROR/FLR2) and Illumina adaptor removal | Trimmed sequences |
| Visualize quality to determine sequence length cutoff | Quality plots | |
| Quality filter | DADA2 pipeline: filter, clean, and merge paired-end reads | ASV table and.fasta file |
| Pre-screening BLAST against AMF reference database | BLAST filtered ASV table and.fasta file | |
| OTU clustering | BLAST filtered OTU table and.fasta file | |
| Align sequences | Align reference and study sequences for forward (R1) and reverse (R2) reads | Forward and reverse.fasta alignments |
| Concatenate forward and reverse alignments | Concatenated.fasta alignment | |
| Tree building | Place study sequences (in batches) into backbone reference tree | RAxML trees |
| Extract AMF | Extract sequences in the Glomeromycota clade | Glomeromycota clade filtered OTU or ASV table and.fasta file |
| Extract AMF families | Extract sequences within each of the 11 Glomeromycota family clades | Family clade filtered OTU or ASV tables and.fasta files |
Fig. 1The rRNA gene region with commonly used AMF primers. Primers used in studies from which database sequences were extracted. The scale represents basepair number. The SSU is represented by 18S, while the LSU is represented by 28S. ITS1 and ITS2 are represented by the grey regions either side of 5.8S, with ITS1 to the left and ITS2 to the right. Full sequences can be found in Table S1
Fig. 2The new reference tree for placement of AMF via LSU amplicon sequences. This expanded tree includes 174 AMF across 11 families (indicated by shades of blue) and sequences representing the major neighboring clades of the Basidiomycota, Ascomycota, Mucoromycota, Mortierellomycota, and Chytridomycota as well as two animal and two plant outgroups (indicated by shades of grey)
Fig. 3An example of non-homologous sequences corrected by the new pipeline. The same 15 OTUs were placed in the tree without an initial BLAST screening (A; shaded in purple), but including this step removed these non-homologous sequences, with only OTU 10 retained (B)