| Literature DB >> 23587339 |
Xin Zhou1, Yiyuan Li, Shanlin Liu, Qing Yang, Xu Su, Lili Zhou, Min Tang, Ribei Fu, Jiguang Li, Quanfei Huang.
Abstract
BACKGROUND: Next-generation-sequencing (NGS) technologies combined with a classic DNA barcoding approach have enabled fast and credible measurement for biodiversity of mixed environmental samples. However, the PCR amplification involved in nearly all existing NGS protocols inevitably introduces taxonomic biases. In the present study, we developed new Illumina pipelines without PCR amplifications to analyze terrestrial arthropod communities.Entities:
Year: 2013 PMID: 23587339 PMCID: PMC3637469 DOI: 10.1186/2047-217X-2-4
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1Schematic pipelines of conventional and PCR-independent NGS biodiversity analyses.
Sample composition, sequencing information and recovery rates of the bulk insect sample
| Number of Individuals | 73 |
| Number of | 69 |
| Number of MOTUs (2%) | 37 |
| Raw data size (Gb) | 15.5 |
| High quality data size (Gb) | 13.2 |
| Discovery rate (with reference) | 97% |
| Discovery rate (no reference) | 92% |
| Assembly coverage rate (% MT genomes) | 74% |
| Total length and percentage of | 51,768 (96%) |
| Number of assembled mitochondrial genes 2 | 613 |
1 The total length (bp) of assembled COI genes and the percentage of assembled COI sequences for 37 full-length COI genes (~ 1,530 bp for each species).
2 Note that a small portion of the genes were assembled into two scaffolds.
Results of assembly for the 35 detected scaffolds and 370 mitochondrial scaffolds
| Minimum | 403 | 106 |
| Average | 3,544 | 1,065 |
| N50 | 5,674 | 1,721 |
| N90 | 1,890 | 444 |
| Maximum | 15,326 | 15,326 |
Figure 2Assembly results of mitochondrial genes. Green bars represent successfully annotated and aligned genes and black bars represent gaps. Additional genes (non-COI) are also assembled and annotated (Table 1), but cannot be aligned to same scaffolds containing COI. These are not shown in the figure. The suffix (A / B) indicates the corresponding reference was assembled into different scaffolds.
Figure 3Rarefaction curves for discoveries of (A) MOTUs and (B) biomass. With increasing sequence quantity, discovery rates for MOTUs and biomass are both improved, using a reference-based method (red dashed line) and a reference independent method (blue solid line).
Figure 4Correlation between biomass and data volume. The sequence volume for each given taxon was measured by number of base pairs. The corresponding biomass was estimated based on the body-length. Hollow circles represents the taxa with a single individual, while solid circles represents the taxa with multiple individuals. Number of individuals in the corresponding MOTU is also labeled on the solid circle. Small taxa were typically sequenced at lower sequencing volumes. But those represented by multiple individuals had higher sequencing volume. The increase of sensitivity for these small insects was likely a consequence of the increased total biomass for the given species.