| Literature DB >> 29187708 |
Miho Hirai1, Shinro Nishi1,2, Miwako Tsuda2, Michinari Sunamura2,3, Yoshihiro Takaki1,2,4, Takuro Nunoura1,2.
Abstract
Shotgun metagenomics is a low biased technology for assessing environmental microbial diversity and function. However, the requirement for a sufficient amount of DNA and the contamination of inhibitors in environmental DNA leads to difficulties in constructing a shotgun metagenomic library. We herein examined metagenomic library construction from subnanogram amounts of input environmental DNA from subarctic surface water and deep-sea sediments using two library construction kits: the KAPA Hyper Prep Kit and Nextera XT DNA Library Preparation Kit, with several modifications. The influence of chemical contaminants associated with these environmental DNA samples on library construction was also investigated. Overall, shotgun metagenomic libraries were constructed from 1 pg to 1 ng of input DNA using both kits without harsh library microbial contamination. However, the libraries constructed from 1 pg of input DNA exhibited larger biases in GC contents, k-mers, or small subunit (SSU) rRNA gene compositions than those constructed from 10 pg to 1 ng DNA. The lower limit of input DNA for low biased library construction in this study was 10 pg. Moreover, we revealed that technology-dependent biases (physical fragmentation and linker ligation vs. tagmentation) were larger than those due to the amount of input DNA.Entities:
Keywords: marine microbiome; metagenomics; subnanogram DNA
Mesh:
Substances:
Year: 2017 PMID: 29187708 PMCID: PMC5745018 DOI: 10.1264/jsme2.ME17132
Source DB: PubMed Journal: Microbes Environ ISSN: 1342-6311 Impact factor: 2.912
Read quality and GC contents of shotgun metagenomic libraries.
| Library ID | Input DNA (pg) | Library construction kit | PCR cycles | Library conc. (nM) | Sequenced reads | Low quality reads (%) | Low complexity reads (%) | Duplication reads (%) | Processed reads (%) | GC Read abundance | |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||||
| Average GC% | Peak positions | ||||||||||
| MR12E02_M840_0cm_K1 | 1,000 | KAPA HP | 14 | 64.03 | 2,799,226 | 11.26 | 0.01 | 0.02 | 88.74 | 53.2% | 56.5% |
| MR12E02_M840_0cm_K2 | 100 | KAPA HP | 18 | 30.14 | 2,551,950 | 16.95 | 0.02 | 0.09 | 83.05 | 53.4% | 56.5% |
| MR12E02_M840_0cm_K3 | 10 | KAPA HP | 21 | 11.25 | 2,513,767 | 15.98 | 0.02 | 1.5 | 84.02 | 53.3% | 56.5% |
| MR12E02_M840_0cm_K4 | 1 | KAPA HP | 25 | 2.87 | 2,108,350 | 36.25 | 0.4 | 6.66 | 63.75 | 53.7% | 57.5% |
| MR12E02_M840_0cm_O1 | 1,000 | Ovation SP+ | 11 | 7.52 | 3,036,979 | 20.6 | <0.01 | 0.21 | 79.4 | 52.3% | 56.5% |
| MR12E02_M840_0cm_N1 | 1,000 | Nextera XT | 12 | 26.30 | 2,210,354 | 18.93 | 0.01 | <0.01 | 81.07 | 54.0% | 55.5% |
| MR12E02_M840_0cm_N2 | 100 | Nextera XT | 14 | 7.93 | 2,860,624 | 11.87 | <0.01 | <0.01 | 88.13 | 57.0% | 59.5% |
| MR12E02_M840_0cm_N3 | 10 | Nextera XT | 17 | 7.41 | 3,174,413 | 8.33 | <0.01 | 0.05 | 91.67 | 56.3% | 58.5% |
| MR12E02_M840_0cm_N4 | 1 | Nextera XT | 20 | 3.68 | 3,902,705 | 10.75 | <0.01 | 0.91 | 89.25 | 56.2% | 58.5% |
| KR1401_A2_0cm_K1 | 1,000 | KAPA HP | 14 | 20.36 | 3,330,923 | 8.91 | 0.12 | 0.27 | 91.09 | 57.0% | 35.5%, 61.5% |
| KR1401_A2_0cm_K2 | 100 | KAPA HP | 18 | 41.59 | 3,601,533 | 11.57 | 0.05 | 0.27 | 88.43 | 56.5% | 35.5%, 61.5% |
| KR1401_A2_0cm_K3 | 10 | KAPA HP | 21 | 20.91 | 2,405,665 | 15.3 | 0.16 | 1.8 | 84.7 | 56.4% | 35.5%, 61.5% |
| KR1401_A2_0cm_K4 | 1 | KAPA HP | 25 | 3.54 | 2,822,057 | 34.39 | 0.28 | 12.39 | 65.61 | 57.1% | 36.5%, 60.5% |
| KR1401_A2_0cm_O1 | 1,000 | Ovation SP+ | 11 | 5.9 | 3,429,835 | 22.72 | 0.01 | 0.28 | 77.28 | 55.2% | 35.5%, 60.5% |
| KR1401_A2_0cm_N1 | 1,000 | Nextera XT | 12 | 18.36 | 2,950,863 | 10.65 | 0.01 | 0.01 | 89.35 | 53.7% | 37.5%, 59.5% |
| KR1401_A2_0cm_N2 | 100 | Nextera XT | 14 | 9.65 | 3,072,133 | 10.55 | <0.01 | <0.01 | 89.45 | 58.8% | 61.5% |
| KR1401_A2_0cm_N3 | 10 | Nextera XT | 17 | 4.80 | 3,451,066 | 8.6 | <0.01 | 0.06 | 91.4 | 58.9% | 61.5% |
| KR1401_A2_0cm_N4 | 1 | Nextera XT | 20 | 3.97 | 4,123,537 | 11.19 | <0.01 | 0.85 | 88.81 | 58.0% | 61.5% |
| MR1404_22_0m_K1 | 1,000 | KAPA HP | 14 | 19.78 | 3,641,322 | 12.45 | 0.03 | 0.34 | 87.55 | 42.9% | 34.5%, 48.5% |
| MR1404_22_0m_K2 | 100 | KAPA HP | 18 | 23.95 | 2,928,383 | 16.23 | 0.07 | 0.33 | 83.77 | 43.2% | 34.5%, 49.5% |
| MR1404_22_0m_K3 | 10 | KAPA HP | 21 | 13.16 | 2,932,866 | 20.14 | 0.05 | 6.08 | 79.86 | 43.9% | 34.5%, 49.5% |
| MR1404_22_0m_K4 | 1 | KAPA HP | 25 | 1.62 | 3,555,563 | 55.01 | 0.05 | 33.49 | 44.99 | 46.1% | 35.5%, 49.5% |
| MR1404_22_0m_N1 | 1,000 | Nextera XT | 12 | 6.71 | 2,868,815 | 14.84 | 0.03 | 0.01 | 85.16 | 46.5% | 38.5%, 49.5% |
| MR1404_22_0m_N2 | 100 | Nextera XT | 14 | 2.08 | 3,626,261 | 9.03 | 0.01 | 0.06 | 90.97 | 49.5% | 40.5%, 49.5% |
| MR1404_22_0m_N3 | 10 | Nextera XT | 17 | 0.74 | 4,261,060 | 10.06 | 0.01 | 1.37 | 89.94 | 48.0% | 38.5%, 48.5% |
| MR1404_22_0m_N4 | 1 | Nextera XT | 20 | 0.89 | 3,967,406 | 15.09 | <0.01 | 5.14 | 84.91 | 49.5% | 46.5%, 66.5% |
Percentages of low quality, low complexity, and duplication reads were estimated from the number of reads detected by Trimmomatic, PRINSEQ, and Picard, respectively.
GC distribution of sequence reads are shown in Fig. 1.
Fig. 1GC distribution of MiSeq reads for each metagenomic library constructed from environmental DNA assemblages extracted from upper bathyal sediment (MR12E02_M840_0cm), abyssal sediment (KR1401_A2_0cm), and subarctic surface water (MR1404_22_0m).
Fig. 2A k-mer clustering analysis of MiSeq reads for each metagenomic library constructed from environmental DNA assemblages extracted from upper bathyal sediment (UBS: MR12E02_M840_0cm), abyssal sediment (AS: KR1401_A2_0cm), and subarctic surface water (SSW: MR1404_22_0m).
Fig. 3Compositions of SSU rRNA gene tag communities and SSU rRNA gene sequence communities identified from metagenomic libraries obtained from environmental DNA assemblages extracted from upper bathyal sediment (UBS: MR12E02_M840_0cm), abyssal sediment (AS: KR1401_A2_0cm), and subarctic surface water (SSW: MR1404_22_0m). Numbers in parentheses indicate the number of sequences identified in each library.
Fig. 4Clustering analysis of SSU rRNA gene tag communities and SSU rRNA gene sequence communities, identified from metagenomic libraries obtained from environmental DNA assemblages extracted from upper bathyal sediment (UBS: MR12E02_M840_0cm), abyssal sediment (AS: KR1401_A2_0cm), and subarctic surface water (SSW: MR1404_22_0m).