| Literature DB >> 32489396 |
Sheng Zhao1, Cuicui Zhang2, Jianqiang Mu1, Hui Zhang1, Wen Yao3, Xinhua Ding4, Junqiang Ding5, Yuxiao Chang1.
Abstract
BACKGROUND: Next generation sequencing (NGS) has been widely used in biological research, due to its rapid decrease in cost and increasing ability to generate data. However, while the sequence generation step has seen many improvements over time, the library preparation step has not, resulting in low-efficiency library preparation methods, especially for the most time-consuming and labor-intensive steps: size-selection and quantification. Consequently, there can be bottlenecks in projects with large sample cohorts.Entities:
Keywords: All-in-one sequencing (AIO-seq); Library preparation; Population genetic research; RNA-seq; Whole genome sequencing
Year: 2020 PMID: 32489396 PMCID: PMC7247233 DOI: 10.1186/s13007-020-00615-3
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 4.993
Fig. 1Flowchart for the All-in-One sequencing (AIO-seq) method. a The libraries were prepared using Tn5 transposase. b The process of mechanical fragmentation was used to prepare the libraries. c In the traditional protocol, the size selection, and quantification were processed using a one sample one tube method. d With the AIO-seq method, the library analyzed by the Agilent 2100 Bioanalyzer will give the fragment distribution pattern and the ratio of the target region (between the two blue lines) to the total library. e The concentration of the total library could be obtained by Qubit™ 4.0 Fluorometer. f The target region concentrations (TRC) were calculated within each library by multiplying the proportion of the target region from (d) and the total library concentration from (e). g Mixing the libraries in one tube according to the calculated TRC and their expected yields of the sequence data. h–i One size selection by Sage HT. j Quantification of the selected fragment by qPCR and sequencing
Library pooling of 14 rice samples using AIO-seq
| Group No. | Sample ID | Con. of lib (ng/μL)a | Ratio of 420–520 bp (%)b | Target region concentration (TRC, ng/μL) | Proportion of expected data yield in groupc | Mass of mixed target region (ng)c | Vol. for mixing (μL)d | Final data yield (Gb) |
|---|---|---|---|---|---|---|---|---|
| A | 1 | 21.00 | 11 | 2.31 | 1/7 | 20.0 | 8.66 | 6.38 |
| 2 | 25.60 | 13 | 3.33 | 1/7 | 20.0 | 6.01 | 6.02 | |
| 3 | 19.60 | 13 | 2.55 | 1/7 | 20.0 | 7.85 | 7.20 | |
| 4 | 20.40 | 12 | 2.45 | 1/7 | 20.0 | 8.17 | 6.97 | |
| 5 | 24.80 | 13 | 3.22 | 1/7 | 20.0 | 6.20 | 7.03 | |
| 6 | 22.20 | 12 | 2.66 | 1/7 | 20.0 | 7.51 | 6.50 | |
| 7 | 24.20 | 12 | 2.90 | 1/7 | 20.0 | 6.89 | 6.74 | |
| B | 1 | 20.20 | 10 | 2.02 | 1/7 | 20.0 | 9.90 | 5.79 |
| 2 | 25.80 | 13 | 3.35 | 1/7 | 20.0 | 5.96 | 5.88 | |
| 3 | 22.00 | 12 | 2.64 | 1/7 | 20.0 | 7.58 | 6.44 | |
| 4 | 24.60 | 12 | 2.95 | 1/7 | 20.0 | 6.78 | 5.95 | |
| 5 | 21.40 | 13 | 2.78 | 1/7 | 20.0 | 7.19 | 5.46 | |
| 6 | 24.60 | 13 | 3.20 | 1/7 | 20.0 | 6.25 | 5.96 | |
| 7 | 26.20 | 13 | 3.41 | 1/7 | 20.0 | 5.87 | 5.74 |
aCon. of lib means the initial concentration of library for each individual assayed by Qubit™ 4.0 Fluorometer
bAnalyzed by Agilent 2100 Bioanalyzer, indicates the ratio of the fragment between 420 and 520 bp, which will be size selected for sequencing
cFor each group, seven samples were processed together and each sample was expected to have an equal yield of data, thus the proportion of expected data yielded for each sample was 1/7, and we mixed equal amounts of the target regions (20.0 ng) for each
dVol. for mixing represents the volume of each library that needed to be pooled. Both the “Mass of mixed target region” and “Vol. for mixing” could be proportionately scaled up or down
Fig. 2Data yield distribution and sequence coverage. Distribution of the data yields among the seven samples in group A (a) and the seven samples in group B (d). The coverage distribution as a percent of the Nipponbare reference genome among the samples in groups A (b) and B (e). The Poisson distribution which is expected if there were no bias is also shown with λ equal to the expected coverage among the genome. Coverage with respect to the G + C content of the reference in the 10 kb bins along the whole genome among samples in groups A (c) and B (f). For (a) and (d), the horizontal line and small open rectangle within the box plot indicate median and mean value, respectively. The extension of vertical lines indicates minimum and maximum observations excluding outliers. The black diamonds stand for the observations
Fig. 3The distribution of data yields among the mixed samples in different groups. a, b Group C includes 30 and Group D includes 55 genomic DNA libraries sequenced in a single lane of a flowcell, separately. c In group E, a mixed library including 32 samples. Even data yields were expected among the 26 rice libraries and 6 maize libraries, respectively; and the data of each maize was expected to be approximately 5 × that of each rice sample. d In group F, a mixed library comprising seven tea libraries, two and five samples were expected to have even data yields, respectively; and the data output of each sample in the group of five was expected to be twice as much of each sample from the group of two. e Group G includes 6 RNA-seq libraries sequenced twice in two lanes. There are expectations of uneven data yields among samples within a mixed library. The horizontal line and small open rectangle within the box plot indicate median and mean value, respectively. The extension of vertical lines indicates minimum and maximum observations excluding outliers. The black diamonds stand for the observations
Fig. 4Map construction and collinearity analysis of a maize BC1F4 population. a Recombination bin map of the 110 maize BC1F4 lines from a cross between CML486 and Lx9801. Red: Lx9801 genotype; blue: CML486 genotype; yellow: heterozygote. b A high-density genetic map constructed with 2264 bin markers from AIO-seq. c Collinearity analysis between the genetic map and the physical map of B73 reference genome (version 4). The corresponding relationship and the position relationship between the maize chromosomes (Chr) and the linkage groups (LG) of the genetic map are shown
QTLs identified for plant height, ear height and leaf angle using a high-density genetic map
| Traita | Year | Chromosome | QTL | Location (cM) | Region (Mb) | LODb | R2 (%)c | Addictive effectd |
|---|---|---|---|---|---|---|---|---|
| PH | 2016 | 3 | 36.0–44.9 | 200.8–210.7 | 3.4 | 10.0 | 5.1 | |
| 2016 | 3 | 95.4–100.8 | 6.9–9.7 | 3.5 | 12.3 | − 5.8 | ||
| 2017 | 4 | 74.6–81.8 | 23.0–78.4 | 3.3 | 9.9 | 5.2 | ||
| 2017 | 6 | 81.6–87.7 | 37.8–91.6 | 2.8 | 8.3 | − 5.1 | ||
| 2017 | 6 | 87.7–93.2 | 15.1–37.8 | 4.0 | 12.4 | − 6.2 | ||
| 2016 | 8 | 106.4–110.6 | 12.7–13.5 | 3.7 | 11.2 | − 6.4 | ||
| 2017 | 8 | 123.4–139.8 | 1.4–5.1 | 2.6 | 7.5 | − 4.3 | ||
| EH | 2017 | 3 | 46.3–56.4 | 183.2–199.6 | 3.2 | 9.3 | − 3.7 | |
| 2016 | 4 | 33.1–42.3 | 184.8–200.8 | 3.2 | 10.3 | − 4.5 | ||
| 2017 | 5 | 21.0–28.7 | 210.5–215.6 | 3.6 | 10.4 | − 4.4 | ||
| 2017 | 10 | 88.2–94.3 | 148.6–149.8 | 4.8 | 15.4 | − 5.1 | ||
| LA | 2016, 2017 | 1 | 120.3–132.3 | 24.4–68.6 | 3.0–10.6 | 6.7–26.8 | − (2.4–4.1) | |
| 2016 | 1 | 133.6–135.0 | 19.3–24.4 | 5.7 | 14.9 | − 3 | ||
| 2016 | 1 | 141.5–149.3 | 11.6–15.2 | 3.1 | 6.9 | − 2.3 | ||
| 2016, 2017 | 2 | 120.0–122.4 | 11.3–13.0 | 3.6–4.7 | 8.7–11.7 | − (2.3–2.7) | ||
| 2016, 2017 | 2 | 128.0–143.4 | 1.9–9.1 | 4.6–8.4 | 12.4–20.8 | − (2.7–3.6) | ||
| 2016 | 3 | 88.2–90.7 | 12.0–20.6 | 5.4 | 11.0 | − 2.6 | ||
| 2016 | 7 | 44.2–49.0 | 131.2–142.0 | 3.9 | 8.1 | 2.4 | ||
| 2016 | 7 | 52.6–59.8 | 110.8–126.7 | 3.7 | 7.9 | 2.2 |
aTrait is the name of the component of the plant architecture: PH for plant height, EH for ear height, LA for leaf angle
bLOD means logarithm of odds
cR2 indicated the percentage of phenotypic variation explained by QTL
dPositive or negative addictive effect value indicates that the allele from Lx9801 or CML486 increases the phenotypic value, respectively
Comparison of the labor input between traditional method and simplified AIO-seq for 96 RIL samples
| Step | Traditional pipelinea | Simplified AIO-seq pipeline | ||
|---|---|---|---|---|
| Time needed (day) | Hands-on time (hour) | Time needed (day) | Hands-on time (hour) | |
| Library preparationb | 1.5 | 8 | 0.5 | 5 |
| Size profile analysis | 0.5 | 2 | 0 | 0 |
| Size selection | 0.5 | 6 | 0.1 | 0.2 |
| qPCR of library | 0.2 | 5 | 0.1 | 0.2 |
| Total | 2.7 | 21 | 0.7 | 5.4 |
aTraditional pipeline is started from mechanical or enzymatic fragmentation
bLibrary preparation includes steps from genomic DNA to the finish of the PCR