| Literature DB >> 27493679 |
Guo-Qian Yang1, Yun-Mei Chen1, Jin-Peng Wang2, Cen Guo1, Lei Zhao1, Xiao-Yan Wang3, Ying Guo1, Li Li2, De-Zhu Li3, Zhen-Hua Guo3.
Abstract
BACKGROUND: The double digest restriction-site associated DNA sequencing technology (ddRAD-seq) is a reduced representation sequencing technology by sampling genome-wide enzyme loci developed on the basis of next-generation sequencing. ddRAD-seq has been widely applied to SNP marker development and genotyping on animals, especially on marine animals as the original ddRAD protocol is mainly built and trained based on animal data. However, wide application of ddRAD-seq technology in plant species has not been achieved so far. Here, we aim to develop an optimized ddRAD library preparation protocol be accessible to most angiosperm plant species without much startup pre-experiment and costs.Entities:
Keywords: Genotype-by-sequencing; MiddRAD; Next-generation sequencing; RAD-seq; ddRAD
Year: 2016 PMID: 27493679 PMCID: PMC4973087 DOI: 10.1186/s13007-016-0139-1
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 4.993
Fig. 1Library preparation flowchart of MiddRAD protocol B and the original ddRAD protocol. a MiddRAD protocol B. b The original ddRAD library constructing flowchart, this procedure is adopted by Peterson et al. Protocol B contains nine steps while the original ddRAD protocol contains 12 steps. Only size-selected fragments are as templates for PCR amplification in protocol B
Total number of fragments produced by in silico digestion of 23 species
| Species |
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| 2352 | 2795 | 3,710,380 | 271,860 | 88,220 | 70,512 |
|
| 7498 | 9006 | 16,747,131 | 803,332 | 306,391 | 162,398 |
|
| 4876 | 5769 | 7,169,615 | 329,150 | 147,647 | 97,378 |
|
| 4994 | 5903 | 8,028,433 | 380,967 | 162,506 | 91,994 |
|
| 12,306 | 14,912 | 3,237,571 | 535,195 | 106,924 | 166,584 |
|
| 4592 | 5255 | 4,221,765 | 245,762 | 99,163 | 59,272 |
|
| 4786 | 5413 | 7,434,539 | 329,065 | 144,069 | 102,802 |
|
| 1450 | 1737 | 3,342,214 | 116,754 | 62,340 | 32,319 |
|
| 6599 | 7483 | 5,467,255 | 433,922 | 170,591 | 129,105 |
|
| 9469 | 10,906 | 10,160,790 | 944,853 | 308,661 | 164,996 |
|
| 3376 | 3979 | 6,027,724 | 226,779 | 114,082 | 80,088 |
|
| 8978 | 10,334 | 7,341,396 | 716,454 | 218,488 | 177,248 |
|
| 11,702 | 13,558 | 10,008,594 | 787,796 | 234,856 | 120,412 |
|
| 6458 | 7804 | 7,904,182 | 483,300 | 199,616 | 103,224 |
|
| 3412 | 4135 | 4,915,092 | 247,403 | 113,511 | 78,634 |
|
| 8364 | 9943 | 9,894,486 | 849,552 | 307,001 | 174,810 |
|
| 3116 | 4351 | 3,505,137 | 237,185 | 90,631 | 67,280 |
|
| 15,086 | 18,005 | 12,249,900 | 596,412 | 255,692 | 124,564 |
|
| 10,042 | 11,855 | 4,881,316 | 595,046 | 138,534 | 171,045 |
|
| 51,097 | 58,916 | 23,112,695 | 3,233,281 | 734,959 | 707,912 |
|
| 21,528 | 23,978 | 8,200,062 | 1,217,504 | 301,472 | 329,922 |
|
| 15,347 | 17,861 | 4,683,693 | 757,165 | 145,882 | 225,478 |
|
| 85,902 | 2797 | 21,096,385 | 4,784,940 | 807,008 | 886,527 |
Number of fragments between 400–700 bp produced by in silico digestion of 23 species
| Species |
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| 132 | 287 | 42,400 | 42,261 | 14,803 | 11,512 |
|
| 521 | 524 | 51,073 | 105,780 | 42,596 | 18,519 |
|
| 441 | 226 | 20,508 | 45,611 | 23,272 | 13,128 |
|
| 386 | 383 | 32,588 | 50,912 | 21,822 | 11,769 |
|
| 796 | 1948 | 50,586 | 60,014 | 18,552 | 21,701 |
|
| 276 | 235 | 19,405 | 32,969 | 14,169 | 8258 |
|
| 433 | 481 | 30,274 | 45,223 | 20,689 | 14,832 |
|
| 103 | 83 | 10,113 | 13,958 | 8173 | 3791 |
|
| 389 | 913 | 51,225 | 55,068 | 26,123 | 17,209 |
|
| 783 | 607 | 80,253 | 140,632 | 49,260 | 24,250 |
|
| 242 | 209 | 16,355 | 31,259 | 17,838 | 12,145 |
|
| 622 | 876 | 64,497 | 95,233 | 36,744 | 26,304 |
|
| 1285 | 972 | 61,985 | 121,796 | 35,348 | 16,701 |
|
| 541 | 472 | 42,743 | 65,689 | 29,666 | 13,827 |
|
| 277 | 152 | 15,658 | 37,530 | 17,857 | 11,229 |
|
| 655 | 666 | 65,729 | 129,175 | 53,775 | 26,496 |
|
| 274 | 123 | 14,466 | 34,514 | 14,194 | 9910 |
|
| 2250 | 421 | 42,919 | 93,748 | 36,617 | 16,313 |
|
| 696 | 1451 | 61,098 | 75,621 | 23,537 | 24,080 |
|
| 2309 | 7086 | 367,267 | 458,669 | 130,850 | 116,861 |
|
| 1923 | 2740 | 128,588 | 161,131 | 56,433 | 56,091 |
|
| 896 | 2434 | 81,751 | 96,716 | 26,592 | 31,528 |
|
| 3344 | 289 | 521,797 | 517,204 | 122,210 | 131,141 |
Fig. 2In silico digestion genome sequences of 23 plant species by AvaII + MspI. The line graph shows tag number at every 100 bp interval from 0–1500 bp (a). Both total tag number (b) and tag number within 400–700 bp (c) are correlated positively with genome size
Fig. 3A comprehensive analysis of data produced by library B. This includes raw reads and clean reads number (a), correct restriction sites ratio (b), GC-content (c), adapter reads ratio (d), data quality of O. sativa read1 (e) and O. sativa read2 (f)
Summary of alignment statistics of sequencing data
| Individual no. | Scaffolds (%) | CDS (%) | Repeats (%) | |||
|---|---|---|---|---|---|---|
| Read1 | Read2 | Read1 | Read2 | Read1 | Read2 | |
|
| ||||||
|
| 63.21 | 61.35 | – | – | – | – |
|
| 15.83 | 13.95 | 3.31 | 3.10 | 0.49 | 0.39 |
|
| ||||||
|
| 88.22 | 82.50 | 19.39 | 19.64 | 6.85 | 7.70 |
|
| 90.66 | 84.86 | 2.38 | 2.83 | 10.95 | 8.82 |
|
| 83.80 | 79.93 | 2.38 | 2.87 | 0.11 | 0.17 |
|
| 3.37 | 3.71 | 1.08 | 1.11 | 0.00 | 0.22 |
|
| 62.23 | 61.97 | – | – | – | – |
|
| 64.15 | 62.59 | – | – | – | – |
|
| 23.19 | 21.72 | 1.40 | 1.59 | 0.10 | 0.15 |
|
| 25.56 | 24.66 | 1.39 | 1.64 | 0.11 | 0.16 |
As CDS and repeats region were not available for D. latiflorus survey genome sequences, MiddRAD data of D. latiflorus individuals were only mapped to the assembled scaffolds
MiddRAD-seq data summary in rice and maize
| Genome information |
|
|
|---|---|---|
| Genome size (Mb) | 383 | 2300 |
| % of repeats in genome | 39.11 | 85.00 |
| GC content (%) | 43.56 | 46.83 |
| Expected information | ||
| Enzyme pairs |
|
|
| Expected RAD tag size range (bp) | 460–680 | 500–680 |
| Expected no. of RAD tags | 60,925 | 284,179 |
| Tags density per 100 kb | 15.92 | 12.36 |
| % of tags in CDS | 25.10 | 3.04 |
| Observed information | ||
| Raw reads | 14,732,449 | 7,414,009 |
| Clean reads | 14,146,516 | 7,337,556 |
| Observed no. of tags | 66,547 | 290,001 |
| Tag average depth | 212.58 | 25.30 |
| Tags per 100 kb | 17.38 | 12.61 |
| Simplification ratio (%) | 2.43 | 1.77 |
| % of tags in CDS | 31.49 | 1.15 |
| % of tags in repeats | 15.83 | 31.44 |
Fig. 4Comparison of the real sequencing data with in silico predicted results. a Each of the 12 rice chromosomes is expected to produce 3521–6414 fragments while each actually generates 4121–7404 tags with the Pearson correlation coefficient r = 0.8374. b The 10 maize chromosomes each is expected to produce 20,555–41,216 fragments while each is observed to generate 21,491–42,376 tags with the Pearson correlation coefficient r = 0.9792
A comprehensive data analysis of P. edulis and A. semialata
| Species | Raw reads no. | Percentage of adapter reads (%) | Percentage of reads with correct restriction sites (%) | GC content (%) | Clean reads no. | Tag no. | Average tag depth | ||
|---|---|---|---|---|---|---|---|---|---|
| Read1 | Read2 | Read1 | Read2 | ||||||
|
| 8,142,517 | 1.19 | 2.45 | 97.94 | 96.39 | 52 | 8,045,315 | 128,803 | 30.18 |
|
| 14,651,272 | 2.32 | 3.96 | 96.08 | 94.85 | 53 | 14,299,253 | 98,869 | 96.18 |
Fig. 5Histograms of data generated by each barcode and sub-library. a Distribution of reads across 40 barcoded samples in a single lane for the D. latiflorus F1 population. Each barcode was used twice. White bars represent data generated by each barcode the first time and black bars represent data generated by each barcode the second time. b Distribution of reads number across four indexed sub-libraries for the D. latiflorus F1 population
Fig. 6Maximum likelihood phylogenetic reconstruction of three bamboo species. Two clades are formed: the first contains the genus Dendrocalamus. D. latiflorus individual 1 is sister to D. latiflorus individual 2. In the second clade, P. rubicunda is sister to P. vivax, which themselves form a monophyletic clade