| Literature DB >> 31139509 |
Kevin Bethune1, Cédric Mariac1, Marie Couderc1, Nora Scarcelli1, Sylvain Santoni2, Morgane Ardisson2, Jean-François Martin3, Rommel Montúfar4, Valentin Klein1, François Sabot1, Yves Vigouroux1, Thomas L P Couvreur1.
Abstract
PREMISE: Third-generation sequencing methods generate significantly longer reads than those produced using alternative sequencing methods. This provides increased possibilities for the study of biodiversity, phylogeography, and population genetics. We developed a protocol for in-solution enrichment hybridization capture of long DNA fragments applicable to complete plastid genomes. METHODS ANDEntities:
Keywords: DNA probes; MinION; de novo assembly; long‐range PCR; whole plastome sequencing
Year: 2019 PMID: 31139509 PMCID: PMC6526642 DOI: 10.1002/aps3.1243
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
Output data obtained from MinION plastome‐enriched library sequencing
| Species | DNA | Probe origin | Total no. of reads (bp) | Median read length (bp) | Longest read (bp) | % of plastome reads | X‐fold | Longest plastome read (bp) | Median plastome read length (bp) |
|---|---|---|---|---|---|---|---|---|---|
|
| Fresh |
| 17,129 | 4627 | 26,128 | 70.8 | 5.3 | 25,828 | 4264 |
|
| Fresh |
| 81,361 | 3695 | 24,804 | 98.2 | 12.4 | 24,504 | 3398 |
|
| Fresh |
| 105,760 | 4914 | 25,468 | 97.0 | 156.3 | 25,167 | 4623 |
|
| Fresh |
| 141,250 | 3783 | 19,378 | 94.4 | 13.8 | 19,078 | 3489 |
|
| Silica gel |
| 202,924 | 2486 | 13,103 | 15.7 | 25.0 | 12,805 | 2129 |
|
| Silica gel |
| 83,833 | 2322 | 10,705 | 87.5 | 94.8 | 10,405 | 1997 |
|
| Silica gel |
| 202,925 | 2437 | 15,132 | 79.0 | 21.6 | 14,832 | 2158 |
The percentage of plastome mapped reads was calculated using Burrows–Wheeler alignment to indicated reference plastomes.
Figure 1Schematic representation of the protocol used for long‐fragment capture of plastomes (adapted from Mariac et al., 2014).
Figure 2Long‐fragment capture results for Oryza sativa. (A) Panicule of Oryza sativa (© IRD ‐ Jean‐Pierre Montoroi. Reprinted with permission.). (B) Number of reads per read length before mapping. (C) Plastome coverage after mapping. Black bars indicate approximate position of both inverted repeat (IR) regions. (D) Percentage of useful reads mapped to the O. sativa reference plastome (KT289404.1) between non‐enriched (N, light gray) and enriched (E, dark gray) libraries.
Figure 3Percentage of useful reads mapped to their respective reference plastome (see Table 1) between Illumina non‐enriched (N, light gray) and MinION enriched (E, dark gray) protocols for the six non‐model species in our study. Photo credits: Cenchrus americanus (CC0 Public Domain; https://pxhere.com/fr/photo/706162); Pennisetum glaucum (© IRD ‐ Cédric Mariac, reprinted with permission); Digitaria exilis (© IRD ‐ A. Barnaud, reprinted with permission); Podococcus acaulis (© IRD ‐ Thomas Couvreur, reprinted with permission); Raphia textilis (© IRD ‐ Thomas Couvreur, reprinted with permission); Phytelephas aequatorialis (© IRD ‐ Thomas Couvreur, reprinted with permission).
Figure 4Long‐fragment capture results for six non‐model plant species. (A) Distribution of read lengths before mapping. (B) Plastome coverage results from the enriched long‐read capture protocol. Black bars indicate approximate position of both inverted repeat (IR) regions.
De novo assembly results from real and simulated data in the number of contigs, and coverage and identity percentages to the respective reference plastome genomes (see Table 1)
| Species | Minimum overlap (bp) | Plastome contigs | Coverage % | Identity % | Longest contig (bp) |
|---|---|---|---|---|---|
|
| 3000 | 2 | 92.32 | 99.14 | 109,087 |
|
| 3000 | 2 | 99.91 | 98.52 | 81,053 |
|
| 3000 | 2 | 99.97 | 99.18 | 125,727 |
|
| 1000 | 17 | 81.24 | 98.86 | 22,803 |
|
| 1000 | 10 | 83.87 | 98.84 | 21,797 |
|
| 1000 | 13 | 87.60 | 98.31 | 20,700 |
| Simulated assembly | 1000 | 4 | 99.72 | 99.05 | 107,633 |
Minimum overlap between reads as defined in Flye.
The simulated data were based on the output results of Phytelephas aequatorialis.
| Species | Country of origin | Voucher/accession no. | DNA quality | Type of library preparation | Probes used | No. of passed reads | Reference | % reads mapped reference | Average coverage depth | Standard deviation | % mismatch | % reference uncovered | % 10× | % 50× |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| Japan | (no voucher) NIP – IRD Montpellier | Band, >20 kbp | Chloroplast enrichment |
| 17,129 | KT289404.1 | 70.80 | 362.69 | 0.36 | 11.80 | 0.000 | 100.000 | 99.996 |
| Shotgun | None | 151,234 | KT289404.1 | 13.32 | 34.15 | 0.25 | 0.20 | 0.000 | 99.898 | 4.146 | ||||
|
| Senegal | (no voucher) CG14 – IRD Montpellier | Band, >20 kbp | Chloroplast enrichment |
| 81,361 | KM088021.1 | 98.23 | 2033.68 | 0.43 | 9.60 | 0.019 | 99.934 | 99.295 |
| Shotgun | None | 1,201,104 | KM088021.1 | 7.94 | 99.56 | 0.44 | 1.80 | 0.034 | 99.546 | 88.834 | ||||
|
| Senegal | (no voucher) PE01455 – IRD Montpellier | Band, >20 kbp | Chloroplast enrichment |
| 105,757 | KJ490012.1 | 97.03 | 3318.44 | 0.39 | 10.20 | 0.000 | 99.992 | 99.981 |
| Shotgun | None | 525,428 | KJ490012.1 | 0.62 | 3.19 | 0.70 | 2.20 | 10.150 | 0.950 | 0.000 | ||||
|
| Mali | (no voucher) CM05784 – IRD Montpellier | Band, >20 kbp | Chloroplast enrichment |
| 141,248 | NC_024176.1 | 94.42 | 3311.87 | 0.29 | 11.40 | 0.000 | 100.000 | 99.700 |
| Shotgun | None | 8,665,102 | NC_024176.1 | 6.85 | 593.29 | 0.33 | 0.50 | 0.045 | 99.846 | 99.462 | ||||
|
| Gabon | Couvreur TLP 556 (WAG) | Band, 15–20 kbp | Chloroplast enrichment |
| 249,416 | NC_027276.1 | 15.72 | 497.68 | 0.47 | 10.20 | 0.454 | 96.163 | 93.096 |
| Shotgun | None | 308,230 | NC_027276.1 | 0.63 | 1.14 | 2.63 | 4.60 | 56.858 | 0.597 | 0.085 | ||||
|
| Angola | Lautenschläger 1086 (K) | Slight degradation, 5–16 kbp | Chloroplast enrichment |
| 83,832 | NC_020365.1 | 87.49 | 893.98 | 0.47 | 12.00 | 0.352 | 96.068 | 91.972 |
| Shotgun | None | 57,007,740 | NC_020365.1 | 0.92 | 381.41 | 3.77 | 6.20 | 0.200 | 99.018 | 98.258 | ||||
|
| Ecuador | Couvreur 1191 (QCA) – TAGUA F1 | Smear, 1–10 kbp fragments | Chloroplast enrichment |
| 202,924 | NC_029957.1 | 79.04 | 1713.58 | 0.53 | 10.30 | 0.004 | 99.522 | 96.383 |
| Shotgun | None | 699,918 | NC_029957.1 | 3.65 | 22.68 | 1.58 | 3.50 | 0.000 | 95.083 | 1.380 |