| Literature DB >> 25788971 |
Leonardo Galindo-González1, David Pinzón-Latorre1, Erik A Bergen1, Dustin C Jensen2, Michael K Deyholos3.
Abstract
BACKGROUND: Detection of induced mutations is valuable for inferring gene function and for developing novel germplasm for crop improvement. Many reverse genetics approaches have been developed to identify mutations in genes of interest within a mutagenized population, including some approaches that rely on next-generation sequencing (e.g. exome capture, whole genome resequencing). As an alternative to these genome or exome-scale methods, we sought to develop a scalable and efficient method for detection of induced mutations that could be applied to a small number of target genes, using Ion Torrent technology. We developed this method in flax (Linum usitatissimum), to demonstrate its utility in a crop species.Entities:
Keywords: EMS; Ion Torrent; Linum usitatissimum; Reverse genetics
Year: 2015 PMID: 25788971 PMCID: PMC4363359 DOI: 10.1186/s13007-015-0062-x
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 4.993
Figure 1Two-step PCR strategy adopted for high throughput sequencing. On the first-step PCR the specific gene section (target) is amplified with a forward primer (blue) bearing a universal tag (red), and a reverse primer (blue) carrying a trP1 adapter (green). For the second-step PCR the amplicons of the first step are amplified with a reverse primer that matches the trP1 adapter (green) and a forward bar-coded (pink) primer (brown) for each desired pool of individuals and genes.
Sequencing statistics of the three experiments performed
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| Pilot | 314 | 74% | 119.38 | 678,532 | 47.35 | 176 |
| Proof of concept-1 | 314 | 79% | 71.80 | 459,888 | 60.31 | 156 |
| Proof of concept-2 | 314 | 76% | 85.40 | 543,659 | 63.02 | 157 |
| Scale up | 316 | 74% | 649.00 | 3,403,220 | 92.04 | 190 |
1After filtering polyclonal wells, test fragments, adapter dimers sequences and low quality reads.
Read statistics of the three experiments performed
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Pilot | S20 | N/A | 115,998 | 17.09 | 4,103.69 |
| S411 | N/A | 67,971 | 10.02 | 2,392.08 | |
| S900 | N/A | 137,314 | 20.23 | 4,794.04 | |
| Proof of concept-1 | Lus10004720 |
| 91,755 | 19.95 | 2,975.50 |
| G25305* |
| 94,242 | 20.49 | 2,942.75 | |
| Lus10031470 |
| 38,154 | 8.30 | 1,059.81 | |
| Lus10043035 |
| 53,216 | 11.57 | 1,621.20 | |
| Proof of concept-2 | Lus10004720 |
| 87,889 | 16.17 | 2,815.02 |
| G25305* |
| 143,832 | 26.46 | 4,404.28 | |
| Lus10031470 |
| 47,620 | 8.76 | 1,255.62 | |
| Lus10043035 |
| 63,282 | 11.64 | 1,941.29 | |
| Scale up | Lus10016751 | ALS-1 | 1,443,997 | 42.43 | 44,199.34 |
| Lus10029955 | ALS-2 | 424,425 | 12.47 | 13,027.62 | |
| G24175* | CLE | 534,255 | 15.70 | 16,146.73 | |
| Lus10017825 | UGT | 729,799 | 21.44 | 20,472.63 |
*Gene Id correspond to first draft assembly of flax (unpublished).
1PME = Pectinmethylesterase, ALS = acetolactate synthase, CLE = cyclic peptide, UGT = glucoronosyl/glucosyl transferase.
2Percentage from total number of reads in Table 1.
Figure 2Sequence coverage and frequency of variants in gene sections of the pilot experiment. DNA from the cultivar Macbeth was diluted (1:64 or 1:96) in CDC Bethune DNA in several simulated pools as described in methods. Each line in the sequence coverage graphs represents one of 28 pools. The frequencies of the variants are plotted against the position in the respective reference sequence section. Each symbol in the frequency graphs represents a frequency of at least 0.1% for each of the 28 pools in each position. A. S20, B. S900; no graphs for S411 are shown since no variants were detected in that sequence.
Figure 3Coverage of four PME genes in two technical replicates. The average sequence coverage from 28 pools in each one of the base positions for the four PME genes is shown.
SNVs found in four PME genes
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
| 33 | G > A | No | N/A | N/A |
|
| 96 | G > A | Yes | Heterozygous | Non-synonymous |
|
| 25 | G > A | No | N/A | N/A |
|
| 54 | G > A | Yes | Heterozygous | Non-synonymous |
|
| 81 | T > A* | No | N/A | N/A |
|
| 88 | C > T | Yes | Heterozygous | Synonymous |
|
| 97 | A > G* | No | N/A | N/A |
|
| 139 | C > T | Yes | Homozygous | Synonymous |
|
| 189 | C > T | No | N/A | N/A |
|
| 154 | C > T | Yes | Homozygous | Synonymous |
|
| 34 | A > G* | N/A | N/A | N/A |
|
| 57 | A > G* | No | N/A | N/A |
|
| 115 | G > A | No | N/A | N/A |
N/A – sequence could not be obtained by Sanger.
*Mutations not expected by EMS, but discovered using the technique.
Figure 4Alignment of amino acid sections from individuals bearing non-synonymous mutations (Table 3 ) to the original non-mutated sequences. Gene IDs are followed by an identifier given to the sequenced individuals. Circles below the alignment indicate enzyme active sites. Blue background indicates the amino acid change.
SNVs found in four genes of interest
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| ALS-1 | 119 | C > T | Yes | Heterozygous | A/V | Yes | 2 homozygous | Yes++ |
| ALS-1 | 140 | C > T | No | N/A | P/L | N/A | N/A | No |
| CLE | 89 | G > A | Yes | Heterozygous | G/D | Yes | 3 heterozygous | Yes+++ |
| CLE | 94 | G > A | Yes | Heterozygous | E/K | Yes | 1 homozygous, 1 heterozygous | No |
| CLE | 134 | G > A | Yes | Heterozygous | R/H | Yes | 1 homozygous | Yes+++ |
| ALS-2 | 26 | G > A | No | N/A | G/E | N/A | N/A | No |
| ALS-2 | 43 | G > A | Yes | Heterozygous | E/K | Yes | 2 homozygous | No |
| ALS-2 | 100 | G > A | No | N/A | E/K | N/A | N/A | No |
| ALS-2 | 161 | C > T | Yes | Heterozygous | A/V | Yes | 2 homozygous | Yes+++ |
| ALS-2 | 161 | C > T | Yes | Homozygous | A/V | Yes | 2 homozygous | Yes+ |
| UGT | 27 | C > T | No | N/A | P/S | N/A | N/A | No |
| UGT | 33 | C > T | No | N/A | H/Y | N/A | N/A | No |
| UGT | 81 | C > T | Yes | Homozygous | L/F | Yes | 3 homozygous | Yes+++ |
| UGT | 99 | G > A* | Yes | Heterozygous | E/STOP | Yes | 3 heterozygous | N/A |
| UGT | 99 | G > A* | Yes | Heterozygous | E/STOP | Yes | 2 heterozygous | N/A |
| UGT | 184 | G > A | Yes | Heterozygous | G/E | Yes | 1 homozygous, 1heterozygous | No |
*Mutation was found by looking at intersecting pools with frequencies below the set threshold.
1Six individuals from progeny examined per mutation.
2The frequencies of the variants were used to run a Phyton script which automatically detects the source individual bearing the mutation (Additional file 8). Parameters used in the script were: 2 lower SD cutoff, 10 upper SD cutoff, 1 min. pools. N/A rows were not picked by the script since they were found by a different methodology. The confirmed points by the script had confidence intervals of: +75%, ++85% and +++99%.
Mutations on position 161 on ALS-2 and 99 on UGT were found in two different individuals.