| Literature DB >> 30931948 |
Camilla Beate Hill1, Debbie Wong2, Josquin Tibbits2, Kerrie Forrest2, Matthew Hayden2,3, Xiao-Qi Zhang4, Sharon Westcott5, Tefera Tolera Angessa4, Chengdao Li6,7.
Abstract
In barley and other cereal crops, phenological diversity drives adaptation to different cultivation areas. Improvement of barley yield and quality traits requires adaptation to specific production areas with introgression of favorable alleles dependent upon precise identification of the underlying genes. Combining targeted sequence capture systems with next-generation sequencing provides an efficient approach to explore target genetic regions at high resolution, and allows rapid discovery of thousands of genetic polymorphisms. Here, we apply a versatile target-capture method to detect genome-wide polymorphisms in 174 flowering time-related genes, chosen based on prior knowledge from barley, rice, and Arabidopsis thaliana. Sequences were generated across a phenologically diverse panel of 895 barley varieties, resulting a high mean depth coverage of ~25x allowing reliable discovery and calling of insertion-deletion (InDel) and single nucleotide polymorphisms (SNPs). Sequences of InDel and SNPs from the targeted enrichment were utilized to develop 67 Kompetitive Allele Specific PCR (KASP) markers for validation. This work provides researchers and breeders a comprehensive molecular toolkit for the selection of phenology-related traits in barley.Entities:
Mesh:
Year: 2019 PMID: 30931948 PMCID: PMC6472358 DOI: 10.1038/s41597-019-0011-z
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Schematic overview of the experimental and data analysis pipeline using GYDLE and custom algorithms. (a) Library preparation and target enrichment using MYbaits technology (MYcroarray®, Ann Arbour, MI, USA). (b) Quality control and filtering workflow using nuclear v3.2.6 (GYDLE Inc., Montreal, Canada). (c) SNP and (d) InDel calling and genotyping pipelines using a combination of nuclear v3.2.6, gym-build v2.6.16, coverage v2.6.16 and findsnp v2.6.16 (GYDLE) software and custom algorithms written in Perl.
Sequencing and alignment values for total read number and alignment rate. The table lists the arithmetic mean, the minimal and maximal values as well as the standard deviation (Std dev) for the total set (first line) and the respective subsets of spring, winter/facultative, two-row, and six-row type barley.
| Total reads (raw) | Total reads (filtered) | Total reads (aligned) | Aligned reads (%) | |
|---|---|---|---|---|
|
| ||||
| All | 1,237,129 | 1,236,974 | 250,585 | 22.52 |
| Two-row | 1,241,038 | 1,240,944 | 250,895 | 22.50 |
| Six-row | 1,218,497 | 1,217,753 | 251,282 | 22.81 |
| Winter/facultative | 1,046,415 | 1,045,641 | 253,353 | 23.78 |
| Spring | 1,257,319 | 1,257,312 | 250,377 | 22.51 |
|
| ||||
| All | 84,134 | 84,074 | 8,436 | 2.77 |
| Two-row | 84,134 | 84,074 | 8,436 | 2.77 |
| Six-row | 179,215 | 179,022 | 19,027 | 2.92 |
| Winter | 366,368 | 365,904 | 29,087 | 5.35 |
| Spring | 84,134 | 84,074 | 8,436 | 2.77 |
|
| ||||
| All | 14,327,607 | 14,324,144 | 1,220,662 | 52.89 |
| Two-row | 14,327,607 | 14,324,144 | 1,220,662 | 52.89 |
| Six-row | 10,017,062 | 10,014,212 | 741,609 | 43.68 |
| Winter/facultative | 2,094,153 | 2,093,049 | 646,642 | 42.95 |
| Spring | 14,327,607 | 14,324,144 | 1,220,662 | 52.89 |
|
| ||||
| All | 1,366,658 | 1,365,983 | 168,172 | 10.69 |
| Two-row | 1,385,747 | 1,385,044 | 168,421 | 10.59 |
| Six-row | 1,207,885 | 1,207,492 | 169,592 | 11.59 |
| Winter/facultative | 408,999 | 408,654 | 152,969 | 10.86 |
| Spring | 1,488,381 | 1,487,633 | 172,289 | 10.88 |
Quality measures for the target enrichment for the total population. The table lists mean values for genome-wide and target coverage, the enrichment factor calculated from that, the normalized mean target coverage, the fractions of the genome and the target covered by any read and by at least 10 reads for the total barley set.
| Quality measure | Total |
|---|---|
| Total number of reads (filtered) | 21,236,974 |
| Number of reads that map to target region | 548,074 |
| Target region size (bp) | 589,819 |
| Haploid genome size (bp) ( | 4,045,300,851 |
| Mean genome-wide coverage | 0.00015 |
| Mean target coverage | 0.2 |
| Mean target length (bp) | 3087.2 |
| Enrichment factor | 1,389.4 |
| Mean fraction of target covered (%) | 88 |
Fig. 2Consequences of genetic polymorphisms identified by targeted enrichment sequencing of flowering time genes categorized by Ensembl Variant Effect Predictor. A total of 6,030 SNP variants are categorized and percentages of potential consequences is given a) for all consequences. For InDels, a total of 1,581 variants are categorized and percentages of potential consequences is given c) for all consequences. See Ensembl Variant documentation for explanation of consequence categories (http://www.ensembl.org/info/genome/variation/predicted_data.html#consequences).
| Design Type(s) | genetic polymorphism analysis objective • strain comparison design |
| Measurement Type(s) | genetic variation |
| Technology Type(s) | DNA sequencing |
| Factor Type(s) | cultivar |
| Sample Characteristic(s) | Hordeum vulgare subsp. vulgare • leaf |