| Literature DB >> 29214201 |
Pingchuan Li1, Bill Biligetu1, Bruce E Coulman1, Michael Schellenberg2, Yong-Bi Fu3.
Abstract
Crested wheatgrass [Agropyron cristatum L. (Gaertn.)] is an important cool-season forage grass widely used for early spring grazing. However, the genomic resources for this non-model plant are still lacking. Our goal was to generate the first set of next generation sequencing data using the genotyping-by-sequencing technique. A total of 272 crested wheatgrass plants representing seven breeding lines, five cultivars and five geographically diverse accessions were sequenced with an Illumina MiSeq instrument. These sequence datasets were processed using different bioinformatics tools to generate contigs for diploid and tetraploid plants and SNPs for diploid plants. Together, these genomic resources form a fundamental basis for genomic studies of crested wheatgrass and other wheatgrass species. The raw reads were deposited into Sequence Read Archive (SRA) database under NCBI accession SRP115373 (https://www.ncbi.nlm.nih.gov/sra?term=SRP115373) and the supplementary datasets are accessible in Figshare (10.6084/m9.figshare.5345092).Entities:
Keywords: Crested wheatgrass; Diploid; Genotyping-by-sequencing; Raw sequence data; Tetraploid
Year: 2017 PMID: 29214201 PMCID: PMC5712052 DOI: 10.1016/j.dib.2017.09.030
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
List of 17 crested wheatgrass accessions used in the study.
| AC Parkland | – | FOR483 | Canada | 2x |
| Fairway | CN32968 | FOR533 | Canada | 2x |
| PGR 16452 | CN43215 | – | Kazakhstan | 2x |
| PGR 16454 | CN43217 | – | Iran | 2x |
| S9542 | – | S9542 | Canada | 2x |
| AC Goliath | CN108673 | – | Canada | 4x |
| AC Newkirk | – | FOR552 | Canada | 4x |
| Karabalykskij 202 | CN31068 | – | Kazakhstan | 4x |
| Kirk | CN108662 | – | Canada | 4x |
| PGR 16830 | CN43478 | – | Kazakhstan | 4x |
| S8959E | – | FOR917 | Canada | 4x |
| S9491 | – | S9491 | Canada | 4x |
| S9514 | – | S9514 | Canada | 4x |
| S9516 | – | S9516 | Canada | 4x |
| S9544 | – | S9544 | Canada | 4x |
| S9556 | – | S9556 | Canada | 4x |
| Vysokij 9 | CN30995 | – | Siberia | 4x |
CN number is the accession identification in Plant Gene Resources of Canada, Agriculture and Agri-Food Canada (AAFC), while the alternative accession labels including FOR or S are from the joint forage breeding program of the University of Saskatchewan and AAFC.
The MiSeq sequence data profile and empirical genomic coverage (EgC) for 17 crested wheatgrass accessions.
| AC Parkland | 6,823,899 | 5,968,413 | 5,948,980 | 12,749×239 | 3,057,232 | 0.044 | SAMN07502767 – SAMN07502782 |
| Fairway | 8,838,311 | 7,023,400 | 5,312,458 | 18,204×240 | 4,376,232 | 0.063 | SAMN07502815 – SAMN07502846 |
| PGR16452 | 8,415,793 | 6,919,568 | 5,252,776 | 34,236×241 | 8,283,160 | 0.120 | SAMN07502847 – SAMN07502878 |
| PGR16454 | 7,750,962 | 6,400,207 | 6,070,265 | 27,981×241 | 6,750,278 | 0.098 | SAMN07502879 – SAMN07502894 |
| S9542 | 7,905,221 | 6,248,364 | 6,048,368 | 18,528×235 | 4,371,062 | 0.063 | SAMN07502991 – SAMN07503006 |
| AC Goliath | 7,084,059 | 5,981,098 | 5,961,130 | 12,227×241 | 2,947,930 | 0.022 | SAMN07502735 – SAMN07502750 |
| AC Newkirk | 6,832,714 | 5,621,605 | 5,605,880 | 9471×239 | 2,265,760 | 0.017 | SAMN07502751 – SAMN07502766 |
| Karabalykskij 202 | 6,826,807 | 5,412,874 | 5,134,977 | 8754×238 | 2,089,266 | 0.015 | SAMN07502799 – SAMN07502814 |
| Kirk | 6,103,771 | 5,172,052 | 5,153,987 | 8580×241 | 2,069,305 | 0.015 | SAMN07502911 – SAMN07502926 |
| PGR16830 | 7,208,410 | 5,978,683 | 5,668,841 | 10,722×238 | 2,560,384 | 0.019 | SAMN07502895 – SAMN07502910 |
| S8959E | 6,312,324 | 5,373,354 | 5,353,217 | 9546×241 | 2,300,750 | 0.017 | SAMN07502927 – SAMN07502942 |
| S9491 | 8,412,288 | 6,951,698 | 6,910,099 | 14,807×240 | 3,563,450 | 0.026 | SAMN07502943 – SAMN07502958 |
| S9514 | 8,194,719 | 6,883,223 | 6,836,397 | 9328×240 | 2,238,810 | 0.017 | SAMN07502959 – SAMN07502974 |
| S9516 | 8,010,454 | 6,687,373 | 6,635,121 | 11,730×240 | 2,815,457 | 0.021 | SAMN07502975 – SAMN07502990 |
| S9544 | 8,496,678 | 7,132,507 | 6,896,082 | 16,800×236 | 3,968,754 | 0.029 | SAMN07503007 – SAMN07503022 |
| S9556 | 7,440,684 | 6,235,134 | 6,026,824 | 17,085×237 | 4,050,705 | 0.030 | SAMN07503023 – SAMN07503038 |
| Vysokij 9 | 6,874,703 | 5,872,351 | 5,855,738 | 12,528×239 | 2,999,840 | 0.022 | SAMN07502783 – SAMN07502798 |
Only the forward (R1) sequence reads from all individual plants of each accession were used for contig assembly.
The number of sticky ends of PstI 'TGCA' found in 5′ end.
The length representing the average contig length in base pairs.
Based on the average genome size of crested wheatgrass estimated through flow cytometry with those of Triticum durum and Triticum aestivum.
| Subject area | |
| More specific subject area | |
| Organism | |
| Type of data | |
| How data was acquired | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | |
| Data accessibility |