| Literature DB >> 36164303 |
Richard M Sharpe1, Benjamin Killian1,2, Tyson Koepke1, Rishikesh Ghogare1, Nnadozie Oraguzie3, Matthew Whiting3, Lee A Meisel4, Herman Silva5, Amit Dhingra1,6.
Abstract
Prunus avium cv. 'Stella' total cellular DNA was isolated from emerging leaf tissue and sequenced using Roche 454 GS FLX Titanium, and Illumina HiSeq 2000 High Throughput Sequencing (HTS) technologies. Sequence data were filtered and trimmed to retain nucleotides corresponding to Phred score 30, and assembled with CLC Genomics Workbench v.6.0.1. A total of 107,531 contigs were assembled with 185 scaffolds with a maximum length of 132,753 nucleotides and an N50 value of 4,601. The average depth of coverage was 135.87 nucleotides with a median depth of coverage equal to 31.50 nucleotides. The draft 'Stella' genome presented here covers 77.8% of the estimated 352.9Mb P. avium genome and is expected to facilitate genetics and genomics research focused on identifying genes and quantitative trait loci (QTL) underlying important agronomic and consumer traits.Entities:
Keywords: Genome; High-throughput sequencing; Prunus avium; Rosaceae
Year: 2022 PMID: 36164303 PMCID: PMC9508403 DOI: 10.1016/j.dib.2022.108611
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Amount of Prunus avium cv. ‘Stella’ genomic data generated using 454 and Illumina sequencing platforms.
| Species | Data Type | Amount | Coverage (x) |
| 454-single | 1Gb | 4.44 | |
| 454-8kb paired | 63.7Mb | 0.28 | |
| 454-20kb paired | 116.5Mb | 0.52 | |
| Illumina (100bp PE) | 17.2Gb | 76.44 | |
| 81.68 | |||
| Subject | Omics: Genomics |
| Specific subject area | Draft genome of |
| Type of data | Genomic sequence |
| How data were acquired | PacBio, Illumina and Roche 454 reads were assembled with CLC Genomics Workbench v.6.0.1 |
| Data format | Raw read data – fastq format |
| Parameters for data collection | A total of 107,531 contigs were assembled with 185 scaffolds with a maximum length of 132,753 nucleotides (nt) and an N50 value of 4,601. Average depth of coverage was 135.87nt; median depth of coverage 31.50nt. The assembly was generated from Paired-End Illumina reads. |
| Description of data collection | Data were collected from processing the raw reads produced via the different sequencing platforms using the CLC Genomics Workbench v. 6.0.1. |
| Data source location | Washington State University |
| Data accessibility | Data hosted on NCBI ( |
| Related research article |