Literature DB >> 36164303

Draft genome data of Prunus avium cv 'Stella'.

Richard M Sharpe1, Benjamin Killian1,2, Tyson Koepke1, Rishikesh Ghogare1, Nnadozie Oraguzie3, Matthew Whiting3, Lee A Meisel4, Herman Silva5, Amit Dhingra1,6.   

Abstract

Prunus avium cv. 'Stella' total cellular DNA was isolated from emerging leaf tissue and sequenced using Roche 454 GS FLX Titanium, and Illumina HiSeq 2000 High Throughput Sequencing (HTS) technologies. Sequence data were filtered and trimmed to retain nucleotides corresponding to Phred score 30, and assembled with CLC Genomics Workbench v.6.0.1. A total of 107,531 contigs were assembled with 185 scaffolds with a maximum length of 132,753 nucleotides and an N50 value of 4,601. The average depth of coverage was 135.87 nucleotides with a median depth of coverage equal to 31.50 nucleotides. The draft 'Stella' genome presented here covers 77.8% of the estimated 352.9Mb P. avium genome and is expected to facilitate genetics and genomics research focused on identifying genes and quantitative trait loci (QTL) underlying important agronomic and consumer traits.
© 2022 The Authors.

Entities:  

Keywords:  Genome; High-throughput sequencing; Prunus avium; Rosaceae

Year:  2022        PMID: 36164303      PMCID: PMC9508403          DOI: 10.1016/j.dib.2022.108611

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specification Table

Value of the Data

The sequence data from the first self-fertile named sweet cherry cultivar generated via mutation breeding will be useful for genomics research. The genomic data from ‘Stella’ cultivar are valuable for understanding the genetic diversity of sweet cherry, since this fruit crop experienced a genetic bottleneck during its domestication. Plant breeders, bioinformatics scientists, genomics and genetics scientists and biotechnologists will benefit from these data. These data can be used in developing trait-linked molecular markers and quantitative trait loci to develop superior cultivars. The collective information will also aid in performing functional characterization of genes.

Data Description

Cultivated Sweet Cherry varieties are an outcome of the domestication of the wild cherry Prunus avium L., and are thought to have originated in the region between the Black and Caspian Seas [1,2]. The data from Prunus avium cv. ‘Stella’ was generated using two different next generation sequencing platforms. Single, 8 kb paired-end (PE), and 20 kb PE reads were generated using pyrosequencing on the 454 GS FLX instrument, and 100 bp PE reads were generated using Illumina 2000 instrument. A total of 18.38 GB of data were generated and with the genome size estimated to be 352.9 Mb, these data represent 81.68X coverage of the genome. Specific details about the data are summarized in Table 1.
Table 1

Amount of Prunus avium cv. ‘Stella’ genomic data generated using 454 and Illumina sequencing platforms.

SpeciesData TypeAmountCoverage (x)

Prunus avium cv ‘Stella’ (sweet cherry)454-single1Gb4.44
454-8kb paired63.7Mb0.28
454-20kb paired116.5Mb0.52
Illumina (100bp PE)17.2Gb76.44
Total18.38Gb81.68
Amount of Prunus avium cv. ‘Stella’ genomic data generated using 454 and Illumina sequencing platforms. A draft genome assembly of Prunus avium cv ‘Stella’ was developed using CLC genomics that consisted of 107,531 assembled contigs organized into185 scaffolds with a maximum length of 132,753 nucleotides (nt) and an N50 value of 4,601. Average depth of coverage was 135.87nt, and median depth of coverage 31.50nt. The assembly summary report is attached as Supplementary file 1. In addition contig information in terms of consensus length, total read count and reads in pairs and average coverage is summarized in Supplementary file 2.

Experimental Design, Materials and Methods

Leaf material and Genomic DNA Purification

Leaf material from ‘Stella’ sweet cherry cultivar was collected from WSU IAREC, Prosser. Total genomic DNA was extracted from young leaf tissue using cetyltrimethylammonium bromide (CTAB) phenol chloroform extraction method [3]. Extracted DNA pellets were air dried and suspended in 50 μl of nuclease-free water and incubated at 37°C with DNase free RNase for 30 min. RNase was inactivated by incubating the tubes at 65°C for 10min. DNA was quantified using Nanodrop 8000 spectrophotometer (Thermo Scientific, Waltham, MA, USA) and 50 ng of extracted genomic DNA was electrophoresed on a 1% agarose gel and compared to lambda DNA dilution series (100, 80, 60, 40, 20, 10 ng) to confirm quality and quantity.

Genome Sequencing and Assembly

The ‘Stella’ sweet cherry genome was sequenced using multiple sequencing platforms. The data were primarily generated via Illumina sequencing platform where 76 × Illumina data were obtained from 2 × 100 standard Illumina HiSeq 2000 sequencing. Read files were obtained after initial sorting and filtering of the data via Illumina's standard data processing. Additional sequencing data were generated on the 454-sequencing platform accounting for 1.18 Gb data. A reference-based assembly of cherry genomic Illumina and 454 data was performed using CLC Genomics assembler v 7.0 with the peach genome v 2.0 as the reference [4] and using the default parameters: length fraction = 0.5, Similarity fraction = 0.8. Additionally, a de novo assembly was generated based on Illumina reads using CLC Genomics v 7.0 with the default Illumina assembly parameters: Minimum contig length = 200, mismatch cost = 2, Insertion cost = 3, Deletion cost = 3, Length fraction = 0.5, Similarity fraction = 0.8. This assembly generated 96,080 contiguous sequences with an N50 of 4,130 from a total of 136,453,160 high quality reads.

Ethics Statement

This work did not involve human subjects, animal experiments and data collected from social media platforms.

CRediT authorship contribution statement

Richard M. Sharpe: Data curation, Writing – original draft. Benjamin Killian: Investigation. Tyson Koepke: Investigation. Rishikesh Ghogare: Data curation, Investigation. Nnadozie Oraguzie: Funding acquisition. Matthew Whiting: Funding acquisition, Supervision. Lee A. Meisel: Funding acquisition, Supervision. Herman Silva: Funding acquisition, Supervision. Amit Dhingra: Conceptualization, Funding acquisition, Supervision, Methodology, Resources, Writing – review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships which have, or could be perceived to have, influenced the work reported in this article.
SubjectOmics: Genomics
Specific subject areaDraft genome of Prunus avium cv ‘Stella’ using short and long-read high throughput sequencing technologies to facilitate sweet cherry genetics, genomics and breeding research
Type of dataGenomic sequence
How data were acquiredPacBio, Illumina and Roche 454 reads were assembled with CLC Genomics Workbench v.6.0.1
Data formatRaw read data – fastq formatAnalyzed data – fasta format
Parameters for data collectionA total of 107,531 contigs were assembled with 185 scaffolds with a maximum length of 132,753 nucleotides (nt) and an N50 value of 4,601. Average depth of coverage was 135.87nt; median depth of coverage 31.50nt. The assembly was generated from Paired-End Illumina reads.
Description of data collectionData were collected from processing the raw reads produced via the different sequencing platforms using the CLC Genomics Workbench v. 6.0.1.
Data source locationWashington State UniversityPullman, WA.United States of America46°43′52.57" N -117°10′46.63″ W
Data accessibilityData hosted on NCBI (https://www.ncbi.nlm.nih.gov/):NCBI accession - SRR4280447, Type: Illumina HiSeq 2000https://www.ncbi.nlm.nih.gov/sra/SRR4280447NCBI accession - SRR4280448, Type: 454 GS FLX Titaniumhttps://www.ncbi.nlm.nih.gov/sra/?term=SRR4280448
Related research articleHewitt S, Kilian B, Hari R, Koepke T, Sharpe R, Dhingra A (2017) Evaluation of multiple approaches to identify genome-wide polymorphisms in closely related genotypes of sweet cherry (Prunus avium L.). Comput Struct Biotechnol J. doi:10.1016/j.csbj.2017.03.002
  4 in total

1.  Proteomic analysis of the major soluble components in Annurca apple flesh.

Authors:  Carmine Guarino; Simona Arena; Luciana De Simone; Chiara D'Ambrosio; Simona Santoro; Mariapina Rocco; Andrea Scaloni; Mauro Marra
Journal:  Mol Nutr Food Res       Date:  2007-02       Impact factor: 5.914

2.  The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution.

Authors:  Ignazio Verde; Albert G Abbott; Simone Scalabrin; Sook Jung; Shengqiang Shu; Fabio Marroni; Tatyana Zhebentyayeva; Maria Teresa Dettori; Jane Grimwood; Federica Cattonaro; Andrea Zuccolo; Laura Rossini; Jerry Jenkins; Elisa Vendramin; Lee A Meisel; Veronique Decroocq; Bryon Sosinski; Simon Prochnik; Therese Mitros; Alberto Policriti; Guido Cipriani; Luca Dondini; Stephen Ficklin; David M Goodstein; Pengfei Xuan; Cristian Del Fabbro; Valeria Aramini; Dario Copetti; Susana Gonzalez; David S Horner; Rachele Falchi; Susan Lucas; Erica Mica; Jonathan Maldonado; Barbara Lazzari; Douglas Bielenberg; Raul Pirona; Mara Miculan; Abdelali Barakat; Raffaele Testolin; Alessandra Stella; Stefano Tartarini; Pietro Tonutti; Pere Arús; Ariel Orellana; Christina Wells; Dorrie Main; Giannina Vizzotto; Herman Silva; Francesco Salamini; Jeremy Schmutz; Michele Morgante; Daniel S Rokhsar
Journal:  Nat Genet       Date:  2013-03-24       Impact factor: 38.330

3.  Population structure and genetic bottleneck in sweet cherry estimated with SSRs and the gametophytic self-incompatibility locus.

Authors:  Stéphanie Mariette; Muriel Tavaud; Uraiwan Arunyawat; Gaëlle Capdeville; Muriel Millan; Franck Salin
Journal:  BMC Genet       Date:  2010-08-20       Impact factor: 2.797

4.  Protocol: a simple method for extracting next-generation sequencing quality genomic DNA from recalcitrant plant species.

Authors:  Adam Healey; Agnelo Furtado; Tal Cooper; Robert J Henry
Journal:  Plant Methods       Date:  2014-06-27       Impact factor: 4.993

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.