| Literature DB >> 30197923 |
Louise Brousseau1,2, Sabine Nidelet1, Réjane Streiff1.
Abstract
Here, we introduce new whole-genome shotgun sequencing and annotation data describing the autosomal vs. Z-heterosomal localization of nuclear genomic scaffolds of the moth species Ostrinia scapulalis. Four WGS libraries (corresponding to 2 males and 2 females) were sequenced with an Illumina HiSeq2500 sequencing technology, and the so-called 'AD-ratio' method was applied to distinguish between autosomal and Z-heterosomal scaffolds based on sequencing depth comparisons between homogametic (male) and heterogametic (female) libraries. A total of 25,760 scaffolds (corresponding to 341.69 Mb) were labelled as autosomal and 1273 scaffolds (15.29 Mb) were labelled as Z-heterosomal, totaling about 357 Mb. Besides, 4874 scaffolds (29.07 Mb) remain ambiguous because of a lack of AD-ratio reproducibility between the two replicates. The annotation method was evaluated a posteriori, by comparing depth-based annotation with the exact localization of known genes. Raw genomic data have been deposited and made accessible via the EMBL ENA BioProject id PRJEB26557. Comprehensive annotation is made accessible via the LepidoDB database (http://bipaa.genouest.org/sp/ostrinia_scapulalis/download/genome/v1.2/).Entities:
Keywords: AD-ratio; Autosomes; Depth analysis; Genome; HiSeq2500; NGS; Ostrinia scapulalis; Sex-chromosome; Structural annotation; Z-heterosome
Year: 2018 PMID: 30197923 PMCID: PMC6127984 DOI: 10.1016/j.dib.2018.08.011
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Total assembly length (~ 420 Mb) partitioning into autosomal (~ 342 Mb), Z-heterosomal (~ 15 Mb), putatively Z-heterosomal (~ 29 Mb) and un-annotated (~ 33 Mb) genomic regions.
| Subject area | Biology |
| More specific subject area | Genomics, Bioinformatics |
| Type of data | DNA sequence reads and annotation table |
| How data was acquired | Shotgun whole genome sequencing (Illumina HiSeq2500) |
| Data format | Raw (2 × 125 raw reads) and Analyzed (Scaffolds annotation) |
| Experimental factors | Wild-type specimens collected in the field |
| Experimental features | Genome: DNA sequencing |
| Data source location | Abbeville, France (50° 8′11.03′′N; 1°49′49.22′′E) |
| Data accessibility | Raw reads are accessible through the EMBL ENA BioProject id PRJEB26557 ( |
| Annotation data are made accessible | |
| Related article | B. Gschloessl, F. Dorkeld, P. Audiot, A. Bretaudeau, C. Kerdelhué, R. Streiff (2018) |