| Literature DB >> 32211462 |
Naoki Yoshida1, Chikara Kaito1.
Abstract
In this article, we report the first de novo transcriptome assembly of the African bullfrog Pyxicephalus adspersus. In this data, 75,320,390 raw reads were acquired from African bullfrog mRNA using Illumina paired-end sequencing platform. De novo assembly resulted in a total of 136,958 unigenes. In the obtained unigenes, 30,039 open reading frames (ORFs) were detected. This dataset provides basic information for molecular level analysis of this species, which undergoes a state of dormancy under dry conditions at ordinary temperatures called estivation.Entities:
Keywords: African bullfrog; Pyxicephalus adspersus; RNA-Seq; Transcriptome; de novo assembly
Year: 2020 PMID: 32211462 PMCID: PMC7082503 DOI: 10.1016/j.dib.2020.105388
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Statistics of sequencing reads, transcripts and unigenes of the African bullfrog.
| Sequencing statistics | Total raw reads | 75,604,146 |
| Total clean reads | 75,320,390 | |
| Assembled transcripts statistics | Total assembled transcripts | 165,449 |
| Total assembled bases | 153,613,259 | |
| Assembled unigenes statistics | Total assembled unigenes | 136,958 |
| Total assembled bases | 100,205,174 | |
| GC % | 40.7 | |
| N50 unigene length (bp) | 1428 | |
| Mean unigene length (bp) | 731 | |
| Detected ORF statistics | Total detected ORF | 30,039 |
| N50 ORF length (aa) | 587 | |
| Mean ORF length (aa) | 341 | |
| Max ORF length (aa) | 7907 | |
| Min ORF length (aa) | 100 |
Statistics of BUSCO completeness of the assembled transcripts of the African bullfrog against the four gene sets.
| BUSCO dataset | ||||
|---|---|---|---|---|
| BUSCO statistics | Tetrapoda | Vertebrata | Metazoa | Eukaryote |
| Total BUSCO groups | 3950 | 2586 | 978 | 303 |
| Complete | 3410 | 2356 | 963 | 300 |
| Single | 2346 | 1701 | 724 | 226 |
| Duplicate | 1064 | 655 | 239 | 74 |
| Fragment | 262 | 140 | 10 | 3 |
| Missing | 278 | 90 | 5 | 0 |
| Completeness % | 86.3 | 91.1 | 98.5 | 99.0 |
Fig. 1Similarity of the African bullfrog ORFs against the Uniprot protein databases. A) BLASTp search of the African bullfrog transcripts against the Uniprot database (all-proteins or X. tropicalis proteins), and the number of transcripts were counted according to E-values. B) Distribution of the identity of the African bullfrog ORFs with an E-value lower than 1E-05 against the Uniprot database (all-proteins or X. tropicalis proteins).
Fig. 2Gene ontology terms assigned to the African bullfrog ORFs. The ORFs with high identity (E-value lower than 1E-5) in a BLASTx search of the Uniprot X. tropicalis protein database were subjected to gene ontology analysis.
Specifications Table
| Subject | Biochemistry, Genetics and Molecular Biology (General) |
| Specific subject area | Transcriptomics |
| Type of data | Table, Figure |
| How data were acquired | Illumina HiSeq 2500 sequencing platform. The obtained data were subjected to |
| Data format | Illumina HiSeq 2500 Raw data in FASTQ format, |
| Parameters for data collection | Tissues of inner organs, intestines, muscles, and skin were collected from the young adult frog. |
| Description of data collection | Total RNAs isolated from 11 tissues were equivalently mixed and sequenced with Illumina HiSeq 2500 platform. |
| Data source location | DNA Data Bank of Japan (DDBJ) Shizuoka, Japan |
| Data accessibility | Data is with the article. The raw sequencing data has been deposited in DDBJ sequencing read archive (SRA): DRR164258 ( |