| Literature DB >> 33051458 |
Lu Ma1,2, Haijian Sun1, Xiuguang Mao3,4.
Abstract
Echolocating bats are fascinating for their ability to 'see' the world in the darkness. Ultrahigh frequency hearing is essential for echolocation. In this study we collected cochlear tissues from constant-frequency (CF) bats (two subspecies of Rhinolophus affinis, Rhinolophidae) and frequency-modulated (FM) bats (Myotis ricketti, Vespertilionidae) and applied PacBio single-molecule real-time isoform sequencing (Iso-seq) technology to generate the full-length (FL) transcriptomes for the three taxa. In total of 10103, 9676 and 10504 non-redundant FL transcripts for R. a. hainanus, R. a. himalayanus and Myotis ricketti were obtained respectively. These data present a comprehensive list of transcripts involved in ultrahigh frequency hearing of echolocating bats including 26342 FL transcripts, 24833 of which are annotated by public databases. No further comparative analyses were performed on the current data in this study. This data can be reused to quantify gene or transcript expression, assess the level of alternative splicing, identify novel transcripts and improve genome annotation of bat species.Entities:
Mesh:
Year: 2020 PMID: 33051458 PMCID: PMC7554033 DOI: 10.1038/s41597-020-00686-w
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Detailed information about Iso-seq libraries.
| Sample | Tissue | OD260/280 | OD260/230 | 28 S/18 S | Completeness (RIN) | SRA IDs | TSA IDs |
|---|---|---|---|---|---|---|---|
| FL-CF-Rhai | cochleae | 2.13 | 1.8 | 1 | 7 | SRR12062845 | GIRV00000000 |
| FL-CF-Rhim | cochleae | 2.13 | 1.81 | 0.9 | 7 | SRR12062844 | GIRW00000000 |
| FL-FM-Myo | cochleae | 2.16 | 1.86 | 1.3 | 7.6 | SRR12062843 | GIRX00000000 |
Statistics of the four FL transcriptomes generated in this study.
| Sample | FL-CF-Rhai | FL-CF-Rhim | FL-FM-Myo | FL-CF-FM |
|---|---|---|---|---|
| Subreads number | 3444947 | 3255638 | 3403451 | |
| Total base (bp) | 6448987299 | 6504282447 | 7190237257 | |
| Mean length (bp) | 1872 | 1998 | 2113 | |
| CCS number | 137159 | 137160 | 152251 | |
| Mean CCS read length (bp) | 2443 | 2628 | 2732 | |
| Number of Passes (mean) | 22 | 20 | 19 | |
| Reads with 5 and 3 Primers (in percent) | 112912 (82.32%) | 107080 (78.07%) | 123700 (81.25%) | |
| Non-Concatamer reads with 5 and 3 Primers | 111976 (81.64%) | 105919 (77.22%) | 122411 (80.4%) | |
| FLNC (Non-Concatamer Reads with 5 and 3 Primers and Poly-A Tail) | 111806 (81.52%) | 105713 (77.07%) | 122222 (80.28%) | |
| Number of transcripts | 10384 | 9984 | 10932 | 31300 |
| Number of non-redundant transcripts | 10103 | 9676 | 10504 | 26342 |
| Total base (bp) | 22746072 | 22932622 | 26578852 | 63358581 |
| Mean length (bp) | 2251 | 2370 | 2530 | 2405 |
Annotation statistics for each of the four FL transcriptomes.
| Database | FL-CF-Rhai | FL-CF-Rhim | FL-FM-Myo | FL-CF-FM |
|---|---|---|---|---|
| Nr | 9555 (94.58%) | 9069 (93.73%) | 10067 (95.84%) | 24793 (94.12%) |
| UniProt | 9324 (92.29%) | 8825 (91.21%) | 9894 (94.19%) | 24198 (91.86%) |
| At least one annotation | 9564 (94.66%) | 9079 (93.83%) | 10090 (96.06%) | 24833 (94.27%) |
Fig. 1Overview of the sequencing data collection (a) and analysis pipeline (b).
Completeness of each of the four FL transcriptomes assessed by benchmarking universal single-copy ortholog (BUSCO) analysis.
| FL-CF-Rhai | FL-CF-Rhim | FL-FM-Myo | FL-CF-FM | |
|---|---|---|---|---|
| Complete BUSCOs (C) | 1458 (35.5%) | 1363 (33.2%) | 1526 (37.2%) | 2122 (51.7%) |
| Complete and single-copy BUSCOs (S) | 1082 (26.4%) | 997 (24.3%) | 1053 (25.7%) | 917 (22.3%) |
| Complete and duplicated BUSCOs (D) | 376 (9.2%) | 366 (8.9%) | 473 (11.5%) | 1205 (29.4%) |
| Fragmented BUSCOs (F) | 179 (4.4%) | 202 (4.9%) | 194 (4.7%) | 232 (5.7%) |
| Missing BUSCOs (M) | 2467 (60.1%) | 2539 (61.9%) | 2384 (58.1%) | 1750 (42.6%) |
| Total BUSCO groups searched | 4104 (100.0%) | 4104 (100.0%) | 4104 (100.0%) | 4104 (100.0%) |
| Measurement(s) | cochlea • transcriptome • sequence feature annotation |
| Technology Type(s) | isoform sequencing • sequence annotation |
| Factor Type(s) | species |
| Sample Characteristic - Organism | Rhinolophus affinis • Myotis ricketti |