| Literature DB >> 35840572 |
Gulab Khedkar1, Chiaki Kambayashi2, Hiromasa Tabata2, Ikuyo Takemura2, Ryuhei Minei2, Atsushi Ogura2, Atsushi Kurabayashi3,4.
Abstract
Blindsnakes of infraoder Scolecophidia (order Squamata) are the most basal group of extant snakes, comprising of more than 450 species with ecological and morphological features highly specialized to underground living. The Brahminy blindsnake, Indotyphlops braminus, is the only known obligate parthenogenetic species of snakes. Although the origin of I. braminus is thought to be South Asia, this snake has attracted worldwide attention as an alien species, as it has been introduced to all continents except Antarctica. In this study, we present the first draft genome assembly and annotation of I. braminus. We generated approximately 480 Gbp of sequencing data and produced a draft genome with a total length of 1.86 Gbp and N50 scaffold size of 1.25 Mbp containing 89.3% of orthologs conserved in Sauropsida. We also identified 0.98 Gbp (52.82%) of repetitive genome sequences and a total of 23,560 protein-coding genes. The first draft genome of I. braminus will facilitate further study of snake evolution as well as help to understand the emergence mechanism of parthenogenetic vertebrates.Entities:
Mesh:
Year: 2022 PMID: 35840572 PMCID: PMC9287396 DOI: 10.1038/s41597-022-01530-z
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1Live specimen of Indotyphlops braminus.
Statistics of the sequencing data of Indotyphlops braminus.
| platform | Average length (bp) | Raw bases (Gbp) | Raw reads | SRA accession |
|---|---|---|---|---|
Illumina Hiseq | 126 | 131.294 | 1,042,013,868 | DRR374855[ |
| 151 | 137.379 | 909,796,440 | DRR374853[ | |
| 150.5 | 153.493 | 1,020,049,008 | DRR374854[ | |
| Total | — | 422.166 | 2,971,859,316 | — |
Oxford Nanopore MinION | 4,810.6 | 14.518 | 3,017,890 | DRR374856[ |
| 6,524.3 | 18.457 | 2,829,032 | DRR374857[ | |
| 6,218.9 | 15.241 | 2,450,721 | DRR374858[ | |
| 7,048.1 | 9.732 | 1,380,757 | DRR374859[ | |
| Total | — | 57.948 | 9,678,400 | — |
Fig. 2The k-mer distribution (k = 21) of Indotyphlops braminus. The 21-mer distribution was calculated by GenomeScope based on 422 Gbp Illumina short-reads data. K-mer coverages (x axis) were plotted against the value of coverage multiplying frequency (y axis).
Statistics of the genome assembly.
| Scaffolds | 4,851 |
|---|---|
| Maximum length (bp) | 7,047,253 |
| Total length (bp) | 1,856,433,866 |
| N50 (bp) | 1,247,154 |
| GC% | 41.96 |
| BUSCO complete (%) | 89.3 |
| BUSCO single-copy (%) | 87.4 |
| BUSCO duplicated (%) | 1.9 |
Statistics of repeat elements in the genome of Indotyphlops braminus.
| Repeat elements | Copies | Length (bp) | Percent (%) |
|---|---|---|---|
| SINE | 114704 | 14502484 | 0.78 |
| LINE | 1120299 | 371407334 | 20.01 |
| LTR elements | 81181 | 80025476 | 4.31 |
| DNA elements | 165204 | 33202527 | 1.79 |
| Unclassified | 2257178 | 461138007 | 24.84 |
| Small RNA | 10022 | 791165 | 0.04 |
| Satellites | 245 | 33662 | 0 |
| Simple repeats | 314143 | 15855113 | 0.85 |
| Low complexity | 45366 | 3575448 | 0.19 |
| Total | 4108342 | 980531216 | 52.82 |
Statistics of the gene model of Indotyphlops braminus.
| Number of protein-coding genes | 23,560 |
| Average CDS length (bp) | 20,067.5 |
| Average exon number per gene | 7.7 |
| Average exon length (bp) | 188.2 |
| Average intron length (bp) | 2,788.1 |
| BUSCO complete (%) | 72.9 |
| BUSCO single-copy (%) | 71.4 |
| BUSCO duplicated (%) | 1.5 |
Fig. 3Krona chart representing taxonomic composition of Indotyphlops braminus gene model. Taxonomy charts, which consist of all taxa (left) and Sauropsida (right), are shown.
| Measurement(s) | Whole genome sequence |
| Technology Type(s) | Next generation DNA sequencing |
| Factor Type(s) | Whole genome of Indotyphlops braminus |
| Sample Characteristic - Organism | Indotyphlops braminus |
| Sample Characteristic - Environment | Soil and rotten wood |
| Sample Characteristic - Location | Japan and India |