| Literature DB >> 34792158 |
Yang Mei1, Dong Jing1, Shenyang Tang1, Xi Chen1, Hao Chen1, Haonan Duanmu1, Yuyang Cong1, Mengyao Chen1, Xinhai Ye1, Hang Zhou1, Kang He1, Fei Li1.
Abstract
Insects are the largest group of animals on the planet and have a huge impact on human life by providing resources, transmitting diseases, and damaging agricultural crop production. Recently, a large amount of insect genome and gene data has been generated. A comprehensive database is highly desirable for managing, sharing, and mining these resources. Here, we present an updated database, InsectBase 2.0 (http://v2.insect-genome.com/), covering 815 insect genomes, 25 805 transcriptomes and >16 million genes, including 15 045 111 coding sequences, 3 436 022 3'UTRs, 4 345 664 5'UTRs, 112 162 miRNAs and 1 293 430 lncRNAs. In addition, we used an in-house standard pipeline to annotate 1 434 653 genes belonging to 164 gene families; 215 986 potential horizontally transferred genes; and 419 KEGG pathways. Web services such as BLAST, JBrowse2 and Synteny Viewer are provided for searching and visualization. InsectBase 2.0 serves as a valuable platform for entomologists and researchers in the related communities of animal evolution and invertebrate comparative genomics.Entities:
Mesh:
Substances:
Year: 2022 PMID: 34792158 PMCID: PMC8728184 DOI: 10.1093/nar/gkab1090
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Data summary of InsectBase 1.0 and 2.0
| Feature | Units | v1.0 | v2.0 | Fold Increase |
|---|---|---|---|---|
| Genomes | Species | 138 | 815 | 5.9 |
| Transcriptomes | Runs | 116 | 25 805 | 222.4 |
| Coding sequences | Transcripts | 160 905 | 15 045 111 | 93.5 |
| UTRs | - | 678 881 | 7 781 686 | 11.4 |
| miRNAs | - | 7544 | 112 162 | 14.9 |
| lncRNAs | - | 2439 | 1 293 430 | 530.3 |
| Pathways | - | 78 | 419 | 5.4 |
| Gene families | - | 54 | 164 | 3.0 |
| HGT genes | - | - | 215 986 | New |
| Insect viruses | - | - | 1524 | New |
| miRNA–mRNA interactions | - | - | 197 533 | New |
| lncRNA–mRNA interactions | - | - | 5 147 543 | New |
Figure 1.Main modules of InsectBase 2.0. It provides information about an organism, genome, transcriptome, chromosome, gene information about protein coding genes, miRNA and lncRNA, gene family, HGT genes, insect pathways, insect viruses, online tools, links and additional services.
Figure 2.Enhanced user interface features of InsectBase 2.0. (A) Species: basic information, file downloading and publications related to each species. (B) Chromosome: information for each chromosome. (C) Protein coding gene: detailed information about each protein coding gene. (D) JBrowse2: genome browser of each annotated genome. (E) Genome synteny: visualization of synteny of 155 chromosome-level genomes.