| Literature DB >> 34718719 |
Jinding Liu1,2,3, Fei Yin2, Kun Lang1,2, Wencai Jie4, Suxu Tan3, Rongjing Duan5, Shuiqing Huang1,2, Wen Huang3.
Abstract
RNA-seq has been widely used in experimental studies and produced a massive amount of data deposited in public databases. New biological insights can be obtained by retrospective analyses of previously published data. However, the barrier to efficiently utilize these data remains high, especially for those who lack bioinformatics skills and computational resources. We present MetazExp (https://bioinfo.njau.edu.cn/metazExp), a database for gene expression and alternative splicing profiles based on 53 615 uniformly processed publicly available RNA-seq samples from 72 metazoan species. The gene expression and alternative splicing profiles can be conveniently queried by gene IDs, symbols, functional terms and sequence similarity. Users can flexibly customize experimental groups to perform differential and specific expression and alternative splicing analyses. A suite of data visualization tools and comprehensive links with external databases allow users to efficiently explore the results and gain insights. In conclusion, MetazExp is a valuable resource for the research community to efficiently utilize the vast public RNA-seq datasets.Entities:
Mesh:
Year: 2022 PMID: 34718719 PMCID: PMC8728262 DOI: 10.1093/nar/gkab933
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of MetazExp. (A) Data processing outline and structure of the database. (B) Searching MetazExp and comparative analysis using MetazExp generate a variety of output and visualization to allow better utilization of the database.
Summary of metazoan genomes and RNA-seq experiments collected in MetazExp.
| Class | Species | Annotation database | Volume (GB) | Study | Experiments | Run |
|---|---|---|---|---|---|---|
| Brachiopoda |
| Ensembl | 58.93 | 1 | 16 | 16 |
| Chelicerata |
| Ensembl | 484.58 | 25 | 129 | 189 |
|
| Ensembl | 529.37 | 17 | 119 | 132 | |
| Cnidaria |
| Ensembl | 1373.85 | 32 | 978 | 1463 |
| Coleoptera |
| Ensembl | 316.34 | 10 | 54 | 55 |
|
| Ensembl | 363.05 | 6 | 94 | 94 | |
|
| Ensembl | 2217.59 | 46 | 902 | 935 | |
| Crustacea |
| Ensembl | 2941.55 | 27 | 1024 | 1025 |
|
| Ensembl | 1064.81 | 18 | 237 | 239 | |
| Ctenophora |
| Ensembl | 796.45 | 10 | 172 | 185 |
| Diptera |
| Ensembl | 2911.93 | 44 | 474 | 509 |
|
| Ensembl | 7121.16 | 109 | 1907 | 1975 | |
|
| Ensembl | 489.54 | 5 | 121 | 121 | |
|
| Ensembl | 156.24 | 7 | 23 | 23 | |
|
| Ensembl | 179.12 | 5 | 24 | 25 | |
|
| Ensembl | 3616.64 | 99 | 691 | 855 | |
|
| Ensembl | 169.08 | 4 | 44 | 45 | |
|
| Ensembl | 36.76 | 2 | 11 | 11 | |
|
| Ensembl | 51.70 | 3 | 10 | 10 | |
|
| Ensembl | 594.51 | 23 | 139 | 152 | |
|
| Ensembl | 287.13 | 13 | 50 | 50 | |
|
| Ensembl | 174.36 | 3 | 38 | 38 | |
|
| Ensembl | 319.16 | 12 | 215 | 330 | |
|
| Ensembl | 262.72 | 1 | 199 | 303 | |
|
| Ensembl | 70067.54 | 1158 | 25 672 | 27 751 | |
|
| Ensembl | 509.99 | 12 | 169 | 234 | |
|
| Ensembl | 2043.96 | 26 | 496 | 554 | |
|
| Ensembl | 282.70 | 17 | 107 | 109 | |
|
| Ensembl | 1292.55 | 47 | 443 | 455 | |
|
| Ensembl | 359.79 | 22 | 180 | 238 | |
|
| Ensembl | 600.69 | 26 | 168 | 227 | |
|
| Ensembl | 63.64 | 1 | 4 | 4 | |
|
| Ensembl | 75.25 | 1 | 8 | 8 | |
|
| Ensembl | 53.89 | 1 | 6 | 6 | |
|
| Ensembl | 530.46 | 20 | 136 | 136 | |
|
| Ensembl | 143.84 | 2 | 22 | 22 | |
|
| Ensembl | 176.74 | 2 | 22 | 22 | |
|
| Ensembl | 253.58 | 2 | 22 | 22 | |
|
| Ensembl | 161.39 | 5 | 61 | 70 | |
|
| Ensembl | 233.13 | 4 | 31 | 31 | |
|
| Ensembl | 719.12 | 17 | 142 | 142 | |
|
| Ensembl | 234.55 | 3 | 154 | 154 | |
|
| Ensembl | 135.60 | 1 | 7 | 7 | |
|
| Ensembl | 67.67 | 2 | 18 | 18 | |
| Echinodermata |
| Ensembl | 1044.15 | 16 | 294 | 295 |
| Hemiptera |
| Ensembl | 2068.17 | 38 | 442 | 442 |
|
| Refseq | 828.28 | 22 | 196 | 202 | |
|
| Ensembl | 269.11 | 8 | 35 | 35 | |
|
| Ensembl | 140.50 | 12 | 31 | 31 | |
|
| Ensembl | 289.87 | 8 | 40 | 40 | |
| Hymenoptera |
| Ensembl | 11022.15 | 142 | 2345 | 2457 |
|
| Ensembl | 263.80 | 11 | 86 | 128 | |
|
| Ensembl | 873.55 | 17 | 210 | 230 | |
|
| Ensembl | 991.26 | 19 | 321 | 321 | |
| Isoptera |
| Ensembl | 392.02 | 6 | 71 | 73 |
| Lepidoptera |
| Ensembl | 6725.34 | 121 | 959 | 981 |
|
| Ensembl | 936.32 | 12 | 155 | 156 | |
|
| Refseq | 712.46 | 16 | 114 | 119 | |
|
| Ensembl | 654.89 | 4 | 432 | 643 | |
|
| Refseq | 716.55 | 18 | 88 | 89 | |
| Mollusca |
| Ensembl | 2696.20 | 45 | 828 | 901 |
|
| Ensembl | 476.87 | 9 | 127 | 127 | |
|
| Ensembl | 323.12 | 3 | 117 | 208 | |
| Myriapoda |
| Ensembl | 220.68 | 4 | 13 | 13 |
|
| Ensembl | 1601.12 | 13 | 249 | 250 | |
|
| Ensembl | 34256.93 | 588 | 9225 | 10 084 | |
|
| Ensembl | 104.13 | 2 | 21 | 21 | |
|
| Ensembl | 475.57 | 12 | 204 | 205 | |
|
| Ensembl | 112.70 | 4 | 22 | 22 | |
| Platyhelminthes |
| Ensembl | 2653.00 | 31 | 1143 | 1144 |
| Porifera |
| Ensembl | 178.09 | 6 | 298 | 341 |
| Rotifera |
| Ensembl | 74.16 | 2 | 10 | 10 |
|
|
|
|
|
|
Figure 2.Accessing MetazExp. MetazExp can be accessed through two primary querying methods. (A) On the search page, the search box contains multiple text search options to look for specific genes, genes in a protein family, genes within a gene ontology term or a KEGG pathway. (B) An example is shown for the pathway search result, where the circadian rhythm KEGG pathway in Drosophila is displayed. Red boxes indicate genes that are present in the database. (C) Alternatively, MetazExp can be accessed by blast searching a nucleotide or protein sequence. (D) Search result is displayed in an interactive table with links to external databases and to MetazExp to retrieve expression information.
Figure 3.Visualizing data in the MetazExp. (A) Genome browser showing alternative splicing events and transcript models in the gene per in Drosophila. (B) Interactive bar chart showing gene expression differences across different genotypes and conditions. The bar chart can be clicked to expand to visualize expression in replicates (experiments).
Figure 4.Case study using the MetazExp. A tissue specific expression and alternative splicing analysis was performed with RNA-seq data in the SRA using MetazExp. (A) PCA plot of overall gene expression for 15 samples covering five tissues in silkworms. (B) Hierarchical clustering based on overall gene expression. (C Same data but plotted for PCA of alternative splicing. (D) Hierarchical clustering based on alternative splicing (PSI).