| Literature DB >> 35710683 |
Ruirui Hu1,2, Rui Yao2, Lei Li2, Yueren Xu2, Bingbing Lei2, Guohao Tang3, Haowei Liang3, Yunjiao Lei2, Cunyuan Li2,4, Xiaoyue Li2, Kaiping Liu2, Limin Wang1, Yunfeng Zhang1, Yue Wang2, Yuying Cui2, Jihong Dai2, Wei Ni2, Ping Zhou5, Baohua Yu6, Shengwei Hu7,8.
Abstract
With the rapid development of high-throughput sequencing technology, the amount of metagenomic data (including both 16S and whole-genome sequencing data) in public repositories is increasing exponentially. However, owing to the large and decentralized nature of the data, it is still difficult for users to mine, compare, and analyze the data. The animal metagenome database (AnimalMetagenome DB) integrates metagenomic sequencing data with host information, making it easier for users to find data of interest. The AnimalMetagenome DB is designed to contain all public metagenomic data from animals, and the data are divided into domestic and wild animal categories. Users can browse, search, and download animal metagenomic data of interest based on different attributes of the metadata such as animal species, sample site, study purpose, and DNA extraction method. The AnimalMetagenome DB version 1.0 includes metadata for 82,097 metagenomes from 4 domestic animals (pigs, bovines, horses, and sheep) and 540 wild animals. These metagenomes cover 15 years of experiments, 73 countries, 1,044 studies, 63,214 amplicon sequencing data, and 10,672 whole genome sequencing data. All data in the database are hosted and available in figshare https://doi.org/10.6084/m9.figshare.19728619 .Entities:
Mesh:
Year: 2022 PMID: 35710683 PMCID: PMC9203544 DOI: 10.1038/s41597-022-01444-w
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1Overview of the AnimalMetagenome DB construction method. (a) Metadata collection for animal metagenomes. (b) Standardization of attributes. (c) Database platform construction. (d) The Animal Metagenome DB web implementation.
List of attributes present in the AnimalMetagenome DB.
| Attributes | Definition |
|---|---|
| Project ID | Project ID from the NCBI BioProject database. |
| Study accession | Study ID from the NCBI SRA database. |
| Experiment ID | Metagenomic library ID from the NCBI SRA database. |
| PubMed ID | Article’s pubmed ID, if available. |
| Project title | Title of the project. |
| Creation date | Date when the project was created. |
| Project description | Project’s abstract. |
| Sample ID | Sample ID from the NCBI BioSample database. |
| Sample site | Origin of the sample based on the host body site. |
| Sex | Physical sex of the host. |
| Age | Age of the host at the time of sampling. |
| Collection date | Date of sample collection. |
| Condition | The information about the host’s experimental treatments. |
| Pheotype | Phenotype of the host. |
| Breed | Breed of animal. |
| Instrument | Sequencing platform. |
| Library strategy | Strategies for building metagenomic libraries. |
| Total size | Total number of reads present in the library. |
| Total bases | Total number of base pairs present in the library. |
| Species | Animal’s species name. |
| Geographic location | Location (country) where the sample was collected. |
| Latitude | Geographic coordinate of latitude in decimal degrees where the sample was collected. |
| Longitude | Geographic coordinate of longitude in decimal degrees where the sample was collected. |
| Study purpose | To classify the study purpose of the project based on the Animal QTLdb database. The classification of the bioproject type was made as to facilitate project clustering. |
Fig. 2Selective statistics of the AnimalMetagenome DB content. (a) The distribution of sequencing platforms in the database. (b) The distribution of metagenome samples from different host sources. (c) The distribution of sample sites (top 5). (d) The distribution of wild animals at the class level. (e) The distribution of metagenome samples collected in different countries (top 10). (f) The distribution of different project research types.
Fig. 3The AnimalMetagenome DB user interface. (a) The “Browse” page allows users to browse data. (b) The “Search” page allows users to select samples according to nine attributes. (c) The “Map” page allows users to select samples according to their geographical location on the world map.
| Measurement(s) | Metagenome metadata |
| Technology Type(s) | Collection and integration the metagenomic information of multiple animal species |
| Factor Type(s) | animal |
| Sample Characteristic - Organism | animal |
| Sample Characteristic - Environment | metagenome |
| Sample Characteristic - Location | United States of America • People’s Republic of China • Canada |