| Literature DB >> 30380119 |
Meng-Wei Shi1, Na-An Zhang1, Chuan-Ping Shi1, Chun-Jie Liu2, Zhi-Hui Luo1, Dan-Yang Wang1, An-Yuan Guo2, Zhen-Xia Chen1.
Abstract
Many animal species present sex differences. Sex-associated genes (SAGs), which have female-biased or male-biased expression, have major influences on the remarkable sex differences in important traits such as growth, reproduction, disease resistance and behaviors. However, the SAGs resulting in the vast majority of phenotypic sex differences are still unknown. To provide a useful resource for the functional study of SAGs, we manually curated public RNA-seq datasets with paired female and male biological replicates from the same condition and systematically re-analyzed the datasets using standardized methods. We identified 27,793 female-biased SAGs and 64,043 male-biased SAGs from 2,828 samples of 21 species, including human, chimpanzee, macaque, mouse, rat, cow, horse, chicken, zebrafish, seven fly species and five worm species. All these data were cataloged into SAGD, a user-friendly database of SAGs (http://bioinfo.life.hust.edu.cn/SAGD) where users can browse SAGs by gene, species, drug and dataset. In SAGD, the expression, annotation, targeting drugs, homologs, ontology and related RNA-seq datasets of SAGs are provided to help researchers to explore their functions and potential applications in agriculture and human health.Entities:
Year: 2019 PMID: 30380119 PMCID: PMC6323940 DOI: 10.1093/nar/gky1040
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overall design of SAGD. SAGD curated metadata information from Expression Atlas, SRA and GEO, and selected groups of RNA-seq datasets with female and male biological replicates with the same project/species/tissue/stage combination. All RNA-seq raw data was processed using a standard pipeline. SAGD includes ‘Browse’, ‘Search’, ‘Download’ and ‘Submission’.
Statistics of RNA-seq datasets and SAGs in each species
| SAG_F4 | SAG_M5 | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Project | Sample | Tissue | Stage | Group | #2 | %3 | # | % | |
|
| 1 | 39 | 4 | 1 | 4 | 45 | 0.2 | 43 | 0.2 |
|
| 1 | 6 | 1 | 1 | 1 | 3,583 | 10.8 | 5,465 | 16.4 |
|
| 2 | 28 | 2 | 2 | 3 | 1,249 | 2.7 | 3,408 | 7.3 |
|
| 1 | 6 | 1 | 1 | 1 | 1,855 | 5.7 | 3,384 | 10.4 |
|
| 1 | 6 | 1 | 1 | 1 | 2,799 | 8.5 | 4,485 | 13.6 |
|
| 1 | 4 | 1 | 1 | 1 | 3 | 0.0 | 14 | 0.0 |
|
| 1 | 4 | 1 | 1 | 1 | 295 | 1.9 | 2,406 | 15.2 |
|
| 3 | 160 | 9 | 1 | 9 | 2,906 | 16.4 | 7,425 | 41.9 |
|
| 1 | 4 | 1 | 1 | 1 | 1,001 | 6.8 | 2,433 | 16.6 |
|
| 2 | 12 | 3 | 1 | 3 | 2,991 | 17.6 | 5,866 | 34.6 |
|
| 2 | 11 | 2 | 1 | 2 | 930 | 6.0 | 2,800 | 18.2 |
|
| 1 | 4 | 1 | 1 | 1 | 417 | 2.8 | 2,370 | 15.7 |
|
| 1 | 4 | 1 | 1 | 1 | 595 | 3.7 | 2,526 | 15.5 |
|
| 1 | 24 | 2 | 1 | 2 | 10 | 0.0 | 20 | 0.1 |
|
| 1 | 87 | 9 | 1 | 9 | 2,349 | 9.4 | 469 | 1.9 |
|
| 16 | 1,800 | 60 | 4 | 66 | 4,871 | 8.3 | 17,223 | 29.5 |
|
| 1 | 12 | 1 | 1 | 1 | 26 | 0.1 | 131 | 0.4 |
|
| 7 | 276 | 20 | 3 | 20 | 606 | 1.1 | 1,092 | 2.0 |
|
| 1 | 12 | 1 | 1 | 1 | 68 | 0.2 | 125 | 0.4 |
|
| 1 | 6 | 1 | 1 | 1 | 973 | 3.3 | 1,898 | 6.4 |
|
| 3 | 323 | 12 | 2 | 21 | 221 | 0.7 | 460 | 1.4 |
|
| 38 | 2,828 | 93 | 6 | 150 | 27,793 | 4.6 | 64,043 | 10.6 |
Notes: 1Duplicates were removed before summing up.
2Number of the SAGs.
3Percent of the genes in the genome.
4Female-biased genes.
5Male-biased genes.
Figure 2.An overview of SAGD. (A) The homepage of SAGD. (B) Browse by gene. (C) Browse by species. The species tree was plotted by TimeTree (www.timetree.org) (29) with modifications. (D) Browse by drug. (E) Browse by dataset. (F) Information of each SAG.