| Literature DB >> 32424068 |
Lingzhao Fang1,2,3,4, Wentao Cai2,5, Shuli Liu1,5, Oriol Canela-Xandri3,4, Yahui Gao1,2, Jicai Jiang2, Konrad Rawlik3, Bingjie Li1, Steven G Schroeder1, Benjamin D Rosen1, Cong-Jun Li1, Tad S Sonstegard6, Leeson J Alexander7, Curtis P Van Tassell1, Paul M VanRaden1, John B Cole1, Ying Yu5, Shengli Zhang5, Albert Tenesa3,4, Li Ma2, George E Liu1.
Abstract
By uniformly analyzing 723 RNA-seq data from 91 tissues and cell types, we built a comprehensive gene atlas and studied tissue specificity of genes in cattle. We demonstrated that tissue-specific genes significantly reflected the tissue-relevant biology, showing distinct promoter methylation and evolution patterns (e.g., brain-specific genes evolve slowest, whereas testis-specific genes evolve fastest). Through integrative analyses of those tissue-specific genes with large-scale genome-wide association studies, we detected relevant tissues/cell types and candidate genes for 45 economically important traits in cattle, including blood/immune system (e.g., CCDC88C) for male fertility, brain (e.g., TRIM46 and RAB6A) for milk production, and multiple growth-related tissues (e.g., FGF6 and CCND2) for body conformation. We validated these findings by using epigenomic data across major somatic tissues and sperm. Collectively, our findings provided novel insights into the genetic and biological mechanisms underlying complex traits in cattle, and our transcriptome atlas can serve as a primary source for biological interpretation, functional validation, studies of adaptive evolution, and genomic improvement in livestock.Entities:
Mesh:
Year: 2020 PMID: 32424068 PMCID: PMC7263193 DOI: 10.1101/gr.250704.119
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.General characteristics of the cattle gene atlas. (A) Clustering analysis of all 723 RNA-seq samples using t-SNE (t-Distributed Stochastic Neighbor Embedding) procedure. (CNS) Central neural system. (B) Examples of tissue-specific genes in brain (GRM5), liver (SLC22A9), white blood cells (FCRL3), uterus (TDGF1), and testes (TRIM9). The y-axis is the raw gene expression, that is, fragments per kilobase per million mapped reads (FPKM). (C) Gene Ontology enrichment analysis of tissue-specific genes (the top 5% of genes based on t-statistics). The value in each bar is the fold of enrichment. (D) The enrichment analysis of cattle tissue-specific genes with human tissue-specific genes. The P-value is obtained using a hypergeometric test.
Figure 2.Comparison of dN/dS ratios of tissue-specific genes across all the 91 tissues and cell types between human and cattle. The red line represents the averaged dN/dS ratio of all orthologous genes between human and cattle. For each tissue, we compared tissue-specific genes of this tissue against the remaining genes using a two-tailed t-test; (*) P < 0.01.
Figure 3.The relationships between 45 complex traits and 91 tissues and cell types. The color corresponds to enrichment degrees (i.e., −log10P) that are computed using a sum-based GWAS signal enrichment analysis based on the top 5% tissue-specific genes and a 50-kb extension. (*) Corrected-P (FDR) < 0.1.
Figure 4.Validation of trait-tissue associations using DNA methylation data across seven tissues. Each dot represents a trait. The y-axis is for GWAS signal enrichments (−log10P) obtained using tissue-specific DNA methylated regions, whereas the x-axis is for GWAS signal enrichments obtained using tissue-specific expressed genes. The r is for Pearson's correlation.
Figure 5.Relationships between milk production traits and brain regions. (A) Milk production traits have a significantly higher GWAS signal enrichments (−log10P) than other types of traits in 14 brain regions (CNS), except for feed efficiency (i.e., residual feed intake [RFI]). We calculate P-values between groups using Student's t-test. (B) Two fine-mapped genes, TRIM46 (top; posterior probability of causality [PPC] = 0.59) and RAB6A (bottom; PPC = 0.79), for protein percentage and milk yield, respectively, are specifically highly expressed in CNS compared with all other tissues and cell types. (C) The associations of milk production traits with brain regions and four brain endocrine tissues (i.e., stalk median eminence [SME], anterior pituitary, posterior pituitary, and pineal gland) based on the GWAS signal enrichments of tissue-specific genes detected within these brain-relevant tissues. (*) Corrected-P (FDR) < 0.1.
Figure 6.Associations of male reproduction and health traits with blood and immune tissues and cell types. (A) Reproduction traits have a significantly higher GWAS signal enrichment (−log10P) than other types of traits in blood/immune tissues. We calculate P-values between groups using Student's t-test. (B) Enrichments of tissues and cell types with sire conception rate (SCR) and ketosis (KETO) disease. (C) Expression patterns of two fine-mapped genes across all 91 tissues and cell types, C6 (top; PPC = 1 for somatic cell score [SCS]) and CCDC88C (bottom; PPC = 1 for day of first birth [DFB]). (D) Associations of male reproduction and health traits with blood/immune tissues and cell types and four intestinal parts based on the GWAS signal enrichments of tissue-specific genes detected within these immune-relevant tissues and cell types. (*) Corrected-P (FDR) < 0.05.