| Literature DB >> 27920798 |
Brittney N Keel1, Amanda K Lindholm-Perry1, Warren M Snelling1.
Abstract
Genomic structural variations are an important source of genetic diversity. Copy number variations (CNVs), gains and losses of large regions of genomic sequence between individuals of a species, have been associated with a wide variety of phenotypic traits. However, in cattle, as well as many other species, relatively little is understood about CNV, including frequency of CNVs in the genome, sizes, and locations, chromosomal properties, and evolutionary processes acting to shape CNV. In this work, we focused on copy number variation in the bovine genome, with the aim to detect CNVs in Bos taurus coding sequence and explore potential evolutionary mechanisms shaping these CNV. We identified and characterized CNV regions by utilizing exome sequence from 175 influential sires used in the Germplasm Evaluation project, representing 10 breeds. We examined various evolutionary and functional aspects of these CNVs, including selective constraint on CNV-overlapped genes, centrality of CNV genes in protein-protein interaction networks, and tissue-specific expression of CNV genes. Patterns of CNV in the Bos taurus genome reveal that reduced functional constraint and mutational bias may play a prominent role in shaping this type of structural variation.Entities:
Keywords: cattle genome; copy number variation; deletion; duplication; gene expression; network centrality; next-generation sequencing; selective constraint
Year: 2016 PMID: 27920798 PMCID: PMC5118444 DOI: 10.3389/fgene.2016.00207
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Breeds of sequenced bulls from the USMARC germplasm evaluation (GPE) population used in this study.
| Hereford | 17 |
| Angus | 19 |
| Simmental | 16 |
| Limousin | 15 |
| Charolais | 17 |
| Gelbvieh (German Yellow) | 17 |
| Red Angus | 15 |
| Shorthorn | 3 |
| Braunvieh (Brown Swiss) | 1 |
| Brahman | 2 |
| Charolais × Angus | 2 |
| Gelbvieh × Hereford | 6 |
| Simmental × Hereford | 3 |
| Simmental × Angus | 4 |
| Hereford × Angus | 13 |
| Limousin × Hereford | 5 |
| Gelbvieh × Angus | 6 |
| Red Angus × Hereford | 7 |
| Charolais × Hereford | 3 |
| Limousin × Angus | 4 |
Gene expression data sets.
| GSE73699 | Mesenteric fat | Microarray | 15 |
| GSE73261 | Spleen* | Microarray | 16 |
| GSE73159 | Duodenum | Microarray | 16 |
| Jejunum | Microarray | 16 | |
| Ileum | Microarray | 16 | |
| GSE41637 | Brain | RNAseq | 3 |
| Colon | RNAseq | 3 | |
| Heart | RNAseq | 3 | |
| Kidney* | RNAseq | 3 | |
| Liver* | RNAseq | 3 | |
| Lung* | RNAseq | 3 | |
| Skeletal muscle | RNAseq | 3 | |
| Spleen* | RNAseq | 3 | |
| Testes | RNAseq | 2 | |
| GSE55435 | Hypothalamus* | RNAseq | 8 |
| Pituitary gland | RNAseq | 7 | |
| Uterus | RNAseq | 8 | |
| Endometrium | RNAseq | 6 | |
| Ovary | RNAseq | 8 | |
| Subcataneous fat | RNAseq | 8 | |
| Liver* | RNAseq | 8 | |
| Longissimus dorsi muscle | RNAseq | 8 | |
| GSE71153 | Rumen | RNAseq | 16 |
| ERP005899 | Adipose | RNAseq | 7~14 pooled |
| Duodenum* | RNAseq | 7~14 pooled | |
| Hypothalamus* | RNAseq | 7~14 pooled | |
| Kidney* | RNAseq | 7~14 pooled | |
| Lung* | RNAseq | 7~14 pooled |
Tissues marked with *were present in multiple studies.
Figure 1CNVRs in GPE bulls. Plot shows the CNVRs identified from the 175 sequenced GPE bull genomes in Circos format (Krzywinski et al., 2009). The outer ideogram runs clockwise from chromosome 1 to chromosome X with labels in Mb of physical distance. The copy number data is represented in the inner tracks. The two innermost tracks show scatter plots of the CNVRs, where the red track shows copy number loss and the green track shows copy number gain. The size of the dot in the scatter plot is proportional to the number of samples containing the CNVR. The other track shows a heat map which indicates the parts of the genome that contain copy number gain and loss. This plot simply collapses the scatter plot values onto a single radial position.
Significantly over- and underrepresented GO slim terms in the set of CNV genes.
| Extracellular transport | 53 | 6 | 0.24 | + | 4.16E-05 |
| Response to toxic substance | 45 | 5 | 0.20 | + | 5.08E-04 |
| Response to stimulus | 2880 | 43 | 12.89 | + | 9.08E-12 |
| Response to interferon-gamma | 58 | 4 | 0.26 | + | 3.50E-02 |
| Amino acid transport | 81 | 5 | 0.36 | + | 8.45E-03 |
| Macrophage activation | 131 | 6 | 0.59 | + | 7.18E-03 |
| Sensory perception of smell | 667 | 30 | 2.99 | + | 9.14E-20 |
| Sensory perception of chemical stimulus | 875 | 32 | 3.92 | + | 1.23E-18 |
| Sensory perception | 1108 | 32 | 4.96 | + | 1.19E-15 |
| Neurological system process | 1593 | 36 | 7.13 | + | 1.18E-14 |
| System process | 1809 | 36 | 8.10 | + | 6.23E-13 |
| Single-multicellular organism process | 2189 | 36 | 9.80 | + | 2.01E-10 |
| Multicellular organismal process | 2199 | 36 | 9.85 | + | 2.30E-10 |
| G-protein coupled receptor signaling pathway | 789 | 13 | 3.53 | + | 1.21E-02 |
| Regulation of biological process | 2260 | 34 | 10.12 | + | 1.36E-08 |
| Biological regulation | 2636 | 34 | 11.80 | + | 8.17E-07 |
| Metabolic process | 6613 | 14 | 29.61 | + | 3.88E-02 |
| n/a | |||||
| MHC protein complex | 19 | 3 | 0.09 | + | 5.59E-03 |
| Heterotrimeric G-protein complex | 38 | 4 | 0.17 | + | 1.72E-03 |
| Integral to membrane | 1478 | 37 | 6.62 | + | 3.11E-17 |
| Membrane | 2433 | 37 | 10.89 | + | 2.19E-10 |
| Plasma membrane | 1458 | 24 | 6.53 | + | 1.01E-06 |
| Cell part | 4063 | 6 | 18.19 | + | 1.97E-02 |
| Intracellular | 3993 | 6 | 17.88 | − | 2.58E-02 |
Number of genes in the background Bos taurus GO slim annotation set with given GO term. Total number of annotated genes is shown in parentheses.
Number of CNV genes with given GO term. Total number of CNV genes with annotations in the background Bos taurus GO slim annotation set is shown in parentheses.
dN/dS analysis.
| All CNV genes | 0.1418 | 2.29E-09 | 0.5589 | 1.51E-07 | 0.2813 | 2.81E-06 |
| Duplication genes | 0.1601 | 1.45E-05 | 0.5135 | 0.0072 | 0.3151 | 3.01E-05 |
| Deletion genes | 0.1308 | 0.0142 | 0.5814 | 0.0083 | 0.2308 | 0.1068 |
| Mixed genes | 0.1235 | 1.36E-04 | 0.5681 | 4.79E-05 | 0.2702 | 0.0068 |
| Neutral genes | 0.0793 | – | 0.4288 | – | 0.1843 | – |
Median nonsynonymous (dN), synonymous (dS), and dN/dS rates are shown. P-values compare copy number variable genes with copy number neutral genes using a one-tailed Wilcoxon rank-sum test.
Number of tissue-specific genes with copy number variation.
| Testes | 531 | 4 |
| Brain | 318 | 0 |
| Spleen | 81 | 1 |
| Duodenum | 6 | 0 |
| Colon | 40 | 1 |
| Liver | 69 | 0 |
| Lung | 45 | 0 |
| Kidney | 75 | 0 |
| Ovary | 0 | 0 |
| Endometrium | 0 | 0 |
| Uterus | 0 | 0 |
| Rumen | 117 | 0 |
| Mesenteric fat | 8 | 0 |
| Adipose | 0 | 0 |
| Hypothalamus | 0 | 0 |
| Heart | 14 | 0 |
| Skeletal muscle | 37 | 0 |
| Pituitary gland | 0 | 0 |
| Subcutaneous fat | 0 | 0 |
| Longissimus dorsi muscle | 0 | 0 |
| Jejunum | 6 | 0 |
| Ileum | 8 | 0 |