| Literature DB >> 27381368 |
Yang Zhou1,2, Yuri T Utsunomiya3, Lingyang Xu1,4, El Hamidi Abdel Hay1, Derek M Bickhart1, Tad S Sonstegard1, Curtis P Van Tassell1, Jose Fernando Garcia3,5, George E Liu1.
Abstract
We compared CNV region (CNVR) results derived from 1,682 Nellore cattle with equivalent results derived from our previous analysis of Bovine HapMap samples. By comparing CNV segment frequencies between different genders and groups, we identified 9 frequent, false positive CNVRs with a total length of 0.8 Mbp that were likely caused by assembly errors. Although there was a paucity of lineage specific events, we did find one 54 kb deletion on chr5 significantly enriched in Nellore cattle. A few highly frequent CNVRs present in both datasets were detected within genomic regions containing olfactory receptor, ATP-binding cassette, and major histocompatibility complex genes. We further evaluated their impacts on downstream bioinformatics and CNV association analyses. Our results revealed pitfalls caused by false positive and lineage-differential copy number variations and will increase the accuracy of future CNV studies in both taurine and indicine cattle.Entities:
Mesh:
Year: 2016 PMID: 27381368 PMCID: PMC4933914 DOI: 10.1038/srep29219
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Comparisons of CNVR coverage between Nellore and HapMap cattle.
Y-axis: the length percentage of CNVR coverage; X-axis: autosomes from chr1 to chr29 and A: average of all autosomes. (a) The coverage of CNVRs of different sizes on autosomes in Nellore cattle; CNVRs were assigned as large (length ≥ 100 kb); medium (10 kb ≤ length < 100 kb) or small (length ≤ 10 kb). (b) The coverage of CNVRs of different sizes on autosomes in HapMap cattle; (c) The counts of CNVs per Mbp in Nellore and HapMap cattle; (d) The coverage of CNVRs derived from CNV segments with a high frequency (>0.1) in HapMap cattle.
Figure 2False positive CNVRs caused by cattle genome assembly errors.
(a) F between male and female of Nellore cattle at CNV segment level; (b) F between male and female of HapMap cattle at CNV segment level; (c) BAF and logR ratio distribution of SNP probes near FP_CNVR7. Five upstream and downstream probes were included to represent the basal level; (d) BAF and logR ratio distribution of SNP probes within FP_CNVR2; (e) Male-specific PCR validation results. Although there were nonspecific bands in some samples, specific amplifications of target bands (labeled) were only present in males.
False positive CNVRs caused by genome assembly errors.
| False positive CNVRs | UMD3.1Chr | Start | End | Btau_4.6.1Chr | Overlapped gene | Overlapped with QTLs |
|---|---|---|---|---|---|---|
| FP_CNVR1 | chr1 | 83,218,713 | 83,238,102 | chrY | Body weight (yearling) | |
| FP_CNVR2 | chr2 | 55,587,169 | 55,598,367 | chrUn | ||
| FP_CNVR3 | chr4 | 45,054,839 | 45,072,215 | chrX | Stillbirth (direct), Calving ease (maternal) | |
| FP_CNVR4 | chr4 | 75,028,395 | 75,104,623 | chrUn | ||
| FP_CNVR5 | chr5 | 22,514,133 | 22,563,988 | chrY | Intramuscular fat, Shear force, Longissimus muscle area, Fat thickness at the 12th rib, Kidney fat weight, Carcass weight, Body weight (slaughter), Chest width, Length of productive life, Milk fat percentage, Milk fat yield (daughter deviation), Body condition score, Somatic cell score (DYD), Conformation score, Rump angle | |
| FP_CNVR6 | chr17 | 15,448,739 | 15,477,125 | chrX | Marbling score | |
| FP_CNVR7 | chr17 | 25,056,695 | 25,119,996 | chrY | ||
| FP_CNVR8 | chr17 | 50,746,686 | 50,962,760 | chrY | Trans-16-C18:1 fatty acid content, Clinical mastitis (DYD), Shear force, Somatic cell score (DYD) | |
| FP_CNVR9 | chr17 | 51,115,979 | 51,433,906 | chrY | Trans-16-C18:1 fatty acid content, Clinical mastitis (DYD), Shear force, Metabolic body weight, Dry matter intake, Abomasum displacement, Body weight (weaning) |
Figure 3Comparisons of CNV segments between Nellore and taurine cattle.
(a) F between Nellore and taurine cattle at CNV segment level. The red dots represent the false positive CNV segments. (b) BAF and logR ratio distribution of SNP probes in Diff_CNVR7. Five upstream and downstream probes were included to represent the basal level.
Figure 4Effects of false positive CNVs on association study for PWG.
Y-axis: Percentage of individuals with loss type; X-axis: Significantly associated CNV segments (p < 0.01). The first 14 CNV segments were located in false positive CNVRs.