| Literature DB >> 27277319 |
Gemma M Jenkins1, Michael E Goddard2, Michael A Black3, Rudiger Brauning4, Benoit Auvray3, Ken G Dodds4, James W Kijas5, Noelle Cockett6, John C McEwan4.
Abstract
BACKGROUND: Copy number variants (CNVs) are a type of polymorphism found to underlie phenotypic variation, both in humans and livestock. Most surveys of CNV in livestock have been conducted in the cattle genome, and often utilise only a single approach for the detection of copy number differences. Here we performed a study of CNV in sheep, using multiple methods to identify and characterise copy number changes. Comprehensive information from small pedigrees (trios) was collected using multiple platforms (array CGH, SNP chip and whole genome sequence data), with these data then analysed via multiple approaches to identify and verify CNVs.Entities:
Keywords: Array CGH; Copy number variants; SNP; Sequence; Sheep
Mesh:
Year: 2016 PMID: 27277319 PMCID: PMC4898393 DOI: 10.1186/s12864-016-2754-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Characteristics of CNVs predicted true by the model (n = 9,789) and filtered to remove artefacts
| Variable | Mean | Median | Std dev | Min | Max |
|---|---|---|---|---|---|
| absl2ra | 0.54 | 0.43 | 0.36 | 0.25 | 3.47 |
| length (bp) | 30,332.02 | 8,706 | 107,369.37 | 1,003 | 2,522,449 |
| Datapointsb | 14.99 | 9 | 23.91 | 3 | 446 |
aabsl2r is the absolute log2 ratio of the CNV. bnumber of CGH array probes in the CNV
Fig. 1CNVR frequency across animals
Fig. 2Number of CNVRs by chromosome length. Labels correspond to chromosome number
Fig. 3Cumulative density plot of the distances separating CNVRs. The red line reflects the observed pairwise distances between CNVRs, while the blue line reflects the simulated (expected if CNVRs are uniformly distributed in the genome) distances separating CNVRs
Cross platform verification results. Number of CNV calls that were verified and not verified
| Verification platform | ||||
|---|---|---|---|---|
| 385K CGH array | Illumina OvineSNP50 BeadChip - cnvPartition | Illumina OvineSNP50 BeadChip - DNAcopy | Sequence analysis ~ 10X coverage | |
| Verified | 17 (1.34 %) | 3 (0.04 %) | 101 (1.36 %) | 714 (61.34 %) |
| Not verified | 1,251 | 7,413 | 7,315 | 450 |
| Total | 1,268 | 7,416 | 7,416 | 1,164 |
Comparison between CNVRs observed in this study and CNVRs observed in the literature
| Number of studies CNVR observed in | Number of CNVR | Number of these CNVR identified in this study (%) |
|---|---|---|
| 1 | 1,802 | 246 (13.7) |
| 2 | 255 | 82 (32.2) |
| 3 | 66 | 24 (36.4) |
| 4 | 20 | 16 (80.0) |
| 5 | 7 | 6 (85.7) |
| 6 | 4 | 4 (100) |
Fig. 4Selection of CGH array probes to cover OvineSNP50 BeadChip SNP positions. Two methods were used to select probe sets to cover SNPs. The first method (a) involved designing at least one probe to cover the SNP position, with two probes in flanking regions. The second method (b) involved designing a probe within the 300 bp region surrounding the SNP and two probes in flanking regions
Fig. 5Pedigree of International Mapping Flock (IMF) animals assayed on the Roche NimbleGen 2.1M CGH array. Some animals (green) appear in more than one pedigree. Segment calls from animals IMF66, IMF91, IMF95, IMF108 and IMF112 (red) were removed from the analysis due to failed 2.1M CGH arrays
Description of the pseudo-maximum likelihood derived mixture model for estimating copy number in sequence data
| Copy number | Distribution | Mixture weights | Mean | Variance |
|---|---|---|---|---|
| 0 | Half normal, centered on zero |
|
|
|
| 1 | Normal |
|
|
|
| 2 | Normal | 1 − |
|
|
| 3 | Normal |
|
|
|
| 4 | Normal |
|
|
|
Starting values of parameters estimated by pseudo-maximum likelihood
| Variable | Starting value | Lower bound | Upper bound |
|---|---|---|---|
|
|
| -∞ | ∞ |
|
|
| 0 | ∞ |
|
| 1.5 | 0.01 | ∞ |
|
| 0.01 | 0 | 0.05 |
|
| 0.025 | 0 | 0.2 |
|
| 0.001 | 0 | 0.2 |
|
| 0.001 | 0 | 0.05 |