| Literature DB >> 23865711 |
Julia Metzger1, Ute Philipp, Maria Susana Lopes, Artur da Camara Machado, Michela Felicetti, Maurizio Silvestrelli, Ottmar Distl.
Abstract
BACKGROUND: Copy number variants (CNVs) have been shown to play an important role in genetic diversity of mammals and in the development of many complex phenotypic traits. The aim of this study was to perform a standard comparative evaluation of CNVs in horses using three different CNV detection programs and to identify genomic regions associated with body size in horses.Entities:
Mesh:
Year: 2013 PMID: 23865711 PMCID: PMC3720552 DOI: 10.1186/1471-2164-14-487
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Chromosomal distribution of CNVs detected by different detection algorithms. (A) Detection results of CNVPartition. (B) Detection results of PennCNV. (C) Detection results of QuantiSNP.
Chromosomal enrichment of CNVs detected by three different algorithms
| 1 | 2.1 | 4.3 | 18.5 |
| 2 | 0.6 | 7.6 | 31.0 |
| 3 | 0.5 | 3.6 | 29.1 |
| 4 | 0.8 | 4.7 | 20.7 |
| 5 | 0.5 | 4.2 | 18.6 |
| 6 | 1.8 | 4.2 | 22.0 |
| 7 | 0.9 | 6.6 | 19.4 |
| 8 | 0.5 | 3.9 | 15.1 |
| 9 | 1.2 | 4.7 | 15.1 |
| 10 | 1.3 | 7.1 | 26.3 |
| 11 | 2.5 | 4.0 | 25.9 |
| 12 | 8.6 | 15.7 | 54.7 |
| 13 | 1.4 | 91.8 | 36.3 |
| 14 | 0.3 | 3.6 | 19.9 |
| 15 | 0.9 | 4.7 | 10.2 |
| 16 | 0 | 2.1 | 10.2 |
| 17 | 1.7 | 7.7 | 21.3 |
| 18 | 1.5 | 5.3 | 14.1 |
| 19 | 0 | 3.6 | 3.1 |
| 20 | 2.3 | 2.7 | 5.2 |
| 21 | 0 | 4.4 | 19.1 |
| 22 | 0 | 3.0 | 28.6 |
| 23 | 26.4 | 2.9 | 15.0 |
| 24 | 1.0 | 2.5 | 4.9 |
| 25 | 3.6 | 6.2 | 22.5 |
| 26 | 0 | 3.0 | 11.0 |
| 27 | 0 | 79.1 | 90.9 |
| 28 | 0.4 | 70.7 | 65.4 |
| 29 | 1.4 | 1.5 | 8.6 |
| 30 | 3.6 | 4.3 | 36.8 |
| 31 | 0 | 4.1 | 3.9 |
| Total coverage | 1.7 | 9.0 | 22.0 |
The proportion of the chromosome covered by detected CNVs is shown.
Figure 2Overlapping CNVs from the three CNV-detection programs used in analysis. 50 CNVs could be detected by all three algorithms.
CNV regions from 50 CNVs of three detection algorithms in comparison with recent studies
| 1 | 155487276-155656642 | Loss, gain | 162, 13 | 155487276-155656642 | Loss | 78 | 155360077–155640397 | Loss, gain | 3 |
| | | | | | | | 155905880–155936772 | Loss | 1 |
| | | | | | | | 155936817–155937275 | Gain | 2 |
| | | | | | | | 155972933–155973160 | Gain | 1 |
| | | | | | | | 156313490–156314132 | Loss | 1 |
| | | | | | | | 156657191–156690829 | Loss | 1 |
| | | | | | | | 156690978–156714131 | Gain | 6 |
| 1 | 178798269- 179550475 | Loss, gain | 10, 5 | 178798269-178815370 | Loss | 1 | ─ | | |
| | | | 178798269-179550475 | Gain | 3 | | | | |
| 2 | 106062109-106063373 | Gain | 20 | 106062109-106064917 | Loss | 7 | ─ | | |
| 3 | 65705932- 65951800 | Loss, gain | 27, 29 | 65705932-66065643 | Gain | 50 | ─ | | |
| 4 | 52424614-52612016 | Gain | 20 | 52194368-52612016 | Gain | 1 | ─ | | |
| 5 | 37840041-37916448 | Loss | 16 | 37840041-37916448 | Loss | 5 | 37878731-37954286 | Loss, gain | 9, 8 |
| 5 | 88243192-88258862 | Gain | 57 | 88243192-88349479 | Gain | 27 | ─ | | |
| 6 | 26086675-26126581 | Gain | 42 | 26102028-26126581 | Gain | 1 | ─ | | |
| 6 | 72032729-72607543 | Loss, gain | 136, 11 | 71956823-72607543 | Loss | 71 | 71977669-73390784 | Loss, gain | 5, 1 |
| 7 | 31406445-31520977 | Loss | 17 | 31406445-31529855 | Loss | 10 | ─ | | |
| 7 | 52610482-52677786 | Gain | 37 | 52610482-52677786 | Gain | 39 | 52617898-52737112 | Loss, gain | 5 ,4 |
| 7 | 73083306-73197149 | Loss, gain | 14, 3 | 73083306-73197149 | Loss | 3 | 73056824-73699458 | Loss, gain | 14, 14 |
| 8 | 4280605-4621044 | Loss, gain | 37, 40 | 4183178-4430473 | Loss | 2 | 4381193-4648028 | Gain | 2 |
| 9 | 31574454-31574969 | Loss | 4 | ─ | ─ | ─ | ─ | | |
| 10 | 674485-1271225 | Loss, gain | 1, 6 | 674485-1141923 | Gain | 10 | ─ | | |
| 11 | 54645681-54812394 | Loss | 47 | 54640169-54812394 | Loss | 1 | ─ | | |
| 12 | 12524489-14777981 | Loss, gain | 108, 116 | 12524489-13488187 | Loss | 32 | 12314908-15866355 | Loss, gain | 12, 13 |
| | | | 13945011-14777981 | Loss | 114 | | | | |
| 18 | 11660478-12399073 | Gain | 2 | ─ | ─ | ─ | ─ | | |
| 20 | 32059082-32210308 | Loss, gain | 24, 3 | 32059082-32250493 | Gain | 9 | 32031829-32033708 | Loss | 1 |
| 24 | 32416012-32628728 | Loss, gain | 1, 11 | 32416012-32628728 | Gain | 20 | ─ | | |
| 25 | 26318531-27125754 | Loss, gain | 137, 4 | 26318531-26942120 | Loss | 99 | 26274719-26980393 | Loss, gain | 2, 2 |
Overlapping CNV regions from each study are shown in base pairs. Copy number (CN) and number of samples are displayed for all studies.
Comparison between 754 CNV regions detected by CGH analyses and the SNP array detection results of CNVPartition, PennCNV and QuantiSNP
| 1 | 67 | 11 | 11 | 24 |
| 2 | 31 | 3 | 3 | 35 |
| 3 | 35 | 0 | 3 | 29 |
| 4 | 27 | 7 | 11 | 15 |
| 5 | 44 | 0 | 0 | 11 |
| 6 | 29 | 7 | 4 | 14 |
| 7 | 50 | 2 | 4 | 12 |
| 8 | 24 | 4 | 4 | 38 |
| 9 | 17 | 0 | 6 | 29 |
| 10 | 31 | 0 | 0 | 48 |
| 11 | 39 | 3 | 5 | 28 |
| 12 | 15 | 7 | 13 | 47 |
| 13 | 22 | 0 | 55 | 55 |
| 14 | 32 | 0 | 3 | 34 |
| 15 | 33 | 0 | 0 | 3 |
| 16 | 23 | 0 | 0 | 13 |
| 17 | 22 | 0 | 5 | 14 |
| 18 | 27 | 0 | 0 | 19 |
| 19 | 14 | 0 | 0 | 0 |
| 20 | 47 | 4 | 9 | 17 |
| 21 | 18 | 0 | 0 | 33 |
| 22 | 21 | 0 | 0 | 29 |
| 23 | 17 | 6 | 6 | 24 |
| 24 | 10 | 0 | 0 | 0 |
| 25 | 14 | 29 | 29 | 43 |
| 26 | 4 | 0 | 0 | 0 |
| 27 | 7 | 0 | 57 | 86 |
| 28 | 5 | 0 | 100 | 80 |
| 29 | 6 | 17 | 0 | 17 |
| 30 | 14 | 0 | 0 | 50 |
| 31 | 9 | 0 | 0 | 0 |
| total | 754 | 3 | 7 | 25 |
All CNV regions that show an overlap of at least 50% with the CNV region detected by CGH are displayed for each chromosome. The highest overlap can be seen with QuantiSNP.
Distribution of detected CNVs among different breeds
| Anglo-Arabian | 165 | 1 | 1 | 0 | 0 | 0 | 0 |
| Arabian | 148 | 87 | 713 | 16 | 15 | 1 | 1 |
| Brandenburger | 168 | 1 | 1 | 0 | 0 | 0 | 0 |
| Hanoverian | 168 | 458 | 2508 | 442 | 262 | 11 | 10 |
| Holsteiner | 168 | 6 | 51 | 2 | 2 | 2 | 2 |
| Lusitano | 160 | 47 | 263 | 40 | 28 | 0 | 0 |
| Maremanno | 160 | 44 | 177 | 17 | 13 | 1 | 1 |
| Oldenburg | 168 | 11 | 54 | 4 | 4 | 0 | 0 |
| Rhenish-German Cold-Blood | 165 | 2 | 9 | 0 | 0 | 0 | 0 |
| Rhinelander horse | 168 | 11 | 49 | 0 | 0 | 0 | 0 |
| German Riding Pony | 148 | 1 | 7 | 0 | 0 | 0 | 0 |
| Selle Francais | 165 | 1 | 2 | 0 | 0 | 0 | 0 |
| Thoroughbred | 163 | 13 | 47 | 2 | 2 | 1 | 1 |
| Trakehner | 168 | 3 | 13 | 0 | 0 | 0 | 0 |
| Westphalian | 168 | 29 | 112 | 13 | 12 | 2 | 2 |
| Zweibrücker | 168 | 1 | 6 | 0 | 0 | 0 | 0 |
| Total | ─ | 717 | 4013 | 536 | 338 | 21 | 17 |
The total number of CNVs, the number of CNVs derived from the 50 CNVs of all three detection programs (intersection) and the number of animals which showed these CNVs in at least one algorithm is displayed. The last two columns show the more stringent category with the number of CNVs that were detected in the same horse in all three algorithms and with the number of horses under these conditions.
Number of CNVs derived from the 50 calls of all three detection algorithms that overlap among different breeds
| | | | | | | | | | | | | | |
| 8 | | | | | | | | | | | | | |
| 7 | 4 | | | | | | | | | | | | |
| 8 | 1 | 1 | | | | | | | | | | | |
| 1 | 1 | 0 | 0 | | | | | | | | | | |
| 1 | 1 | 1 | 1 | 0 | | | | | | | | | |
| 3 | 1 | 1 | 1 | 0 | 1 | | | | | | | | |
| 4 | 0 | 2 | 0 | 0 | 0 | 1 | | | | | | | |
| 12 | 6 | 4 | 2 | 1 | 1 | 1 | 0 | | | | | | |
| 10 | 5 | 3 | 2 | 1 | 1 | 2 | 1 | 6 | | | | | |
| 10 | 4 | 3 | 3 | 1 | 1 | 1 | 1 | 5 | 8 | | | | |
| 3 | 2 | 2 | 2 | 0 | 1 | 1 | 0 | 1 | 2 | 2 | | | |
| 4 | 3 | 4 | 1 | 0 | 1 | 1 | 1 | 3 | 1 | 2 | 1 |
On the whole 13 breeds shared their CNV with another breed (Han Hanoverian, Lus Lusitano, Mar Maremanno, Old Oldenburg, RDK Rhenish-German Cold-Blood, Wes Westphalian, Hol Holsteiner, TB-H Thoroughbred, AV Arabian, RHD Rhinelander horse, Bran Brandenburger, Trak Trakehner, RPon German Riding Pony).
Gene ontology analysis of significantly over- and underrepresented genes
| Response to stress | 586 | 17 | 3.60 | 1.34E-07 | 2.28E-05 |
| G-protein coupled receptor protein signaling pathway | 1042 | 22 | 6.41 | 3.99E-07 | 6.82E-05 |
| Muscle contraction | 613 | 17 | 3.77 | 2.50E-07 | 4.28E-05 |
| Sensory perception | 864 | 13 | 5.31 | 2.65E-03 | 4.53E-01 |
| Phagocytosis | 101 | 6 | 0.62 | 4.25E-05 | 7.28E-03 |
| Macrophage activation | 352 | 8 | 2.16 | 1.58E-03 | 2.71E-01 |
| Response to stimulus | 1921 | 22 | 11.81 | 3.19E-03 | 5.46E-01 |
| Cyclic nucleotide metabolic process | 109 | 6 | 0.67 | 6.46E-05 | 1.10E-02 |
| Nerve-nerve synaptic transmission | 122 | 7 | 0.75 | 1.20E-05 | 2.06E-03 |
| Synaptic transmission | 756 | 12 | 4.65 | 2.51E-03 | 4.30E-01 |
| Cell cycle | 1867 | 3 | 11.48 | 2.47E-03 | 4.22E-01 |
| Transcription from RNA polymerase II promoter | 2343 | 2 | 14.41 | 3.30E-05 | 5.64E-03 |
| Regulation of transcription from RNA polymerase II promoter | 1780 | 1 | 10.95 | 1.37E-04 | 2.33E-02 |
| Transcription | 2353 | 2 | 14.47 | 3.11E-05 | 5.31E-03 |
| Primary metabolic process | 8037 | 16 | 49.43 | 3.67E-11 | 6.28E-09 |
| Nucleobase, nucleoside, nucleotide and nucleic acid metabolic process | 3987 | 8 | 24.52 | 2.75E-05 | 4.70E-03 |
| Cellular component organization | 1496 | 2 | 9.20 | 4.22E-03 | 7.21E-01 |
| Mesoderm development | 1621 | 2 | 9.97 | 2.14E-03 | 3.67E-01 |
| Proteolysis | 1230 | 1 | 7.56 | 3.69E-03 | 6.30E-01 |
| Metabolic process | 8351 | 17 | 51.36 | 1.78E-11 | 3.04E-09 |
| Protein metabolic process | 3295 | 3 | 20.26 | 6.38E-07 | 1.09E-04 |
The software PANTHER was used for the evaluation of 45 CNV regions detected by all three algorithms. In five CNV regions we could not detect any genes for evaluation.
Genome-wide associated CNVs for body size in horses
| 1 | 156,012,982 | 156,870,455 | 0.006 (0.001) | | | | 1 | 111 |
| 1 | 156,657882 | 156,870,455 | | 0.010 (0.003) | | | 1 | 80 |
| 1 | 156,657,882 | 156,818,876 | | | | 0.057 (0.018) | 1 | 132 |
| 8 | 4,430,473 | 4,430,473 | 0.001 (0.001) | | | 0.081 (0.023) | 1 | 12/61 |
| 8 | 4,579,478 | 4,621,044 | | 0.0002 (0.0002) | | | 1 | 33 |
| 9 | 29,889,627 | 29,892,897 | 0.006 (0.001) | 0, 1 | 37 |
The chromosome-wide (EMP1; in brackets) and genome-wide corrected significant P-values (EMP2) of the associated CNV regions detected by PLINK for the different algorithms and the number of animals showing a CNV in this region are given. For the intersection of three detection programs the P-value at the threshold of significance is displayed. CNVPartition did not show any significant association.
Figure 3Verification of detected CNVs by qPCR. The horizontal line represents the relative quantification level 1. Two copies are shown by squares, one copy by triangles, zero copies by circles and three copies by rhombus. (A) The CNV region on ECA1 at 155.63 Mb was validated in nineteen horses (solid symbols). One horse showed a homozygous deletion instead of two copies (unfilled symbol). (B) The CNV region on ECA1 at 156.69 Mb was validated in all twenty horses (solid symbols). (C) The CNV region on ECA8 at 4.50 Mb was validated in sixteen horses (solid symbols). Four horses showed different copies as detected by SNP chip analysis (unfilled symbols).