| Literature DB >> 26531252 |
Magretha Diane Wang1,2, Kennedy Dzama3, Charles A Hefer4, Farai C Muchadeyi5.
Abstract
BACKGROUND: Copy number variations (CNVs) are modifications in DNA structure comprising of deletions, duplications, insertions and complex multi-site variants. Although CNVs are proven to be involved in a variety of phenotypic discrepancies, the full extent and consequence of CNVs is yet to be understood. To date, no such genomic characterization has been performed in indigenous South African Nguni cattle. Nguni cattle are recognized for their ability to sustain harsh environmental conditions while exhibiting enhanced resistance to disease and parasites and are thought to comprise of up to nine different ecotypes.Entities:
Mesh:
Year: 2015 PMID: 26531252 PMCID: PMC4632335 DOI: 10.1186/s12864-015-2122-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Cross-validation plot demonstrating the effect of different K-values on the cross-validation error
Fig. 2Boxplot demonstrating the population structure of the Nguni cattle for k = 2 to k =10
Fig. 3A linear graph demonstrating the bayesian information criterion against the number of clusters
Fig. 4A DPCA plot demonstrating the group clustering with the subfigure 1 and 2 exhibiting discriminant eigenvalues and PCA eigenvalues
Haplotype block chromosomal distribution and characteristics
| CHR | CHRLN | SNP | HPB | MinL | MaxL | AvL | HPBLn | PCN |
|---|---|---|---|---|---|---|---|---|
| 1 | 161428367 | 2637 | 43 | 2809 | 199730 | 10561715 | 4330346 | 2.68 |
| 2 | 141965563 | 1691 | 33 | 1641 | 190889 | 77656.88 | 2485053 | 1.75 |
| 3 | 126844711 | 1716 | 38 | 108 | 192734 | 84265.08 | 3117846 | 2.46 |
| 4 | 123809850 | 1358 | 22 | 3406 | 198460 | 99861.67 | 2097117 | 1.69 |
| 5 | 125249322 | 1349 | 21 | 148 | 191441 | 80559.43 | 1691769 | 1.35 |
| 6 | 122519025 | 1438 | 21 | 2660 | 197610 | 82146.24 | 1725092 | 1.41 |
| 7 | 113029157 | 1328 | 31 | 1969 | 199428 | 88447.73 | 2653463 | 2.35 |
| 8 | 116846264 | 1306 | 30 | 84 | 197993 | 95895.72 | 2781006 | 2.38 |
| 9 | 108503706 | 1253 | 23 | 449 | 173897 | 61200.77 | 1346440 | 1.24 |
| 10 | 105982576 | 1059 | 22 | 10923 | 194738 | 123553.18 | 2718192 | 2.56 |
| 11 | 109987751 | 1071 | 15 | 9816 | 191201 | 96847.67 | 1452730 | 1.32 |
| 12 | 85119472 | 2140 | 25 | 620 | 194123 | 75741.70 | 1817826 | 2.14 |
| 13 | 84213851 | 1220 | 16 | 382 | 196561 | 57314.19 | 917043 | 1.09 |
| 14 | 81216349 | 1121 | 25 | 108 | 160297 | 48024.25 | 1152607 | 1.42 |
| 15 | 84472747 | 1015 | 10 | 6788 | 176328 | 73205.13 | 585651 | 0.69 |
| 16 | 77710258 | 852 | 22 | 178 | 188386 | 74494.59 | 1638903 | 2.11 |
| 17 | 76280064 | 987 | 12 | 2032 | 194454 | 65094.33 | 781144 | 1.02 |
| 18 | 65811054 | 769 | 13 | 5603 | 193175 | 63212.5 | 758563 | 1.15 |
| 19 | 64845320 | 864 | 14 | 1014 | 197085 | 82993.29 | 1161920 | 1.79 |
| 20 | 75686341 | 756 | 12 | 1527 | 195895 | 72733.91 | 800085 | 1.06 |
| 21 | 69078422 | 755 | 10 | 12797 | 173099 | 86163.67 | 775483 | 1.12 |
| 22 | 61598339 | 819 | 9 | 9271 | 176207 | 49994 | 399961 | 0.65 |
| 23 | 52334015 | 1980 | 11 | 2414 | 142956 | 48732.73 | 536071 | 1.02 |
| 24 | 64508398 | 1950 | 12 | 95 | 195261 | 65562.08 | 786757 | 1.22 |
| 25 | 44081797 | 1650 | 11 | 1343 | 180067 | 56038.46 | 616434 | 1.40 |
| 26 | 51826547 | 2017 | 11 | 281 | 187010 | 85746.2 | 857473 | 1.65 |
| 27 | 48460478 | 1784 | 8 | 151 | 191285 | 70888.75 | 567118 | 1.17 |
| 28 | 45964680 | 1890 | 10 | 675 | 140455 | 41588.63 | 332719 | 0.72 |
| 29 | 51812796 | 1538 | 11 | 3683 | 172838 | 64737.73 | 712126 | 1.37 |
| Tot | 40313 | 541 | 84 | 199730 | 79686.58 | 41596938 |
Chromosome number (CHR), chromosome length (CHRLN), number of SNPs (SNP) and HPBs (HPB), minimum length (MinL), maximum length (MaxL), average length (AvL) and total length (HPBLN) of HPBs and percentage of chromosome covered by HPBs (PCN)
Summary statistics of four CNV detection filtering models
| MDL | GCWF | DLRS | GCMDL | ANMLs | QCPS | ANMLsCNVs | CNVRs | AvL |
|---|---|---|---|---|---|---|---|---|
| 1 | 0.040 | 0.300 | Yes | 492 | 379 | 281 | 353 | 259283.62 |
| 2 | 0.040 | 0.300 | No | 492 | 326 | 231 | 334 | 270939.14 |
| 3 | 0.070 | 0.318 | Yes | 492 | 453 | 361 | 501 | 237869.23 |
| 4 | 0.070 | 0.318 | No | 492 | 462 | 352 | 486 | 240572.18 |
The stringencys (GCWF and DLRS), the number of animals (ANMLs), the number of animals that passed (QCPS), the number of animals with CNVs present in their genome (ANMLsCNVs), the number of CNVRs and the average length (AvL) of the CNVs identified within Nguni cattle
Chromosomal distribution of CNVs identified in 492 Nguni cattle
| CHR | CNVRs | CNVLN | PERCN | MinL | MaxL | AvL |
|---|---|---|---|---|---|---|
| 1 | 34 | 4533994 | 2.81 | 36419 | 680994 | 133352.76 |
| 2 | 16 | 1884357 | 1.33 | 44214 | 260334 | 117772.31 |
| 3 | 19 | 4020748 | 3.17 | 53857 | 949810 | 211618.32 |
| 4 | 23 | 3218422 | 2.60 | 48441 | 397435 | 139931.39 |
| 5 | 15 | 1655058 | 1.32 | 47847 | 257875 | 110337.20 |
| 6 | 25 | 4303075 | 3.51 | 31128 | 953806 | 172123.00 |
| 7 | 11 | 1792440 | 1.59 | 52476 | 306135 | 162949.09 |
| 8 | 6 | 794463 | 0.68 | 76217 | 237689 | 132410.50 |
| 9 | 11 | 1230570 | 1.13 | 30336 | 289059 | 111870.00 |
| 10 | 7 | 822052 | 0.78 | 44415 | 184185 | 117436.00 |
| 11 | 13 | 1265163 | 1.15 | 52654 | 199903 | 97320.23 |
| 12 | 19 | 2775332 | 3.26 | 48596 | 392714 | 146070.11 |
| 13 | 6 | 1295356 | 1.54 | 86589 | 522669 | 215892.67 |
| 14 | 12 | 2133059 | 2.63 | 48512 | 741197 | 177754.92 |
| 15 | 11 | 1539814 | 1.82 | 51632 | 390973 | 139983.09 |
| 16 | 10 | 1379434 | 1.78 | 40032 | 242142 | 137943.40 |
| 17 | 10 | 2570441 | 3.37 | 74327 | 1285287 | 257044.10 |
| 18 | 3 | 298969 | 0.45 | 63682 | 161641 | 99656.33 |
| 19 | 3 | 415596 | 0.64 | 106928 | 182010 | 138532.00 |
| 20 | 11 | 1615406 | 2.13 | 49902 | 378113 | 146855.09 |
| 21 | 9 | 964270 | 1.40 | 42434 | 156070 | 107141.11 |
| 22 | 6 | 1942282 | 3.15 | 73778 | 1171794 | 323713.67 |
| 23 | 4 | 506937 | 0.97 | 42345 | 211284 | 126734.25 |
| 24 | 12 | 1744861 | 2.70 | 38738 | 343135 | 145405.08 |
| 25 | 4 | 1369746 | 3.11 | 66262 | 1041448 | 342436.50 |
| 26 | 11 | 1958085 | 3.78 | 73168 | 518655 | 178007.73 |
| 27 | 7 | 784830 | 1.62 | 50958 | 261955 | 112118.57 |
| 28 | 7 | 1354237 | 2.95 | 117087 | 414660 | 193462.43 |
| 29 | 9 | 1179028 | 2.28 | 54840 | 367944 | 131003.11 |
| Total | 334 | 51348025 | 30336 | 1285287 | 153736.60 |
CNVR count (CNVRs), total length (CNV,LN), percentage of chromosome length (PERCN) and minimum (MinL) maximum (MaxL) and average (AvL) lengths of CNVRs identified on each of the 29 Btau chromosomes of 492 Nguni cattle
Fig. 5A venn diagram showing the overlap of CNVs identified by each of the four models in 492 Nguni cattle
Summary statistics of CNV deletions and duplications
| CNa | ANMLs | CNVs | MinL | MaxL | AvL |
|---|---|---|---|---|---|
| 0 | 16 | 7 | 44415 | 76444 | 53931.94 |
| 1 | 406 | 308 | 36419 | 1053438 | 143300.88 |
| 3 | 179 | 142 | 30336 | 953806 | 164468.69 |
| 4 | 1 | 1 | 102466 | 102466 | 102466 |
The number (CNVs), minimum length (MinL), maximum length (MaxL) and average length (AvL) of CNVs
adouble deletion (CN = 0), single deletion (CN = 1), single duplication (CN = 3) and double duplication (CN = 4)
Copy numbers and gene names of CNVRs present in 5 or more individuals
| CNVR | CNa | Total | GEN | |||
|---|---|---|---|---|---|---|
| 0 | 1 | 3 | 4 | |||
| chr17:73713062-74998349 | 8 | 13 | 21 | CHCHD10 IGLL1 LOC527441 SLC5A1 VPREB3 ZNF280A ZNF280B ZNF70 DERL3 GSTT1 GSTT3 GSTT4 MIF SLC2A11 SMARCB1 DDT GGT1 GGT5 SUSD2 C17H22orf13 LOC531152 MIR2323 RTDR1 SNRPD3 SPECC1L UPB1 | ||
| chr1:104798012-105264358 | 16 | 1 | 17 | SI | ||
| chr24:28154039-28497174 | 13 | 3 | 16 | CDH2 | ||
| chr7:75305297-75370366 | 1 | 8 | 5 | 14 | GABRG2 | |
| chr5:3260057-3434356 | 7 | 4 | 2 | 13 | ATXN7L3B | |
| chr6:43037439-43089739 | 12 | 1 | 13 | GBA3 | ||
| chr19:49657396-49784054 | 10 | 2 | 12 | LYAR NSG1 OTOP1 STX18 TMEM128 WDR1 ZBTB49 | ||
| chr6:108998175-109951981 | 5 | 7 | 12 | NOT_FOUND | ||
| chr9:3651455-4439872 | 10 | 1 | 1 | 12 | PECAM1 POLG2 | |
| chr1:32509969-32781614 | 1 | 7 | 3 | 11 | ||
| chr6:71910076-72118486 | 11 | 11 | CHIC2 | |||
| chr28:21101833-21762976 | 5 | 5 | 10 | CTNNA3 | ||
| chr22:59487979-60960603 | 8 | 1 | 9 | |||
| chr6:53514737-53692295 | 9 | 9 | ACAD9 C22H3orf37 CNBP COPG1 EFCC1 GATA2 ISY1 MIR2374 RAB7A RPN1 EFCC1 IQSEC1 ISY1 CHCHD4 HDAC11 NUP210 TMEM43 WNT7A XPC | |||
| chr14:54875898-55141942 | 8 | 8 | ANGPT1 | |||
| chr25:41191025-42687812 | 5 | 3 | 8 | BRAT1 CARD11 GNA12 GRIFIN LFNG MIR2390 MIR2890 | ||
a double deletion (CN = 0), single deletion (CN = 1), single duplication (CN = 3) and double duplication (CN = 4)
Summary statistics of CNVs identified in five Nguni cattle subpopulations
| Pop | ANMLS | ANMLsCNVs | CNVRs | Av/An | MinL | MaxL | AvL | No. Gen. |
|---|---|---|---|---|---|---|---|---|
| A | 103 | 62 | 121 | 1.71 | 42164 | 1066850 | 171789.26 | 39 |
| B | 57 | 27 | 39 | 0.98 | 62327 | 741252 | 186667.09 | 5 |
| C | 53 | 26 | 39 | 1.26 | 50170 | 518655 | 167637.18 | 65 |
| D | 23 | 6 | 8 | 0.39 | 82202 | 180684 | 146892.13 | 50 |
| E | 25 | 12 | 20 | 1.44 | 42164 | 1066850 | 223319.41 | 195 |
| Total | 261 | 133 | 268 | 1.32 | 42164 | 1066850 | 178994.23 | 339 |
The number of animals (ANMLs), animals with CNVs (ANMLsCNVs), CNVRs (CNVRs), the average number of CNVRs per animal (Av/An) the minimum (MinL), average (AvL) and maximum (MaxL) lengths of CNVs and the number of genes (No. Gen)
CNVR genes of Nguni cattle subpopulations
| Pop | No. Gen. | GEN |
|---|---|---|
| A B C E | 2 | PCDH7 CHIC2 |
| B C E | 1 | ATXN7L3B |
| A C E | 1 | NXNL2 |
| A B C | 4 | TNFAIP8 CTNNA3 SI LOC780933 |
| C E | 2 | KCND3 ATP5G3 |
| B E | 1 | ARL6 |
| A E | 17 | RAB40C KLHL1 CISD1 IPMK PWWP2B MRPL28 VPREB3 DECR2 TRNT1 PCDH10 ARL4C ZNF70 NME4 CHCHD10 IGLL1 TMEM8A OTOP1 |
| A D | 2 | CLRN1 LRFN5 |
| B C | 2 | HPS3 LOC514194 |
| A C | 8 | BICD2 CENPP ATG2B CDH12 BDKRB1 BDKRB2 ZWINT MR1 |
| A B | 11 | GABRG2 PDLIM1 LOC509513 DCTD NDST4 CDH2 C28H10orf35 COL13A1 PROM1 ADCY1 TMPRSS15 |
| E | 14 | GRAP2 SERPINB8 CADPS2 HERC4 ENTHD1 KCND2 PPP1R14C FKBP5 MSX1 CTSD FARS2 HTATSF1 NUP210 SORBS2 |
| D | 2 | ASPH FSTL5 |
| C | 44 | NUP35 URB2 HCK INSL6 PDPN PLGRKT PECAM1 ZC3H7B GDA MMS22L C6H4orf32 RHAG CPS1 TM9SF4 POFUT1 GLYATL3 SERINC1 GBE1 TM4SF18 IL1R2 C23H6orf141 CYB5R1 WBSCR17 CDH10 PHYHIPL ATF2 CNTNAP3 ADCY8 ANKRD50 CRISP2 FAM204A MRPS31 CD274 SPAM1 CELF4 KCMF1 CRISP3 HMGXB4 CDC73 KIF3B CELF2 RAB21 LACTB2 RANGAP1 |
| B | 28 | KATNBL1 MPPED2 C15H11orf70 FAM5C SH3BP4 HLTF C21H14orf49 TYW3 PAQR3 CHRM5 MIR1256 GJA1 RPL37A GPC5 CLN5 UBE2U OXR1 FAM98A COX7C SMAD4 ACSL1 LPHN2 TNNI3K CRYZ EMC7 PET112 DHX29 CADM1 |
| A | 149 | TBC1D19 PTGER3 SEC62 LOC527441 NR3C2 CA8 PFKP DDT STUB1 GGT1 AMPH FBXL16 WDR24 C15H11orf96 PRKAR2B TMEM128 RPUSD1 FAF1 NPRL3 LARGE GRB10 AXIN1 LUC7L C11H2orf28 PDIA2 PROP1 MSLN PLEKHA3 NOL4 PDGFD LYAR SPECC1L RNF185 AMY2B SUSD2 QRFPR POLR3K RFC3 ARL4A ACSL6 WFIKKN1 CLN8 ACYP2 SLC22A18 GBA3 MIR2390 FUBP3 SLC5A1 SNRPD3 C25H16orf13 SELM FGGY OTX2 KCTD16 PTGS2 CARD11 C1QTNF7 ARHGDIG DDI1 HAGHL MIF NAP1L4 MTRR H2AFY2 ALX1 ERICH1 CHTF18 FGF9 WDR1 PLEKHA1 GNG13 SRSF6 RRAGC ADIG SEMA3A UPB1 FZD1 SORCS3 NARFL LUZP2 SMARCB1 C15H11orf58 HBA SELPLG BCHE ZNF703 TMEM119 HBQ1 RGS11 MGAT4C LIN7C ITFG3 LMF1 OSTN TMEM225 GSTT4 ASS1 NRG3 ALKBH3 STAB2 CTXN3 RHBDF1 PATZ1 C21H14orf2 SNRNP25 INO80D PRR5L DRG1 ZBTB49 C17H22orf13 SLC25A21 METRN FAM173A ZNF280A KCNJ3 RHOT2 ST6GAL2 PPAP2B INPP5J GSTT3 GSTT1 QTRTD1 GGT5 HTRA1 CARS SEMA3C LOC615200 SOX2 CFTR ZNF280B PHLDA2 LPHN3 LYPLAL1 HBM LSAMP NXPH2 KCNQ1 LIMK2 SLC2A11 FAM195A GRIN3A CDKN1C DRD1 AGPAT9 PIK3IP1 DERL3 SMTN LOC516108 XRCC2 |
The number of genes (No. Gen), gene names (GEN) and the subpopulation (Pop)
Fig. 6Panther pie chart of the (a) biological pathways, (b) cellular components and (c) molecular functions represented within genes of CNVRs that overlap or share breakpoints with HPBs
Fig. 7Geographic origin of the 492 Nguni cattle sampled in the current study