| Literature DB >> 31443466 |
Fengwei Zhang1, Kaixing Qu2, Ningbo Chen1, Quratulain Hanif3, Yutang Jia4, Yongzhen Huang1, Ruihua Dang1, Jicai Zhang2, Xianyong Lan1, Hong Chen1, Bizhi Huang5, Chuzhao Lei6.
Abstract
We report genome characterization of three native Chinese cattle breeds discovering ~34.3 M SNPs and ~3.8 M InDels using whole genome resequencing. On average, 10.4 M SNPs were shared amongst the three cattle breeds, whereas, 3.0 M, 4.9 M and 5.8 M were specific to LQ, WN and WS breeds, respectively. Gene ontology (GO)analysis revealed four immune response-related GO terms were over represented in all samples, while two immune signaling pathways were significantly over-represented in WS cattle. Altogether, we found immune related genes (PGLYRP2, ROMO1, FYB2, CD46, TSC1) in the three cattle breeds. Our study provides insights into the genetic basis of Chinese indicine adaptation to the tropic and subtropical environment, and provides a valuable resource for further investigations of genetic characteristics of the three breeds.Entities:
Keywords: InDel; SNP; cattle; whole-genome resequencing
Year: 2019 PMID: 31443466 PMCID: PMC6769757 DOI: 10.3390/ani9090596
Source DB: PubMed Journal: Animals (Basel) ISSN: 2076-2615 Impact factor: 2.752
Summary of sequencing and mapping results in 15 samples.
| Breed | Sample ID | SRR ID | Total Reads | Aligned Reads Rate (%) | Duplication Rate (%) | Average Read Depth |
|---|---|---|---|---|---|---|
| WS | WS1 | SRR6024561 | 132599505 | 131966081 (99.52%) | 5.24% | 6.7367× |
| WS2 | SRR6024562 | 210348316 | 208908938 (99.32%) | 7.57% | 10.9981× | |
| WS3 | SRR6024569 | 213171893 | 209992479 (98.51%) | 7.56% | 11.0636× | |
| WS4 | SRR6024575 | 220677796 | 219684659 (99.55%) | 7.58% | 10.6536× | |
| WS5 | SRR6024576 | 185678168 | 184843226 (99.55%) | 6.60% | 9.3981× | |
| WS6 | SRR6024577 | 220875686 | 219906218 (99.56%) | 7.17% | 11.5984× | |
| WS7 | SRR6024578 | 124034762 | 123493668 (99.56%) | 5.40% | 6.5332× | |
| WN | WN4 | SRR5507199 | 453240314 | 443641290 (97.88%) | 10.12% | 22.9631× |
| WN8 | SRR5507198 | 453327732 | 448362891 (98.9%) | 9.91% | 23.1603× | |
| WN9 | SRR5507195 | 189321051 | 187411280 (98.99%) | 5.63% | 9.6483× | |
| WN10 | SRR5507196 | 203146450 | 201321637 (99.1%) | 6.00% | 10.1569× | |
| WN11 | SRR5507197 | 229615463 | 228096174 (99.34%) | 6.33% | 11.7816× | |
| LQ | LQ5 | SRR5507190 | 229615463 | 228096174 (99.34%) | 6.33% | 11.5007× |
| LQ12 | SRR5507189 | 219526027 | 217652138 (99.15%) | 6.14% | 11.1222× | |
| LQ15 | SRR5507188 | 208719494 | 207367555 (99.35%) | 5.91% | 10.6481× |
Functional annotation of the detected SNP variants in three cattle breeds.
| Fields | LQ | WN | WS |
|---|---|---|---|
| Total number | 19,178,051 | 23,091,150 | 23,431,130 |
| 3 prime UTR | 390,813 | 481,642 | 500,217 |
| 5 prime UTR | 110,542 | 137,007 | 146,969 |
| Downstream gene | 4,673,996 | 5,721,422 | 5,884,594 |
| Initiator codon | 25 | 31 | 39 |
| Intergenic region | 10,333,593 | 12,409,887 | 12,561,984 |
| Intragenic | 13,622,957 | 16,715,251 | 17,111,225 |
| Intron | 85,061,834 | 100,821,955 | 105,226,537 |
| Missense | 138,168 | 173,641 | 212,294 |
| Non coding transcript exon | 172,858 | 212,090 | 223,022 |
| Non coding transcript | 32,436,546 | 39,244,542 | 39,287,003 |
| Splice acceptor | 717 | 900 | 887 |
| Splice donor | 857 | 1152 | 1129 |
| Splice region | 59,437 | 74,604 | 79,809 |
| Start lost | 246 | 273 | 329 |
| Stop gained | 1108 | 1253 | 1476 |
| Stop lost | 211 | 273 | 263 |
| Stop retained | 221 | 246 | 258 |
| Synonymous | 278,880 | 348,230 | 528,134 |
| Upstream gene | 4,538,595 | 5,552,247 | 5,651,124 |
| Novel | 4,354,976 | 5,288,934 | 5,226,594 |
| Known | 14,823,074 | 17,802,215 | 18,204,535 |
Figure 1Distribution of insertions and deletions length in all InDels.
Figure 2Venn diagram describes overlap and unique SNPs/InDels between the three breeds (LQ, WN, WS). The numbers show specific SNPs/InDels for each breed or overlapping SNPs/InDels between any two breeds or among three breeds. (a) The identified shared and specific SNPs for each breed. (b) The identified shared and specific InDels for each breed.
Distribution of SnpEff annotation InDel variants in three cattle breeds.
| Fields | LQ | WN | WS |
|---|---|---|---|
| Total number | 2,153,542 | 2,586,758 | 2,471,063 |
| 3 prime UTR | 54,892 | 67,465 | 65,837 |
| 5 prime UTR | 9904 | 12,229 | 12,421 |
| Bidirectional gene fusion | 80 | 83 | 139 |
| Conservative inframe deletion | 561 | 814 | 1021 |
| Conservative inframe insertion | 434 | 546 | 781 |
| Disruptive inframe deletion | 1039 | 1292 | 1735 |
| Disruptive inframe insertion | 483 | 583 | 955 |
| Downstream gene | 585,215 | 719,779 | 693,994 |
| Frameshift variant | 4168 | 5516 | 9559 |
| Gene fusion | 198 | 178 | 232 |
| Intergenic region | 1152,941 | 1,382,145 | 1,318,068 |
| Intragenic | 1,485,117 | 1,817,239 | 1,775,709 |
| Intron | 9,407,833 | 11,108,642 | 10,992,364 |
| Non coding transcript exon | 17,425 | 21,355 | 21,201 |
| Non coding transcript | 3,688,414 | 4,441,858 | 4,170,309 |
| Splice acceptor | 384 | 451 | 471 |
| Splice donor | 368 | 412 | 477 |
| Splice region | 6396 | 7787 | 8542 |
| Start lost | 58 | 64 | 54 |
| Stop gained | 57 | 73 | 131 |
| Stop lost | 32 | 73 | 61 |
| Upstream gene | 54,6024 | 666,968 | 642,273 |
Gene Ontology (GO) reveals three cattle sharing high or moderate effects >5 SNPs.
| GO Biological Process Complete | Count | Fold Enrichment | FDR |
|---|---|---|---|
| Smell | |||
| Sensory perception of smell (GO:0007608) | 104 | 3.05 | 2.78 × 10−18 |
| Detection of chemical stimulus involved in sensory perception of smell (GO:0050911) | 101 | 3.03 | 5.15 × 10−18 |
| Sensory perception of chemical stimulus (GO:0007606) | 104 | 2.96 | 4.27 × 10−18 |
| Detection of chemical stimulus involved in sensory perception (GO:0050907) | 101 | 2.99 | 6.22 × 10−18 |
| Immune responses | |||
| Complement activation (GO:0006956) | 9 | 7.5 | 3.92 × 10−3 |
| Humoral immune response (GO:0006959) | 11 | 4.72 | 1.62 × 10−2 |
| Complement activation, classical pathway (GO:0006958) | 6 | 9.21 | 3.29 × 10−2 |
| Activation of immune response (GO:0002253) | 16 | 3.01 | 4.58 × 10−2 |
| Metabolic process | |||
| Metabolic process (GO:0008152) | 114 | 0.57 | 2.55 × 10−10 |
| Organic substance metabolic process (GO:0071704) | 98 | 0.56 | 2.16 × 10−9 |
| Cellular metabolic process (GO:0044237) | 87 | 0.54 | 2.92 × 10−9 |
| Primary metabolic process (GO:0044238) | 95 | 0.57 | 5.67 × 10−8 |
| Organonitrogen compound metabolic process (GO:1901564) | 64 | 0.55 | 4.18 × 10−5 |