| Literature DB >> 28323836 |
Nedenia Bonvino Stafuzza1, Adhemar Zerlotini2, Francisco Pereira Lobo2, Michel Eduardo Beleza Yamagishi2, Tatiane Cristina Seleguim Chud1, Alexandre Rodrigues Caetano3, Danísio Prado Munari1, Dorian J Garrick4, Marco Antonio Machado5, Marta Fonseca Martins5, Maria Raquel Carvalho6, John Bruce Cole7, Marcos Vinicius Gualberto Barbosa da Silva5.
Abstract
Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs.Entities:
Mesh:
Year: 2017 PMID: 28323836 PMCID: PMC5360315 DOI: 10.1371/journal.pone.0173954
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of sequencing and assembly results for the seven bulls representing Gyr, Girolando, Guzerat and Holstein cattle breeds.
| Sample | Total reads | Mapped reads | Coverage |
|---|---|---|---|
| Gyr_1 | 407,432,426 | 98.98% | 15.01 |
| Gyr_2 | 420,236,440 | 98.75% | 15.49 |
| Girolando_1 | 297,636,324 | 99.07% | 10.97 |
| Girolando_2 | 446,757,536 | 98.90% | 16.46 |
| Girolando_3 | 345,972,522 | 99.05% | 12.75 |
| Guzerat_1 | 352,056,808 | 98.97% | 12.97 |
| Guzerat_2 | 445,595,922 | 98.88% | 16.42 |
| Holstein_1 | 338,028,986 | 99.02% | 12.46 |
| Holstein_2 | 326,956,582 | 99.71% | 12.05 |
| Holstein_3 | 380,512,652 | 99.60% | 14.02 |
| Holstein_4 | 291,946,026 | 99.71% | 10.76 |
| Holstein_5 | 304,111,802 | 94.35% | 11.21 |
Summary of SNVs and InDels identified in this study for Gyr, Girolando, Guzerat and Holstein breeds.
| Breed | SNVs | InDels | ||
|---|---|---|---|---|
| total | novel | total | novel | |
| Gyr | 15,941,804 | 970,823 | 1,833,387 | 342,899 |
| Girolando | 13,286,669 | 626,921 | 1,413,047 | 227,176 |
| Guzerat | 16,743,392 | 894,177 | 1,975,563 | 370,095 |
| Holstein | 8,944,009 | 626,269 | 1,348,564 | 228,835 |
Fig 1Distribution of InDel lengths (bp) for Gyr, Girolando, Guzerat, and Holstein cattle breeds.
Description of insertion-to-deletion, heterozygous-to-homozygous (Het/Hom) and transition-to-transversion (Ti/Tv) ratios for InDels and SNVs.
| Sample | Insertions/Deletions | SNVs | |||||
|---|---|---|---|---|---|---|---|
| Insertion/Deletion | Insertion Het/Hom | Deletion Het/Hom | InDel Het/Hom | Total Het/Hom | Ti/Tv | Het/Hom | |
| Gyr_1 | 0.94 | 1.05 | 1.38 | 1.70 | 1.26 | 2.33 | 1.23 |
| Gyr_2 | 0.92 | 1.08 | 1.42 | 1.66 | 1.37 | 2.37 | 1.34 |
| Girolando_1 | 0.96 | 2.47 | 3.58 | 4.26 | 3.74 | 2.37 | 3.73 |
| Girolando_2 | 0.94 | 2.49 | 3.51 | 3.78 | 4.01 | 2.30 | 4.02 |
| Girolando_3 | 0.97 | 1.91 | 2.56 | 3.15 | 2.65 | 2.28 | 2.65 |
| Guzerat_1 | 0.95 | 1.09 | 1.46 | 1.84 | 1.32 | 2.34 | 1.29 |
| Guzerat_2 | 0.94 | 1.19 | 1.54 | 1.92 | 1.39 | 2.33 | 1.35 |
| Holstein_1 | 0.98 | 1.34 | 1.63 | 2.06 | 1.68 | 2.06 | 1.68 |
| Holstein_2 | 0.99 | 1.37 | 1.71 | 2.11 | 1.76 | 2.12 | 1.76 |
| Holstein_3 | 0.98 | 1.37 | 1.71 | 2.05 | 1.79 | 2.16 | 1.79 |
| Holstein_4 | 1.00 | 1.49 | 1.91 | 2.33 | 1.89 | 2.12 | 1.88 |
| Holstein_5 | 0.96 | 0.85 | 0.94 | 1.19 | 0.95 | 1.64 | 0.93 |
Fig 2Venn diagram showing the overlap of all identified SNVs and InDels in the Gyr, Girolando, Guzerat, and Holstein genomes.
Summary of functional classification of SNVs and InDels identified in Gyr, Girolando, Guzerat and Holstein breeds.
Some SNVs and InDels are represented in multiple categories.
| Classification | Gyr | Girolando | Guzerat | Holstein |
|---|---|---|---|---|
| Splice acceptor variant | 385 | 348 | 376 | 276 |
| Splice donor variant | 368 | 346 | 370 | 225 |
| Stop gained | 535 | 481 | 598 | 534 |
| Frameshift variant | 1,479 | 1,364 | 1,491 | 987 |
| Stop lost | 47 | 35 | 42 | 27 |
| Start lost | 107 | 100 | 102 | 56 |
| Inframe insertion | 277 | 235 | 238 | 92 |
| Inframe deletion | 358 | 306 | 302 | 153 |
| Missense variant | 53,519 | 48,073 | 54,433 | 26,931 |
| Protein altering variant | 74 | 72 | 66 | 50 |
| Splice region variant | 15,337 | 12,870 | 15,172 | 6,620 |
| Synonymous variant | 78,030 | 66,437 | 77,527 | 27,820 |
| Stop retained variant | 52 | 46 | 51 | 26 |
| Coding sequence variant | 396 | 360 | 366 | 221 |
| Mature miRNA variant | 120 | 111 | 132 | 81 |
| 5_prime_UTR_variant | 8,643 | 7,518 | 7,945 | 2,245 |
| 3_prime_UTR_variant | 42,266 | 36,082 | 42,233 | 19,609 |
| Non-coding transcript exon variant | 8,260 | 7,019 | 8,610 | 4,523 |
| Intron variant | 5,648,353 | 4,701,181 | 5,845,387 | 2,956,134 |
| Non-coding transcript variant | 8,628 | 7,368 | 8,979 | 4,710 |
| Upstream gene variant | 893,243 | 765,141 | 892,485 | 458,765 |
| Downstream gene variant | 899,298 | 766,050 | 910,644 | 457,038 |
| Intergenic variant | 12,442,695 | 10,192,691 | 13,244,468 | 7,556,938 |
Gene Ontology (GO) terms and KEGG pathways enriched (FDR<0.10) in Gyr, Girolando, Guzerat and Holstein breeds.
| Terms | Gyr | Girolando | Guzerat | Holstein | ||||
|---|---|---|---|---|---|---|---|---|
| Genes | FDR | Genes | FDR | Genes | FDR | Genes | FDR | |
| GO:0007186 G-protein coupled receptor signaling pathway | 612 | 5.35E-25 | 615 | 6.38E-37 | 632 | 4.24E-34 | 531 | 7.81E-63 |
| GO:0007608 Sensory perception of smell | 173 | 1.24E-09 | 173 | 6.66E-13 | 182 | 1.47E-15 | 153 | 1.01E-21 |
| GO:0050907 Detection of chemical stimulus involved in sensory perception | 135 | 1.84E-09 | 135 | 2.88E-12 | 137 | 7.68E-11 | 124 | 3.30E-22 |
| GO:0050911 Detection of chemical stimulus involved in sensory perception of smell | - | - | 45 | 0.03039 | 46 | 0.05485 | 45 | 8.99E-09 |
| GO:0006508 Proteolysis | 139 | 0.02278 | - | - | - | - | - | - |
| GO:0007165 Signal transduction | 297 | 0.02108 | - | - | - | - | 207 | 0.01465 |
| GO:0016021 Integral component of membrane | 2878 | 2.32E-11 | 2736 | 5.33E-13 | 2874 | 5.97E-11 | 1929 | 3.42E-15 |
| GO:0005886 Plasma membrane | 1643 | 1.17E-18 | 1618 | 2.55E-31 | 1680 | 5.21E-27 | 1192 | 6.07E-34 |
| GO:0005887 Integral component of plasma membrane | 589 | 0.00130 | - | - | - | - | - | - |
| GO:0045095 Keratin filament | - | - | 48 | 0.02374 | - | - | - | - |
| GO:0004984 Olfactory receptor activity | 661 | 2.86E-36 | 681 | 3.35E-59 | 694 | 7.46E-55 | 613 | 1.51E-102 |
| GO:0004930 G-protein coupled receptor activity | 764 | 5.20E-32 | 775 | 9.03E-49 | 796 | 1.60E-46 | 672 | 3.52E-83 |
| GO:0004888 Transmembrane signaling receptor activity | 165 | 3.78E-11 | 163 | 1.19E-12 | 167 | 2.98E-12 | 144 | 1.48E-20 |
| GO:0005549 Odorant binding | 143 | 3.36E-07 | 143 | 1.68E-09 | 145 | 3.69E-08 | 125 | 8.74E-16 |
| GO:0005524 ATP binding | 871 | 6.08E-05 | 812 | 0.05890 | 880 | 5.89E-06 | 621 | 4.21E-09 |
| GO:0042626 ATPase activity, coupled to transmembrane movement of substances | - | - | - | - | - | - | 36 | 0.00680 |
| bta04740 Olfactory transduction | 705 | 0.00217 | 723 | 7.39E-12 | 736 | 7.06E-09 | 656 | 1.51E-50 |
| bta04512 ECM-receptor interaction | - | - | 68 | 0.10062 | 71 | 0.04028 | - | - |
| bta02010 ABC transporters | - | - | - | - | - | - | 33 | 0.00254 |
| bta01100 Metabolic pathways | 812 | 0.02570 | - | - | - | - | - | - |
Comparison of BovineHD BeadChip array genotypes to sequencing SNVs.
| Sample | Sequencing calls | Concordant | Discordant |
|---|---|---|---|
| Gyr_1 | 99.42% | 267,258 | 1,558 |
| Gyr_2 | 99.45% | 196,537 | 1,091 |
| Girolando_1 | 98.46% | 145,365 | 2,274 |
| Girolando_2 | 98.92% | 153,157 | 1,670 |
| Girolando_3 | 98.66% | 185,638 | 2,527 |
| Guzerat_2 | 99.39% | 247,931 | 1,526 |
*Percentage of detectable array SNPs that were identified as SNV by sequencing