| Literature DB >> 34828308 |
Sèyi Fridaïus Ulrich Vanvanhossou1, Isabella Jasmin Giambra1, Tong Yin1, Kerstin Brügemann1, Luc Hippolyte Dossa2, Sven König1.
Abstract
This study investigated polymorphisms in the milk protein genes CSN1S1, CSN2, CSN1S2, CSN3, LALBA, and LGB, and casein haplotypes in Beninese indigenous cattle. Considering 67 animals, DNA sequencing of the genes' exons, flanking regions and parts of the 5'-upstream regions identified 1058 genetic variants including 731 previously unknown. In addition, four novel milk protein variants were detected, including CSN3K (p.Ala66Val), LALBAF (p.Arg58Trp), LGBB1 (p.Ala134Val) and LGBK (p.Thr92Asnfs*13). CSN3K is caused by a novel SNP (BTA6:85656526C>T, exon 4) whereas LALBAF and LGBB1 are due to rs714688595C>T (exon 1) and rs109625649C>T (exon 4), respectively. Regarding LGBK, a frameshift insertion of one adenine residue at BTA11:103257980 (exon 3) induces a premature translation termination resulting in a 46% reduction of the reference protein sequence. The casein polymorphisms formed five main CSN1S1-CSN2-CSN1S2-CSN3 haplotypes including B-A1-A-B, B-A1-A-A and C-A2-A-B which are predominant in the investigated cattle breeds. Moreover, in silico analyses of polymorphisms within the 5'- and 3'- untranslated regions of all six milk proteins revealed effects on microRNA and transcription factor binding sites. This study suggests a large genetic variation of milk protein genes in Beninese cattle, which should be investigated in further studies for their effects on milk production, including quality and yield traits.Entities:
Keywords: African taurine and indicine breeds; casein and whey protein genes; casein haplotype; genetic variants; microRNA; transcription factor binding sites
Mesh:
Substances:
Year: 2021 PMID: 34828308 PMCID: PMC8625544 DOI: 10.3390/genes12111702
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Reference gene sequences used to investigate milk protein gene polymorphisms in the Beninese indigenous cattle breeds.
| Gene Name | Gene Sequence | |||
|---|---|---|---|---|
| Sequence ID | Location | Strand 1 | Exons ( | |
|
| ENSBTAG00000007695 | 6:85410518:85429868 | + | 19 |
|
| ENSBTAG00000002632 | 6:85448564:85458344 | − | 9 |
|
| ENSBTAG00000005005 | 6:85529305:85549156 | + | 18 |
|
| ENSBTAG00000039787 | 6:85645254:85659526 | + | 5 |
|
| ENSBTAG00000005859 | 5:31182832:31213745 | + | 4 |
|
| ENSBTAG00000014678 | 11:103255224:103264876 | + | 7 |
1 + = forward strand, − = reverse strand.
Number and percentage distribution (in brackets) of the sequence variants (SNP, InDel) detected within the exon sequences, flanking intron sequences and parts of the 5′-flanking region of the six milk protein genes (CSN1S1, CSN2, CSN1S2, CSN3, LALBA, LGB) in Beninese indigenous cattle breeds.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
|
| 20 (8.06) | 8 (5.83) | 26 (7.53) | 15 (5.23) | 3 (11.54) | 1 (2.22) |
|
| 228 (91.94) | 114 (94.17) | 313 (92.47) | 275 (94.77) | 23 (88.46) | 44 (97.78) |
|
| 248 | 120 | 332 | 287 | 26 | 45 |
1 InDel = insertions/deletions.
Figure 1Distribution of known and novel DNA variants within the exon sequences, flanking intron sequences and parts of the 5′-upstream regions of milk protein genes (CSN1S1, CSN2, CSN1S2, CSN3, LALBA, LGB) in Beninese indigenous cattle breeds.
Figure 2Overall percentage of the DNA variant types detected within the exon sequences (missense, synonymous, frameshift), flanking intron sequences and parts of the 5′-upstream regions of the six milk protein genes (CSN1S1, CSN2, CSN1S2, CSN3, LALBA, LGB) in Beninese indigenous cattle breeds. # The respective distributions of variant types for each gene (see above Figure 1) are presented in Table S4.
Description (ID, location and allele frequencies) of the variants causing protein polymorphisms of the six milk protein genes (CSN1S1, CSN2, CSN1S2, CSN3, LALBA, LGB), their effects on the protein sequence and the designations of consequent protein variants, and frequencies within the different Beninese local breeds and across all investigated cattle.
| Genes | SNP | Gene Transcript | Variant Frequency | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| ID 1 | BTA:bp 2 | Allele | Exon | Protein 3 | Amino Acid 4 | Variant 5 | Crossbreed | Lagune | Somba | Total | |
|
| rs43703010 | 6:85427427 | A | 17 | 192 (207) | Glu | B | 0.62 | 0.97 | 0.93 | 0.81 |
| G | Gly | C | 0.38 | 0.03 | 0.08 | 0.19 | |||||
|
| rs715383373 | 6:85450908 | T | 7 | 197 (247) | Val | A1 | 0.97 | 1.00 | 1.00 | 0.99 |
| C | Ala | L | 0.03 | 0.00 | 0.00 | 0.01 | |||||
| rs43703011 | 6:85451298 | A | 7 | 67 (117) | His | A1 | 0.39 | 0.80 | 0.63 | 0.57 | |
| C | Pro | A2 | 0.61 | 0.20 | 0.37 | 0.43 | |||||
|
| rs441966828 | 6:85533780 | C | 3 | 8 (23) | Ser | A | 0.98 | 1.00 | 1.00 | 0.99 |
| T | Phe | B | 0.02 | 0.00 | 0.00 | 0.01 | |||||
|
| ss7626433428 | 6:85656526 | C | 4 | 66 (87) | Ala | B | 1.00 | 0.75 | 1.00 | 0.97 |
| T | Val | K | 0.00 | 0.25 | 0.00 | 0.03 | |||||
| rs43703015 | 6:85656736 | T | 4 | 136 (157) | Ile | B | 0.50 | 0.75 | 0.80 | 0.72 | |
| C | Thr | A | 0.50 | 0.25 | 0.20 | 0.28 | |||||
| rs43703016 | 6:85656772 | C | 4 | 148 (169) | Ala | B | 0.50 | 0.50 | 0.80 | 0.69 | |
| A | Asp | A | 0.50 | 0.50 | 0.20 | 0.31 | |||||
|
| rs714688595 | 5:31184282 | C | 1 | 10 (58) | Arg | B | 0.96 | 1.00 | 0.88 | 0.94 |
| T | Trp | F | 0.04 | 0.00 | 0.13 | 0.06 | |||||
| rs722550244 | 5:31184283 | G | 1 | 10 (58) | Arg | B | 0.88 | 1.00 | 1.00 | 0.95 | |
| A | Gln | A | 0.12 | 0.00 | 0.00 | 0.05 | |||||
| rs465119286 | 5:31184696 | A | 2 | 41(89) | Ile | B | 0.88 | 1.00 | 0.95 | 0.93 | |
| G | Val | E | 0.12 | 0.00 | 0.05 | 0.07 | |||||
|
| rs109625649 | 11:103259232 | C | 4 | 118 (134) | Ala | B | 0.89 | 0.50 | 0.81 | 0.80 |
| T | Val | B1 | 0.11 | 0.50 | 0.19 | 0.20 | |||||
| ss7626433430 | 11:103257980 | - | 3 | Thr92 | - | B | 0.96 | 1.00 | 1.00 | 0.98 | |
| A | Thr92Asnfs*13 | - | K | 0.04 | 0.00 | 0.00 | 0.02 | ||||
1 Variant ID: the rs# accession number of known variants were retrieved from the Ensembl database; the ss# accession number of the new variants identified from our sequencing were assigned by the European Variation Archive; 2 bp = SNP position according to the ARS-UCD1.2 genome map; 3 Amino acid (AA) position in mature protein. The corresponding AA positions in the protein sequence including signal peptide are given in parentheses; 4 Ala: Alanine, Arg: arginine, Asp: Aspartic acid, Gln: glutamine, Glu: glutamic acid, Gly: Glycine, His: histidine, Ile: isoleucine, Phe: Phenylalanine, Pro: proline, Ser: serine, Thr: Threonine, Trp: Tryptophan, Val: Valine; 5 Milk protein variants were inferred following previous nomenclature [9,14,15,52]. The underlined variants have been newly annotated.
Frequencies of the bovine casein haplotypes (CSN1S1-CSN2-CSN1S2-CSN3) in the different Beninese breeds and across all investigated cattle.
| Haplotypes 1 | Crossbreed | Lagune | Somba | Total |
|---|---|---|---|---|
|
| 0.01 | 0.50 | 0.47 | 0.39 |
|
| 0.20 | 0.18 | 0.25 | 0.22 |
|
| 0.39 | 0.29 | 0.20 | 0.20 |
|
| <0.01 | <0.01 | 0.02 | 0.19 |
|
| 0.38 | <0.01 | 0.02 | <0.01 |
1 Only haplotypes with a minimum frequency of 5% in one breed are considered.
Potential effects of SNP positioned within the 5′- and 3′-untranslated regions (UTR) of bovine milk protein genes on the binding of micro-RNA (miRNA) and transcription factors (TF) in Beninese cattle breeds.
| Gene | Location | Allele | ID 3 | Position | miRNA | TF | ||
|---|---|---|---|---|---|---|---|---|
| Loss 4 | Gain 4 | Loss 4 | Gain 4 | |||||
|
| 6:85411136 | C/T | rs379920406 | 5′-UTR | - | bta-miR-2291 | - | - |
| 6:85411147 | G/A | rs517257790 | 5′-UTR | - | - | - | - | |
| 6:85411192 | C/T | rs520777414 | 5′-UTR | - | - | - | - | |
| 6:85411221 | A/G | rs133040184 | 5′-UTR | bta-miR-133c | bta-miR-9-5p | - | - | |
| 6:85411236 | A/G | rs137119956 | 5′-UTR | bta-miR-2446 | - | E2F-6 | - | |
| 6:85411427 | A/C | rs135615686 | 5′-UTR | - | - | RAR | - | |
| 6:85411578 | A/G | rs109817504 | 5′-UTR | - | - | ATF-3; B-ATF; c-Jun; SMARCC1; SMARCC2 | - | |
| 6:85411618 | C/G | rs109757609 | 5′-UTR | bta-miR-2284e | bta-miR-7864 | - | Sp1; IRF-4 | |
| 6:85411677 | G/A | rs134534951 | 5′ UTR | - | - | STAT5A | - | |
| 6:85411780 | T/C | rs110163054 | 5′-UTR | bta-miR-15a; bta-miR-562 | - | - | - | |
| 6:85413195 | C/T | rs110899610 | 5′-UTR | bta-miR-2325b; bta-miR-2420; bta-miR-453 | bta-miR-376a | - | - | |
| 6:85428962 | G/A | rs466704456 | 3′-UTR | bta-miR-874 | bta-miR-219-5p; bta-miR-2355-3p | - | - | |
| 6:85429024 | C/T | ss7626432754 | 3′-UTR | - | - | - | - | |
| 6:85429077 | A/G | rs435231328 | 3′-UTR | bta-miR-338 | - | - | - | |
| 6:85429241 | A/G | rs211141048 | 3′-UTR | bta-miR-380-3p | - | SMAD4 | - | |
| 6:85429252 | C/A | rs716970086 | 3′-UTR | bta-miR-371 | - | SMAD4 | - | |
|
| 6:85449252 | C/A | rs440770944 | 3′-UTR | bta-miR-2464-3p | - | - | p63 |
|
| 6:85531897 | T/C | rs109900747 | 5′-UTR | - | bta-miR-452 | Sp1; HNF-4alpha; HNF-4gamma; FOXO1 | c-Ets-1 |
| 6:85548443 | A/G | rs211156498 | 3′-UTR | bta-miR-2318; bta-miR-2479; bta-miR-2480 | bta-miR-2463; bta-miR-452 | CP2-L1 | - | |
| 6:85548520 | A/G | rs109274107 | 3′-UTR | - | bta-miR-1388-3p | - | - | |
|
| 6:85656841 | T/A | rs109787476 | 3′-UTR | bta-miR-496 | bta-miR-2284w | ZNF274 | |
| 6:85658779 | C/A | ss7626433382 | 3′-UTR | bta-miR-193a-3p | - | - | - | |
| 6:85658873 | G/A | rs134516686 | 3′-UTR | - | - | - | - | |
|
| 5:31183709 | G/C | rs444727593 | 5′-UTR | - | bta-miR-2475 | - | - |
| 5:31183736 | G/A | rs458466372 | 5′-UTR | bta-miR-2284r; bta-miR-2284s | - | - | - | |
| 5:31183766 | T/C | rs471361585 | 5′-UTR | - | bta-miR-2370-5p | - | - | |
| 5:31183789 | A/G | rs440012037 | 5′-UTR | bta-miR-7864 | - | - | ||
| 5:31183806 | C/T | rs717249686 | 5′-UTR | - | - | - | KLF4 | |
| 5:31183848 | C/A | rs460157851 | 5′-UTR | - | bta-miR-2472; bta-miR-6517 | - | AP-2alpha; IRF-9, STAT2;STAT1 | |
| 5:31183921 | A/G | rs448925171 | 5′-UTR | bta-miR-2325c; bta-miR-6120-3p | bta-miR-218; bta-miR-2452 | - | - | |
| 5:31183924 | T/C | rs719631407 | 5′-UTR | bta-miR-6120-3p | bta-miR-218 | - | - | |
| 5:31184082 | T/C | rs462561324 | 5′-UTR | - | - | - | - | |
| 5:31185981 | C/T | rs718675014 | 3′-UTR | - | bta-miR-2425-5p; bta-miR-2467-3p | N-Myc; N-Myc | c-Myc | |
| 5:31186027 | A/G | rs110359174 | 3′-UTR | bta-miR-143 | - | - | - | |
|
| 11:103255847 | G/A | rs516356159 | 5′-UTR | - | - | - | RCoR2 |
| 11:103255918 | C/T | rs41255682 | 5′-UTR | bta-miR-1777a; bta-miR-2454-5p | - | SIX5 | - | |
| 11:103255930 | G/C | rs41255683 | 5′-UTR | - | - | - | STAT1; STAT3 | |
| 11:103260824 | G/A | rs1116405113 | 3′-UTR | bta-miR-7860 | - | - | - | |
1 bp = SNP position; 2 ref/alt = reference/alternate allele, 3 Variant ID: the rs# accession number of known variants were retrieved from the Ensembl database; the ss# accession number of the new variants identified from our sequencing were assigned by the European Variation Archive; 4 loss of binding, gain of binding.