| Literature DB >> 26305794 |
Joaquim Manoel da Silva1, Poliana Fernanda Giachetto2, Luiz Otávio Campos da Silva3, Leandro Carrijo Cintra2, Samuel Rezende Paiva4, Alexandre Rodrigues Caetano5, Michel Eduardo Beleza Yamagishi2.
Abstract
High density genotyping panels have been used in a wide range of applications. From population genetics to genome-wide association studies, this technology still offers the lowest cost and the most consistent solution for generating SNP data. However, in spite of the application, part of the generated data is always discarded from final datasets based on quality control criteria used to remove unreliable markers. Some discarded data consists of markers that failed to generate genotypes, labeled as missing genotypes. A subset of missing genotypes that occur in the whole population under study may be caused by technical issues but can also be explained by the presence of genomic variations that are in the vicinity of the assayed SNP and that prevent genotyping probes from annealing. The latter case may contain relevant information because these missing genotypes might be used to identify population-specific genomic variants. In order to assess which case is more prevalent, we used Illumina HD Bovine chip genotypes from 1,709 Nelore (Bos indicus) samples. We found 3,200 missing genotypes among the whole population. NGS re-sequencing data from 8 sires were used to verify the presence of genomic variations within their flanking regions in 81.56% of these missing genotypes. Furthermore, we discovered 3,300 novel SNPs/Indels, 31% of which are located in genes that may affect traits of importance for the genetic improvement of cattle production.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26305794 PMCID: PMC4549312 DOI: 10.1371/journal.pone.0136035
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Regions defined for obtaining estimates of genomic variation.
P1 represents the 50bp Illumina probe target sequence. P2 corresponds to the 50bp adjacent to P1 on the distal side of the assayed SNP. S1 and S2 are symmetrical to P1 and P2, respectively.
Fig 2Frequency of missing genotypes in Nelore cattle in a total of 1,709 samples tested with the Illumina Bovine HD.
Fig 3Distribution of SFNBs across bovine chromosomes.
Fig 4Functional characterization of 3,183 SNP markers derived from the Illumina Bovine HD panel that consistently generated missing genotypes in the Nelore breed (SFNBs).
Fig 5Functional characterization of 8,837 SNPs and INDELs identified within 100bp regions flanking SNPs assayed in the HD panel.
Fig 6Number of non-redundant SFNBs in regions flanking SNPs assayed in the Illumina Bovine HD panel (see S1 Table).
Fig 7Mean frequency and standard deviation of the nearest Nelore SNPs within 50bp and 100bp size bins.
Fig 8GO annotation of biological processes affected by genes that were identified by SFNBs from the Illumina Bovine HD panel.
Detailed information for 32 SNPs located within coding regions flanking SFNBs which resulted in non-synonymous substitutions.
An extreme case of non-synonymous mutations in gene EP300. Large numbers of olfactory receptor genes were found to contain SNPs that result in non-synonymous substitutions.
| HD Illumina SNP | SNP Position | Neighbor SNP Position | Allele | Ensembl Gene ID | Ensembl Transcript ID | Associated Gene Name | Description | Amino acids | Codons | Strand |
|---|---|---|---|---|---|---|---|---|---|---|
| BovineHD0100043813 | 1:150972066 | 1:150972082 | T | ENSBTAG00000036019 | ENSBTAT00000050509 | RIPPLY3 | ripply transcriptional repressor 3 | R/L | cGc/cTc | 1 |
| BovineHD0100043813 | 1:150972066 | 1:150972082 | T | ENSBTAG00000036019 | ENSBTAT00000050510 | RIPPLY3 | ripply transcriptional repressor 3 | R/L | cGc/cTc | 1 |
| BovineHD0100043813 | 1:150972066 | 1:150972144 | T | ENSBTAG00000036019 | ENSBTAT00000050510 | RIPPLY3 | ripply transcriptional repressor 3 | R/W | Cgg/Tgg | 1 |
| BovineHD0100043813 | 1:150972066 | 1:150972145 | A | ENSBTAG00000036019 | ENSBTAT00000050510 | RIPPLY3 | ripply transcriptional repressor 3 | R/Q | cGg/cAg | 1 |
| BovineHD0100043813 | 1:150972066 | 1:150972159 | A | ENSBTAG00000036019 | ENSBTAT00000050510 | RIPPLY3 | ripply transcriptional repressor 3 | D/N | Gat/Aat | 1 |
| BovineHD0300002957 | 3:8944823 | 3:8944722 | T | ENSBTAG00000024849 | ENSBTAT00000007838 | LOC100138271 | oviduct-specific glycoprotein-like | T/I | aCt/aTt | 1 |
| BovineHD0500032585 | 5:112843452 | 5:112843390 | A | ENSBTAG00000016198 | ENSBTAT00000021556 | EP300 | EP300 interacting inhibitor of differentiation | L/I | Ctc/Atc | 1 |
| BovineHD0500032585 | 5:112843452 | 5:112843447 | A | ENSBTAG00000016198 | ENSBTAT00000021556 | EP300 | EP300 interacting inhibitor of differentiation | A/T | Gcc/Acc | 1 |
| BovineHD0500032585 | 5:112843452 | 5:112843478 | T | ENSBTAG00000016198 | ENSBTAT00000021556 | EP300 | EP300 interacting inhibitor of differentiation | A/V | gCa/gTa | 1 |
| BovineHD0500032585 | 5:112843452 | 5:112843481 | A | ENSBTAG00000016198 | ENSBTAT00000021556 | EP300 | EP300 interacting inhibitor of differentiation | R/K | aGa/aAa | 1 |
| BovineHD0500032585 | 5:112843452 | 5:112843524 | T | ENSBTAG00000016198 | ENSBTAT00000021556 | EP300 | EP300 interacting inhibitor of differentiation | E/D | gaG/gaT | 1 |
| BovineHD0500032585 | 5:112843452 | 5:112843540 | A | ENSBTAG00000016198 | ENSBTAT00000021556 | EP300 | EP300 interacting inhibitor of differentiation | V/I | Gtt/Att | 1 |
| BovineHD0700005952 | 7:21542670 | 7:21542589 | A | ENSBTAG00000010373 | ENSBTAT00000013697 | CACTIN | cactin, spliceosome C complex subunit | R/Q | cGg/cAg | 1 |
| BovineHD1100012858 | 11:44163513 | 11:44163632 | C | ENSBTAG00000022461 | ENSBTAT00000056779 | SEPT10 | septin 10 | M/T | aTg/aCg | 1 |
| BovineHD1100027304 | 11:93678380 | 11:93678280 | C | ENSBTAG00000045527 | ENSBTAT00000063061 | LOC539172 | olfactory receptor, family 1, subfamily J, member 4-like | K/R | aAg/aGg | -1 |
| BovineHD1200020012 | 12:72533573 | 12:72533699 | C | ENSBTAG00000047181 | ENSBTAT00000035476 | LOC530803 | aTP-binding cassette, sub-family C (CFTR/MRP), member 4-like | K/E | Aag/Gag | -1 |
| BovineHD1200020012 | 12:72533573 | 12:72533707 | G | ENSBTAG00000047181 | ENSBTAT00000035476 | LOC530803 | aTP-binding cassette, sub-family C (CFTR/MRP), member 4-like | L/P | cTg/cCg | -1 |
| BovineHD1300021565 | 13:74675560 | 13:74675685 | C | ENSBTAG00000039693 | ENSBTAT00000066021 | LOC618696 | BPTI/Kunitz family of serine protease inhibitors | I/T | aTt/aCt | 1 |
| BovineHD1500014805 | 15:51516836 | 15:51516834 | C | ENSBTAG00000015357 | ENSBTAT00000028579 | OR52B4 | olfactory receptor, family 52, subfamily B, member 4 | H/R | cAt/cGt | -1 |
| BovineHD1700000025 | 17:59578 | 17:59498 | A | ENSBTAG00000048129 | ENSBTAT00000064742 | OR5I1 (Similar) | olfactory receptor, family 5, subfamily I, member 1 | S/F | tCt/tTt | -1 |
| BovineHD1700000025 | 17:59578 | 17:59511 | T | ENSBTAG00000048129 | ENSBTAT00000064742 | OR5I1 (Similar) | olfactory receptor, family 5, subfamily I, member 1 | E/K | Gag/Aag | -1 |
| BovineHD1700000025 | 17:59578 | 17:59600 | A | ENSBTAG00000048129 | ENSBTAT00000064742 | OR5I1 (Similar) | olfactory receptor, family 5, subfamily I, member 1 | S/L | tCa/tTa | -1 |
| BovineHD1800015304 | 18:52121851 | 18:52121877 | A | ENSBTAG00000001260 | ENSBTAT00000031940 | PINLYP | Phospholipase A2 inhibitor and LY6/PLAUR domain containing | V/M | Gtg/Atg | 1 |
| BovineHD1900002770 | 19:9851703 | 19:9851769 | A | ENSBTAG00000003280 | ENSBTAT00000004247 | MGC137055 | uncharacterized protein MGC137055 | V/I | Gtc/Atc | 1 |
| BovineHD1900015325 | 19:54647984 | 19:54647899 | A | ENSBTAG00000018661 | ENSBTAT00000042576 | TMC6 | transmembrane channel-like 6 | P/H | cCt/cAt | 1 |
| BovineHD2000016716 | 20:59531547 | 20:59531546 | A | ENSBTAG00000021972 | ENSBTAT00000061278 | DNAH5 | dynein, axonemal, heavy chain 5 | V/M | Gtg/Atg | 1 |
| BovineHD2100005821 | 21:20129619 | 21:20129578 | C | ENSBTAG00000045929 | ENSBTAT00000036189 | LOC512440 | myeloid-associated differentiation marker-like | M/V | Atg/Gtg | -1 |
| BovineHD2300007434 | 23:27155608 | 23:27155565 | G | ENSBTAG00000006864 | ENSBTAT00000045468 | LOC781663 | complement C4-A-like | S/G | Agc/Ggc | 1 |
| BovineHD2300008405 | 23:29518476 | 23:29518485 | T | ENSBTAG00000038562 | ENSBTAT00000054810 | LOC784787 | olfactory receptor, family 10, subfamily C, member 1-like | A/V | gCt/gTt | 1 |
| BovineHD2300015228 | 23:52445428 | 23:52445402 | G | ENSBTAG00000047485 | ENSBTAT00000064488 | OR5M10 | olfactory receptor, family 5, subfamily M, member 10 | V/L | Gtg/Ctg | -1 |
| BovineHD2600015134 | 26:16287928 | 26:16287953 | C | ENSBTAG00000023955 | ENSBTAT00000007005 | LOC505468 | cytochrome P450 family 2 subfamily C polypeptide 18-like | V/A | gTg/gCg | 1 |
| BovineHD2900011969 | 29:39707691 | 29:39707754 | C | ENSBTAG00000046686 | ENSBTAT00000063468 | LOC616004 | pregnancy-associated glycoprotein 1-like | E/G | gAa/gGa | -1 |
Fig 9IGV screenshot image.
The double vertical lines indicate the BovineHD0500032585 SNP position. Colored positions indicate flanking SNPs. There are 6 non-synonymous SNPs and 1 synonymous SNP (4th column from left).