| Literature DB >> 19144169 |
Stéphanie Robin1, Sandrine Tacher, Maud Rimbault, Amaury Vaysse, Stéphane Dréano, Catherine André, Christophe Hitte, Francis Galibert.
Abstract
BACKGROUND: Evolution has resulted in large repertoires of olfactory receptor (OR) genes, forming the largest gene families in mammalian genomes. Knowledge of the genetic diversity of olfactory receptors is essential if we are to understand the differences in olfactory sensory capability between individuals. Canine breeds constitute an attractive model system for such investigations.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19144169 PMCID: PMC2635374 DOI: 10.1186/1471-2164-10-21
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Distribution profile of the 732 SNP + indels.
Figure 2Distribution of SNP within the 6 breeds.
OR genes with no SNP in one or several breeds.
| 0 | 0 | 0 | 0 | 0 | 0 | 6 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 6 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 6 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 6 | |
| 0 | 0 | 0 | 0 | 0 | 1 | 5 | |
| 0 | 0 | 0 | 0 | 0 | 1 | 5 | |
| 0 | 0 | 0 | 6 | 0 | 1 | 4 | |
| 0 | 6 | 0 | 0 | 0 | 10 | 4 | |
| 0 | 0 | 1 | 2 | 2 | 0 | 3 | |
| 0 | 2 | 1 | 0 | 1 | 0 | 3 | |
| 1 | 1 | 0 | 0 | 0 | 1 | 3 | |
| 0 | 1 | 1 | 0 | 1 | 2 | 2 | |
| 0 | 1 | 2 | 0 | 1 | 1 | 2 | |
| 0 | 1 | 4 | 0 | 1 | 2 | 2 | |
| 0 | 2 | 2 | 0 | 2 | 2 | 2 | |
| 1 | 0 | 2 | 1 | 0 | 2 | 2 | |
| 1 | 1 | 2 | 0 | 1 | 0 | 2 | |
| 2 | 2 | 0 | 0 | 2 | 2 | 2 | |
| 2 | 2 | 2 | 0 | 1 | 0 | 2 | |
| 0 | 1 | 2 | 1 | 2 | 1 | 1 | |
| 0 | 1 | 5 | 2 | 2 | 3 | 1 | |
| 0 | 2 | 2 | 2 | 2 | 2 | 1 | |
| 0 | 2 | 2 | 2 | 2 | 2 | 1 | |
| 0 | 4 | 3 | 2 | 5 | 3 | 1 | |
| 0 | 4 | 7 | 1 | 4 | 19 | 1 | |
| 0 | 6 | 6 | 6 | 6 | 6 | 1 | |
| 0 | 7 | 5 | 5 | 7 | 5 | 1 | |
| 0 | 8 | 7 | 7 | 7 | 8 | 1 | |
| 0 | 13 | 12 | 3 | 15 | 14 | 1 | |
| 1 | 1 | 1 | 0 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | 0 | 2 | 1 | |
| 2 | 1 | 1 | 0 | 2 | 1 | 1 | |
| 2 | 2 | 2 | 0 | 2 | 1 | 1 | |
| 3 | 2 | 2 | 2 | 0 | 4 | 1 | |
| 3 | 2 | 3 | 1 | 0 | 2 | 1 | |
| 4 | 0 | 4 | 2 | 4 | 3 | 1 | |
| 4 | 3 | 3 | 2 | 3 | 0 | 1 | |
| 5 | 5 | 5 | 5 | 0 | 5 | 1 | |
| 5 | 6 | 3 | 0 | 6 | 3 | 1 | |
| 6 | 6 | 8 | 0 | 7 | 7 | 1 | |
| 7 | 7 | 8 | 7 | 7 | 0 | 1 |
Numbers (0 to 19) refer to the number of SNP per OR gene and per breed. This table also highlights the range of polymorphism of certain OR genes within the six breeds (see, for example, CfOR0006 or CfOR04C05). This table is a subset of additional file 2.
Figure 3Boxplot of cluster sizes (1, 2) and boxplot of SNP contents (3, 4). Boxplot 1 shows the cluster sizes of the 22 least polymorphic OR genes (≤ 2 SNP). This boxplot should be compared with boxplot 2, showing the cluster sizes of the 27 OR genes with the largest number of SNP (≥ 10 SNP). Boxplot 3 corresponds to the SNP contents of 20 OR genes located in clusters with up to five OR genes. It should be compared with boxplot 4, corresponding to the 18 OR genes located in the largest cluster (243 OR genes). Arrows indicate the median values in the four boxplots.
Overrepresentation of minor alleles in specific breeds.
| T/C | 78 | 0 | 0 | 0 | 0 | 0 | 0.096 | ||
| T/A | 189 | 0 | 0 | 0 | 0 | 0 | 0.096 | ||
| T/C | 530 | 0 | 0 | 0 | 0 | 0 | 0.115 | ||
| A/G | 294 | 0.062 | 0.125 | 0.125 | 0 | 0.125 | 0.188 | ||
| A/G | 518 | 0 | 0.062 | 0 | 0 | 0.125 | 0.156 | ||
| G/A | 295 | 0.125 | 0 | 0 | 0 | 0 | 0.125 |
Numbers correspond to the frequency of the allele identified as the minor allele in the whole population. The second allele in the SNP column is always the minor allele in the whole population. As expected, the minor allele at the whole population level is also the minor allele in most breeds. Exceptions in which the minor allele at the whole population level is the major allele in one breed are indicated in bold typeface. This table is a subset of additional file 3.
SNP distribution within breeds.
| 199 | 6 |
| 120 | 5 |
| 91 | 4 |
| 50 | 3 |
| 79 | 2 |
| 193 | 1 |
Number of SNP shared by different pairs of breeds.
| 9 | 3 | 0 | 4 | 1 | |
| 14 | 1 | 3 | 19 | ||
| 0 | 8 | 4 | |||
| 7 | 5 | ||||
| 1 |
Number of SNP shared by different trios of breeds.
| 9 | |
| 2 | |
| 1 | |
| 1 | |
| 1 | |
| 9 | |
| 3 | |
| 5 | |
| 1 | |
| 1 | |
| 8 | |
| 3 | |
| 5 | |
| 1 |
Figure 4Variability in OR gene polymorphism level. Cumulative number of OR genes (y axis) plotted against N values (x axis). The graph shows that more than 50% of OR genes are highly polymorphic, with an N value even smaller than that for anonymous sequences (see Table 6), whereas ~10% are barely polymorphic (N > 5000) (see additional file 2). Note that six OR genes with a very high N value were off-scale and were not plotted on this graph.
Mean N values for OR genes and other sequences.
| 103762 | 733 | 926 | 617 | 594 | 778 | 634 | 628 | 577 | ND | |
| 25545 | 214 | 746 | 577 | 521 | 656 | 552 | 615 | 515 | 1728 | |
| 3685 | 3 | 29480 | 29480 | 9213 | 5669 | 10284 | 8189 | 8631 | ND | |
| 4766 | 10 | 2948 | 2487 | 1993 | 2334 | 2183 | 2373 | 1992 | ND | |
| 18716 | 97 | 864 | 943 | 848 | 735 | 878 | 863 | 732 | ND |
Mean N values were calculated as indicated in the Method section for the complete set of 109 OR genes and for a subset of 27 OR genes analysed in Boxer. Mean N values for exons (outside OR genes), introns (outside OR genes) and for anonymous intergenic sequences were calculated on the 48 dogs cohort. ND: not determined.
Distribution of the 580 SNP (307 silent and 273 missense) between the extracellular (EC), transmembrane (TM) and intracellular (IC) domains.
| 12 | 7 | 19 | 20 | |
| 7 | 17 | 24 | 21 | |
| 8 | 3 | 11 | 8 | |
| 7 | 8 | 15 | 27 | |
| 5 | 8 | 13 | 17 | |
| 4 | 5 | 9 | 7 | |
| 9 | 5 | 14 | 15 | |
| 7 | 16 | 23 | 20 | |
| 13 | 12 | 25 | 45 | |
| 12 | 6 | 18 | 21 | |
| 20 | 11 | 31 | 16 | |
| 5 | 15 | 20 | 35 | |
| 5 | 9 | 14 | 13 | |
| 4 | 12 | 16 | 22 | |
| 12 | 9 | 21 | 20 | |
Pseudoallele frequency (PAF).
| indel | 351 | 0.062 | 0.562 | 0.188 | 0.438 | 0.125 | 0.062 | 0.24 | |
| NS | 823 | 0.688 | 0.312 | 0.375 | 0.438 | 0.562 | 0.562 | 0.49 | |
| indel | 468 | 0.188 | 0.062 | 0 | 0 | 0 | 0 | 0.042 | |
| NS | 737 | 0.812 | 0.875 | 0.812 | 1 | 0.938 | 0.438 | 0.812 | |
| indel | 27 | 0 | 0 | 0.062 | 0 | 0 | 0 | 0.01 | |
| indel | 89 | 0 | 0.062 | 0.062 | 0 | 0 | 0 | 0.021 | |
| NS | 49 | 0 | 0 | 0.062 | 0 | 0 | 0 | 0.01 | |
| indel | 20 | 0.625 | 1 | 0.875 | 0.812 | 0.562 | 0.938 | 0.802 | |
| indel | 70 | 0 | 0 | 0 | 0 | 0 | 0.125 | 0.021 | |
| NS | 306 | 1 | 1 | 1 | 0.938 | 1 | 1 | 0.99 | |
| indel | 289 | 0.062 | 0 | 0 | 0 | 0.188 | 0 | 0.042 | |
| indel | 536 | 0 | 0 | 0 | 0.188 | 0 | 0.062 | 0.042 | |
| NS | 790 | 0 | 0.062 | 0.062 | 0 | 0 | 0 | 0.021 | |
| NS | 84 | 0 | 0 | 0 | 0 | 0 | 0.125 | 0.021 | |
| NS | 360 | 0.812 | 0.375 | 0.125 | 0.188 | 0.438 | 0.062 | 0.333 | |
| NS | 1 | 0.875 | 0.5 | 0.375 | 0.25 | 0.75 | 0.375 | 0.521 | |
| indel | 362 | 0.125 | 0.75 | 0.375 | 0.75 | 1 | 0.125 | 0.521 | |
| indel | 204 | 0 | 0.125 | 0 | 0 | 0 | 0.25 | 0.062 | |
| indel | 633 | 0.125 | 0.188 | 0.438 | 0.25 | 0.188 | 0 | 0.198 | |
| indel | 89 | 1 | 1 | 0.625 | 0.428 | 0.75 | 1 | 0.798 | |
| indel | 306 | 0 | 0 | 0 | 0.143 | 0 | 0 | 0.024 | |
| NS | 658 | 0.25 | 0.188 | 0 | 0.188 | 0.312 | 0.062 | 0.167 |
PAF was calculated independently for each breed and for the whole population. Pseudoallele distribution varies considerably between breeds, with some pseudoalleles present in all breeds and some in only one or a few breeds: 9 are common to all breeds, 5 are found in only one breed and 8 are shared by two to five breeds, giving 10, 12, 13, 10, 10 and 11 OR pseudogenes for GSD, BM, ESS, Grey, LR and Pek, respectively. NS: nonsense, indel: insertion/deletion.
Figure 5Relationship between SNP and haplotype number. Distances between points were calculated with R software (maximum distances) [43] and used to cluster OR genes. With k = 4, a group of 5 OR genes (in light blue) with a large number of SNP but a small number of haplotypes was identified, together with a group of 11 OR genes (in green) with a large number of haplotypes and a small number of SNP. We excluded from this last group the 4 OR genes with only one SNP and 2 haplotypes. Note that an individual point may correspond to more than one OR gene.
Number of breed-specific haplotypes and number of times represented.
| 11 | 41 | 43 | 24 | 25 | 61 | 1 | |
| 8 | 14 | 13 | 10 | 9 | 12 | 2 | |
| 6 | 2 | 2 | 1 | 4 | 8 | 3 | |
| 3 | 2 | 7 | 2 | 4 | 10 | 4 | |
| 1 | 0 | 0 | 0 | 0 | 2 | 5 | |
| 0 | 0 | 0 | 0 | 0 | 2 | 6 | |
| 1 | 0 | 0 | 2 | 0 | 1 | 9 | |
| 0 | 0 | 0 | 0 | 0 | 1 | 11 | |
There are 11 specific haplotypes, each present once, in GSD and 1 specific haplotype present 11 times in Pek.
Intra OR r2 values.
| 0.698 | 0.552 | 0.559 | 0.652 | 0.572 | 0.525 | 0.334 | |
| 1027 | 1701 | 1557 | 1181 | 1447 | 1903 | 3368 |
r2 values were calculated for pairs of SNP (MAF > 0.05) located within coding OR exons.
Percentage of SNP pairs with a D' value > 0.8.
| ND | ND | 75% | 89% | 82% | 66% | 38% | |
| 64% | 79% | 78% | ND | 64% | ND | 66% | |
| 100% | ND | 78% | ND | 100% | 58% | 55% | |
| 89% | 76% | 69% | 87% | 61% | ND | 52% | |
| ND | 73% | 82% | ND | 64% | 81% | 45% | |
These values were identified within 5 clusters of 104 to 182 kb (these clusters are described in additionnal file 5). ND: not determined (too few SNP pairs).