| Literature DB >> 22190937 |
Abstract
Early childhood caries affects 28% of children aged 2-6 in the US and is not decreasing. There is a well-recognized need to identify susceptible children at birth. Caries-free adults neutralize bacterial acids in dental biofilms better than adults with severe caries. Saliva contains acidic and basic proline-rich proteins (PRPs) which attach to oral streptococci. The PRPs are encoded within a small region of chromosome 12. An acidic PRP allele (Db) protects Caucasian children from caries but is more common in African Americans. Some basic PRP allelic phenotypes have a three-fold greater frequency in caries-free adults than in those with severe caries. Early childhood caries may associate with an absence of certain basic PRP alleles which bind oral streptococci, neutralize biofilm acids, and are in linkage disequilibrium with Db in Caucasians. The encoding of basic PRP alleles is updated and a new technology for genotyping them is described.Entities:
Year: 2011 PMID: 22190937 PMCID: PMC3235478 DOI: 10.1155/2011/953412
Source DB: PubMed Journal: Int J Dent ISSN: 1687-8728
Figure 1Dietary sucrose intake and dental caries severity. Each point on the graph represents a different country. Mean DMFT of the population of each country is graphed against mean sugar consumption of 12-year old children. The findings were available from World Health Organization activities in oral epidemiology and published in 1982 [10]. The graph was assembled by the author [9].
All possible combinations of expressed PRH1 alleles. The three proteins encoded by the PRH1 locus are on the left two columns and the two proteins encoded by the PRH2 locus in Caucasians on the right two columns. Because this locus is expressed from both parental genes (12A and 12B), there are six possible protein (allelic) combinations of Pa, Pif, and Db and three possible combinations of Pr1 and Pr2. This gives a total of 18 possible combinations (polymorphisms) among individuals. A single Caucasian individual has one of the six combinations encoded by the PRH1 locus paired with one of the three combinations encoded by the PRH2 locus [9].
| PRH1 locus1 | PRH2 locus1 | ||
|---|---|---|---|
| Chromosome 12A Protein | Chromosome 12B Protein | Chromosome 12A Protein | Chromosome 12B Protein |
|
|
| PRP-1 | PRP-1 |
|
|
| PRP-1 | PRP-2 |
|
|
| PRP-2 | PRP-2 |
|
|
| ||
|
|
| ||
|
|
| ||
1Total = 6 PRH1 allelic combinations X 3 PRH2 combinations = 18.
Figure 2Major genes encoding the acidic proline-rich proteins. The genes are shown 5′– 3′. Dark or gray rectangles represent exons. The unfilled rectangles indicate untranslated portions of exons or whole exons which are upstream of the translational start (AUG) codon. Vertical double-pointing arrow indicates the position of the translation start codon in exon 1. Exons are numbered thereafter. The PRH1 gene shown is for alleles Pa and Pif. Db is encoded by the same structured gene containing an additional 63-base insert in exon 3. The two alleles of PRH2 encode separate proteins, Pr1 and Pr2. The transcripts were obtained by inserting the gene name at http://www.genecards.org/ and then selecting for the transcript diagram at ensembl.org. The transcripts are modified to read 5′–3′ and annotated.
Figure 3Major genes encoding the basic proline-rich proteins. Annotations and data were obtained as described in the legend to Figure 2. Gene PRB1 (top) is polymorphic due to tandem repeats in exon 4, which encodes up to 15 sets of the repeating 20-amino acid sequence in Table 3. The alleles may be long, PRB1L (15 repeats in exon 4–ochre), or short PRB1S (9 or fewer repeats in exon 4–dark red). A medium allele (PRB1M) contains about 12 repeats (not shown). Note that exon 4 of PRB1L was modified from ensembl transcript: PRB1-001 to encode 15 repeats instead of 12. At least one allele of PRB1 is transcribed but not translated. Gene PRB2 is similar to PRB1 in organization, size, and number of repeats, except that the repeats and most of the translated sequence occurs in exon 3. The encoded repeating amino acid sequence (Table 3) is slightly different from the repeat sequence of PRB1. Gene PRB3 occurs as three major alleles. Unlike PRB1 and 2, there are at least one and sometimes two introns within the exon coding sequence repeat region beginning in exon 3. Exons 3 and 4 (and 5 if present) encode 10 tandem repeats of 21 amino acids. The depicted first and second alleles (Ensembl PRB1-001 and PRB3-002) encode essentially identical proteins despite the additional intron in the second, an alternate form of the long allele (PRB3L). An allele missing internal residues 158–220 (4 tandem repeats) is known as a short allele (PRB3S; not depicted). The third depicted allele is the alternative short allele (PRB3_1) which has a C-terminal deletion of 67 amino acids due to a base deletion after the first third of exon 4 (missing 3 tandem repeats) and its C-terminus consists of a sequence of 12 amino acids poorly homologous to the terminal residues of PRB3S. In some individuals, this allele is transcribed but not translated. Gene PRB4 occurs also as three major alleles with different introns as depicted in the figure. The protein is mostly encoded in exon 3 but may differ in length. The longest allele contains 9.5 tandem repeats of 21 amino acids that are slightly different from the repeats encoded by gene PRB3. The whole protein is P10163 (UniProtKB/Swiss-Prot). The proteins reported by Esembl.org (PRB4_1 and PRB4_2.1) are identical but missing residues 113 through 154 and 164 through 184 (missing 3 repeats). The middle transcript variant has an intron within the center of exon 3, resulting in a shortened protein due to loss of its central portion (encoded residues 113 through 181; missing 6 repeats).
List of PRB protein repeats.
| PRB1 | |
| 9, 12, or 15 repeats of a 20-amino acid sequence: P-P-G-K-P-Q-G-P-P-[PAQ]-Q-[GE]-[GD]-[NKS]-[KSQRN]-[PRQS]-[QS] [GPS]-[PQAR]-[PSR] | |
|
| |
| PRB2 | |
| 9, 12, or 15 repeats of a 20-amino acid sequence: P-P-G-K-P-Q-G-P-P-P-Q-G-[GD]-[NKS]-[KSQ]-[PRS]-[QRS] [GPS]-[PSAR]-[PSR] | |
|
| |
| PRB3 | |
| 6, 7, or 10 repeats of a 21-amino acid sequence: [RH]-P-G-K-P-[EQ]-G-[PQS]-P-[PS]-Q-[GE]-G-N-[QK]-[SP]-[QR]-[GR]-P-P-P | |
|
| |
| PRB4 | |
| 3.5, 6.5, or 9.5 repeats of a 21-amino acid sequence: AA tandem repeats of K-P-[EQ]-[GR]-[PR]-[PR]-P-Q-G-G-N-Q-[PS]-[QH]-[RG]-[PT]-P-P-[PH]-P-G with the last repeat being truncated at residue 11–N) | |
Basic PRP genes and products1.
| Locus | Name of encoded protein2 | Partial | Residues3 |
|---|---|---|---|
| PRB1: | Basic salivary proline-rich protein 1 | 1–392 | |
| Pe (II-2) | 17–91 | ||
| Ps2 | 92–392 | ||
| IB-9 (PmF) | 91–152 | ||
| Ps1 (deletion)4 | 92–153; 213–392 | ||
| Con2 (deletion)4 | 152–194; 214–274 | ||
| IB-6 (PmS, P-I)5 | 275–392 | ||
|
| |||
| PRB2: | Basic salivary proline-rich protein 2 | 1–416 | |
| (Con1 glycoprotein) | IB-1 | 17–112 | |
| IB-7 (P-G) | 113–1716 or 1747 | ||
| Con1 | 175–299 | ||
| IB-8C (P-F) | 299–359 | ||
| IB-4 (P-H) | 361–416 | ||
|
| |||
| PRB3: | Basic salivary proline-rich protein 3 | 1–309 | |
| (Parotid salivary glycoprotein G1) | G1 | 17–309 | |
|
| |||
| PRB4: | Basic salivary proline-rich protein 4 | 1–310 | |
| (Parotid o protein) | Protein N1 | 17–39 | |
| (Salivary proline-rich protein II-1) | Glycosylated Pr A | 40–177 | |
| IB-5 (P-D) | 241–310 | ||
1From Azen et al. [70, Table 1] and from Ayad et al. [31, Table 3] updated from the Protein Knowledgebase website (UniProt.org) which was accessed from GeneCards or Epasy websites.
2There is extensive polymorphism (see text). Many proteins encoded by PRB1 and PRB2 [60], Ps1, Ps2, PmS PmF, Pe, Con2 are not included in the Protein Knowledgebase, or (for Con1) the composition is improperly indicated as an alternative name for the whole gene product, which it is not (see Figure 5).
3First set of numbers for each gene indicates the full length encoded polypeptide of each gene. The numbers beneath indicate the fragments of the polypeptide that are commonly found in parotid saliva secretions. The N-terminal 16 amino acids comprise a secretion signal which is removed prior to secretion (Figure 4).
4Deletion of repeating sequences 154–212 or 195–213
5Sequence is identical to IB-8c plus IB-4. IB-4 (P-H) is not normally released from IB-6 because residues 333–336 are QSAR (Figure 4). IB-4 (P-H) is normally derived from PRB2 where the last 4 C-terminal amino acids of IB-8c (356–360) are RSAR (see text, Section 6). At this time, Uniprot.org appears to have incorrectly listed P-H as produced by PRB1, but IB4 correctly from PRB2. P-H is an alternative name for IB4 and would only be produced from PRB1 if the codon for Q (residue 333) is mutated to R, an uncommon A to G mutation, or an even less common double mutation.
6,7Termination codon in PRB2S, or cleavage in PRB2M (see legend to Figure 5).
Figure 4Allelic variations of PRB1 gene. Uppermost sequence is PRB1L from the Expasy website. The N-terminal 16 amino acids indicate the secretion signal which is cleaved within the parotid gland cells before the protein is secreted. The red amino acids (residues 22 and 34) are encoded by the joining of exons 1 and 2, and of exons 2 and 3 (Figure 3). The 15 repeats of the 20-amino acid repeating sequence (Table 3) start at residue 53 and are coded alternately orange and black. A single light blue amino acid separates some of the repeats. Residues 41–53 (orange before the first repeat) comprise the last 11 amino acids of a truncated repeat sequence. Exon 3 is connected to the variable length exon 4 (Figure 3) at residue 113 (red). Lower sequences show the positions of the various alleles listed in Table 4. Row 1. The N-terminal portion of Pe (orange) is encoded by exons 1 and 2. The portion of Pe encoded with other allelic proteins on exon 3 (listed with alternative names in Table 4) is shown in red. Row 2. Pe_Ps2 (red and yellow highlight) Row 3. Pe_Ps1 (red and pale green) Row 4. Pe_Ps0 (red and black) Row 5. Pe_PmF_PmS (red, purple and light blue) Row 6. Pe_PmF_Con2_IB-6 (red, purple, light blue, and pale blue). Rows 7 and 8 are translations of alleles reported by Ensembl.org, Ensembl_protein_1, and Ensembl_protein_2 (black). Both alleles likely express Pe and truncated forms of Ps1. Amino acid numbering for each allele or each furin-cleaved segment of each allele is given in Table 4. Gaps indicate deleted sequences; arrows indicate where allelic sequences differ, and underlines indicate furin cleavage sites (Section 6). The asterisk indicates the early termination of allele Ps0 in which residue 150 (R, encoded CGA) is mutated to UGA (stop). Peptide Pe is expressed, but the slightly shorter PmF-like protein is not detected [63].
Figure 5Allelic variations of PRB2 gene. Upper sequence is PRB2L from the Uniprot.org website. The secretion signal and two amino acids connecting the exon-encoded sequences are indicated in Figure 3 as described for Figure 4. Magenta (residues 41–51) indicates a split repeat sequence whose last 11 amino acids precede the 15 repeats of a 20-amino acid segment (Table 4). The sequences are indicated by alternating black and red colors. A green-colored amino acid (S) separates precedes the first and some later repeats. Magenta (residue 362–370) indicates the first 9 amino acids of the split sequence. Lower sequence indicates different allelic products. The first 17 residues of peptide IB-1 (purple) are encoded by exon2 and its downstream residues by exon 3. The products are color coded: IB-1, purple; IB-7, brown; Con1, purple; IB8c, blue; IB-4, green. Asterisk indicates R (CGA) mutated to UGA (stop) with truncation of the allele at IB-7 or IB4 [75] and is indicated by asterisks at the appropriate arginine residues. The amino acid numbering within each allele or each furin-cleaved segment of each allele is given in Table 4. Underlines indicate furin cleavage sites (Section 6).
(a)
| Protein |
|
| 2Fold increase | 3% min | 4% max |
|---|---|---|---|---|---|
| amylase | 650 | 2600 | 4.0 | 72.1% | 63.6% |
| cystatin | 2 | 4 | 2.0 | 0.2% | 0.1% |
| PRPs | 230 | 1251 | 5.4 | 25.5% | 30.6% |
| mucins | 0 | 0 | 0.0 | 0.0% | 0.0% |
| sIgA | 20 | 230 | 11.5 | 2.2% | 5.6% |
|
| |||||
| Total | 902 | 4085 | |||
(b)
| Protein |
|
| Fold increase | % min | % max |
|---|---|---|---|---|---|
| amylase | 0 | 0 | 0.0 | 0.0% | 0.0% |
| cystatin | 92 | 280 | 3.0 | 19.0% | 12.6% |
| PRPs | 270 | 1335 | 4.9 | 55.9% | 59.8% |
| mucins | 80 | 560 | 7.0 | 16.6% | 25.1% |
| sIgA | 41 | 56 | 1.4 | 8.5% | 2.5% |
|
| |||||
| Total | 483 | 2231 | |||
(c)
| Protein |
|
| Fold increase | % min | % max |
|---|---|---|---|---|---|
| amylase | 380 | 500 | 1.3 | 46.4% | 23.8% |
| cystatin | 240 | 280 | 1.2 | 29.3% | 13.3% |
| PRPs | 90 | 180 | 2.0 | 11.0% | 8.6% |
| mucins | 90 | 700 | 7.8 | 11.0% | 33.3% |
| sIgA | 19 | 439 | 23.1 | 2.3% | 20.9% |
|
| |||||
| Total | 819 | 2099 | |||
1Data rearranged from of [91, Table 1].
2Fold increase is the minimal (min) to maximal (max) concentration (μg/mL), giving the minimal concentration a value of 1.
3% min-% of min total content
4% max-% of max total content.