| Literature DB >> 35494225 |
Adel F Alharbi1,2, Nongfei Sheng3, Katie Nicol1, Nicklas Strömberg3, Edward J Hollox1.
Abstract
Discovering loci under balancing selection in humans can identify loci with alleles that affect response to the environment and disease. Genome variation data have identified the 5' region of the DMBT1 gene as undergoing balancing selection in humans. DMBT1 encodes the pattern-recognition glycoprotein DMBT1, also known as SALSA, gp340, or salivary agglutinin. DMBT1 binds to a variety of pathogens through a tandemly arranged scavenger receptor cysteine-rich (SRCR) domain, with the number of domains polymorphic in humans. We show that the signal of balancing selection is driven by one haplotype usually carrying a shorter SRCR repeat and another usually carrying a longer SRCR repeat. DMBT1 encoded by a shorter SRCR repeat allele does not bind a cariogenic and invasive Streptococcus mutans strain, in contrast to the long SRCR allele that shows binding. Our results suggest that balancing selection at DMBT1 is due to host-microbe interactions of encoded SRCR tandem repeat alleles.Entities:
Keywords: biological sciences; evolutionary mechanisms; genetics
Year: 2022 PMID: 35494225 PMCID: PMC9038570 DOI: 10.1016/j.isci.2022.104189
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1Evidence for balancing selection at the human DMBT1 gene
The DMBT1 gene is shown in blue, with three tracks above representing Tajima’s D measured from sequenced genomes from three populations (European-Americans from Utah (CEU) in green, Chinese from Beijing (CHB) in blue, Yoruba from Ibadan (YRI) in orange), image taken from the 1000 Genomes selection browser (https://hsb.upf.edu/). Below the DMBT1 gene the SNP rs11523871 and two CNVs thar affect the copy number of the SRCR repeats are shown. Above the tracks showing Tajima’s D are different sources of evidence of balancing selection, namely the beta statistic (Siewert and Voight, 2017), the NCD statistic (purple (Bitarello et al., 2018), trans-specific variants (Leffler et al., 2013), and composite likelihood ratio tests (DeGiorgio et al., 2014).
Figure 2Association of rs11523871 and DMBT1 SRCR repeat copy number
(A and B) For two populations, CEPH (A) and YRI (B), the distributions of DMBT1 SRCR repeat domain copy numbers associated with the rs11523871-A allele (blue, above the x axis) and rs11523871-C (red, below the x axis) are shown. The y axis shows the number of observations in the two samples (CEPH n = 263, YRI n = 116).
Figure 3DMBT1 gene expression and rs11523871 genotype
(A) Tissue expression of DMBT1 across 54 tissues, ordered by mean expression level, from RNAseq data. Data and image from the GTEx Portal Locus Browser v.8.
(B) Violin plots show rs11523871 genotype and expression level for the three tissues showing a statistically significant relationship. Median and interquartile range are shown by the white line and grey box.
(C) Boxplots showing rs11523871 genotype and expression level in duodenum from 41 healthy patients, normalized against two different housekeeping genes. Left boxplot shows data normalized to RPLP0 expression; right boxplot shows data normalized to UBC expression. Boxplots indicate median, interquartile range, and range.
DMBT1 protein isoforms and SRCR repeat diploid copy number
| Sample | CNV1 diploid copy number | CNV2 diploid copy number | SRCR repeat diploid copy number | Inferred Secretor status | DMBT1 (gp340) protein isoform | Approxiamate isoform size | |
|---|---|---|---|---|---|---|---|
| 1 | 2 | 7 | 27 | + | III | 389 | + |
| 2 | 2 | 4 | 24 | − | I | 345 | − |
| 3 | 2 | 2 | 22 | + | I | 345 | − |
| 4 | 2 | 6 | 26 | + | II | 375 | + |
| 5 | 2 | 4 | 24 | + | II | 375 | + |
| 6 | 1 | 4 | 20 | − | IV | 287,345 | − |
| 7 | 2 | 5 | 25 | + | II | 375 | + |
| 8 | 2 | 9 | 29 | + | III | 389 | + |
Protein isoform data from (Eriksson et al., 2007).
See Figure 5.
Figure 5Differential binding of S. mutans by DMBT1 isoforms in saliva
Blots of SDS-PAGE gels with saliva from different individuals with DMBT1 size isoforms I–IV.
(A and B) (A) probed using a biotinylated S. mutans SpaP A, Cnm strain and (B) probed with DMBT1-specific antibodies. The positions of DMBT1 on both blots is indicated. Note that DMBT1 isoform size differences are not seen, as they are not resolved at the SDS-PAGE gel density used before the blotting.
Figure 4Identification of transcripts spanning the SRCR repeats in DMBT1
The DMBT1 allele from the genome assembly, with 14 tandemly arranged SRCR repeats highlighted in blue, is shown at the top of the figure, with GRCh38 coordinates as a scale immediately underneath. The sequence alignment of single-molecule sequencing reads mapping to the DMBT1 gene are shown, with at the bottom, the genome features format file (GFF) derived from the sequence alignments. In the GFF image, black boxes indicate complete SRCR repeats, with the total number of tandemly repeated SRCR repeats for that transcript highlighted in red on the left of the particular transcript.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Anti-DMBT1 | D. Malamud, University of Pennsylvania | mAb143 |
| Sample collection, Biobank Department of Odontology, Umeå University | 472 | |
| YRI Genomic DNA samples | Coriell cell repositories | MGP00013 |
| CEU DNA samples from three generation pedigrees | CEPH | n/a |
| Streptavidin-POD | GE Healthcare | L1058765 |
| NHS-LC-biotin | Pierce | 21336P |
| Superscript III First Strand Synthesis Supermix Kit | Thermo Fisher | 18080400 |
| Primers and TaqMan hydrolysis probes for ddPCR of UBC, RPLPO and DMBT1 | Biorad | 10031276 and 10031279 |
| 2x ddPCR Supermix for Probe (no dUTP) | Biorad | 1863024 |
| sequence-specific cDNA-PCR Sequencing kit | Oxford Nanopore | SQK-PCS-109 |
| Maxwell 16 Cell DNA purification kit | Promega | AS1020 |
| Maxwell 16 LEV Simply-RNA Cell kit | Promega | AS1270 |
| Human Genome Diversity Project variation vcf files | Wellcome Trust Sanger Institute | ftp.sanger.ac.uk |
| GTex RNAseq expression data | gtexportal.org | |
| NCI-H292 cell line | European collection of authenticated cell cultures | 91091815 |
| 5' [PHOS] ACTTGCCTGTCGCTCTATCTT | This paper | |
| 5’TCAGTGATGGTGAATGTTTGTCA-3’ | This paper | |
| 5’GACCTTACCTTCTGCTACAGTCGG-3’ | This paper | |
| 5’TGTGAGTGATTTATTTCGGCATTC-3’ | This paper | |
| 5’GACCTTACCTTCTGCTACAGTCGA-3’ | This paper | |
| 5′ATTGATTCACTTCACGGATCAAG 3′ | This paper | Positive control |
| 5′TCTAAGAAATTCCCATGACAGGT 3′ | This paper | Positive control |
| ONT Guppy v3.3.3 | ||
| NanoPlot v.1.32.1 | ||
| Pychopper | ||
| Minimap2 v2.17 | ||
| SAMTools v1.9 | ||
| SHAPEIT2v837 | ||