| Literature DB >> 21387465 |
Robert J Hardwick1, Lee R Machado, Luciana W Zuccherato, Suzanne Antolinos, Yali Xue, Nyambura Shawa, Robert H Gilman, Lilia Cabrera, Douglas E Berg, Chris Tyler-Smith, Paul Kelly, Eduardo Tarazona-Santos, Edward J Hollox.
Abstract
Beta-defensins are a family of multifunctional genes with roles in defense against pathogens, reproduction, and pigmentation. In humans, six beta-defensin genes are clustered in a repeated region which is copy-number variable (CNV) as a block, with a diploid copy number between 1 and 12. The role in host defense makes the evolutionary history of this CNV particularly interesting, because morbidity due to infectious disease is likely to have been an important selective force in human evolution, and to have varied between geographical locations. Here, we show CNV of the beta-defensin region in chimpanzees, and identify a beta-defensin block in the human lineage that contains rapidly evolving noncoding regulatory sequences. We also show that variation at one of these rapidly evolving sequences affects expression levels and cytokine responsiveness of DEFB103, a key inhibitor of influenza virus fusion at the cell surface. A worldwide analysis of beta-defensin CNV in 67 populations shows an unusually high frequency of high-DEFB103-expressing copies in East Asia, the geographical origin of historical and modern influenza epidemics, possibly as a result of selection for increased resistance to influenza in this region.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21387465 PMCID: PMC3263423 DOI: 10.1002/humu.21491
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Figure 1Evolutionary analysis of assembled beta-defensin copy number blocks. A: Recombination network of DNA sequences from a 3.2-kb region within the beta-defensin CNV block. The scale indicates number of nucleotide differences. B: Regions of the beta-defensin copy number repeat showing unusually high divergence of clade II. Peaks indicate regions that showed a divergence/diversity ratio in the top 5%.
Figure 2Characterization of DEFB103 promoters from different clades. Promoter efficiency of 1.9 kb upstream region of DEFB103 in HaCat keratinocyte epithelial cell lines, with pGL3 empty vector as negative control, and pGL3 with SV40 enhancer as a positive control, using a luciferase reporter gene assay. Three promoters, two from clade I corresponding to the two sequences assembled in hg18 human genome, and one from clade II, were tested under three conditions: nonstimulated (NS), interferon-gamma stimulation (IFN-gamma) and Escherichia coli lipopolysacharride stimulation (LPS). Statistically significant differences in expression between clades are shown, after correction for multiple comparisons (*P<0.05, **P<0.01).
Figure 3Principal component analysis of global variation of beta-defensin diploid copy number. A: Frequency distribution of worldwide beta-defensin copy number. B: Biplot of the first two principal components of beta-defensin copy number variation data, representing 76% of the total variation in the dataset. Outlier populations are highlighted. C: Frequency distributions of beta-defesin copy number in outlier populations.
Outlier Populations in Copy Number Distribution
| Population | Copy number frequency difference | Significance ( |
|---|---|---|
| Karitiana | 4 copy increase | <0.0001 |
| Kalash | 6 copy increase | 0.0024 |
| Bantu | 6 copy increase | 0.0061 |
| Quechua | 4 copy decrease | 0.0071 |
| Pashan | 4 copy increase | 0.0086 |
| Mbuti | 6 copy increase | 0.0271 |
Outliers were identified visually from principal component analysis (Fig. 3B) and particular changes in copy number frequency identify. Significance was assessed by a Monte Carlo randomization test against the null hypothesis that the difference is due to random sampling from the larger continental population.
Frequency of Clade II Copies in Different Continental Groups
| Continent | Number of individuals | Clade I copy count | Clade II copy count | Clade II frequency |
|---|---|---|---|---|
| Sub-Saharan Africa | 268 | 1,174 | 66 | 0.05 |
| America | 332 | 1,256 | 128 | 0.09 |
| Central-South Asia | 230 | 839 | 158 | 0.16 |
| East Asia | 515 | 1,676 | 458 | 0.21 |
| Europe | 206 | 769 | 132 | 0.15 |
Figure 4Global distribution map of clade II copy frequency. Frequency of clade II copies is shown as a contour map with frequency intervals colored according to the legend on the right of the map, and sampling locations shown as crosses (see also Supp. Table S1).