| Literature DB >> 35205228 |
Daryn A Stover1,2, Genevieve Housman3, Anne C Stone4, Michael S Rosenberg5, Brian C Verrelli1,5.
Abstract
Bone strength and the incidence and severity of skeletal disorders vary significantly among human populations, due in part to underlying genetic differentiation. While clinical models predict that this variation is largely deleterious, natural population variation unrelated to disease can go unnoticed, altering our perception of how natural selection has shaped bone morphologies over deep and recent time periods. Here, we conduct the first comparative population-based genetic analysis of the main bone structural protein gene, collagen type I α 1 (COL1A1), in clinical and 1000 Genomes Project datasets in humans, and in natural populations of chimpanzees. Contrary to predictions from clinical studies, we reveal abundant COL1A1 amino acid variation, predicted to have little association with disease in the natural population. We also find signatures of positive selection associated with intron haplotype structure, linkage disequilibrium, and population differentiation in regions of known gene expression regulation in humans and chimpanzees. These results recall how recent and deep evolutionary regimes can be linked, in that bone morphology differences that developed among vertebrates over 450 million years of evolution are the result of positive selection on subtle type I collagen functional variation segregating within populations over time.Entities:
Keywords: BMD; COL1A1; adaptation; bone disease; exon duplication; osteoporosis; type I collagen
Mesh:
Substances:
Year: 2022 PMID: 35205228 PMCID: PMC8871609 DOI: 10.3390/genes13020183
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1phyloP conservation scores plotted across human COL1A1: phyloP scores are shown for amino acid mutations in disease severity categories 1 (least severe) and 4 (most severe), and in the 1000 Genomes Project dataset across different protein domains (see Table S2). Mutations noted in “red” occur at glycine residues, whereas mutations noted in “blue” occur at non-glycine residues. As phyloP deviates from “0”, the positive and negative scores reflect evolutionary conservation and acceleration, respectively, compared to a neutral model.
Figure 2COL1A1 patterns of variation in humans and chimpanzees: The ~17-kb COL1A1 locus on chr17 (hg19 assembly) is shown with coding regions as black boxes interspersed with introns. (a) phyloP vertebrate track conservation scores for 50 introns (mean and SE plotted at their midpoints). As scores deviate from “0”, the positive and negative scores reflect evolutionary conservation and acceleration, respectively, at nucleotide sites compared to a neutral model. (b) Plot of intronic SNP pairwise F values between human populations of African and East Asian ancestry. (c) H3K27Ac marks, associated with regulatory elements, with plot reflecting the density calculated as the number of sequenced H3K27Ac tags overlapping a 25 bp window centered at that position. (d) Plot of the 45 intron SNPs fixed between the two core haplogroups in 40 western chimpanzee sequences.
Figure 3Global pattern of linkage disequilibrium in human COL1A1: Linkage disequilibrium pairwise plot of 39 non-coding SNPs (MAF > 5%) in the phased 1000 Genomes Project dataset is plotted as a function of nucleotide distance (see Table S16 for SNP numbering). The ~17 kb COL1A1 locus on chr17 (hg19 assembly) is shown with coding regions as black boxes interspersed with introns. Shaded boxes in red reflect the strength of pairwise correlations among SNPs.
Figure 4Neighbor-joining tree of 118 chimpanzee COL1A1 haplotypes: Haplotypes and their relative phylogenetic positions for each of the four subspecies are shown as color blocks, with bonobo as an outgroup. Evolutionary distances reflect the number of base substitutions per variable site.