| Literature DB >> 27230431 |
Jose Luis Acosta1,2, Alma Cristal Hernández-Mondragón3, Laura Carolina Correa-Acosta3, Sandra Nathaly Cazañas-Padilla3, Berenice Chávez-Florencio3, Elvia Yamilet Ramírez-Vega3, Tulia Monge-Cázares3, Carlos A Aguilar-Salinas4, Teresa Tusié-Luna5, Laura Del Bosque-Plata6.
Abstract
BACKGROUND: Genetic variations of the TCF7L2 gene are associated with the development of Type 2 diabetes (T2D). The associated mutations have demonstrated an adaptive role in some human populations, but no studies have determined the impact of evolutionary forces on genetic diversity in indigenous populations from Mexico. Here, we sequenced and analyzed the variation of the TCF7L2 gene in three Amerindian populations and compared the results with whole-exon-sequencing of Mestizo populations from Sigma and the 1000 Genomes Project to assess the roles of selection and recombination in diversity.Entities:
Keywords: Genetic association; Recombination hotspots; Sweeps selection; TCF7L2 gene; Type 2 diabetes
Mesh:
Substances:
Year: 2016 PMID: 27230431 PMCID: PMC4880969 DOI: 10.1186/s12863-016-0372-7
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Mutations identified in the Indigenous population and shared with the Sigma and 1000 Genome Projects. * Frequencies sorted by cases: controls; cases are defined as patients with type 2 diabetes
| 1000 Genomes Project | Sigma Project | Indigenous people | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MAF | Genotypes | MAF | Genotypesa | Variant | Gene | Gene coordenate | dbSNP138 | Chr | Start | End | Ref | Obs |
| 0.1984 | 2007(G/G)/451(G/T)/46(T/T) | Intronic 6 | Gene | 191193 | rs56913138 | Chr10 | 114901201 | 114901201 | G | T | ||
| 0.000399 | 2503(G/G)/1(G/T)/0(T/T) | Intronic 6 | Gene | 193976 | rs188695269 | Chr10 | 114903984 | 114903984 | G | A | ||
| 0.992412 | 19(T/T)/303(T/C)/2182(C/C) | Intronic 9 | Gene | 201071 | rs176632 | Chr10 | 114911079 | 114911079 | T | C | ||
| 0.002396 | 2498(C/C)/6(C/G)/0(G/G) | Intronic 9 | Gene | 209587 | Chr10 | 114919595 | 114919595 | C | T | |||
| 0.129793 | 2179(C/C)/301(C/A)/24(A/A) | 0.0896552 | C/C = 1564:1618 C/A = 331:320 A/A = 10:16 | Exonic 15 | Gene | 215361 | rs77961654 | Chr10 | 114925369 | 114925369 | C | A |
| 0.000985222 | C/C = 1901:1950 C/G = 4:4 | Exonic 15 | Gene | 215363 | rs61724286 | Chr10 | 114925371 | 114925371 | C | G | ||
aGenotypes
Frequencies sorted cases: controls, the cases defined by patients with type 2 diabetes
Novel mutations identified in the Indigenous population
| Indigenous population | ||||||||
|---|---|---|---|---|---|---|---|---|
| Variant | Gene | Gene coordenate | dbSNP138 | Chr | Start | End | Ref | Obs |
| Intronic 4 | TCF7L2 | 14402 | Chr10 | 114724410 | 114724410 | T | A | |
| Intronic 4 | TCF7L2 | 14404 | Chr10 | 114724412 | 114724412 | G | C | |
| Intronic 6 | TCF7L2 | 191411 | Chr10 | 114901419 | 114901419 | A | T | |
| Intronic 6 | TCF7L2 | 191412 | Chr10 | 114901420 | 114901420 | T | G | |
| Intronic 6 | TCF7L2 | 193490 | Chr10 | 114903498 | 114903498 | A | C | |
| Intronic 6 | TCF7L2 | 193491 | Chr10 | 114903499 | 114903499 | C | A | |
| Intronic 7 | TCF7L2 | 193908 | Chr10 | 114903916 | 114903916 | C | A | |
| Intronic 7 | TCF7L2 | 193913 | Chr10 | 114903921 | 114903921 | T | A | |
| Intronic 7 | TCF7L2 | 193940 | Chr10 | 114903948 | 114903948 | G | C | |
| Intronic 7 | TCF7L2 | 193951 | Chr10 | 114903959 | 114903959 | G | T | |
| Intronic 7 | TCF7L2 | 193967 | Chr10 | 114903975 | 114903975 | G | A | |
| Intronic 7 | TCF7L2 | 193975 | Chr10 | 114903983 | 114903983 | G | T | |
| Intronic 7 | TCF7L2 | 193982 | Chr10 | 114903990 | 114903990 | C | A | |
| Intronic 7 | TCF7L2 | 193987 | Chr10 | 114903995 | 114903995 | G | C | |
| Intronic 7 | TCF7L2 | 193988 | Chr10 | 114903996 | 114903996 | C | A | |
| Intronic 7 | TCF7L2 | 193995 | Chr10 | 114904003 | 114904003 | G | C | |
| Intronic 7 | TCF7L2 | 194007 | Chr10 | 114904015 | 114904015 | T | A | |
| Intronic 7 | TCF7L2 | 194012 | Chr10 | 114904020 | 114904020 | G | T | |
| Intronic 7 | TCF7L2 | 195524 | Chr10 | 114905532 | 114905532 | C | G | |
| Intronic 11 | TCF7L2 | 207481 | Chr10 | 114917489 | 114917489 | G | C | |
| Intronic 11 | TCF7L2 | 207483 | Chr10 | 114917491 | 114917491 | T | A | |
| Intronic 11 | TCF7L2 | 207485 | Chr10 | 114917493 | 114917493 | T | C | |
| Intronic 11 | TCF7L2 | 207486 | Chr10 | 114917494 | 114917494 | G | A | |
| Intronic 11 | TCF7L2 | 207487 | Chr10 | 114917495 | 114917495 | G | A | |
| Intronic 11 | TCF7L2 | 207488 | Chr10 | 114917496 | 114917496 | G | C | |
| Intronic 11 | TCF7L2 | 207492 | Chr10 | 114917500 | 114917500 | G | T | |
| Intronic 11 | TCF7L2 | 207495 | Chr10 | 114917503 | 114917503 | T | G | |
| Intronic 11 | TCF7L2 | 207497 | Chr10 | 114917505 | 114917505 | G | C | |
| Intronic 11 | TCF7L2 | 207499 | Chr10 | 114917507 | 114917507 | C | A | |
| Intronic 11 | TCF7L2 | 207500 | Chr10 | 114917508 | 114917508 | A | T | |
| Intronic 11 | TCF7L2 | 207501 | Chr10 | 114917509 | 114917509 | A | G | |
| Intronic 11 | TCF7L2 | 207502 | Chr10 | 114917510 | 114917510 | T | A | |
| Intronic 11 | TCF7L2 | 207504 | Chr10 | 114917512 | 114917512 | G | A | |
| Intronic 13 | TCF7L2 | 209536 | Chr10 | 114919544 | 114919544 | A | T | |
| Intronic 13 | TCF7L2 | 209555 | Chr10 | 114919563 | 114919563 | T | A | |
| Intronic 13 | TCF7L2 | 209562 | Chr10 | 114919570 | 114919570 | C | T | |
| Intronic 13 | TCF7L2 | 209571 | Chr10 | 114919579 | 114919579 | A | G | |
| Intronic 15 | TCF7L2 | 215224 | Chr10 | 114925232 | 114925232 | A | T | |
| Intronic 15 | TCF7L2 | 215229 | Chr10 | 114925237 | 114925237 | G | T | |
| Intronic 15 | TCF7L2 | 215230 | Chr10 | 114925238 | 114925238 | G | C | |
Fig. 1Dynamics of selective sweeps, recombination rate and linkage disequilibrium (LD) in the chr10:113925369–115925369 region (a) and (b), The y axis on the left side is the negative logarithm of the p-value of association of case-controls of the Sigma Project samples. The y axis on right side is the recombination rate (cM/Mb) calculated by the 1000 Genomes Project. The recombination peaks are represented by the solid blue line. r2 was 0–1 and was calculated for the African population only. The color spectrum of the spheres is a function of the r2 value. (c) The first segment is the genomic coordinates of chromosome 10 based on mutations identified in the Indigenous population and the selective sweep peaks determined only from the Sigma Project samples. The dashed black circles indicate overlap of the recombination hotspots with the selective sweep peaks. The second segment represent the location in chromosome 10 of the hits of the Sigma Project mutations against the GWAS databases. The third segment includes mutations associated with T2D. Red indicates associated mutations, and blue indicates mutations inside the limit cutoff of the Manhattan plot. The fourth segment is the genomic context of the analyzed region, with the exons and introns of each gene. The x axis shows the coordinates of chromosome 10
Fig. 2Distribution and patterns of SNPs in the TCF7L2 gene. At the top is the x axis, which comprises the coordinates of chromosome 10. a) Distribution of SNPs identified in the Indigenous population along the TCF7L2 gene; mutations identified in the Sigma Project and UCSC gene description, i.e., the number of exons and introns. b) Human mRNAs, the location of mRNA along the TCF7L2 gene; Splice ETs, chromosomal position of Human ETs with splice; Human ETs, distribution of human Ets; Poly(A) sites, dispersion of sites associated along a tissue-specific region; UniGene, alignments of all mRNA and EST tracks from TCF7L2 transcription; TFBS (Transcription Factor Binding Sites) conserved, in the figure, each box represents one site conserved in the human/mouse/rat alignment; TargetScan miRNA Regulatory Sites, this track shows conserved mammalian microRNA regulatory target sites in the 3’ UTR regions of TCF7L2. c) Genomic location of variants used in the identification of selective sweeps in the Neanderthal genome. Dashed red circles indicate mutations that are under selective sweeps. d) Genetic differentiation and patterns of variation in the TCF7L2 gene identified in the 1000 Genomes Project for all populations
Fig. 3Network median joining. Haplotype network based on the sequences of variants (rs7903146, rs7896811, rs11196199, rs114863326, rs11196203, rs72826075 and rs35011184) of the samples of the 1000 Genomes Project associated with T2D. The nodes are the haplotype profiles and mv-nodes are the missing haplotypes. The outgroup included nucleotides from chimpanzee, root, macaque and orangutan. Each color indicates a possible haplotype profile; the color was randomly assigned. The size of the sphere is related to the frequency of the haplotype patterns. The dotted blue line includes the haplotypes from the African population, and the dotted green line contains the haplotype patterns of the European population. The bottom left of the figure shows the populations included in the network. The bottom right lists the mutations associated with T2D from the Sigma Project.
Fig. 4NeighborNet. Network analysis was based on 6061 mutations in the TCF7L2 gene identified from the samples of the 1000 Genomes Project. The dashed red circles indicate the reticulations between the nodes. The letters a, b and c are clusters with phylogenetic coherence. The scale is located at the bottom of the network and shows their substitutions by site