| Literature DB >> 18366708 |
Clare C Constantine1, Lyle C Gurrin, Christine E McLaren, Melanie Bahlo, Gregory J Anderson, Chris D Vulpe, Susan M Forrest, Katrina J Allen, Dorota M Gertig.
Abstract
BACKGROUND: We report our experience of selecting tag SNPs in 35 genes involved in iron metabolism in a cohort study seeking to discover genetic modifiers of hereditary hemochromatosis.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18366708 PMCID: PMC2289803 DOI: 10.1186/1471-2350-9-18
Source DB: PubMed Journal: BMC Med Genet ISSN: 1471-2350 Impact factor: 2.103
HealthIron SNPs selected for typing by gene, incomplete cover means not all tag SNPs were included.
| No. SNPs | HUGO | Full or other name(s) | Chr | |
| 16 | * | ACO1 | IRP1; IRE-binding protein; Aconitase 1 | 9p21.2 |
| 1 | CALR | Calreticulin | 19p13.13 | |
| 14 | CD163 | Hemoglobin scavenger receptor | 12p13.31 | |
| 25 | CP | Ceruloplasmin | 3q25.1 | |
| 29 | CUBN | Cubulin (incomplete cover) | 10p13 | |
| 27 | * | CYBRD1 | Dcytb | 2q31.1 |
| 6 | DHCR7 | Smith-Lemli-Opitz syndrome | 11q13.4 | |
| 3 | EXOC6 | SEC15 (incomplete cover) | 10q23.33 | |
| 16 | FLVCR | Feline leukemia virus subgroup C receptor | 1q32.3 | |
| 7 | * | FTH1 | H-ferritin | 11q12.3 |
| 1 | FTL | L-Ferritin | 19q13.33 | |
| 7 | FXN | Frataxin | 9q21.11 | |
| 2 | GAST | Gastrin | 17q21.2 | |
| 1 | GSTP1 | GSTP1 I105V | 11q13.2 | |
| 5 | * | HAMP | Hepcidin | 19q13.12 |
| 5 | * | HCP1 | MGC9564, heme carrier protein 1 | 17q11.2 |
| 18 | * | HEPH | Hephaestin | Xq12 |
| 9 | HEPHL1 | Hephaestin-like 1; Eleutherin | 11q21 | |
| 7 | HFE | Hemochromatosis | 6p22.2 | |
| 4 | * | HFE2 | Hemojuvelin | 1q25 |
| 10 | HMOX1 | HO-1Hemoxygenase 1 | 22q12.3 | |
| 2 | HMOX2 | HO-2 Hemoxygenase 2 | 16p13.3 | |
| 2 | HP | Haptoglobin | 16q22.2 | |
| 6 | HPX | Hemopexin | 11p15.4 | |
| 18 | * | IREB2 | IRP2 IRE-binding protein 2 | 15q24.1 |
| 1 | * | PGRMC1 | Progesterone receptor membrane component 1 | Xq24 |
| 3 | * | PGRMC2 | Progesterone receptor membrane component 2 | 4q28.2 |
| 13 | SLC11A2 | Divalent metal-ion transporter 1; DCT1 | 12q13.12 | |
| 20 | SLC25A37 | Frascati | 8p21.2 | |
| 12 | SLC40A1 | Ferroportin; IREG1; MTP1 | 2q32.2 | |
| 11 | STEAP3 | nm1058 (incomplete cover) | 2q14.2 | |
| 45 | * | TF | Transferrin | 3q22.1 |
| 6 | * | TFR2 | Transferrin receptor 2 | 7q22.1 |
| 30 | * | TFRC | Transferrin receptor 1 | 3q29 |
| 2 | TNF | TNFalpha | 6p21.33 | |
| 384 |
* indicates genes where NHBLI RS&G Caucasian data was used.
Figure 1Scatter plot of SNP validation scores with Gentrain scores. Previously successful SNPs are given a score of 1.1, design scores between 0 and 1 are calculated by a proprietary algorithm based on the surrounding 200 bp.
Comparison of number of tag SNPs generated by LDselect and Tagger in Haploview using Caucasian datasets (HAPMAP, NHLBI, Seattle SNPs). A "test" in Tagger refers to a combination of one to three tag SNPs used to capture one or more SNPs. The number of tests is the number of tag SNPs combinations required to capture all SNPs with a minimum value of r2 of 80%. HapMap Phase 1 data was all that was available at the time of SNP selection.
| HapMap | 27 | 13/15 | 15 | 2 | 0.48 | 0.56 | |
| HapMap | 11 | 8/8 | 8 | 0 | 0.73 | 0.73 | |
| HapMap | 10 | 5/5 | 5 | 0 | 0.50 | 0.50 | |
| NHLBI RS&G | 62 | 14/14 | 16 | 2 | 0.23 | 0.26 | |
| HapMap | 17 | 6/8 | 8 | 2 | 0.35 | 0.47 | |
| NHLBI RS&G | 99 | 16/19 | 17 | 1 | 0.16 | 0.17 | |
| NHLBI RS&G | 13 | 5/5 | 5 | 0 | 0.38 | 0.38 | |
| NHLBI RS&G | 7 | 4/4 | 4 | 0 | 0.57 | 0.57 | |
| NHLBI RS&G | 6 | 3/3 | 3 | 0 | 0.50 | 0.50 | |
| HapMap | 40 | 6/6 | 6 | 0 | 0.15 | 0.15 | |
| HapMap | 12 | 4/5 | 5 | 1 | 0.33 | 0.42 | |
| HapMap | 13 | 4/4 | 4 | 0 | 0.31 | 0.31 | |
| HapMap | 6 | 3/4 | 4 | 1 | 0.50 | 0.67 | |
| HapMap | 11 | 7/7 | 7 | 0 | 0.64 | 0.64 | |
| NHLBI RS&G | 43 | 17/20 | 20 | 3 | 0.40 | 0.47 | |
| SeattleSNPs | 101 | 29/31 | 31 | 2 | 0.29 | 0.31 | |
| NHLBI RS&G | 6 | 4/4 | 4 | 0 | 0.67 | 0.67 | |
| HapMap | 12 | 6/6 | 6 | 0 | 0.50 | 0.50 | |
| NHLBI RS&G | 117 | 33/38 | 36 | 3 | 0.28 | 0.31 | |
Effect of varying r2 on number of tagSNPs selected using Tagger and MAF≥3% for six genes using NHLBI resequencing data. First number is tagSNPs using 2 and 3 multi-marker SNP tagging, second number is number of tests (the number of unique combinations of tag SNPs required to represent all SNPs).
| r2 | ||||||
| 0.50 | 10/10 | 9/9 | 19/21 | 8/8 | 10/11 | 22/22 |
| 0.55 | 10/10 | 10/10 | 20/21 | 9/9 | 11/11 | 22/22 |
| 0.60 | 11/11 | 12/14 | 21/23 | 10/10 | 11/11 | 25/25 |
| 0.65 | 12/13 | 14/16 | 22/23 | 10/11 | 11/11 | 26/26 |
| 0.70 | 14/16 | 15/15 | 22/23 | 11/12 | 11/11 | 27/28 |
| 0.75 | 16/18 | 15/15 | 23/25 | 13/14 | 12/12 | 29/29 |
| 0.80 | 17/19 | 15/15 | 23/26 | 15/17 | 12/12 | 33/36 |
| 0.85 | 19/20 | 16/16 | 26/27 | 16/18 | 12/12 | 37/38 |
| 0.90 | 19/21 | 18/21 | 27/32 | 17/18 | 12/12 | 42/43 |
| 0.95 | 22/23 | 21/25 | 31/35 | 19/20 | 13/13 | 46/51 |
| 1.00 | 23/36 | 21/25 | 34/37 | 22/27 | 14/15 | 50/52 |
Number of tag SNPs from NHLBI resequencing versus HapMap data (Phase 1) for Europeans using MAF ≤ 3% and r2 0.8.
| Gene | NHLBI RS&G | HapMap Phase 1 |
| 26 | 15 | |
| 16 | 5 | |
| 17 | 8 | |
| 5 | 0 | |
| 4 | 1 | |
| 3 | 1 | |
| 13 | 6 | |
| 5 | 0 | |
| 11 | 5 | |
| 5 | 1 | |
| 2 | 2 | |
| 20 | 7 | |
| 4 | 2 | |
| 36 | 6 |
Figure 2Regions sequenced in three resequencing Caucasian data sets: (i) HealthIron in red; (ii) NHLBI RS&G in green; (iii) SeattleSNPs in black. The HapMap Phase 1 Caucasian (European) SNPs with MAF ≥ 3% rs numbers are shown. The TF gene appears in blue with the exons shown as bars. The arrows indicated the direction of transcription.
The number of SNPs identified for the transferrin gene from the four different data sources with different coverage. Figure 1 shows the coverage of each data set, N is the number of Caucasian individuals sequenced/typed.
| Resequencing | HapMap | HapMap | |||
| HealthIron N = 188 | NHLBI RS&G N = 47 | SeattleSNPs N = 23 | N = 30 trios Phase 1 | N = 30 trios Mar 2006 | |
| Total No. SNPs | 31 | 59 | 128 | 12 | 81 |
| No. SNPs ≥ 3% MAF | 17 | 43 | 101 | 11 | 38 |
| Tag SNPs with Tagger | 13 | 20 | 31 | 7 | 17 |
Figure 3Frequency distribution of captured and uncaptured SNPs from Seattle resequencing of . The large number of captured SNPs in the 40–45% range represents the strong block of LD which is captured by a single tagSNP.
Figure 4A graphical representation of linkage disequilibrium patterns for the transferrin gene from SNP data on Caucasian populations: (a) HapMap (11 SNPs with MAF ≥ 3%); (b) HealthIron (12 SNPs); (c) NHLBI RS&G (43 SNPs); (d) SeattleSNPs (101 SNPs). These LD displays were generated using the default settings in HaploView.
Number of tag SNPs/total SNPs with MAF ≥ 3% (ratio) in each of five populations using LDSelect with NHLBI resequencing data. The last column shows number of tag SNPs and total SNPs combined across the five populations using multiPopTagSelect.
| Gene | European N = 47 | Hipanic N = 48 | Chinese N = 47 | African American N = 24 | Yoruban N = 24 | Multiple |
| 26/58 (0.45) | 29/56 (0.52) | 28/57 (0.49) | 69/111 (0.62) | 65/110 (0.59) | 97/145 (0.67) | |
| 15/62 (0.24) | 20/58 (0.34) | 17/62 (0.27) | 28/57 (0.49) | 34/65 (0.52) | 46/85 (0.54) | |
| 17/99 (0.17) | 11/75 (0.15) | 8/83 (0.10) | 18/82 (0.22) | 18/91 (0.20) | 27/114 (0.24) | |
| 5/13 (0.38) | 10/17 (0.59) | 4/7 (0.57) | 11/19 (0.58) | 8/14 (0.57) | 17/27 (0.63) | |
| 4/7 (0.57) | 4/7 (0.57) | 7/10 (0.70) | 8/10 (0.80) | 9/11 (0.82) | 11/15 (0.73) | |
| 3/6 (0.50) | 5/9 (0.56) | 4/9 (0.44) | 11/16 (0.69) | 11/15 (0.73) | 17/22 (0.77) | |
| 13/38 (0.34) | 10/36 (0.28) | 9/10 (0.90) | 16/58 (0.28) | 21/53 (0.40) | 49/95 (0.52) | |
| 5/7 (0.71) | 5/8 (0.63) | 4/7 (0.57) | 8/10 (0.80) | 5/7 (0.71) | 10/13 (0.77) | |
| 11/52 (0.21) | 11/47 (0.23) | 15/63 (0.24) | 21/94 (0.22) | 18/95 (0.19) | 42/134 (0.31) | |
| 5/10 (0.50) | 2/2 (1.00) | 0/0 | 8/11 (0.73) | 10/13 (0.77) | 15/19 (0.79) | |
| 2/4 (0.50) | 4/8 (0.50) | 7/9 (0.78) | 10/13 (0.77) | 11/17 (0.65) | 16/22 (0.73) | |
| 20/44 (0.45) | 24/50 (0.48) | 17/49 (0.35) | 34/61 (0.56) | 42/61 (0.69) | 61/87 (0.70) | |
| 4/6 (0.67) | 3/5 (0.60) | 5/7 (0.71) | 5/5 (1.00) | 6/7 (0.86) | 8/9 (0.89) | |
| 36/119 (0.30) | 34/115 (0.30) | 22/93 (0.24) | 43/135 (0.32) | 44/110 (0.40) | 87/187 (0.47) | |
| Av No. tags | 11.9 | 12.3 | 10.5 | 20.7 | 21.6 | 35.9 |
Figure 5Pattern of linkage disequilibrium across six genes and five population samples using Haploview default settings (with blocks removed).
Number of SNPs on different genotyping arrays within each gene region of NHLBI resequencing. Note that in all cases the entire region was not resequenced and so the first two columns are conservative and refer only to the sample of 47 people of European descent.
| Gene | NHLBI reseq total SNPs | NHLBI tagSNPs MAF≥3% r2 0.8 | Region included in basepairs | Affy 100K | Affy 500K | Affy GW5 | Affy GW6 | Illumina Humhap 300 | Illumina Humhap 550 | Illumina Human 1M |
| 276 | 26 | 69937 | 1 | 11 | 12 | 24 | 14 | 26 | 41 | |
| 139 | 16 | 39535 | 0 | 3 | 3 | 8 | 7 | 12 | 19 | |
| 185 | 17 | 40442 | 4 | 6 | 6 | 15 | 11 | 12 | 18 | |
| 43 | 4 | 6314 | 0 | 2 | 2 | 3 | 1 | 1 | 4 | |
| 65 | 3 | 11169 | 0 | 0 | 0 | 0 | 1 | 2 | 7 | |
| 177 | 13 | 101478 | 1 | 11 | 10 | 32 | 5 | 7 | 21 | |
| 38 | 5 | 7437 | 0 | 1 | 1 | 2 | 0 | 1 | 5 | |
| 241 | 11 | 64450 | 1 | 15 | 11 | 21 | 3 | 7 | 24 | |
| 33 | 5 | 12051 | 0 | 0 | 0 | 1 | 1 | 1 | 7 | |
| 55 | 2 | 21640 | 0 | 1 | 1 | 4 | 2 | 3 | 9 | |
| 158 | 20 | 33189 | 0 | 23 | 18 | 37 | 8 | 11 | 23 | |
| 56 | 4 | 21649 | 0 | 0 | 0 | 1 | 3 | 3 | 13 | |
| 314 | 36 | 36402 | 2 | 4 | 4 | 10 | 7 | 8 | 14 |