Literature DB >> 21179114

A genome-wide admixture scan for ancestry-linked genes predisposing to sarcoidosis in African-Americans.

B A Rybicki1, A M Levin, P McKeigue, I Datta, C Gray-McGuire, M Colombo, D Reich, R R Burke, M C Iannuzzi.   

Abstract

Genome-wide linkage and association studies have uncovered variants associated with sarcoidosis, a multiorgan granulomatous inflammatory disease. African ancestry may influence disease pathogenesis, as African-Americans are more commonly affected by sarcoidosis. Therefore, we conducted the first sarcoidosis genome-wide ancestry scan using a map of 1384 highly ancestry-informative single-nucleotide polymorphisms genotyped on 1357 sarcoidosis cases and 703 unaffected controls self-identified as African-American. The most significant ancestry association was at marker rs11966463 on chromosome 6p22.3 (ancestry association risk ratio (aRR)=1.90; P=0.0002). When we restricted the analysis to biopsy-confirmed cases, the aRR for this marker increased to 2.01; P=0.00007. Among the eight other markers that demonstrated suggestive ancestry associations with sarcoidosis were rs1462906 on chromosome 8p12, which had the most significant association with European ancestry (aRR=0.65; P=0.002), and markers on chromosomes 5p13 (aRR=1.46; P=0.005) and 5q31 (aRR=0.67; P=0.005), which correspond to regions we previously identified through sib-pair linkage analyses. Overall, the most significant ancestry association for Scadding stage IV cases was to marker rs7919137 on chromosome 10p11.22 (aRR=0.27; P=2 × 10(-5)), a region not associated with disease susceptibility. In summary, through admixture mapping of sarcoidosis we have confirmed previous genetic linkages and identified several novel putative candidate loci for sarcoidosis.

Entities:  

Mesh:

Year:  2010        PMID: 21179114      PMCID: PMC3058725          DOI: 10.1038/gene.2010.56

Source DB:  PubMed          Journal:  Genes Immun        ISSN: 1466-4879            Impact factor:   2.676


Introduction

Sarcoidosis, a multiorgan granulomatous inflammatory disease, likely results from an exaggerated T cell response to an airborne antigen 1. The first genome-wide linkage study of sarcoidosis in 63 German families found linkage of sarcoidosis to chromosome 6p212. Subsequently, a novel sarcoidosis risk variant at rs2076530 was identified, which predicted a premature splice site at exon 5 of the class II gene BTNL2, a B7 family co-stimulatory molecule3. The increased sarcoidosis risk associated with this variant was replicated in another German population 4, by our group in European Americans 5, and in a Dutch and British population 6. We also studied variation at exon 5 of BTNL2 in African-American sarcoidosis cases and controls and found variants other than rs2076530 associated with sarcoidosis 5. Additional genome-wide studies have confirmed that sarcoidosis is a complex genetic disease with multiple loci having modest effects. In the Sarcoidosis Genetic Analysis (SAGA) study 7 of 516 African-American sib pairs with at least one affected sib and 359 pairs with both sibs affected, our most significant linkage peaks spanned chromosomes 5p15.2 and 5q11.2 8. A second analysis of this same study population using disease sub-phenotypes found that chest radiographic resolution had one of the highest LOD scores among the pulmonary and extrathoracic involvement phenotypes analyzed 9. The latest evidence for a genetic predisposition to sarcoidosis comes from the first sarcoidosis genome-wide association scan (GWAS), conducted in a German population, that had its most significant disease association map to Annexin A11 (ANXA11), a gene with complex and essential functions in several biological pathways, including cell apoptosis and proliferation 10. Study of the genetics of sarcoidosis in African Americans, who are more commonly and severely affected by this disease 11, suggests that focusing on genes of African ancestry may uncover novel risk variants. A higher incidence of sarcoidosis in African Americans compared with whites has long been noted 12,13 with an over three-fold increased incidence of sarcoidosis in African Americans compared with whites, 35.5 per 100,000 vs. 10.9 per 100,000 11. Other studies outside of the United States that compared sarcoidosis incidence in populations of West African ancestry with whites further support a role of African genes in sarcoidosis susceptibility. In South Africa, sarcoidosis incidence was reported as 23.2 per 100,000 in blacks and 3.7 per 100,000 in Whites 14. In Great Britain, Edmondstone and Wilson 15 reported an annual sarcoidosis incidence of 19.8 per 100,000 in blacks and 1.5 per 100,000 in whites. The eight to ten fold higher incidence of sarcoidosis in blacks in the South African and Great Britain studies is notable because of decreased European ancestry in these two populations compared with African Americans. Average European admixture proportions are about 10 percent in African Caribbeans living in Great Britain compared with 18 percent for African Americans 16. Among the disease outcomes in sarcoidosis that have a strong link with African ancestry, persistence of radiographic evidence for disease among patients of African descent appears to be a consistent finding 15. In a multi-site case control study, after two years of observation, African-American cases were 80% more likely to have a worsening chest x-ray, although this finding was not statistically significant 17. In general, self reported African ethnicity is strongly related to increased sarcoidosis risk and more severe disease outcomes. The recent development of genome-wide ancestry informative marker panels 18,19 now makes scanning the genome for disease genes linked to ancestry in African-American populations feasible. Mapping by admixture linkage disequilibrium (MALD) involves screening the genome of individuals of mixed ancestry who have a disease or trait of interest for chromosomal regions that have a greater percentage of alleles than expected from a parental population. The theory behind MALD is that affected individuals are more likely to carry alleles from the higher risk parental population in chromosomal segments that also harbor disease loci. Advantages to using admixture mapping over genome-wide association mapping include the ability to find effects where there is marked allelic heterogeneity across ethnic groups 20 or where a risk allele is fixed in either parental population. Admixture mapping can also more readily detect risk effects emanating from genes due to rare copy number variants than can a dense SNP map relying only on allelic association. MALD in African-American populations has been used to localize susceptibility loci for prostate cancer 21, kidney disease 22, peripheral artery disease 23, type 2 diabetes 24, obesity 25,26, interleukin-6 levels 27 and blood lipid levels 28. To determine whether African ancestry is linked with sarcoidosis genetic susceptibility, we conducted a whole genome admixture scan using a panel of ancestry-informative markers in a large sample of African-American sarcoidosis cases and unaffected controls. We hypothesized that ancestry-linked genes may predispose to either disease susceptibility and/or severity, as measured by radiographic progression.

Results

Overall Results

The MALD scan estimated associations between increased or decreased African ancestry based on 1,384 SNPs typed on a sample of African-American unrelated sarcoidosis cases (n=1,026) and controls (n=316). The mean African ancestry estimate of the analytic sample was 82.6% with sarcoidosis cases having a slightly higher level of African ancestry than controls (82.9% vs. 81.5%; p=0.03). We detected nine distinct chromosomal regions that had suggestive ancestry associations with sarcoidosis: four with excess African and five with excess European ancestry (table 1). These regions were identified based on affected-only test p values < 0.01 and the same ancestry differential based on the ancestry association test (figure 1). For consistency, the ancestry risk ratios (aRRs) estimated from the more powerful affected-only test are reported in the text. While none met the genome-wide significance level of 10−5 29, the most prominent ancestry linkage peak located at chromosome 6p24.3-6p12.1 had as its most significant marker rs11966463 with an aRR of 1.90 and a Z score of 3.78 (p=0.0002), indicative of increased risk of sarcoidosis associated with excess African ancestry at this locus. The second most prominent peak was on chromosome 17p13.3-17p13.1 with marker rs8070464 having an aRR of 1.68 (Z = 3.51; p=0.0005) also indicative of increased risk of sarcoidosis associated with excess African ancestry. The other two regions that showed excess African ancestry among sarcoidosis cases were chromosomes 4q31.21-4q34.1 and 5p13.3-5p13.2. The five regions that showed excess European ancestry among sarcoidosis cases were chromosomes 2p13.3-2p13.1, 2q35-2q26.3, 5q23.1-5q31.2, 6q23.3-6q25.2 and 8p21.3-8p11.21. Chromosome 8p21.3-8p11.21 was the most significant of the five (p=0.002), but all were comparable in magnitude. In general, ancestry association Z-scores were, as expected, of lower magnitude than affected-only test Z scores, but in the same direction with the same peak SNPs (figure 1). Of note, was a 10 mB region on chromosome Xq24-Xq25, which had the most statistically significant affected-only test p value, but none of the SNPs in this peak attained our ancestry association p value criterion of 0.05. Analyses were rerun for the subset of 910 unrelated cases with a definite (i.e., histologically confirmed) diagnosis of sarcoidosis, and the results were nominally changed (table 2). In these analyses, the most significant finding remained at rs11966463 and increased in significance to p=0.00007. Results in the definite case group for the other eight regions were nominally changed with the possible exception of chromosome 8p21.3-11.21, where the aRR increased from 0.67 to 0.77 and the p value increased from 0.003 to 0.06.
Table 1

Markers with strongest association between sarcoidosis and genetic ancestry based on a genome-wide ADMIXMAP analysis of unrelated sarcoidosis 1,026 cases and 316 controls

Affected-onlyAncestry Association
GENOMICPOSITIONMarkeraRR95%CIP valueaOR95%CIP value
2p13.3-2q12.1Rs14445430.670.52-0.880.0040.690.54-0.890.003
2q35-2q36.3Rs46746590.690.53-0.910.0080.720.55-0.930.01
4q31.21-4q34.1Rs15300441.531.14-2.050.0051.270.94-1.710.1*
5p13.3-5p13.2Rs353971.461.13-1.900.0051.521.16-1.990.003
5q23.1-5q31.2Rs305330.670.51-0.890.0050.720.56-0.940.01
6p24.3-6p12.1Rs119664631.901.36-2.640.00021.491.07-2.080.02
6q23.3-6q25.2Rs2764970.670.52-0.870.0030.680.53-0.880.003
8p21.3-8p11.21Rs14629060.650.5-0.850.0020.730.57-0.940.02
17p13.3-17p13.1Rs80704641.681.26-2.250.00051.371.01-1.850.04

Note: Regions were selected satisfied all of the following criteria: (1) at least one SNP with an Affected-only p-value < 0.01 (i.e. absolute Z-score > 2.576); (2) more than 3 SNPs with Affected-only p-values < 0.05; and (3) at least one SNP with an Ancestry Association p-value < 0.05.

(aRR = ancestry risk ratio; aOR = ancestry odds ratio)

adjacent SNP had p value <0.05

Figure 1

Genome-wide Ancestry Associations with Sarcoidosis susceptibility based on Affected-Only and Ancestry Association Z scores.

Table 2

Markers with strongest association between sarcoidosis and genetic ancestry based on a genome-wide ADMIXMAP analysis of unrelated sarcoidosis 910 cases with a definite diagnosis and 316 controls

Affected-onlyAncestry Association
GENOMICPOSITIONMarkeraRR95%CIP valueaOR95%CIP value
2p13.3-2q12.1rs14445430.630.47-0.80.0010.670.52-0.860.002
2q35-2q36.3rs46746590.710.53-0.90.020.720.56-0.940.02
4q31.21-4q34.1rs15300441.451.06-1.970.021.200.89-1.630.2
5p13.3-5p13.2rs353971.551.17-2.050.0021.561.18-2.050.002
5q23.1-5q31.2rs305330.620.46-0.820.0010.700.54-0.910.007
6p24.3-6p12.1rs119664632.011.42-2.840.000071.531.09-2.140.02
6q23.3-6q25.2rs2764970.770.58-1.010.060.710.55-0.920.01
8p21.3-8p11.21rs14629060.630.47-0.840.0010.710.55-0.920.01
17p13.3-17p13.1rs80704641.621.19-2.20.0021.330.98-1.810.07

Note: Regions were selected satisfied all of the following criteria: (1) at least one SNP with an Affected-only p-value < 0.01 (i.e. absolute Z-score > 2.576); (2) more than 3 SNPs with Affected-only p-values < 0.05; and (3) at least one SNP with an Ancestry Association p-value < 0.05.

(aRR = ancestry risk ratio; aOR = ancestry odds ratio)

Results in Demographic and Phenotypic Subsets

Of the nine sex-specific aRRs shown in table 3, only the sex-specific aRRs in the 5q23.1-5q31.2 region were statistically significantly different from each other (p=0.003). Female cases had ancestry risk ratios of greater magnitude than male cases for six of the nine ancestry peaks, with the only exceptions being chromosomes 5p13.3-5p13.2, 6q23.3-6q25.2, and 17p13.3-17p13.1 where the female-specific risk ratios were of slightly lower magnitude than for males. For four of the six peaks where the female-specific aRR was of greater magnitude (chromosomes 2q35-2q36.3, 4q31.21-4q34.1, 5q23.1-5q31.2 and 6p24.3-6p12.1), the excess ancestry risk appeared to be almost exclusive to female subjects. In addition to the loci identified in the unstratified analysis, marker rs2476230 on chromosome 13q32.2-13q33.3 had an elevated aRR (1.60; p=0.003) only in female subjects and marker rs1131877 on chromosome 14q32.2-14q32.33 had an elevated aRR (2.63; p=0.002) only in male subjects.
Table 3

Markers with strongest association between sarcoidosis and genetic ancestry from a genome-wide ADMIXMAP analysis of unrelated sarcoidosis cases stratified by sex*

All Cases(n=1026)Males(n=252 )Females(n=774 )
GENOMICPOSITIONMarkeraRR95%CIP valueaRR95%CIP valueaRR95%CIP value
2p13.3-2q12.1rs485269610.670.51-0.870.0030.790.46-1.360.4030.620.46-0.840.002
2q35-2q36.3rs1247107810.650.45-0.960.0291.110.56-2.190.7640.530.35-0.820.004
4q31.21-4q34.1rs15300441.531.14-2.050.0050.940.53-1.680.8301.81.29-2.510.005
5p13.3-5p13.2rs353971.461.13-1.900.0051.560.93-2.620.0941.461.08-1.960.013
5q23.1-5q31.2rs305330.670.51-0.890.0051.070.62-1.860.8020.580.43-0.790.0005
6p24.3-6p12.1rs119664631.91.36-2.640.00021.360.72-2.560.3482.071.43-2.990.0001
6q23.3-6q25.2rs2764970.670.52-0.870.0030.560.33-0.960.0340.720.53-0.970.028
8p21.3-8p11.21rs14629060.650.5-0.850.0020.680.4-1.170.160.650.48-0.880.005
13q32.2-13q33.32rs24762301.341.03-1.750.0330.840.49-1.430.5241.601.18-2.170.003
14q32.2-14q32.333rs11318771.270.91-1.780.1552.631.42-4.870.0021.010.7-1.470.957
17p13.3-17p13.1rs80704641.681.26-2.250.00051.841.05-3.250.0351.671.2-2.320.003

Note: Regions selected where either significant in all cases or a sex-stratified subset based on criteria outlined in the table 2 footnote

Most significant SNP in region in females was different than most significant SNP in all cases

Significant region only for females

Significant region only for males

(aRR = ancestry risk ratio estimated from the affected-only analysis)

When stratifying cases by chest radiographic phenotype and analyzing using the affected-only test, seven additional distinct ancestry peaks that were detected, including chromosomes 2p24.3-2p16.1, 4q21.1-4q24, 10p12.2-10q11.23, 16q22.1-1623.2, 18p11.32-18p11.31, 18q23 and 19p12-19q13.41 (table 4). Of the nine original ancestry peaks found in the full case sample, four (2p13.3-2p13.1, 5p13.3-5p13.2, 6q23.3-6q25.2 and 17p13.3-17p13.1) had aRRs of much greater magnitude and more significant associations in the more severe Scadding stage IV case group, despite this case subset being less than 20 percent of the size of the full case sample. A formal test for heterogeneity revealed three loci with significant differences in aRRs between the extreme radiographic phenotypes of stage IV sarcoidosis and radiographically resolved disease (table 5). All three loci were European ancestry-associated loci in the full case group with much stronger associations with European ancestry in stage IV cases. For example, in stage IV chest radiograph cases, chromosome 10p11.22 had the aRR of greatest magnitude (aRR=0.27; p=2 × 10−5) for any of the phenotypic subsets analyzed.
Table 4

Markers with strongest association between sarcoidosis chest radiographic phenotypes and genetic ancestry based on a genome-wide ADMIXMAP analysis of unrelated sarcoidosis cases

All Cases(n=1026)Radiographically Resolved Disease(n=295 )Radiographically Persistent Disease(n=525 )Scadding Stage IV Disease(n=186 )
GENOMICPOSITIONMarkeraRR95%CIP valueaRR95%CIP valueaRR95%CIP valueaRR95%CIP value
2p13.3-2q12.1rs13201490.710.54-0.920.0110.970.59-1.600.910.670.47-0.970.0360.290.15-0.550.0001
2q35-2q36.3rs46746590.690.55-0.910.0080.760.46-1.250.2720.620.42-0.900.0130.660.35-1.240.195
2p24.3-2p16.1rs13165210.770.56-1.060.1100.670.39-1.170.1610.670.44-1.030.0700.330.17-0.680.003
4q21.21-4q24rs100035771.270.98-1.660.0751.430.88-2.330.1531.701.17-2.460.0051.400.75-2.590.291
4q31.21-4q34.1rs15300441.531.14-2.050.0051.520.9-2.580.1171.450.97-2.170.0681.660.87-3.170.128
5p13.3-5p13.2rs400301.411.09-1.830.0081.971.24-3.150.0041.370.96-1.940.0822.251.25-4.050.007
5q23.1-5q31.2rs305330.670.51-0.890.0050.790.48-1.290.3480.710.48-1.040.0790.470.25-0.890.021
6p24.3-6p12.1rs119664631.901.36-2.640.00022.041.15-3.670.0061.611.04-2.490.0061.130.55-2.310.740
6q23.3-6q25.2rs2764970.670.52-0.870.0030.590.36-0.960.0340.630.44-0.910.0130.380.21-0.700.002
8p21.3-8p11.21rs14629060.650.5-0.850.0020.920.56-1.50.7230.680.47-0.980.0370.790.43-1.460.446
10p12.2-10q11.23rs79191370.820.64-1.070.3490.910.56-1.480.7080.620.43-0.90.0370.270.15-0.492×10−5
16q22.1-16q23.2rs124442040.780.58-1.040.0941.210.72-2.040.4710.730.49-1.090.1210.320.16-0.630.001
17p13.3-17p13.1rs80704641.681.29-2.250.00051.660.97-2.820.0621.681.12-2.510.0112.601.36-4.970.004
18p11.32-18p11.31rs5682731.340.95-1.890.0970.740.39-1.410.3621.861.19-2.910.0070.910.43-1.920.802
18q23rs47989540.820.57-1.170.2790.610.33-1.130.1150.490.3-0.810.0050.810.37-1.790.599
19p12-19q13.41rs29741901.451.1-1.910.0091.811.08-3.040.0251.821.25-2.650.0021.490.8-2.770.201

Note: Regions selected where either significant in all cases or a phenotype-stratified subset based on criteria outlined in the table 2 footnote

(aRR = ancestry risk ratio estimated from the affected-only anlaysis)

Table 5

Genomic regions with evidence for loci with significant heterogeneity between stage IV and radiographically resolved sarcoidosis cases based on a genome-wide ADMIXMAP analysis*

Radiographically Resolved Disease(n=295 )Scadding Stage IV Disease*(n=186 )Stage 4 vs. ResolvedTest of Heterogeneity

GENOMICPOSITIONMarkeraRR95%CIP valueaRR95%CIP valueZ scoreP value
2p13.3-2q12.1rs2607141.100.66-1.820.7180.330.18-0.614×10−4−2.970.003
10p12.2-10q11.23rs79191370.910.56-1.480.7080.270.15-0.492×10−5−3.070.002
16q22.1-16q23.2rs124442041.210.72-2.040.4720.320.16-0.630.001−3.020.002

Note: For a region to be included in table 5, there had to be at least two consecutive markers with heterogeneity p-values < 0.01.

(aRR = ancestry risk ratio estimated from the affected-only analysis)

Finally, we ran several validity checks on our results. The main analysis for disease susceptibility was redone using ANCESTRYMAP 30 with resulting African and European ancestry associations virtually identical to what we found using ADMIXMAP. We also redid the main analysis for 100 resamples of our full dataset, in which we relaxed the sampling criteria such that any individual irrespective of affection status could be randomly selected from a family. Analysis of these 100 different samples of unrelated individuals showed essentially the same patterns of ancestry associations across the genome.

Exclusion Map

We prepared an exclusion map for the inclusive (definite and highly probable; n=1,026 cases) case definition (supplementary figure 1) based on the affected-only test results. At least 90% of the genome can be excluded as having a European effect on risk of 2.12 or more, and at least 90% can be excluded as having an African effect on risk of 2.10 or more. Further, among the subset of definite cases (n=910), at least 90% of the genome can be excluded as having a European effect on risk of 2.23 or more, and at least 90% can be excluded as having an African effect on risk of 2.19 or more. The power of the subanalysis for definite cases is less than that for all cases because of the smaller sample size.

Discussion

We used admixture mapping to search the genome for sarcoidosis susceptibility genes linked to ancestry in African Americans. The chromosomal regions we identified complement and confirm previous genomic regions identified by past genome-wide linkage 31,8 and association 10 studies in African Americans and whites. The chromosome 5p13.3-5p13.2, 5q23.1-5q31.2 ancestry linkage signals overlap with the chromosome 5 sarcoidosis linkages we previously reported in African American sib pairs 8. Moreover, the chromosome 5p and 5q regions have also been implicated in Crohn’s disease. The chromosome 5p13 region linked to Crohn’s disease has been characterized as a 1.25 megabase gene desert, but disease associated variants in this region are associated with expression levels of PTGER4 32, the closest gene that maps to this region. Chromosome 5q31 contains the cytokine gene cluster with unique variants associated with Crohn’s disease 33 and psoriasis 34,35. The chromosome 6p24.3-6p12.1 ancestry peak overlaps with the chromosome 6p21 sarcoidosis linkage signal originally reported by Schurmann et al. 31. We also found novel regions containing putative sarcoidosis susceptibility loci on chromosomes 2p13.3-2q12.1, 2q35-2q36.3, 4q31.21-4q34.1, 6q23.3-6q25.2, 8p21.3-8p11.21 and 17p13.3-17p13.1. Of the nine ancestry linkage peaks we identified in the overall risk analysis, the African ancestry peak on chromosome 6p24.3-6p12.1 was the most statistically significant (p=7 × 10−5 for rs11966463 in definite cases) and was close to the genome-wide significance level of 10−5 for admixture mapping 29. This region spans 50 megabases and includes the entire major histocompatibility complex on chromosome 6p21. Variants of the class II HLA-DR and DQ genes are located in this region and known to be associated with sarcoidosis susceptibility and progression 36,37,38,39. In addition, the exon 5 SNP rs2076530 of the BTNL2 gene 200 kilobases telomeric from HLA-DRB1 was shown to be the primary disease-associated variant in a German sample that was driving the linkage signal previously identified in this sample 3. Increased allelic heterogeneity across the HLA region in African Americans 40,41 was the likely explanation for why we replicated this association in the ACCESS white case-control sample but not in two different African-American samples 5. The numerous previously reported associations of genetic variation in the HLA region with sarcoidosis and the linkage to this region suggest that the prominent African ancestry peak at 6p22 we observed in African-American sarcoidosis cases could be due to underlying disease causing genes in the HLA region. To appropriately estimate disease-ancestry linkage on the X chromosome, we had to account for the known unequal contribution of male and female European founders to the present day African-American gene pool 42,43. In our study sample, we estimated ψ, the female/male odds ratio for European versus African founders, to be 3.1, which falls within the range of ψ estimates (2.3-4.3) derived from European admixture estimates based on mitochondrial and Y chromosome lineages in other African-American samples. Despite the incorporation of ψ into the estimation of X chromosome admixture proportions, we still found two broad peaks on the X chromosome at which the observed African ancestry on the X chromosome of cases was much higher than the expected value given autosomal admixture proportions and ψ. A similar excess of African ancestry was observed over the same regions in controls, however, suggesting that the frequency of African ancestry in these regions is higher than elsewhere as a result of drift or selection for African derived DNA on the X chromosome in African Americans rather than a true disease association. We failed to find evidence for increased African or European ancestry influencing sarcoidosis susceptibility on chromosome 10q22.3. This is the region containing the ANXA11 (annexin A11) gene with a common nonsynonymous SNP (rs1049550, T > C, R230C) found to be the most strongly associated sarcoidosis SNP in a German GWAS of sarcoidosis susceptibility 10. None of the ANAX11 sarcoidosis associated variants show significant allelic diversity between African and European populations. This highlights the main limitation of admixture mapping in that putative disease risk variants with similar frequencies in the ancestral African and European populations that comprise the current day African-American population are undetectable by admixture mapping. A female preponderance in sarcoidosis cases is well documented 11,44 and some evidence suggests that female cases may also incur greater granuloma burden in early stage disease 45. In terms of underlying genetic liability, polygenic complex disease theory is that individuals with a lower population risk who contract the disease in question (e.g., males for sarcoidosis) have on average more genetic liability 46. We previously found a greater familial risk associated with African-American male sarcoidosis cases compared with females 47, but the ACCESS case-control study consisting of both white and African-American subjects found no difference in sex-specific familial risk 48. Others have reported sex-specific effects of alleles predisposing to sarcoidosis susceptibility or disease manifestations 49,50,51,52. Our sex-stratified analyses showed that some ancestry linkage signals were consistent for males and females, whereas others were sex-specific. Ancestry signals on chromosomes 2q35-2q36.3, 4q31.21-4q34, 5q23.1-5q31.2, 6p24.3-6p12.1, and 13q32.2-13q33.3 appeared to be exclusive to females, whereas only one significant signal was exclusive to males, on chromosome 14q32.2-14q32.33, however the aRR at this locus at 2.63 was of the greatest magnitude of the sex-specific ancestry signals. Given the 3:1 ratio in female to male sarcoidosis cases in our analytic dataset, it is unclear whether the abundance of significant ancestry signals in female-only analyses simply reflected a greater statistical power or a more robust genetic effect. The etiology of sex differences in sarcoidosis remains unknown, but since female and male sex hormones affect the immune system in different ways 53, one might suspect that some sex-specific genetic effects underlying sarcoidosis risk exist. In terms of genetic risk linked with ancestry, our results suggest that focusing on female cases may provide a more direct path toward sarcoidosis gene finding. Consideration of phenotype plays a critical role in the search for sarcoidosis susceptibility genes. For instance, the acute form of sarcoidosis known as Lofgren’s syndrome has a strong association with a distinct HLA subtype 54. Other studies have shown clear differences in genetic predisposition to acute and chronic forms of sarcoidosis 55,37. In a reanalysis of the SAGA sib pair linkage data according to phenotype 9, we showed that phenotypes defined by specific pulmonary manifestations of disease and affected organs had linkage signals that were different from those based on disease status alone, but more importantly the linkage signals we found were of greater magnitude suggesting a less diluted genetic effect. Persistence of radiographic defined lung disease is a sarcoidosis phenotype that appears to have a strong association with African ancestry 15,17,56. We analyzed case subsets defined by radiographic persistence of disease. For those with radiographic persistence, we also analyzed the subset of cases with evidence of fibrotic (stage IV) disease. In doing so, we found that several loci had aRRs for stage IV disease that were of greater magnitude than any found for disease susceptibility with the strongest ancestry association signal at chromosome 10p12.2-10q11.23 (aRR=0.27; p=0.00002). Stage IV cases were more than three times more likely to carry two European alleles at this locus in comparison to none (i.e. two African alleles). A recent combined genome-wide association study of Crohn’s disease and sarcoidosis cases found the SNP with the strongest association at chromosome 10p12.2 57. The results of these phenotype-stratified analyses show the genetic heterogeneity of sarcoidosis and the need to stratify cases into more homogeneous subsets in gene finding efforts. Our study had several limitations. The African-American sample we analyzed was recruited from sites across the United States, which likely contributed to a greater variation in European admixture proportions than would be observed if cases were drawn from a more circumscribed geographic area 43. Although African admixture proportions varied between 28 and 99% within the study sample, variation in average African admixture was similar across study sources. Ascertainment schemes also varied across the three source populations, with two of the three studies ascertaining on families. In practice, having a majority of cases with relatives allowed Mendelian inconsistent genotypes to be removed resulting in a more accurate analytic set of genetic data. Admixture mapping cannot detect disease variants with similar allele frequencies in the two parental populations, but disease variants differentially fixed in the parental populations may only be detected through admixture mapping. In addition, admixture mapping is robust to allelic heterogeneity, an issue that arose in trying to replicate BTNL2 associations in African-American samples 5. The greater HLA diversity in African-Americans may have limited our ability to find sarcoidosis linkage to this region using traditional methods, but the strong African ancestry signal on chromosome 6p22.2 suggests African alleles in this region may indeed play a role in sarcoidosis susceptibility. We did find one African ancestry-disease associated locus at chromosome 6p24.3-6p12.1 that was close to the genome-wide significance level of 10−5 29 in cases with histologically confirmed disease, but our results suggest that multiple ancestry-linked loci exist harboring genes with alleles of both African and European origin that predispose to sarcoidosis. Next steps will include prioritizing ancestry-linked regions for fine mapping based on level of association and consistency of results in phenotypic subsets. While several ancestry peaks were found at chromosomal regions where known sarcoidosis genes exist, most were in regions that, if due to true genetic effects, may reveal novel sarcoidosis genes that help us better understand and treat this perplexing disease. Loci identified by this ancestry scan may also be relevant in other immune-related diseases, particularly where African-American race is a disease risk factor. Finally, understanding the effects of ancestry on locus-specific risk should aid in future sarcoidosis gene finding efforts in African Americans, such as genome-wide association studies, which will attempt to isolate modest genetic effects possibly confounded by ancestry.

MATERIALS AND METHODS

Study Sample and Case Phenotyping

DNA samples and subject data were collected from three previously conducted studies: 1) a multi-site case control study 58; 2) a multi-site affected sib-pair study 7 and; 3) a single institution family-based study 38. The current study, as well as the three previous studies all underwent human subjects review with the original informed consents allowing usage of data and genetic material for future studies. All study participants self-identified as being African-American, with the single-institution study drawing participants from the Detroit, Michigan area, whereas the two multi-site studies drew participants from across the United States. Figure 2 depicts the initial sample composition submitted for genotyping and how the final analytic sample of 1,357 sarcoidosis cases and 703 controls was derived. Of the 2,494 samples genotyped, 15 were excluded because of poor genotyping quality. Another 15 samples were excluded because the reported sex did not match genotyped sex. A total of 108 samples were found to be duplicated across studies (i.e., the same individual participated in more than one study), and therefore the duplicate sample(s) with the poorer genotyping quality was dropped from the analytical dataset. Nine samples were dropped due to allele-sharing with another sample that indicated unreported monozygotic twin status. Finally, 10 samples were dropped either for having no genes of European or African ancestry or for having evidence of the first European admixture within the last generation and, therefore, uninformative for MALD as they posses no admixed gametes.
Figure 2

Flow diagram of sample composition by study of origin and where samples dropped out to form final analysis sample.

Among the 1,357 cases, 717 (52.8%) were members of nuclear families with one or more sibs and/or parents genotyped. Because of the different ascertainment schemes for the two family studies that contributed to the final analytic sample, sibships included both single and multiple affected individuals. When available, parents were genotyped, but approximately half of the nuclear families (50.7%) had no genotype data on parents. The diagnostic criteria for sarcoidosis varied among the different study populations from which the samples were derived. Overall, for an individual to be classified as a sarcoidosis case, he or she had to satisfy the diagnostic criteria for definite or highly probable sarcoidosis used in the SAGA study 9. In short, definite cases had histological confirmation of non-caseating granulomas and evidence of disease involvement in either the thorax or two or more other organ systems. Highly probable cases lacked histologic confirmation, but had characteristic chest radiographs (bilateral symmetrical hilar adenopathy) and either a history of erythema nodosum or at least two years observation during which time no other disease was found to explain radiographic abnormalities. Where possible, cases were phenotyped as to the persistence or absence of radiographic evidence for lung disease two years after date of diagnosis. The procurement of these data were largely done retrospectively with the exception of a subset of cases from the ACCESS study where study protocol dictated a two-year follow-up exam for cases recruited during the first two years of enrollment 58. For cases that presented with a Scadding stage IV chest radiograph (evidence of lung fibrosis or scarring), no follow-up chest x-ray was needed for phenotyping since a stage IV x-ray indicates permanent changes. Table 6 summarizes the demographic and clinical characteristics of the analytic sample.
Table 6

Demographic and Clinical Characteristics of Sarcoidosis Cases and Controls used in Admixture Mapping Study

Full SampleAnalytic Subset of Unrelateds
FactorCases (n=1,357)Controls (n=703)Cases (n=1,026)Controls (n=316)
Age*36.7 ± 9.744.0 ± 11.237.4 ± 9.641.7 ± 11.5
Male Sex25.4%32.7%24.6%28.2%
Source of Sample:
 Henry Ford35.9%45.9%44.5%9.2%
 ACCESS18.8%40.8%24.9%90.5%
 SAGA38.7%12.4%24.9%0.3%
 More than one study7.0%0.9%5.8%0.0%
Diagnostic Category:
 Definite78.1%84.22%
 Highly Probable19.2%13.8%
 Unknown2.7%1.9%
Chest Radiographic Follow-Up Status:
 Resolved in 2 years23.7%23.1%
 Persistent ≥ 2 year48.3%46.8%
 Stage IV14.6%14.7%
 Unknown28.0%30.1%

Age at diagnosis for cases; Age at study enrollment for controls

Marker Panel Description

We used the Illumina Bead Station platform to assay 1,509 SNPs previously identified as highly informative for distinguishing West African versus European ancestry. The panel of markers is described in documentation on the Illumina website (http://www.illumina.com/products/african_american_admixture_panel.ilmn). After performing both automated and manual genotype calling and quality checks using the Illumina BeadStudio software, we removed data on 49 markers with poor genotype clustering. An additional 23 markers were removed due to a high level (>5%) of missing genotype data (n=8) or genotype frequencies significantly different (p<10−5) than expected under Hardy-Weinberg equilibrium (n=15). Finally, we dropped 53 markers that were in significant (p<0.0001) linkage disequilibrium with one or more adjacent markers. This resulted in a final analytic map of 1,384 markers with an average spacing of 2.9 cM. The median inter-marker distance was 2.4 cM with 95 percent of the markers located no farther than 7.2 cM from an adjacent SNP.

Statistical Methods

Data were analyzed using the ADMIXMAP statistical program (http://homepages.ed.ac.uk/pmckeigu/admixmap/) 29. ADMIXMAP uses a hybrid of Bayesian and traditional modeling to compare observed vs. expected ancestry across the genome. Both affected-only and ancestry association (i.e., case-control) implementations were used. All models were adjusted for age and sex. Prior allele frequencies for the African and European ancestral populations were obtained from the International HapMap project’s 59 West AfricanYoruba and European samples, respectively. Analyses were performed on the entire sample and on case subsets stratified by sex, chest radiographic phenotype and disease status (i.e., definite vs. highly probably cases). As the current versions of admixture mapping methods do not model family data, an unrelated set of cases and controls had to be selected for analysis. The majority of the controls selected for the analyses were derived from the ACCESS portion of the study sample. In addition, for families containing only unaffected sibs (due to lack of DNA from affected individuals in the original pedigree); the unaffected sib with the oldest age of enrollment was selected as a control. Sarcoidosis cases selected for analysis included all ACCESS cases as well as the proband case from each family from the Henry Ford and SAGA studies. For instances in which the proband was not genotyped (due to a lack of DNA or removed as part of the sample quality control process), the case with the earliest age of diagnosis was selected. Stratified analyses used the same case selection criteria (proband or earliest age at diagnosis). This sampling procedure yielded a unique set of 1,026 cases and 316 controls for analysis. Analyses of mitochondrial and Y chromosomal lineages among unadmixed founders who contributed to the African-American gene pool has shown that the female/male sex ratio was lower in Europeans than in Africans 42,43. The average European admixture proportion is thus lower on X chromosomes than on autosomes, since male founders contribute half as many X chromosomes as female founders. To correctly model X chromosome admixture proportions, we extended the statistical model to specify a global parameter ψ for the female/male odds ratio in European versus African founders. For any given value of autosomal admixture proportions realized by the Markov chain Monte Carlo sampler in a given individual, the X chromosome admixture proportions can be calculated as follows. We specify a 2 × 2 frequency table for the most recent unadmixed ancestors of the individual, in which the rows and columns index population ancestry and sex respectively. The row marginal frequencies are the autosomal admixture proportions and the column marginal frequencies reflect the required equal numbers of male and female ancestors. The cell frequencies are calculated from the odds ratio ψ and the marginal frequencies. The X chromosome European admixture proportions are calculated as a weighted average of the frequencies of European ancestry among females and males, with respective weights in the ratio 2 to 1. The model is specified with a Gaussian prior for logeψ, and ψ was sampled from its posterior distribution using a Metropolis algorithm. We allowed ψ to be estimated from our data by specifying a weak prior (i.e., low precision) for the value of logeψ, and set the prior mean to the null value of zero (i.e., equal proportion of male and female African founders). Initial analyses were conducted using the affected-only test because of its greater statistical power, but were followed by the ancestry association test to validate affected-only test results. Significant regions satisfied all of the following criteria: (1) at least one SNP had an affected-only p-value < 0.01 (i.e. absolute Z-score > 2.576); (2) more than 3 SNPs had affected-only p-values < 0.05; and (3) at least one SNP had an ancestry association p-value < 0.05. All locus-specific Z-scores (for both ancestry association and affected-only tests), ancestry odds ratios (aORs; estimated based on the ancestry association test), and ancestry risk ratios (aRRs; estimated from the affected-only test) were calculated with respect to African ancestry, with the aRRs interpreted as the ratio of the risks at the locus of interest associated with carriage of two African alleles versus carriage of two European alleles. Therefore, positive Z-scores and corresponding aRRs > 1 are interpretable as increased risk associated with increased locus-specific African ancestry and negative Z-scores and corresponding aRRs < 1 are interpretable as decreased risk associated with increased locus-specific African ancestry (i.e. increased risk associated with increased European ancestry). To formally test for differences in the results by sex and the extremes of the radiographic phenotype (stage IV radiographic persistent sarcoidosis vs. radiographically resolved disease), we computed a test of heterogeneity of the aRRs estimated from the affected-only analyses from each stratum as the difference in the log affected-only odds ratios (θ) divided by the information (V) for each locus. For strata A and B, the test of heterogeneity was expressed as follows: As with the locus specific affected-only and ancestry association tests, the test of heterogeneity is also distributed as a standard normal deviate. P-values were two-sided for all tests performed. Additionally, we prepared an exclusion map for African and European risk loci based on the affected-only test results from all sarcoidosis cases by the method described by Hoggart et al.29 to show which regions of the genome could be excluded from harboring a locus with a specific aRR effect size at the traditional LOD score exclusion threshold of −2 (i.e. 100 times more likely under the null aRR of one in comparison to a specific aRR different than one). Briefly, given the observed score and information from the affected-only test for each SNP used in the analysis, the log-likelihood function (expressed in term of the loge(aRR)) was solved for the loge(aRR) at the value of loge(1/100) (i.e. 100 times more likely under the null of no ancestry association). As the log-likelihood is quadratic in loge(aRR), there are two possible values for the loge(aRR), one greater than and one less than zero. Because the analysis was performed with respect to increased risk associated with increased locus specific African ancestry, the value greater than zero corresponds to the exclusion threshold loge(aRR) at or above which we can exclude an African ancestry increased risk locus, and the value less than zero corresponds to the exclusion threshold loge(aRR) at or below which we can exclude a African ancestry decreased risk locus or, alternatively, a European ancestry increased risk locus. Supplemental Figure 1. Genome-wide exclusion maps for African and European log ancestry ratios (aRR) at a LOD score of −2. The threshold log aRRs for exclusion were calculated as greater than zero for African and less than zero for European ancestry for ease of inclusion in the same figure. Dotted lines correspond to log aRRs of + and − 0.69 (or aRR of 2 and 0.5 respectively). Log aRR values greater than (for African ancestry) and less than (for European ancestry) the observed value at each marker are excluded at a LOD score of −2 given our observed study data.
  58 in total

1.  Severe pulmonary sarcoidosis is strongly associated with the haplotype HLA-DQB1*0602-DRB1*150101.

Authors:  Christina E M Voorter; Marjolein Drent; Ella M van den Berg-Loonen
Journal:  Hum Immunol       Date:  2005-07       Impact factor: 2.850

2.  Admixture mapping identifies 8q24 as a prostate cancer risk locus in African-American men.

Authors:  Matthew L Freedman; Christopher A Haiman; Nick Patterson; Gavin J McDonald; Arti Tandon; Alicja Waliszewska; Kathryn Penney; Robert G Steen; Kristin Ardlie; Esther M John; Ingrid Oakley-Girvan; Alice S Whittemore; Kathleen A Cooney; Sue A Ingles; David Altshuler; Brian E Henderson; David Reich
Journal:  Proc Natl Acad Sci U S A       Date:  2006-08-31       Impact factor: 11.205

3.  Sarcoidosis is associated with a truncating splice site mutation in BTNL2.

Authors:  Ruta Valentonyte; Jochen Hampe; Klaus Huse; Philip Rosenstiel; Mario Albrecht; Annette Stenzel; Marion Nagy; Karoline I Gaede; Andre Franke; Robert Haesler; Andreas Koch; Thomas Lengauer; Dirk Seegert; Norbert Reiling; Stefan Ehlers; Eberhard Schwinger; Matthias Platzer; Michael Krawczak; Joachim Müller-Quernheim; Manfred Schürmann; Stefan Schreiber
Journal:  Nat Genet       Date:  2005-02-27       Impact factor: 38.330

4.  Analysis of BTNL2 genetic polymorphisms in British and Dutch patients with sarcoidosis.

Authors:  P Spagnolo; H Sato; J C Grutters; E A Renzoni; S E Marshall; H J T Ruven; A U Wells; A Tzouvelekis; C H M van Moorsel; J M M van den Bosch; R M du Bois; K I Welsh
Journal:  Tissue Antigens       Date:  2007-09

5.  HLA-DQB1*0201: a marker for good prognosis in British and Dutch patients with sarcoidosis.

Authors:  Hiroe Sato; Jan C Grutters; Panagiotis Pantelidis; A Neil Mizzon; Tariq Ahmad; Arend-Jan Van Houte; Jan-Willem J Lammers; Jules M M Van Den Bosch; Kenneth I Welsh; Roland M Du Bois
Journal:  Am J Respir Cell Mol Biol       Date:  2002-10       Impact factor: 6.914

6.  Admixture mapping of quantitative trait loci for BMI in African Americans: evidence for loci on chromosomes 3q, 5q, and 15q.

Authors:  Analabha Basu; Hua Tang; Donna Arnett; C Charles Gu; Tom Mosley; Sharon Kardia; Amy Luke; Bamidele Tayo; Richard Cooper; Xiaofeng Zhu; Neil Risch
Journal:  Obesity (Silver Spring)       Date:  2009-02-19       Impact factor: 5.002

7.  Racial differences in sarcoidosis granuloma density.

Authors:  Robert R Burke; Chad H Stone; Suzanne Havstad; Benjamin A Rybicki
Journal:  Lung       Date:  2008-08-21       Impact factor: 2.584

8.  Hospitalization for patients with sarcoidosis: 1979-2000.

Authors:  Marilyn G Foreman; David M Mannino; Lois Kamugisha; Gloria E Westney
Journal:  Sarcoidosis Vasc Diffuse Lung Dis       Date:  2006-06       Impact factor: 0.670

9.  Genome-wide linkage and admixture mapping of type 2 diabetes in African American families from the American Diabetes Association GENNID (Genetics of NIDDM) Study Cohort.

Authors:  Steven C Elbein; Swapan K Das; D Michael Hallman; Craig L Hanis; Sandra J Hasstedt
Journal:  Diabetes       Date:  2008-10-07       Impact factor: 9.461

10.  The 5q31 variants associated with psoriasis and Crohn's disease are distinct.

Authors:  Yonghong Li; Monica Chang; Steven J Schrodi; Kristina P Callis-Duffin; Nori Matsunami; Daniel Civello; Nam Bui; Joseph J Catanese; Mark F Leppert; Gerald G Krueger; Ann B Begovich
Journal:  Hum Mol Genet       Date:  2008-07-09       Impact factor: 6.150

View more
  14 in total

1.  Replication of genetic loci for sarcoidosis in US black women: data from the Black Women's Health Study.

Authors:  Yvette Cozier; Edward Ruiz-Narvaez; Craig McKinnon; Jeffrey Berman; Lynn Rosenberg; Julie Palmer
Journal:  Hum Genet       Date:  2013-03-31       Impact factor: 4.132

2.  Efficient generalized least squares method for mixed population and family-based samples in genome-wide association studies.

Authors:  Jia Li; James Yang; Albert M Levin; Courtney G Montgomery; Indrani Datta; Sheri Trudeau; Indra Adrianto; Paul McKeigue; Michael C Iannuzzi; Benjamin A Rybicki
Journal:  Genet Epidemiol       Date:  2014-05-20       Impact factor: 2.135

3.  Admixture mapping identifies a quantitative trait locus associated with FEV1/FVC in the COPDGene Study.

Authors:  Margaret M Parker; Marilyn G Foreman; Haley J Abel; Rasika A Mathias; Jacqueline B Hetmanski; James D Crapo; Edwin K Silverman; Terri H Beaty
Journal:  Genet Epidemiol       Date:  2014-08-11       Impact factor: 2.135

Review 4.  Granuloma genes in sarcoidosis: what is new?

Authors:  Annegret Fischer; Benjamin A Rybicki
Journal:  Curr Opin Pulm Med       Date:  2015-09       Impact factor: 3.155

5.  Association of HLA-DRB1 with Sarcoidosis Susceptibility and Progression in African Americans.

Authors:  Albert M Levin; Indra Adrianto; Indrani Datta; Michael C Iannuzzi; Sheri Trudeau; Jia Li; Wonder P Drake; Courtney G Montgomery; Benjamin A Rybicki
Journal:  Am J Respir Cell Mol Biol       Date:  2015-08       Impact factor: 6.914

6.  Association of ANXA11 genetic variation with sarcoidosis in African Americans and European Americans.

Authors:  A M Levin; M C Iannuzzi; C G Montgomery; S Trudeau; I Datta; P McKeigue; A Fischer; A Nebel; B A Rybicki
Journal:  Genes Immun       Date:  2012-11-15       Impact factor: 2.676

7.  Extending admixture mapping to nuclear pedigrees: application to sarcoidosis.

Authors:  Paul M McKeigue; Marco Colombo; Felix Agakov; Indrani Datta; Albert Levin; David Favro; Courtney Gray-Montgomery; Michael C Iannuzzi; Benjamin A Rybicki
Journal:  Genet Epidemiol       Date:  2013-01-31       Impact factor: 2.135

8.  Admixture mapping of prostate cancer in African Americans participating in the North Carolina-Louisiana Prostate Cancer Project (PCaP).

Authors:  Jeannette T Bensen; Zongli Xu; Paul M McKeigue; Gary J Smith; Elizabeth T H Fontham; James L Mohler; Jack A Taylor
Journal:  Prostate       Date:  2013-09-04       Impact factor: 4.104

9.  Genome-wide association study of African and European Americans implicates multiple shared and ethnic specific loci in sarcoidosis susceptibility.

Authors:  Indra Adrianto; Chee Paul Lin; Jessica J Hale; Albert M Levin; Indrani Datta; Ryan Parker; Adam Adler; Jennifer A Kelly; Kenneth M Kaufman; Christopher J Lessard; Kathy L Moser; Robert P Kimberly; John B Harley; Michael C Iannuzzi; Benjamin A Rybicki; Courtney G Montgomery
Journal:  PLoS One       Date:  2012-08-27       Impact factor: 3.240

10.  Expression and sub-cellular localization of an epigenetic regulator, co-activator arginine methyltransferase 1 (CARM1), is associated with specific breast cancer subtypes and ethnicity.

Authors:  Melissa B Davis; Xinyu Liu; Shiyao Wang; Jaxk Reeves; Andrey Khramtsov; Dezheng Huo; Olufunmilayo I Olopade
Journal:  Mol Cancer       Date:  2013-05-10       Impact factor: 27.401

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.