Literature DB >> 34241624

Myopia in African Americans Is Significantly Linked to Chromosome 7p15.2-14.2.

Claire L Simpson1,2, Anthony M Musolf2, Roberto Y Cordero1, Jennifer B Cordero1, Laura Portas2, Federico Murgia2, Deyana D Lewis2, Candace D Middlebrooks2, Elise B Ciner3, Joan E Bailey-Wilson1, Dwight Stambolian4.   

Abstract

Purpose: The purpose of this study was to perform genetic linkage analysis and association analysis on exome genotyping from highly aggregated African American families with nonpathogenic myopia. African Americans are a particularly understudied population with respect to myopia.
Methods: One hundred six African American families from the Philadelphia area with a family history of myopia were genotyped using an Illumina ExomePlus array and merged with previous microsatellite data. Myopia was initially measured in mean spherical equivalent (MSE) and converted to a binary phenotype where individuals were identified as affected, unaffected, or unknown. Parametric linkage analysis was performed on both individual variants (single-nucleotide polymorphisms [SNPs] and microsatellites) as well as gene-based markers. Family-based association analysis and transmission disequilibrium test (TDT) analysis modified for rare variants was also performed.
Results: Genetic linkage analysis identified 2 genomewide significant variants at 7p15.2 and 7p14.2 (in the intergenic region between MIR148A and NFE2L3 and in the noncoding RNA LOC401324) and 2 genomewide significant genes (CRHR2 and AVL9) both at 7p14.3. No genomewide results were found in the association analyses. Conclusions: This study identified a significant linkage peak in African American families for myopia at 7p15.2 to 7p14.2, the first potential risk locus for myopia in African Americans. Interesting candidate genes are located in the region, including PDE1C, which is highly expressed in the eyes, and known to be involved in retinal development. Further identification of the causal variants at this linkage peak will help elucidate the genetics of myopia in this understudied population.

Entities:  

Year:  2021        PMID: 34241624      PMCID: PMC8287048          DOI: 10.1167/iovs.62.9.16

Source DB:  PubMed          Journal:  Invest Ophthalmol Vis Sci        ISSN: 0146-0404            Impact factor:   4.799


More people in the world are afflicted with myopia than any other eye disorder. The World Health Organization defines uncorrected refractive errors like myopia as visual impairments and estimates that about 153 million people are living worldwide with uncorrected refractive errors. One quarter of the American population is myopic, and prevalence is rising. Lower-income and disadvantaged populations are particularly at risk because they lack the finances to correct the impairment and thus suffer greater than more affluent populations. Myopia is a complex disease caused by both genetic and environmental factors, making analysis of the phenotype challenging., Multiple environmental factors have been identified, including education level and time outside. Multiple genetic factors have also been identified to contribute to myopia risk; genetic studies of myopia consist of both family-based linkage studies and population-based association studies. Each method has its advantages/disadvantages. Population-based association studies, specifically genomewide association studies (GWAS), are more effective at identifying common variants with a small to moderate effect on the trait. Many GWAS that have found genomewide significant variants associated with myopia and its quantitative phenotype refractive error.– Family-based linkage studies are effective at finding rare, highly penetrant variants. Variants that are rare in the population at large may be common within an individual family. Family-based studies can also offer better coverage of the genome via longer haplotypes. Haplotypes in population-based studies have been broken apart by generations of recombination so only a small number of variants are in linkage disequilibrium (LD). By contrast, the number of meioses that can occur within a given family is quite small, creating longer haplotypes within a family that can be used to tag rare or ungenotyped causal variants in LD with the linked variants. The drawback is that additional variants along the linked haplotype means identifying the actual causal variant is more difficult. Genomewide significant linked loci have been identified for both refractive error and myopia.– Multiple studies have reported linkage using common myopia.– African Americans have been particularly understudied for myopia. Initial studies showed that African Americans had a lower prevalence of myopia than Caucasians; more recent studies show myopia prevalence in African American children is approximately equal to Caucasians. African American children have been shown to have both a higher percentage of new myopia cases and a higher odds ratio for myopia risk than Caucasian children. Other African groups show various prevalences in children – 3% in Ghana and South Africa, and 10% for African Caribbean children in England. A more recent study by Jiang et al. showed that a parent with myopia was associated with an increase of myopia risk in children of multiple populations, including African Americans. A 2020 review by Grzybowski et al., concluded that myopia prevalences in children are rising in Asia, Europe, and North America but are under 10% in South America and Africa. Despite ample evidence for myopia prevalence, there have been a paucity of genetic studies with African samples, with zero GWAS and only a handful of microsatellite-based linkage studies., Our study is the first genetic analysis using SNP genotypes in African American families with a history of myopia. We used an exome-based microarray for increased coverage of rare variants. A subset of the families used in this study were part of a previous study that found significant linkage to 7p15 with refractive error using microsatellites. Using myopia affection as the phenotype did not result in replication of the signal nor was the signal replicated in meta-analysis with other populations.

Methods

Patient Recruitment

We collected data from 517 individuals from 106 African American families in the Philadelphia metropolitan area as part of the Family Myopia Study. Prospective participants were identified through database review, mailings, clinical visits, interviews, and referrals from private doctors. Eligible families were required to have at least three participants, including at least one parent with myopia, and one myopic sibling. All study participants provided informed consent and protocols adhered to the tenets of the Declaration of Helsinki. The study was approved by the institutional review boards of the University of Pennsylvania and the National Human Genome Research Institute. All participants received a comprehensive eye examination, including visual acuity, slit lamp biomicroscopy, dilated fundus examination, and manifest refraction. Patients older than 41 years had their refraction measured with manifest refraction, whereas patients 41 years and younger had their refraction measured using cycloplegic refraction. The measurement used in this study was mean spherical equivalent (MSE), measured in diopters (D), which is calculated by adding the spherical component to one-half the cylindrical component and averaging for both eyes.

Genotyping and Quality Control

Five hundred seventeen subject DNA samples were genotyped using an Illumina ExomePlus array at the Center for Inherited Disease Research (CIDR) at Johns Hopkins University (Baltimore, MD, USA). Variants were filtered to a mean call rate of 99%, and any variant with a quality score of 0.15 or less was set to missing. Monomorphic markers were removed using PLINK. Sib-pair was used to identify Mendelian inconsistencies. Markers with a Mendelian inconsistency in a single family were removed from that family; markers with multiple Mendelian error were removed from all families. PLINK and Prest-Plus were used to verify familial relationships by calculating identity by descent (IBD) values. Ten ungenotyped individuals were added to the data set to connect disjointed pedigrees. The existence of these individuals was confirmed by family history, but they were either unwilling or unavailable to participate. Their phenotypes and genotypes were coded as unknown. The single-nucleotide polymorphism (SNP) data was merged with a previous set of 367 microsatellite genotypes; 493 individuals from the exome-based array had microsatellite data. The final data set consisted of 527 individuals with 98,631 markers.

Myopia Affection Classification

Subjects were classified as either affected or unaffected with myopia. Individuals with MSE ≤ −1.0 D were coded as affected. Adult participants (21 and over) with an MSE of ≥ 0.0 D were coded as unaffected or unknown if the MSE was between −1.0 and 0.0 D in order to avoid potential misclassification errors. We used extra caution when coding children as unaffected, because normal childhood developmental changes can result in misclassification. Children ages 5 to 10 years were coded as unaffected only if their MSE was ≥ 2.0 D and as unknown between −1.0 and 2.0 D. Children aged 11 to 20 years were coded as unaffected if their MSE was ≥ 1.5 D and as unknown between −1.0 and 1.5 D. All children were affected if their MSE was ≤ −1.0 D. These thresholds were based on ophthalmological guidelines as to what levels of childhood refraction are most indicative of myopia in adulthood and the large number of children deemed to be unknown is designed to allow for uncertainty in these projections. The final data set contained a total of 527 individuals (295 as affected, 100 as unaffected, and 132 as unknown/missing). The data were 58.06% female individuals (306 female individuals to 221 male individuals). The average MSE was −2.78 D with a standard deviation of 3.60. The mean age of the entire data set was 40.37 with a standard deviation of 19. The mean age of the adults (21 years and above) was 47.42 (standard deviation of 14.83) and the mean age of the children (20 years and below) was 14.15 (standard deviation of 3.62).

Allele Frequency Estimation

Allele frequencies for the data set were calculated in Sib-pair. Estimating allele frequencies directly from an ethnically homogeneous data set properly controls type I error rates in parametric linkage studies.–

Parametric Linkage Analysis

We performed both variant-based and gene-based parametric linkage analyses. In linkage analyses, all individuals (including those with missing/unknown phenotypes) are included in the analyses. This is because individuals with missing/unknown phenotypes with genotype information will provide information about allele transmission (thus contributing to the overall LOD score of the variant) and even individuals with no phenotype or genotype information may be needed to provide relationship information to connect relatives with data and avoid disjointed pedigrees. Analyses assumed an autosomal dominant mode of inheritance, with a disease allele frequency of 0.01 and a penetrance of 0.9 for disease allele carriers and 0.1 for noncarriers. The 0.1 penetrance for noncarriers (the phenocopy rate) allow for a 10% chance that an individual that has myopia for reasons other than a high-risk variant (e.g. environmental factors or polygenic inheritance). Variant-based analyses were two-point linkage analyses performed between the phenotype and each individual SNP, using TwoPointLods. Gene-based analyses used the collapsed haplotype pattern (CHP) method in SEQLinkage to build multi-allelic pseudomarkers, which corresponded to a gene. Two-point linkage analysis was then performed on the pseudomarkers using Merlin. We performed two sets of gene-based analysis: one analysis using rare variants only (MAF ≤ 0.05) and one analysis using all variants.

Family-Based Association Analysis

We used the family-based association test (FBAT) to perform variant-based and gene-based analysis association analyses. We also used the rare variant transmission disequilibrium test (RV-TDT) by choosing the most informative trio out of the extended families - two parents (one affected and one unaffected) and affected child trio with the highest genotyping rate.

Functional Annotation

All variants were annotated using wANNOVAR, and CRAVAT,, which provide information about SNP location, function, and frequency across multiple populations. They also provide protein predictions from multiple programs such as SIFT,– PolyPhen2, CADD,, and REVEL.

Gene Expression in Human Ocular Tissues

To identify high-priority candidate genes, we examined ocular tissue expression of the significant/suggestive genes from our analyses. Gene expression in human ocular tissues was inspected in the publicly available web resources eyeIntegration and The Ocular Tissue Databases. The eyeIntegration database provides the largest RNA-seq based transcriptome database of healthy human eye tissues and hundreds of Genotype-Tissue Expression (GTEx) tissue samples., The Ocular Tissue Database contains microarray expression data of 10 normal human ocular tissues. We compared the expression of our significant and suggestive genes in the eyes against two reference tissues (whole blood and pan-body synthetic subtissues). The pan-body synthetic set was comprised of a stratified sample of 54 tissues in the GTEx data set.

Results

Variant-Based Linkage Results

Two variants exhibited genomewide significant linkage to myopia using a definition of genomewide significant as (H)LOD ≥ 3.3 and genomewide suggestive as (H)LOD ≥ 1.9 in accordance with the recommendations of Lander and Kruglyak. The rs4719841 is in the intergenic region between MIR148A and NFE2L3 at 7p15.2 (HLOD = 4.34), and rs235397 is located in the noncoding RNA LOC401324 at 7p14.2 (HLOD = 3.43; Fig. 1A). Twenty-five suggestively linked SNPs were found at 7p15.2-14.2 (Fig. 1B). Thirty-three suggestive SNPs were found on other chromosomes, with the largest concentration at 1p36.1-36.11 (Table 1).
Figure 1.

HLOD scores for variant-based two-point linkage analysis. (A) The genomewide HLOD scores (B) the HLOD scores for chromosome 7. In both, the lines at 3.3 and 1.9 represent the respective significant and suggestive thresholds as suggested by Lander and Kruglyak.

Table 1.

All Significant and Suggestive HLOD Scores From Variant Based Linkage

CHRrsIDPOSHLODGENEFUNCEXONFREQSIFTPOLYPHFATHMMCADDREVEL
7rs4719841259975364.34MIR148A; NFE2L3Intergenic.0.27.....
7rs235397353727493.42LOC401324ncRNA.0.20.....
7rs6462100287540953.01CREB5Intronic.0.40.....
7rs7797330308950102.84INMT-MINDY4ncRNA.0.38.....
7rs7779240275626602.82EVX1; HIBADHIntergenic.0.15.....
20rs3746736234246132.75CSTL1Exonicnonsyn0.20TBT0.0030.086
9rs10757225215554452.73MIR31HGncRNA.0.18.....
2rs1920511417928452.59SLC8A1; LINC01913Intergenic.0.33.....
7rs10270663347863982.58NPSR1-AS1ncRNA.0.20.....
7rs1427483339592392.49BMPERIntronic.0.29.....
9rs617575581171183792.48AKNAExonicnonsyn0.06DBT22.60.019
7rs2270219318772612.42PDE1CIntronic.0.23.....
7rs3735400364387092.40ANLNExonicnonsyn0.12DDT29.70.204
7rs6462088285045662.40CREB5Intronic.0.24.....
7rs2011974326113922.35AVL9Intronic.0.34.....
6rs2149501527083102.29SYNE1Exonicnonsyn0.15DBT7.3240.104
7rs10266620319575502.29PDE1CIntronic.0.26.....
1rs7550997265960802.27CEP85Exonicnonsyn0.18TBT15.090.043
1rs8564266050692.27CEP85UTR3.0.18.....
1rs7544266077262.27SH3BGRL3UTR3.0.18.....
1rs10493030265618562.27CEP85Intronic.0.18.....
1rs10902732266061742.27SH3BGRL3; CEP85Intergenic0.18.....
1rs11247900266124602.27UBXN11Exonicsyn0.18.....
1rs11577318266015702.27CEP85Intronic.0.18.....
1rs17163746265642302.27CEP85Intronic.0.18.....
1rs17163749265681652.26CEP85Intronic.0.18.....
10rs6172984659202442.26ANKRD16Exonicnonsyn0.20DDT26.10.690
18rs38746234589972.26TGIF1downstream.0.36.....
7rs6952967317958562.24PDE1CIntronic.0.47.....
6rs7911831606101242.23SLC22A1; SLC22A2Intergenic.0.40.....
7rs1420123296476622.16PRR15; LOC646762Intergenic.0.27.....
7rs1029602245714852.15NPY; MPP6Intergenic.0.38.....
7rs4291168311787492.14ADCYAP1R1; NEUROD6Intergenic.0.18.....
20rs6036107224032872.13LOC284788; LINC00261Intergenic.0.33.....
1rs222857912233852.13SCNN1DExonicnonsyn0.33TBT0.0030.036
6rs345444381674382922.12FGFR1OPExonicnonsyn0.07TBT0.2680.119
7rs128540792663882.11NXPH1; PER4Intergenic.0.12.....
7rs12113424354237202.11LOC401324; HERPUD2Intergenic.0.19.....
2rs13424561738684462.11NAT8Exonicnonsyn0.11TBT0.1660.01
3rs36117895114000192.10ATG7Exonicnonsyn0.12DPT25.30.227
9rs10511687207648702.10FOCADExonicnonsyn0.32TBT14.750.069
7rs6415258321925962.10PDE1CIntronic.0.27.....
7rs212837266952152.08C7orf71; SKAP2Intergenic.0.35.....
1rs12138111265904322.07CEP85Intronic.0.19.....
18rs7760048234607312.06TGIF1; GAPLINCIntergenic.0.15.....
9rs10973446376387442.05TOMM5; FRMPD1Intergenic.0.45.....
18rs38123434646502.02TGIF1; GAPLINCIntergenic.0.38.....
7rs731844341502642.02BMPERIntronic.0.39.....
5rs7715811137699742.01DNAH5Intronic.0.39.....
5rs1502050137797432.01DNAH5Intronic.0.38.....
7rs10224983341803261.98BMPERIntronic.0.15.....
7rs961652341116601.95BMPERIntronic.0.30.....
7rs16480243110691.95STK31; NPYIntergenic.0.38.....
7rs2033670229290611.94SNHG26; FAM126AIntergenic.0.33.....
10rs70717681299030161.93MKI67Exonicnonsyn0.47TBT0.0010.022
1rs10908292367647701.92THRAP3Intronic.0.39.....
20rs5741809369560261.92BPIExonicnonsyn0.14TBT0.0010.011
7rs2392246335718281.91BBS9Intronic.0.12.....
7rs976681245300161.91NPY; MPP6Intergenic.0.36.....

The list of all significant and suggestive variants from the variant-based linkage analyses, as sorted by HLOD. Here, the headers represent: CHR = chromosome, rsID = rsID of the SNP, POS = physical position in base pairs of the SNP, HLOD = heterogeneity LOD score across all 106 families, GENE = Gene location of the SNP (if intergenic then the two closest genes), FUNC = function of the SNP (e.g. exonic, intronic), EXON = if exonic, the exonic function of the SNP (nonsyn = nonsynonymous, syn = synonymous), FREQ = frequency of the variant in gnomAD Africans, SIFT = SIFT prediction (T = tolerated, D = damaging), POLY = PolyPhen2 prediction score (B = benign, P = possibly damaging, D = damaging), FATHMM = FATHMM prediction (T = tolerated), CADD = CADD phred score ≥ 10 corresponds to 10% most deleterious substitutions in genome, ≥ 20 corresponds to 1% most deleterious substitutions in the genome, etc.), REVEL = REVEL score (corresponds to proportion of trees in random forest algorithm that classified variant as pathogenic).

HLOD scores for variant-based two-point linkage analysis. (A) The genomewide HLOD scores (B) the HLOD scores for chromosome 7. In both, the lines at 3.3 and 1.9 represent the respective significant and suggestive thresholds as suggested by Lander and Kruglyak. All Significant and Suggestive HLOD Scores From Variant Based Linkage The list of all significant and suggestive variants from the variant-based linkage analyses, as sorted by HLOD. Here, the headers represent: CHR = chromosome, rsID = rsID of the SNP, POS = physical position in base pairs of the SNP, HLOD = heterogeneity LOD score across all 106 families, GENE = Gene location of the SNP (if intergenic then the two closest genes), FUNC = function of the SNP (e.g. exonic, intronic), EXON = if exonic, the exonic function of the SNP (nonsyn = nonsynonymous, syn = synonymous), FREQ = frequency of the variant in gnomAD Africans, SIFT = SIFT prediction (T = tolerated, D = damaging), POLY = PolyPhen2 prediction score (B = benign, P = possibly damaging, D = damaging), FATHMM = FATHMM prediction (T = tolerated), CADD = CADD phred score ≥ 10 corresponds to 10% most deleterious substitutions in genome, ≥ 20 corresponds to 1% most deleterious substitutions in the genome, etc.), REVEL = REVEL score (corresponds to proportion of trees in random forest algorithm that classified variant as pathogenic).

Functional Annotation of Variants

The rs4719841 is intergenic and has an MAF in the gnomAD database of 0.2656 and 0.26 in our data set. The rs235397, in the noncoding RNA LOC401324 at 7p14.2, has an MAF of 0.2 both in gnomAD and our data set. The only exonic variant in 7p15.2-14.2 was rs3735400 (HLOD = 2.22). The rs3735400 is in ANLN (7p14.2), is a nonsynonymous exonic variant, has a Combined Annotation Dependent Depletion (CADD) score over 29 and predicted damaging by SIFT and PolyPhen2, and has an MAF of 0.1201 in the gnomAD 0.13 in our data set.

Gene-Based Linkage Results

The gene-based analysis using rare variants (MAF ≤ 0.05) produced no genomewide significant results and four genome-wide suggestive genes (Fig. 2A). The two top genes were located at 7p14.3 - INMT-MINDY4 (HLOD = 2.35) and MINDY4 (HLOD = 1.99).
Figure 2.

Genome wide HLODs scores for gene-based two-point linkage analysis. (A) The gene-based HLOD scores using only the rare variants (MAF ≤ 0.05) and (B) the gene-based HLOD scores using all variants. The lines at 3.3 and 1.9 represent the respective significant and suggestive thresholds as suggested by Lander and Kruglyak.

Genome wide HLODs scores for gene-based two-point linkage analysis. (A) The gene-based HLOD scores using only the rare variants (MAF ≤ 0.05) and (B) the gene-based HLOD scores using all variants. The lines at 3.3 and 1.9 represent the respective significant and suggestive thresholds as suggested by Lander and Kruglyak. Using all variants, the genomewide significant signal is recovered at the 7p14.3 band (Fig. 2B). This makes sense, as the variants identified in the variant-based analysis were common (MAF > 0.05). The genomewide significant genes were CRHR2 (HLOD = 4.06) and AVL9 (HLOD = 3.99). There were an additional 23 suggestive genes, with 7 genes in the 7p14.3-14.2 region (Table 2, Supplementary Fig. S1).
Table 2.

All Significant and Suggestive Genes From the Gene-based Linkage Analysis

CHRPOSGENECUMUL LODHLODVARIANTS
748.34CRHR23.984.06All
750.78AVL93.993.99All
959.79DNAI12.812.81All
752.59NPSR1-AS12.742.74All
752.05BMPER1.862.73All
2054.18BPIFA22.642.65All
754.00SEPT72.342.61All
2028.26PAK71.752.47All
149.16EPHB22.442.44All
943.87FOCAD2.142.37All
188.08SMCHD12.372.37All
339.61EFHB1.772.36All
748.59INMT-FAM188B2.352.35Rare
1336.59CSNK1A1L2.152.35All
692.24COL12A12.342.34All
1352.86SETDB21.942.30All
750.30PDE1C2.212.21All
2055.10CEP2501.932.10All
1497.59SERPINA91.352.09All
49.39EVC0.922.08All
754.69KIAA08951.342.04All
746.21LOC6467621.491.99All
748.61FAM188B1.991.99Rare
9111.62ZNF4621.951.95All
49.39EVC1.931.93Rare
6178.62PACRG0.721.92All
149.16EPHB21.921.92All
750.30PDE1C0.311.91All
9123.13AKNA1.901.90Rare

The list of all significant and suggestive genes from the gene-based linkage analyses, as sorted by HLOD. Here the headers represent: CHR = chromosome, POS = genetic position in cM of the gene, GENE = gene, CUMUL LOD = cumulative LOD score for the gene across all 106 families, HLOD = heterogeneity LOD score for the gene across all 106 families, VARIANTS = type of variants used in this test (All = all variants were used, Rare = only rare variants (MAF ≤ 0.05) were used).

All Significant and Suggestive Genes From the Gene-based Linkage Analysis The list of all significant and suggestive genes from the gene-based linkage analyses, as sorted by HLOD. Here the headers represent: CHR = chromosome, POS = genetic position in cM of the gene, GENE = gene, CUMUL LOD = cumulative LOD score for the gene across all 106 families, HLOD = heterogeneity LOD score for the gene across all 106 families, VARIANTS = type of variants used in this test (All = all variants were used, Rare = only rare variants (MAF ≤ 0.05) were used).

Association Results

No genomewide significant (5 × 10−8) results were found in the association analyses. The most significant FBAT variant P value was rs887468 (1.2 × 10−5) in PSORS1C3 at 6q21.33. The most significant FBAT gene-based P value was KCNS1 (4.97 × 10−5) at 20q12. The most significant RV-TDT P value was the intergenic SNP rs11929331 (2.5 × 10−4) between LINC01994 and ATP11B at 3q26.44.

Gene Expression in Human Ocular Tissue

The eyeIntegration and Ocular Tissue databases revealed that a vast majority of the significant/suggestively genes were expressed in ocular tissues (Supplementary Table S1, Supplementary Figs. S2, S3). The suggestive genes (including ANLN and PDE1C) from the variant-based and gene-based analyses were found to have higher expression in most tissues of the eyes compared to whole blood and pan-body synthetic subtissues (Fig. 3, Supplementary Table S1). Other significant and suggestive genes (including AVL9, BBS9, and BMPER) have a higher expression in most of the ocular tissues compared to at least one reference tissue group (Fig. 4, Supplementary Table S1). The Ocular Tissue Database showed that EPHB2, PDE1C, NPY, EVC, and COL12A1 have good expression in the adult human sclera, which may play a role in myopia pathogenesis (see Supplementary Figs. S2, S3). Both databases are primarily derived from European ancestry individuals, so expression may vary in African Americans, but there are no known databases with African American eye expression tissue data.
Figure 3.

Pan-human tissue differential expression of The x-axis shows the different types of tissues used in the test. The y-axis shows the log2 fold change of gene expression. The differential expression is being shown relative to the reference tissue (whole blood).

Figure 4.

Pan-human tissue differential expression of additional significant and suggestive genes in the two-point linkage analysis. The x-axis shows the different types of tissues used in the test. The y-axis shows the log2 fold change of gene expression. The differential expression is being shown relative to the reference tissue (whole blood).

Pan-human tissue differential expression of The x-axis shows the different types of tissues used in the test. The y-axis shows the log2 fold change of gene expression. The differential expression is being shown relative to the reference tissue (whole blood). Pan-human tissue differential expression of additional significant and suggestive genes in the two-point linkage analysis. The x-axis shows the different types of tissues used in the test. The y-axis shows the log2 fold change of gene expression. The differential expression is being shown relative to the reference tissue (whole blood).

Discussion

We have identified a genomewide significant linkage between four markers (2 SNPs and 2 genes) at 7p15.2-14.2 and myopia in African American families. This is the largest genetic analysis of myopia in the African American population and the first using SNP genotypes. This is the first study to report a myopia risk locus in African Americans. African Americans have been severely understudied with respect to myopia and refractive error. We note that the 7p15 linkage signal is not entirely novel, as we have previously identified a genomewide significant linkage to refractive error at 7p15 in a subset of these African American families and replicated it in a Caucasian data set. Both those studies used a different phenotype (quantitative refractive error scores) and the African American study used only microsatellite data and pointedly did not find significant linkage anywhere when myopia affection (the trait used in this study) was the analyzed trait. The risk locus identified at 7p14.2 is entirely novel; it was not identified in the microsatellite study. Further, the increased granularity of the SNP data (especially with the addition of the rare exonic variants), allowed for the location of regions and genes where the linked variants are accumulating, which cannot be done in a study with sparse microsatellite data only. This allowed us to go beyond defining a single linked region and offer specific genes within that linked region that might be causal, which is discussed in detail in the following paragraphs. The risk locus identified at 7p15 is known as MYP17, and, despite multiple replications, the causal gene remains unknown. Our significant linkage signals in the variant-based analysis are in an intergenic region between the transcription factor NFE2L3 and the noncoding RNAs MIR148A at 7p15.2 and LOC401324 at 7p14.2. None of these genes have any previously known connection to eye disorders. The significant genes found by the gene-based analysis (CRHR2 and AVL9) were located slightly downstream at 7p14.3 and have no known association with eye disease. AVL9 and CRHR2 expressions were enriched in most ocular tissues versus reference tissue groups. Linkage peaks are often broad and locus heterogeneity and other factors add to the uncertainty. Thus, the true causal variant(s) may lie anywhere within the linked region. Thus, it is important to examine the entire linked 7p region for candidate genes, which does identify multiple good candidates. The rs3735400 (HLOD = 2.4) in the anillin (ANLN) gene were predicted damaging by both SIFT and PolyPhen2 and had a CADD score over 29. Thus, the variant is likely to have a deleterious effect on gene/protein function. Anillin is an actin binding protein that is involved in cell migration, cell growth, and cytokinesis; it regulates actin cytoskeletal dynamics and has been implicated in multiple cancers. Its expression was shown to be elevated in eye tissue. PDE1C (7p14.3) exhibited suggestive evidence of linkage in the gene-based test (HLOD = 2.2) and had four suggestive intronic variants. PDE1C is a phosphodiesterase that has been reported to be involved in avian and rodent retinal development., PDE1C is expressed significantly higher in both adult and fetal retina tissue and adult and cell line RPE compared to whole blood and pan-body tissues. BMPER (7p14.3) was found to be suggestively linked in both the variant-based and gene-based analyses. BMPER has been shown to be functionally involved in eye organogenesis in mice. BBS9, which was found to have a genomewide suggestive intronic variant (HLOD = 1.9), is known to cause Bardet-Biedl syndrome, a symptom of which includes retinal degeneration., Both genes were expressed higher in ocular tissue than the reference set. In addition to the significantly linked region on 7p, we also identified several genomewide suggestive linkage signals, the most interesting being located at 1p36. This suggestive signal is a replication of a well-documented myopia/refractive error risk locus, MYP14, at 1p36.,,, This is the first time that this linkage to myopia has been documented in African Americans. The causal gene at MYP14 remains elusive. Six of the 9 suggestive SNPs at 1p36 were intronic SNPs located in CEP85. CEP85 is involved in centromere disjunction and has no known connection to eye disease. Other centromere proteins in the CEP family have been found implicated in eye disease, like CEP250 in Usher syndrome (which has retinitis pigmentosa as one of its symptoms). The association analyses did not identify any genomewide significant signals. The top variant was in an intron of the psoriasis gene PSORS1C3. The top gene was KCNS1 at 20q12, a voltage gated potassium channel protein that is functional in lens epithelia. The top SNP in the RV-TDT analysis in an intergenic region between LINC01994 and ATP11B at 3q26.44. The 3q26 is the site of the known myopia locus MYP8 and ATP11B has been shown to be present in mouse retina. It is not surprising that the linkage and association results tended to disagree, because they are testing for different things. The family-based association analysis relies on the risk allele being shared across families, either identical by state (IBS) or IBD. Linkage tracks co-segregation of haplotypes and the phenotype within a family but does not require that these haplotypes contain alleles IBS across different families. Although we have given evidence for some interesting candidate genes for the 7p and 1p signals any implication of causality is speculative at this point. Several genes, including the eye development genes PDE1C and BMPER, are good candidates for future follow-up studies, but additional studies will be needed to determine which variants are responsible for the observed linkage peaks. We also note that any novel linkage findings will need further replication in independent data sets. This study's major limitation was the coverage of the exome-enriched microarray. The causal variant may not have been identified; it is more likely that ungenotyped variants are segregating on the linked haplotypes in each family in LD with the more common variants identified here. This haplotype tagging ability is one of the advantages of family-based linkage studies. This explains why most of the linked variants on the 7p haplotype are common; they are most likely tagging rare variants that were not genotyped on this limited microarray. This also explains why we only identified a genomewide suggestive signal at 7p when using only rare variants in the gene-based tests and why the genomewide significant signal was recovered upon returning the common variants to the analysis. Thus, targeted sequencing of the linked regions, particularly on 7p and 1p, is a necessary future step in determining the causal variant(s). The expression databases helped to narrow down our candidate genes. We should exercise caution in the interpretation of these findings because the samples from the transcriptome databases were from healthy human eye tissues and their expression profile may differ from the diseased eye. We also do not have a refractive error phenotype on the eyes used in these databases. Further, the tissue used in these databases were primarily from European ancestry individuals, so any expression results must be evaluated within that context. We have identified a genomewide significant linkage of myopia to the 7p15.2-14.2 region in African Americans. This is the first significant myopia risk locus found in African Americans. This supports a previous study which has linked 7p15 with refractive error., We also identified several genomewide suggestive signals, including replication of the MYP14 locus at 1p36 for the first time in African Americans. We note that linkage analyses like this one identifies large linked regions and this study aimed to identify good candidate genes/variants and which families are most informative. We plan further whole genome sequencing of our most informative families, which will give us increased coverage of the genome, particularly intronic regions. Deep coverage of these linked regions may elucidate the causal genes/variants.
  76 in total

1.  SIFT missense predictions for genomes.

Authors:  Robert Vaser; Swarnaseetha Adusumalli; Sim Ngak Leng; Mile Sikic; Pauline C Ng
Journal:  Nat Protoc       Date:  2015-12-03       Impact factor: 13.491

2.  Risk factors for hyperopia and myopia in preschool children the multi-ethnic pediatric eye disease and Baltimore pediatric eye disease studies.

Authors:  Mark S Borchert; Rohit Varma; Susan A Cotter; Kristina Tarczy-Hornoch; Roberta McKean-Cowdin; Jesse H Lin; Ge Wen; Stanley P Azen; Mina Torres; James M Tielsch; David S Friedman; Michael X Repka; Joanne Katz; Josephine Ibironke; Lydia Giordano
Journal:  Ophthalmology       Date:  2011-08-19       Impact factor: 12.079

3.  Ethnic differences in the prevalence of myopia and ocular biometry in 10- and 11-year-old children: the Child Heart and Health Study in England (CHASE).

Authors:  Alicja R Rudnicka; Christopher G Owen; Claire M Nightingale; Derek G Cook; Peter H Whincup
Journal:  Invest Ophthalmol Vis Sci       Date:  2010-07-14       Impact factor: 4.799

4.  Refractive error and ethnicity in children.

Authors:  Robert N Kleinstein; Lisa A Jones; Sandral Hullett; Soonsi Kwon; Robert J Lee; Nina E Friedman; Ruth E Manny; Donald O Mutti; Julie A Yu; Karla Zadnik
Journal:  Arch Ophthalmol       Date:  2003-08

5.  Predicting functional effect of human missense mutations using PolyPhen-2.

Authors:  Ivan Adzhubei; Daniel M Jordan; Shamil R Sunyaev
Journal:  Curr Protoc Hum Genet       Date:  2013-01

6.  Refractive error and visual impairment in private school children in Ghana.

Authors:  Ben D Kumah; Anne Ebri; Mohammed Abdul-Kabir; Abdul-Sadik Ahmed; Nana Ya Koomson; Samual Aikins; Amos Aikins; Angela Amedo; Seth Lartey; Kovin Naidoo
Journal:  Optom Vis Sci       Date:  2013-12       Impact factor: 1.973

7.  Genome-wide meta-analyses of multiancestry cohorts identify multiple new susceptibility loci for refractive error and myopia.

Authors:  Virginie J M Verhoeven; Pirro G Hysi; Robert Wojciechowski; Qiao Fan; Jeremy A Guggenheim; René Höhn; Stuart MacGregor; Alex W Hewitt; Abhishek Nag; Ching-Yu Cheng; Ekaterina Yonova-Doing; Xin Zhou; M Kamran Ikram; Gabriëlle H S Buitendijk; George McMahon; John P Kemp; Beate St Pourcain; Claire L Simpson; Kari-Matti Mäkelä; Terho Lehtimäki; Mika Kähönen; Andrew D Paterson; S Mohsen Hosseini; Hoi Suen Wong; Liang Xu; Jost B Jonas; Olavi Pärssinen; Juho Wedenoja; Shea Ping Yip; Daniel W H Ho; Chi Pui Pang; Li Jia Chen; Kathryn P Burdon; Jamie E Craig; Barbara E K Klein; Ronald Klein; Toomas Haller; Andres Metspalu; Chiea-Chuen Khor; E-Shyong Tai; Tin Aung; Eranga Vithana; Wan-Ting Tay; Veluchamy A Barathi; Peng Chen; Ruoying Li; Jiemin Liao; Yingfeng Zheng; Rick T Ong; Angela Döring; David M Evans; Nicholas J Timpson; Annemieke J M H Verkerk; Thomas Meitinger; Olli Raitakari; Felicia Hawthorne; Tim D Spector; Lennart C Karssen; Mario Pirastu; Federico Murgia; Wei Ang; Aniket Mishra; Grant W Montgomery; Craig E Pennell; Phillippa M Cumberland; Ioana Cotlarciuc; Paul Mitchell; Jie Jin Wang; Maria Schache; Sarayut Janmahasatian; Sarayut Janmahasathian; Robert P Igo; Jonathan H Lass; Emily Chew; Sudha K Iyengar; Theo G M F Gorgels; Igor Rudan; Caroline Hayward; Alan F Wright; Ozren Polasek; Zoran Vatavuk; James F Wilson; Brian Fleck; Tanja Zeller; Alireza Mirshahi; Christian Müller; André G Uitterlinden; Fernando Rivadeneira; Johannes R Vingerling; Albert Hofman; Ben A Oostra; Najaf Amin; Arthur A B Bergen; Yik-Ying Teo; Jugnoo S Rahi; Veronique Vitart; Cathy Williams; Paul N Baird; Tien-Yin Wong; Konrad Oexle; Norbert Pfeiffer; David A Mackey; Terri L Young; Cornelia M van Duijn; Seang-Mei Saw; Joan E Bailey-Wilson; Dwight Stambolian; Caroline C Klaver; Christopher J Hammond
Journal:  Nat Genet       Date:  2013-02-10       Impact factor: 38.330

8.  CRAVAT: cancer-related analysis of variants toolkit.

Authors:  Christopher Douville; Hannah Carter; Rick Kim; Noushin Niknafs; Mark Diekhans; Peter D Stenson; David N Cooper; Michael Ryan; Rachel Karchin
Journal:  Bioinformatics       Date:  2013-01-16       Impact factor: 6.937

9.  Focusing in on the complex genetics of myopia.

Authors:  Robert Wojciechowski; Pirro G Hysi
Journal:  PLoS Genet       Date:  2013-04-04       Impact factor: 5.917

10.  Meta-analysis of gene-environment-wide association scans accounting for education level identifies additional loci for refractive error.

Authors:  Qiao Fan; Virginie J M Verhoeven; Robert Wojciechowski; Veluchamy A Barathi; Pirro G Hysi; Jeremy A Guggenheim; René Höhn; Veronique Vitart; Anthony P Khawaja; Kenji Yamashiro; S Mohsen Hosseini; Terho Lehtimäki; Yi Lu; Toomas Haller; Jing Xie; Cécile Delcourt; Mario Pirastu; Juho Wedenoja; Puya Gharahkhani; Cristina Venturini; Masahiro Miyake; Alex W Hewitt; Xiaobo Guo; Johanna Mazur; Jenifer E Huffman; Katie M Williams; Ozren Polasek; Harry Campbell; Igor Rudan; Zoran Vatavuk; James F Wilson; Peter K Joshi; George McMahon; Beate St Pourcain; David M Evans; Claire L Simpson; Tae-Hwi Schwantes-An; Robert P Igo; Alireza Mirshahi; Audrey Cougnard-Gregoire; Céline Bellenguez; Maria Blettner; Olli Raitakari; Mika Kähönen; Ilkka Seppala; Tanja Zeller; Thomas Meitinger; Janina S Ried; Christian Gieger; Laura Portas; Elisabeth M van Leeuwen; Najaf Amin; André G Uitterlinden; Fernando Rivadeneira; Albert Hofman; Johannes R Vingerling; Ya Xing Wang; Xu Wang; Eileen Tai-Hui Boh; M Kamran Ikram; Charumathi Sabanayagam; Preeti Gupta; Vincent Tan; Lei Zhou; Candice E H Ho; Wan'e Lim; Roger W Beuerman; Rosalynn Siantar; E-Shyong Tai; Eranga Vithana; Evelin Mihailov; Chiea-Chuen Khor; Caroline Hayward; Robert N Luben; Paul J Foster; Barbara E K Klein; Ronald Klein; Hoi-Suen Wong; Paul Mitchell; Andres Metspalu; Tin Aung; Terri L Young; Mingguang He; Olavi Pärssinen; Cornelia M van Duijn; Jie Jin Wang; Cathy Williams; Jost B Jonas; Yik-Ying Teo; David A Mackey; Konrad Oexle; Nagahisa Yoshimura; Andrew D Paterson; Norbert Pfeiffer; Tien-Yin Wong; Paul N Baird; Dwight Stambolian; Joan E Bailey Wilson; Ching-Yu Cheng; Christopher J Hammond; Caroline C W Klaver; Seang-Mei Saw; Jugnoo S Rahi; Jean-François Korobelnik; John P Kemp; Nicholas J Timpson; George Davey Smith; Jamie E Craig; Kathryn P Burdon; Rhys D Fogarty; Sudha K Iyengar; Emily Chew; Sarayut Janmahasatian; Nicholas G Martin; Stuart MacGregor; Liang Xu; Maria Schache; Vinay Nangia; Songhomitra Panda-Jonas; Alan F Wright; Jeremy R Fondran; Jonathan H Lass; Sheng Feng; Jing Hua Zhao; Kay-Tee Khaw; Nick J Wareham; Taina Rantanen; Jaakko Kaprio; Chi Pui Pang; Li Jia Chen; Pancy O Tam; Vishal Jhanji; Alvin L Young; Angela Döring; Leslie J Raffel; Mary-Frances Cotch; Xiaohui Li; Shea Ping Yip; Maurice K H Yap; Ginevra Biino; Simona Vaccargiu; Maurizio Fossarello; Brian Fleck; Seyhan Yazar; Jan Willem L Tideman; Milly Tedja; Margaret M Deangelis; Margaux Morrison; Lindsay Farrer; Xiangtian Zhou; Wei Chen; Nobuhisa Mizuki; Akira Meguro; Kari Matti Mäkelä
Journal:  Nat Commun       Date:  2016-03-29       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.