| Literature DB >> 29505555 |
Yong Liu1,2,3, Yu Cao1,2, Yaxiong Li1,3, Dongyun Lei4, Lin Li1,3, Zong Liu Hou1,3, Shen Han1,3, Mingyao Meng1, Jianlin Shi1, Yayong Zhang1,3, Yi Wang1,3, Zhaoyi Niu1, Yanhua Xie1, Benshan Xiao1, Yuanfei Wang1, Xiao Li1, Lirong Yang1, Wenju Wang1,3, Lihong Jiang2.
Abstract
BACKGROUND Recently, mutations in several genes have been described to be associated with sporadic ASD, but some genetic variants remain to be identified. The aim of this study was to use whole-exome sequencing (WES) combined with bioinformatics analysis to identify novel genetic variants in cases of sporadic congenital ASD, followed by validation by Sanger sequencing. MATERIAL AND METHODS Five Han patients with secundum ASD were recruited, and their tissue samples were analyzed by WES, followed by verification by Sanger sequencing of tissue and blood samples. Further evaluation using blood samples included 452 additional patients with sporadic secundum ASD (212 male and 240 female patients) and 519 healthy subjects (252 male and 267 female subjects) for further verification by a multiplexed MassARRAY system. Bioinformatic analyses were performed to identify novel genetic variants associated with sporadic ASD. RESULTS From five patients with sporadic ASD, a total of 181,762 genomic variants in 33 exon loci, validated by Sanger sequencing, were selected and underwent MassARRAY analysis in 452 patients with ASD and 519 healthy subjects. Three loci with high mutation frequencies, the 138665410 FOXL2 gene variant, the 23862952 MYH6 gene variant, and the 71098693 HYDIN gene variant were found to be significantly associated with sporadic ASD (P<0.05); variants in FOXL2 and MYH6 were found in patients with isolated, sporadic ASD (P<5×10^-4). CONCLUSIONS This was the first study that demonstrated variants in FOXL2 and HYDIN associated with sporadic ASD, and supported the use of WES and bioinformatics analysis to identify disease-associated mutations.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29505555 PMCID: PMC5849354 DOI: 10.12659/msm.908923
Source DB: PubMed Journal: Med Sci Monit ISSN: 1234-1010
33 variants in 25 ASD related genes from WES data.
| Gene | The information of variants | The position of mutation site on a chromosome | The number of sample | Harmful prediction by SIFT/Ployphen-2/Mutation Taster | Bioinformatic methods | ||
|---|---|---|---|---|---|---|---|
| Venn analysis | GO analysis | Literature review | |||||
| TTN | NM_003319: exon178: c.C70564T: p.R23522C | 179406045 | ASD-5 | D/D/D | + | ||
| TTN | NM_003319: exon154: c.G47144A: p.R15715Q | 179436520 | ASD-4 | D/D/D | + | ||
| TTN | NM_003319: exon86: c.C21134G: p.T7045S | 179480499 | ASD-2 | D/B/N | + | ||
| TTN | NM_003319: exon186: c.C76340T: p.P25447L | 179397807 | ASD-3 | D/D/D | + | ||
| TTN | NM_003319: exon154: c.G48662T: p.G16221V | 179435002 | ASD-2 | D/D/D | + | + | |
| HYDIN | NM_001270974: exon47: c.G7930A: p.E2644K | 70952188 | ASD-1 | T/B/D | + | + | |
| HYDIN | NM_001198542: exon16: c.A2207C: p.H736P | 71098693 | ASD-2, ASD-3 | T/P/D | + | + | |
| IGSF3 | NM_001542: exon3: c.A619G: p.S207G | 117156600 | ASD-2, ASD-5 | T/B/D | + | ||
| IGSF3 | NM_001007237: exon10: c.G3300C: p.E1100D | 117122048 | ASD-4 | T/B/D | + | ||
| ZFPM | NM_153813: exon10: c.C2359T: p.P787S | 88600725 | ASD-2 | T/B/D | + | + | |
| ZFPM | NM_153813: exon10: c.G2524A: p.A842T | 88600890 | ASD-5 | D/P/N | + | + | |
| MYH6 | NM_002471: exon11: c.G985T: p.E329X | 23871923 | ASD-1 | D/./A | + | ||
| MYH6 | NM_002471: exon22: c.G2851T: p.E951X | 23862952 | ASD-3 | D/./A | + | + | |
| FMO5 | NM_001144830: exon4: c.T572A: p.I191N | 146684019 | ASD-1 | D/D/D | + | ||
| NSD1 | NM_022455: exon5: c.A2608G: p.R870G | 176638008 | ASD-1 | ./D/N | + | ||
| OBSCN | NM_001098623: exon27: c.A7301G: p.H2434R | 228467050 | ASD-1 | T/D/D | + | ||
| FOXL2 | NM_023067: exon1: c.C155G: p.A52G | 138665410 | ASD-1 | T/P/D | + | ||
| NUP188 | NM_015354: exon28: c.C3047T: p.P1016L | 131756681 | ASD-1 | D/D/D | + | ||
| SOX17 | NM_022454: c.A595T | 55371905 | ASD-1 | T/B/N | + | ||
| XPO1 | NM_003400: exon24: c.G2985C: p.K995N | 61708404 | ASD-1 | T/P/D | + | + | |
| KIAA0196 | NM_014846: exon18: c.A2186C: p.N729T | 126062819 | ASD-3 | T/D/D | + | ||
| EP300 | NM_001429: exon13: c.G2261A: p.R754H | 41545061 | ASD-3 | T/D/D | + | ||
| ZNF638 | NM_001014972: exon2: c.A254G: p.E85G | 71576338 | ASD-3 | D/D/D | + | ||
| BBS1 | NM_024649: exon11: c.G1067A: p.R356H | 66291310 | ASD-5 | T/B/D | + | ||
| ARL6 | NM_001278293: exon5: c.G283T: p.D95Y | 97503827 | ASD-5 | D/P/D | + | + | |
| ACVR1 | NM_001105: exon4: c.A275G: p.E92G | 158636905 | ASD-5 | T/P/D | + | ||
| VEGFA | NM_001025366: exon6: c.C997T: p.R333X | 43748503 | ASD-5 | T/./D | + | ||
| USP44 | NM_001042403: exon6: c.A2134C: p.S712R | 95911935 | ASD-3 | D/B/D | + | ||
| FGB | NM_001184741: exon3: c.A263C: p.Y88S | 155487774 | ASD-3 | T/B/N | + | ||
| BCOR | NM_001123384: exon13: c.T4728G: p.D1576E | 39913231 | ASD-4 | T/B/N | + | ||
| SGCD | NM_001128209: exon8: c.A845G: p.Q282R | 156186376 | ASD-4 | T/D/D | + | + | |
| BBS9 | NM_014451: exon19: c.A2389G: p.M797V | 33573776 | ASD-1 | T/B/D | + | ||
| FIBP | NM_004214: exon8: c.G850A: p.A284T | 65652097 | ASD-2 | T/B/D | + | + | |
According to SIFT, related gene was noted on tolerated (T, score >0.5) or deleterious (D, score <0.5). According to Ployphen-2, related gene was noted on probably damaging (D, Polyphen-2 ≥0.909), possibly damaging (P, 0.447 ≤Polyphen-2 <0.909), and benign (B, Polyphen-2 <0.447). According to Mutation Taster, related genes were noted on disease-causing automatic (A), disease-causing (D), polymorphism (N), and polymorphism automatic (P) ().
Figure 1Schematic representation of the study design and protocol. First, all variants were annotated using the ANNOVAR bioinformatics software. The normal population variant databases, including 1,000 Genomes Project, the Single Nucleotide Polymorphism database (DbSNP), and the National Heart, Lung, and Blood Institute (NHLBI) databases were used to exclude the common variants occurring with more than 1% minor allele frequency (MAF). Second, the rare variants obtained in the previous step were further analyzed using Venn analysis, gene ontology (GO) analysis, and literature review. Third, three function predictor scores, SIFT [12], Polymorphism Phenotyping v2 (PolyPhen-2) [13], and MutationTaster [14] were used to predict whether the substitution of amino acids affected the function of the protein, and 33 single nucleotide variants (SNVs) were selected and validated as positive mutations by Sanger sequencing. Finally, a multiplexed MassARRAY system was performed to verify the 33 variants and three mutations with high mutation frequency, which were found to be strongly associated with atrial septal defect (ASD) (P<0.05).
Figure 2Selected gene variants in patients with atrial septal defect (ASD) were validated by MassARRAY analysis. (A) The mutation frequency of the 138665410 variant in the FOXL2 gene. (B) The mutation frequency of the 23862952 variant in the MYH6 gene. (C) The mutation frequency of the 71098693 variant in the HYDIN gene. Shown in patients with ASD (left) and healthy subjects (right) in a scatter chart. These loci with high mutation frequency were found to be strongly associated with ASD (P<0.05). Among these loci, variants in FOXL2 and MYH6 were only found in patients with ASD (P<5×10−4). * P<0.05, **** P<0.0005.
Figure 3Further selected gene variants in patients with atrial septal defect (ASD) were validated by MassARRAY analysis. (A) The mutation frequency of the 61708404 variant in the XPO1 gene. (B) The mutation frequency of the 39913231 variant in the BCOR gene. (C) The mutation frequency of the 65652097 variant in the FIBP gene. (D) The mutation frequency of the 88600890 variant in the ZFPM gene. (E) The mutation frequency of the 156186376 variant in the SGCD gene. (F) The mutation frequency of the 179480499 variant in the TTN gene. Shown in patients with atrial septal defect (ASD) (left) and healthy subjects (right) in a scatter chart. There was no significant difference in these six loci between patients with ASD and healthy subjects (P>0.05). (NS, P>0.05).
Characteristics of 5 ASD patients.
| Number of sample | Gender | Age | Diagnosis and diameter |
|---|---|---|---|
| ASD-1 | Female | 6 months | Secundum ASD, 22 mm |
| ASD-2 | Male | 13 months | Secundum ASD, 10 mm |
| ASD-3 | Female | 19 months | Secundum ASD, 16 mm |
| ASD-4 | Female | 23 months | Secundum ASD, 12 mm |
| ASD-5 | Male | 17 months | Secundum ASD, 11 mm |
Sequences of the primers involved in Mass-Array.
| Variants under test | Forward (5′-3′) | Reverse (3′-5′) | Product (bp) |
|---|---|---|---|
| 23871923 | AGCTTGTAGACGCCAGCTTT | CTGGGTTCACTCCTTGGTCC | 433 |
| 33573776 | ACAGCCAGCTGAACATACCC | GGGATGTCAGAGGGGAAACG | 281 |
| 55371905 | CGCAAGCAGGTGAA | CCAGCGTAGTCCGAGA | 359 |
| 61708404 | TATCCGAATAACATCTC | GAATGTCTTGAACCC | 450 |
| 70952188 | TGGGGACTGCAAAGGTTGTT | ATCGTCTCCTACCCGGTGAA | 420 |
| 131756681 | ATGAAGTGACCACCAAA | GAGAAGACCACAGAGCC | 390 |
| 138665410 | GGTCCAGCGTCCAGTAGTTGC | GCACAGTCAAGGAGCCAGAAGG | 330 |
| 146684019 | TCTATTAGCACCAATCACC | GAAGTTCAAAGGGCAGTA | 315 |
| 176638008 | AAAACCAGGGATTCAAGT | CAGTAAGCCAGGTAGGGA | 465 |
| 228467050 | ATGGTTTGCCCTACACTCCT | TGTCCTCTTGTGGCTCTGC | 89 |
| 65652097 | GACCAGGTTGGGTGTCAGAG | GGGACTTGGTTGATGGAGGG | 248 |
| 88600725 | CCACGAGACCTACACCG | CCAGGCGGAAATGCT | 545 |
| 117156600 | GACAAGAGGCTCAGAGGG | CTGTGGCTTTGTGACTGC | 479 |
| 179435002 | AATGCCTTCTTTATCCCG | AACCAGCCGCTTAGTTTG | 310 |
| 179480499 | CCCATAACTTCAACTCTA | ACACCTCATTTGCCATCT | 461 |
| 23862952 | GCTGCTCCAGCTTGACCTTA | TCTCCTCCCCTCCCCTAGAT | 686 |
| 41545061 | CCTCTTCAGCCCTCTTCACC | GAGACACTGGAGCTTGACCG | 281 |
| 71098693 | GAACTCACCTGAGGCTGGAC | GTATCTCGCTTGGGGCTCTG | 224 |
| 71576338 | ACTTCAAAGGCCACGAGCAC | ATTAGGTAATCGGCGCCCCA | 526 |
| 95911935 | ACAAGTTCATCATCCGAGCC | CAGGGTAACACTGCCCAATCT | 631 |
| 126062819 | TGAAATGGGATCAATAGG | CAAAGCAGTTGCTGGAAG | 369 |
| 155487774 | CCCAAATCCTTCATCTAA | CAGTCTAACGGTTCCAAT | 286 |
| 179397807 | TACTTGCGTGGCTCTGGT | AAGGCTTGGATTATTATGCTCT | 403 |
| 39913231 | TCACCCCTGAGCCACAGATA | GAGGGAAAGTGGATGGTGGG | 303 |
| 117122048 | TTGCATGGGTAATCAAGGGTCA | GCTGGACTGTAGCATCGTGT | 610 |
| 156186376 | ACGCCTACAGGAACGAGG | GGAATGTTGGGACGGATG | 450 |
| 179436520 | GAAGTGTCCCGTTTCTCA | CAGCCTTCAAACTCCTGT | 409 |
| 43748503 | TGGCTTTGCTTTGGTCGTTC | AAACCAGTTGGGTGAGCAGG | 520 |
| 66291310 | AGCACCCCAAGTACTGCATC | CGGGGTGTGGATGACATTGA | 419 |
| 88600890 | CGACGGCCCCATCGACCTGA | CGTGCCCTTGCTGGGAGTCTGG | 520 |
| 97503827 | TCGGTGTAATAGGGTTATGGTATT | TTCTTCTTTGGCCACAACCATT | 399 |
| 158636905 | ACACGGACCCAGGACAAC | TTCCCTCAATGAAGGTGAAACT | 403 |
| 179406045 | TTTGATTGTGGTGGTGAT | CTCCCAAGTGACTGGATAT | 341 |
Quality of data obtained by WES.
| Sample name | Raw reads | Raw data (G) | Effective (%) | Error (%) | Q20 (%) | Q30 (%) |
|---|---|---|---|---|---|---|
| ASD-1 | 28464122 | 7.12 | 95.73 | 0.03; 0.04 | 95.76; 93.25 | 91.57; 87.61 |
| ASD-2 | 18423348 | 4.61 | 95.92 | 0.03; 0.04 | 95.79; 93.10 | 91.62; 87.36 |
| ASD-3 | 22964836 | 5.74 | 95.87 | 0.03; 0.04 | 95.81; 93.28 | 91.67; 87.70 |
| ASD-4 | 17576512 | 4.39 | 95.78 | 0.03; 0.04 | 95.72; 92.50 | 91.48; 86.34 |
| ASD-5 | 18637077 | 4.66 | 95.88 | 0.04; 0.04 | 95.12; 93.79 | 90.10; 87.89 |
Quantity of variants before and after filtering by 1000Genome, dbSNP, and NHLBI.
| Number of sample | Quantity of variants before filtering | Heterozygous variants | Homozygous variants | Quantity of variants after filtering |
|---|---|---|---|---|
| ASD-1 | 36774 | 21711 | 15063 | 159 |
| ASD-2 | 36031 | 20896 | 15135 | 142 |
| ASD-3 | 36443 | 21416 | 15027 | 143 |
| ASD-4 | 36240 | 21528 | 14712 | 133 |
| ASD-5 | 36274 | 21477 | 14797 | 136 |
| Total | 181762 | 107028 | 74734 | 713 |
13 variants from WES data of 5 ASD patients by Venn analysis.
| Gene | The information of variants | The position of mutation site on a chromosome | The number of sample | Harmful prediction by SIFT/Ployphen2/Mutation Taster |
|---|---|---|---|---|
| TTN | NM_003319: c.C70564T | 179406045 | ASD-5 | D/D/D |
| TTN | NM_003319: c.G47144A | 179436520 | ASD-4 | D/D/D |
| TTN | NM_003319: c.C21134G | 179480499 | ASD-2 | D/B/N |
| TTN | NM_003319: c.G48662T | 179435002 | ASD-2 | D/D/D |
| TTN | NM_003319: c.C76304T | 179397807 | ASD-3 | D/D/D |
| HYDIN | NM_001270974: c.G7930A | 70952188 | ASD-1 | T/B/D |
| HYDIN | NM_001198542: c.A2207C | 71098693 | ASD-2, ASD-3 | T/P/D |
| IGSF3 | NM_001542: c.A619G | 117156600 | ASD-2, ASD-5 | T/B/D |
| IGSF3 | NM_001007237: c: G3300C | 117122048 | ASD-4 | T/B/D |
| ZFPM | NM_153813: c.C2359T | 88600725 | ASD-2 | T/B/D |
| ZFPM | NM_153813: c.G2524A | 88600890 | ASD-5 | D/P/N |
| MYH6 | NM_002471: c.G985T | 23871923 | ASD-1 | D/./A |
| MYH6 | NM_002471: c.G2851T | 23862952 | ASD-3 | D/./A |
According to SIFT, related gene was noted on tolerated (T, score >0.5) or deleterious (D, score <0.5). According to PolyPhen-2, related gene was noted on probably damaging (D, Polyphen-2 ≥0.909), possibly damaging (P, 0.447 ≤Polyphen-2 <0.909), and benign (B, Polyphen-2 <0.447). According to MutationTaster, related genes were noted on disease-causing automatic (A), disease-causing (D), polymorphism (N), and polymorphism automatic (P) ().
21 variants associated with cardiac development or CHD obtained by Gene Ontology analysis.
| Gene | The information of variants | The position of mutation site on a chromosome | The number of sample | Harmful prediction by SIFT/Ployphen2/Mutation Taster |
|---|---|---|---|---|
| BMP4 | NM_001202: c.C125T | 54418816 | ASD-5 | T/P/D |
| TTN | NM_003319: c.G48662T | 179435002 | ASD-2 | D/D/D |
| DSG4 | NM_001134453: c.G790A | 28971146 | ASD-5 | D/D/D |
| XPO1 | NM_003400: c.G2985C | 61708404 | ASD-1 | T/P/D |
| TGFBR3 | NM_001195683: c.G1763A | 92181893 | ASD-5 | T/B/N |
| HYDIN | NM_001270974: c.G7930A | 70952188 | ASD-1 | TTN |
| HYDIN | NM_001270974: c.G7930A | 70952188 | ASD-1 | T/B/D |
| TENM4 | NM_001098816: c.G8189C | 78369224 | ASD-1 | ./B/N |
| SGCD | NM_001128209: c.A845G | 156186376 | ASD-4 | T/D/D |
| MYH6 | NM_002471: c.G2851T | 23862952 | ASD-3 | D/./A |
| SMARCA4 | NM_001128845: c.A602T | 11097111 | ASD-5 | T/P/D |
| ZFPM | NM_153813: c.C2359T | 88600725 | ASD-2 | T/B/D |
| ZFPM | NM_153813: c.G2524A | 88600890 | ASD-5 | D/P/N |
| FOXL2 | NM_023067: c.C155G | 138665410 | ASD-1 | T/P/D |
| DNAH7 | NM_018897: c.T6053C | 196741332 | ASD-1 | T/D/D |
| DNAH17 | NM_173628: c.C12828A | 76422625 | ASD-1 | D/P/D |
| FIBP | NM_004214: c.G850A | 65652097 | ASD-2 | T/B/D |
| ATP2B2 | NM_001001331: c.A182G | 10491046 | ASD-3 | T/D/D |
| ZNF638 | NM_001014972: c.A254G | 71576338 | ASD-3 | D/D/D |
| ALMS1 | NM_015120: c.A6808G | 73680465 | ASD-3 | D/D/N |
| MCHR1 | NM_005297: c.G671A | 41077334 | ASD-3 | D/D/D |
According to SIFT, related gene was noted on tolerated (T, score >0.5) or deleterious (D, score <0.5). According to PolyPhen-2, related gene was noted on probably damaging (D, Polyphen-2 ≥0.909), possibly damaging (P, 0.447 ≤Polyphen-2 <0.909), and benign (B, Polyphen-2 <0.447). According to MutationTaster, related genes were noted on disease-causing automatic (A), disease-causing (D), polymorphism (N), and polymorphism automatic (P) ().
20 variants associated with cardiac development or CHD obtained by literature review and protein database.
| Gene | The information of variants | The position of mutation site on a chromosome | The number of sample | Harmful prediction by SIFT/Ployphen2/Mutation Taster |
|---|---|---|---|---|
| ACVR1 | NM_001105: c.A275G | 158636905 | ASD-5 | T/P/D |
| ARL6 | NM_001278293: c.G283T | 97503827 | ASD-5 | D/P/D |
| BBS1 | NM_024649: c.G1067A | 66291310 | ASD-5 | T/B/D |
| VEGFA | NM_001025366: c.C997T | 43748503 | ASD-5 | T/./D |
| BBS9 | NM_014451: c.A2389G | 33573776 | ASD-1 | T/B/D |
| FMO5 | NM_001144830: c.T572A | 146684019 | ASD-1 | D/D/D |
| FOXL2 | NM_023067: c.C155G | 138665410 | ASD-1 | T/P/D |
| NSD1 | NM_022455: c.A2608G | 176638008 | ASD-1 | ./D/N |
| NUP188 | NM_015354: c.C3047T | 131756681 | ASD-1 | D/D/D |
| OBSCN | NM_001098623: c.A7301G | 228467050 | ASD-1 | T/D/D |
| SOX17 | NM_022454: c.A595T | 55371905 | ASD-1 | T/B/N |
| XPO1 | NM_003400: c.G2985C | 61708404 | ASD-1 | T/P/D |
| BCOR | NM_001123384: c.T4728G | 39913231 | ASD-4 | T/B/N |
| SGCD | NM_001128209: c.A845G | 156186376 | ASD-4 | T/D/D |
| EP300 | NM_001429: c.G2261A | 41545061 | ASD-3 | T/D/D |
| FGB | NM_001184741: c.A263C | 155487774 | ASD-3 | T/B/N |
| KIAA0196 | NM_014846: c.A2186C | 126062819 | ASD-3 | T/D/D |
| USP44 | NM_001042403: c.A2134C | 95911935 | ASD-3 | D/B/D |
| ZNF638 | NM_001014972: c.A254G | 71576338 | ASD-3 | D/D/D |
| FIBP | NM_004214: c.G850A | 65652097 | ASD-2 | T/B/D |
According to SIFT, related gene was noted on tolerated (T, score >0.5) or deleterious (D, score <0.5). According to PolyPhen-2, related gene was noted on probably damaging (D, Polyphen-2 ≥0.909), possibly damaging (P, 0.447 ≤Polyphen-2 <0.909), and benign (B, Polyphen-2 <0.447). According to MutationTaster, related genes were noted on disease-causing automatic (A), disease-causing (D), polymorphism (N), and polymorphism automatic (P) ().