| Literature DB >> 21695249 |
Andreas Keller1, Christian Harz, Mark Matzas, Benjamin Meder, Hugo A Katus, Nicole Ludwig, Ulrike Fischer, Eckart Meese.
Abstract
High-throughput sequencing opens avenues to find genetic variations that may be indicative of an increased risk for certain diseases. Linking these genomic data to other "omics" approaches bears the potential to deepen our understanding of pathogenic processes at the molecular level. To detect novel single nucleotide polymorphisms (SNPs) for glioblastoma multiforme (GBM), we used a combination of specific target selection and next generation sequencing (NGS). We generated a microarray covering the exonic regions of 132 GBM associated genes to enrich target sequences in two GBM tissues and corresponding leukocytes of the patients. Enriched target genes were sequenced with Illumina and the resulting reads were mapped to the human genome. With this approach we identified over 6000 SNPs, including over 1300 SNPs located in the targeted genes. Integrating the genome-wide association study (GWAS) catalog and known disease associated SNPs, we found that several of the detected SNPs were previously associated with smoking behavior, body mass index, breast cancer and high-grade glioma. Particularly, the breast cancer associated allele of rs660118 SNP in the gene SART1 showed a near doubled frequency in glioblastoma patients, as verified in an independent control cohort by Sanger sequencing. In addition, we identified SNPs in 20 of 21 GBM associated antigens providing further evidence that genetic variations are significantly associated with the immunogenicity of antigens.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21695249 PMCID: PMC3112142 DOI: 10.1371/journal.pone.0018158
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Coverage plot for one exonic part of SART1 for the four control samples.
X-axis shows the genomic coordinate inside the exon, y-axis denotes the number of reads covering this position and the four lines correspond to the four genomic samples.
Overall number of detected SNPs and SNPs detected in target genes.
| Sample ID | DNA type | No. of Overall SNPs [thereof known] | No. of SNPs in target genes [thereof known] |
| patient A | leuko | 2753 [444] | 581 [197] |
| patient B | leuko | 1527 [235] | 272 [112] |
| patient A | tumor | 2346 [613] | 415 [265] |
| patient B | tumor | 2356 [420] | 516 [239] |
Number of known SNPs in brackets.
Figure 2Venn diagram of detected SNPs in the 132 targeted genes in the four samples.
Distribution of detected SNPs among the samples.
| sample 1 | sample 2 | only in sample 1 [n] | only in sample 2 [n] | in both samples [n] | Overlap [%] |
| patient A leuko | patient A tumor | 1924 | 1517 | 829 | 19.4 |
| patient B leuko | patient B tumor | 729 | 1558 | 798 | 25.9 |
| patient A leuko | patient B leuko | 2086 | 860 | 667 | 18.5 |
| patient A tumor | patient B tumor | 1654 | 1664 | 692 | 17.3 |
| patient A leuko | patient B tumor | 2044 | 1647 | 709 | 16.1 |
| patient B leuko | patient A tumor | 920 | 1739 | 607 | 18.6 |
SNPs with different genotype in tumor and leukocytes of the same patient.
| patient | gene | position | chromosome | reference | sample | coverage | A | C | G | T | genotype | dbSNP | Hgvs |
| A | B4GALNT1 | 56310248 | 12 | C | Leu | 154 | 30;70 | 0;54 | 0;0 | 0;0 | AC | rs715930 | c.[666C>A]+[ = ] |
| Tu | 123 | 61;48 | 7;6 | 0;0 | 0;0 | AA | rs715930 | c.[666C>A]+[666C>A] | |||||
| LOC643266 | 47216571 | 22 | G | Leu | 96 | 27;0 | 0;0 | 69;0 | 0;0 | GA | rs12172263 | c.[820+466G>A]+[ = ] | |
| Tu | 115 | 115;0 | 0;0 | 0;0 | 0;0 | AA | rs12172263 | c.[820+466G>A]+[820+466G>A] | |||||
| LOC644033 | 4412 | 8 | G | Leu | 127 | 0;0 | 0;0 | 21;24 | 42;40 | TG | na | c.[26–95G>T]+[ = ] | |
| Tu | 74 | 0;0 | 0;0 | 7;4 | 31;32 | TT | na | c.[26–95G>T]+[26–95G>T] | |||||
| PARP1 | 224639987 | 1 | A | Leu | 102 | 23;24 | 0;0 | 0;55 | 0;0 | GA | rs1805414 | c.[852A>G]+[ = ] | |
| Tu | 61 | 4;4 | 0;0 | 11;42 | 0;0 | GG | rs1805414 | c.[852A>G]+[852A>G] | |||||
| B | CTDSP2 | 56503706 | 12 | C | Leu | 118 | 0;0 | 51;16 | 0;0 | 11;40 | CT | na | c.[762C>T]+[ = ] |
| Tu | 3580 | 3;0 | 64;82 | 4;1 | 1235;2190 | TT | na | c.[762C>T]+[762C>T] | |||||
| PAXIP1 | 154391302 | 7 | A | Leu | 114 | 13;16 | 0;0 | 26;59 | 0;0 | GA | rs935037 | c.[1440A>G]+[ = ] | |
| Tu | 286 | 0;0 | 0;0 | 64;222 | 0;0 | GG | rs935037 | c.[1440A>G]+[1440A>G] | |||||
| MDM1 | 67006894 | 12 | G | Leu | 52 | 27;10 | 0;0 | 8;7 | 0;0 | AG | rs962976 | c.[308G>A]+[ = ] | |
| Tu | 99 | 74;8 | 0;0 | 13;4 | 0;0 | AA | rs962976 | c.[308G>A]+[308G>A] |
*HGVC = Human Genome Variation Society.
Supporting Information Legends.
Figure 3Identification, verification and distribution of the SART1 SNP rs660118.
A. Next generation sequencing reads of one sample for the SNP rs660118. The blue box at top shows the base and amino acid change. B. Sanger traces of the four genomic samples showing a homozygous allele of the SNP in the middle position. C. Sequence logo for the part surrounding the SNP computed from Sanger sequencing of the initial samples and the independent cohort. At the bottom, the human reference sequence is presented. Glioblastoma patients show higher counts for the allele ‘C’ as compared to unaffected controls.