| Literature DB >> 36249933 |
Mona Saad1, Marc Shebaby2, Cybel Mehawej3, Wissam Faour1.
Abstract
G-quadruplexes (G4s) are non-canonical DNA and RNA secondary structures that control gene regulation. A single nucleotide polymorphism (SNP) is a small genetic variation occurring within a DNA sequence and accounting for the variabilities between individuals. While the majority of SNPs, especially those frequent in the population, are considered as benign genetic variations, few others can lead to diseases. SNPs occurring in G4 sequences were reported to modulate gene regulation. In order to find overlaps between predicted G4 sequences and SNPs located in the genomic regions, we developed two complementary computational python codes (SNP-locator and G4-overlap). The codes map a mutation to the overlapping/closest G4 sequences, based on the genetic variant name and the FASTA format of the corresponding gene. We validated these two codes on a set of 31 SNP variants occurring in cytochromes P450 genes and podocytes-marker genes. Out of 31 SNPs, 28 were accurately located using the mentioned codes.•SNP-locator code locates any SNP in promoters, upstream regulatory regions, exons and introns.•The SNP-locator code requires the FASTA genomic sequence of the studied gene and the genetic variant nomenclature at the cDNA level.•G4-overlap code maps the SNP to the overlapping or the closest G4 sequence.Entities:
Keywords: G-quadruplexes (G4s); Overlapping G4s; Python; Single nucleotide polymorphisms (SNPs)
Year: 2022 PMID: 36249933 PMCID: PMC9563633 DOI: 10.1016/j.mex.2022.101875
Source DB: PubMed Journal: MethodsX ISSN: 2215-0161
Fig. AThe input and output of the SNP-locator code applied to the SNP variant c.506-1G>A located in intron 3 of CYP2D6 gene.
Fig. BThe text file including the results of the SNP-locator code for the studied genes that will be used by the G4-overlap code.
Fig. CThe G4 sequences predicted by the G4Hunter tool; the first occurrence of CYP2D6 in “seqnames” should be followed by “>”.
Fig. DThe inputs and outputs of the G4-overlap code for a group of SNP gene variants.
| Subject area: | Bioinformatics |
| More specific subject area: | |
| Name of your method: | |
| Name and reference of original method: | |
| Resource availability |