| Literature DB >> 34101725 |
Efres Belmonte-Reche1, Israel Serrano-Chacón2, Carlos Gonzalez2, Juan Gallo1, Manuel Bañobre-López1.
Abstract
Quadruplex structures have been identified in a plethora of organisms where they play important functions in the regulation of molecular processes, and hence have been proposed as therapeutic targets for many diseases. In this paper we report the extensive bioinformatic analysis of theEntities:
Mesh:
Substances:
Year: 2021 PMID: 34101725 PMCID: PMC8186786 DOI: 10.1371/journal.pone.0250654
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1A. Results with G4-iM Grinder depend on the quadruplex definitions introduced to the algorithm. Sizes of G- or C-runs, loops and the entire sequence, together with an acceptable number of bulges within the runs are part of the definitions. B. The structures found with GiG under the definitions proposed by the user can be evaluated for their in vitro probability of formation. More positive scores mean that the sequence is more capable of forming G4s, whilst more negative values mean that it is more capable of forming iMs. C. Left, Quadruplex definitions used by GiG’s search engine in this work. C. Right, Total results found within the SARS-CoV-2 by configuration and score criteria. D. PQS and PiMS densities (per 100000 nucleotides) found per different configuration and score criteria for 19 viruses. The G and C content (as a percentage) is shown under each virus. X scale is in logarithmic scale (base 10). Results are categorized by their |score|: intense colours (blue for PQS, yellow for PiMS) are the most probable to form in vitro (|score| ≥ 40), lighter bars are the density of structures with at least a |score| ≥ 20 and grey bars are the densities without the score filter.
Fig 2A. Top. Percentage of conservation of each PQS found along the genome of the SARS-CoV-2. Each point represents one PQS. The PQS score is given by the fill colour of the points, where lower |scores| are greyer, and bluer points have higher |scores|. Bottom. PQS count density plot related to the genome position (counts per 200 nucleotides). Grey coloured density plots are all the results found, whilst blue density plots are the results found with at least a |score| ≥ 20. B. Distribution of the biological features of the SARS-CoV-2 by its genomic position. UTR regions are in red, CDS and genes region are in green, and nps of the orf1ab gene are in purple. Orange dots are mature protein regions of the CDS. C, Top. PiMS count density plot related to the genome position (counts per 200 nucleotides). Grey coloured density plots are all the results found, whilst yellow density plots are the results found with at least a |score| ≥ 20. Bottom. Percentage of conservation of each PiMS found along the genome of the SARS-CoV-2. Each point represents one PiMS. The PiMS score is given by the fill colour of the points, where lower |scores| are greyer, and higher |scores| are more yellow. D. Top scoring PQS (Score ≥ 30, entry 1 to 7) and PiMS (Score ≤ -30, entry 8 to 17) found in the SARS-CoV-2 ordered by their localization in the genome. G-runs are in blue, C-runs are in yellow, loops are in red and bulges within the runs are in green. For each entry, the biological feature column lists the genomic landmark that hosts the potential quadruplex. The percentage of conservation is also given.
Fig 3A. Sequences found in the SARS-CoV-2 reference genome (those with a starting position) and some of the variants identified in specific lineages for four high scoring candidates. Mutations are underlined. B. Centre, SARS-CoV-2 phylogenetic tree by clade and lineage of the sequences analysed. Lineages with less than 100 genomes were grouped (suffix x). Inner segment, Lineage: Mean PQS count (A), Mean PQS count with |score| ≥ 20 (B) and PQS divergence from the reference genome (C). Centre segment, Mean lineage percentage sequence identity with the reference genome (dots) compared to the overall mean found for the 17312 sequences analysed (black line). Outer segment, Clade: Mean PQS count (A), Mean PQS count with |score| ≥ 20 (B) and PQS divergence from the reference genome (C). R-packages used: ggtree [66] and circlize [67].
Fig 4A. The candidates examined in vitro through biophysical assays. The in vitro column states if the sequence forms a quadruplex (Y for Yes, N for No). B, NMR spectra of the two RNA-G4s analyzed at different temperatures (pH 7.0). C, NMR spectra of the DNA-iM analyzed at different temperatures (pH 5.3). D, CD analysis of the two RNA-G4 analyzed.