| Literature DB >> 35419164 |
Hafiz Ishfaq Ahmad1, Gulnaz Afzal2, Sehrish Sadia3, Ghulam Haider3, Shakeel Ahmed4, Saba Saeed5, Jinping Chen6.
Abstract
Oxidative stress is a type of stress that damages DNA and can occur from both endogenous and exogenous sources. Damage to DNA caused by oxidative stress can result in base modifications that promote replication errors and the formation of sites of base loss, which pose unique challenges to the preservation of genomic integrity. However, the adaptive evolution of the DNA repair mechanism is poorly understood in vertebrates. This research aimed to explore the evolutionary relationships, physicochemical characteristics, and comparative genomic analysis of the Nei-like glycosylase gene family involved in DNA base repair in the vertebrates. The genomic sequences of NEIL1, NEIL2, and NEIL3 genes were aligned to observe selection constraints in the genes, which were relatively low conserved across vertebrate species. The positive selection signals were identified in these genes across the vertebrate lineages. We identified that only about 2.7% of codons in these genes were subjected to positive selection. We also revealed that positive selection pressure was increased in the Fapy-DNA-glyco and H2TH domain, which are involved in the base excision repair of DNA that has been damaged by oxidative stress. Gene structure, motif, and conserved domain analysis indicated that the Nei-like glycosylase genes in mammals and avians are evolutionarily low conserved compared to other glycosylase genes in other "vertebrates" species. This study revealed that adaptive selection played a critical role in the evolution of Nei-like glycosylase in vertebrate species. Systematic comparative genome analyses will give key insights to elucidate the links between DNA repair and the development of lifespan in various organisms as more diverse vertebrate genome sequences become accessible.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35419164 PMCID: PMC9001079 DOI: 10.1155/2022/1144387
Source DB: PubMed Journal: Oxid Med Cell Longev ISSN: 1942-0994 Impact factor: 6.543
Results of positive selection tests for NEIL genes.
| Gene | Models | lnL | LRT | PAML | SLAC | FUBAR | MEME |
|---|---|---|---|---|---|---|---|
| NEIL1 | M1a | -11259.31 | 0 | 89,90,107,117,143,367,405,471 | 67, 107, 143, 405, 408, 444 | 67, 117, 143, 251, 367, 471 | 90, 117, 143, 405, 471 |
| M2a | -11259.31 | ||||||
| M7 | -11180.39 | 6.54 | |||||
| M8 | -11177.12 | ||||||
| NEIL2 | M1a | -7282.58 | 1.48 | 65, 67, 84, 94, 103, 105, 116, 191, 256, 301, 329 | 103, 191, 236, 329 | 65, 103, 116, 154, 191, 301, 329 | 103, 105, 191, 256, 329 |
| M2a | -7281.84 | ||||||
| M7 | -7273.27 | 9.16 | |||||
| M8 | -7268.69 | ||||||
| NEIL3 | M1a | -17093.12 | 0 | 36, 46, 51, 56, 62, 89, 108, 113, 185, 258, 309, 336, 437 | 46, 62, 108, 113, 185, 309 | 36, 51, 62, 89, 113, 185, 309, 437 | 51, 62, 113, 185, 309, 437 |
| M2a | -17093.12 | ||||||
| M7 | -16962.42 | 0.48 | |||||
| M8 | -16962.18 |
Figure 1Illustration of protein domain structures using DOG 1.0 illustrator. The phosphorylation sites of the NEIL1 protein are shown in the molecular structure and conserved domain analyses. Positively selected amino acid sites were found in conserved domains, especially the H2TH domain. Positively identified locations were drawn into the 3D structure using human NEIL1 as a reference. Selecton analyses of human NEIL1 are color-coded and compared to sequences from aligned nucleotide coding sequences. Yellow and brown highlights represent positive selection, the neutral selection is represented by grey and white highlights, and purple highlights on codons represent purifying selection.
Figure 2Illustration of protein domain structures using DOG 1.0 illustrator. The phosphorylation and ubiquitylation sites of the NEIL2 protein are shown in the molecular structure and conserved domain analyses. Positively selected amino acid sites were found in conserved domains, especially in the H2TH domain. Positively identified locations were drawn into the 3D structure using human NEIL2 as a reference. Selecton analyses of human NEIL2 are color-coded and compared to sequences from aligned nucleotide coding sequences. Yellow and brown highlights represent positive selection, the neutral selection is represented by grey and white highlights, and purple highlights on codons represent purifying selection.
Figure 3Illustration of protein domain structures using DOG 1.0 illustrator. The phosphorylation sites of the NEIL3 protein are shown in the molecular structure and conserved domain analyses. Positively selected amino acid sites were found in conserved domains, especially the H2TH domain. Positively identified locations were drawn into the 3D structure using human NEIL3 as a reference. Selecton analyses of human NEIL3 are color-coded and compared to sequences from aligned nucleotide coding sequences. Yellow and brown highlights represent positive selection, the neutral selection is represented by grey and white highlights, and purple highlights on codons represent purifying selection.
Figure 4(a) Maximum likelihood estimations of dN/dS at each site of NEIL1, together with estimated profile confidence intervals. The dN/dS = 1 (neutrality) is depicted as a horizontal gray line. Boundaries between partitions (if present) are shown as vertical dashed lines. Statistical significance is evaluated based on the asymptotic χ2 distribution. This analysis includes site to site synonymous rate variation. Profile approximate confidence intervals for site-level dN/dS ratios have been computed. (b) Maximum likelihood estimations of dN/dS at each site of NEIL2, together with estimated profile confidence intervals. The dN/dS = 1 (neutrality) is depicted as a horizontal gray line. Boundaries between partitions (if present) are shown as vertical dashed lines. Statistical significance is evaluated based on the asymptotic χ2 distribution. This analysis includes site to site synonymous rate variation. Profile approximate confidence intervals for site-level dN/dS ratios have been computed. (c) Maximum likelihood estimations of dN/dS at each site of NEIL3, together with estimated profile confidence intervals. The dN/dS = 1 (neutrality) is depicted as a horizontal gray line. Boundaries between partitions (if present) are shown as vertical dashed lines. Statistical significance is evaluated based on the asymptotic χ2 distribution. This analysis includes site to site synonymous rate variation. Profile approximate confidence intervals for site-level dN/dS ratios have been computed.
Detailed site-by-site results from the FEL analysis. Statistical significance is evaluated based on the asymptotic χ2 distribution. This analysis includes site to site synonymous rate variation. Profile approximate confidence intervals for site-level dN/dS ratios have been computed.
| Gene | Codons |
|
|
| LRT |
| Total branch length | dN/dS LB | dN/dS MLE | dN/dS UB | Selection type |
|---|---|---|---|---|---|---|---|---|---|---|---|
| NEIL1 | 114 | 0.00 | 1.519 | 34.71 | 2.169 | 0.1408 | 296.96 | 3,827.26 | 10,00 | 10,000 | Diversifying |
| 143 | 0.00 | 0.538 | 0.405 | 3.391 | 0.0655 | 3.464 | 7,865.16 | 10,00 | 10,000 | Diversifying | |
| 268 | 0.00 | 1.002 | 0.612 | 1.897 | 0.1684 | 5.234 | 3,827.16 | 10,00 | 10,000 | Diversifying | |
| 408 | 0.00 | 1.123 | 0.713 | 4.603 | 0.0319 | 6.100 | 8,626.24 | 10,00 | 10,000 | Diversifying | |
| 419 | 0.00 | 0.495 | 0.321 | 1.662 | 0.1973 | 2.746 | 3,826.84 | 10,00 | 10,000 | Diversifying | |
| 448 | 0.22 | 0.885 | 0.551 | 1.873 | 0.1711 | 4.711 | 1.347 | 4.058 | 13.044 | Diversifying | |
| 455 | 0.24 | 1.397 | 0.855 | 2.04 | 0.1533 | 7.314 | 2.417 | 5.894 | 15.896 | Diversifying | |
| NEIL2 | 77 | 0.00 | 0.682 | 0.404 | 4.036 | 0.0445 | 1.437 | 6,186.34 | 10,00 | 10,000 | Diversifying |
| 78 | 0.00 | 2.266 | 1.678 | 4.706 | 0.0301 | 5.968 | 8,078.11 | 10,00 | 10,000 | Diversifying | |
| 87 | 0.70 | 3.768 | 2.879 | 3.02 | 0.0822 | 10.24 | 2.525 | 5.348 | 11.869 | Diversifying | |
| 131 | 0.00 | 1.33 | 0.851 | 5.615 | 0.0178 | 3.027 | 7,599.97 | 10,00 | 10,000 | Diversifying | |
| 195 | 0.00 | 0.871 | 0.605 | 3.42 | 0.0644 | 2.153 | 6,810.13 | 10,00 | 10,000 | Diversifying | |
| 257 | 0.00 | 0.857 | 0.558 | 4.19 | 0.0407 | 1.984 | 6,810.14 | 10,00 | 10,000 | Diversifying | |
| 265 | 0.00 | 0.648 | 0.415 | 3.472 | 0.0624 | 1.477 | 6,186.32 | 10,00 | 10,000 | Diversifying | |
| NEIL3 | 32 | 0.00 | 1.281 | 10000 | 1.093 | 0.2957 | 85553.36 | 3,827.32 | 10,00 | 10,000 | Diversifying |
| 55 | 0.00 | 0.519 | 0.406 | 1.487 | 0.2227 | 3.469 | 7,599.63 | 10,00 | 10,000 | Diversifying | |
| 114 | 0.00 | 1.519 | 55.08 | 2.169 | 0.1408 | 471.262 | 3,827.26 | 10,00 | 10,000 | Diversifying | |
| 143 | 0.00 | 0.538 | 0.405 | 3.391 | 0.0655 | 3.464 | 7,865.16 | 10,00 | 10,000 | Diversifying | |
| 268 | 0.00 | 1.002 | 0.612 | 1.897 | 0.1684 | 5.234 | 3,827.16 | 10,00 | 10,000 | Diversifying | |
| 271 | 0.00 | 0.694 | 0.281 | 1.615 | 0.2038 | 2.407 | 1,464.80 | 10,00 | 10,000 | Diversifying | |
| 408 | 0.00 | 1.123 | 0.713 | 4.603 | 0.0319 | 6.100 | 8,626.24 | 10,00 | 10,000 | Diversifying | |
| 419 | 0.00 | 0.495 | 0.321 | 1.662 | 0.1973 | 2.746 | 3,826.84 | 10,00 | 10,000 | Diversifying | |
| 444 | 0.74 | 4.696 | 3.046 | 1.594 | 0.2067 | 26.059 | 2.105 | 6.370 | 47.341 | Diversifying | |
| 448 | 0.22 | 0.885 | 0.551 | 1.873 | 0.1711 | 4.711 | 1.347 | 4.058 | 13.044 | Diversifying | |
| 455 | 0.24 | 1.397 | 0.855 | 2.04 | 0.1533 | 7.314 | 2.417 | 5.894 | 15.896 | Diversifying | |
| 472 | 0.57 | 2.486 | 1.641 | 1.205 | 0.2724 | 14.039 | 1.59 | 4.361 | 13.636 | Diversifying |
α: synonymous substitution rate at a site; β: nonsynonymous substitution rate at a site; α = β: the rate estimate under the neutral model; LRT: likelihood ratio test statistic for beta = alpha, versus beta &neq; alpha; P value: asymptotic P value for evidence of selection, i.e., beta &neq; alpha; total branch length: the total length of branches contributing to inference at site used to scale dN-dS; dN/dS LB: 95% profile likelihood CI lower bound for dN/dS; dN/dS MLE: point estimate for site dN/dS; dN/dS UB: 95% profile likelihood CI upper bound for dN/dS.
Figure 5The common target genes of DNA repair pathways. GO annotation and KEGG were used to analyze these target genes.
Go terms and clusters with enriched keywords (one per cluster). This is the number of genes in the user-supplied lists that belong to the ontology term.
| GO | Category | Description | Count | % |
|
|
|---|---|---|---|---|---|---|
| WP4946 | WikiPathways | DNA repair pathways, full network | 17 | 85 | -38.21 | -33.87 |
| GO:0006281 | GO biological processes | DNA repair | 20 | 100 | -36.36 | -32.31 |
| GO:0006284 | GO biological processes | Base-excision repair | 13 | 65 | -33.31 | -29.57 |
| R-HSA-5649702 | Reactome gene sets | Replacement pathway | 7 | 35 | -22.77 | -19.46 |
| R-HSA-5693532 | Reactome gene sets | DNA double-strand break repair | 12 | 60 | -22.18 | -18.91 |
| GO:0006302 | GO biological processes | Double-strand break repair | 12 | 60 | -21.4 | -18.17 |
| GO:0090305 | GO biological processes | Hydrolysis | 12 | 60 | -19.86 | -16.66 |
| GO:0006287 | GO biological processes | Base-excision repair | 7 | 35 | -19.23 | -16.06 |
| WP4016 | WikiPathways | Response via ATR | 8 | 40 | -15.64 | -12.52 |
| GO:0006979 | GO biological processes | Response to oxidative stress | 10 | 50 | -14.01 | -10.94 |
| GO:0006266 | GO biological processes | DNA ligation | 5 | 25 | -13.73 | -10.68 |
| GO:0006289 | GO biological processes | Nucleotide-excision repair | 5 | 25 | -9.6 | -6.86 |
| GO:0010332 | GO biological processes | Response to gamma radiation | 5 | 25 | -9.56 | -6.83 |
| GO:0043504 | GO biological processes | Mitochondrial DNA repair | 3 | 15 | -8.3 | -5.8 |
| R-HSA-110362 | Reactome gene sets | Excision repair | 3 | 15 | -7.68 | -5.24 |
| M258 | Canonical pathways | PID BARD1 PATHWAY | 3 | 15 | -6.05 | -3.78 |
| GO:0016311 | GO biological processes | Dephosphorylation | 3 | 15 | -2.97 | -0.87 |
Figure 6Coevolution of conserved domain residues. The circular connection diagram focuses on the residues with cutoffs of the NEIL1, NEIL2, and NEIL3 genes. The graphic shows the positions of amino acids in the protein. The colors of the curves denote covariance scores among spots.
Figure 73D multidimensional scaling (MDS) scatterplots of covarying sites in NEIL1, NEIL2, and NEIL3 genes. (A) Highlighted red are the residues corresponding to the positively selected residue. Both black and red spots labeled showing the residues.
Figure 8Surface representation of the human NEIL1, NEIL2, and NEIL3 proteins colored conferring to sequence conservation. The color ramping from white (low score) to red (high score) allows to quickly identify areas of weak and robust structural conservation of the proteins.