Literature DB >> 31518347

Petri net-based model of the human DNA base excision repair pathway.

Marcin Radom1,2, Magdalena A Machnicka3,4, Joanna Krwawicz5,6, Janusz M Bujnicki3,7, Piotr Formanowicz1,2.   

Abstract

Cellular DNA is daily exposed to several damaging agents causing a plethora of DNA lesions. As a first aid to restore DNA integrity, several enzymes got specialized in damage recognition and lesion removal during the process called base excision repair (BER). A large number of DNA damage types and several different readers of nucleic acids lesions during BER pathway as well as two sub-pathways were considered in the definition of a model using the Petri net framework. The intuitive graphical representation in combination with precise mathematical analysis methods are the strong advantages of the Petri net-based representation of biological processes and make Petri nets a promising approach for modeling and analysis of human BER. The reported results provide new information that will aid efforts to characterize in silico knockouts as well as help to predict the sensitivity of the cell with inactivated repair proteins to different types of DNA damage. The results can also help in identifying the by-passing pathways that may lead to lack of pronounced phenotypes associated with mutations in some of the proteins. This knowledge is very useful when DNA damage-inducing drugs are introduced for cancer therapy, and lack of DNA repair is desirable for tumor cell death.

Entities:  

Year:  2019        PMID: 31518347      PMCID: PMC6743755          DOI: 10.1371/journal.pone.0217913

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Living organisms as well as their functional parts such as organs, tissues, cells, etc. are very complex systems. These systems are composed of numerous basic building blocks connected by a dense network of interactions and mutual dependencies. A sum of such networks determines the structure and the functionality of the system. Therefore, in order to fully understand the nature of living organisms, it does not suffice to analyze properties of the building blocks, but it is also necessary to analyze the networks of interactions among them. Thus far, various computational methods for analyzing complex systems have been developed, and some of them can be applied in the analysis of biological systems [1, 2]. However, these systems have their own specificity, from which follows that in some cases it can be difficult to use methods developed earlier for other systems. The first and necessary step in the analysis of a complex system is to build its formal model. This model can be expressed in a language of some branch of mathematics, and for instance, differential equations are often used for this purpose. Recently, Petri net theory has emerged as a promising approach for modeling and analysis of biological systems [1]. These nets have been described for the first time by C. A. Petri in the early 1960s in the context of theoretical computer science and for many years this was the main area of their applications [3, 4]. Since this time the theory of Petri nets has been intensively studied and many methods of the analysis of their properties have been developed. In the mid 1990s it was realized that Petri nets could also be used for modeling and analysis of biological systems [5, 6] and during the last two decades the possibilities and limitations of Petri net applications in this area were studied [1, 7, 8]. On the one hand, a strong advantage of Petri net-based models is their intuitive graphical representation. On the other hand, the nets can be analyzed using precise mathematical methods. Moreover, contrary to differential equations, they do not require the knowledge about precise values of some parameters of the modeled system, what can be a very important property in many cases since these values are often difficult to determine for biological phenomena. The integrity of genetic information depends on the interplay between the DNA base modification (caused by enzymatic reactions), DNA damage (caused by cellular and env<span class="Chemical">ironmental chemical reactions), and all DNA-binding proteins and readers of information that work together to make the whole system functional. The DNA damage repair is an exemplary complex biological system, in which a large number of elements (mostly enzymes that perform chemical reactions) cooperate with each other to protect the cell from consequences of DNA damage events. Base damage can be mutagenic and/or cytotoxic and can lead to mutagenesis or cell death, respectively, when unrepaired DNA lesions undergo replication or transcription. To avoid such consequences, correct repair of different types of DNA damage is assured by the existence of several DNA repair subsystems maintained by the cell in order to survive the DNA damage. In <span class="Species">humans these subsystems, called pathways, include DNA damage signaling (DDS), direct reversal repair (DRR), base excision repair (BER), nucleotide excision repair (NER), mismatch repair (MMR), homologous recombination repair (HRR), nonhomologous end-joining (NHEJ) and translesion synthesis (TLS). Details concerning DNA damage and repair in several living organisms were collected and included in the public databases developed by our groups (http://repairtoire.genesilico.pl/pathways/ [9], https://www.ibb.waw.pl/pl/Sekretariat%20Naukowy/Projekty%20strukturalne/dnatraffic [10]). The DNA repair processes have already been a subject of computational modeling. A broad range of modeling approaches gave opportunities to gain a variety of insights into DNA damage and repair. Some of these models were built based on detailed kinetic data available from experimental studies. Examples include the stochastic model of NHEJ, which uses molecule numbers, reaction rate constants and kinetic rate constants to explain the dynamic response to damage induced by different levels of gamma irradiation in human fibroblasts [11]. Computational modeling was also conducted for some sub-pathways of BER, where e.g., differential equations were used to model the removal of 8-oxoguanine (8-oxoG) via base excision repair initiated by human OGG1 glycosylase [12]. In this case the availability of kinetic data for both wild type enzymes and their mutant variants allowed for the prediction of the kinetics and capacity of the whole sub-pathway in the presence of some single-nucleotide polymorphisms which have been associated with cancer. Other examples of kinetic models for BER include the works of [13] and [14]. Valuable insights into DNA repair processes were also delivered by models which did not require incorporation of very detailed data as in the cases discussed above. For example, Monte Carlo simulations of radiation-induced damage and its repair were used to investigate the repair of clustered damage via BER and NER pathways [15, 16]. These approaches allowed to estimate the probabilities of correct repair, fractions of damages converted into double-strand breaks (DSBs) and fractions of repairs ending with mutations. In this work, we present an extensive study of a Petri net model of the BER repair pathways, which we have built based on available literature and from biological pathway databases. In contrast to models built by others, our approach involves modeling the repair of many different types of DNA damage through the activity of a collection of DNA glycosylases initiating BER and other enzymes performing further steps of the repair. With such a comprehensive model formal analysis becomes possible using methods provided by the Petri nets theory. Here we have performed a t-invariant based analysis, involving <span class="Gene">MCT sets and t-clusters, decomposing the model into precisely identified and named functional modules. Understanding their interaction provides valuable insights into the complex DNA repair process. We have also performed an in silico knockout analysis as a prediction of system behavior under a collection of disturbances. The organization of the p<span class="Gene">aper is as follows. Section 2 provides an overview of Petri nets, including the analytical analysis of t-invariants. In Section 3 the results of the net analysis are presented. The p<span class="Gene">aper ends with discussion and conclusions given in Section 4.

Methods

Petri nets

A Petri net is a mathematical object based on a weighted directed bipartite graph. There are two disjoint sets of vertices called places and transitions. Places correspond to passive components of a modeled system (e.g., chemical compounds) while transitions model active components (e.g., chemical reactions or other elementary subprocesses). Places can be connected with transitions and transitions with places by arcs, which describe causal relations between elementary components of the system. The arcs are labeled by weights being positive integer numbers. Places can hold objects called tokens, which represent discrete quantities of the passive components. In a graphical representation of Petri nets places are denoted as circles, transitions as rectangles or bars, arcs as arrows and tokens as dots or positive integer numbers within places. If a weight of an arc is equal to one, usually it is not shown in the graphical representation; otherwise it is represented as a number labeling the arc. The bipartite graph underlying a Petri net determines its structure (which should correspond to a structure of the modeled system), but most of the fundamental properties of nets of this type follow from their dynamics which is related to tokens. The flow of tokens through the net from one place to another via transitions corresponds to the flow of information, substances etc. through the modeled system. The distribution of tokens over all places, called marking, represents a state of the system. The flow of tokens is governed by a transition firing rule. According to this rule, a transition is enabled if the number of tokens in each of its pre-places, i.e., the ones which directly precede this transition, is equal to at least the weight of the arc connecting such a place with the transition. An enabled transition may fire, which means that tokens flow from its pre-places to its post-places, i.e., the ones which are its immediate successors. The number of flowing tokens is equal to the weight of the arc. There are two exceptions to this rule, i.e., transitions without pre-places are continuously enabled while transitions without post-places do not produce tokens. These types of transitions can be used to model interactions of the system with its env<span class="Chemical">ironment. A read arc is a special kind of bidirectional arc, corresponding to a pairs of arcs. Tokens in a place connected by a read arc to a transition are not consumed when firing this transition, but they are required to enable it. Part of the results in the following p<span class="Gene">aper comes from the analysis of the net simulation. In such a simulation one must set specific rules governing the firing of the transitions. Our model is based on a classical Petri net, where in general only three scenarios for transition firing are available: in a single step of the simulation only one of the enabled transitions can fire, all enabled transitions can fire simultaneously or, in a third case, only some, randomly chosen enabled transitions will fire. The latter scenario has been used in our simulations. <span class="Chemical">Specifically, each enabled transition has 50% chance of firing and the sequence of enabled transitions selected for firing in a single simulation step is (each time) chosen randomly. More complex firing scenarios could be available if, e.g., stochastic Petri net has been used. It would however require assigning a so called firing rates to each transition, what in general would be a difficult task requiring knowledge about more or less specific probability of each reaction involved in the modeled system. The assumption of 50% firing chance is therefore a trade-off, allowing simple yet still valuable stochastic simulations of the model. As a summary one can say that in a single step some random enabled transitions (with 50% chance for firing) are randomly assigned to the firing list. One must note that they will not necessarily all fire, because competing transitions on such a list can share the same pre-places and consume “activation” tokens from each other depending on the firing sequence (i.e., when a transition is on that list, but because of firing of the other ones such a transition loses the necessary minimum of tokens in its pre-places, it stops being active and cannot fire). However, such a firing sequence is random in each simulation step, so a sufficient stochastic scenario is provided to study the dynamic behavior of the model. More advanced analysis methods for simulation of a net would be available if a stochastic Petri net (SPN) or continuous Petri net (CPN) have been used to model the DNA repair process. The latter would require the formulation of ODEs for the reaction flows. For the SPN model, the so-called firing rates need to be assigned for each transition—in the stochastic Petri net they influence the transition chances of firing in the simulation. Both approaches have advantages (e.g., a more complex and accurate simulation) and disadvantages. As for the latter, it is still difficult to find specific and accurate values which could describe the ODE in the CPN model or firing rates for all transitions for the SPN model. Further, in a continuous Petri net an invariant, MCT and cluster-based analysis explained in the next section is not available, while in this work such an analysis allowed us to divide the Petri net model into some functional subnets, characterized by the common functionality.

t-invariants analysis

Besides the graphical representation of Petri nets also a more formal one, called incidence matrix, is used. Entries of such matrix A = [a] are integer numbers, and entry a is equal to a difference between numbers of tokens in places p before and after firing transition t. Among many properties of Petri nets, the ones related to t-invariants are especially important in the context of an analysis of biological systems models. A t-invariant is a vector x of integer numbers being a solution to the equation For every t-invariant x there is a set of transitions supp(x) = {tj:xj>0,j = 1,2,…,m} called its support. t-invariants correspond to some subprocesses of the modeled biological system which do not change its state. Hence, analyzing relations among them may lead to discoveries of properties of the system [17]. The net is covered by t-invariants if every transition belongs to a support of at least one of them. In such a situation every transition contributes to at least one process of the modeled biological system [18-20]. Therefore, when an analysis is based on t-invariants the net should be covered by them. In order to find relations between t-invariants they are grouped into sets called t-clusters, using standard clustering algorithms. Also transitions can be grouped into sets called maximal common transition sets (MCT sets) what helps in the analysis of t-invariants [17, 21]. Both t-clusters and MCT sets correspond to some functional modules of the biological system. A more precise description of a Petri net elements, invariants and MCT sets is available as Appendix in S1 Appendix.

Results

The model of the human base excision repair pathway

The Petri net presented in this work is a model of the <span class="Species">human base excision repair pathway. The model was built based on data from the publicly available databases: REPAIRtoire—a database of DNA repair pathways [9], DNAtraffic [10] and Reactome [22, 23]. Subsequently, the computational model was supplemented with several new aspects of the apurinic/apyrimidinic (AP) site processing which are not included in the above-mentioned databases but were reported in the literature: displacement of OGG1 glycosylase at the AP site by AP endonuclease [24-26] or NEIL1 glycosylase [27, 28], displacement of <span class="Gene">NTH1 glycosylase by <span class="Gene">AP endonuclease [29], Polβ-mediated <span class="Chemical">LP-BER [30, 31], mechanism of the repair initiated by NEIL2 and NEIL3 [32, 33]. The model has been built and analyzed using Holmes software [34] and is available in <span class="Chemical">SPPED (Snoopy), XML (SMBL), .project (Holmes) and PDF formats as S1, S2, S3 and S4 Files respectively. The model can be subdivided into 4 subnets, described below:

Subnet 1: Introduction of DNA damage

This part of the model represents the creation of different types of DNA damage from undamaged DNA.

Subnet 2: AP site formation

The biggest part of the model represents the recognition of DNA damage by DNA glycosylases, followed by the removal of damaged bases by cleavage. This part of the model can be further subdivided into 11 modules, each of which represents the activity of one DNA glycosylase (Table 1).
Table 1

Human DNA glycosylases studied in the Petri net model.

Glycosylase name in the modelBiological meaningand physiological DNA substratesType of action:MonofunctionalorBifunctional(β-elimination)(β,δ-elimination)Protein familyReferences
1NTH1 (NHTL1)Human homolog of Endonuclease IIITg/dsDNA; 5-OHC/dsDNA; 5-OHU/dsDNA; 5,6-diHT; FaPyG; Cg; 5,6-diOHUBifunctional(δ-elimination)HhH[10, 3538]
2SMUG (SMUG1)Human single-strand Selective Monofunctional Uracil-DNA Glycosylase 1dU; 5-OHU; 5-hmU; 5-fUMonofunctionalUDG[10, 36, 3941]
3MBD4Methyl-CpG Binding Domain Endonuclease 45-fU; U and T from CpG islands; 5-hmU/ssDNA, Tg opposite G, 5-hmU from U:GMonofunctionalHhH[10, 42, 43]
4MPG (ANPG, AAG)N-Methylpurine-DNA Glycosylase3-mA; 7-mA; N7-mG; N3-mG; 1,N6εA; 1,N(2)εG; 1-mA; 1-mG; 3-meC; HxMonofunctionalMPG[10, 4448]
5MYH (MUTYH)MutH E. coli HomologA opposite 8-oxoG, C or G/ds DNA; 2-oxoA; 2-hAMonofunctionalHhH[10, 36, 41]
6NEIL1Endonuclease VIII-Like 15-OHC; 5-OHU/ssDNA and dsDNA; 8-oxoG opposite C, G, T; 5,6-diHT, 5,6-diOHU; FaPyA; FaPyG; FaPy7mG; Tg; DNA-psoralen,8-oxoA opposite CBifunctional(β,δ-elimination)Endonuclease VIII-like[10, 36, 39, 49, 50]
7NEIL2Endonuclease VIII-Like 25-OHC; 5-OHU; 5,6-diHU; 5,6-diHT; 5,6-di-OHU; 8-oxoG present inside a bubble (ssDNA); FaPyABifunctional(β,δ-elimination)Endonuclease VIII-like[10, 36, 49, 51]
8NEIL3Endonuclease VIII-Like 3FaPyA ssDNA; FaPyG ssDNA; 5-OHU ssDNA; 5-OHC ssDNA; Tg; 8-oxoA, spiroiminodihydantoin, guanidinohydantoinBifunctional(β-elimination)Endonuclease VIII-like[33, 36, 52]
9OGG18-Oxoguanine DNA Glycosylase8-oxoG opposite C, G, T; 8-oxoA from A:C; FaPyG; FaPy7-mGBifunctional(β-elimination)HhH[10, 36, 39, 5355]
10TDGT/G Mismatch-Specific Thymine DNA GlycosylaseU and T from CpG islands; 5-hmU from U:G; 5-OHC; 5-fC; 5-caC; 5-FC; 3,N(4)εC; Tg opposite G, 8-oxoA opposite C, G and TMonofunctionalUDG[41, 43, 5663]
11UNG2 (UNG)Uracil-DNA Glycosylase 2 (nuclear) UNG1 (mitochondrial)U in ssDNA and dsDNAMonofunctionalUDG[10, 36, 64, 65]

A–adenine; C–cytosine; G–guanine; T–thymine; U–uracil; Br–bromo; ca–carboxyl; diH–dihydro; diOH–dihydroxy; ε –etheno; f–formyl; F–fluoro; FaPy– 2,6-diamino-4-hydroxy-5-N-methylformamidopyrimidyne; g–glycol; Hx–hypoxanthine, m–methyl; OH–hydroxyl (h); oxo–dihydro; ss–single-stranded; ds–double-stranded.

A–adenine; C–cytosine; G–guanine; T–<span class="Chemical">thymine; U–uracil; Br–bromo; ca–carboxyl; diH–dihydro; diOH–dihydroxy; ε –etheno; f–formyl; F–fluoro; FaPy– 2,6-diamino-4-hydroxy-5-N-methylformamidopyrimidyne; g–glycol; Hx–hypoxanthine, m–methyl; OH–hydroxyl (h); oxo–dihydro; ss–single-stranded; ds–double-stranded. DNA glycosylases and their substrates are represented by places. The substrates were divided into groups, based on a subset of DNA glycosylases that recognize them (Table 2).
Table 2

Damages to the DNA taken into account in the model.

Group nameDamage detectedGlycosylases
1U:G and UU from dsDNA, U from U:GUNG2, SMUG, TDG
2U:A and U ssDNAU from ssDNA, U from U:AUNG2, SMUG
35-hmU:G5-hmU from U:GSMUG, MBD4, TDG
45-hmU ssDNA and 5-fU5-hmU from ssDNA and 5-fUSMUG, MBD4
55-OHU ssDNA5-OHU ssDNASMUG, NEIL1, NEIL2, NEIL3
65-OHU paired with A/Gand 5-hmU paired with A5-OHU from U:G and U:A,5-hmU from U:ASMUG
7T/U:CpGU from U:G in CpG sitesand T from T:G in CpG sitesMBD4
88-oxoA:C8-oxoA from A:CTDG, OGG1, NEIL1, NEIL2
95-OHC in dsDNA5-OHC dsDNATDG, NTH1, NEIL1, NEIL2
108-oxoA paired with T/G8-oxoA from A:T and A:GTDG, NEIL2
11T paired with T/C/G/O6-mG, 5-fC,5-caC; 3,N(4)εCT paired with T/C/G/O6-mG, 5-fC,5-caC and 3,N(4)εCTDG
12FaPyG:CFaPyG from FaPyG:COGG1, NTH1, NEIL1
13A paired with G/8-oxoG/C and 2-OH/oxoA paired with T or GA from A:G, A:8-oxoG and A:C, 2-OHA/2oxo-A from A:GMYH
141/3/7-mA, 1/3/7-mG, 3-mC, 1,N6εA, hypoxanthine, 1,N(2)εG1-mA, 3-mA, 7-mA, 1m-G, 3-mG, 7-mG, 3-mC, 1,N6εA, 1,N(2)εG, hypoxanthineMPG
155-OHU in dsDNA; 5,6-diHT and 5,6-diOHU5-OHU dsDNA; 5,6-diHT and 5,6-diOHUNTH1, NEIL1, NEIL2
16Tg paired with GTg from Tg:GMBD4, TDG, NTH1, NEIL1
17FaPyG paired with A/G/TFaPyG from FaPyG:A, FaPyG:G and FaPyG:TOGG1, NTH1
188-oxoG paired with C/G/T; FaPy-7mG8-oxoG from G:C, G:G and G:T; FaPy-7mGOGG1, NEIL1
195-OHC ssDNA5-OHC ssDNANEIL1, NEIL2, NEIL3
20Tg in ssDNATg ssDNANEIL1, NEIL3
21Tg paired with A, Tg in dsDNATg from Tg:A, Tg dsDNANTH1, NEIL1, NEIL3
22FaPyA, FaPyG, spiroiminodihydantoin, guanidinohydantoin from ssDNAFaPyA, FaPyG, spiroiminodihydantoin and guanidinohydantoin from ssDNANEIL3
23Cg from dsDNACg dsDNANTH1
24FaPyAFaPyANEIL1, NEIL2
25DNA-psoralenDNA-psoralenNEIL1
265,6-diHU; 8-oxoG in ssDNA5,6-diHU, 8-oxoG ssDNANEIL2
Each group of DNA damage is represented by a separate place. For each glycosylase-substrate pair this part of the model contains a transition representing the damage recognition process, which leads to the glycosylase-damaged DNA complex formation and cleavage of the damaged base, which in turn results in the AP site formation. Fig 1 presents the submodule for the <span class="Gene">OGG1 DNA glycosylase (full names of places and transitions of this submodule are given in Table 3).
Fig 1

The part of the BER Petri net model presenting AP site formation by the OGG1 glycosylase.

Recognition of a specific error group, the cleavage and formation of the DNA repair complex is shown for different error molecules. A general, schematic diagram of the modeled process is shown on the left. For more detailed description of places and transitions see Table 3. Shades of gray were used to mark steps of the process.

Table 3

The biological meaning of places and transitions for Fig 1, presenting AP site formation by the OGG1 glycosylase (subnet 2).

Place / Transition TagMolecules / reaction represented by places and transitions
Glycosylases
OGG1OGG1
DNA damage
grp8_8oxo_ACgroup 8: 8-oxoA paired with C
grp12_FapyGCgroup 12: FapyG paired with C
grp18_8oxoG_CGTgroup 18: 8oxoG paired with C/G/T; FaPy-7mG
grp17_FapyG_AGTgroup 17: FapyG paired with A/G/T
Intermediate complexes and repair steps
recognANDbind_grp17transition: OGG1 mediated recognition and binding of grp17 damage
recognANDbind_grp8transition: OGG1 mediated recognition and binding of grp8 damage
recognANDbind_grp12transition: OGG1 mediated recognition and binding of grp12 damage
recognANDbind_grp18transition: OGG1 mediated recognition and binding of grp18 damage
comDNA_OGG1_grp18place: complex: DNA with grp18 damage + OGG1
comDNA_OGG1_grp8place: complex: DNA with grp8 damage + OGG1
comDNA_OGG1_grp12place: complex: DNA with grp12 damage + OGG1
comDNA_OGG1_grp17place: complex: DNA with grp17 damage + OGG1
cleavage_grp17transition: grp17 damage cleavage by OGG1
cleavage_grp8transition: grp8 damage cleavage by OGG1
cleavage_grp12transition: grp12 damage cleavage by OGG1
cleavage_grp18transition: grp18 damage cleavage by OGG1
Products
comDNA_AP_OGG1place: complex: DNA with AP site + OGG1
8oxoAplace: 8oxoA
FapyGplace: Fapy G
8oxoGplace: 8oxoG
FapyGplace: FapyG

The part of the BER Petri net model presenting AP site formation by the OGG1 glycosylase.

Recognition of a specific error group, the cleavage and formation of the DNA repair complex is shown for different error molecules. A general, schematic diagram of the modeled process is shown on the left. For more detailed description of places and transitions see Table 3. Shades of gray were used to mark steps of the process.

Subnet 3: AP site processing

AP site (generated by damaged base removal) is incised either by AP-lyase activity of a bifunctional DNA glycosylase or by AP endonuclease (APE). AP-lyase activities of OGG1, NTH1, NEIL1, NEIL2 and NEIL3 are represented in the model by appropriate lyase activity transitions. In the case of OGG1, NTH1 and NEIL3, which perform β-elimination, the lyase activity results in the formation of DNA incision with 3ʹdRP (deoxyribose phosphate) end (Fig 2A). Full names of places and transitions for all three Fig 2 subnets are given in Table 4.
Fig 2

Three parts (A-C) of the BER Petri net model presenting AP site processing by different glycosylases. From left to right: part A—an example of the lyase activity generating 3ʹdRP end, part B—an example of the β,δ-elimination reaction, part C—an example of the DNA incision after monofunctional glycosylase activity, D—A general, schematic diagram of the modeled processes. For a more detailed description of places and transitions see Table 4. Shades of gray were used to mark different AP site processing paths.

Table 4

The biological meaning of places and transitions for Fig 2, presenting AP site processing by different glycosylaes (subnet 3).

Place / Transition TagMolecules / reaction represented by places and transitions
Lyase activity generating 3ʹdRP end
comDNA_AP_OGG1place: complex: DNA with AP site + OGG1
OGG1_lyase_acttransition: OGG1 performs β-elimination producing 3'dRP end
comDNA_3dRP_OGG1place: complex: DNA with 3'dRP end + OGG1
OGG1_displ_byNEIL1transition: NEIL1 stimulated displacement of OGG1
NEIL1place: NEIL1
comDNA_3dRP_NEIL1place: complex: DNA with 3'dRP end + NEIL1
OGG1place: OGG1
OGG1_displ_byAPEtransition: OGG1 displacement by APE
APEplace: APE
β,δ-elimination reaction
comDNA_AP_NEIL1place: complex: DNA with AP site + NEIL1
NEIL1_lyase_acttransition: NEIL1 performs β,δ-elimination producing 3ʹP end
comDNA_3p_NEIL1place: complex: DNA with 3ʹP end + NEIL1
DNA incision after monofunctional glycosylase activity
comDNA_AP_TDGplace: complex: DNA with AP site + TDG
TDG_displ_byAPEtransition: TDG displacement by APE
TDGplace: TDG
comDNA_AP_APEplace: complex: DNA with AP site + APE
APE_cleavagetransition: cleavage of AP site by APE
comDNA_br_APEplace: complex: DNA with single strand break + APE
Three parts (A-C) of the BER Petri net model presenting AP site processing by different glycosylases. From left to right: part A—an example of the lyase activity generating 3ʹdRP end, part B—an example of the β,δ-elimination reaction, part C—an example of the DNA incision after monofunctional glycosylase activity, D—A general, schematic diagram of the modeled processes. For a more detailed description of places and transitions see Table 4. Shades of gray were used to mark different AP site processing paths. Only NEIL1 and NEIL2 generate DNA incision with 3ʹP end through β,δ-elimination (Fig 2B). In contrast, AP sites generated by monofunctional glycosylases (SMUG, MBD4, MPG, MYH, TDG, UDG) need additional processing and involvement of APE. This reaction results in DNA incision with 5ʹdRP end (Fig 2C). The AP-lyase activity of OGG1 is weak ‒ after base cleavage, OGG1 can be substituted by APE, which catalyzes strand incision leading to 5ʹdRP end (Fig 2A). Similarly, APE can process AP sites generated by NTH1, and NEIL1 can catalyze β,δ-elimination at the AP sites generated by OGG1 (Fig 2A and 2C). The 3ʹdRP and 3ʹP DNA termini (dirty ends) are refractory to DNA polymerase repair synthesis and need to be “cleaned” to 3ʹOH for BER to proceed. In human cells, APE and PNKP are responsible for the removal of 3ʹdRP and 3ʹP, respectively. 5ʹdRP end prevents the ligation process and its removal is primarily carried by DNA Polβ or FEN1.

Subnet 4: DNA repair synthesis and ligation

The last step of BER is the replacement of the excised nucleoside by repair synthesis catalyzed by DNA polymerases (presented in Fig 3, full names of places and transitions are given in Table 5) and nick sealing by a DNA ligase. It can proceed either as SP- or LP-BER. Typically, SP-BER is performed by Polβ and LIG3-XRCC1 complex on the products of bifunctional glycosylases activity. Repair initiated by monofunctional glycosylases proceeds via LP-BER which requires activity and presence of Polβ, Polδ/Polε, PCNA, FEN1 and LIG1. However, the monofunctional glycosylase pathway can also proceed via SP-BER, after removal of 5ʹdRP by Polβ or FEN1. If Polβ removes 5ʹdRP than the final ligation step is performed by LIG3-XRCC1 complex, otherwise it is done by LIG1. DNA synthesis in LP-BER is usually performed either by Polδ or Polε, with Polδ being preferred in high concentrations of PCNA [66]. However, it was also shown that Polβ can conduct strand synthesis in LP-BER.
Fig 3

The part of the BER Petri net model presenting different repair synthesis pathways.

A general, schematic diagram of the modeled process is shown below the Petri net. For more detailed description of places and transitions see Table 5. Shades of gray were used to mark different paths.

Table 5

The biological meaning of places and transitions for Fig 3 presenting different repair synthesis pathways (subnet 4).

Place / transition TagMolecules / reaction represented by places and transitions
SP-BER after bifunctional glycosylase activity
comDNA_1nt_APEplace: complex: DNA with 1nt gap + APE
polBetaplace: Polβ
polBeta_recruitmtransition: Polβ recruitment
comDNA_1nt_APE_Betaplace: complex: DNA with 1nt gap + APE + Polβ
polBeta_inc_1nttransition: Polβ incorporates 1nt
dNTPplace: dNTP
dNTP_gentransition: generate dNTP
dNTP_dispostransition: disposal of dNTP
PPiplace: PPi
comDNA_unLig_APE_Betaplace: complex: DNA not ligated + APE + Polβ
lig_recruitmtransition: ligase recruitment
LIG3_XRCC1place: LIG3_XRCC1
comDNA_unlig_Beta_LXplace: complex: unligated DNA + Polβ + LIG3 + XRCC1
LIG3_ligtransition: ligation byLIG3
comDNA_lig_Beta_LXplace: complex: ligated DNA + Polβ + LIG3 + XRCC1
diss_SPcomtransition: dissociation of SP-BER complex
APEplace: APE
comDNA_3p_PKNPplace: complex: DNA with3’P and + PKNP
removal_3p_byPKNPtransition: removal of3’P by PNKP
3pplace:; 3’P ending
3p_dispostransition: disposal of 3’P
comDNA_1nt_PKNPplace: complex: DNA with 1nt gap + PNKP
polBeta_recruittransition: Polβis recruited to the repair complex which contains 1nt gap
comDNA_1nt_Beta_PKNPplace: complex: DNA with 1nt gap + Polβ + PKNP
polBeta_inc_1nttransition: Polβincorporates 1nt
comDNA_unlig_Beta_PKNPplace: complex: unligated DNA + Polβ + PNKP
ligationPNKPtransition: ligation by LIG3 in pathway with PNKP
comDNA_lig_Beta_others1place: complex: ligated DNA + Polβ + LIG3_XRCC1 + PNKP
diss_SPcom_PNKPpathtransition: dissociation of SP-BER complex in pathway with PNKP
PNKPplace: PNKP
LP-BER
comDNA_br_APEplace: complex: DNA with a break + APE
polBeta_recruitmDEtransition: Polβrecruitment in LP-BER
comDNA_br_APE_Betaplace: complex: DNA with a break + APE + Polβ
inc_1nttransition: incorporation 1nt
comDNA_1nt_5dRP_APE_Betaplace: complex: DNA 1nt 5dRP + APE + Polβ
polDeltaplace: Polδ
polEpsilonplace: Polε
polB_disp_byPolDtransition: Polβdisplacement by Polδ
polB_disp_byPolEtransition: Polβdisplacement by Polε
PCNA_highplace: PCNA at high levels
PCNA_lowplace: PCNA at low levels
PCNA_dectransition: PCNA decreasing
PCNA_inctransition: PCNA increasing
comDNA_1nt_APE_Delta_PCNAplace: complex: DNA with 1nt gap + APE + Polδ + PCNA
comDNA_1nt_APE_Epsilon_PCNAplace: complex: DNA with 1nt gap + APE + Polε + PCNA
DNA_synt_polDtransition: DNA synthesis (LP) by Polδ
DNA_synt_polEtransition: DNA synthesis (LP) by Polε
comDNA_flap_APE_Delta_PCNAplace: complex: DNA with a flap + APE + Polδ + PCNA
comDNA_flap_APE_Epsilon_PCNAplace: complex: DNA with a flap + APE + Polε + PCNA
FENplace: FEN1
flap_cleav_polDtransition: flap cleavage (LP) by Polδ
flap_cleav_polEtransition: flap cleavage (LP) by Polε
flapplace: flap
flap_outtransition: flap out
comDNA_unlig_APE_Delta_PCNAplace: complex: DNA with an unligated patch + APE + Polδ + PCNA
comDNA_unlig_APE_Epsilon_PCNAplace: complex: DNA with an unligated patch + APE + Polε + PCNA
lig_LP_polDtransition: ligation by LIG1 in LP-BER with Polδ
lig_LP_polEtransition: ligation by LIG1 in LP-BER with Polε
LIGplace: LIG1
comDNA_lig_APE_Delta_PCNA_LIGplace: complex: ligated DNA + APE + Polδ + PCNA + LIG1
comDNA_lig_APE_Epsilon_PCNA_LIGplace: complex: ligated DNA + APE + Polε + PCNA + LIG1
diss_LP_polDtransition: dissociation of LP-BER complex with Polδ
diss_LP_polEtransition: dissociation of LP-BER complex with Polε
PCNAplace: PCNA
PCNA_freetransition: PCNA freed from DNA complex after repair
LP-BER with polBeta and SP-BER with FEN
polBeta_rem_5dRPtransition: Polβremoves 5ʹdRP
5dRPplace: 5’dRP
PCNA_addedtransition: PCNA added to the complex
comDNA_1nt_gap_Beta_PCNAplace: complex: DNA with 1nt gap and 5ʹdRP + APE + Polβ + PCNA
DNA_synthesistransition: DNA synthesis by Polβ
comDNA_flap_APE_Betaplace: complex: DNA with a flap structure + APE + Polβ
flap_exctransition: flap excision
comDNA_unlig_APE_Beta_FENplace: complex: unligated DNA + APE + Polβ + FEN1
APE_removtransition: APE removal
FEN1_dRP_cleavtransition: FEN1 5ʹdRP cleavage
comDNA_1nt_FEN_Betaplace: complex: DNA with 1nt gap + FEN1 + Polβ
polBeta_gap_filltransition: Polβgap filling
comDNA_unlig_Beta_FENplace: complex: unligated DNA + Polβ + FEN1
LIG_ligationtransition: ligation by LIG1
comDNA_lig_Beta_FEN_LIGplace: complex: ligated DNA + Polβ + FEN1 + LIG1
diss_othertransition: dissociation of repair complex
DNA_repairedplace: repaired DNA after damage

The part of the BER Petri net model presenting different repair synthesis pathways.

A general, schematic diagram of the modeled process is shown below the Petri net. For more detailed description of places and transitions see Table 5. Shades of gray were used to mark different paths.

Mathematical analysis of the model

The presented Petri net model consists of 259 transitions and 179 places. Full names of places and transitions are given in Tables A and B in S1 Tables, respectively. In the first step of the analysis of the model, some of its structural properties have been identified. The net is pure and ordinary, i.e., there are no read arcs, and all the arcs have a weight equal to 1. Therefore, the net is homogenous–all outgoing arcs of a given place have the same weight. The network is connected, but not strongly connected, meaning that all its nodes have non-directed connection(s) with the others, but there are pairs of nodes without a directed path between them. The net is not conservative, i.e., many transitions consume a different number of tokens from their pre-places than they produce in the post-places. There are also static conflicts–some transitions share the same pre-places. Finally, there are input and output transitions in the model, which correspond to the connections of the analyzed system with other processes. An important part of the analysis of the net is based on the t-invariants. There are 245 t-invariants and the net is covered by them. In the presented model of DNA BER-repair process, there are 94 non-trivial MCT sets, i.e., containing more than one transition. Many of the MCT sets are responsible for almost the same functionality, but operating on different substrates. Description of the most important MCT sets is presented in Table 6.
Table 6

Maximum Common Transitions sets and their biological functions.

MCT setsContained transitionsBiological meaning
m1t90, t93, t94, t95, t97, t98Polβ in SP-BER using PNKP.
m2t0, t47, t49, t58, t60Polδ in LP-BER.
m3t1, t48, t50, t59, t61Polε in LP-BER.
m4-m7, m9-m12, m18, m21-m37, m41-m62-m94from 2 to 4 transitions at maximumDetection of DNA damage by a proper glycosylase connected with the disposal of by-products. Also processes creating crucial polypeptides of the system.
m8t40, t169, t170, t171DNA incoming and leaving the repair process.
m13t32, t41, t90Cleavage of AP site by APE connected with Polβ recruitment to the DNA-proteins complex.
m14t42, t62, t85Removal of 3ʹdRP by APE, Polβ recruitment leading to SP-BER only.
m15t46, t56, t57XRCC1 and LIG3 ligase recruitment, SP-BER with DNA-Polβ complex followed by complex dissociation.
m16t102, t108, t109PCNA added to DNA-Polβ-APE complex, ligation and dissociation of the complex.
m17t103, t105, t107LP-BER (sub-path parallel to m39 MCT)
m19t6, t87NEIL2 AP-lyase activity
m20t8, t9NEIL3 AP-lyase activity
m38t52, t53NTH1 AP-lyase activity
m39t99, t177NEIL1 AP-lyase activity on 3ʹdRP
m40t100, t106FEN1 cleavage and Polβ gap filling (sub-path parallel to m15 MCT).
35 trivial MCT setssingle transitionVarious auxiliary functions within the base excision repair DNA process not covered by MCT m1-m89.
The most important basic functions of the modeled system correspond to eight MCT sets: m1 (short patch repair with Polβ and PNKP), m14 followed by m15 (short patch repair for DNA-APE complex), m16, m17, m40 (short patch (SP) and long patch (LP) repair in which Polβ and FEN1 are involved), finally m2 and m3 responsible for long patch repair using Polδ and Polε, respectively. Two very important sets are m13 and m14. The latter is crucial for starting one of the short patch repairs with Polβ. Set m13 is responsible for the preparation of the DNA complex for four out of six repair paths, including three long patch repairs (with Polβ, Polδ, and Polε). Overall, the results of analysis of the MCT sets suggest that in most cases the chosen repair sub-pathway is determined by the damage type at the very beginning of the process. However, in some cases alternative pathways are available. For example repair processes in which generation of 5′dRP by APE activity takes place, the repair can either proceed via SP-BER (m15) or via LP-BER, if PCNA level is high (m17, m40). The next step of the model analysis concerns t-clusters computation. Such successfully obtained clusters can divide the Petri net into different subnets responsible for the similar or common functions in the whole DNA repair process. Such a division is based on the t-invariants set. The description of such subnets/clusters can use the previously computed <span class="Gene">MCT sets for simplification (by replacing some groups of transitions by such sets), yet one should have in mind that such a t-cluster analysis is based solely on the computed t-invariants sets. In order to find the optimal number of clusters, many similarity measures and grouping algorithms had to be used and their results compared. The theory is that for each cluster (group) its t-invariants must be very similar to each other according to the chosen similarity measure, and as much as possible dissimilar to t-invariants belonging to different groups [67]. Three important choices needed to be made: choosing the proper similarity measure, the clustering algorithms and the number of clusters. Thorough comparative tests of different choices in these three variables have been performed. In the tests eight similarity measures have been used: Binary, Canberra, Euclidean, Manhattan, Maximum, Minkowski, uncentered and centered Person (the latter often called Correlated Pearson metric). Further, seven clustering algorithms have been tested: Centroid, Complete, McQuitty, Median, Single, UPGMA, and Ward. The evaluation concerned the number of clusters ranging from 2 to 30. In order to evaluate the results two indices have been used, i.e., Mean Split Silhouette (MSS) [68] and Caliński-Harabasz index [69]. For the presented Petri net the best clustering has been obtained using the UPGMA algorithm and correlated Pearson similarity measure. The best number of clusters according to the used evaluation measures is 27. However, 20 of them are trivial single t-invariants responsible for the fluctuations of the system components, like APE, Polβ, Polδ, Polε and the glycosylases responsible for damage detection. The obtained clustering (i.e., set of clusters) is presented in Table 7.
Table 7

Clusters and their biological interpretations.

ClusterCluster size:Biological interpretation
1–201 t-inv. per clusterAPE, PCNA, other polymerases and glycosylases fluctuations in the model
2124Short patch repair using Polβ and PNKP for OGG1, NEIL1 and NEIL2
2216Short patch repair using Polβ, APE, LIG3 and XRCC1 for OGG1, NEIL3, NTH1
2341Long patch repair using Polβ, PCNA, with flap excision using FEN1 and ligation with LIG1 for all glycosylases except NEIL1 and NEIL2
2437Short patch repair after 5ʹdRP removal by Polβ for all glycosylases except NEIL1, NEIL2
2537Long patch repair using Polβ, then Polδ for all glycosylases except NEIL1 and NEIL2
2637Long patch repair using Polβ, then Polε for all glycosylases except NEIL1 and NEIL2
2733Short patch repair using Polβ, PCNA, FEN1 cleavage, Polβ gap filling and final ligation with LIG1 for all glycosylases except NEIL1 and NEIL2
Clusters 21–27 from Table 7 are all responsible for different repair paths within the presented net. Table 8 presents a more detailed description of each path, providing information about e.g., glycosylases involved in damage detection or main polymerases involved in the repair process.
Table 8

Clusters and repair paths.

ClusterRepairGlycosylaseCommentPolymeraseOther proteins
21SP(1)(PNKP)OGG1, NEIL1, NEIL2NEIL1 or NEIL2 displaced by PNKP, OGG1 first displaced by NEIL1 then NEIL1 by PNKPPolβPNKP, LIG3, XRCC1
22SP(2) (APE)OGG1, NEIL3, NTH1glycosylase displacement after lyase activityPolβAPE, LIG3, XRCC1
23LP (Polβ)(PCNA, path 1)all except NEIL1 and NEIL2all glycosylases displaced by APEPolβAPE, PCNA, LIG1, FEN1
24SP(3) (APE)all except NEIL1 and NEIL2all glycosylases displaced by APEPolβAPE, LIG3, XRCC1
25LP (Polδ)all except NEIL1 and NEIL2all glycosylases displaced by APEPolβ, PolδAPE, PCNA, LIG1, FEN1
26LP (Polε)all except NEIL1 and NEIL2all glycosylases displaced by APEPolβ, PolεAPE, PCNA, LIG1, FEN1
27SP(4)(PCNA, path 2)all except NEIL1 and NEIL2all glycosylases displaced by APEPolβAPE, FEN1, LIG1
Table 9 contains the names of the most important transitions corresponding to the elementary processes within short and long patch repair pathways.
Table 9

Repair pathways construction.

Repair pathwaySimplified sequence of transitions (reactions) after detection and glycosylase removal
SP(1)(PNKP, cluster 21)NEIL1(2)_displacement, removal_3P_by_PNKP, disposal_of_3P, Polβ_recruitment_and_1nt_incorp_with_PNKP, Polβ_incorporates_1nt_PKNP_path, ligation_for_PNKP, dissociation_SP_complex_PKNP_path
SP(2)(APE, cluster 22)removal_3dRP_by_APE, Polβ_recruitment, Polβ_incorporates_1nt, ligase_recruitment, ligation_LIG3, dissociation_SP_complex
LP(1)(APE, PCNA path 1, cluster 23)cleavage_of_APs_by_APE, Polβ_recruitment, incorporation_1nt, PCNA_added_to_the_complex, DNA_synthesis, flap_excision, APE_removal, ligation_LIG1, dissociation
SP(3)(APE, cluster 24)cleavage_of_APs_by_APE, Polβ_recruitment, incorporation_1nt, Polβ_removes_5dRP, ligase_recruitment, ligation_LIG3, dissociation_SP_complex
LP(2)(Polδ, cluster 25)cleavage_of_APs_by_APE, Polβ_recruitment, incorporation_1nt, Polβ_displacement_by_ Polδ, DNA_synthesis_LP_Polδ, flap_cleavage_LP_Polδ, ligation_in_LP_Polδ, dissociation_LP_Polδ_complex
LP(3)(Polε, cluster 26)cleavage_of_APs_by_APE, Polβ_recruitment, incorporation_1nt, Polβ_displacement_Polε, DNA_synthesis_LP_Polε, flap_cleavage_LP_Polε, ligation_in_LP_Polε, dissociation_LP_ Polε_complex
SP(4)(FEN1, cluster 27)cleavage_of_APs_by_APE, Polβ_recruitment, incorporation_1nt, PCNA_added_to_the_complex, FEN1_dRP_cleavage, Polβ_gap_filling, ligation_LIG1, dissociation

Analysis of the robustness of the BER pathway via systematic in silico knockout experiment

The impact of each modeled repair synthesis sub-paths: three short-patch (<span class="Chemical">SP) ones (where DNA Polβ is always present) and three long-patch (<span class="Chemical">LP) repairs (with polymerases δ, ε or β) were analyzed via systematic in silico knockout experiment. The six repair synthesis sub-paths defined for this analysis are as follows: Path 1: <span class="Chemical">SP(1), refers to t-invariants cluster 21, DNA synthesis performed by Polβ, includes <span class="Gene">PNKP, Path 2: <span class="Chemical">SP(2), refers to t-invariants cluster 22, DNA synthesis performed by Polβ, Path 3: <span class="Chemical">LP(1), refers to t-invariants cluster 23, DNA synthesis performed by Polβ, Path 4: <span class="Chemical">SP(4), refers to t-invariants cluster 27, DNA synthesis performed by Polβ, includes <span class="Gene">FEN1, Path 5: <span class="Chemical">LP(2), refers to t-invariants cluster 25, DNA synthesis performed by Polδ, Path 6: <span class="Chemical">LP(3), refers to t-invariants cluster 26, DNA synthesis performed by Polε. Net simulations were used to perform a series of knockout experiments. Each simulation contained 20,000 steps and was repeated 20 times. In a single step, any enabled transition had a 50% chance for firing. Transition t169 (DNA_back_to_pool) was treated as a marker of successful DNA repair. Its chance of firing represents the chance of successful completion of the repair process. The results of the simulations for the non-disturbed model are summarized in Table 10. The chance of successful repair, represented by the firing of transition t169, equals on average 23.3%. The biggest proportion of successful repairs is processed via the <span class="Chemical">SP(2) pathway.
Table 10

Involvement of the synthesis sub-paths in the successful DNA repair in the non-disturbed model.

Repair synthesis sub-pathTransitionChancesfor firing
Path 1 (SP)t95: dissociation_SP_complex_PKNP_path6.8% (0.068)
Path 2 (SP)t46: dissociation_SP_complex12.9% (0.129)
Path 3 (SP)t107: APE_removal0.3% (0.003)
Path 4 (SP)t106: Polβ_gap_filling1.2% (0.012)
Path 5 (LP)t47: dissociation_LP_Polδ_complex1.2% (0.012)
Path 6 (LP)t48: dissociation_LP_ Polε_complex1.3% (0.013)

For each repair synthesis sub-path, the chance of firing of the transition directly preceding transition t169 is shown. Chances of firing were averaged from 20 simulations.

For each repair synthesis sub-path, the chance of firing of the transition directly preceding transition t169 is shown. Chances of firing were averaged from 20 simulations. The results of the knockout experiments are presented in Table 11. Transitions critical for each repair synthesis sub-paths have been identified, and their impact on the overall chance of successful repair was checked. The most prominent change in the repair success rate was observed when Path 2 was disabled. In this setup Paths 1 and 3–6 exhibit a slightly higher activity but they cannot fully recapitulate the repair. At the same time the lack of activity in Paths 3–6 did not result in an observable change in DNA repair efficiency. The high importance of Polβ in DNA repair is a well-known phenomenon—the lower repair rate is thus expected (see Discussion for more details). To the best of our knowledge, Polδ and Polε knockouts influence on BER efficiency and higher activity of other BER sub-pathways after Polβ knockout have not been reported yet.
Table 11

Results of the in silico knockout experiments.

Path disabledTransitions disabledOther paths frequency impactChance of successful repair
Path 1t90: removal_3P_by_PNKPPath 1: t95: disabledPath 2: t46: 0.130 (+0.001)Path 3: t107: 0.004 (+0.001)Path 4: t106: 0.013 (+0.001)Path 5: t47: 0.012Path 6: t46: 0.0130.169(- 0.064)
Path 2 (1 of 2 starting points)t62: Polβ_recruitmentPath 1: t95: 0.071 (+0.003)Path 2: t46: 0.080 (-0.049)Path 3: t107: 0.004 (+0.001)Path 4: t106: 0.013 (+0.001)Path 5: t47: 0.012Path 6: t46: 0.0130.190(- 0.043)
Path 2 (completely)t62: Polβ_recruitmentt96: Polβ_removes_5dRPPath 1: t95: 0.068Path 2: t46: disabledPath 3: t107: 0.008 (+0.005)Path 4: t106: 0.029 (+0.017)Path 5: t47: 0.028 (+0.016)Path 6: t46: 0.014 (+0.001)0.141(- 0.092)
Paths 3 and 4t102: PCNA_added_to_the_complexPath 1: t95: 0.068Path 2: t46: 0.134 (+0.005)Path 3: t107: disabledPath 4: t106: disabledPath 5: t47: 0.017 (+0.005)Path 6: t46: 0.015 (+0.002)0.233(no impact)
Path 3t103: DNA_synthesisPath 1: t95: 0.068Path 2: t46: 0.128 (-0.001)Path 3: t107: disabledPath 4: t106: 0.012Path 5: t47: 0.012Path 6: t46: 0.0130.233(no impact)
Path 4t100: FEN1_dRP_cleavagePath 1: t95: 0.068Path 2: t46: 0.131 (+0.002)Path 3: t107: 0.011 (+0.008)Path 4: t106: disabledPath 5: t47: 0.011 (-0.001)Path 6: t46: 0.012 (-0.001)0.232(no impact)
Path 5 and 6t60: Polβ_displacement_by_Polδt61: Polβ_displacement_by_PolεPath 1: t95: 0.070 (+0.002)Path 2: t46: 0.142 (+0.013)Path 3: t107: 0.006 (+0.003)Path 4: t106: 0.023 (+0.011)Path 5: t47: disabledPath 6: t46: disabled0.234(no impact)
Path 5t60: Polβ_displacement_by_PolδPath 1: t95: 0.068Path 2: t46: 0.132 (+0.003)Path 3: t107: 0.005 (+0.002)Path 4: t106: 0.018 (+0.006)Path 5: t47: disabledPath 6: t46: 0.015 (+0.002)0.234(no impact)
Path 6t61: Polβ_displacement_by_PolεPath 1: t95: 0.068Path 2: t46: 0.135 (+0.006)Path 3: t107: 0.004 (+0.001)Path 4: t106: 0.015 (+0.002)Path 5: t47: 0.015 (+0.003)Path 6: t46: disabled0.234(no impact)

Transitions found to be critical for each repair synthesis sub-path are listed together with their impact on other synthesis sub-paths. Chances of successful repair were averaged from 20 experiments. The change compared to non-disturbed model is also given in parenthesis (if the change has been at least 0.001 or more).

Transitions found to be critical for each repair synthesis sub-path are listed together with their impact on other synthesis sub-paths. Chances of successful repair were averaged from 20 experiments. The change compared to non-disturbed model is also given in parenthesis (if the change has been at least 0.001 or more). We have also investigated influence of a glycosylase knockout on the modeled behavior of BER, taking the TDG glycosylase as an example. We found that TDG knockout, simulated by inactivation of transition t_206, which introduces TDG into the system, leads to: i) higher activity of glycosylases involved in processing of the same DNA damage substrates as TDG (namely MBD4, OGG1, NEIL1, NEIL2 and SMUG), ii) slight decrease in the overall DNA repair efficiency, iii) accumulation of DNA damage from group 11, which are recognized only by TDG and iv) decreased activity of the SP-BER pathway.

Discussion

The BER process can be described as a unique DNA repair pathway, where very precise enzymes are used to recognize and remove specific DNA damage. Thus far eleven DNA glycosylases have been found in human cells (Table 1). All DNA glycosylases are divided into four structurally different superfamilies and share a similar ability to recognize the DNA damage. However, there are also several mismatches (e.g., T-C) or DNA damage types (e.g., 3-mA, 7-mG) which are removed only by one human enzyme (Table 2) [9, 10, 36, 65, 70]. Although BER has already been a subject of several computational modeling attempts, we provide the most comprehensive model, which includes the widest range of protein factors involved in the DNA repair process and the biggest collection of DNA damage types (Table 1). The models available to date concentrate mainly on the repair initiated by the OGG1 glycosylase [12-14], while our computational model of human BER includes 68 different DNA damage types found in living cells and identified and repaired by 11 human DNA glycosylases. We have also included into our Petri net model several mechanisms that have not been considered in the available models, e.g., displacement of OGG1 at the AP site by AP endonuclease or NEIL1, displacement of NTH1 by AP endonuclease, Polβ-mediated LP-BER and the mechanism of the repair initiated by NEIL2 and NEIL3. However, we have included in our model mainly the major DNA damage types. As a consequence, some of the products of minor G base oxidative damage (e.g. 5-carboxamido-5-formamido-2-iminohydantoin, 2Ih) which can be removed from DNA by hNEIL1-3 glycosylases [71] are not present in the model. We have also omitted several bulky DNA damage products (from the minor fraction) which are used mostly for in vitro studies of DNA glycosylases activities. BER can proceed via different sub-pathways, depending on the damage and the type of DNA glycosylase that detected the problem (Table 2). In general, the repair DNA synthesis can be performed by DNA Polβ (Polβ, used for both the so-called short-patch (SP) and long-patch (LP) base excision repairs). Two other major DNA polymerases δ and ε (Polδ and Polε) can repair DNA instead of Polβ, and they are always involved in the LP-BER pathway with PCNA involvement. Other possibilities for repairing DNA by the BER mechanism have also been included in our model. The analysis performed on the base of net t-invariants allowed to divide a structure of the net into meaningful biological units (MCT sets) and more importantly to group t-invariants which represent basic subprocesses of the model into clusters (Table 6). The clusters allow for separating different repairing processes, showing the glycosylases, polymerases and other compounds involved in the repair processes, as presented in Tables 7 and 8 and Figs 1–3. Using in silico simulations, we showed that most efficient and successful DNA repair of the studied DNA damage was reached via SP-BER (Table 10). This result was consistent with biological data showing that the patch size in the nucleosomal BER was rather shorter (1 nt) than longer (2–12 nt) [72]. Our in silico knockout experiments showed that Polβ, the so-called repair polymerase (Table 11, Path 2) is necessary for BER. Lack of DNA Polβ in our model resulted in stopping the cleaning of 5’dRP moiety and gap filling (DNA synthesis) which was simultaneously manifested as a block of whole SP-BER. In living cells, the perturbation in BER can lead to genomic instability. Our results could be supported by the observation that somatic mutations in Polβ gene have been detected in various types of cancers (NCI Genomic Data Commons (GDC)). However, it is still not clear whether and how Polβ mutations and overexpression can be linked to cancer onset and its progression. Recently, it has been shown that human R152C Polβ mutant was impaired in BER activity and efficiency. Moreover, the mutant cells have displayed a high frequency of chromatid breakages [73]. Mice carrying a targeted disruption of the Polβ gene has shown growth retardation and died of a respiratory failure immediately after the birth [74]. Also, homozygous Polβ R137Q knock-in mice embryos were typically small in size and had a high mortality rate (21%). In this mutant the BER efficiency was impaired, which subsequently ended in double-strand breaks (DSBs) and chromosomal aberrations [75]. Recently it has been shown that a mouse model with decreased expression of Polβ (with Y265C mutation) developed systemic lupus erythematosus (SLE) [76]. It is not clear which of the two activities of Polβ is the most important in SP-BER. Both DNA synthesis and 5′-dRP-lyase activity can be replaced by other DNA polymerases or FEN1, respectively. If the 5′-dRP moiety is reduced or oxidized, the 5′-dRP-lyase of Polβ cannot remove the modified sugar residue, and LP-BER is initiated. Next, a flap (2–10 nucleotide long) is subsequently removed by FEN1. FEN1 mutations are very rare, suggesting that FEN1 is important for normal DNA metabolism [77, 78]. Targeted deletion of the Fen1 gene in mice causes early embryonic lethality [79]. However, several somatic FEN1 mutations have been detected in human cancer (GDC) but the relationship between FEN1 deficiency and cancer susceptibility remains unclear. Recently, it was reported that L209P FEN1 mutation is associated with colorectal cancer. Human L209P FEN1 mutant was lacking the exo- and endonuclease activities but retaining DNA-binding affinity. This was a dominant-negative mutation and mutated protein impaired LP-BER in vitro and in vivo [80]. In contrast, our computational Petri net BER-knockout model with eliminated FEN1 activity did not show any significant change in BER efficiency (Table 11, Path 4). This inconsistency can be explained by the fact that FEN1 functions not only in LP-BER and it is mostly recognized as a central component of cellular DNA metabolism (e.g. processing of Okazaki fragment maturation intermediates, telomere maintenance and rescue of stalled replication fork) [77]. It is estimated that each day as many as 10,000 abasic sites are formed in one human cell [81]. Removing AP-sites from DNA is a daily task for the DNA repair/tolerance system. Apurinic/apyrimidinic endonuclease 1 is responsible for the AP sites processing which is necessary for further steps of DNA repair pathways [82, 83]. In our computational BER model, the elimination of AP-sites endonuclease 1 (APE1) resulted in downstream blocking of DNA repair, both, SP-BER and LP-BER. Lack of AP-sites repair can be an explanation of the embryonic lethality of Ape1 null mice [84, 85] and increasing tumor susceptibility in heterozygous mice [86]. APE1 mutations in humans (APE1 variants: L104R, E126D, and R237A, exhibiting approximately 40–60% reduction in specific incision activity) have been associated with amyotrophic lateral sclerosis (ALS) [87, 88]. Also, somatic APE1 mutations (APEX P112L, W188X, and R237C) were found in endometrial cancers [89]. Preparation of the 3′OH ends is a key step for repair DNA synthesis. In our model, we observed a reduction of a chance for successful repair when polynucleotide kinase/phosphatase (PNKP) was removed from the pathway (Table 11, Path 1). PNKP involvement in BER is mostly due to APE-independent base excision repair pathway in human cells after NEIL1 and NEIL2 action on oxidized base lesions [27, 28]. However, PNKP (two activities: DNA 5'-kinase and DNA 3′-phosphatase) is mostly recognized as an enzyme which generates 5′-phosphate/3′-hydroxyl DNA termini that are critical for ligation by the non-homologous end joining (NHEJ) DNA ligase LigIV during double-strand break repair (DSBR). Microcephaly with early-onset, intractable seizures and developmental delay (MCSZ) is a hereditary disease caused by mutations in PNKP [90, 91]. Eleven human DNA glycosylases were in silico deleted one-by-one in our computational BER model. None of them was critical for BER function because almost each DNA damage can be removed by at least two distinct glycosylases (Table 2). Apart from DNA damage removal, various BER glycosylases (SMUG, MBD4, TDG, UNG2, NEIL1-3) are involved in nucleotide replacement during the active DNA demethylation process [57]. However, unlike other DNA glycosylases, TDG is essential for embryonic development; mice die at day 11.5 [92]. The lethal phenotype is associated with epigenetic aberrations affecting the expression of developmental genes. Mouse embryonic fibroblasts, (MEFs) derived from Tdg null embryos showed impaired gene regulation due to imbalance histone modification and CpG methylation [93]. Other DNA glycosylases are not essential, mouse knockouts are viable, however, showing increase in mutation frequency or some immune dysfunction [36]. Human mutant DNA glycosylases have also been widely described and correlated with a predisposition to various diseases [70]. Also, an interplay between DNA repair of the oxidatively damaged base, 8-oxoG (8-oxo-7,8-dihydroguanine) and transcriptional activation has been documented for mammalian genes: removal of 8-oxoG from the coding strand by OGG1-mediated BER resulted in upregulated transcription [94]. Since oxidation is ongoing and transition of C to U occurs spontaneously or at specific times during differentiation and development, there is a strong suggestion that BER substrates might be epigenetic and modulate transcription factor binding [71, 95]. We believe that our model can be a valuable tool that can he<span class="Chemical">lp to predict the influence of changes in protein activities or levels on the repair process as well as some regulatory functions. It can be used to predict the sensitivity of the cell with inactivated repair proteins to different types of DNA damage and it can he<span class="Chemical">lp to identify the by-passing pathways that may lead to lack of pronounced phenotypes associated with mutations in some of the proteins. Two supplementary Tables A–B listing all net places and transitions. (PDF) Click here for additional data file.

Appendix with more detailed description of Petri net elements and theory.

(PDF) Click here for additional data file.

The model in SPPED format.

(SPPED) Click here for additional data file.

The model in SMBL format.

(XML) Click here for additional data file.

The model as Holmes project.

(PROJECT) Click here for additional data file.

The model as a figure in PDF format.

(EPS) Click here for additional data file.
  83 in total

Review 1.  3-methyladenine DNA glycosylases: structure, function, and biological importance.

Authors:  M D Wyatt; J M Allan; A Y Lau; T E Ellenberger; L D Samson
Journal:  Bioessays       Date:  1999-08       Impact factor: 4.345

2.  Functional characterization of Ape1 variants identified in the human population.

Authors:  M Z Hadi; M A Coleman; K Fidelis; H W Mohrenweiser; D M Wilson
Journal:  Nucleic Acids Res       Date:  2000-10-15       Impact factor: 16.971

3.  Determinants in nuclease specificity of Ape1 and Ape2, human homologues of Escherichia coli exonuclease III.

Authors:  Masood Z Hadi; Krzysztof Ginalski; Lam H Nguyen; David M Wilson
Journal:  J Mol Biol       Date:  2002-02-22       Impact factor: 5.469

4.  Mechanism of stimulation of the DNA glycosylase activity of hOGG1 by the major human AP endonuclease: bypass of the AP lyase activity step.

Authors:  A E Vidal; I D Hickson; S Boiteux; J P Radicella
Journal:  Nucleic Acids Res       Date:  2001-03-15       Impact factor: 16.971

Review 5.  Base excision repair of DNA in mammalian cells.

Authors:  H E Krokan; H Nilsen; F Skorpen; M Otterlei; G Slupphaug
Journal:  FEBS Lett       Date:  2000-06-30       Impact factor: 4.124

6.  Common polymorphisms and somatic mutations in human base excision repair genes in ovarian and endometrial cancers.

Authors:  M Pieretti; N H Khattar; S A Smith
Journal:  Mutat Res       Date:  2001-01       Impact factor: 2.433

7.  Excision of oxidatively damaged DNA bases by the human alpha-hOgg1 protein and the polymorphic alpha-hOgg1(Ser326Cys) protein which is frequently found in human populations.

Authors:  C Dherin; J P Radicella; M Dizdaroglu; S Boiteux
Journal:  Nucleic Acids Res       Date:  1999-10-15       Impact factor: 16.971

8.  Stimulation of human 8-oxoguanine-DNA glycosylase by AP-endonuclease: potential coordination of the initial steps in base excision repair.

Authors:  J W Hill; T K Hazra; T Izumi; S Mitra
Journal:  Nucleic Acids Res       Date:  2001-01-15       Impact factor: 16.971

9.  Neonatal lethality with abnormal neurogenesis in mice deficient in DNA polymerase beta.

Authors:  N Sugo; Y Aratani; Y Nagashima; Y Kubota; H Koyama
Journal:  EMBO J       Date:  2000-03-15       Impact factor: 11.598

10.  Heterozygosity for the mouse Apex gene results in phenotypes associated with oxidative stress.

Authors:  L B Meira; S Devaraj; G E Kisby; D K Burns; R L Daniel; R E Hammer; S Grundy; I Jialal; E C Friedberg
Journal:  Cancer Res       Date:  2001-07-15       Impact factor: 12.701

View more
  1 in total

Review 1.  Oxidative Damage in Sporadic Colorectal Cancer: Molecular Mapping of Base Excision Repair Glycosylases in Colorectal Cancer Patients.

Authors:  Pavel Vodicka; Marketa Urbanova; Pavol Makovicky; Kristyna Tomasova; Michal Kroupa; Rudolf Stetina; Alena Opattova; Klara Kostovcikova; Anna Siskova; Michaela Schneiderova; Veronika Vymetalkova; Ludmila Vodickova
Journal:  Int J Mol Sci       Date:  2020-04-02       Impact factor: 5.923

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.