| Literature DB >> 31975976 |
Christoph Czepa1, Amirali Amiri1, Evangelos Ntentos1, Uwe Zdun1.
Abstract
Mature verification and monitoring approaches, such as complex event processing and model checking, can be applied for checking compliance specifications at design time and runtime. Little is known about the understandability of the different formal and technical languages associated with these approaches. This uncertainty regarding understandability might be a major obstacle for the broad practical adoption of those techniques. This article reports a controlled experiment with 215 participants on the understandability of modeling compliance specifications in representative modeling languages, namely linear temporal logic (LTL), the complex event processing-based event processing language (EPL) and property specification patterns (PSP). The formalizations in PSP were overall more correct. That is, the pattern-based approach provides a higher level of understandability than EPL and LTL. More advanced users, however, seemingly are able to cope equally well with PSP and EPL in modeling compliance specifications.Entities:
Keywords: Complex event processing; Controlled experiment; Event processing language; Linear temporal logic; Property specification patterns; Understandability
Year: 2019 PMID: 31975976 PMCID: PMC6944266 DOI: 10.1007/s10270-019-00721-4
Source DB: PubMed Journal: Softw Syst Model ISSN: 1619-1366 Impact factor: 1.910
Questions based upon the goal
| Identifier | Question |
|---|---|
| Q1 | How understandable are the tested approaches for participants at the bachelor level (attending the Software Engineering 2 Lab course)? |
| Q2 | Are there differences in understandability between the tested approaches for participants at the bachelor level (attending the Software Engineering 2 Lab course)? |
| Q3 | How understandable are the tested approaches for participants at the master level (attending the Advanced Software Engineering Lab course)? |
| Q4 | Are there differences in understandability between the tested approaches for participants at the master level (attending the Advanced Software Engineering Lab course)? |
| Q5 | How understandable are the tested approaches for participants with industrial working experience? |
| Q6 | Are there differences in understandability between the tested approaches for participants with industrial working experience? |
Informal meanings of LTL operators
| Text notation | Symbol notation | Meaning |
|---|---|---|
|
|
| |
|
|
| |
|
| - | |
|
| - | |
|
|
|
Semantics of EPL operators
| Operator name | Representation | Semantics |
|---|---|---|
| and |
| Logical conjunction that is matched once both |
| or |
| Logical disjunction that is matched once either |
| not |
not
| Logical negation that is matched if the expression |
| every |
every
| Not just observe the first occurrence of the expression |
| leads-to |
| The first |
| until |
| Matches the expression |
Intents of occurrence patterns
| Pattern name | Representation | Intent |
|---|---|---|
| Absence |
| To describe a portion of a system’s execution that is free of certain events or states |
| Universality |
| To describe a portion of a system’s execution which contains only states that have a desired property |
| Existence |
| To describe a portion of a system’s execution that contains an instance of certain events or states |
| Bounded existence |
| To describe a portion of a system’s execution that contains at most a specified number of instances of a designated state transition or event |
Intents of order patterns
| Pattern name | Representation | Intent |
|---|---|---|
| Precedence |
| To describe a relationship between a pair of events/states where the occurrence of the first is a necessary precondition for an occurrence of the second |
| Response |
| To describe a cause–effect relationship between a pair of events/states. An occurrence of the first, the cause, must be followed by an occurrence of the second, the effect |
| 2 Cause–1 Effect Precedence Chain | ( | To describe a relationship between an event/state sequence ( |
| 1 Cause–2 Effect Precedence Chain | To describe a relationship between an event/state | |
| 2 Stimulus–1 Response Chain | ( | To describe a relationship between a stimulus sequence ( |
| 1 Stimulus–2 Response Chain | To describe a relationship between a stimulus event |
Fig. 1Available scopes for property specification patterns (shaded areas indicate the extent over which the pattern must hold)
Meaning of scopes
| Scope name | Representation | Meaning |
|---|---|---|
| global | This scope is implicitly assumed when no other scope is defined | Defines that a pattern must hold during the entire execution of a system |
| before |
before
| |
| after |
after
| |
| between |
between
| |
| after-until |
after
|
Experimental tasks
| Task No. | Context/Source | Compliance specification in natural language | Available elements for modeling |
|---|---|---|---|
| 1 | Request for a loan (cf. Elgammal et al. [ | The branch office manager has to evaluate the loan risk before signing the contract officially. No one else is allowed to evaluate the loan risk and to sign the contract | Tasks |
| Evaluate loan risk | |||
| Officially sign contract | |||
| Roles | |||
| Branch office manager | |||
| 2 | Request for a loan (cf. Elgammal et al. [ | The checking of the customer bank privilege is followed by checking of the credit worthiness. Both activities must take place before determining the risk level of the loan application | Tasks |
| Check customer privilege | |||
| Check credit worthiness | |||
| Evaluate loan risk | |||
| 3 | Medical treatment and surgery of malignant gastric diseases (cf. Rovani et al. [ | The preoperative screening is performed before any surgical treatment in order to assess whether the patient’s conditions are good enough for the surgery to be performed and to estimate potential risks. As far as the surgical technique is concerned, the gastric resection for malignant diseases can be performed by using either a laparoscopic surgery or a traditional open approach, but not both. Furthermore, in both cases a nursing period is needed to monitor the patient after the operation | Tasks |
| Preoperative screening | |||
| Laparoscopic gastrectomy | |||
| Open gastrectomy | |||
| Nursing | |||
| 4 | Renovation work and lead-based paint (cf. United States Environmental Protection Agency [ | Once a lead contamination has been identified, a certified renovator must be present all time while any cleaning activity is performed until the end of the renovation work | Tasks |
| Renovation | |||
| Cleaning | |||
| Presence of certified renovator | |||
| Events | |||
| Lead contamination identified | |||
| 5 | Renovation work and lead-based paint (cf. United States Environmental Protection Agency [ | Contractors, property managers and others who perform renovations for compensation in residential houses, apartments and child-occupied facilities built before 1978 are required to distribute a lead pamphlet before starting renovation work | Tasks |
| Renovation | |||
| Distribute lead pamphlet | |||
| Classify building | |||
| Enter building date | |||
| Data | |||
| Year of construction | |||
| Type of building |
Fig. 2Sample task as presented to the participants
Fig. 3Individual grading procedure
Fig. 4Overall grading procedure
Summary of dropped participants
| Group | Course | Reason |
|---|---|---|
| PSP | SE2 | The participant gave up after the first task |
| PSP | SE2 | The participant did not apply PSP, but used a language/formalism that was not part of the study |
| LTL | SE2 | The participant was assigned to LTL, but gave answers in PSP |
| LTL | SE2 | The participant gave positive perceived difficulty and correctness ratings for unsolved tasks |
| PSP | ASE | The participant did not apply PSP, but wrote basic Boolean formulas |
| PSP | ASE | The participant came unprepared |
| PSP | ASE | The participant did not apply PSP, but drew UML activity diagrams |
| LTL | ASE | The participant gave up after the first task |
Number of observations, central tendency and dispersion of the dependent variables semantic/syntactic correctness and response time per group and course
| LTL | PSP | EPL | |
|---|---|---|---|
| Total number of observations | 51 | 49 | 49 |
| Number of considered observations | 49 | 47 | 49 |
|
| |||
| Syntactic correctness | |||
| Arithmetic mean (%) | 56.52 | 68.64 | 61.82 |
| Standard deviation (SD) (%) | 16.40 | 16.99 | 16.85 |
| Median (%) | 57.84 | 72.55 | 61.76 |
| Median absolute deviation (MAD) (%) | 19.19 | 13.37 | 18.61 |
| Minimum (%) | 9.02 | 24.51 | 21.18 |
| Maximum (%) | 96.27 | 98.82 | 89.22 |
| Skew |
|
|
|
| Kurtosis | 0.01 |
|
|
| Semantic correctness | |||
| Arithmetic mean (%) | 28.49 | 50.19 | 38.20 |
| Standard deviation (SD) (%) | 14.48 | 15.74 | 14.73 |
| Median (%) | 27.06 | 49.61 | 36.08 |
| Median absolute deviation (MAD) (%) | 13.66 | 15.12 | 13.66 |
| Minimum (%) | 2.75 | 18.04 | 10 |
| Maximum (%) | 68.43 | 80.59 | 72.55 |
| Skew | 0.75 |
| 0.27 |
| Kurtosis | 0.24 |
|
|
| Response time | |||
| Arithmetic mean (min) | 43.49 | 48.68 | 44.87 |
| Standard deviation (SD) (min) | 13.10 | 14.39 | 14.07 |
| Median (min) | 40.50 | 45.67 | 47.22 |
| Median absolute deviation (MAD) (min) | 11.98 | 17.49 | 13.66 |
| Minimum (min) | 15.07 | 27.00 | 14.58 |
| Maximum (min) | 75.40 | 79.93 | 75.00 |
| Skew | 0.33 | 0.38 | 0.14 |
| Kurtosis |
|
|
|
| Total number of observations | 22 | 20 | 24 |
| Number of considered observations | 21 | 17 | 24 |
|
| |||
| Syntactic correctness | |||
| Arithmetic mean (%) | 57.01 | 65.13 | 71.91 |
| Standard deviation (SD) (%) | 15.62 | 21.02 | 13.78 |
| Median (%) | 56.67 | 67.84 | 72.06 |
| Median absolute deviation (MAD) (%) | 18.90 | 26.74 | 10.47 |
| Minimum (%) | 29.61 | 21.76 | 31.76 |
| Maximum (%) | 81.96 | 89.41 | 94.71 |
| Skew |
|
|
|
| Kurtosis | 1.22 |
| 1.05 |
| Semantic correctness | |||
| Arithmetic mean (%) | 30.85 | 46.93 | 49.71 |
| Standard deviation (SD) (%) | 12.96 | 17.14 | 13.46 |
| Median (%) | 29.61 | 47.84 | 51.57 |
| Median absolute deviation (MAD) (%) | 14.54 | 19.19 | 12.06 |
| Minimum (%) | 12.75 | 17.65 | 19.41 |
| Maximum (%) | 63.14 | 75.69 | 76.86 |
| Skew | 0.6 | 0.06 |
|
| Kurtosis |
|
|
|
| Response time | |||
| Arithmetic mean (min) | 52.32 | 55.99 | 58.82 |
| Standard deviation (SD) (min) | 15.36 | 13.64 | 14.15 |
| Median (min) | 49.00 | 62.00 | 58.00 |
| Median absolute deviation (MAD) (min) | 16.88 | 11.64 | 15.64 |
| Minimum (min) | 28.00 | 29.50 | 37.17 |
| Maximum (min) | 84.00 | 73.08 | 81.78 |
| Skew | 0.42 |
| 0.15 |
| Kurtosis |
|
|
|
Number of observations, central tendency and dispersion of the dependent variables semantic/syntactic correctness and response time per group of participants with working experience year
| LTL | PSP | EPL | |
|---|---|---|---|
| Number of observations | 20 | 17 | 22 |
| Syntactic correctness | |||
| Arithmetic mean (%) | 58.65 | 66.79 | 66.01 |
| Standard deviation (SD) (%) | 14.68 | 17.76 | 14.82 |
| Median (%) | 58.82 | 67.84 | 70.20 |
| Median absolute deviation (MAD) (%) | 16.42 | 13.08 | 12.50 |
| Minimum (%) | 31.18 | 21.76 | 26.67 |
| Maximum (%) | 81.96 | 89.41 | 89.22 |
| Skew | |||
| Kurtosis | 0.24 | 0.32 | |
| Semantic correctness | |||
| Arithmetic mean (%) | 30.51 | 48.58 | 44.46 |
| Standard deviation (SD) (%) | 16.04 | 16.93 | 15.20 |
| Median (%) | 28.73 | 49.22 | 45.78 |
| Median absolute deviation (MAD) (%) | 16.86 | 20.93 | 18.46 |
| Mnimum (%) | 8.24 | 17.65 | 15.69 |
| Maximum (%) | 63.33 | 75.69 | 72.55 |
| Skew | 0.55 | 0.2 | |
| Kurtosis | |||
| Response time | |||
| Arithmetic mean (min) | 49.31 | 49.19 | 48.64 |
| Standard deviation (SD) (min) | 16.81 | 13.34 | 14.03 |
| Median (min) | 47.94 | 48.85 | 48.13 |
| Median absolute deviation (MAD) (min) | 15.80 | 20.36 | 15.52 |
| Minimum (min) | 15.07 | 29.50 | 24.07 |
| Maximum (min) | 84.00 | 66.00 | 76.08 |
| Skew | 0.29 | 0.21 | 0.22 |
| Kurtosis | |||
Fig. 5Participants’ perceived correctness
Fig. 6Participants’ perceived ease of application
Cliff’s d of syntactic/semantic correctness and response time in SE2, one-tailed with confidence intervals calculated for (cf. Cliff [14] and Rogmann [70]), adjusted p-values (cf. Benjamini and Hochberg [8]) [level of significance: * for , ** for , *** for ] and effect size magnitudes (cf. Kitchenham et al. [49])
| PSP/LTL | PSP/EPL | EPL/LTL | |
|---|---|---|---|
| Syntactic correctness | |||
| | 0.7059 | 0.6071 | 0.6028 |
| | 0.0038 | 0.0014 | 0.0046 |
| | 0.2904 | 0.3916 | 0.3926 |
| |
|
|
|
| | 0.1054 | 0.1166 | 0.1148 |
| |
|
|
|
| CI low |
|
|
|
| CI high |
|
|
|
| |
| 0.0339 | 0.0351 |
| FDR adjusted | 0.0004 | 0.0658 | 0.0658 |
| Level of significance | *** | – | – |
| Effect size magnitude | Medium | – | – |
| Semantic correctness | |||
| | 0.8448 | 0.7153 | 0.6913 |
| | 0.1356 | 0.0032 | 0.0058 |
| | 0.1535 | 0.2816 | 0.3029 |
| |
|
|
|
| | 0.0794 | 0.1057 | 0.1066 |
| |
|
|
|
| CI low |
|
|
|
| CI high |
|
|
|
| |
|
| 0.0002 |
| FDR adjusted |
| 0.0003 | 0.0008 |
| Level of significance | *** | *** | *** |
| Effect size magnitude | Large | Large | Medium |
| Response time | |||
| | 0.5928 | 0.5632 | 0.5298 |
| | 0.0017 | 0.0023 | 0.0029 |
| | 0.4055 | 0.4346 | 0.4673 |
| | 0.1873 |
|
|
| | 0.1153 | 0.119 | 0.1185 |
| | 1.625 |
|
|
| CI low |
|
|
|
| CI high | 0.0076 | 0.0702 | 0.1329 |
| | 0.0537 | 0.1413 | 0.2993 |
| FDR adjusted | 0.0895 | 0.1766 | 0.2993 |
| Level of significance | – | – | – |
| Effect size magnitude | – | – | – |
Cliff’s d of syntactic/semantic correctness and response time in ASE, one-tailed with confidence intervals calculated for (cf. Cliff [14] and Rogmann [70]), adjusted p-values (cf. Benjamini and Hochberg [8]) [level of significance: * for , ** for , *** for ] and effect size magnitudes (cf. Kitchenham et al. [49])
| PSP/LTL | PSP/EPL | EPL/LTL | |
|---|---|---|---|
| Syntactic correctness | |||
| | 0.6303 | 0.4069 | 0.7718 |
| | 0.0058 | 0 | 0.006 |
| | 0.3697 | 0.5931 | 0.2222 |
| |
| 0.1863 |
|
| | 0.1923 | 0.1991 | 0.1404 |
| |
| 0.9354 |
|
| CI low |
|
|
|
| CI high | 0.0748 | 0.4849 |
|
| | 0.092 | 0.1777 | 0.0002 |
| FDR adjusted | 0.1971 | 0.2961 | 0.0012 |
| Level of significance | – | – | ** |
| Effect size magnitude | – | – | Large |
| Semantic correctness | |||
| | 0.7815 | 0.4461 | 0.8373 |
| | 0 | 0.0025 | 0.002 |
| | 0.2185 | 0.5515 | 0.1607 |
| |
| 0.1054 |
|
| | 0.1517 | 0.1938 | 0.1229 |
| |
| 0.5438 |
|
| CI low |
|
|
|
| CI high |
| 0.4055 |
|
| | 0.0003 | 0.2948 |
|
| FDR adjusted | 0.0017 | 0.3641 |
|
| Level of significance | ** | – | *** |
| Effect size magnitude | Large | – | Large |
| Response time | |||
| | 0.5686 | 0.4755 | 0.6349 |
| | 0.0112 | 0 | 0.002 |
| | 0.4202 | 0.5245 | 0.3631 |
| |
| 0.049 |
|
| | 0.194 | 0.1894 | 0.1697 |
| |
| 0.2588 |
|
| CI low |
| 0.2595 |
|
| CI high | 0.1775 | 0.3485 | 0.0246 |
| | 0.2246 | 0.3985 | 0.0583 |
| FDR adjusted | 0.3062 | 0.4703 | 0.1507 |
| Level of significance | – | – | – |
| Effect size magnitude | – | – | – |
Cliff’s d of syntactic/semantic correctness and response time for participants with industry experience year, one-tailed with confidence intervals calculated for (cf. Cliff [14] and Rogmann [70]), adjusted p-values (cf. Benjamini and Hochberg [8]) [level of significance: * for , ** for , *** for ] and effect size magnitudes (cf. Kitchenham et al. [49])
| PSP/LTL | PSP/EPL | EPL/LTL | |
|---|---|---|---|
| Syntactic correctness | |||
| | 0.6471 | 0.5321 | 0.6636 |
| | 0.0029 | 0 | 0.0023 |
| | 0.35 | 0.4679 | 0.3341 |
| |
|
|
|
| | 0.1875 | 0.1934 | 0.1702 |
| | 1.5845 |
|
|
| CI low |
|
|
|
| CI high | 0.0345 | 0.2523 |
|
| | 0.061 | 0.371 | 0.0299 |
| FDR adjusted | 0.1526 | 0.4752 | 0.1043 |
| Level of significance | – | – | – |
| Effect size magnitude | – | – | – |
| Semantic correctness | |||
| | 0.7824 | 0.5802 | 0.7295 |
| | 0 | 0 | 0.0023 |
| | 0.2176 | 0.4198 | 0.2682 |
| |
|
|
|
| | 0.1501 | 0.1907 | 0.1582 |
| |
|
|
|
| CI low |
|
|
|
| CI high |
| 0.1613 |
|
| | 0.0003 | 0.2028 | 0.0029 |
| FDR adjusted | 0.0023 | 0.3803 | 0.0145 |
| Level of significance | ** | – | * |
| Effect size magnitude | Large | – | Large |
| Response time | |||
| | 0.5059 | 0.5134 | 0.4909 |
| | 0.0029 | 0 | 0.0045 |
| | 0.4912 | 0.4866 | 0.5045 |
| |
|
| 0.0136 |
| | 0.1986 | 0.191 | 0.1822 |
| |
|
| 0.0749 |
| CI low |
|
|
|
| CI high | 0.305 | 0.2825 | 0.3057 |
| | 0.4707 | 0.4447 | 0.4704 |
| FDR adjusted p | 0.4752 | 0.4752 | 0.4752 |
| Level of significance | – | – | – |
| Effect size magnitude | – | – | – |
Cliff’s d of perceived correctness and ease of application in SE2 and ASE, one-tailed with confidence intervals calculated for (cf. Cliff [14] and Rogmann [70]), adjusted p-values (cf. Benjamini and Hochberg [8]) [level of significance: * for , ** for , *** for ] and effect size magnitudes (cf. Kitchenham et al. [49])
| PSP/LTL | PSP/EPL | EPL/LTL | |
|---|---|---|---|
|
| |||
| Perceived correctness | |||
| | 0.4336 | 0.4087 | 0.392 |
| | 0.2485 | 0.2589 | 0.259 |
| | 0.3179 | 0.3324 | 0.349 |
| |
|
|
|
| | 0.05 | 0.0511 | 0.0502 |
| |
|
|
|
| CI low |
|
|
|
| CI high |
| 0.0081 | 0.0398 |
| | 0.0105 | 0.0679 | 0.196 |
| FDR adjusted | 0.0316 | 0.1019 | 0.2262 |
| Level of significance | * | – | – |
| Effect size magnitude | Medium | – | – |
| Perceived ease of application | |||
| | 0.4213 | 0.4005 | 0.3881 |
| | 0.2518 | 0.2569 | 0, 2631 |
| | 0.3269 | 0.3426 | 0.3488 |
| |
|
|
|
| | 0.0502 | 0.0513 | 0.0501 |
| |
|
|
|
| CI low |
|
|
|
| CI high |
| 0.0268 | 0.0432 |
| | 0.0302 | 0.1301 | 0.2162 |
| FDR adjusted | 0.0658 | 0.1766 | 0.2317 |
| Level of significance | – | – | – |
| Effect size magnitude | – | – | – |
|
| |||
| Perceived correctness | |||
| | 0.3675 | 0.4013 | 0.3095 |
| | 0.3039 | 0.2914 | 0.324 |
| | 0.3286 | 0.3074 | 0.3664 |
| |
|
| 0.0569 |
| | 0.0808 | 0.0778 | 0.0722 |
| |
|
| 0.7882 |
| CI low |
|
|
|
| CI high | 0.0942 | 0.0352 | 0.1745 |
| | 0.3155 | 0.1145 | 0.2157 |
| FDR adjusted | 0.3641 | 0.2147 | 0.3062 |
| Level of significance | – | – | – |
| Effect size magnitude | – | – | – |
| Perceived ease of application | |||
| | 0.4338 | 0.3752 | 0.4233 |
| | 0.2613 | 0.2616 | 0.2891 |
| | 0.3049 | 0.3632 | 0.2876 |
| |
|
|
|
| | 0.0827 | 0.0807 | 0.0725 |
| |
|
|
|
| CI low |
|
|
|
| CI high | 0.009 | 0.1205 |
|
| | 0.0603 | 0.4412 | 0.0314 |
| FDR adjusted | 0.1507 | 0.4412 | 0.1178 |
| Level of significance | – | – | – |
| Effect size magnitude | – | – | – |
Cliff’s d of perceived correctness and ease of application for participants with industry experience, one-tailed with confidence intervals calculated for (cf. Cliff [14] and Rogmann [70]), adjusted p-values (cf. Benjamini and Hochberg [8]) [Level of significance: * for , ** for , *** for ], and effect size magnitudes (cf. Kitchenham et al. [49])
| PSP/LTL | PSP/EPL | EPL/LTL | |
|---|---|---|---|
| Perceived correctness | |||
| | 0.3586 | 0.344 | 0.37 |
| | 0.2778 | 0.2872 | 0.2745 |
| | 0.3636 | 0.3689 | 0.3555 |
| | 0.0051 | 0.0249 |
|
| | 0.0813 | 0.0793 | 0.0768 |
| | 0.0622 | 0.3142 |
|
| CI low |
|
|
|
| CI high | 0.1383 | 0.1545 | 0.1116 |
| | 0.4752 | 0.3769 | 0.425 |
| FDR adjusted | 0.4752 | 0.4752 | 0.4752 |
| Level of significance | – | – | – |
| Effect size magnitude | – | – | – |
| Perceived ease of application | |||
| | 0.4078 | 0.3006 | 0.5014 |
| | 0.2734 | 0.2524 | 0.2555 |
| | 0.3188 | 0.447 | 0.2432 |
| |
| 0.1463 |
|
| | 0.0826 | 0.0802 | 0.0733 |
| |
| 1.8252 |
|
| CI low |
| 0.0124 |
|
| CI high | 0.048 | 0.2751 |
|
| | 0.1416 | 0.0348 | 0.0003 |
| FDR adjusted | 0.3033 | 0.1042 | 0.0023 |
| Level of significance | – | – | ** |
| Effect size magnitude | – | – | Medium |
GQM summary
| ID | Question | Summary of results |
|---|---|---|
| Q1 | How understandable are the tested approaches for participants at the bachelor level (attending the Software Engineering 2 Lab course)? |
|
| Q2 | Are there differences in understandability between the tested approaches for participants at the bachelor level (attending the Software Engineering 2 Lab course)? | There are significant differences between all tested approaches in terms of semantic correctness, and between PSP and LTL in terms of syntactic correctness |
| Q3 | How understandable are the tested approaches for participants at the master level (attending the Advanced Software Engineering Lab course)? |
|
| Q4 | Are there differences in understandability between the tested approaches for participants at the master level (attending the Advanced Software Engineering Lab course)? | There are significant differences in terms of semantic and syntactic correctness between EPL and LTL, and between PSP and LTL in terms of semantic correctness |
| Q5 | How understandable are the tested approaches for participants with industrial working experience? |
|
| Q6 | Are there differences in understandability between the tested approaches for participants with industrial working experience? | There are significant differences in terms of semantic correctness between PSP and LTL as well as between EPL and LTL |
Sample solution of Task 1
| Group | Sample solution |
|---|---|
| EPL | init ==> TS |
| not ‘Evaluate Loan Risk’.completed until ‘Officially Sign Contract’.started ==> PV | |
| ‘Evaluate Loan Risk’.completed ==> PS | |
| init ==> TS | |
| ‘Evaluate Loan Risk’.role != ‘Branch Office Manager’ ==> PV | |
| init ==> TS | |
| ‘Officially Sign Contract’.role != ‘Branch Office Manager’ ==> PV | |
| LTL | ! ‘Officially Sign Contract’.started W ‘Evaluate Loan Risk’.completed |
| G! (‘Officially Sign Contract’.role != ‘Branch Office Manager’) | |
| G! (‘Evaluate Loan Risk’.role != ‘Branch Office Manager’) | |
| PSP | ‘Evaluate Loan Risk’.completed precedes ‘Officially Sign Contract’.started |
| ‘Officially Sign Contract’.role != ‘Branch Office Manager’ never occurs | |
| ‘Evaluate Loan Risk’.role != ‘Branch Office Manager’ never occurs |
Sample solution of Task 2
| Group | Sample solution |
|---|---|
| EPL | init ==> TS |
| every(‘Check Customer Privilege’.completed -> ‘Check Credit Worthiness’.started) ==> TS | |
| every ‘Check Customer Privilege’.started ==> TV | |
| init ==> TS | |
| not ‘Check Customer Privilege’.completed until ‘Evaluate Loan Risk’.started ==> PV | |
| ‘Check Customer Privilege’.completed ==> PS | |
| init ==> TS | |
| not ‘Check Credit Worthiness’.completed until ‘Evaluate Loan Risk’.started ==> PV | |
| ‘Check Credit Worthiness’.completed ==> PS | |
| LTL | G(‘Check Customer Privilege’.completed -> F ‘Check Credit Worthiness’.started) |
| ! ‘Evaluate Loan Risk’.started W ‘Check Customer Privilege’.completed | |
| ! ‘Evaluate Loan Risk’.started W ‘Check Credit Worthiness’.completed | |
| PSP | ‘Check Customer Privilege’.completed leads-to ‘Check Credit Worthiness’.started |
| ‘Check Customer Privilege’.completed precedes ‘Evaluate Loan Risk’.started | |
| ‘Check Credit Worthiness’.completed precedes ‘Evaluate Loan Risk’.started |
Sample solution of Task 3
| Group | Sample solution |
|---|---|
| EPL | init ==> TS |
| not ‘Preoperative Screening’.completed until ‘Laparoscopic Gastrectomy’.started ==> PV | |
| ‘Preoperative Screening’.completed ==> PS | |
| init ==> TS | |
| not ‘Preoperative Screening’.completed until ‘Open Gastrectomy’.started ==> PV | |
| ‘Preoperative Screening’.completed ==> PS | |
| init ==> TS | |
| ‘Open Gastrectomy’.started leads-to ‘Laparoscopic Gastrectomy’.started ==> PV | |
| ‘Laparoscopic Gastrectomy’.started leads-to ‘Open Gastrectomy’.started ==> PV | |
| init ==> TS | |
| every(‘Laparoscopic Gastrectomy’.completed leads-to ‘Nursing’.started) ==> TS | |
| every ‘Laparoscopic Gastrectomy’.completed ==> TV | |
| init ==> TS | |
| every(‘Open Gastrectomy’.completed leads-to ‘Nursing’.started) ==> TS | |
| every ‘Open Gastrectomy’.completed ==> TV | |
| LTL | ! ‘Laparoscopic Gastrectomy’.started W ‘Preoperative Screening’.completed |
| ! ‘Open Gastrectomy’.started W ‘Preoperative Screening’.completed | |
| (F ‘Open Gastrectomy’.started -> G! ‘Laparoscopic Gastrectomy’.started) & (F ‘Laparoscopic Gastrectomy’.started -> G! ‘Open Gastrectomy’.started) | |
| G(‘Laparoscopic Gastrectomy’.completed -> F ‘Nursing’.started) | |
| G(‘Open Gastrectomy’.completed -> F ‘Nursing’.started) | |
| PSP | ‘Preoperative Screening’.completed precedes ‘Laparoscopic Gastrectomy’.started |
| ‘Preoperative Screening’.completed precedes ‘Open Gastrectomy’.started | |
| after ‘Open Gastrectomy’.started [ ‘Laparoscopic Gastrectomy’.started never occurs ] | |
| after ‘Laparoscopic Gastrectomy’.started [ ‘Open Gastrectomy’.started never occurs ] | |
| ‘Laparoscopic Gastrectomy’.completed leads-to ‘Nursing’.started | |
| ‘Open Gastrectomy’.completed leads-to ‘Nursing’.started |
Sample solution of Task 4
| Group | Sample solution |
|---|---|
| EPL | init ==> TS |
| every(‘Lead Contamination identified’ leads-to not ‘Renovation’.completed until [‘Cleaning’.running and not ‘Presence of Certified Renovator’.running]) ==> PV | |
| LTL | G(‘Lead Contamination identified’ & ! ‘Renovation’.completed -> (! (‘Cleaning’.running & ! ‘Presence of Certified Renovator’.running) W ‘Renovation’.completed)) |
| PSP | after ‘Lead Contamination identified’ until ‘Renovation’.completed [‘Cleaning’.running and not ‘Presence of Certified Renovator’.running never occurs] |
Sample solution of Task 5
| Group | Sample solution |
|---|---|
| EPL | init ==> TS |
| not b.finished until r.started ==> PV | |
| not r.started until b.finished ==> PS | |
| init ==> TS | |
| not d.finished until r.started ==> PV | |
| not r.started until d.finished ==> PS | |
| init ==> TS | |
|
not p.started until [y
| |
| p.started ==> PS | |
| LTL | !r.started W (b.finished & !r.started) |
| !r.started W (d.finished & !r.started) | |
|
!(y
| |
| PSP | before r.started [ b.finished occurs ] |
| before r.started [ d.finished occurs ] | |
|
p.started precedes (y
|
Shapiro–Wilk test of multivariate normality (* for , ** for , * for )
| Group | SE2 | ASE |
|---|---|---|
| LTL |
|
|
|
| ||
| PSP |
|
|
|
| ||
| EPL |
|
|
|
|
|
Shapiro–Wilk test of univariate normality (* for , ** for , * for )
| Group | Dependent variable | SE2 | ASE |
|---|---|---|---|
| LTL | Syntactic |
|
|
| Correctness |
|
| |
| Semantic |
|
| |
| Correctness |
|
| |
| Response time |
|
| |
|
|
| ||
| PSP | Syntactic |
|
|
| Correctness |
|
| |
| Semantic |
|
| |
| Correctness |
|
| |
| Response time |
|
| |
|
|
| ||
| EPL | Syntactic |
|
|
| Correctness |
|
| |
| Semantic |
|
| |
| Correctness |
|
| |
| Response time |
|
| |
|
|
|
Welch’s t test of syntactic/semantic correctness and response time in SE2, one-tailed with confidence intervals calculated for (cf. Welch [84]) and adjusted p-values (cf. Benjamini and Hochberg [8]) [Level of significance: * for , ** for , *** for ]
| PSP/LTL | PSP/EPL | EPL/LTL | |
|---|---|---|---|
| Syntactic correctness | |||
| | 3.5867 | 1.9529 | 1.5761 |
| | 94.691 | 91.994 | 94.863 |
| CI low | 0.0651 | 0.0102 |
|
| CI high | – | – | – |
| Mean | 0.6864 | 0.6864 | 0.6182 |
| Mean | 0.5652 | 0.6182 | 0.5652 |
| | 0.0003 | 0.0269 | 0.0592 |
| FDR adjusted | 0.0013 | 0.0647 | 0.1109 |
| Level of significance | ** | – | – |
| Semantic correctness | |||
| | 7.0831 | 3.8143 | 3.2849 |
| | 93.444 | 91.596 | 95.061 |
| CI low | 0.1661 | 0.0679 | 0.048 |
| CI high | – | – | – |
| Mean | 0.5019 | 0.5019 | 0.382 |
| Mean | 0.2849 | 0.382 | 0.285 |
| |
| 0.0001 | 0.0007 |
| FDR adjusted |
| 0.0009 | 0.0027 |
| Level of significance | *** | *** | ** |
| Response time | |||
| | 1.861 | 1.2971 | 0.5009 |
| | 93.123 | 91.955 | 93.774 |
| CI low | – | – | – |
| CI high | 9.8170 | 8.6859 | 5.9519 |
| | 48.6769 | 48.6769 | 44.869 |
| | 43.4902 | 44.869 | 43.4902 |
| | 0.9671 | 0.9011 | 0.6912 |
| FDR adjusted | 0.9671 | 0.9655 | 0.7975 |
| Level of significance | – | – | – |
Welch’s t test of syntactic/semantic correctness and response time in ASE, one-tailed with confidence intervals calculated for (cf. Welch [84]) and adjusted p-values (cf. Benjamini and Hochberg [8]) [Level of significance: * for , ** for , *** for ]
| PSP/LTL | PSP/EPL | EPL/LTL | |
|---|---|---|---|
| Syntactic correctness | |||
| | 1.3239 |
| 3.371 |
| | 28.887 | 25.573 | 40.268 |
| CI low |
|
| 0.0746 |
| CI high | – | – | – |
| | 0.6513 | 0.6513 | 0.7191 |
| | 0.5701 | 0.7191 | 0.5701 |
| | 0.098 | 0.8724 | 0.0008 |
| FDR adjusted | 0.2449 | 0.9254 | 0.0062 |
| Level of significance | – | – | ** |
| Semantic correctness | |||
| | 3.1981 |
| 4.7839 |
| | 29.231 | 29.156 | 42.581 |
| CI low | 0.0754 |
| 0.1223 |
| CI high | – | – | – |
| | 0.4693 | 0.4693 | 0.4971 |
| | 0.3085 | 0.4971 | 0.3085 |
| | 0.0017 | 0.7095 |
|
| FDR adjusted | 0.0083 | 0.8869 | 0.0002 |
| Level of significance | ** | – | *** |
| Response time | |||
| | 0.7786 |
| 1.4701 |
| | 35.654 | 35.389 | 41.049 |
| CI lowCI low | – | – | – |
| CI highCI high | 11.6186 | 4.5789 | 13.9503 |
| | 55.9853 | 55.9853 | 58.8236 |
| | 52.3191 | 58.8236 | 52.3191 |
| | 0.7793 | 0.2611 | 0.9254 |
| FDR adjusted | 0.8992 | 0.4352 | 0.9254 |
| Level of significance | – | – | – |