| Literature DB >> 35035345 |
Abstract
Is it really better to print everything, including software models, or is it better to view them on screen? With the ever increasing complexity of software systems, software modeling is integral to software development. Software models facilitate and automate many activities during development, such as code and test case generation. However, a core goal of software modeling is to communicate and collaborate. Software models are presented to team members on many mediums and two of the most common mediums are paper and computer screens. Reading from paper or screen is ostensibly considered to have the same effect on model comprehension. However, the literature on text reading has indicated that the reading experiences can be very different which in turn effects various metrics related to reader performance. This paper reports on an experiment that was conducted to investigate the effect of reading software models on paper in comparison with reading them on a computer screen with respect to cognitive effectiveness. Cognitive effectiveness here refers to the ease by which a model reader can read a model. The experiment used a total of 74 software engineering students as subjects. The experiment results provide strong evidence that displaying diagrams on a screen allows subjects to read them quicker. There is also evidence that indicates that on screen viewing induces fewer reading errors.Entities:
Keywords: Controlled experiment; Feature diagrams; Model comprehension; Model representation; Paper-based reading; Screen-based reading use case diagrams; Student-based experiments
Year: 2022 PMID: 35035345 PMCID: PMC8743026 DOI: 10.1007/s10270-021-00966-y
Source DB: PubMed Journal: Softw Syst Model ISSN: 1619-1366 Impact factor: 2.211
The dependent variables and their corresponding hypotheses for the two experiment parts
| Dependent variable | Null hypothesis (Ho) | Alternative hypothesis (Ha) |
|---|---|---|
| Response time | (Ho1): T (Paper) = T (Screen) | (Ha1): T (Paper) ≠ T (Screen) |
| Errors committed | (Ho2): E (Paper) = E (Screen) | (Ha2): E (Paper) ≠ E (Screen) |
| Response time | (Ho3): T (Paper) = T (Screen) | (Ha3): T (Paper) ≠ T (Screen) |
| Errors committed | (Ho4): E (Paper) = E (Screen) | (Ha4): E (Paper) ≠ E (Screen) |
The experimental design of the use cases experiment
| Session 1 | Group A | Movie theatre (paper) | Group B | Movie theatre (screen) |
| Session 2 | Group A | UTube (screen) | Group B | UTube (paper) |
| Session 1 | Group C | PDF (paper) | Group C | PDF (screen) |
| Session 2 | Group D | ERP (screen) | Group D | ERP (paper) |
Use case diagram sizes in terms of the number of graphical constructs
| Movie theatre | UTube | |
|---|---|---|
| Use cases | 11 | 11 |
| Regular | 8 | 8 |
| As a rectangle | 1 | 1 |
| As a classifier | 1 | 1 |
| Abstract with extension points | 1 | 1 |
| Actors | 3 | 3 |
| Total Nodes | 14 | 14 |
| Communication | 5 | 4 |
| Extend | 5 | 8 |
| Include | 4 | 2 |
| Total relationships | 14 | 14 |
| Total elements | 28 | 28 |
Feature diagram sizes in terms of the number of graphical constructs
| PDF diagram | ERP diagram | |
|---|---|---|
| Mandatory features | 40 | 33 |
| Optional features | 31 | 38 |
| Dead features | 2 | 2 |
| Attributes | 2 | 2 |
| Refer | 1 | 1 |
| Feature cardinality | 7 | 7 |
| Group cardinality | 4 | 4 |
| Total | 87 | 87 |
| Require | 4 | 4 |
| Exclude | 4 | 4 |
| Generalization | 4 | 4 |
| Implementation | 4 | 4 |
| Alternative | 14 | 13 |
| Or | 1 | 2 |
| And | 12 | 15 |
| Total relationships | 42 | 46 |
| Total elements | 130 | 133 |
| 45 | 49 | |
Normality test results
| Experiment part | Medium | Variable | Skewness | Kurtosis | Shapiro–Wilk | Normal? | ||
|---|---|---|---|---|---|---|---|---|
| Use case diagrams | Paper | 36 | Time | 0.006 | − 0.51 | − 1.03 | 0.91 | ✖ |
| Screen | 36 | 0.002 | 0.47 | − 1.28 | 0.89 | ✖ | ||
| Paper | 36 | Errors | 0.057 | 0.12 | − 1.10 | 0.94 | ✔ | |
| Screen | 36 | 0.0001 | 1.47 | 0.86 | 0.74 | ✖ | ||
| Feature diagram | Paper | 38 | Time | 0.0001 | 1.80 | 4.26 | 0.83 | ✖ |
| Screen | 38 | 0.0001 | 3.87 | 18.20 | 0.59 | ✖ | ||
| Paper | 38 | Errors | 0.055 | 0.46 | − 0.74 | 0.94 | ✔ | |
| Screen | 38 | 0.001 | 0.48 | 0.02 | 0.89 | ✖ |
Correlation results
| Experiment part | Medium | r statistic | Correlation? | ||
|---|---|---|---|---|---|
| Use case diagrams | Paper | 0.219 | 36 | 0.2001 | ✖ |
| Screen | 0.051 | 36 | 0.7675 | ✖ | |
| Feature diagrams | Paper | 0.149 | 38 | 0.3705 | ✖ |
| Screen | 0.077 | 38 | 0.6441 | ✖ | |
Mann–Whitney test results
| Experiment part | Variable | Difference between medians | 95% CI | Mann–Whitney U statistic | 2-tailed p | 1-tailed p |
|---|---|---|---|---|---|---|
| Use case diagrams | Time | 242 | 124– 403 | 291 | ||
| Errors | 1 | − 1 to 3 | 549.5 | 0.2651 | ||
| Feature diagrams | Time | 1638 | 1499–1820 | 24.5 | ||
| Errors | 5 | 4–7 | 115.5 |
Bold values indicate statistically significant result observed
Cliff’s delta calculations
| Part | Variable | Cliff’s delta | Variance | Statistical significance? | Favoring | Confidence interval around delta | |
|---|---|---|---|---|---|---|---|
| Maximum | Minimum | ||||||
| Use case diagrams | Time | 0.5509 | 0.0127 | ✔ | Screen | ||
| Errors | 0.1520 | 0.0204 | ✖ | 0.4103 | − 0.1288 | ||
| Feature diagrams | Time | 0.9661 | 0.0011 | ✔ | Screen | ||
| Errors | 0.8534 | 0.0051 | ✔ | Screen | |||
Bold values indicate statistically significant result observed
Visual comparison of the group performances using the two presentation mediums
Subject responses to Q1 and Q2 in the post-experiment questionnaire
| Question | Experiment part | On paper | On screen | No difference |
|---|---|---|---|---|
| Do you prefer reading diagrams on screen or on paper? | Use case diagrams | 5 | 9 | 22 |
| Feature diagrams | 5 | 13 | 20 | |
| Do you find it easier reading diagrams on paper or on screen? | Use case diagrams | 3 | 30 | 3 |
| Feature diagrams | 3 | 30 | 5 |
Qualitative results for the Enhances Reading category
| Enhances reading | |||||||
|---|---|---|---|---|---|---|---|
| Good codes | Bad codes | Neutral codes | |||||
| Paper | Screen | Paper | Screen | ||||
Code-2g Graphical constructs popping | 0 | 1 | Code-6b More appropriate for big diagrams | 1 | 0 | Code-1n Exam reading | 2 |
Code-6g More appropriate for big diagrams | 0 | 2 | |||||
Code-8g Quicker to locate graphical constructs | 1 | 1 | |||||
Code-9g Easier to locate graphical constructs | 0 | 5 | |||||
Code-10g Better focus | 0 | 1 | |||||
Code-12g Improved contrast display | 0 | 1 | |||||
Code-13g More appropriate to view on medium | 0 | 1 | |||||
| Total | 1 | 12 | Total | 1 | 0 | Total | 2 |
Qualitative results for the physical attributes and effects category
| Physical attributes and effects | |||||
|---|---|---|---|---|---|
| Good codes | Bad codes | ||||
| Paper | Screen | Paper | Screen | ||
| Code-1g brighter | 0 | 2 | Code-1b dim | 2 | 0 |
| Code-3g looking up | 0 | 2 | Code-2b looking down | 1 | 0 |
| Code-4g less stress on eyes | 1 | 0 | Code-3b glare is tiring | 0 | 1 |
| Code-7g comfort | 4 | 0 | Code-4b higher workload and tiredness | 0 | 2 |
| Code-5b accessibility issue of no printer available | 1 | 0 | |||
| Total | 5 | 4 | Total | 4 | 3 |
Qualitative results for the general preferences and norms category
| General preferences and norms | ||||
|---|---|---|---|---|
| Good codes | Neutral codes | |||
| Paper | Screen | |||
| Code-5g general preference | 1 | 2 | Code-2n no difference | 4 |
| Code-11g convenient | 0 | 1 | Code-3n normally prints everything on paper | 1 |
| Code-4n expected norm as software engineers | 1 | |||
| Code-5n generally used to medium | 1 | |||
| Total | 1 | 3 | Total | 7 |
Codebook
| Code-1g Brighter | |
| Code-2g Graphical constructs popping | |
| Code-3g Looking up | |
| Code-4g Less stress on eyes | |
| Code-5g General preference | |
| Code-6g More appropriate for big diagrams | |
| Code-7g Comfort | |
| Code-8g Quicker to locate graphical constructs | |
| Code-9g Easier to locate graphical symbols | |
| Code-10g Better focus | |
| Code-11g Convenient | |
| Code-12g Improved contrast display | |
| Code-13g More appropriate to view on medium | |
| Code-1b Dim | |
| Code-2b Looking down | |
| Code-3b Glare is tiring | |
| Code-4b Higher workload and tiredness | |
| Code-5b Accessibility issue of no printer available | |
| Code-6b Harder to navigate | |
| Code-1n Exam reading | |
| Code-2n No difference | |
| Code-3n Normally prints everything on paper | |
| Code-4n Expected norm as software engineers | |
| Code-5n Generally used to medium |
| Medium |
| Min | 1st quartile | Median | 95% CI | 3rd quartile | Max | IQR | |
|---|---|---|---|---|---|---|---|---|---|
| Use case diagrams Time | Paper | 36 | 416 | 592.9 | 807.5 | 652–869 | 915.4 | 988 | 322.5 |
| Screen | 36 | 144 | 335.3 | 419.0 | 348–684 | 826.8 | 1041 | 491.4 | |
| Use case diagrams errors | Paper | 36 | 0 | 2.0 | 6.0 | 3–7 | 8.6 | 12 | 6.6 |
| Screen | 36 | 0 | 2.0 | 3.0 | 2–4 | 5.0 | 19 | 3.0 | |
| Feature diagrams time | Paper | 36 | 1215 | 2000.6 | 2242.5 | 2089–2497 | 2731.8 | 4807 | 731.3 |
| Screen | 36 | 328 | 505.5 | 617.5 | 561–700 | 735.2 | 2400 | 229.7 | |
| Feature diagrams errors | Paper | 36 | 0 | 4.0 | 6.5 | 5–10 | 12.1 | 18 | 8.1 |
| Screen | 36 | 0 | 0.0 | 2.0 | 1–2 | 2.0 | 5 | 2.0 |