Literature DB >> 27468391

A validation study regarding a generative approach in choosing appropriate colors for impaired users.

Luigi Troiano1, Cosimo Birtolo2, Roberto Armenise2.   

Abstract

In many circumstances, concepts, ideas and emotions are mainly conveyed by colors. Color vision disorders can heavily limit the user experience in accessing Information Society. Therefore, color vision impairments should be taken into account in order to make information and services accessible to a broader audience. The task is not easy for designers that generally are not affected by any color vision disorder. In any case, the design of accessible user interfaces should not lead to to boring color schemes. The selection of appealing and harmonic color combinations should be preserved. In past research we investigated a generative approach led by evolutionary computing in supporting interface designers to make colors accessible to impaired users. This approach has also been followed by other authors. The contribution of this paper is to provide an experimental validation to the claim that this approach is actually beneficial to designers and users.

Entities:  

Keywords:  Accessibility; Color perception; Color vision deficiency; Dichromacy; Interactive genetic algorithms; Search based software engineering; User interface design

Year:  2016        PMID: 27468391      PMCID: PMC4947084          DOI: 10.1186/s40064-016-2659-6

Source DB:  PubMed          Journal:  Springerplus        ISSN: 2193-1801


Background

Colors can be one of the main barriers to the Information Society for color vision impaired users.1 Colors convey concepts, ideas and emotions, according to cultural and individual traits. If there exist colors entailing joy and relax, others produce excitement or melancholy. Therefore, not perceiving the colors properly can invalidate the purpose of communication. In 2004, the UK Disability Rights Commission (DRC)2 reported their formal investigation regarding accessibility of websites in the UK (Disability Rights Commission 2004). Surprisingly, the 81 % of sample did not meet W3C basic guidelines for accessibility, although most of website designers and owners are aware of the importance of making accessible the Web. According to this study, misperceived colors are the second main barrier to impaired users. In order to break down this barrier, there is a shared understanding for the need of a set of the best practices and guidelines in designing user interfaces (e.g., see Wang et al. 2010; Nguyen et al. 2014). The W3C has been largely working on this topic and proposed a set of recommendations (known as Web Content Accessible Guidelines, WCAG) aimed at making the Web accessible to users with impairments, visual among them. WCAG 2.0 (WCAG 2008) has been published in December 2008. Guidelines state four principles that provide the foundations of Web accessibility:The WCAG defines a 3-level scale (Level A, Level AA and Level AAA) in assessing interface accessibility. WCAG underlines some important intents related to criteria above. The first (Level A) is that colors are an important asset in web pages, as chromatic differences may convey information and each color may have a meaning assigned to it. For instance, fields could be marked by red boxes when wrong input is given, or values in a table could be highlighted by green while others by red. The other two (Levels AA and AAA) take into account that hue and saturation generally do not affect legibility in able-bodied users (Knoblauch et al. 1991). Differently color deficiencies can affect how luminance contrast is perceived. Although ISO provides 3:1 as minimal ratio threshold for standard color vision, WCAG suggests to use respectively 4.5:1 and 7:1 for Level AA and Level AAA. Perceivable “Information and user interface components must be presentable to users in ways they can perceive.” Operable “User interface components and navigation must be operable.” Understandable “Information and the operation of user interface must be understandable.” Robust “Content must be robust enough that it can be interpreted reliably by a wide variety of user agents, including assistive technologies.” In accordance to WCAG and DRC recommendations, Stanca Act (approved by Parliament of Italy in 2004) also states that contrast between background and foreground is one of the key factor in making the Web accessible for all. The importance of taking into account color usability and accessibility guidelines has been reaffirmed by Webster (2014) in a recent article appeared in the ACM Interactions magazine. Accessible color schemes guarantee high luminance contrast between colors (e.g. foreground and background) not only when they are perceived by able-bodied users, but also by those with vision disorders. Accessibility apart, color selection still attains to artistic abilities and aesthetics. Therefore, preservation of original UI chromatic choices, some of them related to the meaning of colors, should be important as much as enabling UI to the broader audience made of users affected by color vision deficiency (CVD). However, attempting to fulfill both needs can often result into conflicts, solving them generally means to experiment several combinations, thus looking for an acceptable trade-off between different options. Searching for a satisfying color palette can be regarded as a problem of combinatorics. In order to test if a particular combination of colors is accessible, a useful technique, according to DRC suggestions, is to convert a web page to black and white and to verify if all information is properly conveyed. This method is unfeasible when exploring a large number of alternatives. A different way to test accessibility is to simulate how colors are perceived by users affected by some kind of color vision deficiency, and to check if a proper contrast ratio is reached according to WCAG definition. This approach makes possible to explore a wider number of possible alternatives by search algorithms. In particular genetic algorithms can offer a reliable means to search the color space, exploiting those solutions that exhibit a good fit to the set of requirements. In the past, we experimented the application of GA to support different tasks concerning the UI design (Armenise et al. 2010; Filipe and Cordeiro 2010; Birtolo et al. 2010; Troiano et al. 2009a, b; Russo et al. 2008; Troiano et al. 2008a, 2015). Among them we considered the problem of adapting color schemes to users affected by vision disorders by means of standard genetic algorithms (SGA) (Troiano et al. 2008b) and interactive genetic algorithms (IGA) (Birtolo et al. 2009) (more details are contained in Troiano and Birtolo 2014). In this paper we will finally attempt to answer to the question if there is evidence that UI design can benefit of support provided by interactive and non-interactive genetic algorithms. As case studies we will focus on Protanopia (incidence 1.3 % males, 0.02 % females) and Deuteranopia (incidence 1.2 % males, 0.01 % females) due to their severity, but the approach being presented can be easily extended to other disorders. The study is made of two experiments. The first is aimed at experimenting how effectively GA can support website and user interface designers in choosing a combination of colors (i.e.,color palettes) that moving from an initial choice is able to evolve towards alternatives that, if the one side are accessible to broader users, on the other side they still preserve the designer’s preferences, which can be made explicit or implicit. The second experiment is aimed at testing if a solution generated by this kind of support is competitive to human-generated color schemes in terms of usability for CVD users. The remainder of this paper is structured as follows: Section 2 overviews how color vision can be modeled for those readers that are not familiar with, and outlines related works and contributions to the problem of adapting colors to CVD users as search; Section 3 provides experimental settings and outcomes; Section 4 discusses conclusions and possible future directions; “Appendix” offers an overview of the field.

Color vision, deficiencies and adaptation

Human vision is generally trichromatic. Colors are perceived by three classes of cells, called cones, able to absorb photons by different sensitivity with respect to the light wavelength. When the peak sensitivity lies in the long-wavelength of the visible spectrum (560–580 nm) we have cones of type L, in the middle-wavelength (530–540 nm) we have M cones, and in the short-wavelength (420–440 nm) there are S cones. The standard spectral sensitivity is reported in Fig. 1, where cone sensitivity is plotted at different wavelengths according to Stockman and Sharpe (2000).3
Fig. 1

Spectral sensitivity curves of L, M and S cones

Spectral sensitivity curves of L, M and S cones Dysfunction of cones (e.g., resulting in spectral sensitivity peak shift or shape alteration) can heavily affect the way color are perceived. Color vision deficiency has place when cone cells exhibit a partial or complete loss of function. The main types of CVD are known as anomalous trichromatism, dichromatism and monochromatism. Anomalous trichromatism is due to a sensitivity peak shift of one of the fundamental cones. This deficiency is further classified in protanomaly, when malfunction is given by L cones, and deuteranomaly, when the disorder is associated to M cones. Depending on the extent the peak is shifted, the perception of anomalous trichromats can range from almost normal to dichromatic vision. Indeed, it has been estimated (DeMarco et al. 1992) that on average the peaks are respectively at 440, 543 and 566 nm for standard average vision, at 440, 543 and 553 for protanomalous trichromats, while deuteranomalous trichromats show peaks at 440, 560 and 566 nm. Therefore, vision deficiency is due to the lower distance between L and M sensitivity peaks (i.e.,respectively 10 and 6 nm). When distance is reset to zero, one of the fundamental cones is missing, resulting into a severe form of CVD known as dichromatism. Therfore, dichromats confuse colors only in the spectrum range of the missing photopigments, as they lack one class of cones. In particular, dichromatism is divided in as protanopia, deuteranopia and tritanopia, depending on which cones among L, M or S are missing. The most common forms of CVD are listed in Table 1, as reported by the MPEG-21 standard specification (Yang et al. 2004).
Table 1

Medical term and CVD descriptions in the MPEG-21 (Yang et al. 2004)

Medical termDeficiency typeDeficiency degree
ProtanomalySome reduction in discrimination of the reddish and greenish contents of colors, with reddish color appearing dimmer than normalMild
ProtanopiaSeverely reduced discrimination of the reddish and greenish contents of colors, with reddish color appearing dimmer than normalSevere
DeuteranomalySome reduction in discrimination of the reddish and greenish contents of colorsMild
DeuteranopiaSeverely reduced discrimination of the reddish and greenish contents of colorsSevere
TritanomalySome reduction in discrimination of the bluish and yellowish contents of colorsMild
TritanopySeverely reduced discrimination of the bluish and yellowish contents of colorsSevere
Incomplete achromatopsiaDeficiency in both L cone sensitivity and M cone sensitivity. No color discrimination, and there is approximatively normal brightness of colorsMild
Complete achromatopsiaDeficiency L cone sensitivity, M cone sensitivity and S cone sensitivity. No color discrimination, and brightness is typical of scotopic visionSevere
Medical term and CVD descriptions in the MPEG-21 (Yang et al. 2004) This leads CVD users to a partial and biased perception of colors, as depicted in Fig. 2, where different types of deficiencies are simulated.
Fig. 2

Color wheel and its simulation in 4 different profiles: non-disabled vision (top-left), deuteranopia (top-right), protanopia (bottom-left), and tritanopia (bottom-right)

Color wheel and its simulation in 4 different profiles: non-disabled vision (top-left), deuteranopia (top-right), protanopia (bottom-left), and tritanopia (bottom-right) Access to information and services can be impaired by CVD. The main problems arise in reading textual information when the user interface is developed without taking color accessibility into account (Gradisar et al. 2007). Indeed, legibility is strongly related to the capability of discerning foreground and background colors, and there is general agreement in recognizing that higher luminance contrast improves legibility (Knoblauch et al. 1991). Recent studies (Zuffi et al. 2007; Gradisar et al. 2007) have confirmed that legibility is significantly affected by how colors are combined together, as difference between colors becomes more important when luminance contrast is lower. However, the effect of chromatic contrast is still unclear: high chromatic contrast between colors having similar luminance can still make the text legible, but no advantage has been found for low-vision reading (Legge et al. 1990). Any attempt to address this issue should take into account that UI is generally designed by non-impaired users with few or no concern of CVD limitations. Therefore several methods have been developed in order to (1) support designers in the task of choosing colors or (2) to compensate deficiency by adapting the UI to CVD user needs. An overview of these methods is presented in “Appendix”. Among the several proposals, a promising approach is based on looking at adaptation in terms of search, so that a color palette is optimized with respect to CVD user needs, but preserving the chromatic choices operated initially by the UI designer. For instance Ichikawa et al. (2003) make use of a genetic algorithm in order to optimize the contrast as perceived by impaired users, and preserving the image chromatism when a Web page is rendered. Similarly, in previous work (Troiano et al. 2008b; Birtolo et al. 2009; Troiano and Birtolo 2014) we investigated and compared genetic algorithms in assisting the design of color accessible interfaces. UI designers choices and preferences (e.g., preserve the meaning of colors) can often conflict with the need of assuring high luminance contrast between colors. An algorithm can be employed to test different combinations of colors in attempt to best solve those conflicts. Genetic algorithms are able to face effectively this problem, as they are able to explore a large combinatorial search space, exploiting those solutions that reveal a good fitness to a set of (even conflicting) requirements. Indeed, using a genetic algorithm provides the following advantages:In particular, we prototyped two solutions based on (i) standard genetic algorithm (SGA) (Troiano et al. 2008b) and on interactive genetic algorithm (IGA) (Birtolo et al. 2009). More details can be found in “Appendix”. By analyzing both algorithms from a computational point of view (Troiano and Birtolo 2014), SGA and IGA showed the capability of converging towards highly fitted solutions. IGA proved to be feasible and advantageous when compared to non-interactive genetic algorithms due to its ability of capturing fitness attributes related to human perception, that cannot be caught by a mathematical model. However, (if possible) quantification of preferences still makes possible to reduce the subjectivity associated to them. In the attempt of performing an analysis that is more robust and independent from the human perception, we simulated the expected behavior by software. However, this was an experimental setting whose results attain to simulation, rather than real operating conditions. So we moved to an experimental setting aimed at testing solutions with real users, looking for a validation of expected benefits. A large number of alternatives can be explored and used to support human creativity and decision-making A trade-off between conflicting criteria can be obtained by considering different quality attributes and design guidelines at the same time Designers can focus on more value-adding tasks, letting the algorithms to fine-tune their choices Interfaces can automatically accommodate impaired user needs

Experimentation

We designed the experiments having in mind the question if it is possible to provide solutions comparable or better than those made by humans, in a shorter time. Although simple, such a question is not trivial due to combinatorial complexity of search and quality of solutions attaining human aesthetics and creativity. Experimentation was aimed at assessing the quality of adapted solutions from both a qualitative and an usability point of view, paying particular attention to color vision disorders, Deuteranopia in particular. Experiments were led at Research and Development Centre of Poste Italiane, Naples. They were organized in two stages: Experiment 1 and Experiment 2. Experiment 1 was aimed at verifying if the automatic tools based on SGA and IGA were effective in supporting the UI design and if it was possible to discern solutions produced by tools from those produced manually. For this purpose we engaged 6 participants in performing a color adaptation with each of the 3 methods (manual, IGA and SGA) and we recorded the time to complete the task. After, for each method, we selected solutions which scored best for each method and we proposed them to a panel of other 14 participants who were asked to respond to a questionnaire aimed at assessing the solutions under different qualitative criteria. Among the questions also the request of scoring each solution and selecting which of them was made manually in their opinion. The participants were not chosen with any specific color vision disorder. Experiment 2 was performed in order to verify if solutions produced along the previous experiment were actually improving accessibility and usability for CVD users. For this purpose, we considered the best automated solution selected by participants to Experiment 1 and compared to the manual solution in solving a memory and selection task by a panel of 20 CVD participants. Both experiments are within-subjects. Below we provide experimentation the details regarding participants, equipment materials, experiment procedures and results.

Participants

For the experimentation we enrolled 40 participants within employees of Poste Italiane. We enrolled the participants in two different phases. During the first phase (Experiment 1) we selected 20 participants. The average age of the participants of this first group was 37.7, ranging from 28 to 55 years old. Ishihara test was adopted for all participants in order to verify possible color vision deficiencies. Two participants confirmed to be color blind users.4 In particular both participants were affected by deuteranopia. The group was involved in assessing the quality of solutions provided algorithmically when compared to those produced manually. For the second phase (Experiment2), we selected a group of 20 participants, whose average age was 33.4 years old, ranging from 26 to 61 years old. This group was composed by deuteranopes. The CVD was confirmed by a preliminary Ishihara test. Due to difficulties to select and involve an appropriate group of users, this required time to complete the experimentation. The group resulted being geographically and socially more heterogeneous, but all the participants were habit to use computers, during both working and free time. This group was involved in usability tests in order to assess real benefits obtained by adapted color palettes.

Equipment and materials

We arranged a single-pc room in our Laboratory in conformance to the ISO 9241 standard. The ambient was neutral and light was sourced in the room by shielded lamps on the ceiling. Other light sources, such as those coming from windows, were curtained. The room illumination was below 300 lux. Attention was paid in order to avoid any glare or reflection on the monitor screen. The experimental observations were carried out on Intel Pentium IV machine with 2 GB of RAM running Windows XP Professional Edition SP2 equipped with BenQ T720 LCD monitor, standard keyboard and optical mouse. The display inclination was , with screen size of 17-inch, resolution of 1280*1024 pixels without interpolation and refresh rate of 75Hz. The monitor white point was set to D65 and the maximum luminance was 160 . The CIE XYZ of the LCD white point was (95.047, 100.000, 108.883). For experimentation, we implemented a tool by which the initial palette is specified (Fig. 3b) with relations between colors and the desired contrast ratio (Fig. 3a). The output is an optimized palette which preserves chromatic choices, but guarantees accessability at the same time (Fig. 3c).
Fig. 3

Tool used for experimentation. a Initialization: choosing contrast ratio required and initial colors. b Choosing foregrounds and backgrounds of the interface. c Results: Optimized colors

Tool used for experimentation. a Initialization: choosing contrast ratio required and initial colors. b Choosing foregrounds and backgrounds of the interface. c Results: Optimized colors The tools supports both IGA and SGA modes. In details, IGA mode entails an interactive selection of preferred solution so that the color preferences are expressed implicitly by awarding the best attractive color combinations, while SGA tool entails a totally automatic procedure where the preferences are explicitly defined and coded before the algorithm starts. Evaluation of results was performed by the following free software: (1) Colour Contrast Analyser (version 2.0, WAT-C 2007), a tool for checking if fore-background color combinations provide good color visibility; (2) ColorSelector (version 5.1, Fujitsu Lmt. 2008), a Java Application aimed at evaluating whether fore-background combinations make the text readable to color blind users; (3) ColorDoctor (version 2.1, Fujitsu Lmt.) and Color Tester (Idea Futura), simulators able to check color accessibility according to W3C Recommendations.

Preliminaries

All participants were briefly introduced to the aim of this study, they received an overview of the usability test procedure, and they gained access to equipment and software. The two groups received clear instructions about the task and how to use the available materials. In particular, participants to the first group involved in producing color schemes were trained on the design process in order to modify colors in a target web page and received an overview of the equipment and software, while the second group was trained on the usability test procedure. Both groups received a brief introduction to W3C color recommendations. Users were asked to assume a comfortable position such to guarantee a distance of about 50 cm from the display, in order to subtend a visual angle size of about , less than so the CIE1931 Standard Colorimetric Observer was used in calculations. The test stimuli pattern consisted of a full-screen desktop application. During the execution of both tasks the trainer assisted the participants and provided some technical help when needed.

Experiment 1

Procedure

Participants were divided in two focus groups: 6 participants, with an appropriate skill in designing web pages and without any kind of CVD, were assigned to the first focus group. The remaining 14 participants were enrolled in the second group. The average age of the two groups was respectively 32.7 and 40.3 years old. The task of the first group was to upgrade a target page towards an accessible web page regarding the use of colors. As target we chose the page available at url http://www.poste.it/privati and depicted in Fig. 4. In Fig. 5, we highlights 6 color combinations that convey information, thus they should be made accessible to CVD users.
Fig. 4

Initial web page. a Original web page. b Simulation of protanope vision. c Simulation of deuteranope vision

Fig. 5

An example of accessibility issues expressed as pairs of foreground/background colors

Initial web page. a Original web page. b Simulation of protanope vision. c Simulation of deuteranope vision An example of accessibility issues expressed as pairs of foreground/background colors The initial page presented some accessibility issues, as summarized in Fig. 5. In particular some color combinations were very critical due their low contrast ratio. The task assigned to the designers was to modify the color in order to achieve an improved version of the web page. This task was performed by 3 methods:Time-on-task records are reported in Table 2. Once the first group completed the task, we collected 6 solutions for each method, totally 18. In order to reduce the number of solutions proposed to the second focus group, for each method we chose the solution with the highest score (i.e., fitness value in the lexicon of evolutionary computing). Solutions provided are presented in Fig. 6. Then, the second group evaluated the proposed solutions. The method used to generate the solution was kept hidden to the group. The group evaluated individually the three different solutions by means of a questionnaire made of the following questions:User group had to give a level of agreement to each question with a score ranging from 1 (Strongly disagree) to 5 (Strongly agree), except of the last two questions.
Table 2

Execution time expressed in seconds

Designer No.ManualIGA toolSGA tool
1825370210
2720305231
3580325164
4610390192
549023382
6535246110
Fig. 6

Color schemes after the first group of Experiment 1 completed the task. a Initial scheme. b Solution 1: Manual Procedure. c Solution 2: SGA Tool. d Solution 3: IGA Tool

the designers modified manually the colors, assisted by Colour Contrast Analyser and/or ColorSelector in measuring the contrast ratio; solutions were automatically produced by SGA tool; the designers were supported by IGA tool in selecting the combinations of colors that best fit his/her preferences and letting the algorithm to evolve solutions and guarantee accessibility. Question 1: The selected elements satisfy aesthetics requirement. (Strongly disagree 1.2.3.4.5 Strongly agree) Question 2: Web page has a colors that make it especially appealing. (Strongly disagree 1.2.3.4.5 Strongly agree) Question 3: Contrast ratio between foreground and background is suitable. (Strongly disagree 1.2.3.4.5 Strongly agree) Question 4: Information is easy to read. (Very difficult 1.2.3.4.5 Very easy) Question 5: My overall impression of colors is (Very Negative 1.2.3.4.5 Very positive) Question 6: I think that the adaptation of color for ensuring accessibility is developed by (Human, Computer, I don’t know) Question 7: Solution Ordering from the worst to the best (Solution 1, Solution 2, Solution 3, Original Solution) Color schemes after the first group of Experiment 1 completed the task. a Initial scheme. b Solution 1: Manual Procedure. c Solution 2: SGA Tool. d Solution 3: IGA Tool Execution time expressed in seconds

Results

The first hypothesis we tested is if solutions provided by Humans (S1), IGA (S2) and SGA (S3) are perceived as equivalent. A Kruskal–Wallis test on answers given by the 14 respondents provided an affirmative conclusion, as reported in Table 3 (tabled chi-squared 5.9915).
Table 3

Kruskal–Wallis statistics

Question No.p valuesKW chi-squaredMean Score
10.97180.05713.238
20.79940.44773.048
30.960.08153.857
40.37221.97684.286
50.96740.06633.333
Kruskal–Wallis statistics Obviously, the same conclusion can be reached by a pairwise (two-sided) Wilcoxon rank-sum test to the first five questions, whose result is reported in Table 4, even when no correction due to multiple comparisons is performed (e.g., Bonferroni, Holm, Hommel, Benjamini-Hochberg, etc.).
Table 4

Wilcoxon rank-sum test: p values

Question No.S1 versus S2S1 versus S3S2 versus S3
10.24020.4840.5716
20.281510.5182
30.91680.82411
40.09690.78970.1581
50.8870.56540.4537
Wilcoxon rank-sum test: p values In addition, answers given to the Question 6 entail the group was not able to recognize S2 as generated by machine (57 % said by human). In general only 14.3 % was able to correctly distinguish this solution from the others. In the focus group we extracted 2 different sub-groups: CVD users (2, affected by Deuteranopia) and Seniors (2, over 50 years old). As described before, each of them evaluated quality of solutions by answering the questionnaire. The question arisen here is to test if these users perceived the quality of the three solutions as different or not. Due to exiguity of the two groups we performed a comparison by performing a Wilcoxon rank-sum test paired on answers to questions in a row given by the two CVD users and the two Seniors for the different solutions, in order to check if answers were able to support a statistical difference facing S1 versus S2, S2 versus S3 and S1 versus S3.5 Looking at p values (Table 5, assuming p values <0.05 to reject test null hypothesis , while p values >0.50 to accept ), we can state that (1) according to CVD users the 3 solutions presented some qualitative differences, preferring S3 as confirmed by answers to Question 7 which put S3 at the first position, while (2) answers provided by elder users did not entail significant differences. We interviewed the CVD users in order to understand why they preferred S3 over the others. They affirmed that solution was more appealing than others and colors appeared in general more pleasant.
Table 5

Pairwise Wilcoxon test: p values

UserS1 versus S2S1 versus S3S2 versus S3
Deuteranopes0.00350.071860.0046
Seniors0.76560.58770.2330
Pairwise Wilcoxon test: p values Looking at answers to Question 7, 42.9 % selected S3 as the best one, while nobody indicated it as the worst one, whilst 71.4 % indicated the original color combination as the worst. Figure 7 reports the page modified according to scheme provided by S3 (IGA) and how it is perceived by CVD users.
Fig. 7

Resulting Web Page. a Non-CVD vision. b Simulation of protanope vision. c Simulation of deuteranope vision

Resulting Web Page. a Non-CVD vision. b Simulation of protanope vision. c Simulation of deuteranope vision Searching for a solution comparable to those made by humans could be of lesser interest if not obtained in lesser time. However, time-on-task reported in Table 2 outlines designers in the first group consistently experienced a shorter time in building a solution. Indeed, the manual method entails in all cases a higher execution time, while the automatic method provide a significant time saving. Higher time is justified by additional selection time with IGA and checking the contrast ratio in the manual approach, both not required by SGA.

Experiment 2

The aim of Experiment 2 was to test solutions from a usability point of view. Two solutions were taken into account: specifically, the solution provided manually and the solution generated by means of IGA (see Fig. 6). Solutions are submitted to the participants by means of (1) Selection Test Tool, and (2) Memory Test Tool. The task implemented by Selection Test Tool consisted in finding a word (on the left) among a set of 20 words, displayed on the right side using a color combination. In order to focus the test on the effect of colors in recognizing the right word, the selection had place among a set of words with a similar typing. Selection had place by clicking on the corresponding word. Time elapsed from showing and selecting the word was recorded. In case of wrong selection, the test goes ahead to the next word. So that, time for both correct and wrong selections was recorded. Test is iterated along 5 words, per user and per color solution. In order to reduce user fatigue a break of 5 minutes was given between the two solutions being tested. Instead, the task implemented by Memory Test Tool consisted in reading a sequence of 3 words, randomly chosen among a dictionary of 20 words, and asking the participant to memorize the sequence, then to answer if a word appeared or not in the sequence. The test performed three sequences of words per user and per solution. Again, in order to make experiment more focused on how color combinations can affect short-term memory, the set of words was made of similar typing. For both the applications, we adopt time-on-task metrics and correctness metrics. We aimed at investigating from usability point of view, the difference between solutions provided by humans (solution H) and solution adapted by genetic algorithms (solution G), when submitted to deuteranopes. During the memory test, we assumed that a respondent was able to remember words appearing in sequence, so to recognize if specific word was displayed or not. We defined correct memorization rate (CMR) as the fraction of correct answers over the total number of questions. CMR for human (H) and algorithmic (G) solution was 0.867 and 0.933 respectively as shown in Table 6. Analyzing in depth the results, we note that the number of respondents giving at least a wrong answer, was 7 in the case of H and 4 in the case of G. This leads to the conclusion that words submitted with the color scheme provided by IGA were easier to memorize.
Table 6

Memorization task and CMR for the two solutions after 60 trials (20 users)

SolutionsAnswer time (s)CMR
AverageSD
H3.0161.7980.867
G2.5352.1560.933
Memorization task and CMR for the two solutions after 60 trials (20 users) Moreover, we compared the response time by means of a Wilcoxon paired test. Looking at p value (0.02482), we can observe a statistical difference between solutions (in particular a shorter answer time in case of G) with a confidence . There is no significant difference, instead, if we consider the selection test. Similarly to CMR, we define correct selection rate (CSR) as the number of correct selections over the overall number of selections. CSR is comparable in the case of both solutions. It is 0.98 for H and 0.99 for G, as reported in Table 7. Investigating the response time by Wilcoxon paired test and looking at p value (0.896), we can state that there is no statistical difference in response time between H and G (strong accept at p value 0.5). Thus participants took a similar time in recognizing and selecting the proposed word.
Table 7

Selection task and CSR for the two solutions after 100 trials (20 users)

SolutionsSelection time (s)CSR
AverageSD
H3.8971.8480.98
G3.8931.9240.99
Selection task and CSR for the two solutions after 100 trials (20 users) The latter findings can be justified by the way a word is recognized within a set. Indeed, the task of picking a word in a set is mostly influenced by the recognition of specific patterns of letters within a word, more than the whole word itself. This was confirmed by respondents interviewed after performing the test.

Limitations

The lack of evidence in Experiment 1 in rejecting the hypothesis that a statistical difference stands out from answers to the questionnaire can only marginally support the assumption that no difference will be reproduced in any further experiment. This makes difficult to draw conclusions in general terms. In addition, the standard deviation in answers varies between 0.663 and 1.342. Therefore, improving the statistical power would require to involve a larger group of participants. Similar considerations can be made for Experiment 2 in testing the response time.

Conclusions

Color accessibility represents a relevant barrier for CVD users in gaining access to the Information Society. Since user interface is generally conceived by non-CVD designers for non-CVD users, there is a need to support the choice of alternative color scheme able to better address color vision disorders. So far, the effort has been focused on building tools able to measure the luminance contrast ratio and to simulate CVD vision, leaving the designer to choose among alternatives. However the large number of combinations can make such a task time demanding. The space of color palettes can be explored searching for a solution able to provide a positive trade-off between aesthetics and accessibility. In particular, genetic algorithms driven by a mix of color metrics and designer preferences can be employed for such a search. The study presented in this paper attempted to provide an answer to the question if solutions that proved to work algorithmically, are also effective when translated to practice. Results from experimentation have been proving that GA search based approach is feasible, both when designer preferences are explicit and implicit. In the first case search is driven only by color contrast ratios and distance from the original color pattern. In the second case, the algorithm iteratively combines metrics and human subjective feedback, leaving the designer free to move towards a different color scheme along the evolution process. In both cases, time-to-task proved to be largely smaller than the manual approach, providing solutions comparable and sometimes preferable to those obtained with no support in searching color alternatives. Experimental results from this study make possible to outline the following conclusions:Experimentation might present some weakness. Due to time constraints, the experiments were performed on single page, so that its layout might have influenced the measurement. However, since the study is comparative, this should not have affected the conclusions. The initial page has a color scheme where white is predominant over the other colors. This could make difficult to capture differences in solutions, thus affecting the answers collected by the questionnaire. However this is a common situation in most web pages. We attempted to limit this effect by providing summary pictures as those depicted in Fig. 6. Another point regards the composition of the second focus group in Experiment 1. Among them we had only two known CVD users. However the page was chosen according to the deficiency. A posterior interview confirmed that user preferences were driven by usability issues more than a personal sense of aesthetics. In addition, the number of participants was not large enough to study if there is a correlation between preferred solutions and the age of respondents, as we did not have a significant representation for each age class. Indeed, age can affect how colors are perceived although Ishihara test is passed. Time-on-task could be influenced by the usability of UI, so that experiment outcomes should be consider only for comparing the three different approaches. Finally, colors attains to cultural background, and the focus groups were made of the same ethnic group. This represents a limitation to universally extend the experimental results. An additional weakness related to Experiment 2 is related to confidence with words. Although, we chose common words within the Italian dictionary, we did not measure this aspect. Finally, we did not measure the severity of the disease as this attains to medical practice and not allowed by internal policies. Automatic support offered an effective support by allowing a reduced time-on-task with respect to a purely manual color adaptation (Experiment 1) Solutions proposed by genetic algorithms is not perceived of lower quality (Experiment 1) Adaptation provided by genetic algorithms is beneficial in order to make the UI accessible to CVD users (Experiment 2) Besides these aspects, conclusions reached by this study offer several points of confirmation (although preliminary) to the validity of pursuing CVD adaptation by means of tools able to free the UI designers from considering accessibility constraints in choosing a color scheme, letting the tools to reach an optimal trade-off. We hope this will foster further research in this direction.
  7 in total

1.  The spectral sensitivities of the middle- and long-wavelength-sensitive cones derived from measurements in observers of known genotype.

Authors:  A Stockman; L T Sharpe
Journal:  Vision Res       Date:  2000       Impact factor: 1.886

2.  Spectral sensitivity of the foveal cone photopigments between 400 and 500 nm.

Authors:  V C Smith; J Pokorny
Journal:  Vision Res       Date:  1975-02       Impact factor: 1.886

3.  Full-spectrum cone sensitivity functions for X-chromosome-linked anomalous trichromats.

Authors:  P DeMarco; J Pokorny; V C Smith
Journal:  J Opt Soc Am A       Date:  1992-09       Impact factor: 2.129

4.  Psychophysics of reading. XI. Comparing color contrast and luminance contrast.

Authors:  G E Legge; D H Parish; A Luebker; L H Wurm
Journal:  J Opt Soc Am A       Date:  1990-10       Impact factor: 2.129

5.  Effects of chromatic and luminance contrast on reading.

Authors:  K Knoblauch; A Arditi; J Szlyk
Journal:  J Opt Soc Am A       Date:  1991-02       Impact factor: 2.129

6.  Detail preserving reproduction of color images for monochromats and dichromats.

Authors:  Karl Rasche; Robert Geist; James Westall
Journal:  IEEE Comput Graph Appl       Date:  2005 May-Jun       Impact factor: 2.088

7.  Computerized simulation of color appearance for dichromats.

Authors:  H Brettel; F Viénot; J D Mollon
Journal:  J Opt Soc Am A Opt Image Sci Vis       Date:  1997-10       Impact factor: 2.129

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.