| Literature DB >> 25100962 |
Anders Hånell1, Niklas Marklund2.
Abstract
A large variety of rodent behavioral tests are currently being used to evaluate traits such as sensory-motor function, social interactions, anxiety-like and depressive-like behavior, substance dependence and various forms of cognitive function. Most behavioral tests have an inherent complexity, and their use requires consideration of several aspects such as the source of motivation in the test, the interaction between experimenter and animal, sources of variability, the sensory modality required by the animal to solve the task as well as costs and required work effort. Of particular importance is a test's validity because of its influence on the chance of successful translation of preclinical results to clinical settings. High validity may, however, have to be balanced against practical constraints and there are no behavioral tests with optimal characteristics. The design and development of new behavioral tests is therefore an ongoing effort and there are now well over one hundred tests described in the contemporary literature. Some of them are well established following extensive use, while others are novel and still unproven. The task of choosing a behavioral test for a particular project may therefore be daunting and the aim of the present review is to provide a structured way to evaluate rodent behavioral tests aimed at drug discovery research.Entities:
Keywords: animal behavior; mice; phenotyping; rats; translational medicine
Year: 2014 PMID: 25100962 PMCID: PMC4106406 DOI: 10.3389/fnbeh.2014.00252
Source DB: PubMed Journal: Front Behav Neurosci ISSN: 1662-5153 Impact factor: 3.558
Figure 1The total use of mice and rats in behavioral testing. The number of publications were determined in PubMed using the search terms (mouse or mice) and (rat or rats) combined with (behavior or behaviour), where data prior to 1960 is excluded since the absence of abstracts in the older literature makes search results unreliable. A sharp rise in mouse behavioral testing can be seen in the last decade and is now slightly more prevalent than rat behavioral testing.
Points to consider when evaluating a behavioral test.
| Motivation | How are the animals motivated to perform the task? Is the level of motivation high and stable? Can the source of the motivation interfere with the disease model? |
| Animal-experimenter interaction | Are the experimenter and the animal in close contact during testing? Can handling be performed prior to testing to allow the animals to adjust to human contact? |
| Dynamic range | Can the test make accurately measure the ability of both naïve and severely impaired animals? Is there a risk for flooring or ceiling effects? Can the test difficulty be adjusted within or between trials? |
| Repeatability | Can the test be repeated to assess changes in ability over time? |
| Interaction with other tests | Is there a risk that the test experience can affect the behavior in other test? |
| Data collection | How are the results collected? Is there a risk for subjective effects in the evaluation? Can the results be collected automatically? |
| Result evaluation | Which statistical tests can be used to evaluate the results? How are the results presented? |
| Result interpretation | Can the results be attributed to a single domain or can, for example, changes in general activity level interfere with the measurements? |
| Automation | How much of the test procedure is automated? Does the test have the potential to be fully automated? |
| Variability | Can the results be related to baseline performance to mitigate the effects of variability? Can the estrous cycle of female animals be measured to restrict testing to a single day in the cycle? |
| Experimental design | Can the test be performed in a blinded fashion? Can the test be used for both mice and rats of different strains and stocks to obtain more robust results? |
| Sensory modality | Which sensory modality does the animal use to solve the task? Is it possible to confirm intact sensory functions? |
| Predictive validity | Do drugs approved for human use improve test results for rodents? |
| Construct validity | Does the test rely on brain structures used for this psychological construct in humans? |
| Ethological validity | Does the test resemble natural rodent behavior? |
| Face validity | Is it immediately apparent what the test is intended to measure? |
| Intrinsic validity | Does the test give the same result when experiments are repeated? |
| Extrinsic validity | Does the test give the same result when performed in, for example, different strains, age group or species? |
| Housing | Does the test cause restrictions on how the animals can be housed? Can the social status be assessed for group housed animals to allow the creation of balanced experimental groups? |
| Throughput | How long does it take to run the test? Do the animals have to be trained before performing the test? Can several animals be tested in parallel? Is the collection and evaluation of the results time consuming? |
| Costs | How much does the equipment cost? How much lab space has to be devoted to the test? How much staff time does the test require? |
| Practical considerations | Can the equipment be stowed away when not in use? Is extensive training of the experimenter required to carry out the test? Does the test have to be performed on several consecutive days which may overlap with weekends, holidays and vacations? Can the test be run by a substitute in case of sick leave? Can the test equipment be easily cleaned and disinfected? |
The points are ordered as they appear in the text and the importance of each point has to be assessed based on the goals of individual projects.
Figure 2Recently introduced behavioral tests. (A) Multivariate concentric square field: the animal is allowed to freely explore a complex arena made up of several subdomains with different characteristics. The dark corner room in the top right of the image is for example likely perceived as safe unlike the brightly lit bridge on the left side which is probably considered risky. The result is analyzed using multivariate statistics to give a behavioral profile of the animal rather than attempting to measure individual traits. (B) Cylinder test: when the animal is placed in the cylinder it will spontaneously rear and use its forepaws for support. Unilateral injury to CNS motor control areas typically induces asymmetric forelimb use in this task. (C) Three-chamber social approach: the sociability of the test mouse is measured by its tendency to spend time in an empty chamber or a chamber containing another mouse. (D) The Dig Task: based on olfactory cues, the animal identifies the correct cup and digs to obtain the reward. (E) OptoMotry: the unrestrained animal is placed on a small, elevated, platform surrounded by four monitors displaying a grating pattern. If the animal has sufficient visual acuity, the lateral flow of the grating pattern induces reflexive head movements which are automatically detected by an overhead camera. (F) Whisker nuisance task: the experimenters hand is seen in the lower left corner holding the small stick which is used to stimulate the whiskers. Traumatic brain injury, for instance, causes allodynia which can be detected in this test. All images except the cylinder test were kindly provided by other scholars, see Acknowledgments for details.
Figure 3The level of difficulty varies between rodent behavioral tests which makes them suitable for different purposes. A non-demanding test (solid line) is, for example, suitable for detection of treatment effects in models of severe central nervous system lesions. Non-demanding tests may, however, display ceiling effects, i.e., even impaired animals receives close to optimal test results. A highly demanding test (dotted line), on the other hand, cause the risk of flooring effects where all animals fails the task, leaving any improvement undetected. A demanding test is thus mainly suitable for detection of minor insults, such as side effects of treatments or discrete effects of genetic manipulations. Test with a continuous increase in difficulty or stimulus intensity (mixed line) are useful over a wide range of ability/trait levels, i.e., have a wide dynamic range. See Range of reliable measurements in the text for examples.