Nikolaos Kokras1,2, Dimitrios Baltas1, Foivos Theocharis1, Christina Dalla1. 1. Department of Pharmacology, Medical School, National and Kapodistrian University of AthensAthens, Greece. 2. First Department of Psychiatry, Medical School, National and Kapodistrian University of AthensAthens, Greece.
Abstract
Behavioral analysis in preclinical neuropsychopharmacology relies on the accurate measurement of animal behavior. Several excellent solutions for computer-assisted behavioral analysis are available for specialized behavioral laboratories wishing to invest significant resources. Herein, we present an open source straightforward software solution aiming at the rapid and easy introduction to an experimental workflow, and at the improvement of training staff members in a better and more reproducible manual scoring of behavioral experiments with the use of visual aids-maps. Currently the program readily supports the Forced Swim Test, Novel Object Recognition test and the Elevated Plus maze test, but with minor modifications can be used for scoring virtually any behavioral test. Additional modules, with predefined templates and scoring parameters, are continuously added. Importantly, the prominent use of visual maps has been shown to improve, in a student-engaging manner, the training and auditing of scoring in behavioral rodent experiments.
Behavioral analysis in preclinical neuropsychopharmacology relies on the accurate measurement of animal behavior. Several excellent solutions for computer-assisted behavioral analysis are available for specialized behavioral laboratories wishing to invest significant resources. Herein, we present an open source straightforward software solution aiming at the rapid and easy introduction to an experimental workflow, and at the improvement of training staff members in a better and more reproducible manual scoring of behavioral experiments with the use of visual aids-maps. Currently the program readily supports the Forced Swim Test, Novel Object Recognition test and the Elevated Plus maze test, but with minor modifications can be used for scoring virtually any behavioral test. Additional modules, with predefined templates and scoring parameters, are continuously added. Importantly, the prominent use of visual maps has been shown to improve, in a student-engaging manner, the training and auditing of scoring in behavioral rodent experiments.
Behavioral analysis in preclinical neuropsychopharmacology relies on the accurate measurement of animal behavior (Kokras and Dalla, 2014; Kokras et al., 2015). Appropriate operating procedures and intensive experimenter training may influence or determine behavioral performance (Chesler et al., 2002; Sousa et al., 2006). Advances in computer science allowed the development of elaborate software which records animal behavior, often with a high degree of automation, taking advantage of intelligent algorithms and image tracking technologies (Noldus, 1991; Noldus et al., 2000, 2001; Zimmerman et al., 2009). However, those commercially available solutions have a high purchasing cost. In addition, automated algorithms may provide better scoring than humans in some cases (Desland et al., 2014) but may also provide less accurate and detailed analysis than humans in certain other cases, as in the forced swim test (distinguishing fine transitions between swimming, climbing and immobility behaviors) and novel object recognition (distinguishing active interest toward the object vs. near vicinity of the animal's head). Several attempts have been done over the last 20 years to develop open-source or freely available computer programs for scoring animal behavior (Moraes and Ferrarezi, 1997; Ottoni, 2000; Taiwanica, 2000; Patel et al., 2006; Poirrier et al., 2006; Aguiar et al., 2007; Blumstein and Daniel, 2007; Otero et al., 2010; Crispim Junior et al., 2012; de Chaumont et al., 2012; Telonis and Margarity, 2015; Friard et al., 2016). Some of those attempts resulted in outdated and probably not under active development computer programs, some were focused on specific models that could not be easily modified for other settings, and some resulted in elaborat solutions that required a significant investment in human resources to develop, adapt and operate. Large-scale behavioral laboratories routinely invest in high-cost commercially available solutions and are also willing to invest human resources in developing in-house esoteric approaches. However, it is not rarely the case that a research team needs a straightforward computer aid to perform a widely-used behavioral test for a specific project. Additionally, in those cases where human scoring is required or desired, there is difficulty in training students and staff members to accurately and reproducibly score animal behavior. This is of paramount importance, as inaccurate scoring by improperly trained personnel may contribute to non-reproducible results. Few computer programs place as priority an interface that facilitates correct training of students and staff. In this context, we have developed a versatile and expandable software package with the aim to provide a ready and easy to use platform for behavioral analysis scoring and a platform through which training in behavioral pharmacology scoring can be facilitated and controlled.
Representative visual maps produced after scoring a male (top) and a female (bottom) rat during Forced Swim Test. The total length of the visual bar corresponds to the test duration (5 min) and each behavior scored is depicted with a designated color, its time of appearance and its duration. Note that both animals have almost identical total duration of immobility, swimming, and climbing, however the organization in time of the observed behaviors differs significantly between the male and the female rat. Also, note the slight differences between the experienced scorer and a trainee, the latter performing the scoring in a satisfactory way, if examining only the total scores, but still committing some errors when inspecting the visual maps. By comparing the produced visual maps and discussing the animal's performance training can be facilitated in an engaging way and reproducibility can be enhanced.
Figure 2
Validation data on Forced Swim Test (FST) Scoring. Two experienced raters, after having trained with the Kinoscope program, scored blindly, and independently male and female rats in the 5 min second session of the two-sessions FST. Each animal is represented in a separate row and on each column, the scoring pattern from each blind and independent rater can be seen. Correlation indices were 0.85 for number of recorded behavioral events, 0.98 for immobility behavior (blue color) and 0.90 for immobility latency, 0.89 from swimming (red color), 0.97 for climbing behavior (black color), 0.95 for head shaking frequency (green color). All correlations were highly significant (p < 0.001) as indicated by Pearson's two-tailed test. Full data published in Kokras et al (Kokras et al., 2015). Raw images from Kinoscope were put in order and collated together using ImageJ/Fiji (Schindelin et al., 2012; Schneider et al., 2012).
Figure 3
Validation data on Novel Object Recognition Scoring (NOR) from a yet unpublished experiment. Two novice student raters, after having trained with the Kinoscope program, scored blindly, and independently male and female rats in the 5 min second trial of a two-trial NOR. Each animal is represented in a separate row and on each column, the scoring pattern from each blind and independent rater can be seen. Correlation indices were 0.90 for number of recorded behavioral events, 0.95 and 0.91 for Object A (red) time and frequency respectively, and 0.94 and 0.87 for Object B (black) time and frequency respectively. Time in general area of the open field is depicted in blue color. All correlations were highly significant (p < 0.001) as indicated by Pearson's two-tailed test. Raw images from Kinoscope were put in order and collated together using ImageJ/Fiji (Schindelin et al., 2012; Schneider et al., 2012).
Figure 4
Representative workflow of using the Kinoscope, with references to Supplemental Figures (.
Representative visual maps produced after scoring a male (top) and a female (bottom) rat during Forced Swim Test. The total length of the visual bar corresponds to the test duration (5 min) and each behavior scored is depicted with a designated color, its time of appearance and its duration. Note that both animals have almost identical total duration of immobility, swimming, and climbing, however the organization in time of the observed behaviors differs significantly between the male and the female rat. Also, note the slight differences between the experienced scorer and a trainee, the latter performing the scoring in a satisfactory way, if examining only the total scores, but still committing some errors when inspecting the visual maps. By comparing the produced visual maps and discussing the animal's performance training can be facilitated in an engaging way and reproducibility can be enhanced.Validation data on Forced Swim Test (FST) Scoring. Two experienced raters, after having trained with the Kinoscope program, scored blindly, and independently male and female rats in the 5 min second session of the two-sessions FST. Each animal is represented in a separate row and on each column, the scoring pattern from each blind and independent rater can be seen. Correlation indices were 0.85 for number of recorded behavioral events, 0.98 for immobility behavior (blue color) and 0.90 for immobility latency, 0.89 from swimming (red color), 0.97 for climbing behavior (black color), 0.95 for head shaking frequency (green color). All correlations were highly significant (p < 0.001) as indicated by Pearson's two-tailed test. Full data published in Kokras et al (Kokras et al., 2015). Raw images from Kinoscope were put in order and collated together using ImageJ/Fiji (Schindelin et al., 2012; Schneider et al., 2012).Validation data on Novel Object Recognition Scoring (NOR) from a yet unpublished experiment. Two novice student raters, after having trained with the Kinoscope program, scored blindly, and independently male and female rats in the 5 min second trial of a two-trial NOR. Each animal is represented in a separate row and on each column, the scoring pattern from each blind and independent rater can be seen. Correlation indices were 0.90 for number of recorded behavioral events, 0.95 and 0.91 for Object A (red) time and frequency respectively, and 0.94 and 0.87 for Object B (black) time and frequency respectively. Time in general area of the open field is depicted in blue color. All correlations were highly significant (p < 0.001) as indicated by Pearson's two-tailed test. Raw images from Kinoscope were put in order and collated together using ImageJ/Fiji (Schindelin et al., 2012; Schneider et al., 2012).Representative workflow of using the Kinoscope, with references to Supplemental Figures (.
Conclusion
Accurate behavioral analysis remains of paramount importance in preclinical psychopharmacology (Sousa et al., 2006). Several excellent computer solutions are available for specialized behavioral laboratories wishing to invest in infrastructure or in customizing open-source algorithms that are already available. Kinoscope, being an open source freely available program for behavioral pharmacologists, as well as other neuroscientists performing behavioral experiments, provides a basic but viable alternative. In our experience, the adoption of this software tool happens without imposing any burden on the day-to-day operations of a research team. Moreover, experienced staff members using the Kinoscope can streamline and audit the training of new members, by making use primarily of the visual maps, thus improving the consistency and reproducibility of scoring by novice researchers. Recently several concerns have been raised with regards to the validity of experimental data (Steckler, 2015; Bespalov et al., 2016). Many factors should be taken into account in improving the quality of experimental studies (Kilkenny et al., 2009; McNutt, 2014; Macleod et al., 2015) and perhaps another overlooked factor is the quality of manual scoring of behavioral experiments, which in turn may result in poor inter-rater agreement and inevitably low reproducibility. In our experience (Kokras et al., 2014, 2015, 2017), using Kinoscope's visual maps as visual aids, either in real-time scoring or in later offline auditing, greatly enhanced in an efficient and engaging way the training of new student members and the troubleshooting of poor reproducibility. Positive feedback has also been received from other departments that have used the beta version of this program, and several groups have already used the program for their research (Castelhano-Carlos et al., 2014; Papazoglou et al., 2015; Wiersielis et al., 2016; Lopes et al., 2017; Caetano et al., in press). The program will be under active development, with more behavioral templates scheduled for inclusion soon (Y-maze, Light/Dark, Tail Suspension Test). Additionally, as data transparency and data sharing has been proposed as a remedy for poor data reproducibility (Steckler et al., 2015), a possibility to export, import and exchange results and raw data produced by Kinoscope will be added. In the same context, the open-source code is also available for inspection and possible modifications (e.g., adding more behavioral templates by other research groups) at github.com. The authors also welcome any suggestions for future improvements. Availability of the latest version of the program is through the Sourceforge repository at https://sourceforge.net/projects/kinoscope, and a training video is also available at the same site along with a manual.
Author contributions
NK and CD conceptualized, designed and led the development of the program. DB and FT wrote the software code. All authors contributed to the writing of the manuscript and approved the final version.
Funding
This study has been funded by an IKY Fellowship of Excellence for Postgraduate Studies – Siemens Program to NK. The costs for this open-access publication are supported by the ECNP Network “Preclinical Data Forum” (https://www.ecnp.eu/projects-initiatives/ECNP-networks/List-ECNP-Networks/Preclinical-Data-Forum.aspx). The ECNP Network “Preclinical Data Forum” neither promotes nor endorses the use of the software tool reported in this publication.
Conflict of interest statement
NK has received honoraria and travel support from Janssen-Cilag, Lundbeck, Sanofi-Aventis, Medochemie Generics and Elpen S.A. CD has received honoraria from Janssen-Cilag and travel support from Boehringer Ingelheim. The other authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Authors: Carlos Fernando Crispim Junior; Cesar Nonato Pederiva; Ricardo Chessini Bose; Vitor Augusto Garcia; Cilene Lino-de-Oliveira; José Marino-Neto Journal: Comput Biol Med Date: 2011-12-26 Impact factor: 4.589
Authors: Malcolm R Macleod; Aaron Lawson McLean; Aikaterini Kyriakopoulou; Stylianos Serghiou; Arno de Wilde; Nicki Sherratt; Theo Hirst; Rachel Hemblade; Zsanett Bahor; Cristina Nunes-Fonseca; Aparna Potluru; Andrew Thomson; Julija Baginskaite; Julija Baginskitae; Kieren Egan; Hanna Vesterinen; Gillian L Currie; Leonid Churilov; David W Howells; Emily S Sena Journal: PLoS Biol Date: 2015-10-13 Impact factor: 8.029
Authors: Fernanda Guilhaume-Correa; Shelby M Cansler; Emily M Shalosky; Michael D Goodman; Nathan K Evanson Journal: J Neurosci Res Date: 2019-09-20 Impact factor: 4.164
Authors: Nikolaos Kokras; Eleni Poulogiannopoulou; Marinos G Sotiropoulos; Rafaella Paravatou; Eleni Goudani; Maria Dimitriadou; Electra Papakonstantinou; George Doxastakis; Despina N Perrea; George Hloupis; Apostolis Angelis; Aikaterini Argyropoulou; Anthony Tsarbopoulos; Alexios-Leandros Skaltsounis; Christina Dalla Journal: Molecules Date: 2020-10-28 Impact factor: 4.411
Authors: Philip Vassilev; Esmeralda Fonseca; Giovanni Hernandez; Andrea Haree Pantoja-Urban; Michel Giroux; Dominique Nouel; Elise Van Leer; Cecilia Flores Journal: eNeuro Date: 2022-02-10