Skip to main content

ORIGINAL RESEARCH article

Front. Neurol. , 27 March 2025

Sec. Neuro-Otology

Volume 16 - 2025 | https://doi.org/10.3389/fneur.2025.1532927

This article is part of the Research Topic The Vestibular System: A tribute to Professor Dr. Hans Straka View all 13 articles

Effects of inter-individual variability in experimenters’ sensitivity and training on behavioral assessment of animal models of vestibular disorders

  • 1Vertidiag, Montpellier, France
  • 2Centre de Recherche en Psychologie et Neurosciences, Aix Marseille Université-CNRS UMR 7077, Marseille, France
  • 3Unité GDR 2074 CNRS, Marseille, France
  • 4UR Vertex 7480, Université de Caen Normandie, CHU Caen, Caen, France

Background: This study was designed to explore the correlation between animal behavioral assessment quality and rater’s individual sensitivity and training.

Methods: We selected different raters to form a panel to rate the severity of posturo-locomotor deficits in animals displaying excitotoxic or ototoxic lesions-induced vestibular syndrome. All raters, regardless of their scientific level, received brief training based on videos and tutorial files. They then had to score videos of rats with different types and stages of vestibular syndromes. All data were collected and analyzed.

Results: Inter-individual variability in raters significantly altered the results of behavioral assessment of posturo-locomotor deficits in vestibulo-lesioned animals. Neither gender nor scientific level had an impact on the results. In contrast, the sensitivity of the individual to animal welfare impacted the mean score in the ototoxic lesion model. Raters with high sensitivity tended to exaggerate the symptomatology.

Conclusion: The use of automated assessments of posturo-locomotor deficits in vestibulo-lesioned rodents, is the best solution to limit these assessment biases.

Highlights

• Identifying and reducing experimental bias is part of the scientific process to better control animal models predictability.

• Sensitivity of raters to animal welfare impacts behavioral assessment in ototoxic lesion model.

• Small raters team with similar training can avoid bias linked to inter-individual variability.

Introduction

Animal pathological models aim to reproduce the biological, physiological and behavioral correlates found in human diseases. The main added value is to offer the possibility to investigate the cellular effects, as well as to screen and test therapeutic targets. They also allow precise monitoring of the course of the pathology and prediction of its consequences for human health. In neurotology as in other medical fields, improvement of the predictability of sensory vestibular damage models is a challenge that researchers are trying to meet (1) by combining biological, physiological and behavioral markers that should be quantitatively scored in a homogeneous manner.

Identifying and reducing the various experimental biases is part of the scientific process to best control this predictability, but also to guaranty reproductivity of data between and within laboratories (24). The research community is becoming increasingly vigilant regarding the possible impact of current scientific practices on the reliability and validity of published research. Besides essential data such as animal sex or age, which may greatly affect biological measures, and are not always mentioned (5), it is now known that even subtle changes in housing (cage size, social isolation or enrichment, noise pollution) (6, 7), training condition (especially positive reinforcement training) and the sequence of experiments, can also significantly alter the expression of symptoms of induced pathology and its recovery processes (8, 9) These observations have led to a range of rules regarding housing and experimental conditions that together may help to ensure animal welfare (10, 11).

In addition to these conditions directly affecting the animals, parameters specific to the experimenters, referred to as experimenter effects (12), or experimenter bias (13), can also be a source of variation in obtained results in the behavioral monitoring of animals in experimentation. This problem has been reported in a few behavioral studies involving rodents that were not automated and had to be rated by different experimenters. In these studies, the scoring of animals in elevated plus maze significantly differed depending on the experimenter (14).

Some of this bias arises from the experimenters’ judgment regarding behaviors. Experience and training are therefore essential for conducting behavioral experiments, because failure to consider essential factors affecting the behavior of animals, the interaction of animals and experimenters (15), and scoring behavior, may strongly influence the reproducibility, validity and reliability of the experiments (1620). Other studies in rodents have reported bias related to the gender of experimenters (21) with for example a modulation of antidepressant-like effect of ketamine (22) or an impact of male-related odors that significantly altered ratings of pain exhibited by mice (23). In the case of animal models of vestibular pathologies with strong clinical disturbances, the experimenter bias in the assessment of behavioral biomarkers of the vestibulopathy has never been explored. It is precisely to address this issue that this study was designed.

We conducted a behavioral follow-up study of rats with excitotoxic or ototoxic vestibular lesions, under conditions similar to those used in our previous studies in terms of choice of animals, housing, handling sequences and observation methods (2426). Both models evoked a panel of posturo-locomotor symptoms including immobility, inability to stand up on the hind limb, impaired posture and walk, head tilt, retropulsion, circling, vertical repetitive movements of the head. Each of these individual symptom varies with their own kinetics (27).

We selected different experimenters to form a panel to rate the intensity of posturo-locomotor deficits in vestibulo-injured animals. Experimenters were themselves assessed regarding their sensitivity to animal welfare, which relies to the individual perception of the animal suffering upon their use or exploitation such as scientific experimentation, and their level of training in the behavioral assessment of posturo-locomotor symptoms.

A study was then performed to search for correlation between raters’ inter-individual characteristics and animal scoring. The results of this study and their consequences in term of potential experimental bias are discussed.

Materials and methods

General procedures

Male Long Evans rats (8–10 weeks, 275 ± 25 g, Janvier, France) were housed in cages in social groups (4 rats per cages) with enrichment of the environment under constant temperature (20 ± 1°C), humidity (60 ± 5%) and brightness (below 110 Lux) conditions. Rats were kept on a light / darkness cycle of 12 h / 12 h (lights on at 7.00 am) with food and water available ad libitum. The animals were acclimatized to the experimenter and the experiment room for 5 days before the beginning of the experiments. The pre-defined endpoints were monitored throughout the study.

All animal experiments complied with the ARRIVE guidelines and are carried out in accordance with the U.K. Animals (Scientific Procedures) Act, 1986 and associated guidelines, EU Directive 2010/63/EU for animal experiments, or the National Research Council’s Guide for the Care and Use of Laboratory Animals under the veterinary and National Ethical Committee supervision (French Agriculture Ministry Authorization nF34-172-05). All the applied procedures were previously approved by the ethics committee (MP-CEPAL n°22).

Assessment of vestibular syndrome

Vestibular lesion procedure

Animals were anesthetized with isoflurane®. Then, induction of the unilateral excitotoxic-type vestibular injury was carried out by a transtympanic administration of 100 μL of 25 mM Kainic acid (TTK) in 0.9% NaCl or of 25 mM sodium Arsanilate (TTA) in the left ear. A subcutaneous injection of 1 mL of sterile 0.9% NaCl prevented dehydration in rats. They were kept under anesthesia in a lateral position for additional 30 min (2830) to allow the administrated compound to bathe the middle ear windows. Animals remained isolated for 24 h after surgery and were then put back in a collective cage. The general condition of the rat, weight and pre-defined endpoints were monitored throughout the operative process and post-operative follow-up.

Experimental design

An assessment panel composed of 8 volunteers and 4 members of the team was set up to determine whether there was any impact of the characteristics of the experimenters on the scoring of animals and the results of a study. Each member of the panel received training through 1 tutorial video of 5 min and 1 descriptive sheet per lesion model (TTK and TTA). They then had to score the severity of vestibular syndrome through 26 videos of TTK rats and 25 videos of TTA rats covering all post-lesionnal time points before and after the TT injections. The test was performed a first time 3 days before a left trans-tympanic ear injection to obtain a pre-operative reference value, then at different times after injection (2 h, 4 h, 1, 2, 3, 7 and 14 days). All videos had an identical length of 2 min. Panel members and data analysts were only aware of the experimental model of the animal in videos, while the time point of the video recording remained unknown. We aimed here to evaluate the sensitivity of the experimenters to the different intensities of the vestibular syndrome expression. The syndrome presents a peak expression in the first hours after the lesion, then gradually decreases over time with the impact of the vestibular compensation. The videos evaluated by the raters presented acute and compensated phases of the syndrome. The aim was to assess up to what intensity an experimenter was able to detect a symptom and score it, whatever the observation delay. Finally, they answered a questionnaire on the characteristics and amplitudes of the posturo-locomotor symptoms (Appendix).

Vestibular syndrome rating

The vestibular syndrome was first activated by elevating the rats by the tail about 20 centimeters from the support to activate the vestibular syndrome and exacerbate the expression of its symptoms. It was assessed for 2 min and was composed of a typical set of posture locomotor symptoms that we were able to determine on the basis of our experience. Those symptoms were together present in the acute phase and disappeared sequentially after a specific time (31). A score of 0 (no symptom) to 3 (severe symptoms) was assigned to the following 10 criteria for the TTK model and 8 for the TTA model for a maximum total score of 30 or 24 respectively: prostration time (immobility), ability to stand up on the hind limb (rearing), quality of gait and general movements of the animal (quality of locomotion), body height and lift (body height), lateral tilt of the head (head tilt), reverse walk (retropulsion), concentric trajectories (circling), vertical repetitive movements of the head (bobbing) and general condition. For the TTK model only, we assessed torsional movement of the body when the rat is lifted by the tail over the support (tail hanging behavior), and landing reflex of the rat when dropped on the support (landing reflex).

Panel characteristics

Panel recruitment

Panel recruitment was done on a voluntary basis with persons accustomed or not to animal experimentation. The final panel was composed of 8 volunteers (lab students and staff who were not involved in animal experiment) and 4 members of the team, with 7 women and 5 men. Only the 4 members of the team were previously trained in vestibular syndrome scoring on animal model. The sensitivity to animal welfare can be defined as the way in which we collectively or individually view the way in which animals are regarded and treated in human societies.

Panel formation

Panel members were not used to scoring animals with vestibular lesion, so they had to be trained. Each member of the panel received training through one tutorial video of 5 min and one descriptive sheet per lesion model (TTK and TTA). They were also given a questionnaire to complete about their previous training, the quality of their training, and their ability to self-assess rat symptoms.

Behavioral tests

The impact of vestibular syndrome on the overall motor and exploratory behavior of the animal was assessed using the Open field test. The animals were accustomed to the device for 5 min per day for 3 days before the start of the experimental follow-up. The vestibular syndrome was first stimulated by elevating the rats by the tail about 20 centimeters from the support in order to exacerbate the expression of symptoms.

Rats with Kainic acid administration were tested on a white neutral and empty surface. Animal were placed in the center of this area and allowed to explore. Their behavior was recorded for 2 min with a webcam placed on the roof. The test was performed a first time 3 days before a left trans-tympanic ear injection to obtain a pre-operative reference value, then at different times after injection (2 h, 4 h, 1, 2, 3, 7 and 14 days).

Rat with Arsanilate injection were tested in an open field. The open field was a white neutral and empty surface of 80x80x40cm. Animals were placed in the center of the field considered more stressful compared to the periphery, and their behavior was recorded for 5 min using a digital camera with the Software Ethovision™ XT 15 (Noldus). A phase of habituation to the environment was allowed. The test was performed a first time 3 days before a left trans-tympanic ear injection to obtain a pre-operative reference value, then at different times after the injection (1, 2, 3, 7, 14 and 21 days). The light was fixed at 50 Lux in the center and 30 Lux in corners of the maze. Walls were thoroughly cleaned between animals with ethanol solution (20%) (20). A member of the team randomly chose videos at each time point, including pre-operative, which were submitted to the panel for assessment.

Statistical analysis

The primary objective of our manuscript is to evaluate the performance of raters in detecting differences (if any) between symptoms and animals. The study is designed around the crossing of three experimental factors: Raters, Animals, and Symptoms, with an additional factor, Sessions. This type of experimental design is relatively common in other disciplines, particularly sensory analysis, which focuses on evaluating assessors’ ability to discriminate between products using various descriptors or attributes (32). For our statistical analysis, we followed standard practices in sensory analysis, as outlined by Per Lea and colleagues (33). Specifically, we employed the following ANOVA model: responses ~ Raters * Animals * Symptoms * Sessions.

This model allows us to account for the effects of individual factors as well as their interactions. In this framework, the Raters and Animals factors are considered random effects, while Symptoms and Sessions are treated as fixed factors. This approach enables us to address several key questions: Are the raters effective in their evaluations? Can they reliably discriminate between different symptoms? Are their assessments consistent across repeated evaluations? Do their evaluations align with those of the panel as a whole? To perform this analysis, we first used the panelperf function from the SensoMineR package in the statistical software R (see https://cran.r-project.org/web/packages/SensoMineR/SensoMineR.pdf). Additionally, we utilized the JASP and JAMOVI software tools to perform complementary analyses. Beyond the ANOVA approach, we also applied the Rasch model (34), which models a rater’s response to a symptom as a function of the difference between the rater’s ability, and the characteristics of the symptom. This model provides a visual representation of the distribution of raters’ abilities in relation to the characteristics of symptoms (specifically, their “accessibility”). Such a representation offers valuable insights into the discriminatory power of raters and the inherent difficulty of the symptoms being assessed. Values were expressed as mean ± SEM. A significant statistical result is indicated by * if p value < 0.05, ** if p value < 0.01, and *** if p value < 0.001.

Results

A questionnaire on scientific experience, experience in animal study, animal scoring and sensitivity to animal welfare was sent to each member of the experimenters panel. We grouped the experimenters according to different parameters. The questionnaire provided an overview of the heterogeneity of the selected experimenters and their specificities (Table 1). The experimenter panel was composed 58% of women and 42% of men with an average age of 30 years (± 8 years). Having chosen a largely scientific panel with 25% experienced and 75% very experienced, but not accustomed to animal studies (42% non-experienced), we also questioned their sensitivity to animal welfare. The questionnaire revealed that 18% were non-sensitive, 50% were sensitive and 42% were very sensitive. More broadly, in our sample, people experienced in animal studies (58%) were not necessarily qualified (33% experienced) for scoring posturo-locomotor deficits in vestibulo-injured animals (Figure 1).

Table 1
www.frontiersin.org

Table 1. Panel overview based on the questionnaire given in Appendix.

Figure 1
www.frontiersin.org

Figure 1. Experimental design.

Using the Rasch model assessment of our questionnaire, we found that some items, such as rearing and retropulsion, were similarly perceived (easy or difficult to score, respectively) in both the in TTK- and TTA-induced vestibulopathy models. Other items were diversely appreciated in the two models: circling, head tilt and bobbing were easy to assess in the TTK model, whereas they were considered difficult to score in the TTA model. Conversely, immobility was easy to assess in the TTA model, whereas it was considered difficult to score in the TTK model (Table 2). These data indicate that in the present study, difficulties in scoring the considered symptom were model-dependent.

Table 2
www.frontiersin.org

Table 2. Assessment of the difficulties in scoring a symptom in vestibular loss model with the Rasch model.

Subjective scoring makes it possible to assess the severity of vestibular syndrome by scoring different symptoms using four grades from 0 to 3 (Figure 2). The 12 members of the panel assessed videos of rats that received a trans-tympanic administration of either kainic or arsenic acids and the average scores were reported. We found significant differences in the way the vestibular disorders symptoms were scored between raters in the two experimental models (p < 0.001). A one sample t test was performed to compare every rater with a hypothetical mean resulting from the mean of all raters (Figures 2A,B).

Figure 2
www.frontiersin.org

Figure 2. Impact of intra-individual variability on average rating of vestibular syndrome induced by an injection of Kainate (A) and Arsanilate (B). ANOVA 2 factors, Subject * Score, with repeated measures, p value <0.001 and a t test to compare sample with a hypothetical mean. Significant statistical result are indicated by * if p value < 0.05, ** if p value < 0.01, and *** if p value < 0.001.

The questionnaire enabled classification of the different members of the raters panel according to their characteristics. Thus, we could assess whether one of the parameters of individuals had a role in the variability in the average symptoms score (Figure 3). No gender difference of raters was observed between both TTK and TTA models (p = 0.59) (Figures 3A,B).

Figure 3
www.frontiersin.org

Figure 3. Differences in the average rating of vestibular syndrome according to gender of experimenters on the two experimental models (A) TTK and (B) TTA. ANOVA 2 factors, Gender * Score.

Panel members were regrouped in 3 different groups according to their experience (Group 1: non-experienced; Group 2: experienced; Group 3: very experienced; Table 1). We did not find statistically significant differences in the average rating between the 3 groups in both the TTK (p = 0.93) (Figure 4A) and TTA (p = 0.22) models (Figure 4B).

Figure 4
www.frontiersin.org

Figure 4. Differences in the average rating of vestibular syndrome according to the experience with animals of raters on two lesion models. There were 3 different groups: Group 1: Non-experienced (n = 5), Group 2: Experienced (n = 3), Group 3: Very experienced (n = 4). Two types of models were studied (A) TTK and (B) TTA. ANOVA 2 factors, Experience.

Another interesting characteristic is the sensitivity to animal welfare of the experimenters. There is a range of sensitivity in the panel from non-sensitive to very sensitive (Table 1). We found that the sensitivity has a significant impact on the average rating of vestibular syndrome in the TTK model (p = 0.02) (Figure 5A). With a posthoc test of Tukey, it appears that the non-sensitive group was significantly different from the sensitive group (p value <0.001) and the very sensitive group (p value <0.001). Likewise, the sensitive group was significantly different than the very sensitive group (p < 0.001). In contrast, the sensitivity had no significant impact on the average scoring of the TTA model (p = 0.84) (Figure 5B).

Figure 5
www.frontiersin.org

Figure 5. Differences in the average rating of vestibular syndrome according to the sensitivity to animal welfare of raters on two lesion models. Group 1: non sensitive (n = 1), Group 2: sensitive (n = 6), Group 3: very sensitive (n = 5). (A) TTK and (B) TTA. ANOVA 2 factors, significant statistical result are indicated by ** if p value < 0.01, and *** if p value < 0.001.

There was no significant evidence of a link between the sensitivity to animal welfare and the experience in animal study in the panel (p = 0.64) (Figure 6).

Figure 6
www.frontiersin.org

Figure 6. Differences in the sensitivity to animal welfare depending on their experience in animal study. ANOVA 2 factors, Sensitivity * Experience. Non-experienced (n = 5), Group 2: Experienced (n = 3), Group 3: Very experienced (n = 4).

Interestingly, no significant differences were found in the way vestibular disorders symptoms were scored between raters who are team members (p = 0.63 for TTK model and p = 0.13 for TTA model, Figures 7A,B), even if there are differences in age or gender (Figures 7C,D), and more surprisingly even if they show differences in sensitivity to animal welfare (Figure 7E).

Figure 7
www.frontiersin.org

Figure 7. Impact of equal and long-term training on average rating of vestibular syndrome induced by an injection of Kainate (A) and Arsanilate (B) considering gender, age and sensitivity difference. ANOVA 2 factors, Subject * Score, with repeated measures, p value = 0.63 for TTK model and p value = 0.13 for TTA model. Descriptive analysis of team members with their age (C), gender (D) and sensitivity to animal welfare (E).

Discussion

Identify biases that can occur in an animal study to avoid erroneous conclusions

Evidence of bias in animal studies was previously shown in published research (10, 11). Arguably, the degree to which environmental variables influence research outcomes may not be considered by many researchers (3537) when they focus on controlling direct experimental variables inherent to their own research. The ultimate research facility, with specialized housing rooms and caging systems, provides an environment where variables are minimized regardless of species, thus creating an optimal setting and preventing bias in the data. Animal facilities specificities can account for major differences between studies (4). However, the differences between scientists who are in contact with animals also subsist. Notably, when rating scales are used, some significant differences can occur. We have tried to identify key points to keep in mind when this type of scoring is used.

We found a significant experimenter effect on the average score of TTK and TTA models which means that one experimenter can probably overstate an effect while another can minimize or even miss it.

Gender

We did not find any evidence of an influence of gender on the ability to score the vestibular disorders symptoms. Bias depending on gender was previously reported in behavioral assessments of posturo-locomotor and cognitive abilities in mice models of ethanol effects (18), and assessments of pain in mice (23). In the first case, olfactory exposure to male experimenters’ odors significantly altered the ability of mice to maintain their balance on a rotarod and cognitive parameters assessed in elevated plus maze or open field. In the second case, olfactory exposure to human male odors induced stress and related analgesia in the rodents. These effects may be triggered by male odors in outfits worn by experimenters on the previous day, or specific chemicals known to be excreted by men such as 3-methyl-2-hexenoic acid, androsterone, or androstadienone. In the present study, this situation could not be studied as ratings were performed posteriori, without contact with animals.

Experience with animals

Surprisingly, experience of raters in scoring posturo-locomotor symptoms in both the TTK and TTA rat models did not significantly influence their ratings. One might have thought that experience would be a determining factor in the detection and quantification of such specific symptoms, as experience with laboratory animals has been demonstrated to significantly influence how behavioral markers of animal suffering are accounted for (38). It seems that the behavioral parameters measured are sufficiently salient for novice experimenters to obtain similar average scores to specialist raters. This observation further validates the predictability of the animal models of peripheral vestibular disorders used in present study (1).

Sensitivity to animal welfare

This has become a major societal issue in recent years, and is the basis for animal welfare vigilance in both research and industry. This issue raises major cultural, economic and ecological issues and has led to legislative developments in Western countries in particular, with legal recognition that animals are sentient beings, which requires them to be treated appropriately. The sensitivity to animal welfare varies greatly among individuals, depending on their own sensitivity, experience and contact with animals. Based on our assessments, the sensitivity to animal welfare appears to be the main parameter which supports the experimenter effect on the scoring of the TTK model. This parameter does not significantly affect the score in the TTA model. This can be tentatively explained by the fact that the less severe injury models (such as the TTK model) leave more room for interpretation than the more severe models (such as the TTA model). Less animal-sensitive subjects will thus tend to underestimate the vestibular syndrome compared to those who are more sensitive to animal welfare. Although the evaluation of animal experimentation videos is independent of the experimenter questionnaire, we cannot rule out the idea that having started the questionnaire with the question of sensitivity to the animal cause may have further sensitized the experimenter who consider themselves to be sensitive. Hence being a source of potential bias in the design.

Anticipation of the scoring scale and training of raters to smooth the results

The behavioral scales used in the TTK (24) and TTA (25, 26) models contain items that are not easy to assess for people who are not accustomed to animal experiments. Thus, this margin of error in the symptoms quantification may lead to a significant experimenter bias. We note that in the team the average rating is very close when comparing all raters. Thus, the team having been trained identically for 2 months with different sensitivity in animal welfare at the beginning, that appears in our study as a major parameter impacting the TTK model scoring, now show a reproducible rating with perfect homogeneity between raters. This means that working on the standardization (39) of a scoring scale with adaptive items when subjects are inexperienced or implementing long training with the team and animals makes it possible to smooth the results and arrive at exploitable and reproducible conclusions even if there is intra-individual variation at the beginning.

Automated behavioral assessment is undoubtedly a solution to avoid biases reported in this study. As an example, videotracking and dynamic weight bearing device have been demonstrated to be a suitable tools to monitor specific posturo-locomotor deficits following unilateral vestibular insult in rodents (4042). However, it is not always available in all research teams. Many of which still use subjective behavioral assessment. It’s even less the case in human vestibular rehabilitation practices. Present study aimed to emphasize the risks of bias in subjective assessment.

Conclusion

This study is to our knowledge the first to identify and assess potential experimenter bias in the behavioral assessment of postural-locomotor symptoms in rodent models of peripheral vestibulopathies. The results of our investigations show that while the gender of the experimenters and their experience with animals have little influence on the way the scoring is performed, sensitivity to animal welfare is a potential source of experimenter bias. Training of experimenters in animal behavioral assessment procedures can reduce the risk of experimenter bias. This should be considered in non-automated behavioral assessment studies.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

All animal experiments complied with the ARRIVE guidelines and are carried out in accordance with the U.K. Animals (Scientific Procedures) Act, 1986 and associated guidelines, EU Directive 2010/63/EU for animal experiments, or the National Research Council’s Guide for the Care and Use of Laboratory Animals. The study was conducted in accordance with the local legislation and institutional requirements.

Author contributions

RB: Conceptualization, Data curation, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing. BH: Data curation, Formal analysis, Investigation, Writing – review & editing. CB: Data curation, Formal analysis, Investigation, Writing – review & editing. NC: Funding acquisition, Investigation, Methodology, Project administration, Resources, Validation, Writing – review & editing. AA: Formal analysis, Methodology, Writing – review & editing. SB: Writing – review & editing. BT: Writing – review & editing. CC: Conceptualization, Investigation, Project administration, Supervision, Validation, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was funded by the company Vertidiag, SA (Montpellier, France), for the supply of animals and technical staff.

Conflict of interest

Authors RB, BH, CB, NC were employed by company VERTIDIAG.

The authors declare that this study received funding from VERTIDIAG. The funder had the following involvement in the study: study design, collection, analysis, interpretation of data, writing of this article, and decision to submit it for publication.

Generative AI statement

The authors declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Tighilet, B, Trico, J, Xavier, F, and Chabbert, C. What predictability for animal models of peripheral vestibular disorders? Biomedicines. (2022) 10:3097. doi: 10.3390/biomedicines10123097

PubMed Abstract | Crossref Full Text | Google Scholar

2. Mogil, JS. Laboratory environmental factors and pain behavior: the relevance of unknown unknowns to reproducibility and translation. Lab Anim. (2017) 46:136–41. doi: 10.1038/laban.1223

Crossref Full Text | Google Scholar

3. Nigri, M, Åhlgren, J, Wolfer, DP, and Voikar, V. Role of environment and experimenter in reproducibility of behavioral studies with laboratory mice. Front Behav Neurosci. (2022) 16:835444. doi: 10.3389/fnbeh.2022.835444

PubMed Abstract | Crossref Full Text | Google Scholar

4. Siegel, V. Reproducibility in research. Dis Model Mech. (2011) 4:279–80. doi: 10.1242/dmm.008037

PubMed Abstract | Crossref Full Text | Google Scholar

5. Flórez-Vargas, O, Brass, A, Karystianis, G, Bramhall, M, Stevens, R, Cruickshank, S, et al. Bias in the reporting of sex and age in biomedical research on mouse models. eLife. (2016) 5:e13615. doi: 10.7554/eLife.13615

PubMed Abstract | Crossref Full Text | Google Scholar

6. Åhlgren, J, and Voikar, V. Housing mice in the individually ventilated or open cages-does it matter for behavioral phenotype? Genes Brain Behav. (2019) 18:e12564. doi: 10.1111/gbb.12564

PubMed Abstract | Crossref Full Text | Google Scholar

7. Höglund, AU, and Renström, A. Evaluation of individually ventilated cage systems for laboratory rodents: cage environment and animal health aspects. Lab Anim. (2001) 35:51–7. doi: 10.1258/0023677011911372

PubMed Abstract | Crossref Full Text | Google Scholar

8. Zhang, MQ, Ji, MH, Zhao, QS, Jia, M, Qiu, LL, Yang, JJ, et al. Neurobehavioural abnormalities induced by repeated exposure of neonatal rats to sevoflurane can be aggravated by social isolation and enrichment deprivation initiated after exposure to the anaesthetic. Br J Anaesth. (2015) 115:752–60. doi: 10.1093/bja/aev339

PubMed Abstract | Crossref Full Text | Google Scholar

9. Priya, V, Srikumar, BN, and Shankaranarayana Rao, BS. Contrasting effects of pre-training on acquisition of operant and radial arm maze tasks in rats. J Integr Neurosci. (2018) 12:1–10. doi: 10.3233/JIN-180077

PubMed Abstract | Crossref Full Text | Google Scholar

10. Bloomsmith, MA, Perlman, JE, Hutchinson, E, and Sharpless, M. Behavioral management programs to promote laboratory animal welfare In: RH Weichbrod and GA Thompson, editors. Management of Animal Care and use Programs in research, education, and testing. Boca Raton, FL: CRC Press/Taylor & Francis (2018)

Google Scholar

11. Hogan, MC, Norton, JN, and Reynolds, RP. Environmental Factors: Macroenvironment versus Microenvironment In: RH Weichbrod and GA Thompson, editors. Management of Animal Care and use Programs in research, education, and testing. Boca Raton, FL: CRC Press/Taylor & Francis (2018)

Google Scholar

12. Sousa, N, Almeida, OFX, and Wotjak, CT. A hitchhiker’s guide to behavioral analysis in laboratory rodents. Genes Brain Behav. (2006) 5:5–24. doi: 10.1111/j.1601-183X.2006.00228.x

PubMed Abstract | Crossref Full Text | Google Scholar

13. Eisenach, JC, and Lindner, MD. Did experimenter bias conceal the efficacy of spinal opioids in previous studies with the spinal nerve ligation model of neuropathic pain? Anesthesiology. (2004) 100:765–7. doi: 10.1097/00000542-200404000-00003

PubMed Abstract | Crossref Full Text | Google Scholar

14. Lopez, MF, and Laber, K. Impact of social isolation and enriched environment during adolescence on voluntary ethanol intake and anxiety in C57BL/6J mice. Physiol Behav. (2015) 148:151–6. doi: 10.1016/j.physbeh.2014.11.012

PubMed Abstract | Crossref Full Text | Google Scholar

15. van Driel, KS, and Talling, JC. Familiarity increases consistency in animal tests. Behav Brain Res. (2005) 159:243–5. doi: 10.1016/j.bbr.2004.11.005

PubMed Abstract | Crossref Full Text | Google Scholar

16. López-Aumatell, R, Martínez-Membrives, E, Vicens-Costa, E, Cañete, T, Blázquez, G, Mont-Cardona, C, et al. Effects of environmental and physiological covariates on sex differences in unconditioned and conditioned anxiety and fear in a large sample of genetically heterogeneous (N/Nih-HS) rats. Behav Brain Funct. (2011) 7:48. doi: 10.1186/1744-9081-7-48

PubMed Abstract | Crossref Full Text | Google Scholar

17. Lewejohann, L, Reinhard, C, Schrewe, A, Brandewiede, J, Haemisch, A, Görtz, N, et al. Environmental bias? Effects of housing conditions, laboratory environment and experimenter on behavioral tests. Genes Brain Behav. (2006) 5:64–72. doi: 10.1111/j.1601-183X.2005.00140.x

PubMed Abstract | Crossref Full Text | Google Scholar

18. Bohlen, M, Hayes, ER, Bohlen, B, Bailoo, JD, Crabbe, JC, and Wahlsten, D. Experimenter effects on behavioral test scores of eight inbred mouse strains under the influence of ethanol. Behav Brain Res. (2014) 272:46–54. doi: 10.1016/j.bbr.2014.06.017

PubMed Abstract | Crossref Full Text | Google Scholar

19. Voikar, V. Reproducibility of behavioral phenotypes in mouse models - a short history with critical and practical notes. J Reprod Neurosci. (2020) 1:1375. doi: 10.31885/jrn.1.2020.1375

Crossref Full Text | Google Scholar

20. Bigelow, LJ, Pope, EK, MacDonald, DS, Rock, JE, and Bernard, PB. Getting a handle on rat familiarization: the impact of handling protocols on classic tests of stress in Rattus norvegicus. Lab Anim. (2023) 57:259–69. doi: 10.1177/00236772221142687

PubMed Abstract | Crossref Full Text | Google Scholar

21. Faraji, J, Ambeskovic, M, Sauter, N, Toly, J, Whitten, K, Lopes, NA, et al. Sex-specific stress and biobehavioral responses to human experimenters in rats. Front Neurosci. (2022) 16:965500. doi: 10.3389/fnins.2022.965500

PubMed Abstract | Crossref Full Text | Google Scholar

22. Georgiou, P, Zanos, P, Mou, T-CM, An, X, Gerhard, DM, Dryanovski, DI, et al. Experimenters’ sex modulates mouse behaviors and neural responses to ketamine via corticotropin releasing factor. Nat Neurosci. (2022) 25:1191–200. doi: 10.1038/s41593-022-01146-x

PubMed Abstract | Crossref Full Text | Google Scholar

23. Sorge, RE, Martin, LJ, Isbester, KA, Sotocinal, SG, Rosen, S, Tuttle, AH, et al. Olfactory exposure to males, including men, causes stress and related analgesia in rodents. Nat Methods. (2014) 11:629–32. doi: 10.1038/nmeth.2935

PubMed Abstract | Crossref Full Text | Google Scholar

24. Bringuier, CM, Hatat, B, Boularand, R, Chabbert, C, and Tighilet, B. Characterization of thyroid hormones Antivertigo effects in a rat model of Excitotoxically-induced Vestibulopathy. Front Neurol. (2022) 13:877319. doi: 10.3389/fneur.2022.877319

PubMed Abstract | Crossref Full Text | Google Scholar

25. Hatat, B, Boularand, R, Bringuier, C, Chanut, N, Besnard, S, Mueller, AM, et al. Vertigoheel improves central vestibular compensation after unilateral peripheral vestibulopathy in rats. Front Neurol. (2022) 13:969047. doi: 10.3389/fneur.2022.969047

PubMed Abstract | Crossref Full Text | Google Scholar

26. Hatat, B, Boularand, R, Bringuier, C, Chanut, N, Chabbert, C, and Tighilet, B. Effect of fluoxetine and Acacetin on central vestibular compensation in an animal model of unilateral peripheral Vestibulopathy. Biomedicines. (2022) 10:2097. doi: 10.3390/biomedicines10092097

PubMed Abstract | Crossref Full Text | Google Scholar

27. Dyhrfjeld-Johnsen, J, Gaboyard-Niay, S, Broussy, A, Saleur, A, Brugeaud, A, and Chabbert, C. Ondansetron reduces lasting vestibular deficits in a model of severe peripheral excitotoxic injury. J Vestib Res. (2013) 23:177–86. doi: 10.3233/VES-130483

PubMed Abstract | Crossref Full Text | Google Scholar

28. Besnard, S, Machado, ML, Vignaux, G, Boulouard, M, Coquerel, A, Bouet, V, et al. Influence of vestibular input on spatial and nonspatial memory and on hippocampal NMDA receptors. Hippocampus. (2012) 22:814–26. doi: 10.1002/hipo.20942

PubMed Abstract | Crossref Full Text | Google Scholar

29. Machado, ML, Lelong-Boulouard, V, Smith, PF, Freret, T, Philoxene, B, Denise, P, et al. Influence of anxiety in spatial memory impairments related to the loss of vestibular function in rat. Neuroscience. (2012) 218:161–9. doi: 10.1016/j.neuroscience.2012.05.029

PubMed Abstract | Crossref Full Text | Google Scholar

30. Vignaux, G, Chabbert, C, Gaboyard-Niay, S, Travo, C, Machado, ML, Denise, P, et al. Evaluation of the chemical model of vestibular lesions induced by arsanilate in rats. Toxicol Appl Pharmacol. (2012) 258:61–71. doi: 10.1016/j.taap.2011.10.008

PubMed Abstract | Crossref Full Text | Google Scholar

31. Péricat, D, Farina, A, Agavnian-Couquiaud, E, Chabbert, C, and Tighilet, B. Complete and irreversible unilateral vestibular loss: a novel rat model of vestibular pathology. J Neurosci Methods. (2017) 283:83–91. doi: 10.1016/j.jneumeth.2017.04.001

PubMed Abstract | Crossref Full Text | Google Scholar

32. Meilgaard, MC, Civille, GV, Carr, BT, and Osdoba, KE. Sensory evaluation techniques. 6th ed. Boca Raton, FL: CRC Press (2024).

Google Scholar

33. Lea, P, Næs, T, and Rødbotten, M. Analysis of variance for sensory data. New York: Wiley (1997).

Google Scholar

34. Andrich, D. Rasch models for measurement. Beverly Hills: SAGE Publications, Inc. (1988).

Google Scholar

35. Clough, G. Environmental effects on animals used in biomedical research. Biol Rev Camb Philos Soc. (1982) 57:487–523. doi: 10.1111/j.1469-185x.1982.tb00705.x

PubMed Abstract | Crossref Full Text | Google Scholar

36. Vesell, ES, Lang, CM, White, WJ, Passananti, GT, Hill, RN, Clemens, TL, et al. Environmental and genetic factors affecting the response of laboratory animals to drugs. Fed Proc. (1976) 35:1125–32.

PubMed Abstract | Google Scholar

37. Kilkenny, C, Browne, WJ, Cuthill, IC, Emerson, M, and Altman, DG. Improving bioscience research reporting: the ARRIVE guidelines for reporting animal research. Osteoarthr Cartil. (2012) 20:256–60. doi: 10.1016/j.joca.2012.02.010

PubMed Abstract | Crossref Full Text | Google Scholar

38. Sandgren, EP, Streiffer, R, Dykema, J, Assad, N, and Moberg, J. Influence of animal pain and distress on judgments of animal research justifiability among university undergraduate students and faculty. PLoS One. (2022) 17:e0272306. doi: 10.1371/journal.pone.0272306

PubMed Abstract | Crossref Full Text | Google Scholar

39. Würbel, H. Behaviour and the standardization fallacy. Nat Genet. (2000) 26:263. doi: 10.1038/81541

Crossref Full Text | Google Scholar

40. Vandeputte, C, Taymans, J-M, Casteels, C, Coun, F, Ni, Y, Van Laere, K, et al. Automated quantitative gait analysis in animal models of movement disorders. BMC Neurosci. (2010) 11:92. doi: 10.1186/1471-2202-11-92

PubMed Abstract | Crossref Full Text | Google Scholar

41. Rastoldo, G, Marouane, E, El Mahmoudi, N, Péricat, D, Bourdet, A, Timon-David, E, et al. Quantitative evaluation of a new Posturo-locomotor phenotype in a rodent model of acute unilateral Vestibulopathy. Front Neurol. (2020) 11:505. doi: 10.3389/fneur.2020.00505

PubMed Abstract | Crossref Full Text | Google Scholar

42. Marouane, E, Rastoldo, G, El Mahmoudi, N, Péricat, D, Chabbert, C, Artzner, V, et al. Identification of new biomarkers of Posturo-locomotor instability in a rodent model of vestibular pathology. Front Neurol. (2020) 11:470. doi: 10.3389/fneur.2020.00470

PubMed Abstract | Crossref Full Text | Google Scholar

Appendix

Questionnaire

Name of the experimenter:

Gender:

Age:

Do you know one or more people with vestibular disorders in your entourage?:

Highlight the correct answer:

Sensitivity to the animal cause: 0 (non-sensitive) – 1 (sensitive) – 2 (very sensitive)

Experience in the scientific field: 0 (no experience) – 1 (some experience) – 2 (very experienced)

Experience in animal experimentation (rodents): 0 (no experience) – 1 (some experience) – 2 (very experienced)

Experience in subjective scoring: 0 (no experience) – 1 (some experience) – 2 (very experienced)

Experience in subjective TTA scoring: 0 (no experience) – 1 (some experience) – 2 (very experienced)

Experience in subjective TTK scoring: 0 (no experience) – 1 (some experience) – 2 (very experienced)

For subjective scoring to be reliable, estimate the level that the experimenter would need: 0 (Beginner level) – 1 (Amateur level) – 2 (Intermediate level) – 3 (Experienced level) – 2 (Very experienced level)

Would you have liked more complete training in order to feel more confident with your scoring?: Yes/No

If so, what did you lack?:

• A more detailed information sheet describing the experiment: Yes/No

• More tutorial videos (details on each item): Yes/No

• A Skype interview with an experienced trainer: Yes/No

• A brief training session with animals in the field: Yes/No

Keywords: animal experimentation, risk of bias, methods, vestibular pathologies, inter-individual variability, pathological models

Citation: Boularand R, Hatat B, Bringuier C, Chanut N, El Ahmadi A, Besnard S, Tighilet B and Chabbert C (2025) Effects of inter-individual variability in experimenters’ sensitivity and training on behavioral assessment of animal models of vestibular disorders. Front. Neurol. 16:1532927. doi: 10.3389/fneur.2025.1532927

Received: 22 November 2024; Accepted: 06 February 2025;
Published: 27 March 2025.

Edited by:

François M. Lambert, UMR 5287 Institut de Neurosciences Cognitives et Intégratives d’Aquitaine (INCIA), France

Reviewed by:

Andreas Wree, University of Rostock, Germany
Joanne Mathiasen, Drexel University, United States

Copyright © 2025 Boularand, Hatat, Bringuier, Chanut, El Ahmadi, Besnard, Tighilet and Chabbert. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Christian Chabbert, Y2hyaXN0aWFuLmNoYWJiZXJ0QHVuaXYtYW11LmZy

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Research integrity at Frontiers

Man ultramarathon runner in the mountains he trains at sunset

95% of researchers rate our articles as excellent or good

Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


Find out more