- School of Veterinary Medicine, University of Surrey, Guildford, United Kingdom
Animal welfare monitoring is a vital part of veterinary medicine and can be challenging due to a range of factors that contribute to the perception of welfare. Tools can be used, however; there are few validated and objective methods available for veterinary and animal welfare professionals to assess and monitor the welfare of dogs over their lifetime. This study aimed to adapt a framework previously validated for other species, The Animal Welfare Assessment Grid (AWAG), for dogs and to host the tool on an accessible, easy to use online platform. Development of the AWAG for dogs involved using the scientific literature to decide which factors were relevant to score welfare in dogs and to also write the factor descriptors. The primary tool was trialed with veterinary professionals to refine and improve the AWAG. Content validity was assessed by subject matter experts by rating the validity of the factors for assessing dog welfare using the item-level content validity index (I-CVI) and scale-level content validity index based on the average method (S-CVI/Ave). Construct validity was evaluated by users of the tool scoring healthy and sick dogs, as well as healthy dogs undergoing neutering procedures. Mann Whitney tests demonstrate that the tool can differentiate between healthy and sick dogs, and healthy and healthy dogs post elective surgery. Test re-test reliability was tested by users conducting multiple assessments on individual dogs under non-changing conditions. Inter-rater reliability was assessed by two users scoring an individual dog at the same time in veterinary referral practice. Repeated measures ANOVA for test re-test and inter-rater reliability both show no statistical difference between scores and that the scores are highly correlated. This study provides evidence that the AWAG for dogs has good content and construct validity, alongside good test re-test and inter-rater reliability.
Introduction
There is currently no universally accepted method to assess animal welfare in any species; however multiple indicators that are evidenced to impact wellbeing both positively and negatively should be used to evaluate an animal's welfare state (1). Routine welfare assessment often needs to produce rapid results, be non-invasive, and should not require any special training for ease of use (2). Assessment tools that are used to evaluate welfare must be developed to be as objective as possible and this can be achieved by ensuring the tool is valid and reliable (3).
Producing reliable results means that similar results occur each time the same animal is tested under the same conditions. Inconsistent results may lead to an inaccurate assessment of welfare and weaken the strength of research findings. It is important to include multiple measures of reliability testing to ensure the tool is consistently reliable by using methods such as inter-rater reliability and test re-testing reliability (Table 1).
The validity of a tool means that it is accurately measuring the construct that the tool was designed to measure. Various tests of validity should be used to establish the tool measures an animal's welfare state and these tests may include construct and content validity (Table 2).
In order to get a true picture of an animal's quality of life, welfare should be routinely measured and not just taken at a snapshot in time. Most of the discussions relating to quality of life in the veterinary clinical setting use no objective tools to assess the animal's welfare. These discussions are more often prompted by owners than vets and may center mainly around euthanasia decisions rather than the proactive quality of life improvement initiatives at an earlier stage (4). Achieving patient-centric and welfare-based treatment goals throughout the animal's life often depends on the willingness of the client and there is an increasing amount of research on improving communication skills and how to implement change in pet owners (5). However, there are few tools for veterinary and animal welfare professionals to use that objectively and holistically assess dog welfare.
Tools that have been previously developed to assess quality of life largely relate to clinical health and illness. Health related quality of life assessment instruments typically take the form of structured questionnaires, which can be either generic or disease specific.
Disease specific instruments may be more responsive to clinical change, but generic instruments can be more valuable as they have the ability to assess a range of factors that impact welfare (6). Quality of life (QOL) tools that have been developed for various chronic diseases and illnesses include chronic pain (7), cardiac disease (8), spinal cord injury (9), osteoarthritis (10, 11), cancer (12, 13), and atopic dermatitis (14–16).
There have also been many tools developed to assess QOL that are unrelated to a specific disease. Mullan and Main (17) devised a four-part quality of life questionnaire for dog owners that assesses a variety of components that can impact a dog's welfare including pain, comfort, exercise, diet, mental stimulation, companionship with people and other dogs. The Mullan and Main (17) tool also assesses behavioral health which is often overlooked in other assessments. It also incorporates a simplified version of the health-related quality-of-life questionnaire for dogs with chronic pain (7). The strong merit of this tool is that it assesses the dog holistically and is patient-centered; it has good repeatability, internal consistency, and validity. However, the questionnaire was designed as a screening tool that would raise awareness of welfare considerations of dogs in veterinary practice, not to generate a quantifiable measure of welfare, and therefore cannot be used to compare the quality of life within or between dogs or measure change over time.
Schmutz et al. (18) developed a tool that is completed by dog owners and assesses eight parameters (energetic, mobile, relaxed, happy, sociable, relaxed, interested and satisfied) using a Likert-type scale. The instrument is demonstrated to have good content validity and reliability and poorer scores are shown in dogs that have chronic disease, demonstrating the tool's use to detect the negative impact of these conditions. The authors also state that the instrument can be completed quickly (3–5 min), which is an important aspect that users may consider when deciding whether to use a welfare assessment tool.
Reid et al. (19) previously designed and validated a similar instrument, the VetMetrica health-related quality of life (HRQL) assessment tool in which the dog owner completes 22 questions and the tool produces scores across four parameters of quality of life (energy, happiness, comfort and calmness). The user can then generate a summary score for physical wellbeing and emotional wellbeing. The user can also compare the scores of the four domains to the average healthy dog in the individuals age group. The scores can assess treatment success to measure clinically significant change over time. The VetMetrica HRQL tool, similar to the Mullan and Main questionnaire, takes a holistic approach to evaluate the dog's quality of life; however, Reid et al. additionally assess how the animal feels about its situation and compares this to what is the average healthy score for the dog's age and breed. However, there is likely to be variation between and within dog breeds and ages, so it may be difficult to compare a dog to another of the same breed or age. Rodger et al. (20) explored the variation of age, sex, and breed using the VetMetrica HRQL tool and found in three domains (energetic and enthusiastic, happy and content, and active and comfortable) there was variation with age, but HRQL declined as the dog aged. This finding is unsurprising as the prevalence of health problems in older animals increases with age. However, in general, there was found to be considerable variation in the HRQL scores, in particular, amongst breeds across all HRQL parameters.
Therefore, a QOL assessment tool should consider each dog as an individual with their own subjective emotions. Emotions can be defined as mental states that motivate behavior by facilitating adaptive physiological, cognitive and behavioral responses (21, 22). Although these cannot be measured directly, there are increasing studies investigating a range of emotional states in dogs (23–28), in particular negative affective states such as fear, anxiety, and frustration. Furthermore, brain anatomy associated with emotional processing in humans are similarly identified in dogs (29–33).
Belshaw (4) states that QOL assessment should encourage us to see each animal as an individual in how they are affected by illnesses and interventions, social interactions and changes in living conditions. Basing quality-of-life assessment on the individual patient and designing unique care and improvement has also been a recent development in human health care. Some studies in mental health have found many benefits to this approach as it allows the patients' needs and interventions to be planned around what they perceive to be important to their quality of life.
Although these instruments make a great effort to maintain objectivity, relying on owner reports alone may be problematic as their perception of their dog can influence their reporting. Owner interpretation of how well or poorly each dog is coping is going to be largely subjective, since it will depend on the individual, and potentially biased, perceptions and beliefs of whoever is making the judgment (34). For example, when assessing their dog's body condition, owners are reported to underestimate their dog's, despite using an objective body condition scale (35). Moreover, owner compliance in regularly completing a questionnaire may be difficult to obtain (36). Therefore, combining a structured owner report, clinical examination, observation of behavior, and discussion of the animal in question may give a better chance of an accurate insight into the dog's welfare. However, it is inevitable that a certain amount of subjectivity may remain in a QOL assessment, but using well-structured tools that aim to reduce bias should help to mitigate this.
It is rare that the instruments incorporate the past experiences of the dog, nor do they consider the actual impact that treatment may have, or predict prospective welfare. Most tools are made to capture the “in the moment” picture of the dog's welfare or look at trends of welfare scores over time.
A common approach for constructing QOL instruments is to identify various domains that independently impact welfare and this concept allows the consideration of the multiple components that reflect the multifactorial nature of quality of life (37). However, many factors that influence a dog's welfare are not independent; pain will affect the dog's behavior, and their ability to play and interact with people and other animals. A change in environment may alter their emotional state, and thus their ability to make choices or carry out behavior. Therefore, when assessing welfare, each factor is likely to influence other aspects of the dog's life, but it is still important to score each individual factor to ascertain where welfare can be improved.
More recently, the issue of novel or “heroic” treatments has raised concern over how QOL is assessed or considered when assessing treatment options. Therefore, objective, structured tools that help assess QOL and help make decisions are vital in veterinary practices and the wider animal welfare professions.
The Animal Welfare Assessment Grid (AWAG) is a tool that monitors the welfare of animals and has been validated across a range of species (38–42). The AWAG assesses physical health, psychological wellbeing, environmental comfort, and veterinary and managemental procedural events. The tool also monitors the cumulative lifetime experience of the animal by assessing the animal throughout its lifetime which can be done in real time or through the use of retrospective and prospective assessment This is important from a welfare perspective as the cumulative impact of positive and negative experiences determines an animal's quality of life (43) and these can shape cognitive bias and long-term emotional state. The duration of positive and negative experiences and the intervals between events must also be taken into account as these can also have a lasting impact on welfare (44). The AWAG has been tested using both retrospective and in-life data and has been found to give a clear indication of animals' welfare during their lifetime (26).
When an animal is scored, the four parameters are visualized with grid scores plotted on the four axes across on a radar chart. By having this, it provides a numeric and visual representation of the animal's welfare state, and if significant changes in welfare are seen, the tool can show which factors have contributed to these changes. Therefore, intervention can be specifically focused to improve the animal's wellbeing.
Each parameter (physical, psychological, environmental, and procedural) is subdivided into several factors that contribute to the overall score. For example, the physical score would encompass the patient's general condition, clinical assessment, pain control, inappetence, and activity level.
Within each parameter the various factors are scored between one and ten. Each factor score is defined using descriptors for each number to reduce scoring bias. A score of one indicates the best possible state (lowest possible impact on welfare), whilst a score of 10 would be the worst possible state (highest possible impact on welfare), for each respective factor. For each parameter, mean factor scores are then calculated and this allows the clinician to ascertain what parameters are impacting quality of life at that time point.
In addition to the ability to quantify quality of life at a given time point, the tool provides a visual representation of the animal's cumulative welfare state (Figure 1). The radar chart displays the four parameters. Each parameter score is calculated from the factors scores, and the resultant scores are marked on the x and y axis. These points are joined together to create a polygon and the total area covered is calculated to derive the cumulative welfare assessment score (CWAS) at that particular point in time (Figure 2).
If significant changes in welfare are seen, the tool can show which factors have contributed to these changes and intervention can be undertaken to improve the animal's wellbeing. This is particularly important in hospitalized patients as it allows veterinary staff to assess the welfare impact of clinical interventions, environment, physical and emotional health to determine what factors are influencing welfare. Veterinary staff can then enhance the factors that contribute to positive welfare and reduce or change elements that are negatively impacting welfare. Additionally, prospective assessments can be undertaken to examine how different treatments will impact the dog's welfare, allowing for improved decision-making.
The aims of this study were to develop an online, easy to use platform for the AWAG and to adapt the AWAG for dogs and conduct initial validity and reliability testing; thus creating an objective, evidence-based welfare assessment tool for anyone working with dogs to use.
Methods
Development of the online AWAG platform
The AWAG software was designed in partnership with Reuben Digital Ltd. ™, Wiltshire and the development of the site began in June 2020. The platform was designed to be functional on a range of devices such as desktops, laptops, tablets and mobiles. Data that were needed to be captured for functionality and analysis were established. The site was designed to easily capture demographic data such as dog name, age, breed, neutered status, and diagnosis using dropdown and free-text functions. The tool was also developed in a way that the assessor can enter events at a certain date and time such as a medical procedure or change in environment as these may provide insight into why a dog's welfare state has changed.
Factor descriptors
Design of the factors and their descriptors
In order to design and adapt the AWAG for dogs, the individual factors to be scored across each parameter (physical, psychological, environmental, and procedural) and the written descriptors needed to be determined. From reviewing the literature, several factors that can contribute to each welfare parameter were identified that influence a dog's quality of life or that may result in cumulative suffering. When deciding on which factors to score, it was important to assess if these would be feasible to score as well as give information about a dog's wellbeing.
The primary physical factors identified in the literature were mobility, conformation, body condition (overweight or underweight), food and water consumption, health state and comorbidities. The physical assessment that the clinician undertakes will encompass the presence of disease, illness, injury, and pain and its likely impact on welfare, which can be further affected by a dog's conformation. A dog's body condition is an objective measure of their physical health status and this is primarily affected by their intake of food. Reduced food intake is an important clinical sign that can result from a myriad of chronic diseases (45). Ill dogs often show reduced food intake or anorexia. Moreover, dogs that are anxious or fearful are often anorexic (46, 47). Therefore, assessing an individual's food intake may give insight into the individual's quality of life.
Psychological factors that can affect general welfare in a negative way include past experiences, fears, and anxieties; specifically, negative experiences in the veterinary practice, intolerance to being handled, separation distress, fear of people, fear of dogs, and fear of noises. Methods of coping such as reactivity and aggression toward stressors can be used to indicate poorer states of welfare.
There are many aspects of a dog's environment that the literature suggests are important to dogs and include the facilitation of a social environment that involves positive interaction with people and other dogs, the opportunity to play, and the ability to use choice and control their environment. As dogs' social needs are so individual, the factor scale describes dogs that have positive social interactions that match their emotional needs. For example, one dog may need frequent engagement with people and other dogs and/or other species throughout the day to meet to maintain emotional wellbeing, whereas another dog may prefer the company of one person and time alone to meet their welfare needs. Dogs that don't encounter other humans or dogs due to fear or dogs that have attachment issues (48) are also included alongside a lack of social environment, as both of these situations will result in poorer welfare scores. As the opportunity to play is intertwined with how enriched the dog's environment is, these were also combined into one scale.
Elements of procedures and management that can affect a dog is the likelihood of pain, handling, length of hospitalization or time in a restricted environment, and the impact on routine. All veterinary procedures will have some impact on welfare which may be short-term and minimally affect the dog, or longer-term involving sedation or anesthesia and affecting the dog's daily routine and ability to carry out normal functions and behaviors.
Peer-reviewed literature and data that report the presentation and severity of the aforementioned welfare factors were used to shape each scale. Descriptors were written and refined for each score. This enabled the respondent to answer on an objective scale of one (least severe) to ten (most severe), which was undertaken to reduce respondent scoring bias by defining each score comprehensively (Current factors—Table 3).
Definitions and examples for terms that may be ambiguous such as “signs of stress”, “enrichment”, and “predictability” were written and implemented into on the AWAG site for user clarification (Table 4).
Recruitment
A multi-pronged recruitment strategy was employed to access veterinary networks as widely as possible across the UK. Recruitment posters were sent to the University of Surrey partner practices and were placed in the Veterinary Times journal. Information about the project was also distributed to professional networks and recruitment posters were shared on social networking sites including Facebook, Twitter, and LinkedIn; followers of these pages were able to share the link if they wished to.
Consent
Prior to being provided with a login to the AWAG site, users were required to submit a signed consent form outlining the research. Clinicians also required owners to sign consent forms. If dogs were in a shelter or other environment where they were not “owned”, the main contact for the organization was required to sign a consent form for all the dogs in their care. Clinicians and owners were also provided with information sheets that provided further details about the research project and data storage and security.
Pilot studies
Pilot studies were undertaken to gain feedback on the functionality and factor scores of the AWAG tool. Twelve clinicians including veterinarians, veterinary behaviorists, and European specialists in behavior piloted the tool and provided both quantitative and qualitative feedback. Factor scores and the functionality of the site were refined on the basis of feedback from these clinicians.
Validation studies
The clinical usefulness of an instrument depends on its ease of use and on its validity. It is important to note that validity is not a dichotomous variable, but a continuous one. The more evidence that can be provided for an instrument, the more “valid” it becomes (49). Thus, following the pilot studies and refinement of the tool, further studies were carried out to ensure the tool was valid and reliable.
These studies tested the tool to ensure experts agreed the factors scored are suitable to assess dog welfare (content validity), that the tool can differentiate between dogs in good and poorer states of welfare (construct validity), and that it produces the same score under non-changing conditions (test re-test reliability). This is important in a welfare assessment tool such as the AWAG as it is necessary to establish that it can represent and quantify an animal's welfare state accurately and that the scores change appropriately where welfare improves or worsens. If the tool is not well validated, it may give a misrepresentation of a dog's wellbeing, resulting in suffering if necessary interventions are missed. Testing reliability is also vital to ensure that the AWAG gives consistent scores under non-changing conditions and that there is little to no variability in the scores between different users, otherwise changes in welfare may not be detected, or conversely, it may show changes in welfare where there are none.
Suitability of factor scores (content validity)
A standard method for assessing content validity involves judgments by subject matter experts (SMEs) with expertise in the content of the test. The Content Validity Index (CVI), a proportion agreement procedure, allows two or more raters to independently review and evaluate the relevance of a sample of items to the domain of content represented in an instrument. The recommended number of experts to review an instrument varies from two to 20 individuals and least five people are suggested to review the instrument to have sufficient control over chance agreement (50).
In order to ensure the factor scores and descriptors were suitable to assess dog welfare ‘Subject Matter Experts' (SMEs) (n = 7) reviewed each factor and rated whether they felt the factor was 1 = not relevant, 2=somewhat relevant, 3= quite relevant, or 4 = very relevant to the assessment of dog welfare. Ratings of 1 and 2 are considered “content invalid,” whereas ratings of 3 and 4 are considered to be “content valid” (51, 52). SMEs were deemed to be experts in dog welfare if they met one of the criteria as described in Table 5. Yusoff provide this table (Figure 3) that outlines acceptable content validity index (CVI) values (53).
Ensuring the AWAG measures different welfare states (construct validity)
Construct validation examines the extent to which a measure assesses the construct that it is intended or supposed to measure (54). This is fundamental in a welfare assessment tool as it is essential that the instrument can report welfare accurately and that the scores increase as welfare states worsen. This trial was conducted by veterinary and animal welfare professionals using the AWAG to score a cohort of dogs that they have evaluated through consultation and examination to be emotionally and clinically healthy to obtain baseline scores for healthy dogs. Assessors also scored healthy dogs undergoing neutering as this is a procedure that is well-known to impact welfare negatively in the short-term as a result of starvation, hospitalization, pain, restricted choice and exercise. Additionally, users scored a cohort of dogs with chronic disease as it is likely that dogs with chronic conditions will score poorer due to the impact these conditions can have on quality of life.
Testing scores under non-changing conditions (test re-test reliability)
Reliability estimates using a test–retest approach measure the degree to which the same testing instrument produces similar results when administered to the same individual in as similar a manner as possible over a period of time. Test–retest reliability is a popular form of reliability estimation for the development and validation of test instruments and is based on correlation (55). Polit (56) state that retest reliability coefficients that approach or exceed 0.80 in their field tests are recommended and (57).
In order to assess test re-test reliability, individual clinicians conducted multiple assessments of 19 emotionally and medically healthy dogs (some in the cohort of healthy dogs used in construct validity testing) in stable and non-changing environments over a two-week period. They performed a minimum of two and a maximum of five tests using the AWAG.
Assessing scores of users on the same patient (inter-rater reliability)
To evaluate the reliability of consistency of scores between users when assessing the same patient, dogs were assessed by two users using the AWAG at the same time in the veterinary clinic during the consultation or when they were hospitalized. In two assessments one user had missed scoring a factor that the other user had scored, so this data was discarded as it would have been unreliable.
Data analysis
Power analysis
Power analysis was undertaken to calculate an appropriate sample size to assess construct validity in the AWAG. An effect size was calculated in R Studio using the “effect size” package with previous data on healthy and chronically ill dogs. A sample size for the number of dogs was then calculated using G*Power 3.1.
Statistical analysis
Test re-test and inter-rater reliability
Highest and lowest scores over multiple assessments were used to assess the variation in scores. Pearson's correlation was used in R Studio to test if the scores were correlated. Analysis of Variance test (ANOVA) was also used to assess the variation between the repeated test scores for each individual dog. This was undertaken in R Studio using the “datarium” package.
Content validity
To calculate item-level content validity index (I-CVI), the relevance rating for each factor was coded to either zero as not relevant, or one as relevant. The number of ones was totaled and divided by the number of experts. The S-CVI/Ave was calculated using the total of the average of the I-CVI ratings divided by the number of factors.
Construct validity
Shapiro-Wilk tests were conducted to test the normality of the data from healthy dogs, sick dogs, and dogs undergoing neutering. Mann Whitney tests were used to compare scores of healthy dogs with sick dogs and healthy dogs with dogs having neutering procedures.
Results
Pilot studies
Results of the Likert-scale questions that obtained data on the factor scores and functionality can be seen in Figure 4.
Feedback from clinicians from the pilot study reported that users thought the tool was important and that the reporting elements were useful, but the tool could be more intuitive with clearer buttons and section headings.
Users of the AWAG felt that the factors were important for dog welfare. One user felt a factor that was missing was side effects of the treatment. However, more people believed some factors were unnecessary. Therefore, the factors were also refined down from 22 to 16. Factors that were removed include comorbidities as these would be picked up by the clinical assessment score, rendering comorbidities unnecessary. Sedation/anesthesia scores were also removed as the sedation or anesthetic element does not impact alone; the associated impacts on routine, eating and drinking, and pain are the primary welfare considerations, and these are being scored as separate factors. Travel to the veterinary practice, separation distress, and abnormal behaviors were also removed as these will be captured within other factors (Table 6).
Regarding the factor descriptors, the general feedback was that they were too detailed, and some were not mutually exclusive. The descriptors were amended in order to address this and were written in a more basic and simpler format.
Power analysis for construct validity
The power analysis t-tests report that a total sample size of 48 dogs is needed to differentiate between healthy and sick dogs for use in validation testing.
Content validation
All seven of the SMEs were in agreement that each factor was relevant to assessing dog welfare besides “aggression toward unfamiliar people” where six out of the seven SMEs believed this was relevant to dog wellbeing. The I-CVI (the proportion of content experts giving item a relevance rating of 3 or 4) = 0.99 (acceptable score 0.83) and S-CVI/Ave (sum of proportion relevance rating)/(number of expert) = 0.94 (Table 7), demonstrating that SMEs agree that the factors scored are representative of a dog's welfare and the AWAG is considered a valid tool to assess wellbeing.
Table 7. Relevance ratings and scores calculated by SMEs (U = user, I-CVI = proportion of content experts giving item a relevance rating of 3 or 4, UA = users in agreement, S-CVI/Ave = (sum of proportion relevance rating)/(number of experts).
Construct validation
“Healthy” vs. “sick” dogs
Mann Whitney tests assessed the difference of Cumulative Welfare Score by health status (mean in group Healthy = 4.78, mean in group Sick = 34.16) suggests that there is a statistically significant difference between the scores of healthy (n = 41) and sick dogs (n = 47) (W = 46, p-363 value = < 0.001) (Figure 5).
“Healthy” vs. “routine procedure” dogs
Mann Whitney tests assessed the difference of Cumulative Welfare Score by health status (mean in group Healthy = 4.78, mean in group procedure = 12.17) suggests that there is a statistically significant difference between the scores of healthy (n = 41) and dogs undergoing routine procedures (n = 8) (w = 45, p-value = < 0.001) (Figure 6).
Reliability
Test re-test reliability
The highest and lowest scores for individual dogs were shown to be strongly correlated (p = <0.001, r = 0.89) and repeated measures ANOVA report no significant difference between tests for each dog (F = 0.55, p = 0.71). These demonstrate that there is little variation in the scores of dogs repeatedly tested in non-changing conditions (Figure 7).
Inter-rater reliability
The two scores for each dog were shown to be highly correlated (p = <0.001, r = 0.97) and repeated measures ANOVA shows no statistical difference between scores (F = 0.39, p = 0.55) (Figure 8).
Discussion
This project involved the development of the AWAG for dogs into a novel, evidence-based, online platform for veterinary and animal welfare professionals to holistically assess canine welfare. Additionally, this study aimed to provide measures of validation and reliability to assess whether the AWAG could accurately measure canine welfare and to ensure the tool was reliable and consistent. The results of this research suggest that the AWAG for dogs is a valid, reliable, and easy-to-use tool for clinicians to score the welfare of the dogs in their care.
The result of the pilot trials provided valuable data on improving both the functionality of the tool and the factor scores. As a large proportion of clinicians believed that many of the initial factors on the system were unnecessary, this helped reduce the number of factors scored. It is important that tools for dog welfare professionals are quick and easy to use due to time constraints in animal welfare settings; therefore, having fewer factors to score made the tool more practical to use in practice. However, it was important to find balance between making the AWAG quick and easy to use whilst being comprehensive enough to accurately assess welfare. Conducting content validation of the tool demonstrated that experts in canine welfare scored each factor to be either quite relevant or very relevant to assess welfare in dogs under the respective parameters.
Pilot trials also revealed that the factor score descriptors could be clearer, with 17% scoring the factor score definitions as difficult. Quantitative feedback reported that some factor descriptors were not always mutually exclusive. The factor descriptors were refined to be simpler, clearer and mutually exclusive.
Following feedback from veterinarians on the factors from pilot trials and SMEs, a refined list of 16 factors across the four parameters (physical, psychological, environmental, and procedural) (Table 3) remained to holistically and objectively assess canine wellbeing. Veterinary consultations, by the nature, generally focus on physical health, which does not give a true representation of a dog's quality of life. Using the AWAG in practice means veterinary staff will discuss a dog's emotional state, its behavior and how it responds and copes with various aspects of its life. It will enable discussion of a dog's physical and social environment, and the predictability of its environment. The AWAG also encourages clinicians to consider the impact of the veterinary visit itself and the welfare concerns that may arise from treatment or procedures, or changes in housing/husbandry affecting the dog's environment which may not be considered in other welfare assessment tools. Owner decisions may be driven by emotion and focused on the potential outcome of treatment, they may not consider how various therapeutic methods may not be in the best interests of the dog (58), either in the long term or short term. Therefore, having the ability to quantify welfare and being able to show owners a visual representation of how their dog's welfare will score over time, may help as part of the decision-making process. Additionally, the AWAG is scored by the veterinary or animal welfare professional using the written descriptors, this helps mitigate any owner bias that may exist in other canine quality of life instruments.
Another utility of the AWAG is the ability to score and monitor the lifetime experience of a dog. Obtaining a baseline score as a puppy or when a dog is first seen in practice allows welfare to be tracked and gives insight into where interventions can be made to improve welfare. Regular monitoring provides clinicians and owners an objective overview of the key factors that influences dog wellbeing and encourages them to discuss psychological health, the environment, and veterinary and husbandry procedures that may be disregarded without use of the tool.
When designing the AWAG, there were also factors that the literature demonstrated could be indicators of welfare such as sleep and abnormal behaviors. Sleep has a significant relationship with mental and physical wellbeing in people. Humans who have depression experience changes in sleep; sleep continuity is affected as well as disinhibition of REM sleep (59–61). Quality and quantity of sleep and inactivity have recently been investigated as a measure of welfare in dogs. Inactivity is shown to be associated with anhedonia, depression-like states, and boredom (28). Conversely, inactivity may be indicative of relaxation or comfort, and it may be difficult to differentiate between inactivity as a result of distress vs. calmness, especially from owner reports. The ability to rest or sleep may also be inhibited by a stressful environment or may be extremely variable dependent on the dog's lifestyle and daily activity (62). Therefore, scoring sleep or inactivity on an ordinal scale from one to ten may also be unfeasible as the length of time a dog spends inactive, does not necessarily indicate a poorer emotional state (63) and quality of sleep is unlikely to be measured accurately without specialist equipment. The adaptability of the AWAG means in future research, wearables that monitor parameters such as activity, heart rate, sleep etc can be integrated into the AWAG site. These additional factors can give insight into a dog's wellbeing will provide additional data that allows animal welfare professionals to monitor of quality of life.
Abnormal behaviors in dogs can be indicative of negative affective states and poor welfare since they are commonly displayed in situations where an animal may be frustrated, stressed, fearful, or lacking stimulation and is often seen in environments where other indicators of poor welfare co-occur (51). These behaviors are out of context in terms of social or environmental stimuli or may be abnormal in duration, frequency and/or intensity, and they may be cognitively or emotionally damaging to the dog. Abnormal behavior can itself lead to welfare concerns if it causes physical injury such as self-mutilation. However, it also proved difficult to score abnormal behaviors on a scale. Denham et al. (64) found that abnormal behaviors can occur under a variety of conditions, not just in states of deprivation. They also found that stereotypic behaviors were reinforced by attention or another action the dog may find positive. Moreover, intensity of the behavior does not necessarily indicate poorer welfare and absence of these behaviors does not indicate good welfare. Therefore, it was decided not to include these factors as they would be problematic and give a potentially misleading assessment of dog wellbeing as in certain contexts or environments, abnormal behaviors may be helping the dog cope and may enhance welfare instead of decreasing it.
Having a valid and rapid tool for veterinary and animal welfare professionals to use is important to objectively assess welfare. Having confidence that an assessment tool has undergone a series of tests to evaluate if it accurately measures welfare and is reliable may reassure clinicians about their clinical judgement or help make treatment and management decisions.
One aim of this study was to provide initial validation of the AWAG for dogs and this was undertaken through both construct and content validity testing. This means that the tool would score a dog with a good quality of life with a low numerical score, and a dog in poor welfare would have a higher score. Veterinarians, veterinary nurses, canine welfare scientists, and clinical behaviorists are well-placed to judge if a dog is clinically and emotionally healthy in a stable environment. Therefore, these professionals were asked to score dogs that they deemed medically and emotionally healthy, dogs that were undergoing a procedure we know impacts welfare in the short-term (neutering), and dogs that have chronic medical or emotional problems. We found that healthy dogs scored low and dogs undergoing neutering scored significantly higher. Dogs with chronic conditions also scored significantly higher compared to healthy dogs. This demonstrates that the AWAG has the ability to accurately capture the welfare state of dogs and discriminate between varying stages of wellbeing. This is also seen in AWAG scores in other species (39, 41) where changes in both the CWAS and individual factor scores suggest subtle changes in welfare state and interventions can be made to improve welfare. Additionally, using the AWAG site, the user can see how their dog scores compared to the cohort of healthy dogs scored. If dogs score higher than the “average healthy dog”, this may encourage owners to make changes to improve their dog's wellbeing.
Another aim of this research was to assess the test re-rest reliability and the inter-rater reliability of the AWAG. The results indicate that when a healthy dog under non-changing conditions is assessed repeatedly over several weeks, there is very little variation between the scores. Several dogs had consistent scores throughout assessments, others had a variation of ~1, which is an expected finding as despite conditions remaining relatively stable, a dog's daily routine, environment and social interactions cannot be controlled completely, so their response and affective state will vary within and between each day.
Although there was no significant variation between scores when assessing inter-rater reliability, cumulative welfare assessment scores still showed some minimal variation, which could be because dogs may interact differently with different people, leading to different ratings. Moreover, veterinary clinics can be a stressful and rapidly changing environment for a dog, and their behavior and responses can change over a short period of time.
We believe the AWAG can be successfully utilized throughout the veterinary journey; in the consultation to discuss quality of life with the owner and identify where improvements can be made, during the hospitalization period to assess how the dog is coping in the environment, how they are impacted by procedural events, and during treatments to monitor if the dog's welfare is improving. Additionally, the AWAG can also be used to assess the welfare of dogs living in varying environments (rehoming shelters, assistance, and service dogs etc.) and allows the user to evaluate where interventions can be made to their environment and management events impact their quality of life.
Future research
To provide further evidence of validity to the AWAG for dogs, assessing concurrent criterion validity would be of value, which would typically involve comparing a measure to another “gold-standard” measure; however, without a gold-standard measure against which the AWAG can be compared, other validated tools could be used to assess if the AWAG produces similar results, providing additional validity.
Conclusions
This study reports the development of a novel canine welfare assessment tool that is highly accessible, produces instant results, easy to use, valid and reliable, and to be used by veterinary and dog welfare professionals with the aim of monitoring and improving dog quality of life. The AWAG for dogs provides a promising platform for clinicians to utilize to objectively quantify welfare to help measure the effects of interventions and to help make treatment and management decisions.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.
Ethics statement
The animal study was reviewed and approved by University of Surrey NASPA, a Sub-committee of the Animal Welfare and Ethical Review Board (FHMS 20-21 182 EGA). Written informed consent was obtained from the owners for the participation of their animals in this study.
Author contributions
RM data collection, analysis, and writing. SP and SW supervising, reviewing, and editing. All authors contributed to the article and approved the submitted version.
Funding
The work was carried out with grants from Agria and SKK Research Fund.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Mellor DJ, Beausoleil NJ. Extending the “Five Domains” model for animal welfare assessment to incorporate positive welfare states. Anim Welf. (2015) 24:241–53. doi: 10.7120/09627286.24.3.241
2. Yon L, Williams E, Harvey ND, Asher L. Development of a behavioural welfare assessment tool for routine use with captive elephants. PLoS ONE. (2019) 14:e0210783. doi: 10.1371/journal.pone.0210783
3. Scott M, Nolan AM, Fitzpatrick JL. Conceptual and methodological issues related to welfare assessment: a framework for measurement. ACTA Agric Acta Agric Scand, Sect A, Anim Sci. (2001) 30:5–10. doi: 10.1080/090647001316922983
4. Belshaw Z. Quality of life assessment in companion animals: what, why, who, when and how. Companion Anim. (2018) 23:264–8. doi: 10.12968/coan.2018.23.5.264
5. Yeates JW. Maximising canine welfare in veterinary practice and research: a review. Vet. J. (2012). 192:272–8. doi: 10.1016/j.tvjl.2011.10.024
6. Reid J, Nolan AM, Scott EM. Measuring pain in dogs and cats using structured behavioural observation. Vet J. (2018) 236:72–9. doi: 10.1016/j.tvjl.2018.04.013
7. Wiseman-Orr ML, Nolan AM, Reid J, Scott EM. Development of a questionnaire to measure the effects of chronic pain on health-related quality of life in dogs. Am J Vet Res. (2004 A) 65:1077–84. doi: 10.2460/ajvr.2004.65.1077
8. Freeman LM, Rush JE, Farabaugh AE, Must A. Development and evaluation of a questionnaire for assessing health-related quality of life in dogs with cardiac disease. J Am Vet Med Assoc. (2005) 226:1864–8. doi: 10.2460/javma.2005.226.1864
9. Budke CM, Levine JM, Kerwin SC, Levine GJ, Hettlich BF, Slater MR. Evaluation of a questionnaire for obtaining owner-perceived, weighted quality-of-life assessments for dogs with spinal cord injuries. J Am Vet Med Assoc. (2008) 233:925–30. doi: 10.2460/javma.233.6.925
10. Brown DC, Boston RC, Coyne JC, Farrar JT. Development and psychometric testing of an instrument designed to measure chronic pain in dogs with osteoarthritis. Am J Vet Res. (2007) 68:631–7. doi: 10.2460/ajvr.68.6.631
11. Hielm-Björkman AK, Kapatkin AS, Rita HJ. Reliability and validity of a visual analogue scale used by owners to measure chronic pain attributable to osteoarthritis in their dogs. Am J Vet Res. (2011) 72:601–7. doi: 10.2460/ajvr.72.5.601
12. Yazbek KVB, Fantoni DT. Validity of a health-related quality-of-life scale for dogs with signs of pain secondary to cancer. J Am Vet Med Assoc. (2005) 226:1354–8. doi: 10.2460/javma.2005.226.1354
13. Lynch S, Savary-Bataille K, Leeuw B, Argyle DJ. Development of a questionnaire assessing health-related quality-of-life in dogs and cats with cancer. Vet Comp Oncol. (2011) 9:172–82. doi: 10.1111/j.1476-5829.2010.00244.x
14. Favrot C, Linek M, Mueller R, Zini E. Development of a questionnaire to assess the impact of atopic dermatitis on health-related quality of life of affected dogs and their owners. Vet Dermatol. (2010) 21:64–70. doi: 10.1111/j.1365-3164.2009.00781.x
15. Noli C, Borio S, Varina A, Schievano C. Development and validation of a questionnaire to evaluate the Quality of Life of cats with skin disease and their owners, and its use in 185 cats with skin disease. Vet Dermatol. (2016) 27:247–e58. doi: 10.1111/vde.12341
16. Noli C, Minafò G, Galzerano M. Quality of life of dogs with skin diseases and their owners. Part 1: development and validation of a questionnaire. Vet Dermatol. (2011) 22:335–43. doi: 10.1111/j.1365-3164.2010.00954.x
17. Mullan S, Main D. Preliminary evaluation of a quality-of-life screening programme for pet dogs. J Small Anim Pract. (2007) 48:314–22. doi: 10.1111/j.1748-5827.2007.00322.x
18. Schmutz A, Spofford N, Burghardt W, De Meyer G. Development and initial validation of a dog quality of life instrument. Sci Rep. (2022) 121:12:1–8. doi: 10.1038/s41598-022-16315-y
19. Reid J, Wiseman-Orr ML, Scott EM, Nolan AM. Development, validation and reliability of a web-based questionnaire to measure health-related quality of life in dogs. J Small Anim Pract. (2013) 54:227–33. doi: 10.1111/jsap.12059
20. Rodger S, Scott EM, Nolan A, Wright AK, Reid J. Effect of age, breed, and sex on the health-related quality of life of owner assessed healthy dogs. Front Vet Sci. (2021) 8:13. doi: 10.3389/fvets.2021.603139
21. Panksepp J. Affective consciousness: Core emotional feelings in animals and humans. Conscious Cogn. (2005) 14:30–80. doi: 10.1016/j.concog.2004.10.004
22. Faragó T, Andics A, Devecseri V, Kis A, Gácsi M, Miklósi Á. Humans rely on the same rules to assess emotional valence and intensity in conspecific and dog vocalizations. Biol Lett. (2014) 10:20130926. doi: 10.1098/rsbl.2013.0926
23. McPeake KJ, Collins LM, Zulch H, Mills DS. The canine frustration questionnaire—development of a new psychometric tool for measuring frustration in domestic dogs (canis familiaris). Front Vet Sci. (2019) 6:152. doi: 10.3389/fvets.2019.00152
24. Starling M, Spurrett A, McGreevy P. A pilot study of methods for evaluating the effects of arousal and emotional valence on performance of racing greyhounds. Animals. (2020) 10:1–20. doi: 10.3390/ani10061037
25. Part CE, Kiddie JL, Hayes WA, Mills DS, Neville RF, Morton DB, et al. Physiological, physical and behavioural changes in dogs (Canis familiaris) when kennelled: testing the validity of stress parameters. Physiol Behav. (2014) 133:260–71. doi: 10.1016/j.physbeh.2014.05.018
26. Jakovcevic A, Elgier AM, Mustaca AE, Bentosela M. Frustration Behaviors in Domestic Dogs. J Appl Anim Welf Sci. (2013 J) 16:19–34. doi: 10.1080/10888705.2013.740974
27. Bradshaw JWS, Pullen AJ, Rooney NJ. Why do adult dogs “play”? Behav Processes. (2015) 110:82–7. doi: 10.1016/j.beproc.2014.09.023
28. Harvey ND, Moesta A, Kappel S, Wongsaengchan C, Harris H, Craigon PJ, et al. Could greater time spent displaying waking inactivity in the home environment be a marker for a depression-like state in the domestic dog? Animals. (2019) 9:420. doi: 10.3390/ani9070420
29. Head E, Rofina J, Zicker S. Oxidative stress, aging, and central nervous system disease in the canine model of human brain aging. Vet Clin North Am Small Anim Pract. (2008) 38:167–78. doi: 10.1016/j.cvsm.2007.10.002
30. Berns G. Decoding the canine mind. Cerebrum. (2020). Available from: http://www.ncbi.nlm.nih.gov/pubmed/32395197 (accessed on June 22, 2021).
31. Berns GS, Brooks AM, Spivak M. Scent of the familiar: An fMRI study of canine brain responses to familiar and unfamiliar human and dog odors. Behav Processes. (2015) 110:37–46. doi: 10.1016/j.beproc.2014.02.011
32. Kujala M V. Canine emotions as seen through human social cognition. Anim Sentience. (2017) 2:1. doi: 10.51291/2377-7478.1114
33. Andics A, Gácsi M, Faragó T, Kis A, Miklósi Á. Voice-sensitive regions in the dog and human brain are revealed by comparative fMRI. Curr Biol. (2014) 24:574–8. doi: 10.1016/j.cub.2014.01.058
34. Serpell J. How happy is your pet? The problem of subjectivity in the assessment of companion animal welfare. Anim Welf . (2019) 57–66. doi: 10.7120/09627286.28.1.057
35. Eastland-Jones RC, German AJ, Holden SL, Biourge V, Pickavance LC. Owner misperception of canine body condition persists despite use of a body condition score chart*. J Nutr Sci. (2014) 3:1–5. doi: 10.1017/jns.2014.25
36. Wareham KJ, Brennan ML, Dean RS. Systematic review of the factors affecting cat and dog owner compliance with pharmaceutical treatment recommendations. Vet Rec. (2019) 184:154–154. doi: 10.1136/vr.104793
37. Lavan RP. Development and validation of a survey for quality of life assessment by owners of healthy dogs. Vet J. (2013) 197:578–82. doi: 10.1016/j.tvjl.2013.03.021
38. Wolfensohn S, Honess P. Laboratory animal, pet animal, farm animal, wild animal: Which gets the best deal? Anim Welf . (2007)
39. Justice WSM, O'Brien MF, Szyszka O, Shotton J, Gilmour JEM, Riordan P, et al. Adaptation of the animal welfare assessment grid (AWAG) for monitoring animal welfare in zoological collections. Vet Rec. (2017) 181:143. doi: 10.1136/vr.104309
40. Wolfensohn S. Too Cute to Kill? The Need for Objective Measurements of Quality of Life. Animals. (2020) 10:1054. doi: 10.3390/ani10061054
41. Wolfensohn S, et al. Assessment of welfare in zoo animals: towards optimum quality of life. Animals. (2018) 8:110. doi: 10.3390/ani8070110
42. Dunn RA. The sooner the better: the arguments for the use of extended welfare assessment grids in animal welfare cases. Liverp Law Rev. (2020) 41:107–27. doi: 10.1007/s10991-020-09243-2
43. Bateson M, Poirier C. Can biomarkers of biological age be used to assess cumulative lifetime experience? Animal Welfare. (2019) doi: 10.7120/09627286.28.1.041
44. Wolfensohn S, Sharpe S, Hall I, Lawrence S, Kitchen S, Dennis M. Refinement of welfare through development of a quantitative system for assessment of lifetime experience. Anim Welf. (2015) 24:139–49. doi: 10.7120/09627286.24.2.139
45. Johnson LN, Freeman LM. Recognizing, describing, and managing reduced food intake in dogs and cats. J Am Vet Med Assoc. (2017) 251:1260–6. doi: 10.2460/javma.251.11.1260
46. Frank D. Recognizing behavioral signs of pain and disease: a guide for practitioners. Vet Clin N Am Small Animal Pract. (2014). 44:507–24. doi: 10.1016/j.cvsm.2014.01.002
47. Ballantyne KC. Separation, Confinement, or Noises: What Is Scaring That Dog? Vet Clin N Am Small Animal Pract. (2018). 48:367–86. doi: 10.1016/j.cvsm.2017.12.005
48. Payne E, Bennett PC, McGreevy PD. Current perspectives on attachment and bonding in the dog-human dyad. Psychol Res Behav Manag. (2015) 8:71–9. doi: 10.2147/PRBM.S74972
49. Walton MB, Cowderoy E, Lascelles D, Innes JF. Evaluation of construct and criterion validity for the ‘Liverpool Osteoarthritis in Dogs' (LOAD) clinical metrology instrument and comparison to two other instruments. PLoS ONE. (2013) 8:e58125. doi: 10.1371/journal.pone.0058125
50. Zamanzadeh Z, Ghahramanian A, Rassouli M, Abbaszadeh A, Alavi-Majd H, Nikanfar A. Design and implementation content validity study: development of an instrument for measuring patient-centered communication. J caring Sci. (2015) 4:165–78. doi: 10.15171/jcs.2015.017
51. Wynd CA, Schmidt B, Schaefer MA. Two quantitative approaches for estimating content validity. West J Nurs Res. (2016) 25:508–18. doi: 10.1177/0193945903252998
52. Almanasreh E, Moles R, Chen TF. Evaluation of methods used for estimating content validity. Res Soc Adm Pharm. (2019) 15:214–21. doi: 10.1016/j.sapharm.2018.03.066
53. Yusoff MSB. ABC of content validation and content validity index calculation. Educ Med J. (2019) 11:49–54. doi: 10.21315/eimj2019.11.2.6
55. Frey BB,. The SAGE Encyclopedia of Educational Research, Measurement, Evaluation. https://www.google.com/search?q=Thousand+Oaks&stick=H4sIAAAAAAAAAOPgE-LUz9U3sEw2MC9R4gAxiyySLbS0spOt9POL0hPzMqsSSzLz81A4VhmpiSmFpYlFJalFxYtYeUMy8kuLE_NSFPwTs4t3sDLuYmfiYAAAEArCGVsAAAA&sa=X&ved=2ahUKEwie9Krw7fr5AhWUGVkFHYO4AxEQmxMoAXoECGUQAw Thousand Oaks, CA: Sage (2018).
56. Polit DF. Getting serious about test–retest reliability: a critique of retest research and some recommendations. Qual Life Res. (2014) 23:1713–20. doi: 10.1007/s11136-014-0632-9
57. Deyo R, Diehr P, Patrick D. Reproducibility and responsiveness of health status measures. Statistics and strategies for evaluation. Control Clin Trials. (1991) 12 doi: 10.1016/S0197-2456(05)80019-4
58. Springer S, Sandøe P, Lund TB, Grimm H. “Patients' interests first, but … ”–Austrian Veterinarians' Attitudes to Moral Challenges in Modern Small Animal Practice. Anim. (2019) 9:241. doi: 10.3390/ani9050241
59. Vandeputte M, de Weerd A. Sleep disorders and depressive feelings: a global survey with the Beck depression scale. Sleep Med. (2003) 4:343–5. doi: 10.1016/S1389-9457(03)00059-5
60. Meerlo P, Sgoifo A, Suchecki D. Restricted and disrupted sleep: Effects on autonomic function, neuroendocrine stress systems and stress responsivity. Sleep Med Rev. (2008) 12:197–210. doi: 10.1016/j.smrv.2007.07.007
61. Riemann D, Krone LB, Wulff K, Nissen C. Sleep, insomnia, and depression. Neuropsychopharmacology, Vol. 45. Berlin, Germany: Springer Nature (2020). p. 74–89. Available from: doi: 10.1038/s41386-019-0411-y (cited May 24, 2021).
62. Hoffman CL, Ladha C, Wilcox S. An actigraphy-based comparison of shelter dog and owned dog activity patterns. J Vet Behav. (2019) 34:30–6. doi: 10.1016/j.jveb.2019.08.001
63. Harvey ND, Moesta A, Wongsaengchan C, Harris H, Craigon PJ, Fureix C. Investigating putative depression-like states in the domestic dog: Does greater time spent displaying waking inactivity in the home kennel co-vary with negative judgment of ambiguity? Appl Anim Behav Sci. (2020) 230:105025. doi: 10.1016/j.applanim.2020.105025
Keywords: dog, welfare assessment, quality of life, validation, veterinary medicine
Citation: Malkani R, Paramasivam S and Wolfensohn S (2022) Preliminary validation of a novel tool to assess dog welfare: The Animal Welfare Assessment Grid. Front. Vet. Sci. 9:940017. doi: 10.3389/fvets.2022.940017
Received: 09 May 2022; Accepted: 26 August 2022;
Published: 16 September 2022.
Edited by:
Edward Narayan, The University of Queensland, AustraliaReviewed by:
Gabrielle Christine Musk, University of Western Australia, AustraliaMarcelo Ghezzi, Universidad Nacional del Centro de Buenos Aires, Argentina
Anna Zamansky, University of Haifa, Israel
Copyright © 2022 Malkani, Paramasivam and Wolfensohn. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Rachel Malkani, r.malkani@surrey.ac.uk