Skip to main content

ORIGINAL RESEARCH article

Front. Psychiatry, 14 November 2023
Sec. ADHD

Utilizing artificial intelligence-based eye tracking technology for screening ADHD symptoms in children

\r\nXiaolu ChenXiaolu Chen1Sihan WangSihan Wang2Xiaowen YangXiaowen Yang1Chunmei YuChunmei Yu1Fang NiFang Ni1Jie YangJie Yang1Yu TianYu Tian1Jiucai YeJiucai Ye1Hao LiuHao Liu1Rong Luo*Rong Luo1*
  • 1Key Laboratory of Development and Maternal and Child Diseases of Sichuan Province, Department of Pediatrics, Sichuan University, Chengdu, China
  • 2NeuroWeave, Co., Ltd., Shanghai, China

Objective: To explore the potential of using artificial intelligence (AI)-based eye tracking technology on a tablet for screening Attention-deficit/hyperactivity disorder (ADHD) symptoms in children.

Methods: We recruited 112 children diagnosed with ADHD (ADHD group; mean age: 9.40 ± 1.70 years old) and 325 typically developing children (TD group; mean age: 9.45 ± 1.59 years old). We designed a data-driven end-to-end convolutional neural network appearance-based model to predict eye gaze to permit eye-tracking under low resolution and sampling rates. The participants then completed the eye tracking task on a tablet, which consisted of a simple fixation task as well as 14 prosaccade (looking toward target) and 14 antisaccade (looking away from target) trials, measuring attention and inhibition, respectively.

Results: Two-way MANOVA analyses demonstrated that diagnosis and age had significant effects on performance on the fixation task [diagnosis: F(2, 432) = 8.231, ***p < 0.001; Wilks’ Λ = 0.963; age: F(2, 432) = 3.999, *p < 0.019; Wilks’ Λ = 0.982], prosaccade task [age: F(16, 418) = 3.847, ***p < 0.001; Wilks’ Λ = 0.872], and antisaccade task [diagnosis: F(16, 418) = 1.738, *p = 0.038; Wilks’ Λ = 0.938; age: F(16, 418) = 4.508, ***p < 0.001; Wilks’ Λ = 0.853]. Correlational analyses revealed that participants with higher SNAP-IV score were more likely to have shorter fixation duration and more fixation intervals (r = −0.160, 95% CI [0.250, 0.067], ***p < 0.001), poorer scores on adjusted prosaccade accuracy, and poorer scores on antisaccade accuracy (Accuracy: r = −0.105, 95% CI [−0.197, −0.011], *p = 0.029; Adjusted accuracy: r = −0.108, 95% CI [−0.200, −0.015], *p = 0.024).

Conclusion: Our AI-based eye tracking technology implemented on a tablet could reliably discriminate eye movements of the TD group and the ADHD group, providing a potential solution for ADHD screening outside of clinical settings.

Introduction

Attention-deficit/hyperactivity disorder (ADHD) is a neurodevelopmental disorder characterized by persistent and age-inappropriate inattention, hyperactivity, and/or impulsivity. The prevalence of ADHD is increasing, with rates of 7.2% globally and 6.26% in China (1, 2). The prevalence in China is likely an underestimation due to many unreported cases as there is a shortage of specialized pediatric psychiatrists, especially in rural areas. As ADHD is a chronic disorder that has a significant impact on the individual, their family, and society, it is critical to screen and diagnose ADHD accurately in order to provide early intervention for children with ADHD (3).

The current diagnostic process for ADHD within the clinical setting primarily relies on a subjective interview and standardized rating scales (4, 5). Although P300 event-related potentials are promising electroencephalography signatures that can objectively discriminate individuals with ADHD, it is not formally used for screening or diagnostic purposes due to its need for sophisticated equipment (6, 7). Other physiological differences have been reported on the markers within the heart–brain and gut–brain axes as well as motor cortex physiology; however, these remain correlative measures and are unable to provide diagnostic confirmation (8, 9). Given the complex etiology of ADHD and the common occurrence of multiple comorbidities, it is therefore challenging to identify children with ADHD by the present procedures, which are based on subjective assessments. This highlights the need for objective and quantifiable neurobehavioral tests and measures that can more quickly and more accurately facilitate diagnostic screening methods for earlier interventions (4, 5, 10).

With the advancement in eye tracking technology, voluntary and involuntary eye movements can be registered to assess ocular dysfunction and neural mechanisms underlying attention, emotions, and intentions (11). A commonly tracked parameter is saccades, which are eye movements involving the sequential alignment of the fovea toward objects of interest in the periphery, through swift and discrete step-like movements. Eye tracking experiments have been shown to be useful for the diagnosis of neuropsychiatric and neurological disorders, including bipolar disorder, mild cognitive impairment, Alzheimer’s disease, Parkinson’s disease, autism, and ADHD (12). Particularly, children with ADHD have a greater number of intrusive saccades during the eye fixation task, with a shorter antisaccade latency compared to children without ADHD (13, 14). Children with ADHD are also more likely to make directional errors and have slower saccade reaction times during an antisaccade task, during which the eye movement should be opposite to the visual stimuli.

Although eye tracking technology can be a useful clinical tool to guide diagnosis, it typically requires the use of specialized equipment, including a chin and head rest, a high-speed camera, and an additional computer screen (15, 16). This limits its use outside of a research lab and thus is not ideal as a clinical tool. As such, the present study aimed to determine whether our artificial intelligence (AI)-based eye tracking technology, which only requires a tablet, can be used as a tool to screen for ADHD symptoms in children aged 6–12 years old.

Materials and methods

Participants

We recruited 639 children in grades 1–6 at the Second Primary School in Wanyuan City for this study, and 437 participants who fulfilled the inclusion criteria were enrolled in this study. Although there were no drop outs, 43 were omitted as they did not complete the eye-tracking task successfully. The inclusion criteria were as follows: aged 6–12 years old, consented to undergo assessment using the criteria described in the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5), and completed at least three prosaccade and three antisaccade trials.

The exclusion criteria were as follows: current, controlled (on medication) or uncontrolled, comorbid psychiatric diagnosis; currently diagnosed with mental retardation [IQ < 80 determined using the Wechsler Intelligence Scale for Children (WISC-IV)]; eye movement disorder; history of seizures, or significant motor or vocal tics, including but not limited to Tourette’s disorder; uncorrected refractive error and distance visual acuity; diagnosis of or parent-reported color blindness, which would prevent independent completion of the eye tracking test. The enrolled participants were then broadly divided into the ADHD and typically developing (TD) groups based on the DSM-5 diagnostic criteria, which are further elaborated in the data analysis section below.

Apparatus

The task does not require the screen resolution to be specifically high, and most consumer-grade portable devices will be sufficient to carry out the task. We used a Lenovo® Yoga Tab 13 Tablet without additional sensors or devices to record the eye movements, eye gaze, and head positions of the participants at 30 Hz. We designed a data-driven end-to-end convolutional neural network appearance-based model that uses the information from serial images to robustly predict eye gaze, with an accuracy of 2 cm based on the average Euclidean distance error from the true fixation location.

To ensure the AI model was robust in predicting eye gaze, we used the Gaze Capture Dataset to train our AI model (17). This dataset features a sizable compilation of eye-tracking data derived from over 1450 individuals and encompasses close to 2.5 million images. Due to the diversity of the dataset, the AI model is adapted to faces of most ethnicities. We performed an initial experiment collecting data from 258 participants, who were requested to sequentially shift their focus between focal points displayed on a screen. Over 232,200 frames were collected and used to analyze the corresponding relation between images of faces, eyes, and gaze location. The accuracy was determined using the average Euclidean distance error, which was found to be less than 2 cm. This indicates that the predicted gaze point could potentially lie up to 2 cm away from the actual fixation location.

Using a distributed processing model, data were collected and uploaded to the cloud server for processing and subsequently returned (Supplementary Figure 1). This procedure allows the tablet to be the sole data collection terminal, thus reducing the resources required for the eye tracking test.

Data processing

The recordings of eye movements were sent to the cloud, where the server acts as a data warehouse and a center for data processing. Parameters including latency, velocity, gain, amplitude, fixation duration, and error rate for cognitive tasks were detected via AI algorithmic calculations.

We have previously implemented the use of an appearance-based model that requires a large amount of data but generalizes well to novel faces. It uses frames with both the face and eyes for detection as crucial inputs, such that eye tracking can only operate if the whole face is visible. The coordinates of the left eye, right eye, and face are then entered as a 1 × 6 vector (Supplementary Figure 2). As long as the face can be captured in the center of the screen without any obstruction, the model would adjust to fit to the size of the face.

By using an appearance-based model, we could achieve end-to-end matching by constructing a sample based on feature recognition, which can tolerate a relatively low resolution and the sample rate of the front-facing camera of the tablet. To calibrate the eye tracker, the subject will be required to stare at the center fixation point on the screen for 500 ms before each trial, minimizing the need for a tedious calibration process that is required for traditional eye tracking technologies.

Stimuli used in the eye tracking test

The stimulus used in the fixation task is a dot measuring 25 pixels. This value is the same diameter as the dot size described by Bucci. et al. (18) with an RGB value of [75, 35, 35] presented on a light-yellow screen with an RGB value of [252, 248, 230]. We used the following formula of arc length to calculate the size of the stimulation dot:

L = ω × r
= 22.2 ° × 2 pixel / 720 × 60 cm
= 5.236 mm

where ω is the angle in radians, and r is the radius, 60 cm being the average relative distance from the eye to the screen based repeated trials. Based on the dimensions of the Yoga 13 tablet, which are 293.4 mm × 204 mm, the size of the dot will be:

= 1440 × 5.236 / 293.4 = 25.698 pixels

As such, we have selected a dot size of 25 pixels for convenient calculation and programming, which will be located at the center of the screen for a duration of 30 s (Supplementary Figure 3).

In the saccade task, the stimulus will appear on either the right or the left side of the screen with a point-size of 0.5 degrees (18). The distance between the stimulus and the initial instructional dot was calculated as follows:

( Number of pixels along width × arc length / screen length ) / 2
= ( 1440 × 5.236 cm / 293.4 mm ) / 2
= 570 pixels

To minimize the contrast between the background and the stimuli (dots and crosses), we opted against using a pure black RGB [0, 0, 0] and instead used a yellowish hue [252, 248, 230] to provide a soft contrast and a more user-friendly viewing experience over an extended period, if necessary. To minimize potential confounding variables, we utilized the most prominent colors (19) that were used in previous studies—red [255, 0, 0] and green [0, 255, 0]—to indicate antisaccade and prosaccade trials, respectively.

Eye tracking test (18)

The participants were instructed to not consume any medications for ADHD treatment (if any) prior to the eye tracking test in order to minimize confounding effects. This was confirmed by verifying with the child’s guardian prior to the task. During the eye tracking test, the participants were seated 60–80 cm away from the screen in a well-lit room (between 800 and 1200 lux) (20) and instructed by the research staff to keep their head and body as still as possible. Visual stimuli were delivered via the tablet with a 13-inch display and a resolution of 1440 × 900. The viewing distance was adjusted by the participant until both eyebrows and the chin were clearly visible on the screen. The participants then completed an eye fixation task, 14 prosaccade trials, and 14 antisaccade trials during the test.

Calibration was performed at the beginning of each trial, during which the participants were required to fixate on the center cross for a minimum of 500 ms. During the eye fixation task, the participants must maintain their eye gaze on the target, which was 30 s long. The participants then had to complete 14 prosaccade and 14 antisaccade trials (presented randomly), which took about 5 min. The whole process should be completed in about 6 min. Only participants who completed at least three prosaccade and three antisaccade trials were included in this study.

During the prosaccade and antisaccade trials, the participants were presented first with the current trial number and the number of subsequent trials to be followed in the center of the screen for 1000 ms (Supplementary Figure 2). Following that, a central cross appeared at the center of the screen for a period of 1500–2500 ms. This was followed by a 200-ms gap, when the central cross disappeared. A central fixation point then reappeared with a target on either the left or right of the fixation point for 3000 ms. A green central fixation point indicated a prosaccade trial, and the participants needed to quickly shift their gaze to the target. A red central fixation point indicated an antisaccade trial, and the participants needed to shift their gaze opposite of the target. After the central fixation point and target disappeared, a blank screen appeared for 1000 ms before the next trial was presented with the next trial number. The presentation of the prosaccade and antisaccade trials as well as the lateral side of the target were randomized.

A successful trial was characterized by the collection of reliable and valid data by the camera, which could then be subjected to AI-based analysis. Certain trials may have failed due to various reasons. For instance, the whole face may not have been completely captured by the camera, so the model could not register the person’s facial features. Participants who failed to maintain eye contact with the fixation point during the initial calibration period would also result in the model not being able to calculate eye-tracking points. Similarly, when the tablet was moved without the teacher’s consent and exposed to outdoor bright lighting, the face in the foreground would become too dark to be recognized by the detection algorithm. As such, we established a criterion of having to complete three of each trial type to be included in the study, which would account for approximately 20% of the 14-trial session.

Definitions of parameters measured

The parameters measured during the eye tracking test and their definitions are presented in Table 1. Briefly, these parameters measure how fast the participants respond as well as their attention, accuracy of performance, and ability to correct erroneous trials.

TABLE 1
www.frontiersin.org

Table 1. The parameters measured during the eye tracking test and their definitions.

Data analysis

The participants were first stratified into two groups: TD group and children with ADHD (ADHD group). The participants were sorted into the TD group if they did not meet the diagnostic criteria for ADHD based on DSM-5.

We used GraphPad Prism® software (V8.0.0 for Mac, GraphPad Software, San Diego, CA, USA) and JASP to perform the statistical analyses. Two-way multivariate analysis of variance (MANOVA) were performed, whereby the participants were grouped based on two independent variables [diagnosis: TD or ADHD; and age: <10 (“young” group) or ≥10 years old (“older” group)]. We used central tendency measures to analyze the data. Dependent variables including the saccade latency and velocity were then analyzed for both the antisaccade and prosaccade trials according to the independent variables. The fixation period and the time spent in the center of the screen were also analyzed. If the main effect of a dependent variable was significant, we performed post-hoc analyses using the Tukey’s-Kramer test. If the interaction was significant, we followed up with a simple effect analysis. A p-value < 0.05 was considered statistically significant.

Results

Participant demographics

The 437 participants enrolled into this study were divided into two groups: the TD group, who did not meet the diagnostic criteria for ADHD based on DSM-5 (n = 325), and the ADHD group (n = 112) (Table 2). All parameters measured in the fixation, prosaccade and antisaccade tasks are summarized in Table 3.

TABLE 2
www.frontiersin.org

Table 2. Demographic characteristics of the participants.

TABLE 3
www.frontiersin.org

Table 3. Parameters measured during the eye tracking task, stratified according to TD or ADHD.

Fixation task

To understand how diagnosis and age groups affected performance on the fixation task, we performed two-way MANOVA on the parameters measured during the task. The two-way MANOVA showed a significant multivariate group effect of both diagnosis [F(2, 432) = 8.231, ***p < 0.001; Wilks’ Λ = 0.963] and age [F(2, 432) = 3.999, *p < 0.019; Wilks’ Λ = 0.982] on the overall performance and the ability to maintain attention during the task. However, the interaction between diagnosis and age groups was not significant [F(2, 432) = 0.691, p = 0.501], suggesting age might be a factor that contribute to performance independent of diagnosis group, and vice versa.

Univariate analyses revealed that diagnosis group had a significant effect on parameters measuring visual steadiness during the fixation task [number of fixation intervals: F(1, 433) = 7.822, **p = 0.005, ω2 = 0.015; central area fixation duration: F(1, 433) = 10.82, **p = 0.001, ω2 = 0.022]. However, age group only had a significant effect on the parameters measuring fixation duration on the central area [the number of fixation intervals: p = 0.244; central area fixation duration: F(1, 433) = 4.565, *p = 0.033, ω2 = 0.008]. This indicated that the ability to remain on task was dependent on diagnosis group, while the ability to maintain attention was dependent on both diagnosis and age groups.

Post-hoc Tukey’s test on the number of fixation intervals showed that the ADHD group exhibited a greater occurrence of saccades compared to the TD group during the fixation task (Cohen’s d = 0.306, 95% CI [0.090, 0.523], **p = 0.005, Figure 1A). The TD group also had a longer central area fixation duration than the ADHD group (Cohen’s d = 0.361, 95% CI [0.144, 0.577], ***p < 0.001, Figure 1B), while younger participants also had shorter central area fixation duration (Cohen’s d = - 0.234, 95% CI [−0.450, −0.018], *p = 0.033, Figure 1B).

FIGURE 1
www.frontiersin.org

Figure 1. Number of intervals and central area fixation duration during simple eye fixation tasks, stratified according to the diagnosis and age. Data shown are expressed as the mean ± SD, n = 437. (A) Two-way ANOVA showed the main effect of the diagnosis on the number of intervals compared to the TD group; the ADHD group performed significantly more saccades in the older participants, *p = 0.017. (B) Two-way ANOVA showed the main effect of diagnosis and age on the central area fixation duration; the younger participants with ADHD spent significantly less time on the central area than the TD group, *p = 0.031.

To further understand the relationship between diagnosis and age groups on fixation task performance, we performed correlation analysis and found that participants with higher SNAP-IV score were more likely to have a shorter central fixation duration and higher number of fixation intervals (r = −0.160, 95% CI [0.250, 0.067], ***p < 0.001). Using diagnosis group as a factor and age as a covariate, we constructed a regression model using central area fixation duration or the number of fixation intervals as dependent variables, TD participants would have an average of 2766.8 ms (adjusted r2 = 0.033, 95% CI [1095.3, 4438], **p = 001) longer duration on the central area compared to participants with ADHD. Similarly, for every year increase in age would predict an average of 562.6 ms (95% CI [109.5, 1016], *p = 0.015) increase in central area fixation duration.

Prosaccade task

Two-way MANOVA showed a significant multivariate effect of age group on the performance on prosaccade task, which mainly measured the responses to reflexive eye movements [age group: F(16, 418) = 3.847, ***p < 0.001; Wilks’ Λ = 0.872]. Similar to the fixation task, there was no significant interaction between diagnosis and age groups [Diagnosis group F(16, 418) = 1.103, p = 0.350, Wilks’ Λ = 0.959, interaction: F(16, 418) = 1.393, p = 0.141, Wilks’ Λ = 0.949].

Univariate analyses revealed that age had a significant effect on the parameters measured in the prosaccade task [mean of saccade latency: F(1, 433) = 19.01, ***p < 0.001, ω2 = 0.040; median of saccade latency: F(1, 433) = 18.56, ***p < 0.001, ω2 = 0.039; adjusted accuracy: F(1, 433) = 7.488, **p = 0.006, ω2 = 0.015; median of peak velocity: F(1, 433) = 6.770, **p = 0.010, ω2 = 0.013]. Diagnosis group, however, only had a marginal significant effect on the adjusted prosaccade accuracy [adjusted accuracy: F(1, 433) = 3.551, p = 0.060, ω2 = 0.006]. This suggested that performance on the prosaccade task was dependent on age.

Post-hoc Tukey’s test showed that younger participants exhibited slower saccades compare to the older participants (mean of prosaccade latency: Cohen’s d = 0.478, 95% CI [0.260, 0.696], ***p < 0.001; Figure 2A). Similarly, younger participants also exhibited slower saccades compared to the older participants (median of prosaccade latency: Cohen’s d = 0.472, 95% CI [0.254, 0.690], ***p < 0.001, Figure 2B). Post-hoc Tukey’s test on the mean prosaccade correction latency also demonstrated that the older participants in the TD group had a shorter median correction latency compared to the younger participants (Cohen’s d = −0.362, 95% CI [−0.658, −0.066], **p = 0.007). While older participants had a longer median correction latency compared to the younger participants within the ADHD group, this difference was non-significant (Cohen’s d = 0.092, 95% CI [−0.410, 0.593]; p = 0.963, Figure 2C).

FIGURE 2
www.frontiersin.org

Figure 2. The parameters measured during the prosaccade trials during the eye tracking test and stratified according to the diagnosis and age. Data are represented as the mean ± SD, n = 437. (A) Two-way ANOVA showed a longer mean (*p = 0.019) and (B) median (*p = 0.019) latency in the younger participants than in the older participants in the ADHD group. (C) The mean (**p = 0.007) correction latency was longer in the younger participants compared to the older participants in the TD group. (D) The adjusted accuracy was lower in the ADHD participants in the younger group compared to the older group (*p = 0.028), while among the TD participants, the younger group had a lower adjusted accuracy than the older group (*p = 0.039). (E) The median peak velocity was higher in the younger participants compared to the older participants in the TD group (***p < 0.001).

When comparing adjusted accuracy, adjusted accuracy was higher in the older group compared to the younger ground (Cohen’s d = 0.300, 95% CI [0.084, 0.516], **p = 0.006 Figure 2D), and was lower in the ADHD group compared to the TD group (Cohen’s d = −0.206, 95% CI [−0.422, 0.009], p = 0.060, Figure 2D). When comparing peak velocity on the task, the younger group had faster saccades compared to the older group (median: Cohen’s d = 0.285, 95% CI [0.069, 0.501], *p = 0.010, Figure 2E).

We then performed correlation analysis using the parameters measured in the prosaccade task and SNAP-IV score, and found that participants with a higher SNAP-IV score had poorer scores on the adjusted prosaccade accuracy (r = −0.120, 95% CI [−0.211, −0.026], *p = 0.012), while the other parameters did not have significant correlations. Using diagnosis group as a factor, age as a covariate and adjusted prosaccade accuracy as a dependent variable, TD subjects would have a predicted higher accuracy of an average 1.5% (adjusted r2 = 0.030, 95% CI [−0.1, 3%]) on the prosaccade task compared to the ADHD group (p = 0.068). For every year increase in age would predict an average of 0.8% increase (95% CI [0.3, 1.2%], ***p < 0.001) in adjusted accuracy. This indicated that adjusted accuracy on the prosaccade task is more dependent on age.

Antisaccade task

Two-way MANOVA revealed a significant multivariate effect of both diagnosis and age groups on the performance on the antisaccade task, which test for voluntary eye movement response and the inhibition abilities [age group: F(16, 418) = 4.508, ***p < 0.001; Wilks’ Λ = 0.853], diagnosis group [F(16, 418) = 1.738, *p = 0.038; Wilks’ Λ = 0.938]. The interaction between diagnosis and age group was also significant [F(16, 418) = 4.508, *p = 0.027; Wilks’ Λ = 0.935].

Univariate analysis showed that there was a significant effect of age on the parameters measured on the antisaccade task [Adjusted accuracy: F(1, 433) = 3.926, *p = 0.048, ω2 = 0.007]; Mean of correction latency: [F(1, 433) = 15.50, ***p < 0.001, ω2 = 0.031]; Median of correction latency: [F(1, 433) = 11.71, ***p < 0.001, ω2 = 0.023]. Diagnosis group similarly had a significant effect on the parameters Median of Initial amplitude: [F(1, 433) = 4.758, *p = 0.030, ω2 = 0.009]; Median of peak velocity: [F(1, 433) = 4.186, *p = 0.041, ω2 = 0.007]; Mean of correction latency: [F(1, 433) = 6.089, *p = 0.014, ω2 = 0.011]; Median of correction latency: [F(1, 433) = 7.138, **p = 0.008, ω2 = 0.013].

Post-hoc Tukey’s test showed that the adjusted accuracy on the antisaccade task was higher in the older group compared to the younger group (Cohen’s d = 0.217, 95% CI [0.001, 0.433], *p = 0.048, Figure 3A). Initial amplitude median was higher in the ADHD group compared to the TD group, in the younger participants (Cohen’s d = 0.239, 95% CI [0.023, 0.455], *p = 0.030, Figure 3B). Median peak velocity, on the other hand, was higher in the ADHD group compared to the TD group within the older group (Cohen’s d = 0.224, 95% CI [0.008, 0.440], *p = 0.041, Figure 3C). However, median correction latency was higher in the ADHD group compared to the TD group (median: Cohen’s d = 0.293, 95% CI [0.077, 0.509], **p = 0.008, Figure 3D), indicating that the ADHD group required more time to detect and correct errors. Younger participants also required more time to correct errors compared to the older participants (median: Cohen’s d = 0.375, 95% CI [0.158, 0.592], ***p < 0.001, Figure 3E).

FIGURE 3
www.frontiersin.org

Figure 3. Parameters measured during the antisaccade trials during the eye tracking test and stratified according to the diagnosis and age. Data are represented as the mean ± SD, n = 437. (A) The adjusted accuracy was lower in the younger participants in the TD group compared to the older group (*p = 0.020). (B) The median initial amplitude was higher in the ADHD group compared to the TD group among the younger participants (*p = 0.010). (C) The median peak velocity was higher in the ADHD participants compared to the TD participants in the older group (*p = 0.406). (D) The median correction latency was higher in the ADHD group compared to the TD group (**p = 0.003). The median correction latency was also higher in the younger participants compared to the older participants (***p < 0.001). (E) The mean correction latency was higher in the ADHD group compared to the TD group (*p = 0.014). The mean correction latency was also higher in the younger participants compared to the older participants (***p < 0.001).

We then performed correlation analysis using the parameters from the antisaccade task and SNAP-IV score, and found that the initial amplitude and correction latency were positively correlated with SNAP-IV scores (median of initial amplitude: r = 0.105, 95% CI [0.011, 0.197], *p = 0.028; Mean of initial amplitude: r = 0.094, 95% CI [0.000, 0.186], *p = 0.049; Median of correction latency: r = 0.181, 95% CI [0.089, 0.270], ***p < 0.001; Mean of correction latency: r = 0.146, 95% CI [0.053, 0.147], **p = 0.002). Accuracy on the task, however, was negatively correlated with the SNAP-IV score, indicating that participants higher SNAP-IV were likely to have lower accuracy (Accuracy: r = −0.105, 95% CI [−0.197, −0.011], *p = 0.029; Adjusted accuracy: r = −0.108, 95% CI [−0.200, −0.015], *p = 0.024).

Using measures that correlated with SNAP-IV score as a variable, we constructed regression models for each measure using diagnosis group as a factor and age as a covariate. We observed that there was a reduction in the median initial amplitude for the TD group compared to the ADHD group by 1.676, while there was a reduction in the mean amplitude of 3.059 for TD group compared to ADHD group (median of initial amplitude: 95% CI [0.140, 3.213], *p = 0.033; mean of initial amplitude: 95% CI [0.312, 6.430], *p = 0.075), These models, however, were not significant (median of initial amplitude: p = 0.509; mean of initial amplitude: p = 0.497).

There was a reduction in correction latency for every year increase in age by an average of 28.115 in the median, and an average of 28.844 in mean (median of correction latency: 95% CI [19.299, 36.930], ***p < 0.001; mean of correction latency: 95% CI [20.470, 37.217], ***p < 0.001). There was also an increase in correction latency in the TD group compared to the ADHD group by an average of 43.841 for median, and 38.359 for mean (median of correction latency: 95% CI [11.322, 76.361], **p = 0.008; mean of correction latency: 95% CI [7.471, 69.248], *p = 0.015). These suggested that correction latency depended strongly on both age and diagnosis group.

With every year increase in age, there was also an increase of accuracy by 1.7%, and increase of adjusted accuracy of 1.8% (accuracy: 95% CI [0.8%, 2.7%], ***p < 0.001; adjusted accuracy: 95% CI [1.1, 2.6%], ***p < 0.001). This was, however, not significant for diagnosis groups (accuracy: p = 0.132; adjusted accuracy: p = 0.538), indicating that accuracy was more dependent on age.

Discussion

In this study, TD children and ADHD children completed an eye tracking task comprised of a simple fixation task, prosaccade trials, and antisaccade trials. This was delivered via a tablet with AI software that we had developed, allowing the measurements of attention, reflexive gaze toward a target, and voluntary gaze away from a target that requires inhibitory control, respectively (13, 21). We found that during the simple fixation task, the ADHD participants exhibited more saccades compared to the TD participants, resulting in a lower amount of time spent fixating on the central area. The younger participants in general also had a shorter fixation duration compared to the older participants. During the prosaccade trials of the eye tracking task, age was a contributing factor to the differences in latency and peak velocity. The younger participants had a slower reaction time, although their velocity was faster. The lack of significant main effect of diagnosis and interaction between age and diagnosis for prosaccade trials was unexpected, however, these may require a higher level of sensitivity when detecting differences in performance across diagnosis and age groups.

During the antisaccade trials, the participants with ADHD had higher amplitudes and a slower correction latency, requiring more time to correct errors and redirect saccades to the correct direction. Age similarly contributed to a slower saccade latency in the younger participants. While the correction latency during the prosaccade trials was dependent on the age of the participants, the number of errors made during the prosaccade trials was higher in the younger ADHD group. In addition, we observed that the ADHD group required a longer duration to correct an erroneous saccade compared to the other group during the antisaccade task, and this again was more prominent in the younger cohort. Our present study also found a significant interaction between age and diagnosis in antisaccade trials, demonstrating that antisaccade trials might be robust at discriminating between ADHD and TD groups, while also detecting developmental changes across age. This thus provides a promising metric to potentially screen ADHD symptoms amongst children across different age range.

Our findings from this study initially suggested that age was a major contributor to differences in the performance on the eye tracking tasks, with the younger participants generally having a shorter fixation duration, more intrusive saccades, a lower adjusted accuracy, and a longer correction latency. The simple fixation task and antisaccade trials rely on focusing attention and having inhibitory control over saccades that require the frontal cortex (22, 23). Given that older children have a slightly more matured frontal cortex, it was expected that the older children in this study would perform better on these measures. This is also consistent with previous literature reports demonstrating that the number of intrusive saccades during simple fixation is reduced as children age and that their performance on antisaccade improves after the developmental age of 10 years old (24, 25).

The diagnosis was a factor that influenced differences on the eye tracking tasks. In particular, the ADHD group, with inattention, hyperactivity, and impulsivity tendencies, had a shorter fixation duration and more intrusive saccades, made more errors, and required more time to correct erroneous saccades. These findings also have been observed in other studies in which participants with ADHD incurred more errors during antisaccade trials, which can persist into adulthood. The poorer performance of the ADHD participants during the antisaccade trials compared to the prosaccade trials also suggests that frontal cortex inhibition control in these participants was less developed (26). Altogether, this result supports the notion that it might be possible to utilize the simple fixation and antisaccade eye tracking tasks to identify ADHD-like symptoms by measuring the fixation duration, the number of saccades during simple fixation, the number of errors, and the latency to correct erroneous saccades.

Finally, this study also demonstrated the robustness of the AI model that we developed for the test, which was able to successfully discriminate the eye movements of the TD and ADHD groups. As it is well known that the robustness of an AI model highly depends on the training data it is provided, there is always room for improvements for our current AI model (27). To increase the generalizability of the model and to enhance its robustness in tolerating changes in variance within a group of children, the data should be trained on a dataset comprising of children across different age groups. Such an approach will ensure that the model is not biased toward a specific age group and will enable it to perform better on a wider range of data. By using a diverse dataset, the model will also be able to recognize patterns and features that are common across different age groups and will be less prone to overfitting. Ultimately this will lead to a more accurate and reliable gaze estimation.

Nevertheless, there are some limitations to our study that must be addressed. For example, our results might have been due to the short fixation time (30 s) and the limited number of trials (14 prosaccade and 14 antisaccade trials), which might not have been long enough to detect significant changes. Additionally, we included participants who had completed at least three prosaccade and three antisaccade trials, which might not have been sufficient for analyses. An achievable solution would be to sample at least 25 successful trials, as demonstrated by other researchers (28). In addition, a future experiment should be performed to determine whether the same parameters can differentiate the TD and those with ADHD subtypes such as combination ADHD, hyperactive/impulsive ADHD, and inattentive ADHD with more participants.

As this is a task that required both attention and judgment, variation dependent on the test time may result in variations in performance. These differences could stem from physical factors, such as fatigue during the day after exercise, or mental factors, such as mental exhaustion after a long day of class. Hunger resulting from approaching lunch time may also affect performance. Although the actual test only lasts 6 min, administrative and logistic factors may contribute to a longer test time. As such, performing tests at a stipulated time window of the day may be useful for future studies. Finally, as the parameters of the stimuli used in the eye tracking test have been optimized for a 13-inch tablet, we plan to refine the test algorithm to allow it to be administered on screens of other dimensions.

Here, we successfully developed AI-based eye tracking technology that can be operated on a tablet, without the need for sophisticated equipment, to detect ADHD symptoms in individuals. In China, where accessibility to professional pediatric psychiatric services is limited, this technology can be easily implemented in schools to identify students at risk of ADHD. Importantly, the results provided are objectively quantifiable, thus allowing students to receive intervention earlier and additional help to cope in school. Moreover, it would allow parents to understand their child better and improve their quality of life. Besides its utility in screening children, it would be worthwhile to explore in future studies whether it is a useful tool for objectively assessing if a child’s ADHD symptoms have improved after introducing either behavioral or pharmacological interventions.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Medical Ethics Committee of West China Second University Hospital of Sichuan University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

XC: Conceptualization, Data curation, Funding acquisition, Writing – original draft. SW: Data curation, Formal analysis, Methodology, Writing – original draft. XY: Data curation, Formal analysis, Methodology, Writing – original draft. CY: Data curation, Formal analysis, Methodology, Writing – original draft. FN: Data curation, Formal analysis, Methodology, Writing – original draft. JYa: Data curation, Formal analysis, Methodology, Writing – original draft. YT: Data curation, Formal analysis, Methodology, Writing – original draft. JYe: Formal analysis, Methodology, Writing – review and editing. HL: Formal analysis, Methodology, Writing – review and editing. RL: Conceptualization, Funding acquisition, Project administration, Supervision, Writing – review and editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was supported by the Popularization and Application Project of Sichuan Provincial Health Commission (21PJ049) and Sichuan Provincial Department of Science and Technology Regional Innovation Cooperation Project (No. 2020YFQ0021).

Acknowledgments

We thank Xuejing Chen for her technical support.

Conflict of interest

SW was employed by NeuroWeave Co., Ltd., Shanghai.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2023.1260031/full#supplementary-material

Supplementary Figure 1 | The two distinct tasks are illustrated here: (A) the simple fixation task, and (B) the prosaccade/antisaccade task. A hybrid approach was implemented by combining the Gap paradigm and the Overlap paradigm. The task consisted of 28 trials, counterbalanced between 14 prosaccade (seven with target on the left) and 14 antisaccade (seven with target on the left) conditions. Participants were presented the trials in a randomized order. The Gap denoted the 200 ms period following the initial fixation, while the Overlap involved a central fixation point appearing alongside the lateral target. The differentiation between the prosaccade and antisaccade tasks lied in the eye movement response relative to the lateral target based on the color of the central point. A green dot indicated a prosaccade task, and the red dot indicated an antisaccade task. This is a royalty-free image obtained from https://www.freeimages.com/photo/handsome-male-traveler-1637362.

Supplementary Figure 2 | Overview of our eye tracking CNN. Inputs include right eye, left eye, together with a face image clipped from the original frame. A 1 × 6 vector represents the coordination of the locations of the eyes and face. The output is the predicted gaze location in pixels. ImageNet: ImageNet Large-Scale Visual Recognition Challenge (ILSVRC) is an annual event to showcase and challenge computer vision models. VGG16: VGG16 is a type of CNN with 16–19 weight layers (approximately 138 trainable parameters). VGG16 structure contains Fully Connected Layers, but here we only used the convolution layers of VGG16.

Supplementary Figure 3 | The AI model used was implemented based on the application program on the tablet. The Relay Centre in the figure represents the eye movement assessment software installed on the Lenovo Yoga 13-inch tablet, serving as a relay station for both efferent (blue lines) information, including presenting visual stimuli information to the subject and transmitting video and task parameter information to the cloud, and afferent (red lines) information, including receiving facial feature video information of the subject’s gaze point and eye movement parameter result information analyzed by the AI model.

References

1. Thomas R, Sanders S, Doust J, Beller E, Glasziou P. Prevalence of attention-deficit/hyperactivity disorder: a systematic review and meta-analysis. Pediatrics. (2015) 135:e994–1001. doi: 10.1542/peds.2014-3482

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Wang T, Liu K, Li Z, Xu Y, Liu Y, Shi W, et al. Prevalence of attention deficit/hyperactivity disorder among children and adolescents in China: a systematic review and meta-analysis. BMC Psychiatry. (2017) 17:32. doi: 10.1186/s12888-016-1187-9

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Harpin V. The effect of ADHD on the life of an individual, their family, and community from preschool to adult life. Arch Dis Child. (2005) 90:i2–7. doi: 10.1136/adc.2004.059006

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Barkley R. Issues in the diagnosis of attention-deficit/hyperactivity disorder in children. Brain Dev. (2003) 25:77–83. doi: 10.1016/s0387-7604(02)00152-3

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Gualtieri C, Johnson LG. ADHD: Is Objective Diagnosis Possible? Psychiatry. (2005) 2:44–53.

Google Scholar

6. Seçen Yazıcı M, Serdengeçti N, Dikmen M, Koyuncu Z, Sandıkçı B, Arslan B, et al. Evaluation of p300 and spectral resolution in children with attention deficit hyperactivity disorder and specific learning disorder. Psychiatry Res Neuroimaging. (2023) 334:111688. doi: 10.1016/j.pscychresns.2023.111688

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Huster R, Messel M, Thunberg C, Raud L. The P300 as marker of inhibitory control - Fact or fiction? Cortex. (2020) 132:334–48. doi: 10.1016/j.cortex.2020.05.021

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Payen A, Chen M, Carter T, Kilmer R, Bennett J. Childhood ADHD, Going Beyond the Brain: A Meta-Analysis on Peripheral Physiological Markers of the Heart and the Gut. Front Endocrinol. (2022) 13:738065. doi: 10.3389/fendo.2022.738065

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Gilbert D, Huddleston D, Wu S, Pedapati E, Horn P, Hirabayashi K, et al. Motor cortex inhibition and modulation in children with ADHD. Neurology. (2019) 93:e599–610. doi: 10.1212/wnl.0000000000007899

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Emser T, Johnston B, Steele J, Kooij S, Thorell L, Christiansen H. Assessing ADHD symptoms in children and adults: evaluating the role of objective measures. Behav Brain Funct. (2018) 14:11. doi: 10.1186/s12993-018-0143-x

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Harezlak K, Kasprowski P. Application of eye tracking in medicine: A survey, research issues and challenges. Comput Med Imaging Graph. (2018) 65:176–90. doi: 10.1016/j.compmedimag.2017.04.006

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Liu Z, Yang Z, Gu Y, Liu H, Wang P. The effectiveness of eye tracking in the diagnosis of cognitive disorders: A systematic review and meta-analysis. PLoS One. (2021) 16:e0254059. doi: 10.1371/journal.pone.0254059

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Bucci M, Seassau M, Larger S, Bui-Quoc E, Gerard C. Effect of visual attention on postural control in children with attention-deficit/hyperactivity disorder. Res Dev Disabil. (2014) 35:1292–300. doi: 10.1016/j.ridd.2014.03.029

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Caldani S, Bucci M, Lamy J, Seassau M, Bendjemaa N, Gadel R, et al. Saccadic eye movements as markers of schizophrenia spectrum: Exploration in at-risk mental states. Schizophr Res. (2017) 181:30–7. doi: 10.1016/j.schres.2016.09.003

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Umesh S, Pant S, Padma S, Jana S, Vasudevan V, Murthy A, et al. A novel fiber Bragg grating system for eye tracking. J Adv Res. (2019) 16:25–34. doi: 10.1016/j.jare.2018.12.007

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Weigle C, Banks D. Analysis of eye-tracking experiments performed on a Tobii T60. Proceedings of the SPIE. Washington, DC: (2008).

Google Scholar

17. Krafka K, Khosla A, Kellnhofer P, Kannan H, Bhandarkar S, Matusik W, et al. Eye Tracking for Everyone. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). New York, NY: (2016). doi: 10.1109/cvpr.2016.239

CrossRef Full Text | Google Scholar

18. Bucci M, Stordeur C, Septier M, Acquaviva E, Peyre H, Delorme R. Oculomotor Abnormalities in Children with Attention-Deficit/Hyperactivity Disorder Are Improved by Methylphenidate. J Child Adolesc Psychopharmacol. (2017) 27:274–80. doi: 10.1089/cap.2016.0162

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Balakrishnan G, Uppinakudru G, Girwar Singh G, Bangera S, Dutt Raghavendra A, Thangavel D. A comparative study on visual choice reaction time for different colors in females. Neurol Res Int. (2014) 2014:301473. doi: 10.1155/2014/301473

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Juslén H, Wouters M, Tenner A. Lighting level and productivity: a field study in the electronics industry. Ergonomics. (2007) 50:615–24. doi: 10.1080/00140130601155001

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Levantini V, Muratori P, Inguaggiato E, Masi G, Milone A, Valente E, et al. EYES Are The Window to the Mind: Eye-Tracking Technology as a Novel Approach to Study Clinical Characteristics of ADHD. Psychiatry Res. (2020) 290:113135. doi: 10.1016/j.psychres.2020.113135

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Goto Y, Hatakeyama K, Kitama T, Sato Y, Kanemura H, Aoyagi K, et al. Saccade eye movements as a quantitative measure of frontostriatal network in children with ADHD. Brain Dev. (2010) 32:347–55. doi: 10.1016/j.braindev.2009.04.017

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Ross R, Hommer D, Breiger D, Varley C, Radant A. Eye movement task related to frontal lobe functioning in children with attention deficit disorder. J Am Acad Child Adolesc Psychiatry. (1994) 33:869–74. doi: 10.1097/00004583-199407000-00013

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Everling S, Fischer B. The antisaccade: a review of basic research and clinical studies. Neuropsychologia. (1998) 36:885–99. doi: 10.1016/s0028-3932(98)00020-7

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Aring E, Grönlund M, Hellström A, Ygge J. Visual fixation development in children. Graefes Arch Clin Exp Ophthalmol. (2007) 245:1659–65. doi: 10.1007/s00417-007-0585-6

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Munoz D, Armstrong I, Hampton K, Moore K. Altered control of visual fixation and saccadic eye movements in attention-deficit hyperactivity disorder. J Neurophysiol. (2003) 90:503–14. doi: 10.1152/jn.00192.2003

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Weber M, Engert M, Schaffer N, Weking J, Krcmar H. Organizational capabilities for ai implementation—coping with inscrutability and data dependency in ai. Information Systems Frontiers. (2023) 25:1549–69.

Google Scholar

28. Fernandez-Ruiz J, Hakvoort Schwerdtfeger R, Alahyane N, Brien D, Coe B, Munoz D. Dorsolateral prefrontal cortex hyperactivity during inhibitory control in children with ADHD in the antisaccade task. Brain Imaging Behav. (2020) 14:2450–63. doi: 10.1007/s11682-019-00196-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: ADHD, AI eye-tracking technology, intrusive saccades, prosaccade, antisaccade

Citation: Chen X, Wang S, Yang X, Yu C, Ni F, Yang J, Tian Y, Ye J, Liu H and Luo R (2023) Utilizing artificial intelligence-based eye tracking technology for screening ADHD symptoms in children. Front. Psychiatry 14:1260031. doi: 10.3389/fpsyt.2023.1260031

Received: 17 July 2023; Accepted: 26 October 2023;
Published: 14 November 2023.

Edited by:

Li Yang, Peking University Sixth Hospital, China

Reviewed by:

Thiago P. Fernandes, Federal University of Paraíba, Brazil
Raquel Quimas Molina Da Costa, University of São Paulo, Brazil

Copyright © 2023 Chen, Wang, Yang, Yu, Ni, Yang, Tian, Ye, Liu and Luo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Rong Luo, bHJzY3VAc2N1LmVkdS5jbg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.