- 1Mensia Technologies SA, Paris, France
- 2Child and Adolescent Psychiatry Department, Robert Debré Hospital, Paris, France
- 3GIPSA-Lab, Université Grenoble Alpes, CNRS, Grenoble-INP, Grenoble, France
Meta-analyses have been extensively used to evaluate the efficacy of neurofeedback (NFB) treatment for Attention Deficit/Hyperactivity Disorder (ADHD) in children and adolescents. However, each meta-analysis published in the past decade has contradicted the methods and results from the previous one, thus making it difficult to determine a consensus of opinion on the effectiveness of NFB. This works brings continuity to the field by extending and discussing the last and much controversial meta-analysis by Cortese et al. (1). The extension comprises an update of that work including the latest control trials, which have since been published and, most importantly, offers a novel methodology. Specifically, NFB literature is characterized by a high technical and methodological heterogeneity, which partly explains the current lack of consensus on the efficacy of NFB. This work takes advantage of this by performing a Systematic Analysis of Biases (SAOB) in studies included in the previous meta-analysis. Our extended meta-analysis (k = 16 studies) confirmed the previously obtained results of effect sizes in favor of NFB efficacy as being significant when clinical scales of ADHD are rated by parents (non-blind, p-value = 0.0014), but not when they are rated by teachers (probably blind, p-value = 0.27). The effect size is significant according to both raters for the subset of studies meeting the definition of “standard NFB protocols” (parents' p-value = 0.0054; teachers' p-value = 0.043, k = 4). Following this, the SAOB performed on k = 33 trials identified three main factors that have an impact on NFB efficacy: first, a more intensive treatment, but not treatment duration, is associated with higher efficacy; second, teachers report a lower improvement compared to parents; third, using high-quality EEG equipment improves the effectiveness of the NFB treatment. The identification of biases relating to an appropriate technical implementation of NFB certainly supports the efficacy of NFB as an intervention. The data presented also suggest that the probably blind assessment of teachers may not be considered a good proxy for blind assessments, therefore stressing the need for studies with placebo-controlled intervention as well as carefully reported neuromarker changes in relation to clinical response.
1. Introduction
Attention Deficit/Hyperactivity Disorder (ADHD) is a common childhood psychiatric disorder characterized by impaired attention and/or hyperactivity/impulsivity. Symptoms may persist in adulthood with clinical significance, which makes ADHD a life-long problem for many patients (2). The prevalence of ADHD is around 5% in school-aged children, thus affecting an estimated 2.5 million children in Europe (3). ADHD negatively impacts children's well-being, with many suffering from low self-esteem (4) and underachievement in school (5). Parents are equally affected, since the child's behavior is frequently attributed to bad parenting (6). From a societal point of view, ADHD also has a high financial impact: a 2013 survey in Europe estimated costs related to ADHD between 9, 860 and 14, 483 Euros per patient/year (7).
The diagnosis of ADHD primarily relies on questionnaire-based clinical evaluation (3), which can be supported by objective assessment metrics of executive function such as the Test of Variables of Attention (TOVA) (8), the Continuous Performance Test (CPT) (9), and the Sustained Attention to Response Task (SART) (10). Objective markers of brain function using electroencephalogram (EEG), functional Magnetic Resonance Imaging (fMRI), or Positron Emission Tomography (PET) are not considered to be useful for improving diagnosis at the individual level, but can help in differentiating groups of patients (11). In particular, different phenotypes of ADHD patients present with an increase in the EEG theta wave power (4–8Hz) and/or a decrease of EEG beta wave power (12–32Hz) in frontal areas, or a decrease in the EEG Sensorimotor Rhythm (SMR) power (13–15Hz) in the central area (12–15). A device using EEG to help clinicians more accurately diagnosis ADHD was cleared by the Food and Drug Administration (FDA) (16).
Psychostimulants are the most common treatment currently in use, and have proven to be efficacious (17, 18). However, their long-term effectiveness and side effects are still debated and form an active area of research (19–23). Moreover, ADHD children under medication commonly suffer from mild side effects such as loss of appetite and sleep disturbance, although serious adverse events are rare (18, 24). These drawbacks make some parents and clinicians reluctant to opt for such treatment, instead turning to non-pharmaceutical alternatives such as dietary changes (25) and behavioral therapy, which have been proven to be less efficacious (26).
Neurofeedback (NFB) is another non-pharmaceutical and non-invasive approach aiming at the reduction of ADHD symptoms (27–29). Shortly after the discovery of the brain's electric activity by Berger (30) and Durup and Fessard (31) demonstrated it could be voluntarily modulated, leading to a series of findings on the self-regulation of brain activity. The first indication of the therapeutic potential of brain activity operant-conditioning came 40 years later when Sterman et al. (32) found that training the SMR activity reduces the incidence of epileptic crisis in kerosene-exposed cats. The technique, then known as NFB, rapidly became the subject of investigation in various fields of neuropsychiatry including, most notably, ADHD (33–36).
NFB is a self-paced brain neuromodulation technique that represents brain activity in real-time using auditory or visual modulations, on which learning paradigms, such as operant conditioning (37) or voluntary control, can be applied. To deliver this intervention, neurophysiological time series are analyzed online in order to drive feedback applications such as serious games (38). The signal of interest should represent the activity of a population of neurons involved in attentional networks, which is translated into visual or auditory cues. The sensory feedback constitutes the rewards mechanism, promoting learning using, for instance, operant conditioning protocols (39). Operant conditioning enables neural plasticity, thus supporting the child in the task repetition (40), which is believed to result in long-lasting neuronal reorganization (41).
Several NFB protocols have been proposed and investigated for decreasing the symptoms of ADHD:
• protocols based on neural oscillations, using frequency-band power training: enhancing SMR (42), reducing theta (29) or enhancing beta (43), or a composite protocol such as enhancing beta while suppressing theta, also known as the Theta Beta Ratio (TBR) protocol (33, 44);
• protocols based on Slow Cortical Potentials (SCPs) training consisting of the regulation of cortical excitation thresholds by focusing on activity generated by external cues (45, 46);
• protocols to enhance Event-Related Potentials (ERPs): in particular, the amplitude of the P300 ERP can be considered as a specific neurophysiological marker of selective attention (47).
Moreover, NFB protocols can be personalized: some studies did not use the usual definitions of EEG band ranges but determined them thanks to the individual Alpha Peak Frequency (iAPF) (48), giving individualized NFB protocols (49–51).
NFB efficacy on the core symptoms of ADHD (inattention, hyperactivity, and impulsivity) has been the subject of several meta-analytic studies (26, 52–55). To date, studies have not reached a consensus on the efficacy of NFB; while Arns et al. (54) and Micoulaud-Franchi et al. (55) claim results in favor of its efficacy, especially on the inattention component highlighted by Micoulaud-Franchi et al. (55), other authors, such as Loo and Barkley (52), Lofthouse et al. (53), and Sonuga-Barke et al. (26) express their reservations, asking for further evidence from blind assessment.
The most recent meta-analysis addressing the efficacy of NFB was published by Cortese et al. (1), including a total of 13 Randomized Controlled Trials (RCTs). The results of this analysis are mixed: when based on parent assessments, which are not blind to treatment, they are significantly in favor of NFB, whereas when the evolution of symptoms is rated by teachers (considered as probably blind), the results are no longer significant. The authors concluded that further evidence from blind assessments is needed in order to support NFB as a treatment for ADHD symptoms. However, some of the choices made in this meta-analysis, which may have had an impact on the results, have since been debated by the community. Specifically, Micoulaud-Franchi et al. (56) criticized the use of an uncommon behavioral scale provided by Steiner et al. (57) for the teachers' assessments and the inclusion of a pilot study carried out by Arnold et al. (58).
As a result of these criticisms and the concurrent publication of new RCTs meeting Cortese et al.'s (1) inclusion criteria, we decided to update this meta-analysis and take the opportunity to investigate the impact of its controversial choices. While performing our investigation, we observed two shortcomings: the assumption that the difference between teacher and parent assessments can solely be explained by the placebo effect, and pooling together heterogeneous studies in terms of methodology and technical implementation. An interesting approach, albeit not commonly performed, to assess the NFB efficacy would be to analyze the specificity of the EEG changes with respect to trained neuromarkers (36). In our case, based on the data at our disposal, we used the technical and methodological heterogeneity of the NFB trials to our advantage rather than disadvantage by extending the previous work with a novel method, the Systematic Analysis of Biases (SAOB). Indeed, the NFB domain is characterized by clinical literature that is extremely heterogeneous: studies differ methodologically (for instance, random assignment and presence of a blind assessment), in the NFB implementation (for instance, number of sessions, session and treatment length, and type of protocol) as well as on the acquisition and processing of the EEG signal. Description and analysis of different types of NFB implementation was subject to several studies (59–62). However to the best of our knowledge, none of these studies has used statistical tools to quantify their influence on clinical endpoints.
Since methodological and technical implementations of studies are highly likely to influence their outcomes (63), we suggest identifying which of the factors independently influence the clinical efficacy with the use of appropriate statistical tools. In addition, we have made available all the raw RCT data we have used and a complete Python library for performing meta-analysis (64). Through doing so, we hope to foster the replication of our and previous studies and to facilitate similar future projects.
2. Materials and Methods
2.1. Inclusion Criteria
Search terms were directly taken from Cortese et al. (1) with the exception of the need for a control arm, which is detailed in Supplementary Material (65). The requirements included:
• studies have to assess NFB efficacy;
• subjects must have received a diagnosis of ADHD based on DSM-IV (66), DSM-5 (3), ICD-10 (67) criteria, or by a qualified psychiatrist;
• studies have to be written in English, German, Spanish, or French;
• studies have to include at least eight subjects in each group;
• patients must be younger than 25 years old;
• the publication has to disclose sufficient details about the data to compute required metrics for the ensuing analysis.
The studies satisfying all these criteria were included in the SAOB. In order to replicate and update Cortese et al.'s (1) meta-analysis, we applied the original inclusion criteria of their meta-analysis to our search (the main difference being the presence of a control group).
2.2. Outcome Definition
In the included studies, the severity of ADHD symptoms have been assessed by parents and, whenever available, by teachers. Cortese et al. (1) and Micoulaud-Franchi et al. (55) defined parents as Most Proximal (MPROX) raters who were not blind to the treatment, as opposed to teachers, who were considered as Probably Blind (PBLIND) raters. This distinction is intended to assess the amplitude of the placebo effect, where it is hypothesized that teachers, who are presumed to be more blind to the intervention, are less influenced in their assessment. Efficacy of NFB was measured using clinical scales, such as the ADHD-RS (68), on the following outcomes: inattention, hyperactivity/impulsivity, and total scores. The factor analysis was performed using the total score.
2.3. Meta-Analysis
The goal of a meta-analysis is to aggregate results from different clinical investigations and offer a consolidated body of evidence. To achieve this, it is necessary to assume some level of homogeneity in the design of the studies: inclusion criteria, and the presence and type of control (active, semi-active, or non-active). Because studies occasionally use slight variations of a clinical scale and because of the clinical heterogeneity of patients and control, the scores are standardized before being pooled into a Summary Effect (SE). The between-Effect Size (ES) is one such standardized metric, which we have implemented in this paper [see Supplementary Material (65)].
The meta-analysis was performed with a Python package developed for this work. The package offers a transparent approach for the choice of parameters in an effort to ease replicability. We have benchmarked it against RevMan version 5.1 [(69), UK, London] by replicating Cortese et al.'s (1) work. The code is made fully available on a GitHub repository (64), together with all the RCTs raw data we have used in the present study.
Before updating the Cortese et al. (1) work with recently published studies (70, 71), we decided to run a sensitivity analysis investigating the choices that later proved controversial (56). The investigated changes included:
• the between-ES of Arnold et al.'s (58) study was computed from the post-test clinical values taken after the completion of the 40 sessions, in contrast to Cortese et al.'s (1) report which used the results after only 12 sessions because the endpoint values were not available at the time of his study;
• the between-ES computed from the teachers' assessment reported by Steiner et al. (57) relied on the BOSS Classroom Observation (72). This is an atypical scale to quantify ADHD symptoms since the Conners Rating Scale Revised (73– 75), a well-defined (76, 77) and broadly used metric, was available in this study. Thus, we decided to compute the between-ES based on the Conners-3 already used in this study to compute the parents' between-ES.
As initially suggested by Cortese et al. (1) the analysis was run on two subgroups of studies with the two choices described above: one gathering studies following the standard protocol defined by Arns et al. (59) and a second including only participants not taking medications during the clinical trial.
2.4. Identify Factors Influencing the Neurofeedback
While revisiting the existing meta-analyses, it became apparent that the studies pooled together were highly heterogeneous in terms of methodological and practical implementation. For instance, all NFB interventions were pooled together regardless the quality of acquisition, the quality of EEG data, and the trained neuromarker. Equally, the methodological implementations varied significantly, requiring the “subgroup” analysis (for instance, gathering studies following standard protocols) that are somewhat arbitrary. To circumvent these limitations, we implemented a novel approach: the SAOB. With this method, the within-ES of each intervention was considered as a dependent variable to be explained by methodological and technical factors. Such analysis aims at identifying known methodological biases (e.g., blind assessments negatively associated with within-ES) and possible technical factors (e.g. a good control on real time data quality positively influences the treatment outcome).
2.4.1. Identify and Pre-process Factors
We classified the factors influencing the efficacy of NFB into five categories: methodological, technical, demographics, and quality of the signal and acquisition. Factors were chosen based on that reported in the literature as presumed to influence ES, and categorized as follows:
• the methodological biases: the presence of a control group, the blindness of assessors, the randomization of subjects in controlled trials, and the approval of the study by an Institutional Review Board (IRB);
• the population: intake of psychostimulants during NFB treatment, the age range of children included, the severity of ADHD symptoms at baseline1, and the degree of engagement with NFB intervention;
• the NFB implementation: the protocol used (SCP, SMR, theta up, beta up in central or frontal areas, theta down), the presence of a transfer phase during NFB training, the possibility to train at home or at school with a transfer card, the type of thresholding for discrete reward, the number of NFB sessions, the length and frequency of the sessions, the length of the treatment, the individualization of the frequency bands based on the iAPF, and coupling NFB with Electromyogram (EMG)-Biofeedback;
• the acquisition quality: the presence of one or more active electrodes and the EEG data quality. The latter was coded as an indicator between 1 and 3, using the following criteria:
the type of electrodes used: Silver Chloride (AgCl)/Gel or Gold (Au)/Gel;
the use of impedance mode: a quality check of electrode contacts ensuring an inter-electrode impedance smaller than 40kΩ;
the level of hardware certificate: compliance with ISO-60601-2-26 (78).
A quality score equal to 3 was assigned if all the above criteria were satisfied. If at least one was satisfied the quality score was set to 2, otherwise the score was set to 1.
• the signal quality: online rejection (epoch rejected, feedback not computed) or correction (feedback computed on the denoised epoch) of Electro-Oculogram (EOG) artifacts, and online rejection of generic artifacts using an amplitude-based detection.
To prevent any bias in the analysis, the names of the factors were hidden during the entire analysis so that the data scientists (AB, QB, DO, and LM) were fully blind to them. The names were revealed only when the data analysis and results were accepted as valid: this included choice of variable normalization and validation of model hypothesis, as detailed below.
The pre-processing of factors for the analysis included the following steps: factors for which there were too many missing observations arbitrarily set to more than 20% of the total observations, were removed from the analysis. Furthermore, if a factor had more than 80% similar observations it was also removed. A study did not systematically correspond to an observation: when several clinical scales and/or raters were available in a study, each couple clinical scale–rater was considered as an observation. Categorical variables were coded as dummies, i.e., the presence of the factor was represented with 1 and its absence with 0. All variables were standardized by subtracting the mean and then dividing by the standard deviation (not applied before the decision tree described below).
2.4.2. Explaining Within Effect Sizes With Factors
To compute the within-ES, the means of total ADHD scores given by parents and teachers were used. Moreover, in case studies providing results for more than one behavioral scale the within-ES scores were computed for each one as:
where Mt,T is the mean of clinical scale, for treatment T, taken at time t (pre-test or post-test) and σt,T represents its standard deviation. With this definition, we focus on the effect of the treatment within a group (79) as commonly reported in the literature (36, 54, 70). This within-ES enables us to quantify the efficacy of NFB inside the treatment group. Finally, to avoid to break analysis methods assumptions, an outlier rejection was applied defining thresholds of acceptance as [μ − 3σ, μ + 3σ], with μ and σ being respectively the mean and the standard deviation of all within-ES computed (80).
The within-ES was then considered as a dependent variable to be explained by the factors (the independent variables). The following three methods, implemented with the Scikit-Learn Python (81, version 0.18.1) and the Statsmodels Python (82, version 0.8.0) libraries, were used to perform the regression:
• weighted multiple linear regression with Weighted Least Squares (WLS) (83);
• sparsity-regularized linear regression with Least Absolute Shrinkage and Selection Operator (LASSO) (84);
• decision tree regression (85).
The aim of the linear regression is to estimate the regression coefficients linking the factors to the within-ES. A significant coefficient (here and hereafter meaning significantly different from zero) indicates that the associated factor has an influence on NFB efficacy and its sign the direction of the effect. The WLS differs from a traditional linear regression estimated with Ordinary Least Squares (OLS) in that a weight is assigned to each observation in order to account for the multiplicity of reported clinical endpoints in some studies. In addition, the weight was also set as a function of the sample size to account for variations in sample sizes. Specifically, the weight of each study was taken as the ratio between the experimental group's sample size and the number of behavioral scales available. We also ran the analysis using the OLS method to assess the impact of the weights on the results.
The second linear method applied was the LASSO, which naturally incorporates variable selection into the linear model thanks to ℓ1-norm applied on the coefficients. A coefficient not set to zero means that the associated factor has an influence on NFB efficacy and its sign indicates the direction of the effect.
The last method used to determine factors influencing NFB was the decision tree (85), a hierarchical and non-linear method. This breaks down a dataset into smaller and smaller subsets using, at each iteration, a variable and a threshold chosen to optimize a simple Mean Square Error (MSE) criterion (86). A tree is composed of several nodes and leafs, the importance of which decreases from the top node, called the root node, downward.
Given that these methods are intrinsically different we compared their results. For instance, the decision tree captures variable interactions and can relate factors to within-ES in a non-linear fashion. On the other hand, the LASSO offers an elegant mathematical framework for variable selection. Further details are provided in the Supplementary Material (65).
3. Results
3.1. Selected Studies
Search terms entered in Pubmed returned 155 results during the last check on February 12, 2018, including 22 articles used in previous meta-analyses on NFB. Following the selection process illustrated in Figure 1, 33 studies were included in the SAOB and 16 in the meta-analysis, as summarized in Table 1. The 33 studies selected for the SAOB followed Cortese et al.'s (1) criteria, with the exception of the requirement for a control group. Indeed, since within-ES were considered in this analysis, a control group was not required.
Figure 1. Flow diagram of selection of studies (last searched on February 12, 2018). The subset (a) corresponds to the Cortese et al.'s (1) inclusion criteria without the requirement of the presence of a control group. The subset (b) precisely corresponds to the studies included in Cortese et al. (1) and more recent works meeting the same criteria.
Table 1. List of all studies included in the three different analyses: (a) studies originally included in Cortese et al. (1) (last searched on August 30, 2015); (b) studies satisfying Cortese et al.'s (1) criteria (last searched on February 12, 2018); (c) studies satisfying Cortese et al.'s (1) criteria to the exception of the part relative to the control group (last searched on February 12, 2018).
3.2. Meta-Analysis
The replication of Cortese et al.'s (1) results obtained are presented in Table 2:
• when computing the between-ES for Arnold et al. (58) with the values after 40 sessions of NFB, smaller between-ES were found as compared to those found by Cortese et al. (1), which was unexpected since the clinical efficacy is supposed to increase with the number of NFB sessions. These lower between-ES impacted the SE: they were slightly lower when computed with this choice although they nonetheless remained significant (see the three first lines of Table 2);
• when relying on the teachers' ratings from the Conners-3 to compute the between-ES of Steiner et al. (57), higher SEs were found in attention but not for total and hyperactivity score. However, this different choice of scale did not affect the statistical significance of the SEs (see the three last lines of Table 2).
Table 2. Comparison between Cortese et al. (1) results obtained with RevMan (69) and those obtained with the meta-analysis package with our choicesa applied.
The meta-analysis was then extended by adding three new articles (51, 70, 71). Bazanova et al. (51) gave parents' assessments for all outcomes, Baumeister et al. (71) provided results solely for parents' total outcome, whereas Strehl et al. (70) gave both teachers' and parents' assessments for all outcomes. To be consistent with the SAOB, only the standard NFB group of Bazanova et al. (51) was included in this update. Despite favorable results for NFB, particularly on parents' assessments, adding these three new studies did not change either the magnitude or the significance of the SE, for any outcome regardless of the raters, as illustrated in Figure 2.
Figure 2. Forest plots showing the between-ES. A negative ES is in favor of NFB. The blue squares correspond to the ES, the blue diamond to the SE and the green line to the 95% confidence interval.
Regarding the “standard protocol” subgroup, Cortese et al. (1) found all the outcomes significant except for the hyperactivity symptoms rated by teachers, which showed only a statistical trend (p-value = 0.11). Similar results were obtained when adding the most recent studies meeting this definition (70, 71) (p-value = 0.11). The SE for the total outcome assessed by teachers remained significant with the addition of the new RCT (p-value = 0.043), thus giving more strength to this result since it is now based on four studies including 283 patients in total.
As for the no-drug subgroup, SEs were found significant for the inattention symptoms assessed by parents (p-value = 0.017). In addition, the differences in Arnold et al. (58) values and the inclusion of Bazanova et al. (51) caused a loss of significance in hyperactivity outcome for parents (p-value = 0.062) compared to Cortese et al. (1) (p-value = 0.016). Only Bazanova et al. (51) was included in this subgroup: in the two other studies the subjects were taking psychostimulants during the trial.
All the clinical scales used to compute the between-ES following our choices are summarized in the Supplementary Material.
3.3. Factors Influencing Neurofeedback
This analysis was performed on 33 trials (corresponding to 67 observations) assessing the efficacy of NFB, as presented in Table 1. The outlier rejection removed two training groups of Bazanova et al. (51) from the analysis because their within-ES were out of the bounds. Among the 28 factors selected, nine were removed because there were too many missing observations or because they were too homogeneous: beta up in frontal areas, the use of a transfer card, the type of threshold for the discrete rewards (incremental or fixed), the EEG quality equal of 3, the presence of a control group, the individualization of the frequency bands based on the iAPF, coupling NFB with EMG-Biofeedback, the severity of ADHD symptoms at baseline, and the degree of engagement with NF intervention.
All results are presented in Table 3. These results, require careful interpretation since each technique provided slightly different results. These differences may depend on the different assumptions of the model and several other factors. Nonetheless, we are inclined to trust the findings that are consistent across methods.
The WLS technique identified nine significant factors for an adjusted R-squared of 0.62 (see second column of Table 3). When applying the OLS, the same factors were significant (except the EEG quality equal of 2 and the presence of more than one active electrode) with a lower adjusted R-squared (0.35). The LASSO regression selected six significant factors (see third column of Table 3). With these methods, a negative coefficient means that the factor is in favor of the efficacy of NFB. The decision tree is presented in Figure 3: the best predictor in our case was the PBLIND (see last column of Table 3). Four other factors also split the subsets; however, increasingly fewer samples are available the lower we get into the tree, making the interpretation increasingly doubtful.
Figure 3. Decision Tree obtained: ES corresponds to the within subject effect size and k to the number of studies, the importance of variables decreases from the root node. Session length is measured in minutes, treatment length in weeks, and age in years.
Several factors were common to the three methods used. These included, in particular: the assessment by a blind rater, the treatment length, and an EEG quality score equal to 2 (see lines 1, 10, and 20 of Table 3). The methods also agreed on the direction of the effect for these factors: a shorter treatment and recording of the EEG with a good-quality system appears preferable, whereas teachers' assessment appears less favorable compared to parents' assessment.
It is more doubtful the influence of the factors returned by only one or two methods (see lines 2, 3, 7, 9, 12, 17, 18, and 21 of Table 3). In particular:
• both WLS and LASSO found that using more than one active electrode during NFB appears to lead to an higher efficacy;
• both WLS and the decision tree found that performing a higher number of sessions seems to be preferable;
• both LASSO and the decision tree found that a higher number of sessions per week appears to positively influence the efficacy of the NFB treatment.
Five factors were returned by only one of the methods: randomizing the groups, the IRB approval, the session length, the presence of a transfer phase, and the correction or rejection of ocular artifacts.
Eight factors were never selected by the three methods (see lines 4, 5, 6, 13, 14, 15, 16, and 23 of Table 3): the children's minimum and maximum age, being on drugs during NFB treatment, the protocols SMR, beta up in central areas, theta down, and SCP, and the artifact correction based on amplitude.
Thus, these factors overwhelmingly appear not to have an influence on NFB efficacy.
In the next section we discuss only the factors that were selected by at least two of the three methods.
4. Discussion
4.1. Meta-Analysis
This replication and update of a meta-analysis did not meet all PRISMA recommendations (107). In particular, the risk of bias in individual and across studies was not assessed.
In the meta-analysis performed here, we challenged some choices made by Cortese et al. (1) which proved controversial: the computation of between-ES based on an unusual scale (57) and the inclusion of a pilot study (58) whose endpoint values were not available at the time Cortese et al. (1) conducted their meta-analysis. We here review the list of changes, their justification, and their impact on the analysis.
First, relying on the Conners-3 (108) instead of the BOSS Classroom Observation (72) for teachers' ratings seems preferable because this scale is more commonly used (74, 75) and is a revision of the Conners Rating Scale Revised (73) whose reliability has been studied (76). However, relying on one or the other scale did not change the significance of the between-ES, regardless of outcome.
Second, to compute the between-ES of Arnold et al. (58) the clinical scores taken when all sessions were completed were used instead of looking at interim results as with Cortese et al. (1). Some studies suggested that the number of sessions correlates positively with the changes observed in the EEG (61) so that a lower number of sessions would lead to artificially smaller between-ES. Here, the between-ES computed with the values at post test of Arnold et al. (58) were smaller than those obtained after 12 sessions; however, these differences did not lead to a change of significance of the SE.
To conclude on this meta-analysis, although some points were controversial, the impact on the meta-analysis was minimal and did not change the statistical significance of any outcome. The addition of the three new studies (51, 70, 71) further confirmed the original results. Indeed, the significance did not change for any outcome: the SE remained significant for MPROX raters and non-significant for PBLIND. Adding three more studies increased the significance of the sensitivity analysis run by Cortese et al. (1) most notably the SE of studies corresponding to NFB “standard protocols” (59). While Cortese et al. (1) found that this subset tends to perform better, particularly on the PBLIND outcome, adding two studies confirmed this result on the total clinical score (p-value < 0.05). Despite the obvious heterogeneity of the studies included in this subset (particularly in terms of protocol used), these results suggest a positive relation between the features of this standard design and NFB performance. This result is a breakthrough in the demonstration of standard NFB protocol efficacy for the treatment of ADHD. Nonetheless, the studies included in this subset are still highly heterogeneous (particularly in terms of protocol used), a factor which should be accounted for.
4.2. Factors Influencing Neurofeedback
Description and analysis of different types of NFB implementation were subject to several studies (1, 54, 59–62). However, to the best of our knowledge, none of these studies has implemented a systematic multivariate approach to associate factors to clinical endpoints therefore exposing their univariate analysis to a greater extend to the presence of a confounding factor.
Two observations were detected as outliers and so removed from the dataset before performing the SAOB: Bazanova et al.'s (51) individualized NFB and individualized NFB coupled with EMG-Biofeedback groups. Indeed, these two groups presented very large within-ES (–3.41 and –3.95), even bigger than those reported in the literature on medication treating ADHD (109). These large values broke our working hypothesis, so in order to be able to conclude on the results obtained by the SAOB, an outlier rejection was implemented.
As expected, the number of sessions was found to be significant, even if it was by only two methods, which was in compliance with existing literature. For instance, using several univariate regressions without correction for multiple testing (54), Arns et al. (59) stated that performing less than 20 NFB sessions leads to smaller effects. Similarly, Vernon et al. (61) observed that positive changes in the EEG and behavioral performance occurred after a minimum of 20 sessions. However, Enriquez-Geppert et al. (60) insisted that the number of sessions should be carefully chosen in order to avoid “overtraining.” The fact that the number of sessions was not identified by the LASSO as a positively contributing factor might be explained by the presence of only two data points with 20 sessions or less. Conceivably, the temporal threshold of efficacy was passed for all included studies, making the identification of this factor by the three methods unlikely on this dataset. However, the two methods that identified this factor both agreed that as expected, the more sessions performed, the more efficient the NFB tends to be.
Interestingly, Minder et al. (110) suggests that the subject location of the NFB training may also be an important contributory factor to clinical effectiveness. However, this has been challenged by a recent study (110) showing that performing NFB at school or at the clinic has no significant impact on treatment response.
The type of NFB protocol was not identified by any method, and did not appear to influence the NFB results. This minimal importance granted by the methods to the NFB protocols is counter-intuitive given the centrality of the protocols in the neurophysiological mode of action and subsequent expected impact on therapeutic effectiveness (61). A possible explanation for this result is that these protocols were equally efficacious for the populations to whom they were offered and thereby did not constitute a significant explanatory factor. This result, however, does not preclude a combined and personalized strategy (offering personalized protocols based on phenotypes) to further improve performance, as previously suggested by Alkoby et al. (111).
Several factors were selected by all three methods with the same direction of influence: the EEG quality, the treatment length, and the rater's probably blindness to the treatment. First, our analysis highlighted that recording EEG in good conditions leads to better results. This can be explained by the fact that better signal quality enables more accurate extraction of EEG patterns linked to ADHD and hence leads to better learning and therapeutic efficacy (63). However, it remains difficult to assess the quality of EEG hardware (such as the amplifier used) because little information is provided in these studies. This calls for greater care in future studies, which should strive to assess and report the quality of the data.
Next, it appears that the longer the course of treatment, the less efficient it becomes. This may be explained by the degree of engagement with NFB intervention: it may be harder to be engaged with a long course of treatment. However, it is difficult to quantify because either no questionnaires assessing engagement were submitted to children or this information was not provided. It is an interesting point to investigate, so we invite future studies to share it if possible.
Arguably, the treatment length is a proxy for treatment intensity, suggesting that a shorter period of treatment is more likely to succeed because the frequency of the sessions is higher. This hypothesis is supported by the fact that the variable session pace (number of sessions per week) is also associated with larger within-ES according to the LASSO and the decision tree. The impact of the intensity of treatment has been investigated by Rogala et al. (112) on healthy adults: it was observed that studies with at least four training sessions completed on consecutive days were all beneficial. Overall, these results suggest adopting a high session pace, which is not common knowledge in the field.
Some other factors' influence would have been interesting to investigate, such as using personalized NFB protocols based on the iAPF (49), which seems promising according to Bazanova et al. (51) and Escolano et al. (50). However, it could not be included in the SAOB because only two studies used personalized NFB protocols. This lack of studies is also the reason why the impact of coupling EMG-Biofeedback with NFB could not be included in the SAOB. Another interesting factor, which could have helped explain the result on the treatment duration, was excluded from the analysis: the severity at baseline. Although pre-test scores were available for each study, they could not be compared because they were measured on different scales. To address this problem, the scores were normalized using the maximum score to be obtained on each scale. However, this value was not found for several clinical scales, which led to missing observations. When more studies including these features will be available, it would be interesting to run the SAOB to determine the influence of these excluded factors.
In general our results strongly support the effectiveness of NFB for the treatment of ADHD. However, as expected, the assessment of symptoms by non-blind raters leads to far more favorable results than by PBLIND raters, a result widely expected and in close compliance with the existing meta-analysis (1, 55). This observation would certainly be contradictory should teachers' assessments reflect a placebo effect, which has long been documented in the literature (110, 113, 114). This point is investigated in greater detail in the following section.
4.3. Analysis on the Probably Blind Raters
Teachers were considered as PBLIND raters by Cortese et al. (1) and Micoulaud-Franchi et al. (55). Unexpectedly, the data provided did not exactly match the widely accepted hypothesis stating that the difference between MPROX and PBLIND can solely be explained by the placebo effect. Nonetheless, the emphasis put on 'probably' indicates that teachers may be aware of the treatment followed. An element that corroborates this hypothesis is the fact that, for all the studies included in this work, the amplitude of the clinical scale at baseline suggests that teachers did not capture the full extent of the symptoms or, put differently, that they were blind more to the symptoms than to the intervention, as illustrated in Figure 4. Indeed, before the intervention, teachers rated the symptoms less severely compared to parents and observed less improvement at post-test: this tends to correspond more to case A with no placebo effect than case B. The expected differences of ratings between teachers and parents have been extensively studied (110, 113, 114), observing that teachers are more likely to underrate a child's symptom severity, especially for younger children. As a consequence, teachers might simply be less likely to observe a clinical change over the course of the treatment (110, 113, 114). Moreover, it is also clear that there is more variability in teachers' scores compared to parents', which could partly explain the lower ES obtained for PBLIND raters, since the variability deflates the ES. In conclusion, using PBLIND as an estimate for correcting the placebo effect does not appear an appropriate choice.
Figure 4. Pre-test and post-test scores (± standard error) given by parents (MPROX) in blue and teachers (PBLIND) in green, dashed line. Data hypothesized under two different hypotheses: (A) no placebo effect, teachers see fewer symptoms altogether so that difference pre-post is low and (B) placebo effect, teachers see as many baseline symptoms as parents but do not see as much improvement. (C) Real data: evolution of parents' and teachers' scores between pre- and post-test on studies that satisfy Cortese et al.'s (1) inclusion criteria and that provide teachers and parents' scores on the same scale.
Another way to highlight a possible placebo effect is to focus on the decision tree illustrated in Figure 3. The top node splits: on the one hand 46 observations corresponding to MPROX raters and, on the other, 21 observations corresponding to PBLIND. If the differences observed between PBLIND and MPROX raters were due to the placebo effect, one would expect to find in the MPROX sub-tree some factors linked to the perception of the implication in the treatment. This was indeed the case: treatment length was found to be significant but not in the direction corroborating the hypothesis that they are a part of a placebo effect. Indeed, one would expect that the longer the treatment, the higher the placebo effect and the greater the within-ES. Instead, the opposite was found, somewhat invalidating the hypothesis.
Overall, these results suggest that PBLIND assessments could hardly be used to assess placebo effect as they seem to be blinder to symptoms than to intervention. In the absence of an ethically (115) and technically (116) feasible sham for NFB protocols (117), it is necessary to fall back on an acceptable methodological alternative for the demonstration of clinical effectiveness. Among those are the analysis of neuromarkers collected during NFB treatment demonstrating that patients do control the trained neuromarkers; that they learn (reinforce control over time), and that these possibly lead to lasting brain reorganization (e.g., changes in their baseline resting state activity). The specificity of these changes, in relation to which neuromarkers were trained and to the clinical improvement, will be an essential component of this demonstration.
5. Conclusion
In this work we provide additional elements in favor of the effectiveness of NFB for the treatment of ADHD. First, we confirm that a subgroup of standard NFB studies shows a statistically significant improvement on PBLIND assessments (k = 4 studies instead of 3, n = 283 patients instead of 158 (1)).
Second, we identify technical factors as positive contributors to clinical effectiveness, which strongly suggests that it is mediated by a real mechanism of action based on EEG conditioning. Equally, treatment intensity was also found to contribute, corroborating what is known from learning theory (memory consolidation) (118); that is to say, a more intense treatment leads to an increased clinical efficacy.
While these findings certainly contribute to the debate, this work also suggests that the ultimate demonstration of evidence remains out of reach, as teachers' assessments were partly invalidated as a proxy for the quantification of the placebo effect. As a consequence, using PBLIND endpoints to address the specificity of the clinical efficacy is not recommended and we instead advise a reliance on other available methodological tools. These tools include sham NFB and neuromarker analysis investigating the specificity of the EEG changes with respect to trained neuromarkers as well as changes in clinical endpoints.
This work also offers an open-source toolbox for running meta-analysis and SAOB: the code and data used are available, thus ensuring the transparency and replicability of these analysis, as well as fostering future ones. Regarding perspectives, this two-fold methodological framework applied to NFB for ADHD could be suitable for other NFB applications (29).
Data Availability
The datasets analyzed for this study can be found in the repository (64) https://github.com/AuroreBussalb/meta-analysis-statistical-tools.
Author Contributions
AB extracted all data from articles and performed the analysis. MC provided advice and expertise concerning the design of the study and the methods used. RD and EA provided clinical interpretation of the results. QB and DO supported the toolbox implementation and validation. LM supervised the entirety of this work.
Funding
This research was conducted as part of a PhD thesis funded by Mensia Technologies, the Association Nationale Recherche et Technologie (ANRT), and the EU H2020 NEWROFEED grant number 684809.
Conflict of Interest Statement
AB, QB, DO, and LM work for Mensia Technologies. MC served as scientific advisor for Mensia Technologies when this work was conducted.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We would like to thank Jean-Arthur Micoulaud-Franchi, MD, PhD for his helpful comments and ideas stimulating this work. We also want to thank Samuele Cortese, MD, PhD for his help while replicating his meta-analysis. Finally, we also would like to thank the Association Nationale Recherche et Technologie (ANRT) for partially funding the PhD thesis of the first author as well as the EU H2020 NEWROFEED grant number 684809.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2019.00035/full#supplementary-material
Abbreviations
ADHD, Attention Deficit/Hyperactivity Disorder; AgCl, Silver Chloride; Au, Gold; CPT, Continuous Performance Test; EEG, electroencephalogram; EMG, Electromyogram; EOG, Electro-Oculogram; ERP, Event-Related Potential; ES, Effect Size; FDA, Food and Drug Administration; fMRI, functional Magnetic Resonance Imaging; iAPF, individual Alpha Peak Frequency; IRB, Institutional Review Board; LASSO, Least Absolute Shrinkage and Selection Operator; MPROX, Most Proximal; MSE, Mean Square Error; NFB, neurofeedback; OLS, Ordinary Least Squares; PBLIND, Probably Blind; PET, Positron Emission Tomography; RCT, Randomized Controlled Trial; SAOB, Systematic Analysis of Biases; SART, Sustained Attention to Response Task; SCP, Slow Cortical Potential; SE, Summary Effect; SMR, Sensorimotor Rhythm; TBR, Theta Beta Ratio; TOVA, Test of Variables of Attention; WLS, Weighted Least Squares.
Footnotes
1. ^Baseline severity is given by the pre-treatment score. But, to have comparable severity scores between the different clinical scales, pre-treatment score is normalized, i.e., divided by the maximal score which can be obtained on the clinical scale.
References
1. Cortese S, Ferrin M, Brandeis D, Holtmann M, Aggensteiner P, Daley D, et al. Neurofeedback for attention-deficit/hyperactivity disorder: meta-analysis of clinical and neuropsychological outcomes from randomized controlled trials. J Am Acad Child Adolesc Psychiatry (2016) 55:444–55. doi: 10.1016/j.jaac.2016.03.007
2. Faraone SV, Biederman J, Mick E. The age-dependent decline of attention deficit hyperactivity disorder: a meta-analysis of follow-up studies. Psychol Med. (2006) 36:159–65. doi: 10.1017/S003329170500471X
3. The American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders 5th Edition (DSM-5) (2013).
4. Shaw-Zirt B, Popali-Lehane L, Chaplin W, Bergman A. Adjustment, social skills, and self-esteem in college students with symptoms of ADHD. J Atten Disord. (2005) 8:109–20. doi: 10.1177/1087054705277775
5. Barry TD, Lyman RD, Klinger LG. Academic underachievement and attention-deficit/hyperactivity disorder: the negative impact of symptom severity on school performance. J Sch Psychol. (2002) 40:259–83. doi: 10.1016/S0022-4405(02)00100-0
6. Harpin VA. The effect of ADHD on the life of an individual, their family, and community from preschool to adult life. Arch Dis Childhood (2005) 90(Suppl. 1):i2–7. doi: 10.1136/adc.2004.059006
7. Le HH, Hodgkins P, Postma MJ, Kahle J, Sikirica V, Setyawan J, et al. Economic impact of childhood/adolescent ADHD in a European setting: the Netherlands as a reference case. Eur Child Adolesc Psychiatry (2014) 23:587–98. doi: 10.1007/s00787-013-0477-8
8. Forbes GB. Clinical utility of the test of variables of attention (TOVA) in the diagnosis of attention-deficit/hyperactivity disorder. J Clin Psychol. (1998) 54:461–76.
9. Barkley RA. The ecological validity of laboratory and analogue assessment methods of ADHD symptoms. J Abnormal Child Psychol. (1991) 19:149–78.
10. Robertson IH, Manly T, Andrade J, Baddeley BT, Yiend J. Oops!': performance correlates of everyday attentional failures in traumatic brain injured and normal subjects. Neuropsychologia (1997) 35:747–58.
11. Johnstone J, Gunkelman J, Lunt J. Clinical database development: characterization of EEG phenotypes. Clin EEG Neurosci. (2005) 36:99–107. doi: 10.1177/155005940503600209
12. Monastra VJ. Electroencephalographic biofeedback (neurotherapy) as a treatment for attention deficit hyperactivity disorder: rationale and empirical foundation. Child Adolesc Psychiatr Clin North Am. (2005) 14:55–82. doi: 10.1016/j.chc.2004.07.004
13. Matoušek M, Rasmussen P, Gillberg C. EEG frequency analysis in children with so-called minimal brain dysfunction and related disorders. In: Neurophysiological Correlates of Mental Disorders. vol. 15. Vienna: Karger Publishers (1984). p. 102–8.
14. Janzen T, Graap K, Stephanson S, Marshall W, Fitzsimmons G. Differences in baseline EEG measures for ADD and normally achieving preadolescent males. Biofeedback Self Regul. (1995) 20:65–82.
15. Loo SK, McGough JJ, McCracken JT, Smalley SL. Parsing heterogeneity in attention-deficit hyperactivity disorder using EEG-based subgroups. J Child Psychol Psychiatry (2017).59:223–31. doi: 10.1111/jcpp.12814
16. Neba Health. Neba (2015). Available online at: https://shop.nebahealth.com/products/neba-fda-cleared-brainwave-adhd-assessment-aid
17. Taylor E. Uses and misuses of treatments for ADHD. The second Birgit Olsson lecture. Nordic J Psychiatry (2014) 68:236–42. doi: 10.3109/08039488.2013.804118
18. Storebo OJ, Krogh HB, Ramstad E, Moreira-Maia CR, Holmskov M, Skoog M, et al. Methylphenidate for attention-deficit/hyperactivity disorder in children and adolescents: cochrane systematic review with meta-analyses and trial sequential analyses of randomised clinical trials. BMJ (2015) 2015:351. doi: 10.1136/bmj.h5203
19. DuPaul GJ, Barkley RA, Connor DF. Stimulants. Attention deficit hyperactivity disorder: A handbook for diagnosis and treatment, 2nd Edn. New York, NY: Guilford Press (1998) p. 510–551.
20. Swanson JM, Kraemer HC, Hinshaw SP, Arnold LE, Conners CK, Abikoff HB, et al. Clinical relevance of the primary findings of the MTA: success rates based on severity of ADHD and ODD symptoms at the end of treatment. J Am Acad Child Adolesc Psychiatry (2001) 40:168–79. doi: 10.1097/00004583-200102000-00011
21. Jensen PS. A 14-month randomized clinical trial of treatment strategies for attention-deficit/hyperactivity disorder. Arch Gen Psychiatry (1999) 56:1073–86.
22. Su Y, Yang L, Stein MA, Cao Q, Wang Y. Osmotic release oral system methylphenidate versus atomoxetine for the treatment of attention-deficit/hyperactivity disorder in Chinese youth: 8-week comparative efficacy and 1-year follow-up. J Child Adolesc Psychopharmacol. (2016) 26:362–71. doi: 10.1089/cap.2015.0031
23. Becker SP, Froehlich TE, Epstein JN. Effects of methylphenidate on sleep functioning in children with attention-deficit/hyperactivity disorder. J Dev Behav Pediatr. (2016) 37:395. doi: 10.1097/DBP.0000000000000285
24. Cooper WO, Habel LA, Sox CM, Chan KA, Arbogast PG, Cheetham TC, et al. ADHD drugs and serious cardiovascular events in children and young adults. N Engl J Med. (2011) 365:1896–904. doi: 10.1056/NEJMoa1110212
25. Bélanger SA, Vanasse M, Spahis S, Sylvestre MP, Lippé S, l'Heureux F, et al. Omega-3 fatty acid treatment of children with attention-deficit hyperactivity disorder: a randomized, double-blind, placebo-controlled study. Paediatr Child Health (2009) 14:89–98. doi: 10.1093/pch/14.2.89
26. Sonuga-Barke EJS, Brandeis D, Cortese S, Daley D, Ferrin M, Holtmann M, et al. Nonpharmacological interventions for ADHD: systematic review and meta-analyses of randomized controlled trials of dietary and psychological treatments. Am J Psychiatry (2013) 170:275–89. doi: 10.1176/appi.ajp.2012.12070991
27. Arns M, Heinrich H, Ros T, Rothenberger A, Strehl U. Neurofeedback in ADHD. Front Hum Neurosci. (2015) 9:602. doi: 10.3389/fnhum.2015.00602
29. Marzbani H, Marateb HR, Mansourian M. Neurofeedback: a comprehensive review on system design, methodology and clinical applications. Basic Clin Neurosci. (2016) 7:143. doi: 10.15412/J.BCN.03070208
30. Berger H. Über das elektrenkephalogramm des menschen. Arch Psychiatr Nervenkrankheiten (1929) 87:527–70.
31. Durup G, Fessard A. I. L'électrencéphalogramme de l'homme. Observations psycho-physiologiques relatives à l'action des stimuli visuels et auditifs. L'année Psychol. (1935) 36:1–32.
32. Sterman M, MacDonald L, Stone RK. Biofeedback training of the sensorimotor electroencephalogram rhythm in man: effects on epilepsy. Epilepsia (1974) 15:395–416.
33. Lubar JF, Shouse MN. EEG and behavioral changes in a hyperkinetic child concurrent with training of the sensorimotor rhythm (SMR). Biofeedback Self Regul. (1976) 1:293–306.
34. Rossiter DTR, La Vaque TJ. A comparison of EEG biofeedback and psychostimulants in treating attention deficit/hyperactivity disorders. J Neurother. (1995) 1:48–59.
35. Linden M, Habib T, Radojevic V. A controlled study of the effects of EEG biofeedback on cognition and behavior of children with attention deficit disorder and learning disabilities. Appl Psychophysiol Biofeedback (1996) 21:35–49.
36. Maurizio S, Liechti MD, Heinrich H, Jäncke L, Steinhausen HC, Walitza S, et al. Comparing tomographic EEG neurofeedback and EMG biofeedback in children with attention-deficit/hyperactivity disorder. Biol Psychol. (2014) 95:31–44. doi: 10.1016/j.biopsycho.2013.10.008
38. Wang Q, Sourina O, Nguyen MK. Eeg-based “serious” games design for medical applications. In: Cyberworlds (cw), 2010 International Conference on. Singapore: IEEE (2010). p. 270–6.
39. Sherlin LH, Arns M, Lubar J, Heinrich H, Kerson C, Strehl U, et al. Neurofeedback and basic learning theory: implications for research and practice. J Neurother. (2011) 15:292–304. doi: 10.1080/10874208.2011.623089
41. Van Doren J, Heinrich H, Bezold M, Reuter N, Kratz O, Horndasch S, et al. Theta/beta neurofeedback in children with ADHD: feasibility of a short-term setting and plasticity effects. Int J Psychophysiol. (2017) 112:80–8. doi: 10.1016/j.ijpsycho.2016.11.004
42. Beauregard M, Levesque J. Functional magnetic resonance imaging investigation of the effects of neurofeedback training on the neural bases of selective attention and response inhibition in children with attention-deficit/hyperactivity disorder. Appl Psychophysiol Biofeedback (2006) 31:3–20. doi: 10.1007/s10484-006-9001-y
43. Kropotov JD, Grin-Yatsenko VA, Ponomarev VA, Chutko LS, Yakovenko EA, Nikishena IS. ERPs correlates of EEG relative beta training in ADHD children. Int J Psychophysiol. (2005) 55:23–34. doi: 10.1016/j.ijpsycho.2004.05.011
44. Arns M, Conners CK, Kraemer HC. A decade of EEG theta/beta ratio research in ADHD: a meta-analysis. J Atten Disord. (2013) 17:374–83. doi: 10.1177/1087054712460087
45. Heinrich H, Gevensleben H, Freisleder FJ, Moll GH, Rothenberger A. Training of slow cortical potentials in attention-deficit/hyperactivity disorder: evidence for positive behavioral and neurophysiological effects. Biol Psychiatry (2004) 55:772–5. doi: 10.1016/j.biopsych.2003.11.013
46. Banaschewski T, Brandeis D. Annotation: what electrical brain activity tells us about brain function that other techniques cannot tell us–a child psychiatric perspective. J Child Psychol Psychiatry (2007) 48:415–35. doi: 10.1111/j.1469-7610.2006.01681.x
47. Fouillen M, Maby E, Le Career L, Herbillon V, Mattout J. “ERP-based BCI for children with ADHD: motivations and trial design,” In: From Vision to Reality - Proceedings of the 7th Graz Brain-Computer Interface Conference Graz: GBCIC (2017). doi: 10.3217/978-3-85125-533-1-26
48. Klimesch W. EEG alpha and theta oscillations reflect cognitive and memory performance: a review and analysis. Brain Res Rev. (1999) 29:169–95.
49. Liu Y, Hou X, Sourina O, Bazanova O. Individual theta/beta based algorithm for neurofeedback games to improve cognitive abilities. In: Transactions on Computational Science XXVI. Berlin: Springer (2016). p. 57–73.
50. Escolano C, Navarro-Gil M, Garcia-Campayo J, Congedo M, Minguez J. The effects of individual upper alpha neurofeedback in ADHD: an open-label pilot study. Appl Psychophysiol Biofeedback (2014) 39:193–202. doi: 10.1007/s10484-014-9257-6
51. Bazanova OM, Auer T, Sapina EA. On the efficiency of individualized theta/beta ratio neurofeedback combined with forehead EMG training in ADHD children. Front Hum Neurosci. (2018) 12:3. doi: 10.3389/fnhum.2018.00003
52. Loo SK, Barkley RA. Clinical utility of EEG in Attention Deficit Hyperactivity Disorder. Appl. Neuropsychol. (2005) 12:64–76. doi: 10.1007/s13311-012-0131-z
53. Lofthouse N, Arnold LE, Hersch S, Hurt E, DeBeus R. A review of Neurofeedback treatment for pediatric ADHD. J Atten Disord. (2012) 16:351–72. doi: 10.1177/1087054711427530
54. Arns M, de Ridder S, Strehl U, Breteler M, Coenen A. Efficacy of neurofeedback treatment in ADHD: the effects on inattention, impulsivity and hyperactivity: a meta-analysis. Clin EEG Neurosci. (2009) 40:180–9. doi: 10.1177/155005940904000311
55. Micoulaud-Franchi JA, Geoffroy PA, Fond G, Lopez R, Bioulac S, Philip P. EEG neurofeedback treatments in children with ADHD: an updated meta-analysis of randomized controlled trials. Front Hum Neurosci. (2014) 8:906. doi: 10.3389/fnhum.2014.00906
56. Micoulaud-Franchi JA, Salvo F, Bioulac S, Fovet T. Neurofeedback in attention-deficit/hyperactivity disorder: efficacy. J Am Acad Child Adolesc Psychiatry (2016)55:1091–2. doi: 10.1016/j.jaac.2016.09.493
57. Steiner NJ, Frenette EC, Rene KM, Brennan RT, Perrin EC. Neurofeedback and cognitive attention training for children with attention-deficit hyperactivity disorder in schools. J Dev Behav Pediatr. (2014) 35:18–27. doi: 10.1097/DBP.0000000000000009
58. Arnold LE, Lofthouse N, Hersch S, Pan X, Hurt E, Bates B, et al. EEG neurofeedback for attention-deficit/hyperactivity disorder: double-blind sham-controlled randomized pilot feasibility trial. J Atten Disord. (2014)17:410–9. doi: 10.1177/1087054712446173
59. Arns M, Heinrich H, Strehl U. Evaluation of neurofeedback in ADHD: the long and winding road. Biol Psychol. (2014) 95:108–15. doi: 10.1016/j.biopsycho.2013.11.013
60. Enriquez-Geppert S, Huster RJ, Herrmann CS. EEG-neurofeedback as a tool to modulate cognition and behavior: a review tutorial. Front Hum Neurosci. (2017) 11:51. doi: 10.3389/fnhum.2017.00051
61. Vernon D, Frick A, Gruzelier J. Neurofeedback as a treatment for ADHD: a methodological review with implications for future research. J Neurother. (2004) 8:53–82. doi: 10.1300/J184v08n02_04
62. Jeunet C, Lotte F, Batail JM, Philip P, Franchi JAM. Using recent BCI literature to deepen our understanding of clinical neurofeedback: a short review. Neuroscience (2018)378:225–33. doi: 10.1016/j.neuroscience.2018.03.013
63. Congedo M, Lubar JF, Joffe D. Low-resolution electromagnetic tomography neurofeedback. IEEE Trans Neural Syst Rehabil Eng. (2004) 12:387–97.
64. Bussalb A. Meta-analysis Statistical Tools. GitHub (2019). Available online at: https://github.com/AuroreBussalb/meta-analysis-statistical-tools
65. Bussalb A. Supplemental Material. (2019). Available online at: https://github.com/AuroreBussalb/meta-analysis-statistical-tools/blob/master/documentation/article-latex-source/supplemental_material.pdf
66. The American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV) Washington, DC (2000).
67. World Health Organization. The ICD-10 Classification of Mental and Behavioural Disorders: Diagnostic Criteria for Research. vol. 2. World Health Organization (1993).
68. Pappas D. ADHD Rating Scale-IV: Checklists, norms, and clinical interpretation. J Psychoeduc Assess. (2006) 24:172–8. doi: 10.1177/0734282905285792
69. Cochrane Collaboration. RevMan 5.1 (2011). Available online at: https://community.cochrane.org/help/tools-and-software/revman-5
70. Strehl U, Aggensteiner P, Wachtlin D, Brandeis D, Albrecht B, Arana M, et al. Neurofeedback of slow cortical potentials in children with Attention-Deficit/Hyperactivity disorder: a multicenter randomized trial controlling for unspecific effects. Front Hum Neurosci. (2017) 11:135. doi: 10.3389/fnhum.2017.00135
71. Baumeister S, Wolf I, Holz N, Boecker-Schlier R, Adamo N, Holtmann M, et al. Neurofeedback training effects on inhibitory brain activation in ADHD: A matter of learning? Neuroscience (2018) 378:890–9. doi: 10.1016/j.neuroscience.2016.09.025
72. Shapiro ES. Academic Skills Problems Fourth Edition Workbook. New York, NY: Guilford Press (2010).
73. Conners CK, Sitarenios G, Parker JD, Epstein JN. The revised Conners' Parent Rating Scale (CPRS-R): factor structure, reliability, and criterion validity. J Abnormal Child Psychol. (1998) 26:257–68.
74. Christiansen H, Reh V, Schmidt MH, Rief W. Slow cortical potential neurofeedback and self-management training in outpatient care for children with ADHD: study protocol and first preliminary results of a randomized controlled trial. Front Hum Neurosci. (2014)8:943. doi: 10.3389/fnhum.2014.00943
75. Bluschke A, Broschwitz F, Kohl S, Roessner V, Beste C. The neuronal mechanisms underlying improvement of impulsivity in ADHD by theta/beta neurofeedback. Sci Rep. (2016) 6:31178. doi: 10.1038/srep31178
76. Collett BR, Ohan JL, Myers KM. Ten-year review of rating scales. V: scales assessing attention-deficit/hyperactivity disorder. J Am Acad Child Adolesc Psychiatry (2003) 42:1015–37. doi: 10.1097/01.CHI.0000070245.24125.B6
77. Epstein JN, Weiss MD. Assessing treatment outcomes in attention-deficit/hyperactivity disorder: a narrative review. Prim Care Companion CNS Disord. (2012) 14:PCC.11r01336. doi: 10.4088/PCC.11r01336
78. International Electrotechnical Commission. 60601-2-26:2012 (2012). Available online at:https://webstore.iec.ch/publication/2637#additionalinfo
79. Cohen J. Statistical Power Analysis for the Behavioral Sciences. 2nd ed. Hillsdale, NJ: Erlbaum Associates (1988).
80. Shewhart WA. Economic Control of Quality of Manufactured Product. New York, NY: ASQ Quality Press (1931).
81. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in Python. J Machine Learn Res. (2011) 12:2825–30.
82. Seabold S, Perktold J. Statsmodels: Econometric and statistical modeling with python. In: Proceedings of the 9th Python in Science Conference. vol. 57. Austin, TX:SciPy Society (2010). p. 61.
83. Montgomery DC, Peck EA, Vining GG. Introduction to Linear Regression Analysis. vol. 821. Hoboken, NJ: John Wiley & Sons (2012). p. 190–1.
84. Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc Ser B Methodol. (1996) 58:267–88.
86. James G, Witten D, Hastie T, Tibshirani R. An Introduction to Statistical Learning. vol. 112. New York, NY: Springer (2013). p. 303–32.
87. Bakhshayesh AR, Hänsch S, Wyschkon A, Rezai MJ, Esser G. Neurofeedback in ADHD: a single-blind randomized controlled trial. Eur Child Adolesc Psychiatry (2011) 20:481. doi: 10.1007/s00787-011-0208-y
88. Bink M, van Nieuwenhuizen C, Popma A, Bongers IL, van Boxtel GJM. Behavioral effects of neurofeedback in adolescents with ADHD: a randomized controlled trial. Eur Child Adolesc Psychiatry (2014) 24:1035–48. doi: 10.1007/s00787-014-0655-3
89. Gevensleben H, Holl B, Albrecht B, Vogel C, Schlamp D, Kratz O, et al. Is neurofeedback an efficacious treatment for ADHD? A randomised controlled clinical trial. J Child Psychol Psychiatry (2009) 50:780–9. doi: 10.1111/j.1469-7610.2008.02033.x
90. Holtmann M, Grasmann D, Cionek-Szpak E, Hager V, Panzner N, Beyer A, et al. Spezifische Wirksamkeit von Neurofeedback auf die Impulsivität bei ADHS. Kindheit Entwicklung (2009) 18:95–104. doi: 10.1026/0942-5403.18.2.95
91. Steiner NJ, Sheldrick RC, Gotthelf D, Perrin EC. Computer-based attention training in the schools for children with attention deficit/hyperactivity disorder: a preliminary trial. Clin Pediatr. (2011) 50:615–22. doi: 10.1177/0009922810397887
92. van Dongen-Boomsma M, Vollebregt MA, Slaats-Willemse D, Buitelaar JK. A randomized placebo-controlled trial of electroencephalographic (EEG) neurofeedback in children with attention-deficit/hyperactivity disorder. J Clin Psychiatry (2013) 74:821–7. doi: 10.4088/JCP.12m08321
93. Deilami M, Jahandideh A, Kazemnejad Y, Fakour Y, Alipoor S, Rabiee F, et al. The effect of neurofeedback therapy on reducing symptoms associated with attention deficit hyperactivity disorder: a case series study. Basic Clin Neurosci. (2016) 7:167. doi: 10.15412/J.BCN.03070211
94. Drechsler R, Straub M, Doehnert M, Heinrich H, Steinhausen HC, Brandeis D. Controlled evaluation of a neurofeedback training of slow cortical potentials in children with attention deficit/hyperactivity disorder (ADHD). Behav Brain Funct. (2007) 3:35. doi: 10.1186/1744-9081-3-35
95. Duric NS, Assmus J, Gundersen D, Elgen IB. Neurofeedback for the treatment of children and adolescents with ADHD: a randomized and controlled clinical trial using parental reports. BMC Psychiatry (2012) 12:107. doi: 10.1186/1471-244X-12-107
96. Fuchs T, Birbaumer N, Lutzenberger W, Gruzelier JH, Kaiser J. Neurofeedback treatment for attention-deficit/hyperactivity disorder in children: a comparison with methylphenidate. Appl Psychophysiol. Biofeedback (2003) 28:1–12. doi: 10.1023/A:1022353731579
97. Geladé K, Janssen T, Bink M, Maras A, Oosterlaan J, et al. Behavioral effects of neurofeedback compared to stimulants and physical activity in attention-deficit/hyperactivity disorder: a randomized controlled trial. J Clin Psychiatry (2016) 77:e1270–7. doi: 10.4088/JCP.15m10149
98. Lee EJ, Jung CH. Additive effects of neurofeedback on the treatment of ADHD: A randomized controlled study. Asian J Psychiatry (2017) 25:16–21. doi: 10.1016/j.ajp.2016.09.002
99. Leins U, Goth G, Hinterberger T, Klinger C, Rumpf N, Strehl U. Neurofeedback for children with ADHD: a comparison of SCP and Theta/Beta protocols. Appl Psychophysiol Biofeedback (2007) 32:73–88. doi: 10.1007/s10484-007-9031-0
100. Li L, Yang L, Zhuo C, Wang YF. A randomised controlled trial of combined EEG feedback and methylphenidate therapy for the treatment of ADHD. Swiss Med Wkly. (2013) 143:w13838. doi: 10.4414/smw.2013.13838
101. Meisel V, Servera M, Garcia-Banda G, Cardo E, Moreno I. Reprint of “Neurofeedback and standard pharmacological intervention in ADHD: a randomized controlled trial with six-month follow-up”. Biol Psychol. (2014) 95:116–25. doi: 10.1016/j.biopsycho.2013.09.009
102. Mohagheghi A, Amiri S, Moghaddasi Bonab N, Chalabianloo G, Noorazar SG, Tabatabaei SM, et al. A randomized trial of comparing the efficacy of two neurofeedback protocols for treatment of clinical and cognitive symptoms of ADHD: theta suppression/beta enhancement and theta suppression/alpha enhancement. Biomed Res Int. (2017) 2017:3513281. doi: 10.1155/2017/3513281
103. Mohammadi MR, Malmir N, Khaleghi A, Aminiorani M. Comparison of sensorimotor rhythm (SMR) and beta training on selective attention and symptoms in children with attention deficit/hyperactivity disorder (ADHD): A trend report. Iran J Psychiatry (2015) 10:165.
104. Monastra VJ, Monastra DM, George S. The effects of stimulant therapy, EEG biofeedback, and parenting style on the primary symptoms of attention-deficit/hyperactivity disorder. Appl Psychophysiol Biofeedback (2002) 27:231–49. doi: 10.1023/A:1021018700609
105. Ogrim G, Hestad KA. Effects of neurofeedback versus stimulant medication in attention-deficit/hyperactivity disorder: a randomized pilot study. J Child Adolesc Psychopharmacol. (2013) 23:448–57. doi: 10.1089/cap.2012.0090
106. Strehl U, Leins U, Goth G, Klinger C, Hinterberger T, Birbaumer N. Self-regulation of slow cortical potentials: a new treatment for children with attention-deficit/hyperactivity disorder. Pediatrics (2006) 118:e1530–40. doi: 10.1542/peds.2005-2478
107. Moher D, Liberati A, Tetzlaff J, Altman DG. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. Ann Intern Med. (2009) 151:264–9. doi: 10.7326/0003-4819-151-4-200908180-00135
108. Conners CK, Pitkanen J, Rzepa SR. Conners 3rd Edition (Conners 3; Conners 2008). In: Encyclopedia of Clinical Neuropsychology. New York, NY: Springer 2011. p. 675–8.
109. Luan R, Mu Z, Yue F, He S. Efficacy and tolerability of different interventions in children and adolescents with attention Deficit hyperactivity Disorder. Front Psychiatry (2017) 8:229. doi: 10.3389/fpsyt.2017.00229
110. Minder F, Zuberer A, Brandeis D, Drechsler R. Informant-related effects of neurofeedback and cognitive training in children with ADHD including a waiting control phase: a randomized-controlled trial. Eur Child Adolesc Psychiatry (2018) 27:1055–66. doi: 10.1007/s00787-018-1116-1
111. Alkoby O, Abu-Rmileh A, Shriki O, Todder D. Can we predict who will respond to neurofeedback? A review of the inefficacy problem and existing predictors for successful EEG neurofeedback learning. Neuroscience (2017) 378:155–64. doi: 10.1016/j.neuroscience.2016.12.050
112. Rogala J, Jurewicz K, Paluch K, Kublik E, Cetnarski R, Wróbel A. The Do's and Don'ts of neurofeedback training: a review of the controlled studies using healthy adults. Front Hum Neurosci. (2016) 10:301. doi: 10.3389/fnhum.2016.00301
113. Sollie H, Larsson B, Mørch WT. Comparison of mother, father, and teacher reports of ADHD core symptoms in a sample of child psychiatric outpatients. J Atten Disord. (2013) 17:699–710. doi: 10.1177/1087054711436010
114. Narad ME, Garner AA, Peugh JL, Tamm L, Antonini TN, Kingery KM, et al. Parent–teacher agreement on ADHD symptoms across development. Psychol Assess. (2015) 27:239. doi: 10.1037/a0037864
115. Holtmann M, Pniewski B, Wachtlin D, Wörz S, Strehl U. Neurofeedback in children with attention-deficit/hyperactivity disorder (ADHD)–a controlled multicenter study of a non-pharmacological treatment approach. BMC Pediatr. (2014) 14:202. doi: 10.1186/1471-2431-14-202
116. Birbaumer N. Clinical-psychological treatment of epileptic seizures: a controlled study. Perspect Promis Clin Psychol. (1991).
Keywords: ADHD, neurofeedback, meta-analysis, analysis of bias, EEG
Citation: Bussalb A, Congedo M, Barthélemy Q, Ojeda D, Acquaviva E, Delorme R and Mayaud L (2019) Clinical and Experimental Factors Influencing the Efficacy of Neurofeedback in ADHD: A Meta-Analysis. Front. Psychiatry 10:35. doi: 10.3389/fpsyt.2019.00035
Received: 19 September 2018; Accepted: 21 January 2019;
Published: 18 February 2019.
Edited by:
Jean Marc Guile, University of Picardie Jules Verne, FranceReviewed by:
Nishant Goyal, Central Institute of Psychiatry, IndiaOlga M. Bazanova, State Scientific-Research Institute of Physiology & Basic Medicine, Russia
Poppy L. A. Schoenberg, Vanderbilt University Medical Center, United States
Copyright © 2019 Bussalb, Congedo, Barthélemy, Ojeda, Acquaviva, Delorme and Mayaud. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Aurore Bussalb, YXVyb3JlLmJ1c3NhbGJAbWVuc2lhdGVjaC5jb20=