Effectiveness of EEG Biofeedback as Compared with Methylphenidate in the Treatment of Attention-Deficit/Hyperactivity Disorder: A Cinical Out-Come Study ()
1. Introduction
Attention-deficit/hyperactivity disorder (ADHD) requires ongoing management and monitoring. Although robust evidence indicates the efficacy of stimulant medications in helping to manage the symptoms of school-aged ADHD children [1], typically improvements are noted in some functional domains but not in others [2]. Furthermore, only 60% to 75% of DSM-IV—diagnosed ADHD subjects respond to methylphenidate [3-9]. As concluded by Pelham [10], “other interventions are needed for non-responders or incomplete responders to medication”.
Neurofeedback, which has also been called electroencephalography (EEG) biofeedback, is reportedly used by more than 1500 practitioners [11]. The theoretical basis of neurofeedback is based on a biological model of ADHD, considering that ADHD is a disorder of neural regulation and assuming under this approach that these neural deficiencies are amenable to change using behavioral methods (e.g. an operant conditioning procedure whereby an individual learns to self-regulate the electrical activity) [11]. Since the work of Lubar and Shouse [12], several studies have used neurofeedback approaches for treating ADHD and concluded that despite some limitations, neurofeedback may be worthy of further consideration as a viable treatment approach for ADHD [13-28]; for a meta-analysis see Arns et al. [29]. However, despite these promising results, neurofeedback has not been considered a standard therapy for ADHD [28] and has been considered an inconsistent and problematic method [30]. More experimental investigations need to evaluate the cognitive and behavioral outcome following neurofeedback [31,32].
The main issue of interest was whether neurofeedback achieves an equally effective outcome as compared with stimulant medication (as a standard method). In addition, we investigated whether the two types of treatment had dissimilar effects on different domains of behavioral and cognitive functioning of children with ADHD as measured by clinical, neuropsychological and experimental tests.
2. Methodology
2.1. Design
Considering the ethical problem of including untreated patient or patient undergoing placebo, La Vaque and Rossiter [33] pointed out that, rather than comparing a new treatment (e.g., neurofeedback) to a no-treatment placebo, it should be compared to a protocol of “known efficacy” to determine whether such an intervention would result in an equivalent effect. This type of design is often referred to equivalent study [34]. Regarding the well established efficacy of methylphenidate [1], we compared the effects of neurofeedback and methylphenidate. A control group made up of healthy children to control test-retest effect.
2.2. Participants and Selection Procedure
The ADHD children were all recruited from the pediatric neurology department of Amiens University Hospital. These children had never been treated with methylphenidate. Diagnosis was based on Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV) criteria and inclusion was dependent on meeting the full diagnostic criteria for ADHD. For all participants, the Child Behavior Checklist (CBCL) [35] was completed by parents, and the SNAP-IV questionnaire [36] was filled out by parents and teachers. Diagnosis was then established after semi-structured interview, a clinical neurological examination, and a set of ADHD-oriented neuropsychological and behavioral tests including the ADHD Rating Scale-IV [37] full version of the WISC-III [38], the Stroop Color-Word test [39,40]. Then they were administered the Continuous Performance Test (CPT-II) [41] and an attentional capture test (ACT) [42,43]. These cases were reviewed independently by a pediatric neurologist and a psychologist blinded to each other’s findings and included in the ADHD group only if both clinicians concurred on this diagnosis. This protocol was approved by the local ethics committee. No monetary compensation was given.
Participants were selected according to the following criteria: age between 7 and 12 years; performance and verbal-scales of IQ > 80; and no additional neurologic disorders.
Originally, 40 children with ADHD participated in the study; afterwards 12 children were excluded as follows: 11 children had performance and/or verbal-scale IQ < 80; and one child had burst of diffuse irregular slow wave activities in her EEG. Hence, 28 children with ADHD participated in two treatment groups: neurofeedback (NFT) (n = 13) or methylphenidate (MPH) (n = 15) group. Assignment to the treatment groups was based on the parents’ informed choice. NFT participants did not receive any psychoactive medication during the entire study period. All participants of the NFT group completed the treatment schedule, whereas two participants of the MPH group dropped out; one of them because of excessive side effects and another one did not like to continue to take methylphenidate. This left a total of 26 children with ADHD for analysis.
As mentioned, 13 age-matched healthy children participated as the no-treating control group (CON). Therefore, our study population consisted of 39 children as follows:
• 13 ADHD children (12 males, mean age = (9.1 ± 1) SD years) as NFT group;
• 13 ADHD children (all male, mean age = (8.8 ± 1.8) SD years) as MPH group;
• 13 healthy children (10 males, mean age = (9.1 ± 1.3) SD years) as CON group.
2.3. Preand Post-Test Measures
Following measures were used for statistical propose:
• Behavioral test: the SNAP-IV which were filled out by parents and teachers. The SNAP-IV is a rating scale based on symptoms listed in the DSM-IV and includes the two subsets of symptoms: inattention and hyperactivity/impulsivity.
• Neuropsychological tests: full version of the WISC-III was performed to assess the intellectual ability. The Stroop was used to evaluate the interference score, which represents response inhibition [44,45].
• Experimental tests: the CPT-II and the ACT. The CPT is a task evaluating capacity to maintain attention during time and capacities to inhibit a prepotent motor response [41,46]. Four main scores of the CPT-II were used: reaction time (RT), standard error of RT as a variability, omission, and commission errors. The ACT is another objective test based on attentional capture paradigm. The dependent measures in this task were RT, variability, and error response.
For the NFT and MPH groups, all instruments were performed after treatment, whereas healthy children (CON) were reassessed via only the CPT-II and the ACT. All MPH participants were “wash-on” when post-test was administered.
2.4. Neurofeedback Training
Training was administered using the ProComp2® encoder (Thought Technology Ltd.) and the NeuroCARE Pro®, version 1.9 (Zengar Institute Inc.). EEG was recorded from two electrodes placed on C3 and C4 according to the International 10 - 20 system [47], referenced to linked earlobes (sampling rate: 256 Hz).
Training protocol was similar to the paradigm described by Lubar et al. [15]. The ongoing EEG with frequency range 1 - 60 Hz was band-pass filtered in two frequency ranges: theta (4 - 8 Hz) and beta1 (15 - 18 Hz). The aim of neurofeedback training was to increase the power in the beta1 band (“reward band”) and simultaneously to decrease the power in the theta band (“inhibit band”). The trainer, seated behind participant, observed information about the power in each of these frequency bands online on a monitor, and fed back audio-visually to the participant via a second monitor. Feedback consisted of negative feedback and reinforcement: whenever spectral amplitudes were below or above thresholds for inhibit or reward bands, the software briefly interrupts ongoing movie.
Twenty four neurofeedback training sessions were conducted during 10 to 12 weeks with 2 - 3 sessions per week on afternoons, weekdays or vacations. Each session consisted of 35 - 40 min, lasted ~1 hour including the time for preparation. Participants sat in a comfortable chair in front of the monitor on which the movie was shown. Our instructions were “let the movie play” and “be calm and attentive to make the movie play better”. Children were asked to pay attention to history of the movie. They were explicated that the amount of pauses were not sign of improvement or regression. However, it was informed that the pauses in the movie were giving them information, as an indication of attention, and there was no other technical error.
2.5. Methylphenidate Administration
Participants of the MPH group were administered 20 mg/day (average dose (0.71 ± 0.13) mg/kg) long action methylphenidate (Ritalin LP®) every day on morning during the entire treatment period.
2.6. Statistical Analysis
Conventional Tests: Analyses of variance (ANOVA) for repeated measurements with the factors Treatment as the within-subjects factors and Group as the between-subjects factor were separately calculated for each dependent measure. Effect sizes (ESs) were assessed with Cohen’s d [48].
Equivalence Analysis: for data reduction purpose and investigating the “equally effect of two types of treatment”, we performed the following steps:
• ESs of each dependent measure were calculated for each group, separately;
• Calculated effect sizes were averaged for NFT and MPH group, separately;
• To test the conventional “no difference” null hypothesis between mean ESs of two groups, independent t test was used.
• A statistically significant difference between two treatments may or may not be equivalent for practical purposes [49]. Equivalence analysis [50-51] was conducted to test whether the treatment-related changes in both groups could be regarded as statistically equivalent. Using equivalent analysis, we investigated “equally effect of two types of treatment” to determine whether possible differences between neurofeedback and methylphenidate were significant for clinical purposes.
3. Results
Means and standard deviations of the Pre-test and posttest of all dependent measures with results of paired t tests and ESs for the NFT and MPH groups are presented in Table 1. The effects of both types of treatment on the each dependent measure, as indicated by the ESs, are compared in Figure 1.
3.1. Behavioral Effect
There were no significant differences between MPH and NFT groups on any of the SNAP-IV scales at pre-test. According to parent’s opinion, inattention (F1, 24 = 36.55; P = 0.000; ES = 0.60) and hyperactivity (F1, 24 = 41.09; P = 0.000; ES = 0.63) were improved by both types of treatment. An interaction Treatment × Group effect was found for inattention (F1, 24 = 9.75; P =0.005; ES = 0.29) and hyperactivity (F1, 24 = 7.82; P = 0.01; ES = 0.25). As depicted in Table 1 and Figure 1, treatment effect was more pronounced in the MPH group.
3.2. Neuropsychological Effect
• Stroop Color-Word Test: seventeen children (9 in NFT and 8 in MPH group) were included for the analyses of the Stroop Test. Three children were younger than 8 years and six children had below 2 zscore in the word reading score.
At pre-test, the NFT group had more problems in interference control as compared with MPH group but did

Table 1. Means and standard deviations of the Pre-test [M1 (SD)] and the post-test [M2 (SD)] of dependent measures with results of paired t tests and effect sizes (ES) for the Neurofeedback and Methylphenidate groups.

Figure 1. Effect sizes of each dependent measures and mean (+1 SD) of these effect sizes for NFT and MPH groups. Note: ACT = Attentional Capture test; ESs = mean effect sizes of all dependent measures; Hyp = Hyperactivity; In = Inattention; Int = Interference.
not reach significant (P = 0.06). There were a significant main effect of Treatment (F1, 15 = 7.14; P = 0.01; ES = 0.32) and Treatment × Group interaction (F1, 15 = 7.6; P = 0.01; ES = 0.33). Paired t test revealed that the interference score was significantly improved by only NFT.
• WISC-III: there were no significant differences between groups on any of the WISC-III scores at pretest. Main effects of Treatment were identified for the full IQ scale (F1, 24 = 11.65; P = 0.002; ES = 0.38). Analysis of the subscores demonstrated that this effect was accounted for mainly by the performance score (F1, 24 = 22.24; P = 0.000; ES = 0.48), but not by the verbal score (F1, 24 = 0.17; P = 0.67; ES = 0.007). Further analysis confirmed that all subtests of the performance score, except Block Design, increased significantly from preto post-test. Main effect of Treatment was significant for Picture Completion (F1, 24 = 13.74; P = 0.001; ES = 0.36), Coding (F1, 24 = 41.09; P =0.000; ES = 0.63), Picture Arrangement (F1, 24 = 4.84; P = 0.03; ES = 0.17), and Object Assembly (F1, 24 = 10.25; P = 0.004; ES = 0.29) for MPH group only. Regarding the verbal subtests, except Digit Span (F1, 24 = 10.96; P = 0.003; ES = 0.31), none of the subtests were significantly changed. There were no further significant main effects or interactions for any of the three scores or subtests.
3.3. Experimental Effect
• CPT-II: there were pre-test differences between groups on the omission errors (F2, 36 = 5.74; P = 0.007) and variability of RT (F2, 36 = 4.59; P = 0.01). Post hoc LST test yielded significant between ADHD (NFT and MPH groups) and control children (P < 0.008 for omission errors and P < 0.01 for variability), but not between NFT and MPH groups. No significant differences between groups (NTF versus MPH versus CON) were found for the commission errors RT at pre-test.
For omission errors, a significant main effect of Treatment (F1, 36 = 17.46; P = 0.000; ES = 0.32) and Treatment × Group interaction (F2, 36 = 9.24; P = 0.001; ES = 0.33) was identified. Here, improvements were observed from preto post-test in both NFT and MPH groups, but not in control children. Three groups did not differ significantly at post-test. Similarly, there were a significant main effect of Treatment (F1, 36 = 6.43; P = 0.01; ES = 0.15) and Treatment × Group interaction (F2, 36 = 4.06; P = 0.02; ES = 0.18) for the variability which improved by both treatments. Changes in control children were not significant. Three groups did not differ significantly at post-test. Although main effect of Treatment was significant (F1, 36 = 17.74; P = 0.000; ES = 0.33) for commission errors, there was no significant Treatment × Group interaction. Commission errors were significantly reduced from preto post-test in control children as well (t12 = 2.36; p = 0.03; ES = 0.51). Furthermore, since this measure was not statistically different between the ADHD and control groups it could not be a viable measure in this study, hence was excluded from further analysis. For RT, there was no significant main effect (Treatment, Group) or interaction.
• Attentional Capture Test: for RT, no significant differences between groups (NTF versus MPH versus CON) were found at pre-test. However, we observed a significant main effect of Treatment (F1, 36 = 5.43; P = 0.02; ES = 0.13), but no significant Treatment × Group interaction. Paired t test revealed that response speed was significantly improved by only methylpenidate. Changes in NFT group and control children were not significant. Three groups did not differ significantly at post-test. Although the effect of distractors was significant on RT for distractor trails (507 ± 35) compared to control trails (493 ± 130; t38 = 2.36; P = 0.02), there was no significant main effect (Treatment, Group) or interaction.
Pre-test differences between groups on the variability of RT (F2, 36 = 6.04; P = 0.005) were found. These differences were between ADHD and controls (P < 0.01), but not between NFT and MPH groups. Main effect of Treatment (F1, 36 = 12.07; P = 0.001; ES = 0.25) was significant. Effect of Treatment × Group interaction was marginally significant (F2, 36 = 3.17; P = 0.054; ES = 0.15). While changes in control children were not significant, variability was reduced by both treatments. Three groups did not differ significantly at post-test.
At pre-test, control children had less response error, but was not significant. Significant main effect (Treatment, Group) or interaction was not also observed.
3.4. All Dependent Variables
As shown in Table 2 and Figure 1, while ESs by MPH (M = 0.791, SD = 0.631) is greater than by NFT (M = 0.558, SD = 0.344) treatment, this difference was not significant. The equivalent analysis indicated that NFT and MPH did not also equivalent.
4. Discussion
Based on the SNAP-IV, parents reported reduced ADHDrelated behaviors in both groups; however, MPH was more effective than NFT. Although two groups were not significantly different at pre-test, larger MPH effect might obtain from degraded inattention and hyperactivity in the participants of MPH group.
In terms of response inhibition, experimental data demonstrated that the effect of MPH on the interference score of the Stroop test is inconclusive [52]. While some studies [42,53] have not found significant effect, some others [52,54,55] have reported a positive effect. To our knowledge, no study has compared NFT with MPH effect by using the Stroop test. Nevertheless, in an fMRI study using Counting Stroop test, not only increased accuracy in interference trails, but also activation in the right anterior cingulate cortex was observed in NFT compared with no-treatment ADHD group [26]. In the present study, positive treatment effect was found only for NFT. One explanation could be that NFT group benefit more than MPH group because of degraded interference control in pre-treatment. Future research is

Table 2. Conventional and equivalence test results of the means and standard deviations of the effect size of all dependent measures for the neurofeedback (NFT) and methylphenidate (MPH) groups.
recommended to investigate the effect of NFT and MPH on interference control, as a form of response inhibition.
In accordance with other studies [15,17,18,21,28], intellectual ability as assessed with the WISC-III improved by both NFT and MPH. It is likely that the observed changes were attributable to practice effects [21]. However, the improvement on the intelligence performance scores, but not on the verbal one, allow us to suggest that this improvement is related to attention enhancement. In this line, improvement on the digit span subtest, which is related to the attention/executive function [56] seems to confirm our conclusion.
Cognitive improvement by both types of treatment was also observed using two experimental tests, confirming that improvements were not limited to parent’s opinion. Treatment effect size was larger in the RT-variability and omissions, especially in the CPT-II. These behavioral, neuropsychological and experimental findings accord with those of previous studies demonstrating that NFT can significantly improve several behavioral and cognitive functions in ADHD children [12-28].
Calculated NFT and MPH effect sizes for each dependent measure were different. It would suggest that the impact of the two types of treatment on different domains of behavioral and cognitive functioning of children with ADHD did not similar.
With respect to the performed calculations on all measures, we found that although averaged effect size for MPH was greater than for NFT, both were in medium range and the difference was not significant. The equivalent analysis indicated that NFT and MPH did not also equivalent. These results (not different obtained from conventional tests, but not equivalent obtained from equivalence analysis) demonstrate that the variability is too great relative to the effect size to interpret. The small sample size limits the interpretability of the clinical outcome comparison. Proving the equivalence of both treatments would require a much larger sample.
The present study overcame some aforementioned methodological shortcoming [30] by comparing NFT with a standard method (MPH) and by applying both objective assessments and external ratings by parents. Nevertheless, our study had a number of limitations. Lack of random assignment may result in possible differences in treatment motivation and effects of expectancies. However, it was not feasible to administer either treatment without the parents’ consent. Although it is tempting to conclude that specific training in changing brainwave activity was responsible for the treatment effects, this conclusion cannot be made with certainty. The study design did not enable this possibility to conclusively rule out because of ethical issues.
Overall, our findings might provide further support to the view that neurofeedback can be considered an effective treatment for children with ADHD, at least an appropriate adjunctive treatment for non-responders or incomplete responders to medication as well as for those their parents favor a non-pharmacological treatment.
5. Acknowledgements
We would like to acknowledge all parents of participants and Mme Pataille for the recruitment of the control children. We gratefully thank Dr John Vessy, Wheaton College, for providing advice and feedback regarding the equivalence analysis.