Main

Relapse rates following treatment for substance dependence are remarkably high, and efforts to address this unfortunate reality are becoming an integral part of treatment (Daley and Marlatt 1997). In outcome studies of alcoholics, for example, approximately 65–70% of patients have been found to relapse within one year of treatment, with the majority of these patients relapsing within less than three months (Hunt et al. 1971; Emrick 1974; Miller and Hester 1986). In outcome studies of drug- or polysubstance-dependent patients, relapse rates following treatment are similar to, if not greater than, those found for patients solely dependent on alcohol (Emrick 1974; McKay et al. 1999).

A number of investigators have focused on analyzing the multiple variables associated with relapse in an attempt to develop screening tools that might identify subpopulations most in need of relapse-prevention treatment. A number of risk factors have been identified. More severe levels of alcohol or drug dependence have, for example, been associated with an increased likelihood of relapse (Booth et al. 1991; Kampman et al. 1998; Langenbucher et al. 1996; McLellan et al. 1994; Winterer et al. 1998). In such studies, severity of dependence has been quantified using scores on either alcohol or drug use problem scales, DSM symptom counts, or the number of previous hospitalizations.

The presence of an additional, comorbid psychiatric disorder is likewise associated with an increased risk for relapse. One such disorder is the presence of severe conduct problems both before and after age 15, i.e., a conjoint diagnosis of Conduct Disorder (CD) and Antisocial Personality Disorder (ASPD) (Hesselbrock 1991; Leal et al. 1994; Woody et al. 1984, 1985). Comorbid depression also appears to promote relapse. However, the association between comorbid depression and treatment outcome is complex, and appears to depend upon the nature of the treatment, the gender and age composition of the patient sample, and the preferred drug of abuse (Hesselbrock 1991; Bobo et al. 1998; Brown et al. 1998; Hodgins et al. 1999; Kranzler et al. 1996; Rounsaville et al. 1986; Sellman and Joyce 1996; Ziedonis and Kosten 1991).

Although the above-mentioned clinical studies have demonstrated an association between greater illness severity, psychiatric comorbidity, and relapse, the association has not been replicated in other studies. Furthermore, when these same variables have been entered into more rigorous analyses (e.g., logistic regression or discriminant function) designed to predict the outcome of individual cases, the resulting sensitivity and specificity in prediction have been less than ideal. In part, the limited power of CD/ASPD, depression, and illness severity for specifying outcome might be attributed to inaccuracy in the patient's recall of his/her history and severity of substance abuse as well as imprecision in the diagnosis of comorbid disorders. Accordingly, the addition of an objective measure, which is less influenced by the reliabilities and validities of the patient's report and the clinical diagnosis, might markedly improve our ability to forecast outcome and plan appropriate preventive measures.

Several recent studies employing objective and quantitative electroencephalographic (EEG) techniques have already demonstrated an association between the amount of β (i.e. > 13 Hz) activity in the spontaneous electroencephalogram and relapse to alcohol or cocaine abuse. Our initial report on this topic (Bauer 1994) described a study of 17 alcohol-dependent inpatients and 14 healthy, nonalcoholic volunteers from the community. The patients were tested after 7 and 14 days of verified abstinence. The control group was also repeatedly tested to control for familiarization effects. All of the participants were medication-free at the time of testing and were screened to exclude individuals with other drug dependence, seizures, and similar factors that would complicate their EEGs. Patients were followed for a 3-month period after leaving treatment. Via detection procedures which included self-reports, weekly breathalyzer tests, and collateral informants, the patient group was divided into subgroups defined by the presence versus absence of any alcohol use during the 3-month post-treatment period. Differences in baseline EEG spectral power, and in autonomic, demographic, and other pre-treatment characteristics were tested between the Relapse-prone and Abstinence-prone patient groups, and the nonalcoholic control group.

The results showed that patients with unfavorable versus favorable outcomes were similar in demographic and clinical characteristics. In the face of these similarities, EEG activity alone significantly differentiated the groups: Relapse-prone patients exhibited more high frequency β activity in their EEGs than Abstinence-prone patients. This suggestion of heightened central nervous system arousal associated with future relapse was convergently validated by the demonstration of corresponding elevations in cardiac output and rate within the Relapse-prone group.

In 1998, a group from Germany (Winterer et al. 1998) replicated our 1994 findings using 38 alcohol-dependent patients studied after seven days of abstinence. Their analyses of demographic and substance use characteristics also revealed relatively few differences between the Abstinence-prone and Relapse-prone groups. When measures of EEG activity were entered into a discriminant function analysis, the sensitivity for predicting relapse was about 88%. Abstinent outcomes were correctly identified for 92% of the cases. The overall classification accuracy of approximately 90% was greater than the 59% accuracy associated with only clinical measures.

In 1999, a research group from New York University (Prichep et al. 1999) extended the idea of relating baseline EEG activity to outcome in a group of cocaine-dependent patients enrolled in a residential treatment program. Their approach to the analysis was somewhat different. For example, in the NYU study, retention in treatment was the criterion variable, not relapse per se. These investigators also assigned their 35 patients to groups not on the basis of treatment outcome, but on the basis of a cluster analysis of several EEG variables. The relationships between the empirically-derived EEG clusters and several demographic and substance use variables were then examined. There were no significant differences between the EEG clusters except for the membership's average length of stay in residential treatment. One cluster of patients, which was characterized by a shorter length of stay in treatment in comparison to the other cluster, was also characterized by a relative abundance of EEG β activity. When viewed in context, the NYU study is interesting because it suggests that the association between enhanced EEG β activity and negative outcome generalizes to another substance-abusing population (viz. cocaine abusers) and to an alternate operational definition of a negative outcome (viz. a shorter length of stay in treatment).

The present study was designed to extend previous EEG studies of relapse prediction in five key areas: (1) It utilized a larger number of patients (N = 107) than previous EEG studies and therefore provided adequate statistical power and the opportunity for subgroup analyses. (2) It formally determined whether the value of EEG activity as a predictor of relapse exceeds that of other variables which are more easily and less expensively measured. (3) The present study formally tested the generality of EEG as a relapse predictor across four different types of substance dependence. (4) The present study utilized collateral informants and frequent (1–2x/week), unannounced urine and breath assays as the method for detecting relapse. (5) And, finally, the present study examined whether EEG methods are informative in a residential treatment population that has already demonstrated at least one month of successful abstinence.

METHODS

Patient Recruitment and Screening

Data were obtained from 107 patients carefully selected from residential substance abuse treatment programs in the Hartford, Connecticut, area. As a control, an additional 22 subjects were recruited. The latter group of subjects had no history of substance dependence.

All of the subjects were healthy. None was taking prescribed or non-prescribed medications. Candidates were also excluded if they had a history of seizures, head injury, schizophrenia, major affective disorder, mental retardation, or other significant medical disorders, including cardiovascular, hepatic, immunologic, or renal disease. Medical and psychiatric histories were evaluated using structured interviews, viz., the SSAGA (Bucholz et al. 1994) for DSM-IIIR and the Addiction Severity Index (McLellan et al. 1980), administered by a trained research assistant. The interviews were audio-taped and monitored by the author to verify their consistency and validity. Additional demographic, medical, psychological, and drug use information was garnered from medical records and questionnaires, viz., Michigan Alcoholism Screening Test (MAST; Selzer 1971), Drug Abuse Screening Test (DAST-10; Skinner 1982), Spielberger State-Trait Anxiety Inventory (STAI; Spielberger 1983), Beck Depression Inventory (BDI; Beck et al. 1961), and the Shipley Institute of Living Scale (SILS; Shipley 1940).

At the time of the EEG assessment, patients had been abstinent for minimum of one month and maximum of five months. Following the EEG assessment and for a subsequent 6-month period, patients were visited frequently (i.e. 1–2x/week) and unpredictably for the purpose of detecting relapse or continued abstinence via urine (EZ Screen, Editek Inc.) and breath screening. Interviews with patients and informants were also conducted at these times. Patients were assigned to the Relapse-prone group if they used alcohol, cocaine, opiates, or benzodiazepines on one or more occassions during a 6-month tracking period. Patients were assigned to the Abstinence-prone group if they demonstrably maintained abstinence from these substances over the 6-month tracking period. Prior to their formal enrollment in the study, patients and controls signed an approved informed consent agreement describing the research procedures and the study's risks and benefits.

Acquisition and Reduction of EEG Data

EEG activity was amplified (gain = 10K, bandpass = 0.1–40 Hz) and digitized at 100 Hz over a 5-minute period under eyes-closed conditions. Fifteen electrode sites, arranged in the International 10/20 System, were sampled. EEG epochs of 1.28 sec duration were detrended, baseline-corrected, and cosine tapered. The epochs were edited for EMG/movement (peak-to-peak deflection > 50 microvolts), lead sway and A-D converter overflow artifacts, and mathematically corrected for eye movement artifact. The accepted EEG epochs were then submitted to a Fast Fourier Transform which computed the power (in μV2) of the EEG in 0.78 Hz steps over a total range of 1.5–40 Hz. The power spectra were averaged over epochs. Estimates of power within the δ (1.5–3.2 Hz), θ (3.8–7.02 Hz), α (7.8–12.5 Hz), and slow (13.2–18.75 Hz) and fast (19.5–39.8 Hz) β frequency bands were expressed as a proportion of total power across the entire frequency range. For the purpose of maximizing reliability and minimizing noise, these relative power estimates for each frequency band were averaged across the 15 electrode sites in all but one of the analyses (see below).

Data Analysis

Demographic, psychological, and drug use differences between the Abstinence-prone and Relapse-prone patient groups and the Non-Dependent Control group were evaluated using one-way ANOVA for continuous variables and Pearson's Chi-Square test for categorical variables.

The strategy for evaluating the relationship between EEG activity and outcome proceeded in several stages:

The first stage in the analysis was a set of five 1-way ANOVAs comparing the three subject groups (Abstinence-prone, Relapse-prone, Control). To eliminate the potential complication of subtle and confounded group differences in illness severity, alcohol (MAST), drug use (DAST-10), and depression (BDI) severity scores were entered as covariates. Separate ANCOVAs were performed for relative power in each of the five preselected, nonoverlapping frequency bands. Tukey post hoc tests were used to determine the source of significant differences suggested by the ANCOVA.

Another ANCOVA was performed focusing only on the EEG frequency that revealed group differences in the prior analysis. This analysis was designed to examine the legitimacy of combining data across electrode sites. The ANCOVA included electrode site and outcome as within- and between-subjects factors, respectively. The effects of illness severity were again controlled by utilizing MAST, DAST-10, and BDI scores as covariates. The control group was omitted from this and all subsequent analyses.

The third analysis involved an assignment of the 107 patients to one of four groups defined by their DSM-IIIR substance dependence diagnosis (alcohol, cocaine, alcohol and cocaine, or opioid/polydrug dependence) during the 12 months preceding admission to residential treatment. This analysis did not utilize covariates. It was structured as a 4 (diagnostic group) by 2 (outcome) ANOVA, with EEG fast β power as the dependent variable.

The fourth stage in the analysis included childhood Conduct Disorder (CD) and paternal alcoholism, along with outcome, as independent variables in a single 2 × 2 × 2 ANCOVA.

The final two analyses utilized stepwise logistic regression for the purpose of evaluating the predictive value of clinical measures alone versus the combination of clinical measures and EEG fast β power. Cohen's Kappa (k) statistic was used to formally evaluate the agreement between actual and predicted outcomes. F-to-enter statistics and the respective significance levels provided an estimate of the relative value of various predictors. Predictive accuracy was summarized using standard descriptors:

• Sensitivity = Proportion of true positives (relapse) who test positive [# true positive/(# true positive + # false negative)].

• Specificity = Proportion of true negatives (abstinent) who test negative [# true negative/(# false positive + # true negative)].

• Positive Predictive Value = Proportion of test positives who are in fact positive [# true positive/(# true positive + # false positive)].

• Negative Predictive Value = Proportion of test negatives who are in fact negative [# true negative/(# false negative + # true negative)].

RESULTS

The demographic, psychological, and substance use characteristics of the Abstinence-prone, Relapse-prone, and Non-Dependent Control groups are shown in Table 1 . On average, subjects were 34.9 years of age. Approximately 67% of the patients were male. Approximately 45% were Caucasian.

Table 1 Demographic and Baseline Psychological and Substance Use Characteristics of Subject Groups

The two patient groups reported higher scores than the Non-Dependent Control group on depression, trait anxiety, and alcohol and drug problem scales. However, the patient groups did not differ from one another on these scales. Both patient groups had been abstinent for a similar period (approx. three months) at the time of baseline EEG data collection.

Only two background measures differentiated between the Relapse-prone and Abstinence-prone groups. The Relapse-prone group was comprised of a higher percentage of members meeting DSM-IIIR diagnostic criteria for childhood Conduct Disorder. In addition, Relapse-prone patients exhibited a significantly lower IQ score on the SILS than the other group.

The overall effect of subject group on relative EEG power within the five preselected frequency bands was evaluated using five separate one-way ANCOVAs. Figure 1 shows that power in the high frequency β band significantly differentiated the groups (F = 3.6, p = .03). Tukey post-hoc tests showed that Relapse-prone patients exhibited significantly more high frequency β activity than the other two groups.

Figure 1
figure 1

Relative power (+ 1 SE) in the δ, θ, α, and slow and fast β EEG frequency bands as a function of group (Relapse-prone, Abstinence-prone, or Non-dependent Control).

A topographic analysis (Electrode Site × Outcome Group ANCOVA) of fast β power for the Abstinence-prone and Relapse-prone groups revealed that power was maximal (Electrode Site: F = 5.26, p = .009) in regions along and anterior to the central sulcus. The scalp distribution was similar in both groups (Electrode Site × Outcome Group: F = 0.73, p = .46).

To further localize the anatomical source of EEG fast β power, we submitted group-averaged fast β power for each electrode site to Current Source Density analysis and constrained the search for sources via the Boundary Element Method. CSD analysis is a state-of-the-art method which attempts to find a distributed solution for source localization. CSD differs from dipole modeling algorithms which can make unrealistic assumptions about the number of likely generators and their size or orientation. The Boundary Element Method represents a significant enhancement in localization algorithms because it constrains the search for sources according to actual skin, skull, and cortical depths and shapes. It is, in fact, based on a structural MRI as the model, as opposed to non-BEM methods which assume that the head and brain form concentric spheres. As one can see in Figure 2 , the Minimum Norm Least Squares CSD/BEM algorithm identified the most anterior regions of the frontal brain as the likely source of fast β power.

Figure 2
figure 2

Topographic map of current source density distribution of fast β power for the Relapse-prone and Abstinence-prone groups. The yellow region defines an area characterized by maximal CSD.

A separate analysis of the data asked whether the abnormal elevation in EEG fast β power associated with future relapse varied with the preferred substance of abuse. For this analysis, patients were assigned to one of four mutually exclusive groups, viz., alcohol-dependent (n = 21), cocaine-dependent (n = 28), alcohol- and cocaine-dependent (n = 34), or opioid/polydrug-dependent (n = 24), using DSM-IIIR criteria. The analysis revealed significantly (F = 3.2, p = .02) greater levels of fast β activity in the cocaine- and alcohol-dependent, alcohol-dependent and cocaine-dependent groups relative to the opioid/polydrug dependent group. Yet, the interaction of outcome and type of substance dependence was not significant (F = 0.29, p = .83). Furthermore, tests of the effects of outcome were significant at p < .05 for all four categories of dependence.

Since the abnormal elevation in β power in Relapse-prone patients is not related to their preferred drug of abuse, might it be related to a premorbid factor that is common across alcohol, cocaine, and opioid dependence? A 3-way ANOVA was used to test for interactions between two known risk factors, namely a family history of alcoholism (Cotton 1979) and DSM-IIIR Conduct Disorder (Robins 1966), and outcome. The interaction of these three variables was significant (F = 8.02, p = .006; Figure 3 ), suggesting that the neurophysiological abnormality associated with relapse is related to the same neurophysiological abnormality that accompanies increased risk for the initial onset of alcoholism and drug abuse.

Figure 3
figure 3

Relative power (+1 SE) in the fast β frequency band as a function of outcome and two premorbid risk factors for substance dependence, viz., childhood Conduct Disorder and paternal alcohol dependence

We then asked the more important question regarding the relative value of clinical, drug use, and EEG predictors of relapse. The first logistic regression (Table 2 ) included the number of alcohol and drug abuse problems, and the four combinations of the paternal alcoholism by Conduct Disorder factorial. The resulting accuracy in predicting outcome using these characteristics was no better than chance. The average classification accuracy was only 56.5%. The associated test for agreement between actual and predicted outcomes was not significant (Cohen's κ = 0.1, p = n.s.).

Table 2 Logistic Regression Analysis Results

The second logistic regression included these same variables along with fast β power. With this change in input parameters, the accuracy in predicting outcome improved from 56.5% to 74.3%—a better than chance level. The statistics evaluating the level of agreement between actual and predicted outcomes also improved (Cohen's κ = 0.45, p < .0001; Sensitivity = 0.61, Specificity = 0.85, Positive Predictive Value = 0.75; Negative Predictive Value = 0.73). Most importantly, though, fast β power far outweighed the clinical and demographic variables as a predictor of relapse (see Table 2). A comparison of the associated F-to-enter statistics indicates that fast β power explained more variance in predicting relapse than any of the remaining putative predictors. Relative fast β power was, in fact, the only predictor that attained statistical significance within the step-wise model. The best-fitting logistic regression equation was: Outcome{0 = abstinent, 1 = relapse, cutpoint = 0.5} = (13.51 * Fast β Relative Power) – 1.06.

DISCUSSION

In summary, the present results, and those reported by others, indicate that the value of EEG fast β power for predicting relapse can be generalized across patients with histories of either alcohol, cocaine, cocaine and alcohol, or opioid dependence. Furthermore, the time at which the EEG is recorded does not compromise its predictive value. In both the Winterer et al. (1998) and Bauer (1994) studies, the EEG was recorded from hospitalized patients after one week of abstinence. In the present study, the EEG was recorded from residential treatment program patients after an average of three months of abstinence.

Some readers might question the neural origin of fast β power and whether it genuinely reflects brain activity or is merely an artifact of movement and associated electromyographic activity. The careful methods by which the EEG data were screened (see Methods) for the absence of electromyographic artifact argue against the latter alternative. Furthermore, fast β power exhibits high heritability (Eischen et al. 1995; Propping et al. 1981) and excellent test-retest reliability (Lund et al. 1995; Nagakubo et al. 1991). Heritability and reliability are not characteristics of an artifact.

A critical reader might likewise be curious about neuropathological mechanisms which mediate the association between fast β power and relapse risk. In this regard, it is important to note that current source density analysis techniques employed in the present study (see Figure 2) localized the source of fast β activity to deep, anterior regions of the frontal brain. A frontal source for the underlying neuropathology was also suggested by the results of a previous P300 event-related potential study (Bauer 1997) of cocaine abusers at lower versus higher risk for relapse. In that study, relapse-prone patients exhibited a specific deficit in the amplitude of the P3a subcomponent. The P3a subcomponent appears to be generated within the orbitofrontal cortex (Clark et al. 2000). Thus, the results of two analyses now suggest that relapse-prone patients exhibit a deficit in the function of a brain region that normally serves to dampen impulsivity and emotionality.

An unresolved question in understanding the neuropathology of relapse risk relates to the frequency band which discriminated the groups. Within the confines of non-quantitative EEG evaluations performed by clinical electroencephalographers, β activity is often discounted because it offers no value for localizing specific sources of seizure activity. Furthermore, enhanced β activity is rarely detected among patients with diffuse cerebral atrophy secondary to probable Alzheimer's Disease and other forms of dementia (Niedermeyer 1999). It is worth noting, however, that enhanced low voltage, high frequency activity will sometimes presage the larger voltage, low frequency spike and/or wave complexes that define an epileptic event. No one should infer from this statement that the enhanced EEG β activity found in relapse-prone patients is equivalent to epileptic activity. Rather, it may only reflect a subtle and persistent dysregulation of neural coherence that superficially resembles the abnormal EEG activity detected before seizure onset.

The association demonstrated presently between fast β power and two premorbid risk factors—namely, childhood Conduct Disorder and a family history of alcoholism—is not unprecedented in the literature. In 1993, we (Bauer and Hesselbrock 1993) reported that young adult men who possessed both of these risk factors also exhibited a relative enhancement in EEG fast β power. Importantly, the young men enrolled in our 1993 study did not meet DSM-IIIR criteria for psychoactive substance dependence. It therefore appears that the abnormal elevation in fast β power among patients who later relapse is determined by the conjoint effects of these premorbid risk factors.

The importance and ultimate clinical applicability of the present findings for the management or treatment of substance abuse is tempered by the need to validate our regression equation in an independent and more heterogeneous sample of substance abusers. Yet, the present findings do suggest that EEG assessments could now be useful in treatment matching research or other controlled research endeavors. Future studies should evaluate the question of whether fast β power retains its value for predicting relapse among medically-complicated substance abusers, or among substance-abusing patients receiving medications, e.g., benzodiazepines (Bauer 2000), which alter the frequency distribution of the EEG. If specific medications or medical disorders are found to reduce the prognostic significance of fast β power, then researchers could either develop a different EEG threshold for detecting future relapse in these specific subpopulations or choose to not apply the predictive power of the EEG in these patients.

One should also consider the importance of the present findings in the context of current restrictions on research and health care budgets, and the efficiencies that may derive from directing prevention and treatment efforts toward individuals at highest risk for relapse. With regard to budget restrictions, for example, it is noteworthy that EEG technology is more practical and affordable than the other neuroimaging technologies, including fMRI, PET, or SPECT. Accordingly, EEG technology can be realistically implemented in large or small treatment settings for either research or clinical indications. Its superiority, relative to clinical or demographic measures, in predicting relapse suggests that it may have value in guiding patients toward more versus less intensive treatments. The inferior spatial resolution of the EEG, relative to the other neuroimaging modalities, can be remedied in part through the use of high density electrode arrays and modern statistical methods of current source localization. Such methods, which were employed in the present study, and dipole localization methods, which were employed in a previous study (Michel et al. 1992), implicate deep, anterior regions of the frontal lobe as the likely generator of EEG fast β activity. It is therefore reasonable to hypothesize that the normal function of these regions is altered among patients at high risk for relapse to substance abuse.