Abstract
Group analysis of brain magnetic resonance imaging (MRI) metrics frequently employs generalized additive models (GAM) to remove contributions of confounding factors before identifying cohort specific characteristics. For example, the National Consortium on Alcohol and NeuroDevelopment in Adolescence (NCANDA) used such an approach to identify effects of alcohol misuse on the developing brain. Here, we hypothesized that considering confounding factors before group analysis removes information relevant for distinguishing adolescents with drinking history from those without. To test this hypothesis, we introduce a machine-learning model that identifies cohort-specific, neuromorphometric patterns by simultaneously training a GAM and generic classifier on macrostructural MRI and microstructural diffusion tensor imaging (DTI) metrics and compare it to more traditional group analysis and machine-learning approaches. Using a baseline NCANDA MR dataset (Nā=ā705), the proposed machine learning approach identified a pattern of eight brain regions unique to adolescents who misuse alcohol. Classifying high-drinking adolescents was more accurate with that pattern than using regions identified with alternative approaches. The findings of the joint model approach thus were (1) impartial to confounding factors; (2) relevant to drinking behaviors; and (3) in concurrence with the alcohol literature.
Similar content being viewed by others
Introduction
After birth, the human brain undergoes profound change that continues throughout adolescence and into young adulthood1. A consensus of cross-sectional and longitudinal magnetic resonance imaging (MRI) studies suggests that cortical gray matter volume declines and the cortical mantle thins2,3, but white matter volume, microstructural organization, and myelination of fiber tracts increase4,5, during healthy adolescent development. In this developmentally critical second decade of life, young people commonly engage in risky behaviors, including consumption of alcohol. A recent U.S. survey estimates that 66% of 18-year-olds have drunk alcohol and about 25% report getting drunk6. A rising incidence of binge drinking may put developing youth at particularly high risk for deviations from the normal trajectory of brain development7. Longitudinal studies of heavy relative to minimal drinking during adolescence report acceleration of gray matter volume shrinkage, attenuation of white matter growth8, and decreased fiber integrity9. Similar but subtler developmental changes have been detected in youth who drink regularly, if not heavily10. Despite such reports of quantifiable effects of drinking on normal neurodevelopmental trajectories, weak effects may be difficult to extricate using traditional, hypothesis-driven methods11 and may be enhanced by the use of machine-learning approaches to determine group separating characteristics12.
In neuroimaging studies, identifying group differences using classification approaches can be straight forward if the groups are of equal sample size and matched with respect to demographic factors such as age, sex, and ethnicity13,14,15. However, a challenge of many neuroimaging studies is statistical power, particularly given the number of potentially confounding factors16. For example, the National Consortium on Alcohol and Neurodevelopment in Adolescence (NCANDA)17, a landmark longitudinal study supported by the National Institute on Alcohol Abuse and Alcoholism and the National Institutes of Health Big Data to Knowledge initiative, has been collecting MRI and neuropsychological data in adolescents and young adults to (1) expand knowledge about normal brain maturation; (2) document changes following initiation of moderate-to-heavy alcohol consumption; and (3) identify imaging markers that predict early-onset alcohol use disorder (AUD). The number of youth with a notable history of alcohol consumption at baseline was small17. To power this investigation adequately, however, the study also recruited youth with minimal alcohol exposure at baseline that had a high risk for transitioning to the AUD phenotype during the course of the 10-year study.
One popular approach for analyzing unbalanced data sets is to include only subsets of the collected sample matched with respect to basic demographic variables. For example (in support of the first aim of the NCANDA study), age-matched samples selected from another large cohort study, the āPediatric Imaging, Neurocognition, and Geneticsā data set confirmed the longitudinal brain developmental patterns identified in the minimal drinking adolescents of the NCANDA cohort3. Specific to the NCANDA cohort and its second aim, the study also reported smaller and thinner frontal and temporal cortices for the group initiating moderate-to-heavy alcohol consumption relative to the minimally-drinking group. Matching cohorts, however, is not always successful in revealing significant group differences. For example, analyses of diffusion tensor image (DTI) data from demographically-matched subsets of the NCANDA study did not reveal effects of moderate-to-high drinking on DTI metrics (i.e., regional fractional anisotropy, mean diffusivity, axial diffusivity, and radial diffusivity)4. This was surprising given evidence that excessive alcohol consumption in adults disrupts white matter microstructure of select fiber systems18,19,20,21.
An alternative approach to analyzing unbalanced data is to include the entire sample, but to remove the effects of confounding variables before performing group analysis12,22,23,24,25,26,27. Regression approaches, such as the āordinaryā generalized additive model (GAM), remove the effects of confounding factors by first modeling the relationship between the dependent variable (e.g., volume of the corpus callosum) and confounding factors (e.g., age) on a subset of the sample (e.g., minimal alcohol-consuming healthy controls)3, then using that model to remove the effect of confounding factors from each dependent variable so that residuals of the raw metric are used in group analyses. However, GAM often suffers from sensitivity to noise, as demonstrated, for example, by the variance in age associated with peak white matter microstructural maturation4. Robust regression claims to address the sensitivity issue by separately modeling the effects of confound and noise in MRI metrics28. While robust regression has often been used in large neuroimaging studies29, the noise model requires a-priori specification, which can reduce the power of the analysis. For example, a cautious threshold for accounting for noise generally results in a robust GAM but the effects of confounding variables are then estimated on a notably reduced sample size. A small sample generally fails to capture comprehensive effects of confounding factors and the resulting GAM is thus likely inaccurate. We hypothesized that typical sequential use of the GAM to isolate the effects of confounding variables on MR metrics would minimize information relevant for distinguishing groups (e.g., adolescents with a drinking history relative to those without a significant drinking history).
To test the hypothesis, we apply a machine learning approach to the baseline NCANDA neuroimaging data set. Our proposed approach, referred to as Joi-GAM-Class (for joint GAM classification) simultaneously determines optimal parameters (1) of a GAM (for removing the influence of confounding factors) and (2) a logistic classifier (for cohort classification). The classifier identifies a subset of variables (i.e., residual scores of imaging metrics) that is most informative for differentiating minimal from regular drinking youth. We refer to this subset of brain measurements as pattern. To identify a pattern, the classifierās search for informative brain metrics is constrained to subsets of a certain size, enforced by embedding āsparsity constraintsā into the classification model12. To help with an initial understanding of the method used herein, Fig.Ā 1 presents the output of three approaches analyzing a synthetic data set. FigureĀ 1(a) plots an arbitrary image metric (y-axis) relative to age (x-axis). The green dots represent the imaging metrics of the minimal drinkers and the black ones of the regular drinkers. For both cohorts, the metric is clearly effected by age, a confounding factor also in the NCANDA data. The effects of age outweigh the effects of group when the classifier is applied directly to raw imaging metrics (i.e., not residuals) as the two cohorts are not separated accurately (Fig.Ā 1(b)). FigureĀ 1(c) shows a few samples that are mislabeled by classification based on residual scores of raw imaging metrics, i.e., after the confounding effects of age are removed via robust GAM. The GAM was parameterized based on the imaging metrics of the control group, i.e., the minimal drinkers. As is true with real data, however, the noise associated with raw imaging metrics made it highly unlikely to recover the ātrueā age effect. Instead the data allow for a variety of plausible solutions shown schematically in the gray region outlined in Fig.Ā 1(a). Within this set of possible solutions, the robust regression chose the solution that best fits a-priori assumptions. The assumptions were defined through specific settings of the underlying optimization algorithm, which were not specific to the classification task. By contrast, our joint optimization approach selected the GAM model so that the classifier perfectly separated the two cohorts (Fig.Ā 1(d)).
To complete hypothesis testing, we cross-validate our joint algorithm approach (i.e., Joi-GAM-Class) against alternative implementations using the baseline NCANDA imaging data set. The data set consists of structural MRI and microstructural DTI metrics collected in 705 adolescents: 671 that are minimal (no-to-low) drinking and 34 that are regular drinkers. The GAM is defined with respect to age and socioeconomic status, because these two variables are not matched across the two cohorts (and are therefore confounding variables). To apply cross-validation, a popular method to measure accuracy of machine learning approaches, the total data set is divided into subsets (i.e., folds) in which the cohorts in each subset are matched with respect to demographic factors other than age and socioeconomic status. Each implementation uses one subset for training. The accuracy of patterns identified during training is then evaluated on the second subset to ensure that solutions are not specific to the first subset. This process is repeated with the second subset used for training and first for testing. The test accuracy of classifiers is summarized with accuracy scores, which include measures for testing the resistance of implementations to confounding factors. Furthermore, we compute p-values representing the statistical significance of accuracy scores and patterns identified by each implementation. Here, we are the first to report progress on the third aim of NCANDA (i.e., identify imaging markers that predict early-onset AUD) by presenting patterns of neuromaturation that are impartial to confounding effects (such as age) and correctly classify adolescents who drink regularly.
In a conference paper30, we first discussed the idea of jointly parameterizing GAM and classification to analyze two independently collected structural MRI data sets of participants ranging in age from 60 to 72 years (Nā=ā74). The first data set contained participants infected with the Human Immunodeficiency Virus (HIV) and effected by HIV-Associated Neurocognitive Disorder as well as demographically matched controls. The second data set, which was matched to the first one, contained individuals diagnosed with Mild Cognitive Impairment and a control cohort. In the conference submission, the GAM was used to remove the effect of acquisition differences between the two data sets and the classifier to identify differences between HIV-Associated Neurocognitive Disorder and Mild Cognitive Impairment. The experiment revealed that our joint approach is more accurate than sequential methods in identifying group differences based on data not ideally constructed for classification. Here, we confirm this finding on the NCANDA data set.
Results
Comparison of Sequential and Joint Approaches
Our experiments on the NCANDA data set revealed that our joint approach Joi-GAM-Class (based on MRI and DTI metrics) was indifferent to confounding factors (i.e., age and socioeconomic status) and more accurate than alternative implementations, listed here:
-
No-GAM-Class: performed sparsity constrained classification on raw image scores (i.e., omitting GAM); the benchmark for analysis without removing the effects of confounding factors.
-
Seq-GAM-Class: popular sequential approach first parameterized an ordinary GAM and then performed sparsity-constrained classification.
-
Seq-GAM Rob -Class: sequentially executed robust regression and sparse classification; the alternative to Seq-GAM-Class that accounted for image noise.
-
Joi STR -GAM-Class: the proposed joint model confined to the structural (STR) MRI metrics; the only other implementation indifferent to the confounding factors.
-
Joi DTI -GAM-Class: the proposed joint model confined to microstructural DTI metrics; as with Joi STR -GAM-Class, this method provided a benchmark for single-image modality analysis.
-
Joi OPT -GAM-Class: a simplified version of our proposed joint model suitable for optimizing group separation, but not indifferent to the effects of confounding factors.
Note, TableĀ S2 of the supplement lists these and all other acronyms used throughout the article.
We measured the accuracy of each implementation using two-fold cross-validation. After training each implementation on the training data to classify minimal and regular drinkers, we measured their accuracies on the testing data by reporting sensitivity, specificity, Area Under the receiver operating characteristic Curve (AUC), and ānormalized-accuracyā. āNormalized-accuracyā computed the accuracy of an implementation in correctly labeling samples while accounting for differences in sample size between the two cohorts. To ensure the indifference of an implementation to the effects of confounding variables, we also reported āmatched-accuracyā. To compute āmatched-accuracyā, we first defined a subset of the test data in which the cohorts were matched with respect to all known demographic scores including age, socioeconomic status, and cohort size and then re-computed the normalized-accuracy with respect to this subset. We set a threshold for labeling an accuracy score as significant at pāā¤ā0.002 based on a two-tailed Fisherās exact test31 (i.e., the probability of a classifierās output to be generated by randomly assigning samples to cohorts) or the DeLongās test32 i.e., (the probability of the output of one implementation to be generated by another implementation). This significant threshold was considered conservative as the number of implementations compared herein was small4. Unless otherwise stated, significant findings refer to the outcome of the Fisherās exact test.
Desirable implementations were those with significant normalized-accuracy and significant matched-accuracy. For each implementation, indifference to the effects of the confounding variable āageā was calculated using the two-tailed Fisherās exact test to measure the ability of the relevant classifier to cleanly separate minimal (no-to-low) alcohol-exposed adolescents into an older (i.e., above the age of 15.4; Nā=ā335) and younger cohort (i.e., below the age of 15.5; Nā=ā336). The two cohorts were matched with respect to all demographic factors (i.e., socioeconomic status, supratentorial volume, sex, ethnicity, scanner) except age. Implementations with p > 0.01 passed the age-test as the effect of age was non-existent or magnitudes weaker than the effects of regular drinking. Thus, desirable implementations that also passed the age-test were considered informative with respect to distinguishing regular drinkers from minimal alcohol exposed adolescents. Critically, all implementations passed the socioeconomic status test, i.e., a replication of the age-test applied to this variable. We thus omit discussion of this test.
TableĀ 1 summarizes results. The classifier without data harmonization (No-GAM-Class) was the only implementation, whose ānormalized-accuracyā score was significantly lower than chance. The sequential implementations (Seq-GAM-Class and Seq-GAM Rob -Class) had significant ānormalized-accuracyā scores but non-significant āmatched-accuracyā scores. Compared to those implementations, the joint methods reported higher AUC, normalized-accuracy, and matched accuracy scores. Although specificity was higher than sensitivity for all implementations, the difference between these was substantially smaller for the joint approaches. The smallest difference was observed for Joi STR -GAM-Class (sensitivity: 70.6%; specificity: 76.9%). Joi STR -GAM-Class was also informative as it passed the age-test and had significant normalized-accuracy and matched-accuracy scores. Among the joint approaches, the accuracy score was diminished when only DTI metrics were used (i.e., Joi DTI -GAM-Class): this implementation also failed the age-test and did not have a significant matched-accuracy score. Joi Opt -GAM-Class failed the age-test and had the largest difference between normalized-accuracy and matched-accuracy scores (dropped by 12.6%), but it achieved the highest accuracy score (80.8%). Joi-GAM-Class passed the age-test, had a high accuracy score, and the smallest difference between normalized-accuracy (75.9%) and matched-accuracy (77.1%) scores. These accuracy scores were higher than those of the only other informative implementation (i.e., Joi STR -GAM-Class). Joi-GAM-Class was also the only implementation that was significantly better than No-GAM-Class and Seq-GAM-Class. On a trend level (p < 0.0003), it was also better than Seq-GAM Rob -Class.
Pattern Analysis
As part of cross-validating an implementation, training consisted of parameter exploration, i.e., recording the identified pattern and corresponding accuracy for different parameter settings of the implementation. A pattern consists of a small number of MR-derived metrics that the implementation deemed informative for distinguishing the two cohorts. FigureĀ 2 plots the normalized frequency of unique patterns identified by each implementation across all training runs. The following lists each implementation by the number of unique regions identified: Joi STR -GAM-Class (53 patterns), No-GAM-Class (72 patterns), Seq-GAM-Class (72 patterns), Joi OPT -GAM-Class (72 patterns), Joi DTI -GAM-Class (225 patterns), Joi-GAM-Class (381 patterns), and Seq-GAM Rob -Class (853 patterns). Interestingly, Joi-GAM-Class recorded four informative (and dominant) patterns each appearing in at least 50% of the training runs.
The four informative patterns of Joi-GAM-Class consist of the MR metrics listed in TableĀ 2. The most frequently selected pattern (97.8%) consisted of the volumes of lateral ventricles and mid posterior corpus callosum. The second pattern (80.5%) included the first two MR metrics and two additional structural MRI metrics (i.e., volumes of centrum semiovale and central corpus callosum). The third pattern (54.7%) added DTI metrics fractional anisotropy of anterior corona radiata and posterior thalamic radiation) and the fourth (52.9%) included also axial diffusivity of fornix and volume of cingulate gyrus (Fig.Ā 3). Thus, this implementation provided consistency in the identified patterns.
Alternative implementations also frequently selected the previously mentioned regions. The only MRI metrics not used by Joi-GAM-Class were the mean diffusivity of the corticospinal tract selected by Seq-GAM-Class, and the axial diffusivity of the medial lemniscus selected by Seq-GAM Rob -Class.
TableĀ 2 also lists the normalized- and matched-accuracy scores for the logistic classifier confined to the residual scores of the four patterns selected by Joi-GAM-Class. The fourth pattern, which included the MRI metrics of the other three patterns, had equivalent normalized-accuracy and matched-accuracy scores (79.4%). The classifier based solely on a single MRI metric achieved accuracy scores below 70% for most regions. The classifier based on the fractional anisotropy of the anterior corona radiata (normalized-accuracy: 80%) and the posterior thalamic radiation (normalized-accuracy: 75.4%) were exceptions, but their matched-accuracy scores were below 70%.
Regarding group differences (see FigsĀ 4 and 5), the volume of the mid posterior corpus callosum was significantly smaller (\(p=0.0002\)) in regular drinkers relative to minimal alcohol-drinking adolescents. The axial diffusivity of the fornix (\(p=0.0005\)) and the fractional anisotropy of the anterior corona radiata (\(p < 0.0001\)) and posterior thalamic radiation (\(p < 0.0001\)) were significantly higher in the regular drinking adolescents relative to those with minimal alcohol-exposure.
Discussion
Joi STR -GAM-Class and Joi-GAM-Class were the only successful approaches for identifying regular drinking on a subject level. This finding supports our central hypothesis that typical sequential use of the GAM to isolate the effects of confounding variables on MR metrics would minimize information relevant for distinguishing groups (e.g., adolescents with a drinking history relative to those without a significant drinking history). Joi-GAM-Class (i.e., the more accurate of these methods) selected patterns that included structural MRI volumes of the lateral ventricles, centrum semiovale, corpus callosum, and cingulate gyrus and microstructural DTI measures of the fornix, corona radiata, and thalamic radiations. The integrity of each of these regions has been reported to be affected by alcohol misuse in studies using more traditional, hypothesis-driven, morphometric group analysis. When this type of analysis was applied to those eight MRI metrics, only four of them showed significant group differences on the NCANDA data set. We thus conclude that the outcome of machine learning models, such as the one proposed here, requires analyzing MRI metrics as a whole to gain knowledge about the effect of alcohol on individuals.
Strong agreement existed among the regions included in the four informative patterns identified by Joi-GAM-Class. While inter-dependencies between the repeated training runs with varying parameter settings of Joi-GAM-Class could account for this finding, this explanation fails to explain the consistency between the informative patterns identified by Joi-GAM-Class and alternative implementations. A more likely explanation for this consistency is the significant impact of regular drinking on the regions identified by Joi-GAM-Class.
The brain regions identified by Joi-GAM-Class are relevant with respect to the Alcohol Use Disorder (AUD) literature. For example, the centrum semiovale, the most frequently appearing region across all patterns, was modestly smaller in the regular than in the minimal drinking group. This finding is consistent with in vivo neuroimaging26 and postmortem studies33 reporting smaller centrum semiovale volume in heavy alcohol drinking relative to healthy control adults. Smaller than normal white matter volumes could indicate a disruption in adolescent brain development given that white matter continues to grow throughout early adulthood34,35,36.
A number of studies report that the corpus callosum is sensitive to alcohol use disorder26,37,38. The corpus callosum integrates information and mediates complex behaviors39 and is larger and thicker in adolescents with higher intelligence40,41 and better problem solving abilities42. The cingulate cortex has been associated with selective attention43, conflict monitoring and decision making in controls44 and alcoholics45,46. The lateral ventricles are generally enlarged in heavy alcohol consuming adults and serve as a sensitive marker of alcoholic-level drinking13,47,48.
Joi-GAM-Class also identified regions with altered DTI metrics in the regular drinkers relative to the minimal drinking adolescents. Although low fractional anisotropy and high mean radial diffusivity are often reported in heavy drinking youth15, the current study reports that axial diffusivity of the fornix, fractional anisotropy of the anterior corona radiata and posterior thalamic radiation were high in the regular drinking group. These findings were also reported previously4, albeit at a statistically insignificant level. A recent longitudinal study of detoxified alcohol-dependent male adolescents found evidence of low white matter integrity in the body of the anterior corona radiata15. Microstructural compromise of the fornix, a major fiber bundle connecting limbic structures, has been reported in adult alcoholics49.
We complete the review of our morphological findings by noting that the importance of single-region metrics (i.e., its frequency of appearance in the training runs as specified in TableĀ 2) was unrelated to its significance in discriminating the two cohorts, i.e., only half the scores were significantly different between groups. The importance of a single-region metric was also unrelated to its accuracy in classification, i.e., all single-regional metrics reported low matched-accuracies. These observations were further supported by repeating two-fold cross-validation of the sequential procedure with the classifier (without sparsity constraints) being trained on the 29 regional measurements. These 29 MRI metrics were identified by applying a two-tailed t-test to residual scores of the training dataset and retaining those with \(p\le 0.01\) (i.e., the significance threshold that led to the highest classification accuracy). The resulting normalized-accuracy of the classifier based on these 29 metrics at 67.3% was significantly lower than that of Joi-GAM-Class. Thus, the type of machine learning applied here analyzed all potentially informative metrics as a whole12 to determine those regions impacted by regular alcohol use on the developing adolescent brain. In support of this statement, Joi-GAM-Class received lower accuracy scores than those listed in TableĀ 2 for āPattern 4ā, the informative pattern of Joi-GAM-Class consisting of all eight regional scores. This pattern is the first known imaging marker with respect to the NCANDA cohort that predicts (i.e., classified with significant accuracy) individuals with regular drinking habits at baseline.
For readers interested in the technical aspects of our proposed machine learning approach, the remainder of the discussion focuses on differences in the implementations and their impact on accuracy scores. We first note, that No-GAM-Class, i.e., performing classification without the GAM model, failed the age-test and resulted in low accuracy scores, thereby supporting the need for properly modeling the effects of confounding factors. One way of modeling the effect is to perform the analysis on a subset of the data with the cohorts being carefully matched with respect to confounding factors. However, the sample size of a matched data set is often much smaller than the original dataset, thereby reducing statistical power. Alternatively, the effects of confounding factors can be removed via GAM.
When parameterizing a GAM independently from classification (i.e., sequential approaches), the residual effects of confounding factors can significantly effect the final classification, as observed here, since the sequential approaches (Seq-GAM-Class and Seq-GAM Rob -Class) failed the age-test. That the joint implementations Joi DTI -GAM-Class and Joi OPT -GAM-Class also failed the age test is a caution to check the output of regression-based approaches for the effects of confounding factors. The series of stringent statistical tests performed post hoc in this study identified those outputs that were not selected because of contributions of confounding factors. Based on those tests, the only informative patterns were determined by the joint implementations Joi STR -GAM-Class and Joi-GAM-Class.
When confining the joint analysis to just one modality, classification achieved higher accuracy when based on structural metrics (i.e., Joi STR -GAM-Class) than when based on DTI metrics (i.e., Joi DTI -GAM-Class). The higher accuracy scores of Joi-GAM-Class over the single-modality implementations (i.e., Joi STR -GAM-Class and Joi DTI -GAM-Class) further highlight the importance of analyzing multiple modalities together.
In conclusion, only the joint approaches Joi STR -GAM-Class and Joi-GAM-Class passed the age-test, showed significant normalized- and matched-accuracy scores, and succeeded in identifying informative patterns on a data set not ideally constructed for classification. Thus, our experiments support the hypothesis of this study.
Methods
Participants
At baseline4, NCANDA recruited 831 adolescents, of whom 28 were excluded for the current analysis due to brain abnormalities or missing MRI data. Of the remaining 803 youth, 671 (333 male and 338 female adolescents, ages 12 to 21 years) met the criteria for minimal (no-to-low) alcohol consumption17 and comprised the control group. The remaining 132 adolescents reported initiating moderate-to-heavy alcohol consumption: female participants consumed four or more drinks (beer, wine, or hard liquor) and male participants consumed five or more drinks (beer, wine, or hard liquor) on at least one occasion in their lifetime. Of these, 34 subjects met criteria for regular drinking (i.e., they drank a minimum of two alcoholic drinks at least once per week). The total data set on 705 youth (671 minimal and 34 regular drinkers) used in this study included demographic information and MRI scans acquired across the five NCANDA collection sites17, two of which used Siemens 3āT Tim Trio scanners (Siemens) and three of which used General Electric 3āT Discovery MR750 scanners (GE). Each participant was described by age, sex, self-reported ethnicity, socio-economic status (based on the highest education achieved by either parent)50, MRI scanner type (GE or Siemens), and supratentorial volume (determined from MR images) (see TableĀ 3).
The two cohorts were matched (pā>ā0.1) on ethnicity (multinomial Chi-Square test51), and sex, and MRI scanner type (binomial Chi-Square test52). Age, socio-economic status, and supratentorial volume (i.e., the only other confounding factors3) were compared using unpaired, two-tailed t-tests53. The two cohorts matched with respect to supratentorial volume but not age and socio-economic status. Most of the regular drinkers were older (18 or older) and had higher socio-economic status than the control group.
Brain imaging metrics used for each individual included 32 MRI derived structural volume scores extracted from the T1- and T2-weighted MRIs3, and 112 DTI derived microstructural scores4. All scores were provided as data releases (Demographic Score Release: NCANDA DATA 00010 V5, Structural Score Release: NCANDA DATA 00011, DTI Score Release: NCANDA DATA 00012 V2) by the software platform Scalable Informatics for Biomedical Imaging Studies (SIBIS; sibis.sri.com)54. The Section āData Pre-processingā of the supplement summarizes the pre-processing steps performed on these data as described by3,4.
Implementations
All implementations used here were based on the sparse-logistic classification model12. This method is trained to accurately classify samples by minimizing an energy function that encodes the underlying classification task as finding informative patterns (of MRI metrics) of a certain size. No-GAM-Class directly trained the classifier on the 144 raw imaging metrics of each subject. Training of the sequential approaches Seq-GAM-Class and Seq-GAM Rob -Class consisted of first parameterizing a GAM for regressing out the effects of confounding factors (i.e., age and socio-economic status) before optimizing the classifier on the residual scores. The GAM used a linear model for capturing the relationship between the image metrics and socio-economic status and a quadratic model for capturing the relationship between the image metrics and age3,4. Seq-GAM-Class used the least square estimation and Seq-GAM Rob -Class used the robust regression (i.e., bisquare estimation, the default of ārobustfitā in Matlab2013b)28 to determine the optimal setting of GAM on the minimal drinkers of the training data set. The joint approaches (Joi STR -GAM-Class, Joi DTI -GAM-Class, Joi OPT -GAM-Class, and Joi-GAM-Class) removed the effects of confounding factors while concurrently optimizing classification accuracy by embedding the GAM model into the energy function of the classifier. While Joi OPT -GAM-Class reported the result with respect to minimizing the energy function, Joi STR -GAM-Class, Joi DTI -GAM-Class, and Joi-GAM-Class went one step further and extended the energy function so that it accounted for accuracy of the GAM in removing the effects of the confounding factors. Joi-GAM-Class (as well as Joi OPT -GAM-Class) considered all 144 imaging metrics, while Joi STR -GAM-Class was confined to the 32 macro-structural MRI metrics and Joi DTI -GAM-Class to the 112 micro-structural DTI metrics. The accuracy of each implementation was measured via 2-fold cross-validation described in further detail in the supplemental section on āCross-Validationā.
For the technically inclined reader, the following subsections describe in detail the optimization algorithms used for training the sequential and joint implementations.
Training of the Sequential Approaches
The training of a sequential approach consisted of two steps: (1) determine the optimal setting of the GAM with respect to the ācontrol groupā (i.e., minimal drinking cohort) and (2) identify the pattern, i.e., the subset of residual imaging scores most informative for group separation. The pattern was identified by computing the āweightsā of a sparse, logistic regression classifier12 that resulted in the highest normalized-accuracy based on the training data.
To determine the optimal setting \({\alpha }_{i}\,:\,=({\alpha }_{i\mathrm{,0}},\ldots ,{\alpha }_{i\mathrm{,3}})\) of the GAM with respect to each image measurement type āiā, let āage s ā be that age and āses s ā the socio-economic status of subject āsā. Then the GAM defined the relationship of the confounding factors to the corresponding image score i s as
Assuming that the image scores were Gaussian distributed, then determining the optimal Ī± i was equivalent to maximizing a likelihood function parameterized by the mean of a Gaussian distribution. To define the likelihood function, we now introduce the mathematical notation summarized in TableĀ S1 of the supplement. Specifically, the training data (i.e., one of the folds) consisted of two cohorts totaling Nā=ā352 subjects with \({\mathbb{C}}\) representing the set of indices of the minimal drinkers. Each subject āsā was described by the factor vector \({{\bf{d}}}_{s}\mathrm{=[1,}\,ag{e}_{s},ag{e}_{s}^{2},se{s}_{s}]\) (consisting of N D ā=ā3 subject specific demographic values) and up to \({N}_{F}=144\) image scores i s . Training the GAM with respect to the data of the non-drinking cohort was then equivalent to fitting a matrix Ī¦ so that the factor vector of each control subject was a predictor of the corresponding image scores i.e., \({{\bf{i}}}_{s} \sim {\rm{\Phi }}\cdot {{\bf{d}}}_{s}^{{\rm{{\rm T}}}}\). Assuming that \({{\bf{i}}}_{s} \sim {\mathscr{N}}({\rm{\Phi }}\cdot {{\bf{d}}}_{s}^{{\rm{{\rm T}}}},\,{\sqrt[{N}_{D}]{{\sigma }_{s}}}^{2})\) was normally distributed with \({\sigma }_{s}^{2}\in {\mathbb{R}}\) and referring to \({\Vert \cdot \Vert }_{2}\) as the l2-norm, the optimal fitted matrix \(\hat{{\rm{\Phi }}}\) was obtained by solving the following maximum likelihood problem
where D is the set of factor vectors and I is the corresponding set of image scores across all samples.
Interpreting \(\mathrm{1/}{\sigma }_{s}^{2}\) as the āweightā of each sample, the above minimization problem defined a robust regression of Ī¦ that was solved via bi-square estimation. With respect to the ordinary GAM, \({\sigma }_{s}=\sigma \) was assumed to be uniform across all subjects so that computing \(\hat{{\rm{\Phi }}}\) simplified to the least-square solution. Regardless of the specific computation of \(\hat{{\rm{\Phi }}}\), the corresponding residual (desensitized) scores of all N subjects were determined via
Training of the sparse, logistic regression classifier consisted of minimizing a log probability with respect to the weights selecting the subset of informative residual image scores best separating both cohorts. In order to define the log probability, the association of each subjects ā\(s\)ā to a cohort was encoded by label \({z}_{s}\). If the subject was a regular drinker then \({z}_{s}=1\), and \({z}_{s}=-\,1\) if it was a minimal exposed individual. \(Z\,:\,=[{z}_{1},{z}_{2},\mathrm{..}.,{z}_{N}]\) was the vector of label assignments of all subjects in the training fold. The logistic function was defined as \(\theta (a)\,:\,=\,\mathrm{log}\,\mathrm{(1}+\exp (\,-\,a))\), the weight vector āĻā encoded the importance of each residual score of r s in distinguishing the two cohorts, and \(v\in {\mathbb{R}}\) was the ālabel offsetā. Assuming that all samples were independently and identically distributed according to the binomial distribution
determining the optimal parameters was equivalent to minimizing the following logistic cost function
with respect to a sparse search space defined according to the l0-ānormā \({\Vert \cdot \Vert }_{0}\) and a predefined number N K ā<āN F of non-zero elements, i.e., the sparsity constraint
In other words, parameterizing the sparse, logistic classifier summarizes to determining the optimal parameters \(\hat{\nu }\) and \(\hat{\omega }\) for the following minimization problem
We determined its solution via penalty decomposition12. The image scores associated with non-zero entries in \(\hat{\omega }\) then defined the group separating pattern.
Training of Joint Methods
Alternative to the sequential approach, the training of the joint approach consisted of simultaneously determining the optimal values for the variables \(\hat{{\rm{\Phi }}}\) of the GAM and \((\hat{\nu },\hat{\omega })\) of the sparse, logistic regression. Specifically, the joint approaches were parameterized by maximizing the following joint probability
\(P({z}_{s}|{{\bf{i}}}_{s},{{\bf{d}}}_{s},{\rm{\Phi }},\nu ,\omega )\) was defined according to Eq. (3) and \(P({{\bf{i}}}_{s}|{{\bf{d}}}_{s},{\rm{\Phi }},\sigma )\) according to the normal distribution of Eq. (1). Computing the log of that joint probability resulted in
The previous section parameterized the GAM with respect to the minimal drinkers (controls) so that any significant deviation in the image scores of the second cohort could be directly related to the existing clinical literature. To comply with that model, we confined the second sum of Eq. (8) to the controls and model the āinputā of the regular drinkers in parameterizing the GAM through the uninformative, uniform distribution represented by the constant \(c\in {\mathbb{R}}\). Thus, the log of the joint probability is redefined as
and its minimization as
where \(\gamma \,:\,=\frac{1}{2{\sigma }^{2}+1}\) weighted the importance between the logistic cost function and the GAM, \(\sigma \) was fixed beforehand via parameter exploration, L(Ā·) was defined according to Eq. (4), and G(Ā·) according to Eq. (1). Note, if Ī³ā=ā0 then the above equation simplifies to Joi OPT -GAM-Class, the commonly used logistic regressor with the training of the GAM solely driven by group separation.
Given that finding the optimal solution of Eq. (10) was prone to a local minimum due to the non-convex energy function, the parameters Ī¦ā² were initialized by the output of the GAM of Eq. (1). The local minimum for Equation (10) was then determined through an algorithm inspired by penalty decomposition12. Specifically, we introduced \(\hat{\psi }\in {\mathbb{S}}={{\mathbb{R}}}^{{N}_{F}}\), the non-sparse approximation of the weights \(\hat{\omega }\). Eq. (10) was then equivalent to
Introducing the weighting parameter \(\rho > 0\), the solution of Eq. (11) was iteratively estimated by
As pointed out in Algorithm 1, the parameters of the logistic regression model were initialized as \(\omega =0\), \(\psi =0\), \(\nu =0\)12 and then, together with Ī¦, updated via block coordinate descent. If the parameters converged, \(\rho \) was increased and the procedures was repeated until the maximum of the absolute difference between the elements of the sparse weights Ļ Ļ and the non-sparse weights Ļ Ļ was below a fixed threshold \({\varepsilon }_{P}\)12, i.e., let \({\Vert \cdot \Vert }_{{\rm{\max }}}\) denote the maximum element of a vector or matrix then
At each of these iterations, block coordinate descent improved the current estimate Ī¦ā², Ī½ā², Ļā², and Ļā² of (Ī¦ Ļ , Ī½ Ļ , Ļ Ļ , Ļ Ļ ) by minimizing Eq. (12) fixing all variables but one and repeating this process until all variables converged. Keeping Ī½ā², Ļā², and Ļā² fixed, then the minimization problem of Eq. (12) simplified to
Since the penalty function was smooth and convex, Eq. (14) was solved via gradient descent. Interpreting the above minimization problem as desensitizing the image scores from the influence of demographic factors, Ī¦ parameterized the GAM specified by G(Ī¦) and was regularized by L (Ā·, Ī½ā², Ļā²) to account for the noise in the image measurements i s .
Next, block coordinate descent updated Ī½ā² and Ļā² by keeping Ī¦ā² and Ļā² fixed so that Eq. (12) simplified to
Again, gradient descent was employed to determine the minimum of that equation as the penalty function was smooth and convex. Finally, Ļā² was updated by solving Eq. (12) with fixed Ī¦ā², Ī½ā², and Ļā², i.e., using the closed form solution to determine
Following the suggestion of Zhang et al.12, block coordinate descent was repeated (i.e., Equations (14ā16) until the relative changes of Ī¦ā², Ī½ā², Ļā², and Ļā², between iterations were smaller than a fixed threshold \({\varepsilon }_{B}\), i.e.
with \({\rm{\Delta }}(a)=\frac{\parallel a^{\prime} -a^{\prime\prime} {\parallel }_{{\rm{\max }}}}{{\rm{\max }}\,\{\parallel \,a^{\prime} \,{\parallel }_{{\rm{\max }}}\mathrm{,1\}}}\). Once converged, Ī¦ Ļ , Ī½ Ļ , Ļ Ļ , and Ļ Ļ were updated according to Ī¦ā², Ī½ā², Ļā², and Ļā², Ļ was increased, and another block coordinate descent was initiated until Ļ Ļ and Ļ Ļ converged according to Eq. (13), which was the case in all of our experiments. Additional comments about the joint optimization are provided by the supplement.
Data availability
In compliance with NIH policy, the data release NCANDA DATA 00010 V5, NCANDA DATA 00011, and NCANDA DATA 00012 V2 that supports the finding of this study is released to the public according to the NCANDA Data Distribution agreement (see https://www.niaaa.nih.gov/research/major-initiatives/national-consortium-alcohol-and-neurodevelopment-adolescence for more detail).
Code availability
Our Matlab implementation of the proposed algorithm (GAM-Sparsity Constraint Logistic Regression V1) is available via https://www.nitrc.org/projects/gam_sparityreg.
Informed Consent
All procedures performed in this study were in accordance with the Declaration of Helsinki. All participants underwent informed consent processes at the visit with a research associate trained in human subject research protocols. Adult participants or the parents of minor participants provided written informed consent before participation in the study. Minor participants provided assent before participation. The Institutional Review Boards of each NCANDA site approved this study, and each site followed this procedure to obtain voluntary informed consent or assent, depending on the age of the participant.
References
Stiles, J. & Jernigan, T. L. The basics of brain development. Neuropsychology Review 20, 327ā348 (2010).
Shaw, P. et al. Neurodevelopmental trajectories of the human cerebral cortex. Journal of Neuroscience 28, 3586ā3594 (2008).
Pfefferbaum, D. et al. Adolescent development of cortical and white matter structure in the NCANDA sample: Role of sex, ethnicity, puberty, and alcohol drinking. Cerebral Cortex 26, 4101ā4121 (2016).
Pohl, K. M. et al. Harmonizing DTI measurements across scanners to examine the development of white matter microstructure in 803 adolescents of the NCANDA study. NeuroImage 130, 194ā213 (2016).
Lebel, C. et al. Diffusion tensor imaging of white matter tract evolution over the lifespan. NeuroImage 60, 340ā352 (2012).
Johnston, L. D., OāMalley, P. M., Miech, R. A., Bachman, J. G. & Schulenberg, J. E. 2016 Overview Key Findings on Adolescent Drug Use (Monierting the Future, 2017).
Cservenka, A. & Brumback, T. The Burden of Binge and Heavy Drinking on the Brain: Effects on Adolescent and Young Adult Neural Structure and Function. Frontiers in Psychology 8, 1111 (2017).
Squeglia, L. M. et al. Brain development in heavy drinking adolescents. American Journal of Psychiatry 172, 531ā542 (2015).
Jacobus, J., Squeglia, L. M., Bava, S. & Tapert, S. F. White matter characterization of adolescent binge drinking with and without co-occurring marijuana use: a 3-year investigation. Psychiatry Research: Neuroimaging 214, 374ā381 (2013).
Pfefferbaum, A. et al. Altered Brain Developmental Trajectories in Adolescents After Initiating Drinking. The American Journal of Psychiatry (2017).
Squeglia, L. M. et al. Neural Predictors of Initiating Alcohol Use During Adolescence. The American Journal of Psychiatry 174, 172ā185 (2017).
Zhang, Y. et al. Extracting patterns of morphometry distinguishing HIV associated neurodegeneration from mild cognitive impairment via group cardinality constrained classification. Human Brain Mapping 37, 4523ā4538 (2016).
Pfefferbaum, A., Rosenbloom, M., Deshmukh, A. & Sullivan, E. Sex differences in the effects of alcohol on brain structure. American Journal of Psychiatry 158, 188ā197 (2001).
Ahmadi, A. et al. Influence of alcohol use on neural response to Go/No-Go task in college drinkers. Neuropsychopharmacology 38, 2197ā2208 (2013).
Luciana, M., Collins, P. F., Muetzel, R. L. & Lim, K. O. Effects of alcohol use initiation on brain structure in typically developing adolescents. The American Journal of Drug and Alcohol Abuse 39, 345ā355 (2013).
OāHalloran, L., Nymberg, C., Jollans, L., Garavan, H. & Whelan, R. The potential of neuroimaging for identifying predictors of adolescent alcohol use initiation and misuse. Addiction 112, 719ā726 (2016).
Brown, S. A. et al. The National Consortium on Alcohol and NeuroDevelopment in Adolescence (NCANDA): A multi-site-study of adolescent development and substance use. Journal of Studies on Alcohol and Drugs 76, 895ā908 (2015).
Karas, M. et al. Brain connectivity-informed regularization methods for regression. bioRxiv (2017).
Kuceyeski, A., Meyerhoff, D. J., Durazzo, T. C. & Raj, A. Loss in connectivity among regions of the brain reward system in alcohol dependence. Human Brain Mapping 34, 3129ā3142 (2013).
Schulte, T. et al. How acute and chronic alcohol consumption affects brain networks: Insights from multimodal neuroimaging. Alcoholism: Clinical and Experimental Research 36, 2017ā2027 (2012).
Zahr, N. M. Chapter 17 - Structural and microstructral imaging of the brain in alcohol use disorders. Handbook of Clinical Neurology 125, (275ā290 (2014).
van Erp, T. G. et al. Subcortical brain volume abnormalities in 2028 individuals with schizophrenia and 2540 healthy controls via the ENIGMA consortium. Molecular Psychiatry 21, 547ā553 (2015).
Worsley, K. J. et al. A general statistical analysis for fMRI data. NeuroImage 15, 1ā15 (2002).
Wang, X.-F., Jiang, Z., Daly, J. J. & Yue, G. H. A generalized regression model for region of interest analysis of fMRI data. NeuroImage 59, 502ā510 (2012).
Friston, K. J. et al. Statistical parametric maps in functional imaging: A general linear approach. Human Brain Mapping 2, 189ā210 (1995).
Le Berre, A. P. et al. Sensitive biomarkers of alcoholismās effect on brain macrostructure: similarities and differences between France and the United States. Frontier in Human Neuroscience 9, 354ā1ā354ā13 (2015).
Kochunov, P. et al. Heritability of fractional anisotropy in human white matter: A comparison of Human Connectome Project and ENIGMA-DTI data. NeuroImage 111, 300ā311 (2015).
Holland, P. W. & Welsch, R. E. Robust regression using iteratively reweighted least-squares. Theory and Methods 6, 813ā827 (1977).
Fritsch, V. et al. Robust regression for large-scale neuroimaging studies. Neuroimage 111, 431ā441 (2015).
Zhang, Y., Park, S. H. & Pohl, K. M. Joint data harmonization and group cardinality constrained classification. In Medical Image Computing and Computer Assisted Interventions, vol. 9900 of Lecture Notes in Computer Science, 282ā290 (Springer-Verlag, 2016).
Fisher, R. A. On the interpretation of Ļ2 from contingency tables, and the calculation of P. Journal of the Royal Statistical Society 85, 87ā94 (1922).
DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 44, 837ā845 (1988).
Kril, J. J., Halliday, G. M., Svoboda, M. D. & Cartwright, H. The cerebral cortex is damaged in chronic alcoholics. Neuroscience 79, 983ā998 (1997).
Pfefferbaum, A. et al. Variation in longitudinal trajectories of regional brain volumes of healthy men and women (ages 10 to 85 years) measured with atlas-based parcellation of MRI. NeuroImage 65, 176ā193 (2013).
Giedd, J. N. et al. Child psychiatry branch of the national institute of mental health longitudinal structural magnetic resonance imaging study of human brain development. Neuropsychopharmacology Reviews 40, 43ā49 (2015).
Raznahan, A. et al. Longitudinal four-dimensional mapping of subcortical anatomy in human development. Proceedings of the National Academy of Sciences 111, 1592ā1597 (2014).
Kapogiannis, D., Kisser, J., Davatzikos, C., abd Jeffrey Metter, L. F. & Resnick, S. Alcohol consumption and premotor corpus callosum in older adults. European Neuropsychopharmacology 22, 704ā710 (2012).
Pfefferbaum, A. et al. Regional brain structural dysmorphology in human immunodeficiency virus infection: Effects of acquired immune deficiency syndrome, alcoholism, and age. Biological Psychology 72, 361ā370 (2012).
Hinkley, L. B. N. et al. The role of corpus callosum development in functional connectivity and cognitive processing. Plos One 7, e39804 (2012).
Hutchinson, A. D. et al. Relationship between intelligence and the size and composition of the corpus callosum. Neuroscience 192, 455ā464 (2009).
Luders, E. et al. The link between callosal thickness and intelligence in healthy children and adolescents. Neuroimage 54, 1823ā1830 (2011).
van Eimeren, L., Niogi, S. N., McCandliss, B. D., Holloway, I. D. & Ansari, D. White matter microstructures underlying mathematical abilities in children. Neuroreport 19, 1117ā1121 (2009).
Posner, M. I. & Rothbart, M. K. Attention, self-regulation and consciousness. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences 353, 1915ā1927 (1998).
Botvinick, M. M. Conflict monitoring and decision making: reconciling two perspectives on anterior cingulate function. Cognitive, Affective, and Behavioral Neuroscience 7, 356ā366 (2007).
Schulte, T., Muller-Oehring, E. M., Sullivan, E. V. & Pfefferbaum, A. Synchrony of corticostriatal-midbrain activation enables normal inhibitory control and conflict processing in recovering alcoholic men. Biological Psychology 71, 269ā278 (2012).
Le Berre, A.-P. et al. Impaired decision-making and brain shrinkage in alcoholism. European Psychiatry 29, 125ā133 (2014).
Wobrock, T. et al. Effects of abstinence on brain morphology in alcoholism. European Archives of Psychiatry and Clinical Neurosciences 259, 143ā150 (2009).
Pitel, A.-L., Chanraud, S., Sullivan, E. V. & Pfefferbaum, A. Callosal microstructural abnormalities in Alzheimerās disease and alcoholism: same phenotype, different mechanisms. Psychiatry Research 184, 49ā56 (2010).
Pfefferbaum, A., Rosenbloom, M., Rohlfing, T. & Sullivan, E. V. Degradation of association and projection white matter systems in alcoholism detected with quantitative fiber tracking. Biological Psychology 65, 680ā690 (2009).
Akshoomoff, N. et al. The NIH toolbox cognition battery: results from a large normative developmental sample (PING). Neuropsychology 28, 1ā10 (2014).
McHugh, M. L. The Chi-square test of independence. Biochemia Medica 23, 143ā149 (2013).
Yates, F. Contingency tables involving small numbers and the Ļ 2 test. Journal of the Royal Statistical Society 1, 217ā235 (1934).
Zimmerman, D. A note on interpretation of the paired-samples t test. Journal of Educational and Behavioral Statistics 22, 349ā360 (1997).
Nichols, B. N. & Pohl, K. M. Neuroinformatics software applications supporting electronic data capture, management, and sharing for the neuroimaging community. Neuropsychology Review 25, 356ā368 (2015).
Acknowledgements
This work was supported by the U.S. National Institute on Alcohol Abuse and Alcoholism (AA021697, AA012388, AA017168, AA017347, AA010723, MH113406), the National Institute of Health Office of Directors (AA021697-04S1). KMP was also supported by the Creative and Novel Ideas in HIV Research (CNIHR) Program through a supplement to the University of Alabama at Birmingham (UAB) Center For AIDS Research funding (NIH P30 AI027767). This funding was made possible by collaborative efforts of the Office of AIDS Research, the National Institute of Allergy and Infectious Diseases, and the International AIDS Society. EVS received support from the Moldow Womenās Hope and Healing Fund.
Author information
Authors and Affiliations
Contributions
S.P., Y.Z., N.M.Z. prepared the figures and helped in writing the manuscript. Q.Z., S.P., Y.Z. conducted the experiments. A.P., D.K., and E.V.S. collected and processed the data. K.M.P. managed the project and was writing the manuscript. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleās Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleās Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Park, S.H., Zhang, Y., Kwon, D. et al. Alcohol use effects on adolescent brain development revealed by simultaneously removing confounding factors, identifying morphometric patterns, and classifying individuals. Sci Rep 8, 8297 (2018). https://doi.org/10.1038/s41598-018-26627-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-018-26627-7
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.