Abstract
Autism Spectrum Disorder (ASD) is a complex neurodevelopmental disorder influenced by both genetic and environmental factors. Recently, gut dysbiosis has emerged as a powerful contributor to ASD symptoms. In this study, we recruited over 100 age-matched sibling pairs (between 2 and 8 years old) where one had an Autism ASD diagnosis and the other was developing typically (TD) (432 samples total). We collected stool samples over four weeks, tracked over 100 lifestyle and dietary variables, and surveyed behavior measures related to ASD symptoms. We identified 117 amplicon sequencing variants (ASVs) that were significantly different in abundance between sibling pairs across all three timepoints, 11 of which were supported by at least two contrast methods. We additionally identified dietary and lifestyle variables that differ significantly between cohorts, and further linked those variables to the ASVs they statistically relate to. Overall, dietary and lifestyle features were explanatory of ASD phenotype using logistic regression, however, global compositional microbiome features were not. Leveraging our longitudinal behavior questionnaires, we additionally identified 11 ASVs associated with changes in reported anxiety over time within and across all individuals. Lastly, we find that overall microbiome composition (beta-diversity) is associated with specific ASD-related behavioral characteristics.
Similar content being viewed by others
Introduction
Autism Spectrum Disorder (ASD) is a complex neurodevelopmental disorder that occurs in 1 out of every 54 children in the United States [1]. ASD is characterized by a set of social and cognitive impairments and can be influenced by a growing number of both genetic and environmental factors [2, 3]. Environmental factors such as maternal prenatal medication use, maternal health factors, and prenatal infection have been associated with ASD development [4]. ASD is also associated with increased prevalence of gastrointestinal (GI) issues [5, 6]. These GI issues can include symptoms such as chronic constipation, diarrhea, abdominal pain, and potential signs of GI inflammation such as vomiting and bloody stools [7]. Growing evidence suggests that ASD is associated with gut dysbiosis. In animal studies, food-based exposure to Lactobacillus reuteri or Bacteroides fragilis reduced ASD-like social deficits in mice [8, 9]. Evidence of dysbiosis of the gut microbial community has also been observed in humans with ASD, specifically altered levels of Bifidobacterium, Lactobacillus, and Clostridium species [10, 11]. In addition, fecal microbiota transplants in children with ASD demonstrated improvements in GI and ASD symptoms [12, 13]. Reductions of core-ASD symptoms with probiotic usage have also been observed [14].
Studies show that individuals with ASD have increased intestinal permeability and systemic levels of bacterial metabolites that may contribute to ASD pathogenesis by affecting the nervous system [15, 16]. Furthermore, ASD’s comorbid disorders, such as anxiety and Hypothalamus-Pituitary-Adrenal dysfunction, have been associated with gut dysbiosis [17, 18]. Recent studies have also identified microbial markers associated with the ASD phenotype and/or the diet and lifestyle qualities associated with ASD [10, 19,20,21,22]. These studies constitute an important initial step towards understanding the complex role the gut plays in neurodevelopmental disorders, but studies to date have often been limited by small sample sizes, limited data on dietary practices, and, most importantly, a lack of longitudinal data points that account for the high variability of gut microbiome structure over time. Notably, most of these studies also do not include a control cohort of participants matched for environmental variables, which have a tremendous impact on the gut microbiome [23, 24]. These are limitations we attempt to address in this paper.
The present study aims to characterize the gut microbiome associations in ASD while minimizing issues stemming from limited or single timepoint data. We analyzed 432 stool samples from 72 pairs of siblings diagnosed with ASD and their typically developing (TD) siblings between 2 and 8 years old, and within 2 years of each other, over the course of a month (three timepoints, each separated by two weeks). Each stool sample is accompanied by detailed lifestyle and dietary information (over 100 variables were reported), allowing for a more holistic understanding of ASD and its relationship with the gut microbiome. The present study aims to identify taxa that are differentially abundant between the two cohorts over time, to quantify the predictive value of those microbial features, and to place those microbial associations in the larger context of dietary, lifestyle, and behavioral variables. Figure 1 shows an overview of the study design.
Results
Eleven ASVs are significantly associated with the ASD phenotype, as determined by the union of at least two differential analysis methods
Out of 834 total ASVs (Amplicon Sequence Variants, assigned using DADA2), 117 were identified to be significantly different between the ASD and TD cohorts (Supplementary File 1) by at least one of the contrast analysis methods used after normalization and filtration (DESeq2, MetagenomeSeq, and ANCOM, see methods). Out of the 117 ASVs found to be significant across timepoints (Supplementary File 1), 37 belonged to the Lachnospiraceae family. Oscillospiraceae and Bacteroidaceae were the second most represented families with 10 ASVs belonging to each of these families. 93 of the 117 ASVs were detected as significant by DESEQ2, 28 by MetagenomeSeq, and 4 by ANCOM. 45 ASVs were not associated with any lifestyle or dietary variables extracted from the questionnaires. Most notably, 11 ASVs were identified by at least 2 differential analysis methods.
Table 1 summarizes the 11 ASVs with overlapping detection between two contrast methods independently, and their lifestyle/dietary associations if applicable. Two of these were solely associated with the ASD cohort, and no other dietary or metadata co-variate: one from the genus Holdemania and one from the family Lachnospiraceae. Interestingly, the Blautia genus was represented in 3 of the 11 ASVs.
Figure 2 shows total sum scaled abundance bar plots for the ASVs identified as significant by two methods (CLR normalized counts in Supplementary File 2). The ASV with the highest abundance among the 11 belonged to Blautia wexlerae with a relative abundance of almost 4% within the NT and around 2.5–3% within the ASD cohort as shown in Fig. 2. ASVs from Bacteroides thetaiotaomicron and a different Blautia ASV were among the next highest abundances with values around 0.005–0.01%.
We also performed differential abundance testing using all three contrast methods on ASV counts aggregated by annotated genus (Supplementary File 3). We found many of the differentially abundant ASVs from Table 1 to be members of differentially abundant genera, namely the genera Bacteroides, Borkfali, Haemophilus, Streptococcus. Interestingly, the genus Veillonella was identified as increased in TD participants by all three analysis methods.
Demographics, diet, and lifestyle differences between cohorts
We recorded 331 diet and lifestyle variables for each individual participating (Supplementary File 4).
Unsurprisingly, as ASD has been found at a higher prevalence in males [25], 84.7% of the ASD cohort were male as compared to 52.7% of the TD cohort. There were no demographic differences between ASD and TD cohorts as siblings were exclusively documented as the same ethnicity.
A total of 14 of the 331 variables were significantly different between the ASD and TD cohorts. Categorical variables significant in chi-squared tests between cohorts are shown in Table 2 as well as cohort age and C-section birth status. Notably, bowel function and GI symptoms were observed significantly more often in ASD participants, as were special dietary regimes and dietary supplementation (adjusted p < 0.05 in Wilcoxon rank-sum tests or two-way repeated anova). Six of these variables were also associated with microbial community dissimilarities using the Bray–Curtis distance metric tested by PERMANOVA. These variables were “dietary restriction”, “dietary supplementation”, “GI symptoms within 3 months”, “GI issues this week”, “habitual fruit consumption”, and “last 2 weeks dairy consumption”. Other variables significantly associated with microbial structure but not significantly different between cohorts can be found in Supplementary File 5 and PCoAs using Bray–Curtis distance and constrained by phenotype and each of the permanova variables with R2 values above 0.01 are contained in Supplementary File 6. Similar plots using unweighted and weighted UniFrac distance are in Supplementary Files 7 and 8. Extreme consumption rates such as ‘Daily’ or ‘Never’ were, as expected, most easily distinguishable from the microbiome signature.
Dietary/lifestyle, but not global microbiome compositional features, explain ASD phenotype
To assess the overall associations between lifestyle, microbial factors, and ASD phenotype, we employed logistic regression using different feature sets as follows: (1) Basic (age + sex), (2) Basic + lifestyle/dietary variables, (3) Basic + microbiome features, (4) Basic + lifestyle/diet variables + microbiome features. Microbiome features were calculated as scores along a principal coordinate ordination using Bray–Curtis distance. Additionally, null models were created by replacing features with uniformly randomly distributed noise.
We found that inclusion of lifestyle/dietary variables, but not of microbiome features, significantly improved the explanatory power of a model over basic features (Fig. 3A). Microbiome features did explain phenotype significantly more accurately than random noise variables. A combination of lifestyle and microbiome features did not significantly improve performance over lifestyle features.
Because highly correlated variables are difficult to distinguish using regression models, we calculated the Pearson correlation matrix between all significant lifestyle variables and other lifestyle variables (Fig. 3B). We find that high bread, multivitamin, fermented vegetable, and olive oil consumption, along with GI distress and non-celiac sensitivity, are significant predictors of ASD. Home-prepared meals, as opposed to ready-to-eat meals, were inversely correlated with both ASD phenotype and GI distress.
While the major axes of variation within the gut microbiome did not present additional explanatory power on top of age and sex, some of the axes were statistically significantly related to phenotype. Correlations between axes coordinates and lifestyle variables are found in Fig. 3C. Most notably, a sample’s position along the axes of highest variation (axis1) was associated with the TD phenotype, and scores along this axis correlated with vegetable, fruit, and fat/oil consumption, in addition to meats, seafood, and in general eating home-prepared meals.
Some principal component axes, while not obviously correlated with any lifestyle characteristic, were enriched for the 8 biomarkers associated with ASD or the 3 biomarkers associated with TD (Fig. 2, Table 1). A modified gene set enrichment analysis where a set was considered the eight ASD or three TD biomarkers revealed that scores for biomarkers along particular significant axes were more skewed than would be expected by random chance (gsea p < 0.05) (Fig. 3D).
11 Taxa correlate with anxiety scores within and across individuals
Anxiety in the last 2 weeks before each sample collection was reported by caretakers on a scale of “No elevated anxiety”, “Somewhat elevated”, and “elevated” (0, 1, 2). This metric was used to measure changes in anxiety within the same individual across time, allowing us to fully leverage the longitudinal nature of the data to identify specific ASVs associated with reported anxiety. We found 10 ASVs significantly negatively correlated and 1 ASV positively correlated with increasing anxiety (Fig. 4A, B). Two ASVs from the species A. butyriciproducens, well-known for it’s butyrate production, both negatively correlated with anxiety. 6 of the 10 ASVs negatively correlated with anxiety were members of the Lachnospiraceae family. Three of the ASVs found correlated with anxiety in the full cohort were similarly correlated with anxiety when considering only the ASD samples (Fig. 4B).
We also found diversity metrics (Chao1, Shannon, FaithPD) to be correlated with ASD severity score (MARA) and age, however, diversity was not significantly different between ASD and TD cohorts (Supplementary File 9).
Ten behavioral variables are associated with the microbial structure within the ASD cohort
Out of the 14 behavioral questions within the Mobile Autism Risk Assessment (MARA, “Methods” for details) collected in the ASD cohort, 10 were significantly associated with gut microbiome composition (Table 3). Constrained PCOAs using Bray–Curtis distances of DESEQ2 normalized counts were created for each of the significant behavioral variables (Supplementary File 10).
Discussion
The present study provides novel insights into the relationship between the gut microbiota and ASD. By leveraging siblings as controls and recording longitudinal gut microbiome, lifestyle, and dietary factors, this work identified differentially abundant taxa between the two cohorts, and identified environmental variables possibly influencing these abundances. We also measure the explanatory power of lifestyle and dietary variables as compared to compositional microbiome features to distinguish between ASD and TD, and identified taxa associated with anxiety changes, a condition comorbid to ASD. Lastly, we find that overall microbiome composition (beta-diversity) is associated with specific ASD-related behavioral characteristics.
Associating microbial biomarkers with ASD
The set of 117 ASVs identified by any differential analysis method represents the most inclusive analysis and is meant to capture all potentially relevant hits, at the cost of increased expected false positives. In contrast, the 11 ASV set represents the intersection of multiple methods and is expected to contain more robust hits that are reproducible across multiple methods. Among these 11 ASVs, the observed increase in Bacteroides annotated-ASVs and the decrease of the Streptococcus annotated-ASVs in ASD individuals are consistent with findings from previous studies on the genus level [26, 27]. The enrichment of a Haemophilus genus member and Blautia wexlarae within the TD cohort coincides with findings of increase of Haemophilus parainfluenzae and the Blautia genus in TD individuals compared to ASD cohorts [27, 28]. In contrast, we also found two other ASVs from the Blautia genus were significantly increased in the ASD cohort. Notably, B. wexlarae has also been found to be associated with non-celiac gluten sensitivity, a common comorbidity of ASD (we also observed a trend towards gluten sensitivity in our cohort (q val 0.01, Supplementary File 11)) [29]. As 16S amplicon studies have limited taxonomic identification, these sub-genus level discrepancies call for deeper phylogenetic characterization, which could be achieved by metagenomic sequencing. We also noted that a member of the Holdemania genus was enriched in the ASD cohort, which another study found was associated with ASD children who were classified as “picky eaters” [30]. Association of Borkfalki ceftriaxensis and the Anaerovovracaceae family with ASD do not seem to have been reported yet in the literature. Among the 117 ASVs detected to be significant in at least one of our methods, Faecalibacterium, Anaerofustis, and Erysipelatoclostridium ASVs were associated with ASD, and one ASV from Dialister with the TD cohort, which is consistent with previous studies, but may represent false-positive hits [27, 31,32,33,34].
In addition to ASV level analyses, we also performed differential analysis after aggregating ASV counts into genera. We found many of the differentially abundant ASVs from Table 1 to be members of differentially abundant genera, namely the genera Bacteroides, Borkfali, Haemophilus, Streptococcus. Interestingly, the genus Veillonella was identified as increased in TD participants by all three analysis methods. This genus is best known for fermenting lactate to propionate and has been observed to be increased in TD controls in other ASD studies as well [19, 22, 35].
Differentiating between phenotype-based signals and environmental proxies
Dietary selectivity and food sensitivity amongst children with ASD have been widely reported in the literature [19, 36]. Dietary habits and gut microbiome compositions are integrally linked; diet influences microbiome composition and bacteria may influence dietary choices [37]. This study attempted to pinpoint the confounding factors that both impact the microbial structure and are significantly associated with one of the two cohorts within our 4-week-long assessment. Six variables listed in Table 2 were both different between cohorts and associated with gut microbiome composition. As such, these are highly relevant confounding factors that should be considered carefully when untangling the complex relationship between the gut microbiome and ASD.
Overall effect size of diet/lifestyle vs. microbial signature
While differential analysis allows us to identify specific ASVs of interest, it does not reveal the overall signal strength of the set of variables identified. We used logistic regression models to estimate the explanatory power of environmental variable signatures, and found that lifestyle and dietary features are able to differentiate individuals with ASD from their siblings significantly above a baseline model of sex and age, but that global microbiome compositional features cannot. At the same time, actual microbiome features do provide more explanatory power than random noise, implying that compositional microbiome features may capture some of the influences of diet and lifestyle that are predictive of phenotype. Some axes of gut microbial variation that are not obviously reflective of diet/lifestyle but are still statistically associated with phenotype may be driven by abundance of the reported biomarkers, however, this effect is small compared to the effects of diet.
We emphasize that many of the predictive diet/lifestyle variables are likely informed by a child’s ASD status and therefore cannot be interpreted as directly related to biology. For instance, fermented vegetable consumption and vitamin D supplementation are predictive of the ASD phenotype possibly because parents interested in the intestinal health of their ASD children provide these interventions intentionally.
The overall microbial structure is associated with self-reported behavioral measures
This study is not the first to report that the microbiota may reflect complex behavioral traits. Multiple animal-animal or human-animal stool microbiome transplant studies have shown that some behavioral traits seem to be mediated by the gut content [38, 39]. In humans, Flannery et al. determined that taxonomic and functional composition of the gut microbiome is associated with behavior and early development in school-aged children [40]. Other studies have also associated microbiome structure with a toddler’s temperament [41]. In ASD specifically, an open-label study showed that microbiota transfer therapy from a neurotypical donor to a recipient with an ASD diagnosis improved GI and behavioral symptoms [12, 13].
Reduction of alpha diversity between children with ASD and a control cohort has been reported in several studies [10, 20, 22, 34, 42] but remains inconsistent across cohorts, especially in sibling studies [43, 44]. Notably in this study, younger participants were more likely to report more severe ASD symptoms and to have a less diverse microbiome, suggesting that discrepancies in age or failure to utilize severity of ASD symptoms in analysis may both be confounding factors and explain some of the discrepancies in alpha diversity metric results across studies.
Detecting stable microbial signatures over time
We found 33 ASVs that were significant in a single timepoint, but were not consistently different across timepoints. For example, single timepoints implicated ASVs annotated to the species Faecalibacterium prausnitzii, Bacteroides intestinalis, and Bifidobacterium bifidum, all species that have previously been observed as differentially abundant in ASD [31, 34]. Without longitudinal data to account for intra-individual gut microbiome variability, differential analyses are likely to detect false positives. As such, the lack of longitudinal data likely accounts for at least a portion of the inconsistencies observed between gut microbiome association studies [45].
Anxiety changes over time correlate with changing taxa abundances
Among the ASVs linked to changes in reported anxiety, six ASVs annotated as Lachnospiraceae were negatively correlated with anxiety. As mentioned above, the association of the Lachnospiraceae family with ASD is widely discussed in the literature; but this family has been reported negatively correlated with ASD comorbid and psychological distress disorders frequently [33,46,47,48].
Two ASVs from the same species, Agathobaculum butyriciproducens, were negatively correlated with anxiety in the full cohort and within only the ASD cohort. This species is known for heavy butyrate production, which has been demonstrated to decrease gut inflammation and enhance gut epithelial barrier integrity [49, 50]. Given the prevalence of GI issues and leaky gut in ASD [51], the inverse relationship between these butyrate producers and anxiety may be related to some of the GI and core symptoms we observe in ASD. Furthermore, the same three ASVs were positively correlated with the alpha diversity indices in the ASD cohort, suggesting that these taxa are among those lost in lower diversity guts.
While a non-traditional three-point scale was used to measure anxiety, our results are consistent with association scales identified using the more widely used Beck Anxiety Inventory. One study using single timepoint data from 73 individuals reports significant Spearman correlation coefficients of around 0.3 when correlation individual taxa to anxiety metrics across individuals, as compared to our reported coefficients of around 0.16 [52].
Further considerations and future directions
Despite the number of longitudinal samples and wealth of dietary information, 16S rRNA amplicon analysis can only provide a superficial identification of stool-associated microbial structure. Discrepancies in the species and strain identification could be reduced using shotgun metagenomic sequencing. Paired with multi-omics analysis, such studies would help pinpoint specific mechanisms beyond microbial structure associations.
This paper presents one of the largest longitudinal studies implemented in ASD. But with each sampling separated by two weeks, our sampling only represents a small snapshot of time within an individual’s gut microbiome (one month). Gut microbiomes can fluctuate based on the season due to differences in dietary habits during these times and this effect can be seen to some degree in our study. We attempted to mitigate this issue by sampling sibling control at the same time, but observing a longer collection period would help to maximize the signal/noise ratio.
Previous literature suggests that TD siblings of children with ASD may not constitute a perfect control group, as they tend to exhibit microbiome compositions between their own siblings and unrelated neurotypical controls [22, 53]. Therefore, the ASVs identified in this study are likely to be biased towards precision (confidence that observed difference is related directly to phenotype) rather than sensitivity (confidence that all possible differences have been detected). This study design is likely to have limited the potential biasing effects of enterotypes (distinct community types that stratify individuals) in differential analysis as well [54]. Because sibling pairs are likely to display the same enterotype, a brief investigation into the Bacteroides to Prevotella as well as Firmicutes to Bacteroidetes ratios amongst participants suggests an even spread of this variable between phenotypes (Supplementary File 12).
In addition, ANCOM authors have released ANCOMBC, a newer version of the ANCOM than the ANCOM2.1 used in this study [55]. While there may be slight benefit from using ANCOMBC, we do not expect a drastic change to our conclusions by using ANCOMBC as only 4 of the 117 taxa were significant through ANCOM2.1.
Finally, the lifestyle and dietary information collected from participants was self-reported. While we had more objective measures of verifying the ASD phenotype, the validity of the dietary and behavioral measures collected in this study can vary between families, and even within families if the caregiver changed between samplings. Again, we believe that potential reporting inaccuracies were partially mitigated by the fact that the caregiver answered questions for both the ASD child and their TD sibling for each timepoint.
Conclusion
In the present study, we report 11 microbial taxa consistently associated with ASD in one of the largest longitudinal gut microbiome datasets collected on paired siblings to date. This large sample size was coupled with comprehensive metadata composed of over 100 dietary, lifestyle, and ASD-related variables, and allowed us to identify 6 potential confounding variables within this dataset: habitual fruit and dairy consumption, dietary supplementation or restrictions, and GI symptoms within the past week or last 3 months. In this study, we pinpointed which of the ASVs that significantly differ between the two cohorts are also associated with these factors, providing additional context into what might be driving these associations. We determined that compositional microbiome features do not add extra information above a baseline of age and sex, but diet and lifestyle variables significantly increase explanatory power. We also identified taxa, including A. butyriciproducens, that were negatively correlated with anxiety changes across both cohorts. These ASVs are strong candidates for further investigation into the mechanisms behind these findings and the relationship between ASD, the gut-brain axis, and the gut microbiome. Lastly, we find the overall microbiome composition (beta-diversity) did associate with specific ASD-related behaviors, implying that the nature of ASD as a spectrum must be fully considered in future gut microbiome studies.
Methods
Recruitment and data collection
We recruited families with two siblings, one previously diagnosed with ASD by a health care provider and one TD, via the website microbiome.stanford.edu. In total, 1432 families visited the website. We recruited children between 23 months to 8 years old, and siblings had to be within 2 years of each other. Dietary, lifestyle, demographic, and host health information were collected via an initial and bi-weekly questionnaires (at each collection time) for each individual (see Supplementary File 4). General dietary habits as well as recent dietary intake during the week prior was collected (labeled as longitudinal within Supplementary File 4).
Each sibling provided three stool samples, spaced two weeks apart. While we received 701 samples total, sibling pairs with individuals that were younger than 23 months, were currently being breast-fed or had ASD children that did not meet ASD criteria (See ASD Diagnosis Verification) were removed, leaving a total of 72 sibling pairs consisting of 432 samples from 144 different participants.
Autism spectrum disorder diagnosis verification
The MARA, a parent-reported behavioral questionnaire designed to screen children who are at high risk for ASD, was collected electronically from ASD participants [56].
Additionally, parents submitted a short video of their child with and child without ASD via encrypted file share to be rated for ASD symptoms by Stanford Institutional Review Board approved raters on a set of 30 behavioral features [57]. Scores across multiple raters were fed to previously published Machine Learning classifiers to predict ASD risk scores [58, 59]. By combining these risk scores with the parent-report screening tool (MARA), as well as parent-reported physician diagnosis, we confirmed diagnosis using majority rules consensus. We excluded (3) children and their TD siblings for whom the consensus did not agree with original parent diagnosis. Please see Supplementary File 13 for expanded methods.
Stool collection and storage
Every 2 weeks, caretakers of participants collected a sample using a provided toilet collection kit, and shipped it back at room temperature in preservation buffer (Norgen Biotek, ON, Canada). At the initial timepoint, caretakers also collected a second sample that was immediately frozen at home at −20, then shipped back overnight with two ice packs provided to the participants. Once received, stool samples were stored at −80C until processing. Please see Supplementary File 13 for expanded methods.
DNA extraction, amplification, and sequencing
Before DNA extraction, stool samples were thawed, pelleted, and supernatant was removed. DNA was extracted from the pelleted stool samples using the MagAttract PowerMicrobiome DNA/RNA Kit (Qiagen) on the KingFisher Flex 96 (ThermoFisher), following manufacturer’s instructions. If DNA did not meet quality standards, an additional DNA clean-up procedure was performed with the Zymo ZR-96 DNA Clean-up kit. All samples were quantified via the Quant-iT PicoGreen dsDNA Assay Kit. The 16S rRNA V4 region was amplified with degenerate primers designed against conserved regions of the 16S rRNA V4 gene region, fused with Illumina adapters and indexing barcodes (slight modifications from [60]). The following primer sequences with adapters, pads, and linkers were used:
Forward primer: AATGATACGGCGACCACCGAGATCTACAC TATGGTAATT GT GTGYCAGCMGCCGCGGTAA
Reverse primer: CAAGCAGAAGACGGCATACGAGAT XXXXXXXXXXXX AGTCAGTCAG CC GGACTACNVGGGTWTCTAAT (Where “XXXXXXXXXXXX” represents the Barcode)
PCR products were cleaned-up using AMPure XP beads (Beckman Coulter) and then quantified via the Quant-iT PicoGreen dsDNA Assay Kit (Invitrogen)
Libraries were pooled, and paired-end sequencing (2 × 250 bps) was performed on an Illumina MiSeq using the MiSeq Reagent Kit v2 (500-cycles) and custom sequencing primers. We obtained an average of 157,103 reads per sample (with a min of 23,321 and max is 996,530).
Sequence processing, filtering, and taxonomic annotation
Raw sequence reads were processed with DADA2 applying default settings for filtering, learning errors, dereplication, ASV inference, and chimera removal [61]. Truncation quality (truncQ) was set to 2. Ten nucleotides were trimmed from each terminus of each read. An average of 156, 246 reads per sample library remained after processing the raw reads. For strain level ASV assignment, ASVs were mapped to an in-house strain database (StrainSelect, https://www.secondgenome.com/platform/data-analysis-tools/strainselect, version 2019 (SS19)) using USEARCH (usearch_global) in the same manner as a recent study by Shah [62]. Please see Supplementary File 13 for expanded methods.
Statistical analysis
Statistical analysis was performed using R version 3.6.2 using RStudio Server Pro 1.2.5033-1. The following packages were used: shiny 1.5.0, tibble 3.0.2, data.table 1.13.0, devtools 2.3.1, knitr 1.29, tidyr 1.1.0, reshape2 1.4.4, dplyr 1.0.0, ggplot2 3.3.2, pander 0.6.3, DT 0.14, gridExtra 2.3, adegraphics1.0–15, stats, smart 3.4–8, caret 6.0–86, randomforest 4.6–14, ROCR 1.0–11, exactRankTests 0.8–31, nlme 3.1–148, compositions 2.0–0, ggpubr 0.4.0, vegan 2.5–6, MetagenomeSeq 1.28.2, DESeq2 1.26.0, biomformat 1.14.0, phyloseq 1.30.0 and sourced ANCOM2.1 from https://github.com/FrederickHuangLin/ANCOM.git. Full code used for analysis can be found at https://github.com/MaudeDavidLab/M3_Phyloseq_Analysis/tree/manuscript.
Normalization and taxa filtration
Before filtration, we attained a minimum read depth of 2.3 × 104 reads and a maximum depth of 9.9 × 105 reads. Taxa not present in at least 3% of the samples were removed. Taxa abundances were normalized using DESeq2 or Cumulative Sum Scaling (CSS) depending on the contrast analysis performed [63, 64]. Due to how DESeq2 normalized minimized intra-group variance within families more so than CSS (Supplementary File 14), DESeq2 was used as the primary normalization in our gut microbial community analysis.
In addition, taxa that significantly vary over time within the same individual were removed to increase the chance of identifying taxa directly related to core phenotype characteristics, rather than changes due to diet or season. A Friedman test was used to model ASV abundance as dependent on timepoint for each individual, and ASVs that were significantly related to timepoint (p < 0.1) were removed. 64 ASVs were removed from DESeq normalized data, 78 ASVs were removed from CSS normalized data, and 72 ASVs were removed from unnormalized data.
See Supplementary File 13 for complete methods description.
References
Maenner MJ, Shaw KA, Baio J, Washington A, Patrick M, DiRienzo M, et al. Prevalence of autism spectrum disorder among children aged 8 years - autism and developmental disabilities monitoring network, 11 sites, United States, 2016. MMWR Surveill Summ. 2020;69:1–12.
Bai D, Yip B, Windham GC, Sourander A, Francis R, Yoffe R, et al. Association of genetic and environmental factors with autism in a 5-country cohort. JAMA Psychiatry. 2019;76:1035–43.
David MM. The role of the microbiome in autism: all that we know about all that we don’t know. mSystems 2021;6. https://doi.org/10.1128/mSystems.00234-21.
Karimi P, Kamali E, Mousavi SM, Karahmadi M. Environmental factors influencing the risk of autism. J Res Med Sci. 2017;22:27.
Kohane IS, McMurry A, Weber G, MacFadden D, Rappaport L, Kunkel L, et al. The co-morbidity burden of children and young adults with autism spectrum disorders. PLoS ONE. 2012;7:e33224.
Wasilewska J, Klukowski M. Gastrointestinal symptoms and autism spectrum disorder: links and risks - a possible new overlap syndrome. Pediatric Health Med Ther. 2015;6:153–66.
Hsiao EY. Gastrointestinal issues in autism spectrum disorder. Harv Rev Psychiatry. 2014;22:104–11. https://doi.org/10.1097/hrp.0000000000000029.
Sgritta M, Dooling SW, Buffington SA, Momin EN, Francis MB, Britton RA, et al. Mechanisms underlying microbial-mediated changes in social behavior in mouse models of autism spectrum disorder. Neuron. 2019;101:246–59.e6.
Hsiao EY, McBride SW, Hsien S, Sharon G, Hyde ER, McCue T, et al. Microbiota modulate behavioral and physiological abnormalities associated with neurodevelopmental disorders. Cell. 2013;155:1451–63.
Adams JB, Johansen LJ, Powell LD, Quig D, Rubin RA. Gastrointestinal flora and gastrointestinal status in children with autism-comparisons to typical children and correlation with autism severity. BMC Gastroenterol. 2011;11:22.
Song Y, Liu C, Finegold SM. Real-time PCR quantitation of clostridia in feces of autistic children. Appl Environ Microbiol. 2004;70:6459–65.
Kang D-W, Adams JB, Coleman DM, Pollard EL, Maldonado J, McDonough-Means S, et al. Long-term benefit of microbiota Transfer therapy on autism symptoms and gut microbiota. Sci Rep. 2019;9:5821.
Kang D-W, Adams JB, Gregory AC, Borody T, Chittick L, Fasano A, et al. Microbiota Transfer Therapy alters gut ecosystem and improves gastrointestinal and autism symptoms: an open-label study. Microbiome 2017;5. https://doi.org/10.1186/s40168-016-0225-7.
Santocchi E, Guiducci L, Prosperi M, Calderoni S, Gaggini M, Apicella F, et al. Effects of probiotic supplementation on gastrointestinal, sensory and core symptoms in autism spectrum disorders: a randomized controlled trial. Front Psychiatry. 2020;11:550593.
Fowlie G, Cohen N, Ming X. The perturbance of microbiome and gut-brain axis in autism spectrum disorders. Int J Mol Sci. 2018;19. https://doi.org/10.3390/ijms19082251.
Emanuele E, Orsi P, Boso M, Broglia D, Brondino N, Barale F, et al. Low-grade endotoxemia in patients with severe autism. Neurosci Lett. 2010;471:162–5.
Hollocks MJ, Howlin P, Papadopoulos AS, Khondoker M, Simonoff E. Differences in HPA-axis and heart rate responsiveness to psychosocial stress in children with autism spectrum disorders with and without co-morbid anxiety. Psychoneuroendocrinology. 2014;46:32–45.
Zaboski BA, Storch EA. Comorbid autism spectrum disorder and anxiety disorders: a brief review. Future Neurol. 2018;13:31–7.
Kang D-W, Park JG, Ilhan ZE, Wallstrom G, Labaer J, Adams JB, et al. Reduced incidence of Prevotella and other fermenters in intestinal microflora of autistic children. PLoS ONE. 2013;8:e68322.
Finegold SM, Molitoris D, Song Y, Liu C, Vaisanen ML, Bolte E, et al. Gastrointestinal microflora studies in late-onset autism. Clin Infect Dis. 2002;35:S6–16.
Parracho HM, Bingham MO, Gibson GR, McCartney AL. Differences between the gut microflora of children with autistic spectrum disorders and that of healthy children. J Med Microbiol. 2005;54:987–91.
Finegold SM, Dowd SE, Gontcharova V, Liu C, Henley KE, Wolcott RD, et al. Pyrosequencing study of fecal microflora of autistic and control children. Anaerobe. 2010;16:444–53.
David LA, Maurice CF, Carmody RN, Gootenberg DB, Button JE, Wolfe BE, et al. Diet rapidly and reproducibly alters the human gut microbiome. Nature. 2014;505:559–63.
Xie H, Guo R, Zhong H, Feng Q, Lan Z, Qin B, et al. Shotgun metagenomics of 250 adult twins reveals genetic and environmental impacts on the gut microbiome. Cell Syst. 2016;3:572–84.e3.
Werling DM, Geschwind DH. Sex differences in autism spectrum disorders. Curr Opin Neurol. 2013;26:146–53.
Zhang M, Ma W, Zhang J, He Y, Wang J. Analysis of gut microbiota profiles and microbe-disease associations in children with autism spectrum disorders in China. Sci Rep. 2018;8:13981.
Liu F, Li J, Wu F, Zheng H, Peng Q, Zhou H. Altered composition and function of intestinal microbiota in autism spectrum disorders: a systematic review. Transl Psychiatry. 2019;9:43.
Kang D-W, Ilhan ZE, Isern NG, Hoyt DW, Howsmon DP, Shaffer M, et al. Differences in fecal microbial metabolites and microbiota of children with autism spectrum disorders. Anaerobe. 2018;49:121–31.
Lau NM, Green PHR, Taylor AK, Hellberg D. Markers of celiac disease and gluten sensitivity in children with autism. PLoS ONE. 2013;8:e66155.
Berding K, Donovan SM. Diet can impact microbiota composition in children with autism spectrum disorder. Front Neurosci. 2018;12:515.
Coretti L, Paparo L, Riccio MP, Amato F, Cuomo M, Natale A, et al. Gut microbiota features in young children with autism spectrum disorders. Front Microbiol. 2018;9. https://doi.org/10.3389/fmicb.2018.03146.
Liu F, Horton-Sparks K, Hull V, Li RW, Martínez-Cerdeño V. The valproic acid rat model of autism presents with gut bacterial dysbiosis similar to that in human autism. Mol Autism. 2018;9:61.
Ma B, Liang J, Dai M, Wang J, Luo J, Zhang Z, et al. Altered gut microbiota in Chinese children with autism spectrum disorders. Front Cell Infect Microbiol. 2019;9:40.
De Angelis M, Francavilla R, Piccolo M, De Giacomo A, Gobbetti M. Fecal microbiota and metabolome of children with autism and pervasive developmental disorder not otherwise specified. PLoS ONE. 2013;8:e76993–13.
Strati F, Cavalieri D, Albanese D, De Felice C, Donati C, Hayek J, et al. New evidences on the altered gut microbiota in autism spectrum disorders. Microbiome. 2017;5. https://doi.org/10.1186/s40168-017-0242-1.
Cermak SA, Curtin C, Bandini LG. Food selectivity and sensory sensitivity in children with autism spectrum disorders. J Am Diet Assoc. 2010;110:238–46.
Alcock J, Maley CC, Aktipis CA. Is eating behavior manipulated by the gastrointestinal microbiota? Evolutionary pressures and potential mechanisms. Bioessays. 2014;36:940–9.
Sharon G, Cruz NJ, Kang DW, Gandal MJ, Wang B, Kim YM, et al. Human gut microbiota from autism spectrum disorder promote behavioral symptoms in mice. Cell. 2019;177:1600–18.e17.
Bercik P, Denou E, Collins J, Jackson W, Lu J, Jury J, et al. The intestinal microbiota affect central levels of brain-derived neurotropic factor and behavior in mice. Gastroenterology. 2011;141:599–609. 609.e1–3.
Flannery JE, Stagaman K, Burns AR, Hickey RJ, Roos LE, Giuliano RJ, et al. Gut feelings begin in childhood: the gut metagenome correlates with early environment, caregiving, and behavior. mBio 2020;11. https://doi.org/10.1128/mbio.02780-19.
Christian LM, Galley JD, Hade EM, Schoppe-Sullivan S, Dush CK, Bailey MT. Gut microbiome composition is associated with temperament during early childhood. Brain Behav Immun. 2015;45:118–27. https://doi.org/10.1016/j.bbi.2014.10.018.
Finegold SM. Desulfovibrio species are potentially important in regressive autism. Med Hypotheses. 2011;77:270–4.
Ahmed SA, Elhefnawy AM, Azouz HG, Roshdy YS, Ashry MH, Ibrahim AE, et al. Study of the gut microbiome profile in children with autism spectrum disorder: a single tertiary hospital experience. J Mol Neurosci. 2020;70:887–96.
David MM, Tataru C, Daniels J, Schwartz J, Keating J, Hampton-Marcell J, et al. Crowdsourced study of children with autism and their typically developing siblings identifies differences in taxonomic and predicted function for stool-associated microbes using exact sequence variant analysis. 2018; https://www.biorxiv.org/content/10.1101/319236v2.full.pdf.
Fouquier J, Huizar NM, Donnelly J, Glickman C, Kang D-W, Maldonado J, et al. The gut microbiome in autism: study-site effects and longitudinal analysis of behavior change. mSystems. 2021;6. https://doi.org/10.1128/mSystems.00848-20.
Ding X, Xu Y, Zhang X, Zhang L, Duan G, Song C, et al. Gut microbiota changes in patients with autism spectrum disorders. J Psychiatr Res. 2020;129:149–59.
Peter J, Fournier C, Durdevic M, Knoblich L, Keip B, Dejaco C, et al. A microbial signature of psychological distress in irritable bowel syndrome. Psychosom Med. 2018;80:698–709.
Järbrink-Sehgal E, Andreasson A. The gut microbiota and mental health in adults. Curr Opin Neurobiol. 2020;62:102–14.
Ahn S, Jin TE, Chang DH, Rhee MS, Kim HJ, Lee SJ, et al. Agathobaculum butyriciproducens gen. nov. sp. nov., a strict anaerobic, butyrate-producing gut bacterium isolated from human faeces and reclassification of Eubacterium desmolans as Agathobaculum desmolans comb. nov. Int J Syst Evol Microbiol. 2016;66:3656–61.
Silva JPB, Navegantes-Lima KC, Oliveira A, Rodrigues D, Gaspar S, Monteiro V, et al. Protective mechanisms of butyrate on inflammatory bowel disease. Curr Pharm Des. 2018;24:4154–66.
Wang LW, Tancredi DJ, Thomas DW. The prevalence of gastrointestinal problems in children across the United States with autism spectrum disorders from families with multiple affected members. J Dev Behav Pediatr. 2011;32:351–60.
Chung Y-CE, Chen HC, Chou HL, Chen IM, Lee MS, Chuang LC, et al. Exploration of microbiota targets for major depressive disorder and mood related traits. J Psychiatr Res. 2019;111:74–82.
Son JS, Zheng LJ, Rowehl LM, Tian X, Zhang Y, Zhu W, et al. Comparison of fecal microbiota in children with autism spectrum disorders and neurotypical siblings in the simons simplex collection. PLoS ONE. 2015;10:e0137725.
Costea PI, Hildebrand F, Arumugam M, Bäckhed F, Blaser MJ, Bushman FD, et al. Enterotypes in the landscape of gut microbial community composition. Nat Microbiol. 2018;3:8–16.
Lin H, Peddada SD. Analysis of compositions of microbiomes with bias correction. Nat Commun. 2020;11:3514.
Duda M, Daniels J, Wall DP. Clinical evaluation of a novel and mobile autism risk assessment. J Autism Dev Disord. 2016;46:1953–61.
Tariq Q, Daniels J, Schwartz JN, Washington P, Kalantarian H, Wall DP. Mobile detection of autism through machine learning on home video: a development and prospective validation study. PLoS Med. 2018;15:e1002705.
Kosmicki JA, Sochat V, Duda M, Wall DP. Searching for a minimal set of behaviors for autism detection through feature selection-based machine learning. Transl Psychiatry. 2015;5:e514.
Levy S, Duda M, Haber N, Wall DP. Sparsifying machine learning models identify stable subsets of predictive features for behavioral detection of autism. Mol Autism. 2017;8:65.
Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Huntley J, Fierer N, et al. Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME J. 2012;6:1621–4.
Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJ, Holmes SP. DADA2: high resolution sample inference from amplicon data. bioRxiv. 2015. https://doi.org/10.1101/024034.
Shah MS, DeSantis TZ, Weinmaier T, McMurdie PJ, Cope JL, Altrichter A, et al. Leveraging sequence-based faecal microbial community survey data to identify a composite biomarker for colorectal cancer. Gut. 2018;67:882–91. https://doi.org/10.1136/gutjnl-2016-313189.
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550.
Paulson JN, Colin Stine O, Bravo HC, Pop M. Differential abundance analysis for microbial marker-gene surveys. Nat Methods. 2013;10:1200–2. https://doi.org/10.1038/nmeth.2658.
Acknowledgements
This work was supported by the NIH (NIDA R44DA04395402) (investigators: SI, DPW, MMD), the NSF (GRFP) (GED, BSC). The funding bodies did not play any role in the design, collection, analysis, interpretation or writing of the study.
Author information
Authors and Affiliations
Contributions
AM, CT, and MMD performed the primary statistical analysis on the 16S data and created the manuscript. SI, TZD, MD, and DPW conceptualized the project. BSC and KD performed participant recruitment, phenotype verification, and sample collection. XY, ER, RLH, WH, ML, ER, KS collected, curated, and generated the data. YW developed the software. MP and XY performed quality control and data processing.
Corresponding authors
Ethics declarations
Competing interests
The authors declare a conflict of interest. The authors affiliated with Second Genome, Inc. have the following competing interests: Second Genome Inc. employs and provides stock options to all authors affiliated with Second Genome Inc. MMD has a financial interest in Second Genome Inc. Second Genome Inc. is an independent therapeutics company with products in development to treat Inflammatory Bowel Diseases and Cancer, and could potentially benefit from the outcomes of this research. MMD is co-owner of Microbiome Engineering Inc., a company specialized in developing biosensors. DPW is cofounder of Cognoa, a company focused on digital methods for healthy child development.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Tataru, C., Martin, A., Dunlap, K. et al. Longitudinal study of stool-associated microbial taxa in sibling pairs with and without autism spectrum disorder. ISME COMMUN. 1, 80 (2021). https://doi.org/10.1038/s43705-021-00080-6
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1038/s43705-021-00080-6
This article is cited by
-
A robust microbiome signature for autism spectrum disorder across different studies using machine learning
Scientific Reports (2024)
-
Topic modeling for multi-omic integration in the human gut microbiome and implications for Autism
Scientific Reports (2023)