Dimension reduction of microbiome data linked Bifidobacterium and Prevotella to allergic rhinitis

Komaki, Shohei; Sahoyama, Yukari; Hachiya, Tsuyoshi; Koseki, Keita; Ogata, Yusuke; Hamazato, Fumiaki; Shiozawa, Manabu; Nakagawa, Tohru; Suda, Wataru; Hattori, Masahira; Kawakami, Eiryo

doi:10.1038/s41598-024-57934-x

Download PDF

Article
Open access
Published: 05 April 2024

Dimension reduction of microbiome data linked Bifidobacterium and Prevotella to allergic rhinitis

Shohei Komaki¹,
Yukari Sahoyama²,
Tsuyoshi Hachiya¹,
Keita Koseki³,
Yusuke Ogata⁴,
Fumiaki Hamazato²,
Manabu Shiozawa²,
Tohru Nakagawa⁵,
Wataru Suda⁴,
Masahira Hattori^4,6 &
…
Eiryo Kawakami^3,7,8

Scientific Reports volume 14, Article number: 7983 (2024) Cite this article

703 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Dimension reduction has been used to visualise the distribution of multidimensional microbiome data, but the composite variables calculated by the dimension reduction methods have not been widely used to investigate the relationship of the human gut microbiome with lifestyle and disease. In the present study, we applied several dimension reduction methods, including principal component analysis, principal coordinate analysis (PCoA), non-metric multidimensional scaling (NMDS), and non-negative matrix factorization, to a microbiome dataset from 186 subjects with symptoms of allergic rhinitis (AR) and 106 controls. All the dimension reduction methods supported that the distribution of microbial data points appeared to be continuous rather than discrete. Comparison of the composite variables calculated from the different dimension reduction methods showed that the characteristics of the composite variables differed depending on the distance matrices and the dimension reduction methods. The first composite variables calculated from PCoA and NMDS with the UniFrac distance were strongly associated with AR (FDR adjusted P = 2.4 × 10^–4 for PCoA and P = 2.8 × 10^–4 for NMDS), and also with the relative abundance of Bifidobacterium and Prevotella. The abundance of Bifidobacterium was also linked to intake of several nutrients, including carbohydrate, saturated fat, and alcohol via composite variables. Notably, the association between the composite variables and AR was much stronger than the association between the relative abundance of individual genera and AR. Our results highlight the usefulness of the dimension reduction methods for investigating the association of microbial composition with lifestyle and disease in clinical research.

Context-aware dimensionality reduction deconvolutes gut microbial community dynamics

Article 31 August 2020

EMBED: Essential MicroBiomE Dynamics, a dimensionality reduction approach for longitudinal microbiome studies

Article Open access 20 June 2023

Comparison of the effectiveness of different normalization methods for metagenomic cross-study phenotype prediction under heterogeneity

Article Open access 25 March 2024

Introduction

Allergic rhinitis (AR), a condition affecting over half a billion people worldwide¹, is triggered by immunoglobulin E (IgE)-mediated responses to airborne allergens, resulting in symptoms such as nasal itching, sneezing, and congestion². This condition not only reduces quality of life, but also contributes to reduced cognitive function and increased irritability³. AR is also associated with an increased risk of developing asthma². The etiology of AR is still not fully understood, despite its recognition as a major health problem. Emerging research points to the gut microbiota as a key player in the development of allergies^4,5,6. The human gut harbours a wide variety of microorganisms and the composition of the human microbiome varies from person to person. Dietary components can influence the gut microbiome and modulate allergic responses through the production of bioactive metabolites and their interactions with immune cells^4,5,6,7.

In clinical research, enterotyping has been used as a method to characterise the microbial composition of the gut^7,8. In a seminal paper defining enterotypes for the first time, the human gut microbiota was classified into three enterotypes: P-type (Prevotella-rich), B-type (Bacteroides-rich), and R-type (Ruminococcus-rich)⁸. Subsequent studies have shown that the composition of the human microbial community is not discretely distributed as enterotypes, but rather continuously distributed in typical populations^9,10.

Dimension reduction methods such as principal component analysis (PCA) and principal coordinate analysis (PCoA) are commonly used to visualise the distribution of the human microbiome community^11,12. Dimension reduction methods illustrate the distributions of microbial samples by mapping multidimensional data of microbial composition onto a few composite dimensions based on the distances (or dissimilarities) between samples. The composite variables calculated by the dimension reduction methods are robust to technical and biological noise. However, in clinical research, the composite variables calculated by dimension reduction methods have not been widely used to study the association of the human gut microbiome with lifestyle and disease.

There are several methods for calculating the dissimilarity between samples and obtaining composite variables from the distance matrix¹². Here, we investigated the association of the human gut microbial composition with dietary intake of 42 nutrients and the symptom of AR, focusing on the application and comparison of several dimension reduction methods, including PCA, PCoA, non-metric multidimensional scaling (NMDS), and non-negative matrix factorization (NMF).

Materials and methods

Samples and datasets

In the present study, we re-analysed a microbiome dataset from 186 participants with symptoms of AR and 106 controls without symptoms of AR at the Hitachi Health Care Centre in Japan. The dataset was used in our previous study to identify up- and down-regulated microbial genera in AR patients compared to controls¹³. No dimension reduction method was used in the previous study. The present study investigated the association of the human gut microbial composition with dietary intake of 42 nutrients and the symptom of AR, focusing on the application and comparison of dimension reduction methods.

In brief, food consumption data for the study participants were obtained using the Brief Self-Administered Diet History Questionnaire (BDHQ) and adjusted by energy using the density method^14,15,16. Bacterial DNA was isolated from faecal samples, followed by amplification of the 16S V1-V2 region by polymerase chain reaction (PCR)¹⁷. An equal amount of each PCR amplicon was mixed and subjected to multiplex amplicon sequencing using MiSeq (2 × 300 paired-end). Filtered reads with BLAST match lengths < 90% to the representative sequence in the 16S databases, including the Ribosomal Database Project (RDP) (Release 11, Update 5), CORE (updated 13 October 2017; http://microbiome.osu.edu/), and a reference genome sequence database obtained from the NCBI FTP site (ftp://ftp.ncbi.nih.gov/genbank/, April 2013), were considered chimeras and removed. From the filtered reads, 10,000 high quality reads per sample were randomly selected. The total reads were then sorted by the frequency of redundant sequences and grouped into operational taxonomic units (OTUs) using UCLUST with a sequence identity threshold of 97%. The representative sequences of the generated OTUs were subjected to a homology search against the above databases using the GLSEARCH program for taxonomic assignments. Phylum, genus and species level assignments were made using sequence similarity thresholds of 70%, 94% and 97%, respectively.

This study was approved by the Hitachi Hospital Group Ethics Committee (Approved No. 2018-5, 2019-10, and 2020-88), the Institutional Review Board of the Hitachi Ltd. (Approved No. 220-1 and 238-1), and the Research Ethics Committee (Approved No. H30-5). Written informed consent was obtained from all participants. This study was conducted in accordance with the principles of the Declaration of Helsinki.

Dimension reduction of microbial composition

Genus-level abundance was expressed as a percentage. Genera with a mean relative abundance of ≥ 0.1% were included, resulting in a genus-level abundance matrix with 292 rows (samples) and 50 columns (genera).

We first performed enterotyping to classify the microbiome of 292 individuals as in the landmark study⁸. Following the enterotyping R tutorial (https://enterotype.embl.de/), the Jensen-Shannon divergence (square root of the Jensen-Shannon distance; JSD) between all pairs of 292 samples was calculated and a 292 × 292 pairwise distance matrix was generated. The JSD is a symmetrized and smoothed version of the Kullback–Leibler divergence which measures the similarity of the probability distributions of two samples¹⁸. Based on the distance matrix, 292 samples were then clustered into the discrete enterotypes by the partitioning around medoids algorithm using the clusterSim R package (version 0.50.1)¹⁹. The number of clusters was set at 3 as in the seminal study⁸.

We also used several dimension reduction methods, including PCA, PCoA, NMDS, and NMF, to obtain the continuous composite variables from the above-mentioned genus-level abundance matrix of 292 samples and 50 genera. The top 3 composite variables for each dimension reduction method were obtained for subsequent analyses.

To perform the PCA, we used the prcomp function implemented in the stats R package (version 4.2.1)²⁰ with the scale = TRUE option, which normalises the genus-level abundance matrix so that each column has a mean of 0 and a standard deviation of 1. In addition, PCA was performed on the centered log-ratio (CLR) transformed abundance matrix. For the CLR transformation, the clr function implemented in the compositions R package (version 2.0.6)²¹ was used.

To perform the PCoA, we applied the cmdscale function in the stats R package (version 4.2.1) to the above-mentioned JSD to obtain the top 3 composite variables. In addition, we calculated the Bray–Curtis dissimilarity (BCD) matrix²² using the beta.pair.abund function implemented in the betapart R package, which measures the compositional difference between two ecological communities (version 1.5.6)²³ with “bray” specified as index.family, and the weighted UniFrac distance matrix²⁴, which takes into account the phylogenetic distance between genera, using USEARCH (version 10.0.240_i86)²⁵. The cmdscale function was applied to the BCD and UniFrac matrices to obtain the top 3 composite variables.

For the NMDS analysis, the metaMDS function from the vegan R package was used to obtain the top 3 composite variables from JSD, BCD, and UniFrac distance matrices. Prior to generating the JSD and BCD matrices, the microbiome abundance matrix was standardized and multiplied by the total sample size using the decostand function from the vegan R package (version 2.6.2)²⁶.

Non-negative matrix factorization is a dimension reduction method that decomposes a non-negative matrix V into two non-negative matrices W and H, such that V is approximately equal to W multiplied by H²⁷. The matrix W represents the composite variables, while H represents the coefficients when the original data are expressed as a linear combination of the composite variables. The number of composite variables is set as 3. The reconstruction error between V and WH is minimised by iteratively updating W and H according to some loss function. We used the NNLM R package (version 0.4.4)²⁸ to apply NMF to the genus-level abundance matrix V. We used the Kullback–Leibler divergence as the loss function to minimise the reconstruction error.

For comparison, we also calculated the ratio of Prevotella to Bacteroides (P/B ratio)^29,30.

Statistical analysis

To examine the similarity between composite variables, and to examine the association between composite variables and genus-level abundance, we applied the pairwise Spearman's rank correlation test using the cor function from the stats R package with the “spearman” method. We evaluated the association of composite variables with the intake levels of 42 nutrients using the Spearman's rank correlation test. The association of composite variables with AR was tested by the Wilcoxon-Mann–Whitney test using the wilcox_test function from the coin R package (version 1.4.2)³¹, with the distribution set to “exact”. For each dimension reduction method, we calculated the top three composite variables and compared them with 50 genera (3 × 50 = 150 tests), 42 nutrients (3 × 42 = 126 tests) and allergic rhinitis (3 × 1 = 3 tests). We applied the Benjamini and Hochberg false discovery rate (FDR)³² correction for multiple testing to the sets of 150, 126 and 3 P values, respectively, using the p.adjust function in the stats R package, specifying “BH” as method. To investigate the potential impact of confounders, we also assessed the association between each composite variable (an explanatory variable) and AR (an outcome variable) using logistic regression adjusted for age, sex, and BMI. The outcome variable of AR was further regressed on the microbiome distance matrix by the microbiome regression-based kernel association test (MiRKAT) using the MiRKAT R package (version 1.2.3)³³. We used three 292 × 292 pairwise distance matrices (JSD, BCD, and UniFrac) for the MiRKAT analysis.

Results

Continuous distribution of gut microbial community

There were 50 genera with mean relative abundance ≥ 0.1%. The JSD matrix was calculated from the 50-dimensional genus-level data of the 292 individuals. Enterotypes were calculated from the JSD matrix (Fig. 1a). Bacteroides and Prevotella were abundant in the enterotype 1 (corresponding to B-type) and 3 (P-type), respectively (Fig. 1b; Table 1). The abundance of Ruminococcus was low across all enterotypes, whereas Bifidobacterium was abundant in the enterotype 2 (Fig. 1b; Table 1). The distributions of enterotypes 1 and 2 overlapped in the PCoA plots, emphasising that the microbial composition was continuous rather than discrete. The continuous distribution of gut microbial composition was further supported by other dimension reduction methods, including PCA (Fig. 2a,b), PCoA with the BCD and UniFrac (Fig. 2c,d), NMF (Fig. 2e), and NMDS with the JSD, BCD, and UniFrac (Fig. 2f–h).

Table 1 Characteristics of the study participants.

Full size table

Comparison of composite variables calculated using different dimension reduction methods

To compare the composite variables obtained from the different dimension reduction methods, we calculated the Spearman's rank correlation between pairs of composite variables (Fig. 3a). The results showed that the top 3 composite variables calculated from the PCoA and NMDS using the JSD were highly correlated with those calculated from the PCoA and NMDS using the BCD. The top 3 composite variables calculated from the PCoA using the UniFrac distance were highly correlated with those calculated from NMDS using the UniFrac distance. The top 1 composite variable calculated from PCA was highly correlated with those from PCoA and NMDS, while the second and third composite variables from PCA were not remarkably correlated with those calculated from the other methods. The first composite variable calculated by NMF was highly correlated with the P/B ratio, second composite variable was correlated with those of PCoA and NMDS using JSD and BCD, and the third composite variable was highly correlated with the first principal component and the second composite variables of PCoA and NMDS using the UniFrac distance.

The first composite variable calculated using PCoA and NMDS with JSD and BCD was positively correlated with the relative abundance of Prevotella and negatively correlated with Bacteroides (Fig. 3b). The second composite variable calculated using PCoA and NMDS was positively correlated with Bifidobacterium, while the third composite variable was negatively correlated with Bifidobacterium. The first composite variable calculated using PCoA and NMDS with the UniFrac distance was positively correlated with the relative abundance of Bifidobacterium and negatively correlated with Prevotella. The second composite variable was negatively correlated with Bacteroides, while the third composite variable was negatively correlated with Bifidobacterium. The first composite variable calculated using NMF was positively correlated with Prevotella and negatively correlated with Bacteroides, as was the P/B ratio.

Nutrients and AR were related to microbial composition via composite variables

We examined the association of composite variables calculated using different dimension reduction methods with the consumption levels of 42 nutrients and AR. The MiRKAT analysis, which tested the association of distance matrices rather than composite variables with AR, showed that the UniFrac distance was significantly associated with AR (P = 3.7 × 10^–4), whereas the JSD and BCD distances were not (P > 0.05). No significant association was observed between the microbiome distance matrices and nutrient intakes.

According to the Wilcoxon-Mann–Whitney test, AR was associated with the first composite variable calculated from the UniFrac distance (FDR adjusted P = 2.4 × 10^–4 for PCoA and P = 2.8 × 10^–4 for NMDS), the first and second composite variables calculated from the NMF (FDR adjusted P = 0.0048 and 0.029, respectively), and the P/B ratio (P = 0.013; Fig. 4j), while the other composite variables were not significantly associated with AR. Logistic regression analysis adjusted for age, sex, and BMI also confirmed that the composite variables significantly associated with AR in the Wilcoxon-Mann–Whitney test were significantly associated with AR (Supplementary Table S1). The first composite variables calculated from the UniFrac distance (PCoA and NMDS), which showed the strongest positive association with AR, were significantly associated with increased abundance of Bifidobacterium and decreased abundance of Prevotella (Fig. 4c,h). Notably, the association between these composite variables and AR was stronger than the association between the relative abundance of individual genera and AR (FDR adjusted P = 0.45 for Bifidobacterium and P = 0.37 for Prevotella).

The second composite variable calculated from NMF, which was positively correlated with the relative abundance of Bifidobacterium, was also associated with the intake of several nutrients, including carbohydrate [CHO], saturated fat [SFA], and alcohol [ALC], and the latter two nutrients were also associated with the third composite variable (Fig. 4i). CHO, SFA, and ALC were also associated with the second composite variables calculated using the JSD (Fig. 4a,f) and BCD (Fig. 4b,g). The third composite variable calculated by PCoA with JSD was positively associated with nutrient intakes including dietary fibre (total dietary fibre [TDF], insoluble dietary fibre [NDF], and soluble dietary fibre [WDF]), magnesium [MG], potassium [K], niacin [NAC], thiamin [VB1], and vitamin B6 [VB6] (Fig. 4a). The composite variables calculated by PCA were not associated with any nutrient and AR (FDR adjusted P value > 0.05) (Fig. 4d), while the second composite variable calculated by PCA of CLR-transformed abundance data was negatively associated with ALC (Fig. 4e).

Discussion

Dimension reduction has been used to visualise the distribution of multidimensional microbiome data^11,12, but the composite variables calculated by the dimension reduction methods have not been widely used to investigate the relationship of the human gut microbiome with lifestyle and disease. In the present study, we applied several dimension reduction methods, including PCA, PCoA, NMDS, and NMF, to a microbiome dataset from 186 subjects with symptoms of AR and 106 controls. All the dimension reduction methods supported that the microbial composition appeared to be continuous rather than discrete. The top 3 composite variables obtained from JSD and BCD were highly correlated with each other, and those obtained from the UniFrac distance were also highly correlated with each other, whereas the top 3 composite variables from other methods did not correspond, suggesting that the characteristics of the composite variables differed depending on the distance matrices and the dimension reduction methods. We found that the first composite variables calculated from PCoA and NMDS with the UniFrac distance were strongly associated with AR, and also with the relative abundance of Bifidobacterium and Prevotella. Unlike JSD and BCD, which focus primarily on differences in abundance or composition without considering phylogeny, UniFrac incorporates both the relative abundance of bacteria and their phylogenetic relationships²⁴. Another difference is that JSD and BCD were calculated from the genus-level abundance data in our study, whereas the weighted UniFrac distance used classification at the operational taxonomic unit (OTU) level. The variability in the strength of the association between AR and composite variables across different distances and dimension reduction methods underscores the importance of using a variety of analytical approaches to uncover the complex interplay between the microbial community and human health.

The genus Bifidobacterium has been recognized to confer health benefits to the host also through its interaction with the host’s immune system^34,35. These benefits comprise both local effects, which result from the contribution of Bifidobacterium to the intestinal barrier function—which ultimately translates into systemic health—and systemic effects, which stem from the microorganism’s impact on specific pathways through extracellular structures and metabolites. An example of this is the anti-inflammatory response that is elicited by acetate produced by Bifidobacterium³⁶. Bifidobacterium can digest complex carbohydrates, such as glucans, into acetate which is further digested into butyrate by other gut microorganisms. Butyrate is known to possess anti-inflammatory properties that include the production of TGF-β, IL-18, and IL-10 cytokines by antigen-presenting cells and IECs, which together stimulate the differentiation of naïve T cells into Treg cells. Additionally, Bifidobacterium produce two types of pili, hair-like structures found on the surface of bacteria, and pili produced in certain bifidobacterial strains have been shown to stimulate TNF-α levels in macrophages while suppressing other pro-inflammatory cytokines that are associated with systemic immune responses³⁷.

The association between Bifidobacterium and allergic symptoms has indeed been reported. For example, an observational microbiome study reported that patients with atopy and asthma tended to have a lower abundance of Bifidobacterium³⁸. Although studies on the relationship between allergic rhinitis and the intestinal Bifidobacterium abundance are limited, one study reported that the symptom of allergic rhinitis was reduced by oral administration of probiotic B. lactis³⁹. However, the increase in intestinal Bifidobacterium was not confirmed. In addition, it is debated whether Bifidobacterium is beneficial or not, and at least, the genus Bifidobacterium is not uniformly beneficial⁴⁰.

The abundance of Bifidobacterium was further linked to increased intakes of carbohydrate and decreased alcohol consumption by several dimension reduction methods, including PCoA and NMDS using JSD and BCD, and NMF. Consumption of non-digestible carbohydrates has been shown to promote the growth of Bifidobacterium⁴¹. Excessive alcohol consumption has been shown to lead to an imbalance in the gut environment and a reduction in Bifidobacterium populations⁴².

There are several limitations to this study. First, we did not evaluate the performance of the different dimension reduction methods. We applied several dimension reduction methods to a case–control dataset and examined the association of the composite variables calculated using different dimension reduction methods with nutrient intake and AR. Further studies are needed to investigate whether dimension reduction methods can contribute to an increased ability to distinguish patients from controls on the basis of discriminatory power. Second, we only analysed one microbiome dataset. Further research using a variety of microbiome datasets is warranted to determine the advantages and disadvantages of dimension reduction methods. For example, our data showed that the first composite variable calculated by NMF was positively correlated with the P/B ratio, suggesting that NMF may be able to extract a meaningful dimension from microbiome data alone without prior knowledge.

In conclusion, our results highlight the usefulness of the dimension reduction methods for investigating the association of microbial composition with lifestyle and disease in clinical research.

Data availability

The data are not available for public access because of participant privacy concerns, but are available from the corresponding author on reasonable request.

References

Bousquet, J. et al. Allergic Rhinitis and its Impact on Asthma (ARIA) 2008*. Allergy 63, 8–160 (2008).
Article PubMed Google Scholar
Brożek, J. L. et al. Allergic Rhinitis and its Impact on Asthma (ARIA) guidelines—2016 revision. J. Allergy Clin. Immunol. 140, 950–958 (2017).
Article PubMed Google Scholar
Bjermer, L., Westman, M., Holmström, M. & Wickman, M. C. The complex pathophysiology of allergic rhinitis: Scientific rationale for the development of an alternative treatment option. Allergy Asthma Clin. Immunol. 15, 24 (2019).
Article PubMed PubMed Central Google Scholar
Pascal, M. et al. Microbiome and allergic diseases. Front. Immunol. 9, 1584 (2018).
Article PubMed PubMed Central Google Scholar
Hirata, S. & Kunisawa, J. Gut microbiome, metabolome, and allergic diseases. Allergol. Int. 66, 523–528 (2017).
Article CAS PubMed Google Scholar
McKenzie, C., Tan, J., Macia, L. & Mackay, C. R. The nutrition-gut microbiome-physiology axis and allergic diseases. Immunol. Rev. 278, 277–295 (2017).
Article CAS PubMed Google Scholar
Wu, G. D. et al. Linking long-term dietary patterns with gut microbial enterotypes. Science 334, 105–108 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
MetaHIT Consortium (additional members) et al. Enterotypes of the human gut microbiome. Nature 473, 174–180 (2011).
Mobeen, F., Sharma, V. & Prakash, T. Enterotype variations of the healthy human gut microbiome in different geographical regions. Bioinformation 14, 560–573 (2018).
Article PubMed PubMed Central Google Scholar
Knights, D. et al. Rethinking “enterotypes”. Cell Host Microbe 16, 433–437 (2014).
Article CAS PubMed PubMed Central Google Scholar
Yang, S. et al. The gut microbiome and antibiotic resistome of chronic diarrhea rhesus macaques (Macaca mulatta) and its similarity to the human gut microbiome. Microbiome 10, 29 (2022).
Article CAS PubMed PubMed Central Google Scholar
Zhu, L. et al. Gut microbial characteristics of adult patients with allergy rhinitis. Microb Cell Fact 19, 171 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sahoyama, Y. et al. Multiple nutritional and gut microbial factors associated with allergic rhinitis: The Hitachi Health Study. Sci. Rep. 12, 3359 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Kobayashi, S. et al. Comparison of relative validity of food group intakes estimated by comprehensive and brief-type self-administered diet history questionnaires against 16 d dietary records in Japanese adults. Public Health Nutr. 14, 1200–1211 (2011).
Article PubMed Google Scholar
Kobayashi, S. et al. Both comprehensive and brief self-administered diet history questionnaires satisfactorily rank nutrient intakes in Japanese adults. J. Epidemiol. 22, 151–159 (2012).
Article PubMed PubMed Central Google Scholar
Willett, W. & Stampfer, M. J. Total energy intake: Implications for epidemiologic analyses. Am. J. Epidemiol. 124, 17–27 (1986).
Article CAS PubMed Google Scholar
Kim, S.-W. et al. Robustness of gut microbiota of healthy adults in response to probiotic intervention revealed by high-throughput pyrosequencing. DNA Res. 20, 241–253 (2013).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Endres, D. M. & Schindelin, J. E. A new metric for probability distributions. IEEE Trans. Inform. Theory 49, 1858–1860 (2003).
Article MathSciNet Google Scholar
Walesiak, M. & Dudek, A. The choice of variable normalization method in cluster analysis. Proceedings of the 35th International Business Information Management Association Conference, 325–340 (2020).
R Core Team. R: A language and environment for statistical computing. (R Foundation for Statistical Computing, Vienna, 2022). https://www.R-project.org.
van den Boogaart, K. G. & Tolosana-Delgado, R. “compositions”: A unified R package to analyze compositional data. Comput. Geosci. 34, 320–338 (2008).
Article ADS Google Scholar
Baselga, A. Separating the two components of abundance-based dissimilarity: balanced changes in abundance vs. abundance gradients. Methods Ecol. Evol. 4, 552–557 (2013).
Article Google Scholar
Baselga, A. & Orme, C. D. L. betapart: an R package for the study of beta diversity. Methods Ecol. Evol. 3, 808–812 (2012).
Article Google Scholar
Lozupone, C. & Knight, R. UniFrac: a new phylogenetic method for comparing microbial communities. Appl. Environ. Microbiol. 71, 8228–8235 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Edgar, R. C. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26, 2460–2461 (2010).
Article CAS PubMed Google Scholar
Oksanen, J. et al. vegan: Community Ecology Package.
Lee, D. D. & Seung, H. S. Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999).
Article ADS CAS PubMed Google Scholar
Lin, X. & Boutros, P. C. NNLM: Fast and versatile non-negative matrix factorization.
Levy, R. et al. Longitudinal analysis reveals transition barriers between dominant ecological states in the gut microbiome. Proc. Natl. Acad. Sci. USA 117, 13839–13845 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Hjorth, M. F. et al. Prevotella-to-Bacteroides ratio predicts body weight and fat loss success on 24-week diets varying in macronutrient composition and dietary fiber: Results from a post-hoc analysis. Int. J. Obes. 43, 149–157 (2019).
Article CAS Google Scholar
Hothorn, T., Hornik, K., van de Wiel, M. A. & Zeileis, A. A Lego system for conditional inference. Am. Stat. 60, 257–263 (2006).
Article MathSciNet Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B (Methodol.) 57, 289–300 (1995).
MathSciNet Google Scholar
Plantinga, A. et al. MiRKAT: Microbiome regression-based kernel association tests.
Ruiz, L., Delgado, S., Ruas-Madiedo, P., Sánchez, B. & Margolles, A. Bifidobacteria and their molecular communication with the immune system. Front. Microbiol. 8, 2345 (2017).
Article PubMed PubMed Central Google Scholar
Alessandri, G., Ossiprandi, M. C., MacSharry, J., van Sinderen, D. & Ventura, M. Bifidobacterial dialogue with its human host and consequent modulation of the immune system. Front. Immunol. 10, 2348 (2019).
Article CAS PubMed PubMed Central Google Scholar
Fukuda, S. et al. Bifidobacteria can protect from enteropathogenic infection through production of acetate. Nature 469, 543–547 (2011).
Article ADS CAS PubMed Google Scholar
Duranti, S. et al. Elucidating the gut microbiome of ulcerative colitis: bifidobacteria as novel microbial biomarkers. FEMS Microbiol. Ecol. 92, fiw191 (2016).
Article PubMed Google Scholar
Fujimura, K. E. et al. Neonatal gut microbiota associates with childhood multisensitized atopy and T cell differentiation. Nat. Med. 22, 1187–1191 (2016).
Article CAS PubMed PubMed Central Google Scholar
Singh, A. et al. Immune-modulatory effect of probiotic Bifidobacterium lactis NCC2818 in individuals suffering from seasonal allergic rhinitis to grass pollen: an exploratory, randomized, placebo-controlled clinical trial. Eur. J. Clin. Nutr. 67, 161–167 (2013).
Article CAS PubMed Google Scholar
Hidalgo-Cantabrana, C. et al. Bifidobacteria and their health-promoting effects. Microbiol. Spectr. 5, 5.3.21 (2017).
Pokusaeva, K., Fitzgerald, G. F. & Van Sinderen, D. Carbohydrate metabolism in Bifidobacteria. Genes Nutr. 6, 285–306 (2011).
Article CAS PubMed PubMed Central Google Scholar
Hizo, G. H. & Rampelotto, P. H. The role of bifidobacterium in liver diseases: A systematic review of next-generation sequencing studies. Microorganisms 11, 2999 (2023).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research did not receive any specific grant from funding agencies in either the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations

Genome Analytics Japan Inc., Tokyo, Japan
Shohei Komaki & Tsuyoshi Hachiya
Technology Strategy Div., Hitachi High-Tech Corporation, Business Tower, Toranomon Hills, 1-17-1 Minato-ku, Toranomon, Tokyo, 105-6409, Japan
Yukari Sahoyama, Fumiaki Hamazato & Manabu Shiozawa
Advanced Data Science Project (ADSP), RIKEN Information R&D and Strategy Headquarters, RIKEN, Yokohama City, Kanagawa, 230-0045, Japan
Keita Koseki & Eiryo Kawakami
Laboratory for Microbiome Sciences, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
Yusuke Ogata, Wataru Suda & Masahira Hattori
Hitachi Health Care Center, Hitachi Ltd., Ibaraki, Japan
Tohru Nakagawa
Graduate School of Advanced Science and Engineering, Waseda University, Tokyo, Japan
Masahira Hattori
Artificial Intelligence Medicine, Graduate School of Medicine, Chiba University, Chiba City, Chiba, 260-8670, Japan
Eiryo Kawakami
Institute for Advanced Academic Research (IAAR), Chiba University, Chiba City, Chiba, 260-8670, Japan
Eiryo Kawakami

Authors

Shohei Komaki
View author publications
You can also search for this author in PubMed Google Scholar
Yukari Sahoyama
View author publications
You can also search for this author in PubMed Google Scholar
Tsuyoshi Hachiya
View author publications
You can also search for this author in PubMed Google Scholar
Keita Koseki
View author publications
You can also search for this author in PubMed Google Scholar
Yusuke Ogata
View author publications
You can also search for this author in PubMed Google Scholar
Fumiaki Hamazato
View author publications
You can also search for this author in PubMed Google Scholar
Manabu Shiozawa
View author publications
You can also search for this author in PubMed Google Scholar
Tohru Nakagawa
View author publications
You can also search for this author in PubMed Google Scholar
Wataru Suda
View author publications
You can also search for this author in PubMed Google Scholar
Masahira Hattori
View author publications
You can also search for this author in PubMed Google Scholar
Eiryo Kawakami
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.K., T.H., K.K., and E.K. wrote the manuscript. T.N. oversaw clinical data management. Y.S., T.H., F.H., and M.S. managed the nutritional data. Y.O., W.S., and M.H. managed the microbiome data. S.K., Y.S., T.H., K.K., and E.K. performed the statistical analysis. F.H., M.S., M.H., and E.K. supervised the work. Y.S., F.H., M.S., T.N., M.H., and E.K. designed and coordinated the project. All authors commented on and approved the manuscript.

Corresponding author

Correspondence to Yukari Sahoyama.

Ethics declarations

Competing interests

S.K. and T.H. are an employee and a board member of Genome Analytics Japan Inc., respectively. Y.S., F.H., and M.S. are employees of Hitachi High-Tech Co. The other authors declare that they have no conflicts of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Komaki, S., Sahoyama, Y., Hachiya, T. et al. Dimension reduction of microbiome data linked Bifidobacterium and Prevotella to allergic rhinitis. Sci Rep 14, 7983 (2024). https://doi.org/10.1038/s41598-024-57934-x

Download citation

Received: 21 July 2023
Accepted: 22 March 2024
Published: 05 April 2024
DOI: https://doi.org/10.1038/s41598-024-57934-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.