Host genetic factors related to innate immunity, environmental sensing and cellular functions are associated with human skin microbiota

Moitinho-Silva, Lucas; Degenhardt, Frauke; Rodriguez, Elke; Emmert, Hila; Juzenas, Simonas; Möbus, Lena; Uellendahl-Werth, Florian; Sander, Nicole; Baurecht, Hansjörg; Tittmann, Lukas; Lieb, Wolfgang; Gieger, Christian; Peters, Annette; Ellinghaus, David; Bang, Corinna; Franke, Andre; Weidinger, Stephan; Rühlemann, Malte Christoph

doi:10.1038/s41467-022-33906-5

Download PDF

Article
Open access
Published: 19 October 2022

Host genetic factors related to innate immunity, environmental sensing and cellular functions are associated with human skin microbiota

Nature Communications volume 13, Article number: 6204 (2022) Cite this article

5718 Accesses
8 Citations
12 Altmetric
Metrics details

Subjects

Abstract

Despite the increasing knowledge about factors shaping the human microbiome, the host genetic factors that modulate the skin-microbiome interactions are still largely understudied. This contrasts with recent efforts to characterize host genes that influence the gut microbiota. Here, we investigated the effect of genetics on skin microbiota across three different skin microenvironments through meta-analyses of genome-wide association studies (GWAS) of two population-based German cohorts. We identified 23 genome-wide significant loci harboring 30 candidate genes involved in innate immune signaling, environmental sensing, cell differentiation, proliferation and fibroblast activity. However, no locus passed the strict threshold for study-wide significance (P < 6.3 × 10⁻¹⁰ for 80 features included in the analysis). Mendelian randomization (MR) analysis indicated the influence of staphylococci on eczema/dermatitis and suggested modulating effects of the microbiota on other skin diseases. Finally, transcriptional profiles of keratinocytes significantly changed after in vitro co-culturing with Staphylococcus epidermidis, chosen as a representative of skin commensals. Seven candidate genes from the GWAS were found overlapping with differential expression in the co-culturing experiments, warranting further research of the skin commensal and host genetic makeup interaction.

Microbe-host interplay in atopic dermatitis and psoriasis

Article Open access 16 October 2019

Nanna Fyhrquist, Gareth Muirhead, … Harri Alenius

A systematic review of skin ageing genes: gene pleiotropy and genes on the chromosomal band 16q24.3 may drive skin ageing

Article Open access 30 July 2022

Jun Yan Ng & Fook Tim Chew

European and multi-ancestry genome-wide association meta-analysis of atopic dermatitis highlights importance of systemic immune regulation

Article Open access 04 October 2023

Ashley Budu-Aggrey, Anna Kilanowski, … Lavinia Paternoster

Introduction

Human-associated microbial communities show individual-specific variation shaped by a multitude of factors^1,2. For skin in particular, the bacterial community composition is strongly influenced by host characteristics, such as skin microenvironment, sex, age and body mass index (BMI), and to a lesser extent by lifestyle and environmental expositions³. The genetic influence of the host on skin microbiome composition and diversity was suggested by findings indicating heritability of up to 56.4% for single taxonomic branches of skin commensals in twins⁴. Furthermore, host genetics and skin microbiota interactions haven been suggested by studies including targeted genes⁴ and in the context of inflammatory diseases, such as atopic dermatitis⁵. Nevertheless, the influence of host genetics on the skin microbiome is largely understudied and no dedicated genome-wide association study (GWAS) of host genetics and the bacterial community inhabiting the skin has been performed so far. This strongly contrasts with what is known about the human gut microbiota, where a variety of associated genomic loci and pathways has been identified by large GWAS^6,7. Together, these gut microbiome-based GWAS have not only suggested how human molecular mechanisms modulate the microbiome but also indicated the consequences of such modulation to the host health and disease.

Therefore, we aimed to study the effects of genetics on skin microbiota across skin microenvironments through meta-analyses of GWAS of two German cohorts. To investigate the putative influence of the skin microbiota in skin diseases we applied Mendelian randomization (MR) analysis. Finally, putative effects of the skin microbiome members on the expression of candidate genes identified by GWAS were tested using normal human epidermal keratinocytes cultured with the common skin bacterium Staphylococcus epidermidis.

Results and discussion

A total of 1656 skin samples from participants of two cross-sectional, population-based German cohorts, KORA FF4 (n_Individuals = 324) and PopGen (n_Individuals = 273)^8,9 were analyzed. Skin samples were taken from dry [dorsal and volar forearm (PopGen)], moist [antecubital fossa (KORA FF4 and PopGen)] and sebaceous [retroauricular fold (KORA FF4) and forehead (PopGen)] skin microenvironments (Fig. 1a–c, Supplementary Table 1). Microbial community profiles were obtained from sequencing of the V1-V2 regions from the 16 S ribosomal RNA (rRNA) gene (see Methods). Genome-wide association analyses were conducted on univariate relative abundances of individual bacteria (amplicon sequence variants; ASVs) and non-redundant taxonomic groups ranging from genus to phylum levels (79 in total; see Methods). Additionally, multivariate community composition (i.e., beta diversity as captured by Bray-Curtis dissimilarity) was analyzed for association with host genetic variation. The umbrella term “microbial feature” will henceforth be used in this article for all 80 analyzed input data.

**Fig. 1: Characteristics of KORA FF4 and PopGen cohorts.**

We tested the association of microbial features with variation in 4,685,714 human autosomal single nucleotide polymorphisms (SNP), accounting for main confounders of the skin microbiota (age, sex and BMI) and genetic background of study participants (see Methods)^3,7. Cohort-wise association results were combined in a meta-analysis framework according to skin microenvironment, justified by the observed similarity of the microbiota profiles of samples from the same microenvironment (Fig. 1d). To assure robustness of association results, only loci with genome-wide significance (P_Meta < 5 × 10⁻⁸) and with nominal significance in both cohorts (P < 0.05) were further considered (see Methods for details).

A total of 23 loci showed a genome-wide significant association with skin microbial features, of which 22 were linked to univariate features (Table 1 and Fig. 2a). However, none of these passed the strict threshold for study-wide significance (P < 6.3 × 10⁻¹⁰ for 80 features included in the analysis, see Methods). Most of the associations were found in moist skin microenvironment (n = 11), followed by dry (n = 7) and sebaceous (n = 5) (Fig. 2b). There was a tendency for a higher number of associations found in deeper taxonomic levels: the highest number of significant associations were found at the ASV level (n = 8), followed by genus level (n = 6; Fig. 2c). Of all microbial features deeper than family level, features within the genus Staphylococcus were associated with most loci (n = 5; Fig. 2d). Bayesian fine-mapping or linkage disequilibrium (LD) structure prioritized 462 genetic variants as potentially causal (Supplementary Data 1). A total of 30 genes were found of interest for containing potentially causal variants and/or because these variants were significantly associated with the gene expression in skin tissue from the GTEx portal¹⁰ (Table 1). Of these, 27 were protein coding genes, one an rRNA pseudogene and two long-non coding RNA (lncRNA) genes (Supplementary Data 2). Most of the protein coding genes were expressed in skin tissue and found expressed in different cell types in skin in datasets from previous studies (see Methods for details^11,12) (Fig. 3). In the next section, we will explore the genes of interest with functional roles related to the host-microbiome interface.

Table 1 Results summary of GWAS for microbial features

Full size table

**Fig. 3: Expression of human genes associated with the skin microbiome in public databases.**

Host functions associated with the human skin microbiota

Genetic variants associated with the skin bacteria were localized in genes related to pathogen sensing and regulation of response to pathogens. C1QBP (locus id: 22, lead variant rs2472614, P_Meta = 4.7 × 10⁻⁸, associated with ASV086 [Acinetobacter johnsonii]), for instance, encodes the complement component 1, q subcomponent binding protein (C1qBP, a.k.a. gC1q-R/p33) and is abundantly expressed in keratinocytes (Fig. 3¹²). C1qBP is an ubiquitous, multi-ligand, multifunctional and multicompartmental protein, which also acts as endothelial receptor to plasma proteins from the complement and kinin/kallikrein systems and is a marker for epithelial cell proliferation^13,14. C1qBP binds to microbial proteins¹⁵, including Staphylococcus aureus protein A¹⁶, and therefore, is suggested to play a role in both the response to and pathogenesis of microbes¹⁷. Additionally, DHX33, (locus id: 22, same locus containing gene C1QBP), and CARD8 (locus id: 23, rs6509364, P_Meta = 8.5 × 10⁻⁰⁹, associated with the Rhodobacteraceae family) encode proteins which regulate inflammasome activity, which in turn regulate innate immunity caspase 1 activation¹⁸. DHX33 activates the NLRP3 inflammasome after sensing cytosolic RNA derived from viruses, bacteria or achaea^19,20. CARD8 is structurally related to NLRP1, a sensor component of the NLRP1 inflammasome, and has been shown to activate caspase 1 activity in resting T cells and is a negative regulator of NLRP3 inflamasome^21,22. Together, these results suggest that innate immune components carrying out sensing and regulatory activities may be involved in shaping the human skin microbiota.

Associated genetic variants were also localized at genes HTT (locus id: 5, rs2159173, P_Meta = 1.8 × 10⁻⁰⁹, associated with ASV093 [Staphylococcus (uncl.)]) and CFAP54 (locus id: 16, rs12423627, P_Meta = 6.9 × 10⁻⁰⁹, associated with ASV002 [Staphylococcus (uncl.)]), which encode proteins required for cilia formation in mammalian cells^23,24 and expressed in different cell types in skin (Fig. 3). Further, we found SNPs that were associated with the expression of the transcript ENSG00000269886 (locus id: 3, rs2664121, P_Meta = 4.3 × 10⁻⁰⁹, associated with the genus Micrococcus). Interestingly, the effector alleles of all of these (rs2664121, rs2075337, rs2543492, rs1300250) were associated with the decrease in both tissue expression of ENSG00000269886 and relative abundance of the genus Micrococcus (the GTEx portal¹⁰ and Supplementary Data 1). ENSG00000269886 is an lncRNA antisense to the gene TTLL3, which regulates cilia assembly across eukaryotes^25,26. Skin cells do not have motile cilia. Thus, it is likely that these genes are related to primary cillium, an organelle at the cell surface that senses extracellular signals, such as chemo-mechanical signals, osmolarity, pH, oxygen and light²⁷. Primary cillium is found in various skin cells such as keratinocytes, fibroblasts, melanocytes and Langerhans cells²⁸. Its formation is influenced by the dynamics of the actin cytoskeleton²⁹, which is regulated by SRGAP3 encoded protein (locus id: 3)³⁰. Together, these results suggest that extracellular sensing through primary cilium may be involved in the regulation of the skin microbiota.

Additional associations were observed with SNPs located in genes involved in cellular differentiation and proliferation. These were RAF1 (locus id: 4, rs709165, P_Meta = 4.0 × 10⁻⁰⁸, associated with ASV006 [Staphylococcus hominis])^31,32,33 and RGS12 (locus id: 5, rs2159173)^34,35,36, the latter found abundantly expressed in keratinocytes¹² (Fig. 3). Furthermore, SNPS in PDGFRA (locus id: 6, rs55702239, P_Meta = 3.5 × 10⁻⁰⁸) were associated with order Bacteroidales and genus Bacteroides. PDGFRA is abundantly expressed in fibroblasts¹² (Fig. 3) and participates in cellular maintenance³⁷ and extracellular matrix production³⁸. Keratinocyte proliferation, differentiation and function as well as innate immune signaling are major forces contributing to the complex function of the skin barrier. Therefore, it is conceivable that the discovered GWAS associations may represent links between the skin barrier and members of the skin microbiota.

Expression of candidate genes by keratinocytes co-cultured with Staphylococcus epidermidis

To gain insights in the putative participation of the identified candidate genes in the molecular interaction with the skin bacteria, we analyzed the in vitro transcriptional profile of normal human epidermal keratinocytes co-cultured with S. epidermidis, an abundant commensal in human skin³⁹. Transcriptional profiles (six replicates) of keratinocytes from the foreskin of a 0-year-old male donor co-cultured with the S. epidermidis ATCC 14990 strain clearly differed from the profiles of controls, keratinocytes that were not co-cultured with bacteria (Fig. 4a). The S. epidermidis ATCC 14990 strain is a well characterized laboratory strain which is close to the strains found in the skin of the participants of the two cohorts studied. This proximity is suggested by the observation of 100% overlap and identity with the full length of ASV002 [Staphylococcus (uncl.)] amplicon sequence (307 base pairs), the second most abundant ASV in the whole database (~10% of rarefied sequences) and the most abundant ASV assigned to Staphylococcus genus.

**Fig. 4: GWAS genes expressed by keratinocytes co-cultured with *Staphylococcus epidermidis*.**

A total of 4134 genes were differentially regulated (Supplementary Data 3), suggesting a strong transcriptional response of human keratinocytes to S. epidermidis ATCC 14990 strain in vitro. According to pathway enrichment analysis (Supplementary Data 4), the most significant biological processes upregulated were related to immune response, including cytokine-mediated and innate immune responses, as well as response to virus and symbionts (Fig. 4b). On the other hand, ribosomal biogenesis, processing of ribosomal RNA and non-coding RNA were among the most significantly down regulated biological processes. In this scenario, a quarter of the candidate genes (n = 7) were differentially expressed (q < 0.05 and absolute log2 fold change >1) when comparing cultures with and without S. epidermidis ATCC 14990 (Fig. 4c).

Based on knockout mouse macrophage cells, the deficiency of C1QBP protein increases the DNA sensor cyclic GMP-AMP (cGAMP) synthase-induced innate immune response⁴⁰. Here, we observed the downregulation of C1QBP transcription associated with the upregulation of genes belonging to innate immune response (Fig. 4b, c), which sides with our GWAS suggestion that this gene may play a role in the regulation of skin bacteria via innate immunity. On the other hand, DHX33 was downregulated, contrasting to its role in innate immunity via activation of NLRP3, which transcript was upregulated (Fig. 4c and see Supplementary Data 3). It is thus likely that the reduced expression of DHX33 in our assays may be associated with the role of DHX33 in rRNA synthesis via positive regulation of transcription by RNA polymerase I⁴¹, being both pathways downregulated (Fig. 4b, c, Supplementary Data 4).

Genes coding for SRGAP3 and TTLL3, of which the lncRNA antisense gene was implied by GWAS, were upregulated (Fig. 4c and Supplementary Data 3). These observations support our discovered association of primary cilium and skin bacteria. However, it is important to bear in mind that the encoded proteins are not exclusively related to primary cilium, and their expression in our assays may also be related to other structures, e.g., cytoskeleton in the case of SRGAP3³⁰, and processes, e.g., proliferation in the case of TTLL3²⁶. Finally, the know role of PDGFRA in fibroblast activity are not directly translated to keratinocytes^37,38. Therefore, the consequences of S. epidermidis-induced in vitro upregulation of PDGFRA in keratinocytes remain to be investigated.

Our in vitro experiment is explorative in nature and is limited to its reductionist approach: it consists of two-dimensional co-cultures of isolated keratinocytes and a single S. epidermidis laboratory strain. It is well known that the immunomodulatory effects of S. epidermidis depend on the specific strain, and that there is a large S. epidermidis strain level variation. Thus, it is not possible to directly extrapolate our preliminary functional results to an eventual keratinocyte response to skin commensals in vivo. A panel of commensal strains as well as in vitro models closer to the skin physiology, such as three-dimensional human skin models⁴², are necessary to uncover the functional dynamics of the host-commensal cellular interactions. Nevertheless, our assays allowed for the observation of the transcriptional regulation of several GWAS selected genes, being a starting point for functional investigations of the roles of these genes in the interaction with the skin microbiota.

Influence of skin microbiota on non-infectious skin diseases

Summary statistics of univariate microbial features were used as exposures in 2-sample mendelian randomization (MR; see Methods) to assess their influence on non-infectious skin diseases. A total of eight comparisons passed the per-trait suggestive threshold (q_(trait) value <0.05, Fig. 5), although no comparison passed the global threshold (q_(global) value <0.05; Supplementary Data 5). MR results indicated the influence of staphylococci in dermatitis/eczema (Staphylococcus genus, β = 1.5 × 10⁻⁰³), and further, modulating roles of Flavobacteriaceae in two allergy-related traits with microenvironment-specific effect direction (β_Moist = 8.8 × 10⁻⁰⁴; β_Dry = 1.1 ×10⁻⁰³). Additional results suggested involvement of Staphylococcus ASVs in psoriasis (ASV012 [Staphylococcus hominis], β = 4.0 × 10⁻⁰⁴), seborrhoeic keratosis (ASV010 [Staphylococcus (uncl.)], β = 7.2 × 10⁻⁰⁴) and vitiligo (ASV012 [Staphylococcus hominis], β = 1.2 × 10⁻⁰⁴). Potential protective effects of staphylococci in allergic rhinitis were also suggested (ASV012 [Staphylococcus hominis], β = −1.1 × 10⁻⁰³). It is noteworthy that these are likely coagulase-negative staphylococci, which are typical members of the skin microbiota⁴³. However, the ASV-level signals from the MR were only weakly or inconclusively supported by the sensitivity analysis (see Methods, Supplementary Data 5). Together, our findings suggest that members of the microbiota may modulate the health-disease balance in skin.

**Fig. 5: Results from Mendelian Randomization analysis.**

In summary, we conducted the first genome-wide association analysis dedicated to the human skin microbiota and identified 23 genome-wide significant loci. The combination of samples from different skin microenvironments of participants from two independent German cohorts allowed for robust results, despite the rather small number of included participants. The candidate genes have functions related to innate immune signaling, environmental sensing, cell differentiation, proliferation and fibroblast activity. Keratinocyte cultures challenged with a laboratory strain of S. epidermidis indicated regulation of seven candidate genes identified by GWAS, providing preliminary evidence that GWAS selected genes may be transcriptionally regulated by skin commensals. MR analysis further supported that specific skin microbiota features might have causal roles in the development of atopic dermatitis, but also suggested modulation of other non-infections skin diseases.

It needs to be considered that, despite our efforts to integrate information from different molecular levels and databases to understand the exact mechanisms by which the variants influence candidate gene function(s) and or expression and how this influences the skin microbiome, further and advanced experiments are needed. Likewise, it would be important to systematically establish differences in cutaneous gene expression with skin type, skin physiology and across age groups. Nevertheless, our results suggest a close interaction of the host genetic makeup and associated skin microbiomes. Furthermore, our findings point to the skin microbiota as a target for disease prevention and management, with potential for the development of personalized treatments for non-infectious, inflammatory skin conditions.

Methods

Cohorts’ description, genotyping, imputation and harmonization

PopGen cohort participants were randomly recruited via the local population registry in Kiel, Germany, and as blood donors of the University Hospital Schleswig-Holstein, Campus Kiel⁹. Genotypes derived from the Affymetrix Genome-Wide Human single nucleotide polymorphism (SNP) Array 6.0 were quality controlled following a previously established protocol⁴⁴ and using the IKMB GWAS Quality Control Pipeline (https://github.com/ikmb/gwas-qc). Briefly, variants with excess missing data (>2%) and/or that deviated from Hardy-Weinberg Equilibrium [HWE, False Discovery Rate (FDR⁴⁵) P value <10⁻⁵] were excluded. Samples with high missing data (>2%), high overall increased/decreased heterozygosity rates (i.e., ±5 standard deviation from the sample mean) and related individuals with a PLINK⁴⁶ PI_HAT score >0.1875 were removed. To assess population structure, we performed a principal components analysis (PCA) including individuals of the 1000 Genomes Phase3 ref. 47 and removed outlier individuals not matching a European ancestry. Imputation was performed with the Michigan Imputation Server⁴⁸ (Reference Panel: HRCr1.1 2016 (GRCh37/hg19); Array Build: GRCh37/hg19; rsq filter: off; Phasing Eagle 2.4 (phased output); Population: EUR; Mode: Quality Control & Imputation) and was followed by removal of monomorphic variants. These steps were performed following the miQTL cookbook instructions (https://github.com/alexa-kur/miQTL_cookbook#chapter-2-genotype-imputation).

KORA FF4 cohort participants from the youngest age group (39-48 years) that were previously genotyped as part of KORA S4 Survey were recruited from the southern German city of Augsburg and its two surrounding counties⁸. Genotyping and genotyping imputation were performed by the KORA Study Center. Briefly, genotypes were derived from the Affymetrix Genome-Wide Human SNP Array 6.0 (KORA F4). Samples with missing data (>3%), mismatch with phenotypic and genetic gender and high heterozygosity rates (i.e., ±5 standard deviation from the sample mean) were removed. Samples were also checked for European ancestry, population outliers and compared with other existing genotype data of the same individual within the KORA cohort. Variants with excess missing data (>2%), deviating from HWE (P value <5 × 10⁻¹⁰) and Minor allele frequency (MAF) (<2%) were removed. Prephasing was done with SHAPEIT v2⁴⁹ and imputation with IMPUTE v2.3⁵⁰ (reference panel: 1000 Genomes Phase 3 integrated variant set release in NCBI build 37).

To harmonize both genotype datasets, resulting VCF (PopGen) and IMPUTE output (KORA FF4) files were converted to PLINK format using PLINK v1.9⁴⁶. Participants that had their skin microbiota profiled (see section below) were selected and variants with MAF < 5% were removed. Genotype Harmonizer v 1.4.23 was used to update the KORA FF4 allele reference based on the PopGen data. SNPs with missingness >10% and non-biallelic SNPs were removed from PopGen data using PLINK v2.0-alpha-avx2-20200217. PopGen SNPs were references to set alleles in KORA F4 data, which also underwent removal of variants with missingness <10% and non-biallelic SNPs. Data sets were merged into PLINK files using PLINK v1.9 which contained SNPs available in both cohorts. Lastly, a principal component analysis (PCA) was produced with PLINK v1.9 to summarize the genetic population structure.

Written informed consent was obtained from all study participants. All protocols were approved by the ethics committees of the Medical Faculty of Kiel University (PopGen) and of the Bavarian Medical Association (KORA). We have complied with all relevant ethical regulations.

Sampling collection and microbial profiling

Skin microbiota was sampled as described previously³. Briefly, skin swabs were taken with Catch-All Sample Collection Swab (Epicentre Biotechnologies, Madison, WIS) soaked in specimen collection fluid (SCF-1) from 4 cm² area of the skin site. Skin sites were selected to represent moist skin (antecubital fossa in both PopGen and KORA FF4), sebaceous skin (retroauricular fold in KORA FF4 and forehead in PopGen), and dry skin (volar and dorsal forearm in PopGen). Skin swabs were stored at -80 °C until DNA extraction using the QIAamp UCP Pathogen Mini Kit on an automated QIAcube system (QIAGEN GmbH, Hilden, Germany) for PopGen and the PowerSoil DNA Isolation Kit (MoBio Laboratories, Carlsbad, CA) for KORA FF4.

Bacterial profiles were based on the V1 and V2 variable regions of the gene coding for 16 S ribosomal RNA (rRNA). Briefly, V1-V2 regions were amplified with PCR performed with the primer pair 27F-338R. Pooled amplicon libraries were sequenced with MiSeq Reagent Kit v3 on the Illumina MiSeq (Illumina Inc., San Diego, CA). Sequencing reads were processed with DADA2 v1.10⁵¹, resulting in an amplicon sequence variant (ASV) table, which records the number of times each exact ASV was observed in each sample⁵². ASV is a finer scale analogue of the operational taxonomic unit (OTU), which resolves the sequenced region variant down to a single-nucleotide difference level. ASVs were taxonomically classified down to genus level using RDP classifier algorithm based on Ribosomal Database Project (RDP) version 16 release with 50% confidence^53,54. Species-level annotations were added to ASV sequences based on exact matches to the RDP database, using the function addSpecies() from DADA2 R package. Species-level abundances were not considered in the GWAS, as these are likely incomplete and possibly inaccurate⁵⁵, however annotations can still serve as proxies for sub-genus level placement of ASVs. Therefore, their species-level annotations were carried as part of the ASV annotation throughout the manuscript using square brackets, i.e. ASV001 [Propionibacterium acnes] or ASV001 [P. acnes]. Finally, sequences were filtered to remove chloroplasts, mitochondria and low abundant ASVs (less than 0.1% of total sequence counts of a given skin site). Samples were removed if taken from a site with apparent skin abnormality or in which corticosteroids or antibiotics were applied in the last seven days before collection. Microbiota data was manipulated in R 3.6.2 using the Phyloseq package v1.34.0^56,57. Details on sequencing, read processing and ASV filtering are provided in our previous study with the same dataset³. Finally, only samples from participants with genotype data were kept for downstream analysis.

Association of microbial features with host SNPs

The association of SNPs was tested for multivariate (for inferences on the bacterial community; beta diversity), and univariate microbial features (for inferences on individual bacterial clades). Beta diversity was inferred from Bray-Curtis dissimilarities of rarefied amplicon variant (ASV) table (5,000 sequences per sample), calculated in R version 3.6.2 using the Vegan package v2.5-5⁵⁸. Bacterial clades included ASVs and taxonomic groups ranging from genus to phylum. Taxonomic groups were obtained by merging the ASV sequence counts that had the same taxonomy at a certain rank, using the Phyloseq function tax_glom(). For each skin site, univariate features with a median sequence count higher than 50 and that were present in more than 100 participants were kept. In addition, univariate features were kept only if present in both sites of the same skin microenvironment, i.e., moist, sebaceous or dry. This effort resulted in 103 bacterial clades. To avoid redundancy, these clades were clustered together based on a 0.985 Spearman correlation cut-off. Clustering of clades were performed in each skin microenvironment separately because skin microenvironments have distinct bacterial profiles³. This effort resulted in a total of 79 bacterial features to be tested: 3 phyla, 4 classes, 7 order, 7 families, 15 genera and 43 ASVs.

Statistical tests were conducted for each microbial feature in each skin site from a single cohort following the framework established previously⁷. Because this process generates subsets of the whole data, additional variant inclusion criteria were implemented when necessary prior association tests. Accordingly, genetic variants were filtered (MAF > 5%) and coded into numeric features (0 = homozygous for reference allele; 1 = heterozygous; and 2 = homozygous for alternative allele). Only non-monomorphic variants were considered for testing. All tests were performed on the alternative allele as effect allele.

For tests with multivariate microbial features, distance-based redundancy analysis was performed with the vegan function capscale() with age, sex, BMI and the first ten genetic principal components (PCs) as covariates. The variables were selected because they were found as main confounders of the skin microbiota³ and to account for the influence of the genetic background. The variance left unexplained by these covariates was extracted using the R residuals() function. The effect of genetic variants was estimated from the residual matrix with a distance-based F-test using moment matching⁵⁹. For tests with univariate microbial features, zero-truncated non-rarefied count abundances were used. Outliers were filtered based on rarefied counts to account for uneven sequencing depths between samples. Samples were considered outliers when they deviated more than 5× the interquartile range (IQR) from the median abundance. Finally, count abundances (non-rarified) were fit with the Mvabund v4.1.6⁶⁰ function manyglm() in generalized linear models with negative binomial distributions and the covariates above mentioned as predictors. The logarithm of the total sequence counts of each sample was used as offset. Unexplained variance was extracted using residuals() function, which extracts from manyglm() models residuals that are normal⁶¹. The effect of genetic variants was estimated from the residuals using linear model. P value was calculated using the R summary function, which performs a two-sided t-test.

Microenvironment-wise meta-analysis

Genomic inflation (λ_GC) was calculated for all tests using the regression method as implemented in GenABEL v1.8-0 R package⁶². All values were below 1.02, indicating no genomic inflation. Because skin microbiota profiles are distinctive between microenvironments³, meta-analyses were performed combining data sets that originated from skin sites of the same microenvironment. Therefore, results from moist skin sites were merged into one meta-analysis and results from sebaceous skin sites into another, because skin sites from these microenvironments are from different cohorts. Because the distance-based F-test applied to multivariate features do not produce beta values, a fixed effect meta-analysis was performed with METAL release 2011-03-25⁶³, with meta-analysis P values (P_Meta) and sample size based weighting. For univariate features, an inverse-variance weighted fixed effect meta-analysis was performed with METASOFT v2⁶⁴ on beta values and their standard errors. Meta-analysis results were reported significant if genome-wide significance (P_Meta < 5 × 10⁻⁸) was achieved and the association was found nominally significant in the two skin sites (P < 0.05). Because samples from the two dry skin sites came from a single cohort (PopGen), results were combined and considered significant if the P value of at least one skin site was genome-wide significant (P < 5 × 10⁻⁸) with at least nominal significance at the other skin site (P < 0.05). In this case, the lowest P value was reported as the P_Meta value. The study-wide significance threshold was calculated considering the number of microbiota features tested (P_Meta < 5 × 10⁻⁸/80 = 6.3 × 10⁻¹⁰).

Fine-mapping and gene prioritization

Genes were considered of interest when containing potentially causal variants and/or these variants were significantly associated with the gene expression in skin tissue. Fine-mapping was performed to explore the most likely causal set of variants using shotgun stochastic search algorithm implemented in FINEMAP v1.4⁶⁵. For moist and sebaceous microenvironments, fine-mapping was performed with summary statistics (beta values and their standard errors) from meta-analysis. For dry microenvironment, beta values and their standard errors from volar forearm were used for fine-mapping. Genes were reported when intersecting with the range of the 95% posterior credible SNP set assuming one causal variant as input parameter for the algorithm. If fine-mapping did not find a credible set (<50 variants), or for beta-diversity results, genes with variants with LD > 0.6 to the lead SNP were reported. SNPs and genes were annotated using the R package biomaRt v2.48.0⁶⁶.

To investigate whether genetic variants could affect gene expression in skin tissues, prioritized variants selected by fine-mapping or in LD > 0.6 were mapped to Genotype-Tissue Expression (GTEx) Project¹⁰ database v8 (lower leg and suprapubic skin tissues). Briefly, chromosomal positions in genome assembly hg19 (GRCh37) were converted to hg38 (GRCh38) using LiftOver from the human genome browser at the University of California Santa Cruz (UCSC)⁶⁷. These positions were then mapped to single-tissue cis-quantitative trait locus (QTL) data downloaded (11/06/2021) from the GTEx portal¹⁰, specifically the file GTEx_Analysis_v8_eQTL.tar, which contains genes of which expression are significantly associated with genetic variants based on permutations. Only data from skin tissues (suprapubic non-sun-exposed and lower leg sun-exposed) were used in this analysis.

Expression of genes in skin tissues and cell types

Consensus transcriptional expression of genes in skin tissue were retrieved from the Human Protein Atlas version 20.1¹¹, which additionally includes data sets from the GTEx¹⁰ and the Functional annotation of the mammalian genome (FANTOM5)⁶⁸ projects. Single-cell RNA-Seq data of skin from healthy individuals (n = 5) were retrieved from a recent study by Solé-Boldo et al.¹².

Mendelian randomization

Mendelian randomization (MR) was performed using summary statistics of univariate association analyses as ‘exposures’ and six selected skin-related traits as ‘outcomes’ (allergy/hypersensitivity/anaphylaxis, seborrheic keratosis, eczema/dermatitis, hay fever/allergic rhinitis, psoriasis, vitiligo). Outcome summary statistics were retrieved from UK Biobank using the R package TwoSampleMR v0.5.5⁶⁹. UK Biobank originated from the IEU Open GWAS Project database⁷⁰. All variants in the microbial ‘exposures’ with an association P value <10^-5 were included in the analyses. After harmonization with the exposure data, only independent variants were retained using the clump_data() function with default parameters. Additionally, variants with an F statistic <10 were excluded from the analysis to avoid weak instrument bias⁷¹. In case of more than two independent retained variants, inverse variance weighted (IVW) MR analysis was performed as primary analysis, otherwise Wald-ratio was calculated. For exposures with more than two instrument variables, weighted mean, and MR Egger regression were performed for sensitivity analysis. MR Egger regression with non-significant beta values (P > 0.05) and weighted median MR results with significant (P < 0.1) and concordant effect direction to the IVW MR analysis were regarded as supporting. P values of the primary MR analysis (IVW or Wald-ratio) were corrected for multiple testing using per-trait and global FDR correction³⁶. All MR analyses were conducted in R v3.6.1.

Keratinocytes co-culture with Staphylococcus epidermidis

Normal human epidermal keratinocytes (NHEKs) (foreskin of a 0-year-old male Caucasian donor; Promocell, Heidelberg, Germany, Lot number 407Z001) were cultured in Keratinocyte Growth Medium (KGM; Lonza Biosciences, Walkersville, USA) + supplements + CaCl₂ + penicillin/streptomycin at 37 °C and 5% CO₂. Cells were used at passages 4-6. Keratinocytes were seeded into 6-well plates and grown until confluency. Staphylococcus epidermidis (Winslow and Winslow) Evans (ATCC 14990) was cultured in Tryptic Soy Broth (TSB; Thermo Fischer, Waltham, USA) medium at 37 °C. For co-cultivation with keratinocytes, bacteria were centrifuged, resuspended in KGM + CaCl₂, and added to the keratinocytes at an optic density (OD) of 0.1 in KGM + CaCl₂. A total of six replicates of each condition, with and without the addition of S. epidermidis, were performed, two replicates per weekly batch. Plates were centrifuged at 350 x g for 5 min to allow bacteria to settle on the bottom. After 3 h incubation, plates were washed, and KGM + CaCl₂ with gentamycin was added for a further incubation of 23 h at 37 °C and 5% CO₂. Plates were washed twice before RNA isolation using Trizol (Thermo Fischer, Waltham, USA) as per manufacturer’s instruction.

Sequencing libraries were prepared using TrueSeq Stranded mRNA kit (Illumina Inc., San Diego, USA). Sequencing was performed on the Illumina NovaSeq 6000 platform (Illumina Inc., San Diego, USA) with 2 × 50 base pairs length. Raw sequences were processed using the nf-core/rnaseq pipeline v3.0^72,73, which includes adapter quality trimming with Trim Galore (https://github.com/FelixKrueger/TrimGalore), removal of ribosomal RNA with SortMeRNA⁷⁴, alignment with STAR⁷⁵ and transcript quantification with Salmon⁷⁶. The human genome assembly hg19 was used as reference. Differentially expressed genes were detected with the R package DESeq2 v.1.30.0⁷⁷. Wald test was performed with negative binomial generalized linear models, which included the weekly batch and whether S. epidermidis was added to the culture or not (~batch + condition). P were corrected for multiple testing using the FDR method⁴⁵. Approximate posterior estimation for generalized linear model (apeglm⁷⁸) shrinkage was applied to logarithmic (log2) fold change (LFC). Results were considered significant based on q values (<0.05) and LFC (absolute LFC > 1). Enrichment of expressed genes up and down regulated were performed using the R package enrichR v3.0 and the GO_Biological_Process_2021 database⁷⁹. Enriched pathways were considered significant based on q values (<0.05; Fisher exact test). To get an overview of the effect of the S. epidermidis addition to keratinocyte cultures, transcriptional profiles were visualized through principal component analysis (PCA). First, variance stabilizing transformation (VST) from the R package DESeq2 v.1.30.0⁷⁷ was applied to the transcriptional data. PCA was performed as implemented in the R package PCAtools v. 2.4.0⁸⁰.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Raw 16 S rRNA gene amplicon sequences of PopGen participants were deposited at the European nucleotide archive (ENA) under accession code PRJEB41215. GWAS summary statistics generated in this study are available at GWAS catalogue under accession codes GCST90133164-GCST90133313. Phenotype data from PopGen individuals can be accessed through the Material Data Access Form from the PopGen Biobank (Schleswig-Holstein, Germany). Information about the Material Data Access Form and how to apply can be found at http://www.uksh.de/p2n/Information+for+Researchers.html. KORA data are available at https://www.helmholtz-munich.de/en/kora/for-scientists/cooperation-with-kora/index.html upon request by means of a project agreement. In addition, the following public database and resources were used: 1000 Genomes Phase3 ref. 47, Ribosomal Database Project (RDP) version 16⁵⁴, Genotype-Tissue Expression (GTEx) Project database v8¹⁰, Functional annotation of the mammalian genome (FANTOM5)⁶⁸, Skin single-cell data from by Solé-Boldo et al.¹², UK Biobank and the IEU Open GWAS Project database⁷⁰.

Code availability

Code used in the analysis are available at https://github.com/LucasMS/skin.mgwas.pub⁸¹.

References

Zhernakova, A. et al. Population-based metagenomics analysis reveals markers for gut microbiome composition and diversity. Science 352, 565–569 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Falony, G. et al. Population-level analysis of gut microbiome variation. Science 352, 560–564 (2016).
Article ADS CAS PubMed Google Scholar
Moitinho-Silva, L. et al. Host traits, lifestyle and environment are associated with the human skin bacteria. Br J Dermatol, https://doi.org/10.1111/bjd.20072 (2021).
Si, J., Lee, S., Park, J. M., Sung, J. & Ko, G. Genetic associations and shared environmental effects on the skin microbiome of Korean twins. BMC Genomics 16, 992 (2015).
Article PubMed PubMed Central Google Scholar
Baurecht, H. et al. Epidermal lipid composition, barrier integrity, and eczematous inflammation are associated with skin microbiome configuration. J. Allergy Clin. Immunol. 141, 1668–1676.e1616 (2018).
Article CAS PubMed Google Scholar
Kurilshikov, A. et al. Large-scale association analyses identify host factors influencing human gut microbiome composition. Nat. Genet 53, 156–165 (2021).
Article CAS PubMed PubMed Central Google Scholar
Rühlemann, M. C. et al. Genome-wide association study in 8,956 German individuals identifies influence of ABO histo-blood groups on gut microbiome. Nat. Genet 53, 147–155 (2021).
Article PubMed Google Scholar
Holle, R., Happich, M., Löwel, H., Wichmann, H. & Group, M. K. S. KORA - a research platform for population based health research. Das. Gesundheitswesen 67, 19–25 (2005).
Article Google Scholar
Nöthlings, U. & Krawczak, M. PopGen. Bundesgesundheitsblatt - Gesundheitsforschung - Gesundheitsschutz 55, 831–835 (2012).
Article PubMed Google Scholar
GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet 45, 580–585 (2013).
Article Google Scholar
Uhlen, M. et al. Proteomics. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Article PubMed Google Scholar
Solé-Boldo, L. et al. Single-cell transcriptomes of the human skin reveal age-related loss of fibroblast priming. Commun. Biol. 3, 188 (2020).
Article PubMed PubMed Central Google Scholar
Dembitzer, F. R. et al. gC1qR expression in normal and pathologic human tissues: differential expression in tissues of epithelial and mesenchymal origin. J. Histochem Cytochem 60, 467–474 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ghebrehiwet, B. & Peerschke, E. I. cC1q-R (calreticulin) and gC1q-R/p33: ubiquitously expressed multi-ligand binding cellular proteins involved in inflammation and infection. Mol. Immunol. 41, 173–183 (2004).
Article CAS PubMed Google Scholar
Braun, L., Ghebrehiwet, B. & Cossart, P. gC1q-R/p32, a C1q-binding protein, is a receptor for the InlB invasion protein of Listeria monocytogenes. EMBO J. 19, 1458–1466 (2000).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, T., Ghebrehiwet, B. & Peerschke, E. I. Staphylococcus aureus protein A recognizes platelet gC1qR/p33: a novel mechanism for staphylococcal interactions with platelets. Infect. Immun. 68, 2061–2068 (2000).
Article CAS PubMed PubMed Central Google Scholar
Peerschke, E. I. & Ghebrehiwet, B. The contribution of gC1qR/p33 in infection and inflammation. Immunobiology 212, 333–342 (2007).
Article CAS PubMed PubMed Central Google Scholar
Guo, H., Callaway, J. B. & Ting, J. P. Inflammasomes: mechanism of action, role in disease, and therapeutics. Nat. Med. 21, 677–687 (2015).
Article PubMed PubMed Central Google Scholar
Mitoma, H. et al. The DHX33 RNA helicase senses cytosolic RNA and activates the NLRP3 inflammasome. Immunity 39, 123–135 (2013).
Article CAS PubMed PubMed Central Google Scholar
Vierbuchen, T., Bang, C., Rosigkeit, H., Schmitz, R. A. & Heine, H. The human-associated archaeon methanosphaera stadtmanae is recognized through its RNA and Induces TLR8-dependent NLRP3 inflammasome activation. Front Immunol. 8, 1535 (2017).
Article PubMed PubMed Central Google Scholar
Linder, A. et al. CARD8 inflammasome activation triggers pyroptosis in human T cells. EMBO J. 39, e105071 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ito, S., Hara, Y. & Kubota, T. CARD8 is a negative regulator for NLRP3 inflammasome, but mutant NLRP3 in cryopyrin-associated periodic syndromes escapes the restriction. Arthritis Res Ther. 16, R52 (2014).
Article PubMed PubMed Central Google Scholar
McKenzie, C. W. et al. CFAP54 is required for proper ciliary motility and assembly of the central pair apparatus in mice. Mol. Biol. Cell 26, 3140–3149 (2015).
Article CAS PubMed PubMed Central Google Scholar
Keryer, G. et al. Ciliogenesis is regulated by a huntingtin-HAP1-PCM1 pathway and is altered in Huntington disease. J. Clin. Invest 121, 4372–4382 (2011).
Article CAS PubMed PubMed Central Google Scholar
Wloga, D. et al. TTLL3 Is a tubulin glycine ligase that regulates the assembly of cilia. Dev. Cell 16, 867–876 (2009).
Article CAS PubMed Google Scholar
Rocha, C. et al. Tubulin glycylases are required for primary cilia, control of cell proliferation and tumor development in colon. EMBO J. 33, 2247–2260 (2014).
Article CAS PubMed PubMed Central Google Scholar
Anvarian, Z., Mykytyn, K., Mukhopadhyay, S., Pedersen, L. B. & Christensen, S. T. Cellular signalling by primary cilia in development, organ function and disease. Nat. Rev. Nephrol. 15, 199–219 (2019).
Article PubMed PubMed Central Google Scholar
Toriyama, M. & Ishii, K. J. Primary cilia in the skin: functions in immunity and therapeutic potential. Front. in Cell and Developmental Biol. 9, https://doi.org/10.3389/fcell.2021.621318 (2021).
Smith, C. E. L., Lake, A. V. R. & Johnson, C. A. Primary cilia, ciliogenesis and the actin cytoskeleton: a little less resorption, a little more actin please. Front Cell Dev. Biol. 8, 622822 (2020).
Article PubMed PubMed Central Google Scholar
Bacon, C., Endris, V. & Rappold, G. A. The cellular function of srGAP3 and its role in neuronal morphogenesis. Mech. Dev. 130, 391–395 (2013).
Article CAS PubMed Google Scholar
Chen, J., Fujii, K., Zhang, L., Roberts, T. & Fu, H. Raf-1 promotes cell survival by antagonizing apoptosis signal-regulating kinase 1 through a MEK-ERK independent mechanism. Proc. Natl Acad. Sci. USA 98, 7783–7788 (2001).
Article ADS CAS PubMed PubMed Central Google Scholar
Samuel, D. S. et al. Raf-1 activation stimulates proliferation and inhibits IGF-stimulated differentiation in L6A1 myoblasts. Horm. Metab. Res 31, 55–64 (1999).
Article CAS PubMed Google Scholar
Rubiolo, C. et al. A balance between Raf-1 and Fas expression sets the pace of erythroid differentiation. Blood 108, 152–159 (2006).
Article CAS PubMed Google Scholar
Schroer, A. B. et al. A role for Regulator of G protein Signaling-12 (RGS12) in the balance between myoblast proliferation and differentiation. PLoS One 14, e0216167 (2019).
Article CAS PubMed PubMed Central Google Scholar
Willard, M. D. et al. Selective role for RGS12 as a Ras/Raf/MEK scaffold in nerve growth factor-mediated differentiation. EMBO J. 26, 2029–2040 (2007).
Article CAS PubMed PubMed Central Google Scholar
Li, Z. et al. Regulator of G protein signaling protein 12 (Rgs12) controls mouse osteoblast differentiation via calcium channel/oscillation and galphai-ERK signaling. J. Bone Min. Res 34, 752–764 (2019).
Article CAS Google Scholar
Ivey, M. J., Kuwabara, J. T., Riggsbee, K. L. & Tallquist, M. D. Platelet-derived growth factor receptor-alpha is essential for cardiac fibroblast survival. Am. J. Physiol. Heart Circ. Physiol. 317, H330–H344 (2019).
Article CAS PubMed PubMed Central Google Scholar
Horikawa, S. et al. PDGFRalpha plays a crucial role in connective tissue remodeling. Sci. Rep. 5, 17948 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Oh, J. et al. Biogeography and individuality shape function in the human skin metagenome. Nature 514, 59–64 (2014).
Article CAS PubMed PubMed Central Google Scholar
Song, K. et al. Leaked mitochondrial C1QBP inhibits activation of the DNA sensor cGAS. J. Immunol. 207, 2155–2166 (2021).
Article CAS PubMed Google Scholar
Zhang, Y., Forys, J. T., Miceli, A. P., Gwinn, A. S. & Weber, J. D. Identification of DHX33 as a mediator of rRNA synthesis and cell growth. Mol. Cell Biol. 31, 4676–4691 (2011).
Article CAS PubMed PubMed Central Google Scholar
Emmert, H., Rademacher, F., Glaser, R. & Harder, J. Skin microbiota analysis in human 3D skin models-“Free your mice”. Exp. Dermatol 29, 1133–1139 (2020).
Article PubMed Google Scholar
Becker, K., Heilmann, C. & Peters, G. Coagulase-negative staphylococci. Clin. Microbiol Rev. 27, 870–926 (2014).
Article PubMed PubMed Central Google Scholar
Severe Covid, G. G. et al. Genomewide association study of severe Covid-19 with respiratory failure. N. Engl. J. Med 383, 1522–1534 (2020).
Article Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B (Methodol.) 57, 289–300 (1995).
MathSciNet MATH Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Genomes Project, C. et al. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article ADS Google Scholar
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet 48, 1284–1287 (2016).
Article CAS PubMed PubMed Central Google Scholar
Delaneau, O., Marchini, J. & Zagury, J. F. A linear complexity phasing method for thousands of genomes. Nat. Methods 9, 179–181 (2011).
Article PubMed Google Scholar
Howie, B. N., Donnelly, P. & Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet 5, e1000529 (2009).
Article PubMed PubMed Central Google Scholar
Callahan, B. J. et al. DADA2: High-resolution sample inference from Illumina amplicon data. Nat. Methods 13, 581 https://www.nature.com/articles/nmeth.3869#supplementary-information (2016).
Article CAS PubMed PubMed Central Google Scholar
Callahan, B. DADA2 Pipeline Tutorial (1.16), https://benjjneb.github.io/dada2/tutorial.html (2021).
Wang, Q., Garrity, G. M., Tiedje, J. M. & Cole, J. R. Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl Environ. Microbiol 73, 5261–5267 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Cole, J. R. et al. Ribosomal Database Project: data and tools for high throughput rRNA analysis. Nucleic Acids Res. 42, D633–642 (2014).
Article CAS PubMed Google Scholar
Johnson, J. S. et al. Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis. Nat. Commun. 10, 5029 (2019).
Article ADS PubMed PubMed Central Google Scholar
McMurdie, P. J. & Holmes, S. phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data. PLOS ONE 8, e61217 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing. https://www.R-project.org/ (2020).
Oksanen, J. et al. vegan: Community Ecology Package. https://CRAN.R-project.org/package=vegan (2019).
Rühlemann, M. C. et al. Application of the distance-based F test in an mGWAS investigating β diversity of intestinal microbiota identifies variants in SLC9A8 (NHE8) and 3 other loci. Gut microbes 9, 68–75 (2018).
Article PubMed Google Scholar
Wang, Y., Naumann, U., Wright, S. T. & Warton, D. I. mvabund– an R package for model-based analysis of multivariate abundance data. Methods Ecol. Evolution 3, 471–474 (2012).
Article Google Scholar
Dunn, P. K. & Smyth, G. K. Randomized Quantile Residuals. J. Computational Graph. Stat. 5, 236–244 (1996).
Google Scholar
Aulchenko, Y. S., Ripke, S., Isaacs, A. & van Duijn, C. M. GenABEL: an R library for genome-wide association analysis. Bioinformatics 23, 1294–1296 (2007).
Article CAS PubMed Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
Article CAS PubMed PubMed Central Google Scholar
Han, B. & Eskin, E. Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies. Am. J. Hum. Genet 88, 586–598 (2011).
Article CAS PubMed PubMed Central Google Scholar
Benner, C. et al. FINEMAP: efficient variable selection using summary data from genome-wide association studies. Bioinformatics 32, 1493–1501 (2016).
Article CAS PubMed PubMed Central Google Scholar
Durinck, S. et al. BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis. Bioinformatics 21, 3439–3440 (2005).
Article CAS PubMed Google Scholar
Kent, W. J. et al. The human genome browser at UCSC. Genome Res 12, 996–1006 (2002).
Article CAS PubMed PubMed Central Google Scholar
Lizio, M. et al. Gateways to the FANTOM5 promoter level mammalian expression atlas. Genome Biol. 16, 22 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hemani, G. et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife 7, https://doi.org/10.7554/eLife.34408 (2018).
Elsworth, B. et al. The MRC IEU OpenGWAS data infrastructure. bioRxiv, 2020.2008.2010.244293, https://doi.org/10.1101/2020.08.10.244293 (2020).
Stock, J. H., Wright, J. H. & Yogo, M. A Survey of Weak Instruments and Weak Identification in Generalized Method of Moments. J. Bus. Economic Stat. 20, 518–529 (2002).
Article MathSciNet Google Scholar
nf-core/rnaseq: nf-core/rnaseq v3.0 - Silver Shark v. 3.0 (Zenodo, 2020).
Ewels, P. A. et al. The nf-core framework for community-curated bioinformatics pipelines. Nat. Biotechnol. 38, 276–278 (2020).
Article CAS PubMed Google Scholar
Kopylova, E., Noe, L. & Touzet, H. SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics 28, 3211–3217 (2012).
Article CAS PubMed Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS PubMed Google Scholar
Patro, R., Duggal, G., Love, M. I., Irizarry, R. A. & Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods 14, 417–419 (2017).
Article CAS PubMed PubMed Central Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Article PubMed PubMed Central Google Scholar
Zhu, A., Ibrahim, J. G. & Love, M. I. Heavy-tailed prior distributions for sequence count data: removing the noise and preserving large differences. Bioinformatics 35, 2084–2092 (2018).
Article PubMed Central Google Scholar
Kuleshov, M. V. et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res 44, W90–97 (2016).
Article CAS PubMed PubMed Central Google Scholar
Blighe, K. & Lun A. PCAtools: Everything Principal Components Analysis. R package version 2.8.0. https://github.com/kevinblighe/PCAtools (2022).
Moitinho-Silva, L. Host genetic factors related to innate immunity, environmental sensing and cellular functions influence human skin microbiota, https://github.com/LucasMS/skin.mgwas.pub, https://doi.org/10.5281/zenodo.7047733. (2022).

Download references

Acknowledgements

We are grateful to all participants and study staff from the Biobank PopGen (Dr Gunar Jacobs and team) and the KORA Studienzentrum (Dr Margit Heier and team). We thank the staff from UKSH Dermatology laboratory (particularly, Anke Rose), the IKMB microbiome laboratory, the IKMB DNA laboratory and the IKMB sequencing laboratory for their excellent support. We are grateful to Dr Sören Franzenburg and Eike Matthias Wacker for assistance and troubleshooting. We thank Martin Schulzky for the design of skin site icons. The project leading to this application has received funding from the Deutsche Forschungsgemeinschaft (DFG) Grant no. WE2678/14-1 (granted to S.W.) and the Innovative Medicines Initiative 2 Joint Undertaking under Grant Agreement no. 821511 (BIOMAP, granted to S.W.). This Joint Undertaking receives support from the European Union’s Horizon 2020 research and innovation programme and EFPIA. This publication reflects only the author’s view and the JU is not responsible for any use that may be made of the information it contains. This work was also supported by the Deutsche Forschungsgemeinschaft (DFG) Collaborative Research Center 1182 ‘Origin and Function of Metaorganisms’ (grant no. SFB1182, Project A2 granted to A.F.). The study received infrastructure support from the DFG research unit “miTarget” (Projektnummer 426660215; EL 831/5-1 granted to A.F.). The KORA study was initiated and financed by the Helmholtz Zentrum München – German Research Center for Environmental Health, which is funded by the German Federal Ministry of Education and Research (BMBF) and by the State of Bavaria.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

These authors contributed equally: Andre Franke, Stephan Weidinger, Malte Christoph Rühlemann.

Authors and Affiliations

Institute of Clinical Molecular Biology, Kiel University, Kiel, Germany
Lucas Moitinho-Silva, Frauke Degenhardt, Simonas Juzenas, Florian Uellendahl-Werth, David Ellinghaus, Corinna Bang, Andre Franke & Malte Christoph Rühlemann
Department of Dermatology and Allergy, University Hospital Schleswig-Holstein, Kiel, Germany
Lucas Moitinho-Silva, Elke Rodriguez, Hila Emmert, Lena Möbus, Nicole Sander & Stephan Weidinger
Institute of Biotechnology, Life Science Centre, Vilnius University, Vilnius, Lithuania
Simonas Juzenas
Department for Epidemiology and Preventive Medicine, University of Regensburg, Regensburg, Germany
Hansjörg Baurecht
Biobank PopGen and Institute of Epidemiology, Kiel University, Kiel, Germany
Lukas Tittmann
Institute of Epidemiology, Kiel University, Kiel, Germany
Wolfgang Lieb
Institute of Epidemiology, Helmholtz Zentrum München – German Research Center for Environmental Health, Neuherberg, Germany
Christian Gieger & Annette Peters
Research Unit of Molecular Epidemiology, Helmholtz Zentrum München – German Research Center for Environmental Health, Neuherberg, Germany
Christian Gieger

Authors

Lucas Moitinho-Silva
View author publications
You can also search for this author in PubMed Google Scholar
Frauke Degenhardt
View author publications
You can also search for this author in PubMed Google Scholar
Elke Rodriguez
View author publications
You can also search for this author in PubMed Google Scholar
Hila Emmert
View author publications
You can also search for this author in PubMed Google Scholar
Simonas Juzenas
View author publications
You can also search for this author in PubMed Google Scholar
Lena Möbus
View author publications
You can also search for this author in PubMed Google Scholar
Florian Uellendahl-Werth
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Sander
View author publications
You can also search for this author in PubMed Google Scholar
Hansjörg Baurecht
View author publications
You can also search for this author in PubMed Google Scholar
Lukas Tittmann
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Lieb
View author publications
You can also search for this author in PubMed Google Scholar
Christian Gieger
View author publications
You can also search for this author in PubMed Google Scholar
Annette Peters
View author publications
You can also search for this author in PubMed Google Scholar
David Ellinghaus
View author publications
You can also search for this author in PubMed Google Scholar
Corinna Bang
View author publications
You can also search for this author in PubMed Google Scholar
Andre Franke
View author publications
You can also search for this author in PubMed Google Scholar
Stephan Weidinger
View author publications
You can also search for this author in PubMed Google Scholar
Malte Christoph Rühlemann
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Study was designed by L.M.S., F.K., E.R., H.E., F.U.W., D.E., A.F., S.W. and M.C.R. Sample, data collection and processing were done by E.R., H.E., L.T., W.L., C.G., A.P., C.B. Analysis was performed by L.M.S., F.D., H.E., S.J., L.M., F.U.W., D.E., N.S., H.B. and M.C.R. Manuscript was written by L.M.S., F.D., E.R., H.E., F.U.W., A.F., S.W. and M.C.R. All authors revised the manuscript.

Corresponding authors

Correspondence to Andre Franke or Stephan Weidinger.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Moitinho-Silva, L., Degenhardt, F., Rodriguez, E. et al. Host genetic factors related to innate immunity, environmental sensing and cellular functions are associated with human skin microbiota. Nat Commun 13, 6204 (2022). https://doi.org/10.1038/s41467-022-33906-5

Download citation

Received: 10 November 2021
Accepted: 06 October 2022
Published: 19 October 2022
DOI: https://doi.org/10.1038/s41467-022-33906-5

This article is cited by

A genome-wide association study reveals the relationship between human genetic variation and the nasal microbiome
- Xiaomin Liu
- Xin Tong
- Tao Zhang
Communications Biology (2024)
Disease modification in inflammatory skin disorders: opportunities and challenges
- Thomas Bieber
Nature Reviews Drug Discovery (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.