Introduction

Human skin is a complex and dynamic ecosystem that is colonised by diverse bacteria, fungi, and viruses that together form the skin microbiota1. The composition and diversity of skin microbial communities is affected by multiple host and environmental factors, such as the topography and occlusion of sites, sex, age, and lifestyle2,3. Despite variations in the microbial communities across different niches, the microbiota contributes functional roles collectively with resident skin cells towards maintaining skin homeostasis. In particular, the resident microbiota promotes colonisation resistance that inhibits pathogenic organisms and regulates the immune system4,5,6,7. Bacterial communities of the host can undergo changes in species composition between health and disease states. For example, Staphylococcus aureus increases in atopic dermatitis (AD) reducing microbiome diversity8,9,10.

Metagenomic studies have confirmed the abundance of viruses on human skin, which can impact its health11,12. This insight has increased focus upon the viruses of bacteria and the contribution to the skin virome of these bacteriophages (phages). Since phages infect and replicate inside bacterial hosts, they were proposed to have important roles in modulating the skin bacterial communities and their population dynamics11,13, with a recent study indicating the potential contribution of skin phages in AD dysbiosis that proposes both temperate and lytic phages drive host population changes14. Phages with a lytic life cycle can infect skin colonising bacteria, potentially modulating the composition of the bacterial community; this is exploitable for therapeutic applications15,16. In contrast, temperate phages can integrate their viral genome into the bacterial host chromosome, and the resulting prophages can encode niche promoting activity, such as Sa3int phages of Staphylococcus aureus that are proposed to increase host colonisation, limiting their therapeutic usage17. Temperate phages comprise the major population of dsDNA phages sampled on skin and were found to be persistent11. Moreover, horizontal gene transfer by phage transduction promotes the evolution of a bacterial community through the acquisition of host accessory genes into genomes18.

Coagulase-negative staphylococci (CoNS) colonise human skin with differing abundances across the body depending on multiple factors, including sebaceous, moist and dry skin properties2. Staphylococcus epidermidis is the most frequently isolated CoNS and is considered a ubiquitous human skin resident19. Other frequently isolated commensal CoNS of skin include S. hominis, S. haemolyticus, S. lugdunensis, and S. capitis, with the latter abundant on the scalp11,20,21,22. Although CoNS form part of the healthy skin microbiota, these staphylococci have the potential to cause opportunistic invasive disease, ranging from indwelling device infections to neonatal septicaemia23,24,25.

Differences in the relative abundance of the commensal CoNS species within the skin microbiota could reflect their relative competitive fitness concomitant with ageing-related changes of the skin26,27. The range of bacterially produced factors that influence the abundances of staphylococci on skin are partially understood with studies showing the importance of antimicrobials and secreted antagonist proteins and peptides7,28,29. Beyond these antagonisms, improved understanding of microbiome dynamics and stability requires investigation of the effects of bacteria-phage interactions.

Consistent with the relative abundance of their hosts across different microenvironments, Staphylococcus phages are among the most abundant on skin30. Previous identification of skin phages used direct swabbing and purification of phages, and DNA extraction for metagenomic analysis11,31. Most staphylococcal phages identified from skin have a linear dsDNA genome and belong to the class Caudoviricetes (previously order Caudovirales comprising families: Siphoviridae, Podoviridae and Myoviridae)11,16. The majority of studies with staphylococcal phages have focused on S. aureus, particularly with respect to virulence genes and horizontal gene transfer32,33,34. The study of CoNS and their phages has received less attention despite their interaction likely being a critical contribution to formation and temporal stability of skin communities and their microenvironment. Recent studies of staphylococcal phages include the isolation of lytic phage Andhra35,36 and multiple phages infecting S. epidermidis37, hinting at the potential for unexplored diversity on skin.

This study focused on isolating and characterising CoNS species-specific phages from the skin of healthy volunteers. A total of 40 phages infecting major CoNS species were purified from skin swabs. The phages were assigned to six genetic clusters revealing two clusters are previously undescribed phages, of which we name one as Alsa phages. We characterised the genetics and biochemical properties of four isolated Alsa phages. The collection of 40 phages were studied for their host range across 140 strains of 8 staphylococci revealing infection differences. Our aim was to establish a basis for future investigations of how staphylococcal phages contribute to the temporal dynamics of skin communities in health and disease. The identification of an increased diversity of staphylococcal phages infecting different species colonising skin supports studies of skin diseases, such as atopic dermatitis and eczema, and their therapy.

Materials and methods

Ethics approval

The study was approved by Central University Research Ethics Committee C at University of Liverpool, United Kingdom (project reference number: 9895).

Bacterial culture

A collection of Staphylococcus species and strains collected by the Horsburgh lab group at the University of Liverpool was used in this study (Supplementary Table S1). A single colony of each strain was inoculated in Brain Heart Infusion (BHI) broth (Neogen) and incubated overnight at 37 °C whilst shaking.

Skin sample collection and treatment

Healthy volunteers (n = 80) were recruited to the study, and they used swabs to collect phages from their outer skin layer of different body sites. A total of 4 swabs were obtained from each participant. Swabbing was performed by soaking a sterile cotton swab (Copan) in sodium and magnesium salts (SM) buffer (100 mM NaCl, 4.8 mM MgSO4, 1 M Tris–HCl pH 7.5, 0.01% (w/v) gelatine) and rubbing the swab gently on the chosen skin site (~ 5 cm × 5 cm) for a period of one minute. The skin swab samples were incubated overnight in BHI medium at 37 °C to enrich the phage yield of the collected samples. Cultured samples were then centrifuged at 5000 rpm for 2 min to pellet bacterial cells and debris, and the supernatant was filtered through a syringe filter (0.45 μm, Star Lab) to obtain phage filtrate.

Bacteriophage isolation

An agar-based spot assay was performed using the cultured skin sample supernatants to detect the presence of phages. Following the overnight incubation of Staphylococcus indicator host strains, 200 μl of mid-log phase cultured host cells were mixed with 4 ml of soft agar (0.4% w/v), then poured onto 1.5% (w/v) BHI agar plates. Once the soft agar lawn solidified, a volume of 10 μl of phage filtrate was pipetted onto the bacterial lawn followed by overnight incubation of the plates at 37 °C. The presence of cleared growth zones in the indicator species lawn was used to pick plaques using sterile pipette tips to suspend in 500 μl SM buffer followed by filtering (0.45 μm).

Bacteriophage purification and propagation

Plaque assays were performed to purify the isolated phages. Ten-fold serial dilutions of each isolated phage were prepared in BHI media supplemented with 10 mM MgSO4 (BHI-MG). Diluted phages (100 μl) were mixed with an equal volume of mid-log phase cells of susceptible bacterial strain and incubated at 37 °C with shaking for 1 h. The mixture was then added to 4 ml soft agar (0.4% w/v), poured onto BHI agar plates (1.5% w/v), and left to set prior to incubating overnight at 37 °C. A single plaque was picked into 500 μl SM buffer and filtered (0.45 μm) to obtain phage lysate. Plaque assays were repeated three times with filtrate to ensure purified phages.

To propagate phages, mid-log phase cells of susceptible bacterial hosts (S. capitis 104, S. epidermidis BEN150, S. hominis LIV1218 and LIV1220, S. haemolyticus BEN145) were infected with the purified phage lysates in BHI-MG and incubated at 30 °C for 18 h. After incubation, each phage propagation was centrifuged at 5000 × g for 5 min, the phage lysate supernatant was filtered (0.45 μm) and stored at 4 °C. Plaque assays were used to determine titres of the propagated phages.

One-step growth curve

Bacteria for phage propagation were cultured in 10 ml BHI-MG to reach OD600 of 0.5 prior to addition of phage at a multiplicity of infection (MOI) of 0.1 followed by an incubation at 37 °C for 20 min to allow adsorption. Phage-adsorbed bacteria were collected by centrifugation at 8000 × g at 4 °C for 5 min. The supernatant was discarded, and cell pellets were re-suspended in 10 ml BHI broth and incubated at 37 °C for 90 min with agitation at 200 rpm. Aliquots of 200 μl were collected every 10 min and immediately serially diluted to measure phage titres using the soft agar overlay technique. The phage burst size was calculated by dividing the mean final titres by the mean initial phage titres.

Planktonic culture growth of hosts with phages

To assay the impact of phages on host growth, an overnight culture of host bacteria was sub-cultured in BHI medium to OD600 of 0.2. Phages were added at a MOI of 0.01, 0.1, and 1. Bacteria with no phage added were used as a negative control. The mixture was incubated at 37 °C for 22 h, and OD600 was measured hourly. The OD600 measurements of all samples were plotted using GraphPad Prism (v9.3.1).

Thermal and pH stability of phages

The stability of skin-isolated phages was assayed across a temperature range of -80 to + 70 °C. One millilitre of phage solution (~ 109 PFU mL−1) was aliquoted in 1 ml of Phosphate Buffer Saline (PBS) and incubated at selected temperatures for 1 h, with control samples incubated at 4 °C. PBS adjusted to a range of pH 4–9 was similarly used to incubate phage solution at 28 °C for 1 h, with samples incubated at pH 7 as controls. Phage titres were determined using the soft agar overlay technique with ten-fold dilutions spotted on a lawn of its susceptible host bacteria. Statistical analysis was performed using analysis of variance (ANOVA).

Host range determination

The host range of skin-isolated phages was assessed by testing each purified phage for its ability to infect 140 different strains belonging to 8 staphylococcal species (Supplementary Table S1). This assay was conducted using the spot assay after overnight incubation, when the presence of clearing zones were visually categorised according to their clarity into three groups: complete, turbid, or zero.

Efficiency of plating (EOP)

To validate the phage-host range data, an EOP assay was used with representative bacterial strains that showed complete or turbid clearing by spot assay (Supplementary Table S1) against all infecting skin-isolated phages. To perform the assay, phage lysates were serially diluted to 10–8 dilution, and 10 μl of each dilution was spotted on the prepared bacterial lawns of each of the bacterial strains tested. The plates were then incubated overnight at 37 °C, the number of PFUs produced by the susceptible bacteria was counted, and phage titre was determined. EOP values were determined by dividing mean phage titres of a tested strains by the phage titre of the original propagation host.

Phage adsorption assay

Exponential growth phase bacterial cells (~ 108 CFU mL-1) were infected with a volume of phage lysate to achieve MOI of 0.1 in a 10 ml BHI-MG. One millilitre of phage-host mixture was immediately collected upon phage inoculation to determine the initial phage titre. Phages were allowed to adsorb at 37 °C for 20 min and the same volume of the phage-host suspension was collected and centrifuged at 10,000 × g for 3 min. Phages mixed with BHI only were used as a negative control. The titres of the free phages in the supernatants were determined by plaque assay. The proportion of unadsorbed phages was calculated by following the formula: [residual titre/ initial titre] X 100%.

Sodium periodate treatment

To determine the contribution of bacterial surface receptors in phage adsorption, treatment of host cells with sodium acetate (CH3COONa) (VWR International) and sodium periodate (NaIO4) (Sigma) was performed following previously published protocols38, with minor modifications. Following overnight culture of each strain, 500 μl of culture was centrifuged at 5000 × g for 5 min. Bacterial pellets were washed with PBS, re-suspended in 50 mM of NaIO4 in 50 mM of CH3COONa (pH 5.2), and incubated at room temperature in the dark for 1 h. Treated cells were centrifuged (10,000 × g, 5 min) and washed with 1 ml PBS. Pellets were re-suspended in 1 ml BHI broth and OD600 adjusted to 0.5. A phage adsorption assay with phages added at MOI 0.1 was performed as described above in the absence or presence of periodate treatment. Unpaired t-test was used for statistical analysis.

Transmission electron microscopy

Washed phages from lysates were visualised by negative-staining using carbon film copper grids (Agar Scientific, UK; AGS160-3) charged to positive using a 2 min glow discharge in a Quorum Q150TS sputter coater with a turbo orifice plate. The phage samples were left to adhere to grids for 60 s in clamping forceps then grids washed with double distilled water and rinsed using 2% uranyl acetate, ensuring excess drops were wicked away with complete drying prior to imaging at 120 kV using a FEI120kV Tecnai G2 Spirit BioTWIN TEM with a Gatan Rio 16 camera plus digital micrograph software.

Phage DNA extraction

Phage DNA was extracted following previously published protocols with some modifications39,40. To extract sufficient phage genomic DNA, phage particles were precipitated using precipitation solution of 10% (w/v) polyethylene glycol (PEG) 8000 in 1 M NaCl. Samples were mixed gently by inversion and kept on ice for 2 h then incubated overnight at 4 °C. The precipitated lysates were centrifuged at 10,000 × g for 15 min. Supernatant was discarded and pellets re-suspended in 500 μl SM buffer. For phage DNA extraction, the phage mixture was treated with 20 U of DNase in DNase reaction buffer (Invitrogen), and 10 μl of RNase A (Invitrogen), mixed gently, and incubated at 37 °C for 1 h whilst shaking. This was followed by heat inactivation of the enzymes at 75 °C for 20 min. The purification of phage DNA was subsequently carried out using Norgen phage DNA isolation kit according to the manufacturer’s instructions. DNA concentration was measured fluorometrically using Qubit™ dsDNA HS Assay Kit (Thermo Fisher Scientific). The integrity of each of the purified DNA was checked using 1% Tris–Acetate EDTA (TAE) agarose gel electrophoresis.

DNA sequencing and bioinformatic analysis

Bacterial sequencing and MLST

DNA extraction of bacteria and Illumina sequencing analysis was performed by MicrobesNG (Birmingham, UK) using their established MicrobesNG Genome Sequencing Method (v20230314). The multi-locus sequence types (MLST) of strains were determined using MLST (v2.10) from genome sequences based on the MLST scheme of S. hominis and S. epidermidis. A core genome alignment was built from the S. hominis isolates and annotated using Prokka (v1.14.5) with default parameters. Pan-genome analysis of the isolates was done using Panaroo (v1.2.3) with default parameters producing the core genome alignment. The phylogeny of strains used for host range studies was constructed from a maximum-likelihood (ML) tree, with the substitution model TIM + F + I + R3 using 100 bootstrap replicates, generated by IQ-TREE (v2.2.2.6)41 based on core genome alignment. The core genome maximum likelihood tree was visualised with iTOL (v6.8.1)42. The phylogeny of a larger set of S. hominis genomes (n = 243) obtained from the NCBI database used the substitution model GTR + F + I + R10). Plasmidfinder (v2.0.1) (https://cge.food.dtu.dk/services/PlasmidFinder)43,44 was used to identify plasmids present in 26 S. hominis, 19 S. capitis and 25 S. epidermidis isolates with identity threshold of 85% and 80% coverage.

Phage genome sequencing

MinION DNA library preparation of phage DNA was done prior to sequencing using Oxford Nanopore Technology (ONT) with the MinION sequencer. MinION sequencing libraries were prepared using Native Barcoding Expansion 1–12 (EXP-NBD 104) and 13–24 (EXP-NBD114) with the Ligation Sequencing Kit following the manufacturer’s instructions. Samples of the barcoded DNA library were mixed with sequencing buffer and loading beads then loaded onto a MinION SpotCN flow cell, version R9.4.1 using a MinION Mk1B device for sequencing using the MinKNOW (v21.06.0) software.

Bioinformatic analysis of phage genomes

The generated raw dataset from the ONT was demultiplexed and base-called onto fastq files using Guppy (v4.5.4). NanoPlot (v1.32.1) and Seqstats (https://github.com/clwgg/seqstats) were used to check the quality of the sequencing reads. Reads were then de novo assembled using Flye (v2.9.2) pipelines. Quast (v5.0.2) was run to assess the assembly quality based on number of contigs, largest contig, total length, GC%, and N50. Assembly quality was also checked with Bandage (v0.9.0) and CheckV (v0.7.0). The genome ends (termini) and packaging mechanism were predicted and reordered using PhageTerm (v1.0.12) and tRNAs were searched using tRNAscan-SE (v2.0). PhageAI webserver platform (https://app.phage.ai/) was used to determine the predicted phage lifestyle. For each assembled phage genome, Pharokka (v1.0.0) was used for open reading frame (ORF) calling with annotation using default parameters combined with Prokka (v1.14.5) to ensure accurate determination of coding sequences (CDS). HHpred (https://toolkit.tuebingen.mpg.de/tools/hhpred)45 was used to interrogate functional category determination by Prokka.

Family-level classification of phages was performed using ViPTree (https://www.genome.jp/viptree) using whole-genome derived amino acid sequences. In addition, the current inphared database (v1.2) was used to compare phage genomes using Mash (v2.3) and subsequently with whole-genome alignment using MUMmer (v4.0.0). For comparative analyses in this study, unverified phage genomes were excluded. To determine the relationships within each family, smaller phage genome datasets of each family were analysed by (i) calculating intergenomic similarities based on nucleic acid sequences using VIRIDIC (http://rhea.icbm.uni-oldenburg.de/VIRIDIC/) and (ii) constructing a phylogeny based on core proteins using intergenomic similarity thresholds for species (95%) and genus (70%). Core protein analysis of core genes was calculated with VirClust (http://rhea.icbm.uni-oldenburg.de/virclust/) and a tree cut at a distance of 0.9. Clinker (v0.0.28)46 was used to perform the genome comparisons and describe relationships. Proksee (v6.0.3) (https://proksee.ca) was used to generate circular phage genome maps and visualise ORFs annotated using Pharokka. To gain further insight of staphylococcal phage relationships with taxonomic context, vConTACT2 (v0.9.19)47 was used with the Diamond, MCL, and ClusterONE modules (with -db Prokaryotic Viral RefSeq v201, using ICTV and NCBI taxonomies as RefSeq database). Network analysis was visualised using Cytoscape (v3.9.1)48.

Detection of anti-viral defence systems within bacterial genomes

Genes of phage defence system were detected with the PADLOC webserver (https://padloc.otago.ac.nz/padloc/) (v1.2.0) and padlocdb (v1.5.0) using genomes from strains of the host range analyses or S. hominis genomes retrieved from NCBI database. Defence systems categorised as DMS_other were excluded. In addition, the DefenseFinder webserver (https://defense-finder.mdmparis-lab.com/)49 was used to detect anti-viral defence system genes in CoNS strains. PHASTER (PHAge Search Tool Enhanced Release) webserver (https://phaster.ca/) was used to predict prophages of S. hominis, S. capitis and S. epidermidis strains as either intact, incomplete, or questionable. Geneious Prime (v2023.1.2), and both Batch CD-Search and InterPro webserver (https://www.ebi.ac.uk/interpro/) (v98.0) were used to compare protein sequences from host or phage.

Statistics and reproducibility

For statistics and reproducibility in this study, each of the experiments were performed independently a minimum of three times.

Results

Isolation and identification of CoNS infecting phages from human skin

Skin was swabbed by healthy volunteers (n = 80) to generate 320 skin samples that were processed and screened for the presence of phages infecting CoN Staphylococcus species. Isolation of phages involved the use of multiple strains of the staphylococcal species: S. epidermidis, S. hominis, S. capitis, and S. haemolyticus, with successive rounds of plaque purification. A total of 40 phages were isolated from the 320 skin swab samples and stored for study (Supplementary Table S2). Purified phage genome DNA was sequenced using ONT to provide insights into phage diversity, and a phylogenetic tree was constructed based on the genome-wide sequence similarities compared against 1332 phage reference genome sequences (Fig. 1a). Comparative genomic analysis assigned the collected phages to 6 genetic clusters, of which two are novel. Based on recent International Committee on Taxonomy of Viruses (ICTV) nomenclature50, each of the phages belong to an undefined order of the Caudoviricetes class. The novel phage genomes were assigned as unclassified members of the Siphoviridae family (cluster 4: genome length ~ 45 kb; GC% 31.47) and an undefined Herelleviridae family (cluster 1: genome length ~ 147 kb; GC% 29.62–29.67) (Fig. 1a). The previously identified phages included: undefined Rountreeviridae (genus Andhravirus) (cluster 2: genome length ~ 18 kb; GC% 28.14–31.20); unclassified Siphoviridae (cluster 3 and cluster 5: genome lengths ~ 46 kb and ~ 40 kb; GC% 34.70 and GC% 33.27–33.49, respectively); and unclassified Sextaecvirus (cluster 6: genome length 88–104 kb; GC% 28.34–29.68) (Supplementary Table S2). The previously unreported Herelleviridae family (formerly part of Myoviridae) phage genomes are divergent from other Staphylococcus phage families identified from the skin sampling with maximal protein distance based on ViPTree analysis (Fig. 1a). The relationships between the isolated phages and their closest relatives were supported based on sequence homology, which determined variation across ascribed genera (Supplementary Table S2). To further explore relationships, vConTACT was used to generate a network based on interrelations of shared phage proteins from an all-versus-all pairwise comparison that assigned them to protein clusters. The relationships output using the RefSeq database determined that the six clusters from comparative genome analysis were connected to the RefSeq phage network (Fig. 1b). Cluster 2 phages were grouped as Andhra phages, which were connected to the network via a cluster 6 phage that grouped with Staphylococcus phage 6ec. Cluster1 Alsa phages formed their own subcluster that connected via both cluster 6 and diverse Herelleviridae. Clusters 3, 4 and 5 were connected to each other and the RefSeq phage network associated with Siphoviridae of multiple genera (Fig. 1b, Supplementary Table S3).

Figure 1
figure 1

Phylogenetic tree and relationship of CoNS phages. (a) Staphylococcal phage genomes from this study were assigned to 6 genetic clusters (1–6) using ViPTree 3.1 that included 1332 related phage taxa. Virus families are identified on the inner ring as Herelleviridae (mint green), Schitoviridae (blue), and other unknown families (grey). Phyla of their bacterial host is indicated in the outer ring with ascribed colours shown in the key. Cluster 4 represents a novel phage genome of this study (red line) and is distinct from øIME1354_01 representing the adjacent cluster of the same node (black line). (b) Network diagram of phage genomes represented by each node, and edges are genome relatedness inferred by vConTACT. RefSeq phages (small circles) that are neighbours in the core network only are shown for clarity. Individual isolated staphylococcal phages are represented as coloured nodes and named clusters 1–6; expanded in the inset.

The staphylococcal phage genomes were compared using intergenomic similarities analysis (Supplementary Fig. S1) to further discriminate them based on their genetic properties by including the sequences of all 40 phages isolated in this study together with reference staphylococcal phage genomes (cluster 2: NC_047813 øAndhra; cluster 3: ON325435 øCUB-EPI_14; cluster 4: NC_070727 øIME1354_01; cluster 5: NC_028821 øStB2-like; cluster 6: NC_023582 øvB_SepS-SEP9). In addition, to validate the presence of distinct genetic clusters, phage genomes from ViPTree and inphared databases were included in the analysis based on the top k-mer match. Unverified phage genomes in GenBank were excluded from the analysis. From these genome comparisons, 6 identified clusters were supported from the phylogenetic tree generated using ViPTree as being discrete, of which both clusters 1 and 4 represent new phage genomes (Fig. 1a, Supplementary Fig. S1). The four cluster 1 Herelleviridae phage genomes are novel and named here as Alsa phages (type genome Alsa_1) based on their distinct clustering in the ViPTree output, which was supported from intergenomic similarities analysis (Supplementary Fig. S1)51.

Further discrimination of the 40 skin phage genomes and reference phage genomes was done based on intergenomic distances calculated by using VirClust hierarchical clustering of protein presence/absence. VirClust analysis highlighted the distinct protein content with 4 clades in the hierarchical tree showing limited overlap of proteins between the ascribed phage clusters (Fig. 2). In addition to the Alsa phages discrimination, the singleton dsDNA phage (S-CoN_Ph26) genome of cluster 4 from intergenomic similarities analysis contained unique encoded protein clusters supporting it as a novel phage, and distinct from staphylococcal phage IME1354_0152 representing the adjacent cluster of the same node as S-CoN_Ph26 (Fig. 2, Supplementary Table S2), but was not investigated further in this study.

Figure 2
figure 2

Hierarchical clustering of CoNS phage genomes. (1) Intergenomic distances of protein clusters were calculated using VirClust (2) and their relationship by silhouette width was examined using a hierarchical tree. The constituent viral genome clusters (VGCs) were determined based upon a 0.9 distance threshold. Similarity within each VGC is indicated by colour, with values closer to 1 being lime green. (3) Viral genome protein cluster (PCs) distribution is represented by a heatmap with rows representing individual genomes and columns as individual protein clusters together. A range of nine colours are used to indicate the number of protein clusters. Data to the right provide viral genome-specific statistics (4–9): (4) genome length (bp); (5) fraction of shared proteins (dark grey) from total proteins (light grey); (6) fraction of shared proteins within a VGC; (7) fraction of proteins specific to its own VGC; (8) proportion of proteins shared outwith a VGC; (9) the proportion of proteins shared exclusively outwith a VGC. (10) CoNS phage names from this study, provided as either their identifier or GenBank accession number, are included with reference phage genomes. (11) Electron micrographs of 6 phages, one from each cluster identified in this study [scale bar, 200 nm]. (12) Plaque morphology of each of the pictured phage and (13) their ViPTree cluster number and (14) their phage name.

The morphology of a representative phage of each of the 6 clusters defined by ViPTree analysis was determined by transmission electron microscopy (Fig. 2, Supplementary Fig. S2). Phage S-CoN_Ph26 (cluster 4) has an icosahedral head (41 ± 0.4 nm) and a short, non-contractile tail (7.35 ± 0.4 nm) while the remaining phage cluster representatives are all long-tailed phages. Analysis of the genome content using Phage AI predicted lytic lifecycles for 30 phages (clusters 1,2, and 6; % confidence 94–95, 87–99 and 78–92, respectively) and lysogenic lifecycle for the remaining 10 phages (clusters 3,4, and 5; % confidence 100, 100, and 99–100, respectively) (Supplementary Table S2).

Bioinformatic analysis of the phage genome annotations (Supplementary Table S4), determined a notable feature of only the cluster 6 genomes is a tRNA-Trp, that could contribute to growth in the host or the capacity to infect a broader host range, as proposed for 53. Neither virulence-associated genes nor CRISPR-Cas were identified across the set of phages based on Pharokka annotations. A flavodoxin-encoding gene together with genes for ribonucleotide reductase in phages of cluster 1 and 6 were identified; these activities were associated with both viral replication and host metabolism in a study of phage 0305φ8-36 of Bacillius thuringiensis54.

Novel Alsa phages

Phage AI analysis was used to make predictions about the lifecycle of the Alsa phages identified in this study, which proposed they have a lytic lifecycle based on the absence of lambdoid phage integrase and repressor protein functions (Supplementary Table S4). Further phylogenetic analysis of staphylococcal phages identified that Alsa phages share a common ancestor with a broad group of lytic staphylococcal phages, including S. aureus øTwort and øK (Supplementary Fig. S3). The linear dsDNA Alsa phage genomes range in size from 146,115 to 148,384 bp, harbour 256–270 coding sequence (CDS), and feature a narrow G + C content ranging from 29.62 to 9.67% (Supplementary Tables S2 and S4). The architecture of the four Alsa genomes shows synteny with collocated functional and structural proteins (Fig. 3, Supplementary Fig. S4). No tRNA coding sequence was identified using tRNAscan-SE and phageTerm identified a long Direct Terminal Repeat (DTR) sized from 8250 to 9667 bp. The lysis genes of Alsa phages annotated with endolysin activity separately encode three proteins with cell-wall lysis function: CHAP domain protein, peptidoglycan recognition protein (PGRP) and β-N-acetylglucosaminidase (Supplementary Table S5). In addition to these lytic activities, the phage Alsa genomes encode a putative lytic transglycosylase domain protein with its gene located distally to the tail fibre genes (Supplementary Table S5). Lytic transglycosylase domains were not identified in any of the other phage genomes of this study. Also exclusive to the Alsa phages is an Ig domain containing protein with unspecified role yjdB domain. An additional tail gene was identified in Alsa_3 and _4 compared with Alsa_1 and _2 phages (Fig. 3).

Figure 3
figure 3

Comparison of Alsa phage coding regions based on functional annotation. Genomes of øAlsa1-4 were annotated using Pharokka and HHpred and visualised with Clinker with comparison of ORF functional categories across phages indicated by colour. Grey ORFs indicate hypothetical proteins. Scale, 2.5 kb.

Characterisation of novel Alsa phages

One step growth curve assays were used to determine Alsa phage growth curves and their latent period. During replication, the latent period of both øAlsa_1 in S. hominis LIV1218 and øAlsa_2 in S. hominis LIV1220 was 15 min, while the latent periods of øAlsa_3 and øAlsa_4 in S. capitis 104 were longer at 20 and 30 min, respectively (Fig. 4a). Burst sizes of the Alsa phages varied, ranging from 9 to 590 PFU per cell after 60 min, at least under the conditions studied. The doubling times of S. capitis 104, S. hominis LIV1218 and S. hominis LIV1220 were 1.17, 1.52 and 2.13 h, respectively. The thermal stability of the four phages across – 80 °C to 70 °C was determined and revealed that øAlsa_1 and øAlsa_2 were stable at 50 °C while both øAlsa_3 and øAlsa_4 had low stability at this temperature. Only øAlsa_1 and øAlsa_4 had stability at − 20 °C (Fig. 4b). Each of the four phages were stable across a pH range from 4 to 9 (Fig. 4c).

Figure 4
figure 4

General characteristics of Alsa phages. One step growth curve of Alsa phages using original isolation hosts (a). Stability of phages across a range of temperatures (b) and pH (c) measured by PFU values. Statistically significant differences relative to control are indicated by asterisk: one-way ANOVA values *P < 0.05, ** P < 0.01, ***P < 0.001.

The effect of Alsa phages on host culture growth was determined in their original host strains used for isolation with different MOI (0.01, 0.1 and 1) over 20 h in liquid culture. Relative to the absence of phage control, an MOI of 1 resulted in near complete lysis after 20 h. Distinct from the other Alsa phages, at MOI 0.1, øAlsa_4 did not cause complete lysis, while an MOI of 0.01 resulted in variable levels of lysis across the four phages in their respective hosts (Fig. 5).

Figure 5
figure 5

Growth kinetics of hosts with Alsa phages. Phages were cultured with their respective isolation hosts at different MOI using the absence of phage as a control. Absorbance of cells was measured over 20 h in the absence (black, closed circle) or the presence of phages at different MOI (0.01, pink square; 0.1 green triangle; 1, purple diamond) with hosts LIV1218 (øAlsa_1), LIV1220 (øAlsa_2), 104 (øAlsa_3, øAlsa_4).

Adsorption of the four Alsa phages to their respective isolation strain hosts was determined after multiple optimisations for MOI and time. Using an MOI 0.1 over a 20 min time period, between 70 and 100% of the population across the four phages adsorbed to the host cells, indicating high adsorption efficiency for each combination (Fig. 6a). To further characterise the binding of Alsa phages to their respective host, sodium periodate (NaIO4) treatment of the host cells, which would impact cell surface carbohydrate, resulted in significantly reduced adsorption efficiency of each phage compared to those incubated without NaIO4 (P = 0.001) (Fig. 6b). These data support Alsa phages binding to a host cell carbohydrate as a receptor with S. capitis and S. hominis differing in the extent of the reduction observed, which could reflect surface differences of these species.

Figure 6
figure 6

Adsorption of Alsa phages and effects of periodate treatment. (a) Alsa phages were incubated with their respective isolation hosts at MOI 0.1 and free phage were titred at the indicated time periods by plaque assay. (b) The effect of sodium periodate treatment for 60 min on the proportion of phages adsorbed (MOI 0.1) over 20 min was determined with hosts LIV1218 (øAlsa_1), LIV1220 (øAlsa_2), 104 (øAlsa_3, øAlsa_4).

Host range determination of skin-isolated phages

To compare the host range of the 40 skin-isolated phages, a collection of 140 bacterial strains of 8 Staphylococcus species was used, including S. lugdunensis, S. haemolyticus, S. epidermidis, S. capitis, S. hominis, S. warneri, S. saprophyticus, and S. aureus (Supplementary Table S1, Fig. 7). Using a spot assay, the lytic activity of phages (titres 106–109 pfu ml−1) was assessed after overnight incubation based on the extent of indicator strain lysis zones. S. epidermidis, S. capitis, and S. warneri were broadly susceptible to infection by the set of phages with individual strain variation (Fig. 7). In contrast, S. hominis strains showed a distinct pattern of more limited infection by the set of phages, when compared to other Staphylococcus species tested (S. epidermidis, capitis, warneri and lugdunensis) (Fig. 7, Supplementary Fig. S5). Within the set, the Alsa phages showed a capability for complete lysis of S. hominis. Specifically, øAlsa_1, øAlsa_2, øAlsa_3 and øAlsa_4 infected 9%, 8%, 5% and 3% of the 140 Staphylococcus strains tested while 50%, 42%, 15% and 27% of S. hominis strains (n = 26) were infected, respectively (Supplementary Fig. S5).

Figure 7
figure 7

Host range of isolated CoNS phages. The 40 CoNS phages of this study were tested for their ability to infect 140 strains of staphylococcal species using a spot assay. Host lysis was determined visually by zone clarity with complete (dark green), turbid (light green), or no lysis (purple). The experiment was repeated a minimum of three times.

Further analysis of the included genome sequences revealed diversity of the S. hominis strains with 15 different sequence types (ST) identified with ST94 and ST96 (Fig. 7, Supplementary Table S6) having the greatest sensitivity to phages. Several of the phages in the identified set were more capable of lysis of individual CoNS species, specifically S-CoN_Ph30 caused complete lysis of 80% of the tested S. epidermidis strains (n = 20) while the phages S-CoN_Ph5, 11, 17, 20, 24, 28, 34 caused complete lysis of 79% of the tested S. capitis strains (n = 15). Distinct from the CoNS species, S. aureus was uniformly resistant to infection by all the tested phages. Using the spot assay, a single phage (S-CoN_Ph7) was capable of lysis from without, however no infection occurred as confirmed by the absence of plaques on S. aureus BEN184 upon serial dilution at high phage titres, and the inability to propagate S-CoN_Ph7 on S. aureus BEN184 or use the culture supernatant to infect the original host strain, S. capitis 104 (Fig. 7).

EOP assays were used to confirm phage amplification and plaque formation and validate the host restriction patterns observed with spot assays. Phages that caused complete or turbid lysis were selected to perform EOP assays with 7 representative host strains of both S. epidermidis and S. capitis species, together with 13 S. hominis strains (Supplementary Fig. S6). Broadly, the EOP data confirmed the distinctly reduced infection of S. hominis strains that contrasted with the greater infection of both S. capitis and S. epidermidis observed in the host range data, based on plaque formation (Supplementary Fig. S6, Supplementary Table S7).

Phage defence identification

To gain insights of potential defence mechanisms that could explain the limited phage infection observed here with S. hominis, the genomes of all the strains tested (n = 26) were interrogated using both PADLOC (Fig. 8) and DefenseFinder (Supplementary Fig. S7). Multiple abortive infection (Abi) defence and RM systems are widely present in the strains used for host range analysis (Supplementary Table S8). Notably, RM Type IV was identified in every S. hominis genome and adjacent to each was a gene encoding a 5-methylcytosine specific restriction endonuclease system conserved domain protein (McrC; sequence identity 65–100%). To further investigate RM genes, the host range study S. capitis (n = 19) and S. epidermidis (n = 25) strain genomes were similarly interrogated with PADLOC and DefenseFinder. This analysis identified a high frequency of RM Type I in S. capitis and both RM Type I and II in S. epidermidis genomes (Supplementary Figs. S8, S9, Supplementary Table S8), however the McrC gene was absent from both species. To determine if the presence of RM Type IV is widespread in S. hominis, 243 genomes available in the NCBI database were analysed to look for defence systems more broadly at species level (Supplementary Fig. S10). This revealed the presence of Abi and Gabija systems plus high frequency of RM Type I and IV defence systems.

Figure 8
figure 8

Phage defence systems and plasmid content of S. hominis. The presence of phage defence system genes in genomes of the S. hominis strains used in the host range study were detected with PADLOC. Number of defence system genes present is scored (0–6) and the total number of prophages (0–6) are indicated as either intact, incomplete, or questionable. Phylogenetic tree of S. hominis strains was generated using IQ-TREE. Sequence type (ST) of strains is indicated by vertical-coloured bars. Plasmid content (0–7 plasmids) is shown by different sized and coloured circle symbols, determined by PlasmidFinder.

To investigate if the host range profile for phage infection corresponded with prophage presence and observed phage resistance, comparative genomic analysis of the S. hominis isolates tested with phages here identified that the number of intact prophages ranged from 0 to 1, with additional incomplete prophage regions (Fig. 8) showing evidence of temperate phage infections. Further, when both S. capitis and S. epidermidis genomes of strains from the host range analysis were compared there was no correlation between number of prophages and resistance to infection (Fig. 7, Supplementary Table S9). This indicates that among possible explanations, the phage infection barrier observed here with S. hominis could be a superinfection mechanism determined by a species-specific incomplete prophage, but this was not studied further. Plasmid content was also determined in each host strain and found to vary (0–8), and no clear link was identified between the numbers of plasmids present and phage infection susceptibility (Figs. 7, 8, Supplementary Fig. S11, Supplementary Table S10).

Discussion

Recent analyses have demonstrated the spatio-temporal distribution of bacterial species on human skin over the life course2,4,21. While other studies sought to determine the host and bacterial factors that drive microbial community changes, here we investigated the virome with respect to phages that infect CoNS. The contributions of both phages and their hosts is needed to improve understanding of the biotic and abiotic drivers of population changes on skin and the ability to externally alter these for health and personal care.

For this study, we focused on isolating phages present on human skin that infect the multiple coagulase-negative species of Staphylococcus that are abundant residents, where most efforts have focused previously on the skin pathogen S. aureus. We identified 40 phages and comparative genomic analysis revealed that the phages formed six distinct genetic clusters that belong to an undefined order of the Caudoviricetes class50. Within this undefined order, two of the genetic clusters were previously reported and here we identified an additional two novel clusters, which extends our understanding of skin phage diversity with respect to CoNS species. Specifically, the novel phages belong to an unclassified Siphoviridae family (cluster 4: genome length ~ 45 kb) and an undefined Herelleviridae family (cluster 1: genome length ~ 147 kb). The latter of these phages named Alsa represents a clade that shares a common ancestor with known lytic Herelleviridae phages of staphylococci, based upon a phylogenetic tree produced by ViPTree (Supplementary Fig. S3). We propose that Alsa are lytic phages based on their relationship with reported Staphylococcus lytic phages together with the absence of integration-related genes supported by phage AI. Further study is needed to investigate the proposed novel temperate phage S-CoN_Ph26 of cluster 4 that was not investigated further here. vConTACT network analysis determined that the 40 identified phages were connected to genomes of a RefSeq database derived from ICTV and NCBI databases. In particular, the lytic Alsa cluster 1 and cluster 3 phages formed subclusters of their own that integrated with the large, interconnected network (Fig. 1b, Supplementary Table S3). While the lytic phages (clusters 1,2,6) formed connected but discrete groups, the temperate Siphoviridae (clusters 3,4,5) were closely connected with first neighbours to multiple Staphylococcus Siphoviridae. The successful isolation of this large collection of phages using simple and common swab sampling methods highlights the potential to uncover yet more diversity of the cutaneous virome via phages that infect staphylococci. Such studies should combine greater numbers of volunteers from distinct global populations with additional CoNS species and strains for phage propagation to limit propagation bias.

While skin swab approaches can enable phage isolation for experimental study, metagenomic studies have greatly extended our understanding of the diversity of viruses from communities of key cutaneous fungal and bacterial genera14,55. Recently, evidence was gathered that determined differences between the phage repertoire of healthy and atopic dermatitis skin that might reflect an important role for phages in structuring skin communities14. In their study, 111 viral metagenome-assembled genomes were not conclusively mapped to a host demonstrating the need for metagenomic studies to be performed in combination with phage isolation as undertaken here. Purified phages can define host ranges needed to extend microbiome knowledge and aid modelling of skin community dynamics of healthy and inflamed skin.

The broadening of phage diversity from skin here builds on several other studies that examined phages capable of infecting CoNS hosts. A recent study of 26 volunteers found half harboured phages that infect S. epidermidis and revealed 7 unique phage sequences37. An extensive screen of wastewater samples in Switzerland56 uncovered 94 novel phages infecting 29 Staphylococcus species comprising 117 strains. Notably, the wastewater screen identified many phages infecting species S. vitulinus, S. sciuri, S. xylosus, S. succinus, S. epidermidis and S. aureus. Further analysis of the skin phages sampled in our study is needed to determine if their host range extends to those species in the wastewater study that are uncommonly found on skin. Hannigan et al. reported that from cutaneous sample metagenomic data a relative proportion of > 85% temperate phages11, similar to that identified in the human gut virome57,58. In contrast, our sampling size of 40 phage genomes identified a proportion of 25% temperate phage, which might reflect differences in the sampling procedure and chosen hosts for propagation.

The majority of the 10 CoNS temperate phages isolated here includes eight StB20-like phages (cluster 5) that were similar to those identified by van Zyl et al.31 and connected as first neighbours by vConTACT (Supplementary Table S3). The remaining two temperate phages include cluster 3, which has similarity to phage IME1354_01 isolated from S. cohnii52 and the novel cluster 4 phage. Further study is needed to investigate their prophage biology and potential morons. The eight StB20-like phages have 0–1 gene annotated as a moron by Pharokka (category: auxiliary metabolic gene and host takeover); this moron encodes a putative small, membrane protein that warrants further study.

The host range here with the 40 phages determined that the species S. epidermidis, S. capitis, S. lugdunensis and S. warneri were broadly infected, with a limited number of individual strains of these staphylococci showing an absence of infection that likely reflects their phage resistance (Supplementary Fig. S5). Within the collection of phages isolated here, we determined the host range was restricted to CoNS species with no evidence of infection leading to propagation in S. aureus. Further study is required to understand the barriers to these phages infecting S. aureus that might reflect the absence of phage receptors and could extend to phage defence mechanisms. Most phages of Staphylococcus adsorb to wall teichoic acid glycoploymers and recent work by Beck et al. identified that resistance to øE72 infection in S. epidermidis is mediated by WTA modification with glucose59. The authors propose that glucose-modified WTA alters susceptibility to phage infection in certain CoNS species.

A clear pattern of reduced infection was evident with S. hominis using the isolated collection of phages, whereby few individual bacterial strains were infected (Supplementary Fig. S5). This reduced infection pattern of S. hominis compared with other tested CoNS species indicates there might be a widespread mechanism to limit phage infection in S. hominis. First, we investigated the potential mechanism of S. hominis phage resistance by curating possible phage defence mechanisms using PADLOC and DefenseFinder with our strain set (n = 26) used for the host range study. This indicated that RM Type IV and McrC represented distinct differences between the RM systems of S. hominis compared with both S. capitis and S. epidermidis. RM systems and restriction endonucleases have a key role in genome maintenance by controlling ingress of DNA based on methylation status and contribute to limiting bacteriophage infection and plasmids60. Further, RM systems can contribute to speciation61. To extend investigation of S. hominis RM, 243 genomes available in public databases were analysed using PADLOC and DefenseFinder, which supported that RM Type IV was present across the genomes with high frequency, as was Type I. There was found to be no clear link between presence and absence of RM Type IV and McrC with reduced host infection of S. hominis by the tested phages, however its contribution to phage defence in the species needs to be experimentally determined together with the RM Type I system present at high frequency in both S. epidermidis and S. capitis genomes. Lee et al. identified major diversity of RM Type I genes of S. epidermidis with either truncation or in 38% of strains gene absence, supporting the need to determine the functional activity of phage defence systems62.

Greater insights of Staphylococcus phage defence could come from broadening the range of species to identify if other examples of limited phage infection occur. S. lugdunensis has a closed genome with RM type I and II but also CRISPR/Cas defence systems63 that might contribute to limited phage infection seen with the S. lugdunensis isolates tested here. A recent experimental study of Escherichia coli identified 21 previously undescribed phage defence systems with prophages and mobile genetic elements providing unassigned barriers to infection64 and highlighting alternative approaches to uncover a species’ full repertoire of defence mechanisms. The complement of phages identified here will support such experimental searches of Staphylococcus defence systems.

Further experimental study of S. hominis to determine the exact molecular basis of its reduced phage infection should include a focus on surface adsorption studies. In this study, periodate treatment supports that an extracellular carbohydrate could serve as the phage receptor for adsorption, consequently insights of teichoic acid and its glycosylation modifications would define phage adsorption and infection and build on the study of Refs.59,65. Future studies could include Alsa phages with their greater capacity to direct the complete lysis of a greater proportion of S. hominis strains than other tested phages that should include the study of its discrete, putative transglycosylase activity. Such interrogation will help to unravel aspects of the virome and the factors that could structure skin communities, and their contribution to health and disease.

The capability to alter the frequency of key species within the skin microbiome using phages and their lysins has clear health and personal care applications. For example, wounds and ulcers have seen development of topical phage products, and Kifelew et al. used a phage cocktail that reduced S. aureus load and ulcer wound size66. Kielholz et al. generated a stable and biocompatible phage-loaded wound dressing that released soluble fibres with S. aureus and Pseudomonas aeruginosa lytic phages67. A phase I clinical trial of a topical 3-phage cocktail hydroxyethlycellulose gel targeting Cutibacterium acnes demonstrated safety, tolerance and its effectiveness at reducing burden of the species on the face68.

Specific to our study, S. hominis is associated with human axilla body odour69,70 and future studies could investigate whether there is a correlation between reduced presence of Alsa phages in high body odour individuals. Potentially, increasing infection by resident Alsa phages could be promoted to limit odour. For example, an inhibitor of the phage infection barrier could be used in personal care that would promote the resident skin phages to infect and reduce the skin population of S. hominis.

In conclusion, the extended collection of phages infecting Staphylococcus in this study adds to the potential to enhance the molecular toolkit needed to improve recombinant DNA studies of the CoNS, which has lagged behind that of S. aureus. Additionally, studies investigating use of the phages in reducing staphylococcal biofilms and in therapeutic and personal care applications can be addressed in future work.