## Introduction

Despite decades of focus on the local scale, ecology now recognizes the influence of regional processes of evolution and dispersal on community diversity and composition [1,2,3]. During all stages of community assembly, colonization can influence community diversity and composition by contributing new species and adding individuals to established populations [4,5,6]. Determining how many and which colonists influence local communities is important to understand community structure because the number and identity of species of colonists can regulate richness [7,8], influence species composition [9], trait distribution [9], and ecosystem function [10] at the local scale. In plant and animal ecology, the traditional concept of the regional species pool hypothesizes that species available for colonization depend on large-scale evolutionary and biogeographical processes [11,12]. Due to the large spatial and long temporal scales of these processes, the regional species pool is often assumed to be stable at ecological time scales [11,13,14] and this view has permeated much of the understanding of local and regional processes [15], interspecific interactions [16], and neutral theory [17]. However, the pool of colonists interacting with local scale habitats is often a subset of this large-regional pool, subject to more proximate habitat factors, particularly in microbial systems [18,19,20].

The evolutionary regional species pool can be divided into subsets of colonists that share similar dispersal mechanisms and ecological traits. For example, of all bacteria in alpine glaciers, Rime and collaborators [21] found that microorganisms from glacier sediment, and not from precipitation or streams, colonized newly deglaciated soil. For clarity, we refer to each of these subsets of colonists as a colonist pool, the group of species that arrives to local patches and interacts with local habitats. Previous studies have identified the most important colonist pool among a series of potential pools in some systems (e.g. [20,22,23,24]), but have rarely directly considered the possibility that multiple colonist pools can coexist with several local habitats (but see [25]). For example, although plant communities are often exposed to wind dispersed colonists (seed rain) and colonists from the soil (seed bank), which colonist pool is actually the most influential can depend on soil properties of local habitats [26,27,28]. If a particular local habitat receives colonists from only one colonist pool, comparing the colonist pool with local community composition is informative of the habitat filter [29]. These filters occur when habitat conditions are harsh or stressful for non-tolerant species and local communities become less diverse, less heterogeneous and less similar to the colonist pool [30]. However, there are other possible scenarios when multiple colonist pools influence different habitats (Fig. 1). If neighboring local sites with different habitats receive colonists from the same colonist pool (Fig. 1b), this provides an ideal opportunity to compare their habitat filters. If local sites receive colonists from multiple colonist pools (Fig. 1c), then comparing the local habitat to all contributing colonist pools is necessary to assess the nature of the habitat filters.

Here, we examined the influence of multiple colonist pools of sediment bacteria on fiddler crab-associated bacterial communities and ask whether considering multiple colonist pool influences improves our understanding of this system. When fiddler crabs molt, newly formed chitin surfaces of the carapace and the gut are sterile and receive colonists from sources of bacteria in the surrounding environments [31,32]. Colonists are likely to come from the sediment because fiddler crabs feed on organic matter, microalgae and bacteria that they scrape off the surface sediment [33] and burrow to mate, escape predation and avoid extreme temperatures [34]. Sediment bacteria can be subdivided in three dominant habitats in salt marshes based on physicochemical conditions: sediment surface, subsurface, and crab burrows [35,36,37]. Bacteria in the surface sediment experience high oxygen and light availability [36,38] and should be dominated by aerobic heterotrophs, autotrophs, and nitrogen fixers [39,40,41]. Subsurface bacteria experience low oxygen levels and utilize NO, Fe and SO4 as electron acceptors [35,42] and should thus be dominated by taxa capable of anaerobic metabolism [36]. The sediment on the burrow walls is aerated and mixed as fiddler crabs burrow, developing a diverse bacterial assemblages that differ markedly in composition from surface and subsurface bacteria [43,44,45,46,47]. Although microbial communities in these sediments have been studied separately, their combined influences on the infauna have not been assessed yet. Host-associated microbial communities exposed to colonization from sediment bacteria could reveal alternative multiple colonist pool scenarios (Fig. 1).

Regardless of how many colonist pools influence a site, all propagules must pass through local habitat filters before recruiting to local communities. In the crab’s carapace, bacteria able to attach and compete for space should be able to maintain membership in the community [48,49]. In contrast, to colonize the gut, bacteria must tolerate the physical scrapping and sediment selection of the crab’s oral appendages [50] and the chemical stress of hepatopancreas secretions [51,52]. In addition to local habitat characteristics, host factors like genotype [53,54,55] or species can act as a filter and determine the variation in the composition of associated microbial communities [56,57]. Empirical studies have however found contradictory evidence about these relationships [58,59] vs. [60], which could result from our lack of understanding of the mechanisms through which host genotype or species regulate its habitats. Fiddler crab species have been found to differ in some physiological factors [61] but not others [62] and whether these factors have an effect on their associated microbial communities is unknown. In contrast, the crab’s sex, has been linked with increased richness in male gut bacteria (Cuellar-Gempeler and Munguia [63]). This diversity effect of sex can be mediated by physiological [64,65,66], or behavioral [67,68] differences between fiddler crab males and females. Importantly, to decipher the role of host factors and habitat filters on bacterial community diveristy and composition, we must first investigate the influence of colonization.

The major goal of this study is to investigate the influence of multiple colonist pools on fiddler crab associated bacteria and to compare its effect on composition to that of host-associated factors. Specifically, we asked three questions. First, we ask how different are the colonist pools. We hypothesize that surface, subsurface and burrow bacteria differ based the physicochemical characteristics of each sediment type. Second, we questioned how many and which sediment colonist pool influences the carapace’s and the gut’s bacterial communities. Based on the crab’s interaction with the sediment, we hypothesize that bacteria colonize the gut when the crab feeds on surface sediment and burrow sediment bacteria colonize the carapace as the crab takes refuge. In contrast, we do not expect the subsurface sediment to be an important contributor of colonists. Third, we ask whether host factors like crab sex or species generate variation in community diversity and composition. We hypothesize that sex is more important in determining variation in bacterial composition than host species, based on previous findings that suggest bacterial communities associated with males are more diverse.

## Methods

We conducted our study in a 150 × 150 m2 marsh area located on Stedman Island, near Aransas Pass, Texas (27° 53’ 13.56” N, −97° 7’ 0.07 W). The sediment is influenced by inflow from high salinity seawater from Redfish Bay (between 25 and 35 ppt during the summer [69]) and vegetated by scarce black mangrove (Avicenia germinans), saltmarsh cordgrass (Spartina alterniflora) and woody glasswort (Salicornia bigelovii). This salt marsh was chosen because it is co-inhabited by two fiddler crab species (Uca panacea and Uca rapax) that exhibit similar behavior, habitat, and diet [70]. We focused on the bacterial community associated with these crab’s carapace and gut, which represent different habitats for bacteria. While the crab’s carapace is open to colonization [48,49], colonizing the gut requires surpassing physical [50] and chemical filters [51,52].

### Sampling procedure

Sample collection was divided evenly in four sampling dates (July 21, 29 and 31, and August 5, 2014). Adult U. panacea and U. rapax (18 males and 18 females) were collected by hand and stored in individual sterile containers. Using sterile spatulas, we collected a total of 10 samples of each sediment type. Approximately 20 g of surface, subsurface and burrow sediment were collected on sterile 50 mL falcon tubes. Replicates of surface, subsurface and burrows were collected approximately 10 m apart from each other at sites with active burrows. Active burrows were identified based on crab activity (leaving and entering) and signs of recent burrowing (freshly disturbed sediment and crab tracks). Surface sediment was scrapped from the top layer (0 to 2 cm deep). For subsurface samples, we mixed sediment from 7 to 15 cm in depth. Although microbial community composition can vary significantly with depth [71], crabs are exposed to all bacteria over this depth range when burrowing. Likewise, burrow samples were obtained from this depth range but along the surface of fiddler crab burrows. We selected only burrows that extended beyond 15 cm in depth and were wider than 2 cm. Crab and sediment samples were transported to the Marine Science Institute from the University of Texas at Austin for further processing.

### Sample preparation and DNA extraction

Upon arrival to the lab, samples were prepared for DNA extraction. Each sample from the sediment was homogenized by vortexing at high speed for 30 s and 2 g were separated, in sterile conditions, for DNA extraction. Although this procedure may not completely homogenize a heterogeneous sediment sample, it allowed mixing of the sediment obtained from different depths in subsurface and burrow samples. Surface samples were treated the same for consistency. Crabs were rinsed with sterile deionized water to remove debris and unattached microorganisms. Then, carapaces were swabbed and scraped to profile the surface community. Crabs were sacrificed by freezing and dissected to obtain gut samples. Then, guts were rinsed with sterile deionized water to avoid food bolus interference and to remove unattached bacteria. All samples were kept in MoBio PowerSoil bead tubes at −80 °C and were processed within 2 months of collection. DNA was extracted using the PowerSoil DNA extraction kit (MoBio) as instructed by the manufacturer. We included samples taken of all sterile materials to control for potential contamination. DNA concentration was quantified with Qubit fluorometer (Qubit 2.0, Invitrogen) using high sensitivity assay reagents. Only samples with more than 0.1 ng/μl DNA yield were used for the study to avoid sample bias.

### 16S rRNA gene library preparation and sequence analysis

Individual samples were prepared for Illumina sequencing using a two-step, gene-specific PCR. To avoid host DNA amplification, we targeted the V4 hypervariable region of bacterial 16S using the 515F/806R primer pair (Ong et al [72], Wang and Qian [73]). We used the MiSeq Illumina platform to obtain pair-end 250 bp nucleotide reads. Library preparation and sequencing was done at the University of Texas Genome Sequencing and Analysis Facility (GSAF). The resulting sequences were processed using custom bash scripts and QIIME [74] with Greengenes as reference databank [75]. OTUs (Operationa Taxonomical Units) were defined at the 97% sequence similarity and were picked with an open frame. Please refer to our Supplementary material for details on library preparation and sequence processing.

### Data analysis

The following analyses were conducted in the R environment (version 3.3.2, [76]). We removed OTUs assigned to Archaea or unassigned and those found in less than 3 times in less than 1% of the samples.

First, we evaluated differences in diversity and community composition among sediment types. To determine whether these results reflect similar sampling effort across samples, we calculated species accumulation curves. Since the species accumulation curves from sediment reached asymptotes (Fig S1), we considered only raw richness for further analyses. We assessed the difference in richness across sediment type by using a GLM fitted with a negative binomial distribution. Then, we calculated Pielou’s evenness as J = H’/lnS [77] and used an ANOVA to determine differences across sediment types. We assessed normality and homoceidacity of residuals using the Shapiro-Wilks test and the Bartlett test, respectively [78,79]. Evenness was inverse square root transformed to meet parametric assumptions. We calculated distance-to-centroid as a measure of community similarity and evaluated differences in similarity across sediment types. We used an analysis of multivariate homogeneity of group dispersions with 999 permutations. This method is multivariate analogue of Levene’s test for homogeneity of variances where non-euclidean distances between objects and group centroids are reduced to principal coordinates to assess beta diversity [80]. We assessed the role of sampling date by adding it as a random factor in these tests (Table S1) but, since we did not find any effects, we removed it from the statistical models presented here.

We used a Cannonical Correspondance Analysis (CCA) with standardization by row and column weights to assess compositional differences between the three types of sediment bacteria. CCA is a direct ordination technique that can identify correlations between multivariate data on OTU abundances and the sediment type coded as dummy variables. Significance of the differences in composition was assessed with a permutational MANOVA on the CCA scores [81]. We compare the results from the CCA with non metric multidimensional scaling (NMDS) based on Bray-Curtis and a weighted UniFrac (Fig S4). Since we obtained similar results, we felt confident to proceed with the CCA ordination. We chose this specific ordination tool because it can incorporate the function predict, which allows us to find corresponding scores for crab samples in an unbiased way.

Second, we determined the contributions of each sediment pool to each crab-associated community by comparing community composition in each crab sample and the centroid of the various colonist pools. To incorporate the crab sample scores to the sediment CCA, we used the function predict from the package ‘vegan’ (version 2.4.4, [82]). Then, we assigned each crab-associated community to a species pool based on the distance to nearest pool centroid in multivariate CCA space. This approach is useful to estimate similarity between standard and test data without the bias of including all samples in the calculation of the original ordination axes [83]. We assessed the sensitivity of this method by comparing the results from all OTUs with the results from the 100 most abundant taxa (Fig S2) and to the outcome of Source Tracker ([84], Fig S3).

Third, we wanted to determine the effect of host-factors on community diversity and composition. We assessed the effect of host-factors on richness and evenness using a three-way ANOVA including habitat (carapace or gut), sex and species. We tested for normality and homoceidacity as described above for sediment samples. Evenness data was square root transformed to meet parametric assumtions. Then, we used a perMANOVA on CCA scores obtained from predict to determine how host factors, such as the type of habitat, or the crab’s sex or species influence bacterial community diversity and composition. We were also interested in assessing the significance of these groupings independently of the assignment methodology from the predict function. Thus, we used cluster analysis to identify groups of samples based on community composition and asked whether these clusters correspond to host-factors or pool assignments. We used a hierarchical clustering analysis with the “ward” method (hclust function, [85]) and estimated the significant number of clusters using the gap statistic [86]. Lastly, we calculated distance-to-centroid as a measure of community similarity and evaluated whether similarity was determined by the host’s sex or species within each habitat. We used an analysis of multivariate homogeneity of group dispersions as described above and tested the role of habitat type, host sex and species.

We looked at three taxonomical levels to identify the bacteria that contribute to the above diversity patterns. First, we illustrated bacterial phyla that characterize each sediment type and crab habitat these patterns. We built a heatmap detailing the abundance and distribution of bacterial Phyla using heatmap.2 from the package ‘gplots’ (version 3.0.1, [87]). Then, we focused on the family taxonomic level to identify bacterial abundance patterns that underlie colonist pool relationships. To identify specific families contributing to differences and similarities between sediment and crab substrates, we used SIMPER analysis [88]. SIMPER analysis compares normalized relative abundance between samples measuring differences as Bray-Curtis dissimilarity. The contribution of individual OTU i to compositional differences is estimated with the formula

$${\mathrm{d}}_{ijk}{\mathrm{ = }}|\frac{{{\mathrm{x}}_{ij} - {\mathrm{x}}_{ik}}}{{{\mathrm{\Sigma x}}_{ij} - {\mathrm{x}}_{ik}}}|$$

where x represents the abundance of OTU iin simple j and k. Although this method can confound the overall abundance with variation between samples [89], we used it to identify bacterial families explaining 80% of the variation in community composition between carapace, gut, surface, and burrow bacteria. We tested for significant differences in abundance of bacterial families identified by SIMPER across sediment and crab substrates using a permutation test (999 permutations). We selected informative families to further tested differences in abundances among sediment types and crabs. We used ANOVAs unless parametric assumptions were not met after transformations. In these cases Kruskal-Wallis test was used instead. To identify specific differences in abundance across substrates, we used post hoc Tukey or Nemeyeni tests, depending on whether data was parametric or not. To correct for false discoveries over tests for each taxa, we used Bonferroni corrections [90]. We used additional heatmaps to illustrate OTU abundance and occupancy patterns of selected families.

## Results

Of the original samples, 65 samples were suitable for analysis. After quality-filtering, we had 21 carapace, 15 gut, and 30 sediment samples with a median of 6605 reads per sample and a range between 3000 and 14346 reads. We identified 517 individual bacterial OTUs distributed across 17 phyla and 80 families.

### Colonist pools in the sediment

Sediments contained a total of 459 OTUs. Surface and subsurface had comparable richness levels, while burrow sediments had slightly lower richness, albeit not significantly different (Table 1, χ22, 27 = 4.380, p = 0.996). Evenness followed rather the opposite trend, with burrow having a higher value than subsurface and surface having the lowest value (Table 1, F2, 27=16.83, p < 0.001). Surprisingly, the surface did not contain any unique OTUs, while subsurface and burrow sediments had 14 and 36 unique OTUs, respectively (Table 1). These unique OTUs were rare within each sediment type and did not represent more than 2% of the community (Table 1). The CCA revealed significant differences in bacterial community composition among sediment types (Fig. 2, perMANOVA, F2, 27 = 108.73, R2 = 0.889, p < 0.001). While there were clear differences in composition, sediment types also differed in community similarity, with burrow sediment containing the least similar assemblages (permutation test, F value = 45.898, p < 0.001, Fig. 2)

In the surface sediment, we found high abundance of Firmicutes, and Spirochaetes (Fig. 3). Particularly, surface sediment bacteria were dominated by taxa from Halanaerobacteriaceae and Balneolaceae (relative contribution ranging from 9.2 to 24.8%, Fig. 4, Table S2). Cyanobacteria were consistently found in burrow and surface sediment but never reached high relative abundances (average relative abundance of 0.8% in surface and burrow sediment), and it was extremely rare in subsurface sediment. Subsurface sediments were instead represented by high abundances of OTUs from Caldithrix, and SAR406 (also known as MGA, [91]), while burrow sediments contained high abundance of Acidobacteria and Nitrospirae (Fig. 3). Subsurface and burrow sediments shared high abundances of Rhodobacteriaceae and Desulfobacteriaceae (Fig. 4). Importantly, sediment types differed in OTUs composition within bacterial families such as Desulfobacteriaceae (Fig. 5, Table S3).

### Colonist pool influences on crab-associated bacteria

Based on community similarity, most crab-associated communities were assigned to burrow, few to surface sediment, and none to subsurface sediment pool. While all carapace bacterial communities were (100%) assigned to the burrow, 10% gut communities were assigned to the surface and 90% to the burrow (Fig. 2). Crab gut bacterial communities assigned to the surface sediment corresponded to males from both species. A perMANOVA based on predicted CCA scores of crab samples revealed significant differences in bacterial community composition among carapaces and guts, but also showed an effect due to the interaction between host species and sex (Table 2). Crab bacterial communities were characterized by high abundance of Proteobacteria, Bacteroidetes and Actinobacteria and Verrucomicrobia. While most phyla were more abundant in the carapace, Tenericutes and Planctomycetes were more abundant in the gut.

The influences of sediment bacteria were better understood by looking at similarities at the family and OTU taxonomical level (Figs. 4, 5). Carapace and gut bacterial communities held high abundances of Rhodobacteraceae (mostly Paracoccus sp, Rhodobacter sp), Flavobacteriaceae and Saprospiraceae, which are also found, at lower abundance, in burrow and subsurface sediment (Fig. 4). Dominant taxa from the surface (Halanaerobiaceae and Balneolaceae) were absent or present at very low abundances within crab habitats (Fig. 4). Vibrionaceae was an important representative of crab bacterial communities and was found, at lower abundance, in the burrow sediment. At the OTU level, we found that taxa within Desulfobacteraceae found in crab samples reflect those found in burrow sediment (Fig. 5).

### Host factors

Habitat within the host was the main explanatory factors for the bacterial communities’ richness and evenness. Carapace bacterial communities had higher richness than gut bacterial communities, with a marginal effect of species (Tables 3, 4). This pattern was probably driven by U. panacea, which had the lowest values of richness in the gut (Table 3). Gut bacterial communities had higher evenness than carapace bacterial communities (Tables 3, 4). There were no significant differences in community similarity between gut and carapace bacterial communities (permutation test, F value = 3.050, p = 0.081, Fig. 2), crab species (F value = 0.616, p = 0.46) or male and female communities (F value = 0.133, p = 0.87).

The cluster analysis showed a contrasting effect of host factors on carapace and gut bacterial communities. Gap analysis indicated that carapace bacterial communities clustered in 6 significant groups (Fig. 6a, b). These groups loosely correspond to host factors (Fig. 6a). For example, the first cluster on the left contains only U. rapax samples and the two clusters on the far right are composed of U. panacea. In contrast, gap analysis indicated only one cluster comprising all gut bacterial communities (Fig. 6c, d).

## Discussion

Whether a unique regional species pool or multiple sources of colonists influence local communities could have consequences for local community dynamics. Here, we identified three potential colonist pools of bacteria in the salt marsh sediments and their influences in crab-associated bacterial communities. Patterns of similarity in community composition suggested that carapace bacterial communities receive most colonists from burrow sediment (Fig. 2). In contrast, guts received colonists from burrow and, at least in part, from surface bacteria (Fig. 2). This result suggests that multiple colonist pools can influence local patches (Fig. 1b) and individual colonist pool can influence distinct local habitats (Fig. 1c).

We found that the three potential colonist pools within the salt marsh sediment differed in OTU composition (Fig. 2). Our findings coincide with previous studies showing strong physicochemical differences driven sediment depth [47] and bioturbation [37,71,92] and suggest that sediment bacteria are under distinct ecological selection pressures. For example, surface sediment was strongly dominated by salt tolerant taxa (Halanaerobiaceae, Balneolaceae) while other OTUs were rare (Fig. 4, Table S2, [93,94,95]. Even with this strong environmental stress, surface bacteria were more diverse than other sediment types, possibly because oxygen, light availability or immigration from other habitats sustain a long tail of rare OTUs (Fig. 3, Table S2). These results coincide with studies highlighting the strong effects of salinity on autotrophic [40] and nitrogen fixers [41] commonly found in the surface. Interestingly, many of these halophilic taxa are also anaerobic and their abundance in the surface sediment suggests fine scale changes in oxygen availability. Subsurface and burrow communities were dominated by taxa consistent with an environment limited in oxygen but abundant in sulfates and nitrates (Figs. 3, 4, [39,42,96]), but the differences between these two types of sediment were less clear. By looking at the OTU taxonomical level, we found groups of OTU from selected families like Desulfobacteraceae that clearly characterize sediment types (Fig. 5, Table S3). This finding suggests that each group of OTU within this family specialize on each sediment type, partitioning the coastal marsh habitat and reducing competition [97]. Physicochemical differences among sediment type appear to be a stronger factor in driving bacterial composition than physical separation between the sediments driving ecological drift.

Based on similarities in OTU relative abundance and occupancy between crab-associated habitats and the sediment pools, burrow and surface sediment bacteria colonized fiddler crabs’ guts and carapaces (Fig. 2). Evidence for influence of the burrow on both crab habitats is better seen at two taxonomic levels. At the family level, Rhodobacteraceae and Flavobacteriaceae were important components of burrow sediment and crab-associated bacterial communities (Fig. 4). These taxa are common to the marine environment and have been previously found in association with crustacean exoskeleton and gut [98,99,100]. At the genus level, different taxa within Desulfobacteriaceae characterized each sediment type, but only burrow taxa matched taxa found associated with crabs (Fig. 5, Table S3). These findings support the idea that both gut and carapace habitats receive colonists from the burrow sediment (Fig. 1b).

In this study, we emphasize the role of the burrow sediment as a source of colonists, yet these results also suggest a two-way colonization feedback between burrow sediments and crab-associated bacteria. There are two reasons why we suggest that burrow to crab colonization is predominant. First, dominant taxa are shared between the subsurface and the burrow, suggesting that these are sediment and not crustacean adapted taxa. Second, Desulfobacteriaceae are known environmental bacteria requiring resources like sulfates and acetates to fulfill metabolic requirements [101]. Desulfobacteriaceae associated with crabs are likely to undergo constant colonization from the burrow, suggesting a source-sink relationship, at least for this group of taxa (also known as mass effects in metacommunity ecology, [102]). However, we want to highlight that the crab’s activity can also have an important effect on the diversity and composition of burrow sediment bacteria. Previous work showed that areas of increased bioturbation are enriched in Vibrio spp. [103,104] and other Proteobacteria [105]. In our study, Vibrionaceae from crab bacterial communities is likely to colonize the burrow sediment. We propose a complex feedback mechanism between the effects of the crab’s activity on burrow sediment physicochemistry, bacterial migration from the crab to the sediment and colonization of crab habitats from burrow sediment bacteria.

We found that crab-associated bacterial communities significantly cluster in groups based on community composition, but only in the carapace (Fig. 6). Contrary to our expectations, the clusters were associated with host species. There may be physiological or behavioral differences between these species that we have not considered. For example, fiddler crab reproductive behavior and physiology varies across species and can influence the associated bacterial communities [65]. In contrast to the carapace bacterial community, there were no significant clusters in gut bacteria (Fig. 6). One possible explanation is that strong filtering in the gut homogenizes these bacterial communities. If this is true, then their composition should remain unchanged and similarity in community composition should be higher [30], yet we failed to find differences in community similarity between gut and carapace bacterial communities. Alternatively, lack of clustering can be explained by considering a gradient in the contribution of multiple colonist pools. On one extreme, guts receive most or all colonists from surface sediment; while towards the other extreme, surface contribution diminishes and burrow contribution increases. Although there are no records of fiddler crab feeding from the burrows, they may be using burrow water when stressed by high temperatures and low water availability [106,107]. In addition, these patterns may be associated with molting stage, or other defining events in the life of each individual crab.

Our results suggest that surface sediment bacteria can influence some gut bacterial communities. While this concurs with our expectations based on crab feeding behavior [67], we found that dominant taxa from the surface sediments were not representative of gut bacterial communities (Figs. 3, 4). Similarly, dominant taxa in the gut were not represented in the surface (Figs. 3, 4). A possible explanation is that rare OTUs from the surface sediment influence the distribution of rare OTUs in the crab’s gut. However, when analyzed the patterns from the 100 most abundant species, we found the same relationship between the gut and the surface sediment bacteria (Fig S2). Influential taxa could thus be intermediate OTUs that are rare within the 100 most abundant taxa. A possible explanation for these patterns is that bacteria able to colonize the crab’s gut survive by remaining inactive and dormant within surface sediment.

The method we use to assign colonist pools to local communities relies on assignments based on similarity to determine species pool influences. This method is limited by the absence of a direct measure of dispersal, which results in two main issues. First, other sources of colonists may influence fiddler crab-associated communities. For example, bacteria in tidal seawater, plants and other crabs could influence fiddler crab associated bacterial communities. However, marsh sediment contains richer and more abundant bacterial assemblages [108] and has a stronger relationship with fiddler crabs based on their activity budgets [67,68,109], when compared to other sources. Second, our supposition that similarity indicates major contribution of a colonist pool assumes that we have identified all the species pools. Even with these limitations, the method used here allows us to identify likely sources of colonists from the host’s surrounding environment with no previous knowledge and it provides results that are comparable to other methods ([84], Fig S3).

Our study highlights colonization from distinct species pools and its consequences for local diversity. Importantly, it suggests that local communities may be influenced by more than one colonist pool and colonist pools can influence more than one local habitat. Grouping species according to their dispersal regimes and large-scale ecological filters (such as sediment type) may result in a more realistic approach to the influence of multiple colonist pools on local communities. This view could aid at bridging our understanding of systems that operate at distinct scales but depend on this balance between colonization and local habitats, such as host-associated bacteria and plant communities. Integrating multiple colonist pools with the traditional evolutionary and biogeographical concept of the species pool may lead to a better understanding of the relationship between colonization, local filters and patterns of diversity across multiple scales.