Introduction

Anthropogenic disturbance has severely affected most terrestrial ecosystems. Habitat loss and fragmentation has now become a common scenario in tropical and subtropical regions (Laurance, 2010), where plant populations are expected to suffer several genetic consequences (Ellstrand and Elam, 1993; Young et al., 1996; Lowe et al., 2005; Aguilar et al., 2008). Furthermore, human-induced changes in animal abundance, composition and movement patterns may alter pollen- and seed-dispersal mutualisms (Dirzo et al., 2014; Neuschulz et al., 2016) that are critical to maintaining plant gene flow and genetic diversity (Sork and Smouse, 2006). Therefore, plants are not only threatened by fragmentation and demographic reduction, but also by the loss of dispersal and gene flow capacities that may ultimately limit their adaptive potential and ability to cope with the fast-paced environmental change (Jump and Peñuelas, 2005).

A population size reduction is expected to be followed by the loss of rare alleles and, subsequently, by a progressive loss of genetic variation through increased genetic drift (Loveless and Hamrick, 1984; Young et al., 1996). Mating among relatives and self-fertilization may also become more frequent in smaller populations that consequently promotes inbreeding and inbreeding depression (Young et al., 1996; Lowe et al., 2005). Limited gene flow should increase the spatial genetic structure (‘SGS’) within continuous populations while promoting genetic differentiation among separate (sub)populations (Loveless and Hamrick, 1984; Lowe et al., 2005).

Evidence for most of these predictions, however, is controversial as indicated by dissimilar, and often contrasting, results (Lowe et al., 2005; Sork and Smouse, 2006; Kramer et al., 2008). In part, such discrepancies may be because disturbance is a very recent phenomenon with respect to the longevity of trees, so that the surveyed adult individuals represented genetic variation in predisturbance times (Lowe et al., 2005; Aguilar et al., 2008). In addition, contrary to initial expectations, some fragmented populations may not suffer a decline in gene flow levels (Kramer et al., 2008; see below). Thus, the detection of the genetic impact will depend on the age of sampled individuals and the time elapsed since the habitat disturbance began, as well as the sensitivity of genetic variation measures to show changes after disturbance. In general, genetic effects have been more easily detected in early-stage (progenies) versus adult individuals, and by changes in allelic richness and SGS versus gene diversity and inbreeding coefficients (Lowe et al., 2005; Aguilar et al., 2008; Wang et al., 2011; Browne et al., 2015; Carvalho et al., 2015).

Elucidating the underlying mechanisms of those genetic consequences poses an additional challenge because multiple related factors may operate simultaneously. For instance, both forest fragmentation and logging, which commonly act in concert in anthropogenically disturbed habitats, may reduce the effective sizes and the genetic connectivity within and among plant populations (Ellstrand and Elam, 1993; Young et al., 1996; Lowe et al., 2005). The former may determine landscape barriers to gene flow among forest remnants and the latter could increase the degree of isolation among reproductive trees. The genetic effects may be difficult to disentangle because both processes are expected to erode genetic diversity and increase inbreeding and spatial genetic structure (Ellstrand and Elam, 1993; Young et al., 1996; Lowe et al., 2005).

Disturbed areas may also be subject to the restriction of contemporary gene flow levels through the loss, or movement limitation, of pollinators and seed dispersers (Sork and Smouse, 2006). For instance, a higher degree of spatial isolation of reproductive individuals and/or restriction in pollinator displacements might limit the ability of plants to exchange pollen. This may reduce the outcrossing rates and the number of pollen donors that fertilize each maternal tree, potentially increasing population inbreeding levels (Lowe et al., 2005; Sork and Smouse, 2006). In particular, several studies have shown that selfing rates tend to increase when adult tree density decreases, as is the case in disturbed areas (Ward et al., 2005). Despite detrimental effects, derived from changes in pollination and mating patterns, have been occasionally documented, there is also evidence that the amount of pollen-mediated gene flow is not necessarily reduced (reviewed in Lowe et al., 2005; Ward et al., 2005; Sork and Smouse, 2006; Aguilar et al., 2008; Kramer et al., 2008). Interestingly, the movement distances of pollinating insects (either native or exotic) are often extensive, and even higher, in fragmented populations, allowing the maintenance of genetic diversity and connectivity (see, for example, White et al., 2002; Dick et al., 2003; Hanson et al., 2008).

Seed dispersal and plant regeneration may be substantially affected by defaunation (Neuschulz et al., 2016). Recruitment patterns may be altered by the extirpation of both seed dispersers and seed predators (Kurten, 2013). The outcome will depend on which of these assemblages are more severely impacted, and the interplay with density-dependent processes that influence mortality rates, such as Janzen–Connell effects (Nathan and Muller-Landau, 2000; Kurten, 2013; Galetti et al., 2015). For instance, given that seed predation mostly occurs near maternal plants, a relative increase of recruitment around adult trees may result from the loss of dispersers and/or seed predators. Instead, overall recruitment levels would be reduced or enhanced, respectively.

So far, little is known about the impact of defaunation on seed-mediated gene flow and plant genetic structure. Frugivore-mediated dispersal shapes the primary distribution of maternal progenies and genotypes across the landscape (García and Grivet; 2011). Each dispersal agent may contribute differentially to the emerging spatial patterns of genetic variation (Hamrick et al., 1993; García and Grivet, 2011). Some of them may perform clumped dispersal of seeds containing highly structured gene pools (see, for example, Bialozyt et al., 2014; Karubian et al., 2015), whereas others promote an admixture of genotypes and a decrease of fine-scale SGS (see,for example, Karubian et al., 2010; Giombini et al., 2016). In general, the loss of dispersers should increase progeny aggregation around maternal plants because of lower seed removal (Hamrick et al., 1993; Kurten, 2013). In addition, an overall restriction of gene dispersal via seeds should increase the strength of SGS (Loveless and Hamrick, 1984; Hamrick et al., 1993; Gehring and Delph, 1999).

Tropical forest remnants are typically deprived of medium to large vertebrates (Peres, 2001; Stoner et al., 2007) upon which many large-seeded plants rely for accomplishing effective and long-distance seed dispersal (Kurten, 2013). Our main goal was to examine the potential early genetic consequences of the loss of large vertebrates for large-seeded vertebrate-dispersed plants. To this end, we compared the genetic variation in early-stage individuals of the large-seeded palm Syagrus romanzoffiana between continuous protected forest and nearby partially defaunated fragments in the Atlantic Forest of South America. This palm is mainly dispersed by the Amazonian tapir (Tapirus terrestris), which is the largest frugivore in the region and highly sensitive to habitat disturbance, and by other medium–large mammals and birds (Olmos et al., 1999; Guix and Ruiz, 2000; Giombini et al., 2009; Sica et al., 2014). Specifically, we addressed whether palm recruits in partially defaunated fragments present lower genetic diversity, higher inbreeding and stronger fine-scale SGS than in protected forest, where the main disperser is still abundant. We also evaluated whether recruitment around adult palms in the fragments was composed of a lower proportion of dispersed recruits (that is, recruits not assigned as the progeny of the focal adult). Finally, we assessed differences in mating patterns (that is, selfing rate and diversity of pollen donors) between areas.

Materials and methods

Study species and region

Syagrus romanzoffiana (Cham.) Glassman is widely distributed in the southeast of South America, and one of the most common palms in the Atlantic Forest. It is a monoecious species mainly pollinated by bees and beetles. Its mature infructescences produce 500–1000 single-seeded drupes of 2–3 cm, with an orange fleshy pericarp. The fruiting period extends most of the year, thus offering a keystone resource for numerous frugivores during periods of fruit scarcity (Keuroghlian and Eaton, 2008; Genini et al., 2009). Vertebrates that consume these fruits (Guix and Ruiz, 2000) are far more numerous than those capable of ingesting and effectively dispersing the seeds (Giombini et al., 2016). Besides tapirs, other potentially important dispersers are crab-eating foxes, coatis and large birds (for example, cracids and toucans). Some smaller frugivores (for example, crab-eating racoons, several rodents) may occasionally contribute to seed dispersal. In continuous protected areas, the density of early-stage recruits is typically highest in tapir latrines, and is higher in the vicinity of adult palms than in random sites (Giombini et al., 2009; Sica et al., 2014). Only few vertebrates predate S. romanzoffiana seeds, including some terrestrial rodents and, mainly, squirrels. The larva of the weevil Revena rubiginosa kills the seed embryo in early developmental, predispersal stages and is probably the main seed predator (Olmos et al., 1999; Mendes et al., 2016).

This study was conducted in the north of Misiones Province, northeastern Argentina (Supplementary Figure S1) that represents the southern limit of the Upper Paraná Atlantic Forest of South America. This ecoregion has been globally reduced to 8% of its original extension during the second half of the twentieth century, and currently presents a high degree of fragmentation (Di Bitetti et al., 2003). The study region contains both large areas of protected continuous forest and disturbed forest fragments that were originated by human settlement and local demographic growth since five decades ago. Sampling was conducted in the Iguazú National Park (henceforth ‘protected area’), which is a 58 600 ha natural reserve, and five forest remnants (‘disturbed area’). Both areas have been connected by continuous forest until fragmentation began (Supplementary Figure S1). The forest fragments range from 20 to 150 ha and originated <40 years ago. They are located in the surroundings (2.5–5.5 km distant) of the rural locality of Comandante Andresito that is 10 km east of Iguazú National Park eastern limit (Supplementary Figure S1). The protected area harbours a considerable richness and abundance of large vertebrates, including tapirs and the remaining potential dispersers of S. romanzoffiana. The disturbed area includes forest fragments that are partially defaunated (Iezzi et al., unpublished) while preserving similar vegetation structure and composition with respect to the protected area. In particular, the tapir is locally extinct. Coatis and large birds (that is, cracids and toucans) are still present with roughly similar abundances in the two areas, whereas foxes seem to be more abundant in the disturbed area. Agoutis and pacas, two medium–large rodents that probably play a minor role in effective dispersal of S. romanzoffiana, seem to be scarcer in the disturbed area (Iezzi et al., unpublished).

In the study region, S. romanzoffiana adults are normally found at densities of 10–20 individuals per ha, with a fairly random spatial distribution (Olmos et al., 1999). Notably, disturbed areas, such as logged fragments and even crop fields, often retain similar adult densities. This is probably because this palm has a well-recognized cultural and ornamental value among native people. In addition, S. romanzoffiana is not considered a valuable timber species and local inhabitants usually avoid cutting its stem because it damages saws and chainsaws. Therefore, the spatial distribution of adults does not strictly resemble the pattern of landscape fragmentation but exhibits a higher degree of continuity. The similarity in densities and spatial distribution between protected and disturbed areas should ameliorate the effect of some potentially confounding factors on genetic variation (for example, pollination distances, mating patterns and number of nearby maternal seed sources; see Discussion).

Sample collection and sampling schemes

Leaf tissue was collected from early-stage individuals (hereafter ‘recruits’) and adult palms representing candidate maternal sources (see below). Recruits corresponded either to seedlings presumably <1 year old, or saplings (early juveniles) >1 year old, with wider (but still not pinnate) leaves.

Consequences of seed-dispersal limitation on genetic variation and spatial genetic structure

The first sampling scheme (Table 1) was carried out to detect the potential short-term consequences of seed-dispersal limitation, with a particular focus on tapir extinction, on genetic variation and fine-scale SGS of recruits. We sampled recruits from plots of 100 m × 10 m (‘100 × 10 m plots’) established in four forest fragments and four sites in the protected area (Table 1; Supplementary Figure S1) in order to estimate the genetic diversity, inbreeding levels and fine-scale SGS. In two sites (D4 and P1), extremely high-density vegetation precluded sampling from some portions of the plot, and hence the sampling was extended beyond 100 m (up to 120 m) in order to compensate for the unsampled portion (Supplementary Figure S2). All recruits within each plot were counted and mapped by defining an X–Y coordinate system and then a subset of 51–52 recruits were sampled at random, seeking to encompass the entire plot (Supplementary Figure S2). Within the protected area, recruits tend to be spatially associated with tapir latrines (Giombini et al., 2009; Sica et al., 2014). In addition to its prominent contribution to S. romanzoffiana recruitment, this large frugivore shows the most contrasting abundances between the protected area and forest fragments (that is, abundant and absent, respectively). Therefore, plots in the protected area were located so that they included tapir latrines. The number of plots and number of individuals per plot were chosen in accordance to the compromise between each other for permitting an affordable experimental effort and cost. Whereas sample sizes of 50 individuals may be somewhat borderline for detecting SGS (Epperson, 2005), sampling from four sites is preferable than from a single site, as patterns of SGS often show considerable heterogeneity (Smouse et al., 2008). In addition, the use of highly variable markers (that is, microsatellites) may also aid in estimating SGS accurately (Epperson, 2005). Because the local density of nearby adults (potential contributing mothers) may affect the strength of the spatial genetic structure of recruits (Hamrick et al., 1993; Vekemans and Hardy, 2004), we also counted and mapped the adults within plots extended (symmetrically) to 140 m × 40 m (Supplementary Figure S2).

Table 1 Summary of sample collection of Syagrus romanzoffiana recruits in a disturbed and a protected area of the Atlantic Forest in the northeast of Argentina

Comparison of parental sources of progenies recruited around adult palms

The second sampling design was conducted both to evaluate whether the disturbed area showed a stronger aggregation of progenies around maternal trees and whether mating patterns pointed to lower levels of pollen-mediated gene flow. In each area, eight focal adults and a subset of recruits established around them were sampled to estimate the proportion of dispersed recruits, the diversity of pollen donors and the selfing rate. At each focal site, a 3 m radius circular plot centred on the adult stem was delimited and 10–16 recruits (≈15 on average) were randomly sampled. Leaf tissue from the focal adult (candidate mother) was also collected. Focal adults in the protected area were located along a 4 km road section. In the disturbed area, focal adults were situated in four forest fragments, three of which were the same as those used for estimating genetic diversity and SGS (Supplementary Figure S1). In all cases, focal adults were separated by at least 350 m.

DNA extraction and genotyping

We extracted DNA from 5–35 mg leaf tissue with the NucleoSpin 96 Plant II kit (Macherey-Nagel, Düren, Germany) and genotyped all individuals using a set of nine, previously characterized microsatellites (http://tomato.bio.trinity.edu/manuscripts/12-6/mer-12-0111.pdf). The nine microsatellites, genotyping procedures and the estimation of genotyping errors and null alleles were described in detail in Giombini et al. (2016). Briefly, the frequency of null alleles was <0.05 for all loci and the allele typing error rate per locus was <0.01. Despite their very low occurrences, null alleles and genotyping errors were considered in maternity analyses as described by Giombini et al. (2016). The whole data set contained <0.5% of missing data. The genotypes of all individuals (N=674), as well as geographic coordinates of sampling locations, were deposited at Dryad (doi:10.5061/dryad.s0vf6).

Data analyses

Genetic diversity

Allelic richness (that is, the number of different alleles) and gene diversity (that is, the expected heterozygosity under random mating) were estimated using SPAGeDi 1.4 (Hardy and Vekemans, 2002). Allelic richness was corrected to standardized (rarefacted) sample sizes (that is, equal number of individuals and genes) (Kalinowski, 2004). Gene diversity was calculated according to the unbiased estimator of Nei (1978). Both genetic diversity estimates were compared between the protected and disturbed area by the Wilcoxon matched-pairs (one-sided) test, using STATISTICA 6.0 (StatSoft, Inc., 2001). This test corresponds to a nonparametric randomized blocks design for paired comparisons (Sokal and Rohlf, 1995). For our purposes, the two areas (protected and disturbed area) were the treatments of interest, whereas loci represented the blocks. Thus, variation in genetic diversity levels among loci, which may arise from different mutation rates, potential selective constraints and so on, was controlled by pairing treatments by locus (Kalinowski, 2004).

Inbreeding

The inbreeding coefficient (Fis) was computed as the kinship coefficient (estimator defined in Loiselle et al., 1995, see further details below) between homologous genes (alleles) within individuals (Hardy, 2003) using SPAGeDi 1.4. This coefficient estimates the probability of identity between two alleles drawn from the same individual with respect to the probability of identity among two alleles randomly sampled from the ‘reference population’, and provides a measure of the departure from random mating (Hardy, 2003; Vekemans and Hardy, 2004). The ‘reference allele frequencies’ (Hardy, 2003), which are required for this and other estimations (see below), were obtained by pooling the genotypes of 819 individuals (including those analysed herein) sampled across a broad area in the north of Misiones province. These reference allele frequencies are provided in Giombini et al. (2016). A one-sided test was applied to assess whether inbreeding coefficients were significantly higher than expected under random mating, after generating the null distribution based on 1000 permutations of alleles among individuals (see SPAGeDi 1.4 user’s manual).

Fine-scale SGS

We characterized the fine-scale SGS of recruits within the 100 × 10 m plots according to Vekemans and Hardy (2004). This approach is based on the analysis of spatial genetic autocorrelation (that is, the relation between pairwise kinship coefficients and spatial distance among individuals) and was conducted with SPAGeDi 1.4 (Hardy and Vekemans, 2002). We used the kinship estimator (ρij) proposed in Loiselle et al. (1995) that is generally the most accurate and does not suffer biases in the presence of low frequency alleles (Vekemans and Hardy, 2004; see SPAGeDi 1.4 user’s manual). Kinship-distance curves were characterized by the slope of the regression (b) of pairwise kinship values on the natural logarithm of physical distance (Vekemans and Hardy, 2004). The overall strength of SGS was quantified by the Sp statistic, which is defined as Sp=−b/(1-ρ1), where ρ1 is the average kinship between nearest-neighbour individuals (that is, individuals in the first distance class) (Vekemans and Hardy, 2004). Sp provides a single summary statistic that synthesizes the strength of SGS and is fairly insensitive to the sampling scheme. Moreover, as is the case for b, it does not depend on the set of distance intervals arbitrarily chosen for constructing autocorrelograms. This statistic is thus suitable for measuring the strength of SGS on fine spatial scales and for comparing values among populations, sites or samples under different conditions (Vekemans and Hardy, 2004; Wang et al., 2011). The SGS patterns were visualized by constructing autocorrelograms with six distance intervals (I1: 0–5 m, I2: >5–10 m, I3: >10–20 m, I4: >20–40 m, I5: >40–80 m and I6: >80—maximum distance), so that the estimates of average kinship coefficients were supported by more than 100 pairwise comparisons for most distance intervals. Standard errors for the estimated b and ρ1 values were obtained by a jackknife procedure over loci. Statistical significances were assessed after obtaining the 95% confidence limits of the null distribution by performing 1000 random permutations of the spatial locations of individuals within plots and evaluating whether the observed b or ρ1 values were lower or higher than the 5% lower or upper limits of the null distribution (one-sided test), respectively. The P-values for the eight plots were then adjusted for multiple testing by the sequential Bonferroni procedure, declaring significance at an experiment-wise level of α=0.05. The statistical comparison of ρ1, b and Sp between the disturbed and protected area was conducted by using per-locus estimates averaged across sites in each area and applying a one-sided Wilcoxon matched-pairs test, analogously to the comparison of genetic diversity. In addition, for descriptive purposes, average multilocus values of ρ1, b and Sp were computed from the four sites in both areas.

We complemented the above-mentioned analysis by evaluating the heterogeneity in fine-scale SGS patterns within and among sampling sites and areas. For this purpose, we applied the method developed by Smouse et al. (2008), as implemented in GenAlEx 6.5 (Peakall and Smouse, 2012). This method is based on a genetic correlation coefficient (r) that is closely related to the kinship coefficient because it depends on the proportion of alleles shared between individuals. However, r does not include information on external allele frequencies of a reference population (Smouse et al., 2008). Using the multipopulation testing approach (see Smouse et al. (2008), and GenAlEx user’s guide), we specifically evaluated: (1) whether the combined patterns of genetic autocorrelation in each area indicated a nonrandom distribution of genotypes across space (by testing the null hypothesis that r=0 for all distance intervals of the resulting, multisite correlogram by the ωD statistic in each area), (2) whether the four autocorrelograms in each area were homogeneous (by means of the ωE statistic in each area), (3) whether the overall patterns of genetic autocorrelation were significantly different between the disturbed and protected area (by comparing the resulting correlograms between areas by the ωE statistic) and (4) which distance intervals showed more divergent r values between the disturbed and protected area (by comparing r values at each distance interval between the resulting correlograms by the t2 statistic). Statistical significances were determined based on 1000 random permutations of individual spatial positions and a significance level of α=0.01.

Proportion of dispersed recruits around adults

The proportion of recruits not mothered by the focal adult (candidate mother) in the 3 m radius circular plots was estimated by three maternity exclusion methods: (1) full-pedigree maximum likelihood (hereafter ‘ML’) reconstruction (Wang and Santure, 2009, 2) ML estimation of pairwise relationships for recruit–candidate mother dyads (Wagner et al., 2006) and (3) crude exclusion analysis based on at least one and two mismatching loci (Wang, 2010). They were implemented with COLONY v2.0.5.0 (Jones and Wang, 2010), ML-RELATE (Kalinowski et al., 2006) and both MicroErrorAnalyzer (Wang, 2010) and WHICHPARENTS 1.0 (Eichert, 1999), respectively. These three alternative approaches were used to obtain a more robust comparison of the relative amounts of dispersed recruits, although the former generally outperforms the others, because it exploits more information from the data (Wang, 2012). The accommodation of genotyping errors and potential occurrence of null alleles, as well as further details of the implementation of the three methods, are explained in Giombini et al. (2016). We tested whether the relative amount of dispersed recruits, as inferred from full-pedigree ML reconstruction, was lower in the disturbed than in the protected area by the nonparametric (one-sided) Mann–Whitney test implemented in STATISTICA 6.0 (StatSoft, Inc., 2001).

Selfing rate and diversity of pollen donors

The selfing rate (s) and diversity of pollen donors were estimated based on the pedigree relationships and parental genotypes of the recruits sampled around adult palms, as inferred by full-pedigree reconstruction with COLONY (Wang et al., 2012). Those recruits not assigned as the progeny of the focal adults (that is, dispersed recruits) were discarded in these estimations, because the inferred parental genotypes cannot be assigned unambiguously to paternal (pollen) and maternal (seed) sources. We thus obtained a data set analogous to those based on progeny arrays in which the maternal (focal adult) genotype is known and the inference of paternal genotypes is straightforward. We computed s as the proportion of recruits for which the focal adult was assigned as both the mother and the father (Wang et al., 2012). Provided the method developed by Wang et al. (2012) is based on pedigree reconstruction without requiring a priori information on any parental genotype, we also obtained an alternative estimation of selfing rate (s') from the group of recruits sampled in the 100 × 10 m plots.

The diversity of pollen donors was quantified both in terms of the absolute number of different contributing fathers or ‘paternal richness’ (Np) and the effective number of contributing fathers or ‘paternal diversity’ (Nep) that is informative about the number and evenness of paternal contributions. Owing to the unequal number of nondispersed recruits among focal sites, Np was estimated as a function of the number of nondispersed recruits using the rarefaction/extrapolation methods (‘'individual-based abundance data’) implemented in EstimateS 9.0 (Colwell, 2013). The effective number of fathers was calculated as Nep=1/PPI, where PPI is the unbiased estimator of the probability of paternal identity (Smouse and Robledo-Arnuncio, 2005), computed with an analogous equation to that described in Scofield et al. (2011) for the probability of maternal identity (q*gg). We applied the nonparametric (two-sided) Mann–Whitney test for comparing s, Np for standardized samples of 7 and 16 recruits (Np(7) and Np (16), respectively), and Nep between the protected and disturbed area using STATISTICA 6.0.

Results

Genetic diversity and inbreeding

A total of 414 recruits sampled from the 100 × 10 m plots in the disturbed (N=208) and protected (N=206) area were genotyped (Table 1). Allelic richness was significantly higher in the protected area, as revealed by a higher number of alleles across all but two loci (Table 2, Twilcoxon=2, P=0.013). EE54 was uninformative in this comparison as it only presented two alleles (Table 2) that occurred at least once in all plots. Gene diversity was identical in both areas (Table 2, Twilcoxon=19, P=0.339).

Table 2 Summary statistics of genetic diversity of Syagrus romanzoffiana recruits sampled in a disturbed and a protected area of the Atlantic Forest in the northeast of Argentina

Inbreeding coefficients were consistent with random mating patterns for all plots (Table 3) and showed a lower average value in the disturbed vs protected area (Fis=0.015 vs Fis=0.038, respectively).

Table 3 Inbreeding levels and spatial genetic autocorrelation of Syagrus romanzoffiana recruits sampled in a disturbed and a protected area of the Atlantic Forest in the northeast of Argentina

Fine-scale spatial genetic structure

Two disturbed and one protected sites showed a significant decay of kinship with distance among recruits (Table 3). Nearest-neighbour recruits (0–5 m distance interval) exhibited higher kinship values than expected under no spatial structuring in three out of four sites in both areas (Table 3 and Figure 1). A more negative average regression slope along with a higher average kinship among nearest neighbours resulted in an almost twofold average Sp statistic in the disturbed area (Sp=0.019 vs Sp=0.010; Table 3). Seven out of nine loci yielded higher ρ1, more negative b and higher Sp values in the disturbed area. These differences were significant or marginally significant (Twilcoxon=7, P=0.033 for comparison of ρ1, and Twilcoxon=11, P=0.087 for comparison of both b and Sp).

Figure 1
figure 1

Spatial genetic autocorrelograms for Syagrus romanzoffiana recruits sampled in a disturbed and a protected area of the Atlantic Forest in the northeast of Argentina. The autocorrelograms for the four disturbed (D1–D4) and protected (P1–P4) sites are shown in left and right columns, respectively. All x axes represent distance in m (logarithmic scale). Average kinship values for each distance interval (black markers connected by lines) are indicated at the upper limit of the interval. Distance intervals are as follows: 0–5 m, >5–10 m, >10–20 m, >20–40 m, >40–80 m and >80–120 m. Also shown are the average expected values under random spatial distribution of genotypes (grey continuous line) and their 95% confidence limits (grey dotted lines). Single-site (grey lines and markers) and multisite average (black lines and markers) autocorrelograms are plotted together at the bottom panels.

As suggested by the visual inspection of the correlograms (Figure 1), the results of the heterogeneity analyses (Supplementary Table S1) indicated that: (1) the genotypes of the sampled recruits were nonrandomly distributed across space in both the disturbed (ωD=68.87, P=0.001) and protected areas (ωD=66.56, P=0.001), (2) the genetic autocorrelation patterns were quite heterogeneous among sites in both the disturbed (ωE=51.78, P=0.001) and protected areas (ωE=47.15, P=0.001), (3) the overall patterns of genetic autocorrelation differed significantly between areas (ωE=35.22, P=0.001) and (4) the larger differences in the overall patterns of genetic autocorrelation between areas occurred for the last distance intervals; the r value was higher in the disturbed area at the 40–80 m interval (r=−0.014 vs r=−0.038, t2=8.31, P=0.006) and lower at distances of >80 m (r=−0.045 vs r=0.027, t2=11.77, P=0.001). The r value for the first distance interval was slightly but not significantly higher in the disturbed area (r=0.086 vs r=0.067, t2=1.88, P=0.173).

Proportion of dispersed recruits around adults

A total of 244 recruits sampled around 16 focal adults in the disturbed (N=128) and protected (N=116) areas were genotyped (Table 4). All maternity exclusion methods indicated a lower proportion of dispersed recruits around adults in the disturbed area (Figure 2). The median percentages (first–third quartiles) of dispersed recruits inferred by full-pedigree ML reconstruction, which should provide the most accurate estimates, were 0.0 (0.0–15.6) and 14.4 (0.0–39.3) in the disturbed and protected area, respectively (Figure 2). The difference, however, was not significant (UMann–Whitney=20.5, P=0.117).

Table 4 Mating pattern parameters (selfing rate and diversity of pollen donors) in Syagrus romanzoffiana estimated from recruits sampled around focal adults in a disturbed and a protected area of the Atlantic Forest in the northeast of Argentina
Figure 2
figure 2

Percentages of Syagrus romanzoffiana dispersed recruits sampled around conspecific adults in a disturbed and a protected area of the Atlantic Forest in the northeast of Argentina. Shown are the values of median (black thick lines with markers), first and third quartiles (bottom and top of the boxes) and entire range (whiskers) estimated from the eight focal sites in each area. Values are presented for the four maternity exclusion methods. Crude-1 locus and Crude-2 loci, crude exclusion methods based on at least one and two mismatching loci, respectively (implemented with WHICHPARENTS); Full ML, full-pedigree maximum likelihood method (implemented with COLONY); Pair ML, pairwise maximum likelihood method (implemented with ML-RELATE).

Selfing rate and diversity of pollen donors

Average selfing rates estimated from the focal sites (n=8, s=0.01±0.01 s.e. and s=0.04±0.04 s.e. for disturbed and protected area, respectively; Table 4) were different from those obtained from recruits sampled in the 100 × 10 m plots (n=4, s'=0.12±0.01 s.e. and s'=0.11±0.01 s.e. for disturbed and protected area, respectively; Supplementary Table S2). In any case, and regarding our purposes, the estimated values did not differ significantly between areas, as indicated by the Mann–Whitney test applied for the comparison of s (U=27, P=0.955; Table 4) or by the largely overlapping distributions of s' estimates (Supplementary Table S2).

The richness and diversity of pollen donors were also similar between areas, with not (or marginally) significantly lower values in the disturbed vs protected area (Table 4; Mann–Whitney test for comparisons of Np (7), Np (16) and Nep, respectively: U=16, P=0.189; U=24.5, P=0.694 and U=13, P=0.094).

Discussion

In contemporary times, plants are probably suffering from various short-term genetic consequences because of anthropogenic disturbance (Aguilar et al., 2008). However, empirical evidence indicates that plant populations may respond in a complex or subtle way and hence the genetic impact may not be easily predicted (Lowe et al., 2005; Kramer et al., 2008). Elucidating the causes and consequences of the changes in the patterns of plant gene flow is crucial for defining conservation actions. Overall, our findings are strongly suggestive of some early genetic consequences resulting from the limitation in contemporary gene flow via seeds, but not pollen, in defaunated areas.

First, allelic richness, but not gene diversity, was significantly lower in the disturbed area. Such pattern may be expected from the rapid loss of low frequency (rare) alleles soon after population shrinkages (Young et al., 1996), as has been detected in some small or fragmented populations (see, for example, Lowe et al., 2005; Jump and Peñuelas, 2006). However, a reduction of allelic richness is unlikely to occur as a consequence of a genetic bottleneck for long-lived trees when <50 years have elapsed since fragmentation (Aguilar et al., 2008; Wang et al., 2011; Browne et al., 2015), as it is the case for the present study. Moreover, as mentioned above, the fragmentation and potential decline of the effective population size in S. romanzoffiana was probably not very severe. Thus, although not surveyed, we do not expect that the parental populations (S. romanzoffiana adults) in the study areas showed different allelic richness to account for the difference observed in recruits. Therefore, in the context of our study, the difference in allelic richness may be interpreted more plausibly as a spatial restriction for the occurrence of certain alleles in defaunated sites, as a consequence of gene dispersal limitation, rather than the loss of those alleles by a population bottleneck.

Second, the disturbed area showed an overall trend toward a stronger fine-scale SGS. The differences of b and Sp among areas, however, were marginally significant and two disturbed sites (D2 and D4) showed no significant decrease of kinship with distance (Table 3). The moderate difference between areas is consistent with the fact that the disperser assemblage in the disturbed area is only partially depleted. Thus, the remaining dispersers probably contribute, in some degree, to the admixture of genotypes over the spatial scale studied here. In addition, SGS patterns were fairly heterogeneous, suggesting that several factors, beside seed dispersal levels, were also involved in shaping the SGS at each site. Such heterogeneity seems to be the norm in empirical studies evaluating SGS from different sites, populations or conditions (see, for example, Gehring and Delph, 1999; Smouse et al., 2008; De-Lucas et al., 2009; Zeng et al., 2012).

The differences in allelic richness and SGS between areas may be partly accounted for by the contribution of tapir-mediated dispersal, as sampling sites in the protected area included tapir latrines. This frugivore has been shown to promote the recruitment of unrelated neighbouring seedlings and increase the number of alleles and maternal genotypes as more defecations are accumulated (Giombini et al., 2016). Therefore, the observed patterns would be somewhat biased towards a genetic footprint left by tapirs. As our sampling scheme does not allow us to tease apart the relative contribution of tapirs and other dispersers, we cannot determine the extent to which the former could explain the differences among the protected and disturbed area. Given that tapir latrines (and single defecations) containing plant recruits are quite widespread across well-protected areas (Fragoso and Huffman, 2000; Giombini et al., 2009; Barcelos et al., 2013), the effect on the emerging patterns of genetic variations might be significant. Addressing this issue would be valuable because tapirs are among the most sensitive vertebrates to habitat disturbance and their extirpation is becoming increasingly common in South America (Peres, 2001; Stoner et al., 2007; Cruz et al., 2014).

Other factors potentially influencing SGS patterns and allelic richness did not show significant differences between the disturbed and protected area. For instance, a higher local density of adult palms may reduce SGS because of a larger overlap of maternal seed shadows (Hamrick et al., 1993; Vekemans and Hardy, 2004). Nonetheless, the natural density of S. romanzoffiana adults is fairly uniform across the entire study region and is not expected to be currently reduced in the disturbed area. In fact, the number of nearby adults was not higher in the protected sites (4.25 vs 5.25 on average; Table 1) and we found no correlation between Sp values and the number of nearby adults (Rspearman=0.22, P=0.600).

An increase in inbreeding levels is a potential short-term consequence of habitat disturbance (see, for example, Lowe et al., 2005; Jump and Peñuelas, 2006; Aguilar et al., 2008) and may affect and be affected by SGS patterns (Loveless and Hamrick, 1984; Ellstrand and Elam, 1993). Inbreeding is largely determined by selfing rates and mating among closely related individuals (that is, biparental inbreeding). Remarkably, neither selfing rates nor overall inbreeding were higher in the disturbed area. In addition, the richness and diversity of pollen donors were also similar in both areas. The diversity of pollen donors, however, showed a marginally significant higher value in the protected area, and thus cannot be completely discarded as a factor contributing to the higher allelic richness and weaker SGS in that area. Overall, our results suggest that patterns of pollen-mediated gene flow are not severely affected in the disturbed area.

No previous assessment of S. romanzoffiana mating system has been reported so far. Measured selfing rates varied depending on the method used for the estimation. This is not surprising because each method involved a different type of sample. Whereas the recruits sampled from the focal sites represented the progeny of a single, known maternal source, those sampled from the 100 × 10 m plots most probably originated from multiple, unknown parents. Regardless of the estimation method, global averages were lower than 0.20 (s=0.02±0.08 s.d., n=16 and s'=0.11±0.02 s.d., n=8, respectively). Therefore, based on these rough estimates, and as previously suggested by Giombini et al. (2016), S. romanzoffiana may be considered a predominantly outcrossing species (Goodwillie et al., 2005). Such mating pattern seems to be pervasive among palms and other tree families in the Neotropics (Ward et al., 2005; Nazareno and Dos Reis, 2012; and references therein). However, selfing rates could have been underestimated by our procedure if some selfed progeny underwent embryo abortion and/or differential mortality at the seed or early seedling stages (Husband and Schemske, 1996).

Another suggestive signal of restricted seed dispersal was revealed by the comparison of the proportion of dispersed recruits around adults. A stronger spatial aggregation of seeds or seedlings around conspecific (putative maternal) adults in defaunated areas seems to be a widespread pattern for animal-dispersed plants (Kurten, 2013). The number of S. romanzoffiana recruits around adults was indeed larger in the disturbed area (Sica et al., 2014; see also Table 4). It is unlikely that a decrease in seed predation is responsible for this pattern, although we cannot rule it out completely. Seed predators of S. romanzoffiana should be affected less drastically than dispersers, because the former involve smaller vertebrates (for example, squirrels and small rodents) that are more resilient to disturbance than the latter. Remarkably, vertebrate seed predation of another palm has been shown to increase in defaunated areas in the Atlantic Forest because of higher abundance or activity of small rodents, including squirrels (Galetti et al., 2015). Furthermore, a potential decrease in vertebrate seed predation is likely to be compensated, at least partially, by an increase in invertebrate seed predation (Kurten, 2013; Mendes et al., 2016). In particular, a negative correlation has been found between seed predation by squirrels and invertebrates in S. romanzoffiana, and this is as expected because the former prey not only on seeds but also on the invertebrate larvae (Mendes et al., 2016).

In addition to being more numerous, recruits around adults in the disturbed area consisted of a lower proportion of dispersed individuals (that is, those originated from a maternal source other than the focal adult). This latter result cannot be explained by a lower rate of seed predation per se. Instead, this probably reflects the lower abundance of dispersal agents. Few studies have so far documented this pattern using genetic analysis (Pacheco and Simonetti, 2000; Wang et al., 2007), although it might be a widespread signature in the so-called ‘empty’ or ‘half-empty’ forests (Kurten, 2013). However, we cannot be conclusive on this respect as the observed differences, although consistent across several maternity exclusion methods, were not significant. A limited statistical power might be, at least partially, responsible for such result.

Overall, our results pointed consistently to a disturbance on the same underlying process, that is, the restriction of gene dispersal via seeds. The contrasting genetic patterns revealed in this study may be signalling potentially deleterious effects in the short and long term that would result from the loss of genetic variation and an increase in SGS. However, because of selective and demographic effects occurring throughout the life cycle, patterns of genetic structure in early cohorts may or may not be imprinted in subsequent stages and generations (Loveless and Hamrick, 1984; García and Grivet, 2011; Bialozyt et al., 2014). Further research is therefore needed to examine the extent to which these incipient changes in genetic patterns are ultimately detrimental to the viability and adaptive potential of plant populations.

As already mentioned, many plant populations are capable of maintaining, and even increasing, levels of pollen-mediated gene flow in disturbed areas. Remarkably, such capability has been verified in both wind- and insect-pollinated species, likely induced by opening vegetation structure and increasing pollinator foraging distances, respectively (see Lowe et al., 2005; Kramer et al., 2008; and references therein). In contrast, an increment of seed-mediated gene flow in deforested landscapes has been only reported for wind-dispersed species (Bacles et al., 2006), whereas opposite trends have been revealed in animal-dispersed species (Wang et al., 2011; Browne et al., 2015; this study). For instance, a significant increment of the SGS in recruits, but not adults, occurred in recently (30–40 years) fragmented areas in the neotropical, vertebrate-dispersed palm Oenocarpus bataua (Browne et al., 2015). Likewise, Wang et al. (2011) detected an increase of SGS in cohorts of the tropical Asian tree Castanopsis sclerophylla recruited in post-fragmentation times (<50 years ago). The results from both studies suggested that the stronger SGS in the fragmented area was more likely a consequence of a restriction in animal-mediated seed dispersal than in pollen flow. Based on such evidence along with our results, we may propose that an increment in SGS should be occurring quite frequently in post-disturbance generations of tropical trees dispersed by frugivorous vertebrates sensitive to anthropogenic disturbance. Moreover, as our findings concern the consequences of partial (modest) defaunation levels, a stronger genetic impact should be expected for more severely defaunated areas. Although the protection of plant-pollinators mutualisms, and associated gene flow, is undoubtedly desirable to sustain genetically viable plant populations, preservation of plant-disperser mutualisms, largely threatened in tropical systems, should also be imperative.

Data archiving

Data available from the Dryad Digital Repository: http://dx.doi.org/10.5061/dryad.s0vf6.