Introduction

Island biogeography is concerned with colonization and diversification of organisms on islands, including empirical tests of evolutionary hypotheses pertaining to gene flow in dispersal-limited organisms1,2. Islands are geographically widespread and diverse, and vary in shapes and sizes, age and geologic origins, and show different degrees of isolation3. Darwin already recognized that this combination of attributes makes islands appealing objects of scientific study4. Modern biogeography recognizes the interplay among island histories, the specifics of their geography, and various attributes of organisms that inhabit them5.

Amongst island systems some of the best studied in terms of biogeographic research are Hawaii6,7,8, Galapagos9,10,11, Azores12,13,14, Canary15,16,17 and Solomon18,19,20 islands, as well as large continental fragments such as Madagascar21,22,23 and New Zealand24,25,26. However, the Caribbean island system27,28,29,30 is the single most ‘published’ island system in biogeography literature (Google Scholar title hits 237 compared with 195 for the second, New Zealand). The Caribbean Basin, also known as West Indies, lies in the tropical zone between South and North American continents, and to the east of the Gulf of Mexico. Combining over 700 islands, the Caribbean is considered among the world’s biodiversity hotspots31,32,33. In its most broad categorization, the Caribbean comprises three regions: (1) Greater Antilles with the largest islands of Cuba, Hispaniola (Dominican Republic and Haiti), Puerto Rico and Jamaica representing 90% of all land in Caribbean Sea; (2) Lesser Antilles with numerous smaller, mostly volcanic, islands and (3) the Lucayan platform archipelago (the Bahamas).

The Greater Antillean islands of Cuba, Hispaniola, and Puerto Rico, but not Jamaica, are parts of the old proto-Antillean arc that began its formation over 130 million years ago (MYA). Through Caribbean plate tectonics, the proto-Antillean arc drifted eastward until settling at its current location around 58 MYA34,35. Researchers disagree on the timing of the proto-Antillean arc connection with South or North America in the Cretaceous or even on the existence of such a connection. However, that distant past may have had little biological relevance for current biotas due to a catastrophic effect of the bolide that crashed into Yucatan around 65 MYA which arthropods would likely not have survived36,37,38. The emergence of the Greater Antilles as relevant biogeographic units is therefore more recent. Various studies estimate that earliest contiguous permanent dry land on the Greater Antilles has existed since the middle Eocene, approximately 40 MYA38,39,40,41,42.

Although it may be possible that the Greater Antilles have remained isolated from continental landmasses since the early Cenozoic, a hypothesized land bridge potentially existed around 35–33 MYA38,43. This land bridge, known as GAARlandia (Greater Antilles – Aves Ridge), is hypothesized to have connected the Greater Antilles with the South American continent for about 2 million years, due to a sea level drop and subsequent exposure of land at Aves Ridge. As a means of biotic evolution on the Greater Antilles, the GAARlandia hypothesis allows for a combination of overland dispersal and subsequent vicariance and can be tested with the help of time calibrated phylogenies and fossils. While patterns of relationships that are consistent with predictions based on GAARlandia have been found in some lineages44,45,46,47,48 it is not a good model for explaining the biogeographical history of others27,29,49,50.

Among the islands forming the Greater Antilles, Jamaica is a geological special case since it was originally a part of the Central American tectonic plate. Jamaica emerged as an island around 40 MYA but remained partially or fully submerged until its reemergence in mid-Miocene around 15 MYA30,51,52,53, and was never part of the hypothetical GAARlandia landbridge. Consequently, Jamaica’s biota is distinct from other regions of Greater Antilles54.

The Lesser Antilles formed more recently. Northward of Guadalupe they split into two arches of distinct origins. The older, outer arc formed volcanically in Eocene-Oligocene, but its islands are largely composed of limestone signifying that they were submerged and have undergone orogenic uplift since the Miocene. The Lesser Antilles’ inner arc is of more recent volcanic origin (<10 MYA) and its islands continue to be formed55,56,57,58,59. With no history of continental connection, most of the Lesser Antilles have been completely isolated for at least a few million years, and thus their biotas must have originated via overwater dispersal30,60,61.

Spiders and other arachnids are emerging as model organisms for researching biogeography of the Caribbean27,44,48,62,63,64. Spiders are globally distributed and hyperdiverse (~47,000 described of roughly 100,000 estimated species65,66) organisms that vary greatly in size, morphology and behavior, habitat specificity, and importantly, in their dispersal biology67,68,69. While some spiders show good active dispersal70, others are limited in their cursorial activities but exhibit varying passive dispersal potential. Many species are able to passively drift on air currents with behavior called ballooning67,71 to colonize new areas. Some genera of spiders, like Tetragnatha or Nephila are known to easily cross geographic barriers, disperse large distances, and are one of the first colonizers of newly formed islands72,73,74. These are considered to be excellent aerial dispersers, while other lineages are not as successful. For example, the primitively segmented spiders, family Liphistiidae and the mygalomorph trapdoor spiders, likely do not balloon and have highly sedentary lifestyle imposing strict limits on their dispersal potential. As a consequence, bodies of sea water or even rivers represent barriers that limit their gene flow, which leads to micro-allopatric speciation75,76,77,78,79,80. Unlike the above clear-cut examples, the dispersal biology of most spider lineages is unknown, and their biogeographic patterns poorly understood.

This research focuses on the tetragnathid spider genus Cyrtognatha and its biogeography in the Caribbean. Cyrtognatha is distributed from Argentina to southern Mexico and the Caribbean65,81. A recent revision recognized 21 species of Cyrtognatha but cautioned that only a fraction of its diversity is known82. Its biology is poorly understood as these spiders are rarely collected and studied (a single Google scholar title hit vs 187 title hits for Tetragnatha). Considering their phylogenetic proximity to Tetragnatha, as well as its described web architecture, it seems likely that Cyrtognatha species disperse by ballooning83,84,85. Through an intensive inventory of Caribbean arachnids, we obtained a rich original sample of Cyrtognatha that allows for the first reconstruction of their biogeographic history in the Caribbean. We use molecular phylogenies to reconstruct Cyrtognatha evolutionary history with particular reference to the Caribbean, and compare estimates of clade ages with geological history of the islands. We use this combined evidence to test the vicariant versus dispersal explanations of Caribbean colonization, and to look for a broad agreement of Cyrtognatha biogeographic patterns with the GAARlandia landbridge hypothesis. We also greatly expand our understanding of Cyrtognatha diversity in the Caribbean region.

Materials and Methods

Field collection and identification

Material for our research was collected as a part of a large-scale Caribbean Biogeography (CarBio) project. Extensive sampling was conducted across Caribbean islands and in Mexico, using visual aerial search (day and night), and beating66,86. Collected material was fixed in 96% ethanol and stored at −20/−80 °C until DNA extraction. Species identification was often impossible due to juvenile individuals or lack of match with the described species (Table 1).

Table 1 Detailed information on Cyrtognatha specimens and outgroups.

Molecular procedures

DNA isolation took place at University of Vermont (Vermont, USA; UVM) using QIAGEN DNeasy Tissue Kit (Qiagen, Inc., Valencia, CA), at the Smithsonian Institute in Washington, DC using an Autogenprep965 for an automated phenol chloroform extraction, and at EZ Lab (Ljubljana, Slovenia). The latter protocol involved robotic DNA extraction using Mag MAX™ Express magnetic particle processor Type 700 with DNA Multisample kit (Applied Biosystems, Foster City, CA, USA) and following modified protocols87 (Vidergar, Toplak & Kuntner, 2014).

We targeted two genetic markers: (1) the standard Cytochrome C oxidase subunit 1 (COI) barcoding region, which has repeatedly been shown to be taxonomically informative in species delimitation88,89; and (2) the nuclear 28S gene for a subset of terminals representing all sampled species. We used the forward LCO1490 (GGTCAACAAATCATAAAGATATTGG)90 and the reverse C1-N-2776 (GGATAATCAGAATATCGTCGAGG)91 for COI amplification. The standard reaction volume was 25 µL containing 5 µL of Promega’s GoTaq Flexi Buffer and 0.15 µL of GoTaq Flexi Polymerase, 0.5 µL dNTP’s (2 mM each, Biotools), 2.3 µL MgCl2 (25 mM, Promega), 0.5 µL of each primer (20 µM), 0.15 µL BSA (10 mg/mL; Promega), 2 µL DNA template and the rest was sterile distilled water. We used the following PCR cycling protocol: an initial denaturation step of 5 min at 94 °C followed by 20 touch-up method cycles of 60 s at 94 °C, 90 s at 44°→ 54 °C, 1 min at 72 °C, followed by 15 cycles of 90 s at 94 °C, 90 s at 53.5 °C, 60 s at 72 °C and the final extension period of 7 min at 72 °C.

The primer pair for 28S were the forward 28Sa (also known as 28S-D3A; GACCCGTCTTGAAACACGGA)92 and the reverse 28S-rD5b (CCACAGCGCCAGTTCTGCTTAC)93. The standard reaction volume was 35 µL containing 7.1 µL of Promega’s GoTaq Flexi Buffer and 0.2 µL of GoTaq Flexi Polymerase, 2.9 µL dNTP’s (2 mM each, Biotools), 3.2 µL MgCl2 (25 mM, Promega), 0.7 µL of each primer (20 µM), 0.2 µL BSA (10 mg/mL; Promega), 1 µL DNA template and the rest was sterile distilled water. We used the following PCR cycling protocol: an initial denaturation step of 7 min at 96 °C followed by 20 touch-down method cycles of 45 s at 96 °C, 45 s at 62 °C → 52 °C, 60 s at 72 °C, followed by 15 cycles of 45 s at 96 °C, 45 s at 52 °C, 60 s at 72 °C and the final extension period of 10 min at 72 °C. The PCR products were purified and sequenced at Macrogen (Amsterdam, NL).

We used Geneious v. 5.6.794 for sequence assembly, editing and proofreading. For alignment, we used the default settings and the automatic optimization option in the online version of MAFFT95. We concatenated the COI and 28S matrices in Mesquite96.

We obtained 103 original Cyrtognatha COI sequences and mined four additional Cyrtognatha COI sequences from GenBank (Table 1). We excluded a single sequence, representing Argentinian C. atopica (GU129638), from most analysis, due to its poor quality, as already discussed by Dimitrov and Hormiga97. Moreover, we added three COI sequences from GenBank (Arkys cornutus, Metellina mengei, Pachygnatha degeeri) and three original COI sequences (Leucauge argyra, Chrysometa linguiformis, Tetragnatha elongata) to be used as outgroups. We obtained 22 original sequences of 28S gene fragment representing all putative species of Cyrtognatha and included one from GenBank. Additionally, we incorporated three original 28S sequences (Leucauge argyra, Chrysometa linguiformis, Tetragnatha elongata) and a single one from GenBank (Arkys cornutus) to be used as outgroups (Table 1). The concatenated matrix contained 1244 nucleotides (663 for COI and 581 for 28S).

Species delimitation

Because the current taxonomy of Cyrtognatha based on morphology is highly incomplete82, we undertook species delimitation using COI data. To estimate molecular taxonomic operational units (MOTUs), we used four different species delimitation methods, each with its online application: PTP (Poisson tree process)98, mPTP (multi-rate Poisson tree process)99, GMYC (generalized mixed yule coalescent)100 and ABGD (automatic barcode gap discovery)101. We ran these species delimitation analyses using the default settings, with the input tree for GYMC from BEAST2102 and the input trees for PTP and mPTP from MEGA 6.0103.

Phylogenetic analyses

We used MrBayes104 to reconstruct an all-terminal phylogeny for a complete set of our original Cyrtognatha material and outgroups using COI (Table 1). For Bayesian analysis we used the Generalised time-reversible model with gamma distribution and invariant sites (GTR + G + I) as suggested by AIC and BIC criterion in jModelTest2105. We ran two independent runs, each with four MCMC chains, for 100 million generations, with a sampling frequency of 1000 and relative burn-in set to 25%. The starting tree was random.

For a species level phylogeny, we then selected two individuals per MOTU and added 28S sequence data for two partitions and analyzed this concatenated dataset under a Bayesian framework. As above, jModelTest2 suggested GTR + G + I as the appropriate model, this time for both partitions. These analyses had 28 terminals including outgroups (Table 1). The settings in MrBayes were as above, but the number of MCMC generations was set to 30 million. Due to high mutation rates in noncoding parts of nuclear genes like 28S, insertions and deletions accumulate through evolution106,107, resulting in numerous gaps in a sequence alignment. We treated gaps as missing data but also ran additional analyses applying simple gap coding with FastGap108.

Molecular dating analyses

We used BEAST2102 for time calibrated phylogeny reconstruction (chronogram) constrained based on the results from the above described species level phylogeny. We used a single COI sequence per MOTU and trimmed the sequences to approximately equal lengths. We then modified the xml file in BEAUti102 to run three different analyses. The first analysis was run using GTR + G + I as suggested by jModelTest2. The second analysis employed the package and model bModelTest109. The third analysis used the package and RBS model110. All parameters were set to be estimated by BEAST. We used a Stepping-Stone Sampling (SS) approach, implemented as Model_Selection 1.4.1 extension in BEAST2, to calculate marginal likelihoods of models employing either strict or relaxed molecular clock (see Supplementary Note S2 for details and Baele et al.111,112 for justification). We then performed likelihood ratio test (LRT) and Bayes Factor test (BF), using calculated marginal likelihood scores and discovered that a relaxed log normal clock model better fits our data (LTR: p < 0.001; logBF = 27.1). Following Bidegaray-Batista and Arnedo113 we set ucld.mean prior as normally distributed with mean value of 0.0112 and standard deviation of 0.001, and the ucld.stdev as exponentially distributed with the mean of 0.666. We ran an additional analysis using a fossil calibration point on the basal node of Caribbean Cyrtognatha clade. Cyrtognatha weitschati, known from Dominican amber of Hispaniola, is hypothesized to be 13.65–20.41 million years old. We used an exponential prior with 95% confidence interval spanning from a hard lower bound at 13.65 MYA to the soft upper bound at 41 MYA. This upper bound corresponds with the time of Hispaniola appearance. We used SS sampling approach to calculate marginal likelihood scores for models with either a Yule or a Birth-Death tree prior (Supplementary Note S2). As suggested by the results of LRT (p < 0.001) and BF (logBF = 21.4) tests on those two models, we opted for a Yule process as a tree prior. The trees were summarized with TreeAnnotator102, with 20% burn-in based on a Tracer114 analysis, target tree set as Maximum clade credibility tree and node heights as median heights.

All metafiles from BEAST and MrBayes were evaluated in Tracer to determine burn-in, to examine ESS values and to check for chain convergence. For visualization of trees we used FigTree115. All MrBayes and BEAST analyses were run on CIPRES portal116.

Ancestral area estimation

We used BioGeoBEARS117 in R version 3.5.0118 to estimate ancestral range of Cyrtognatha in the Caribbean. We used a BEAST produced ultrametric tree from the above described molecular dating analysis as an input. We removed the outgroup Tetragnatha elongata and conducted the analyses with the 13 Cyrtognatha MOTUs from six areas (Hispaniola, Jamaica, Puerto Rico, Cuba, Lesser Antilles and Panama). We estimated the ancestral range of species with all models implemented in BioGeoBEARS: DEC (+J), DIVALIKE (+J) and BAYAREALIKE (+J). We used log-likelihoods (LnL) with Akaike information criterion (AIC) and sample-size corrected AIC (AICc) scores to test each model’s suitability for our data. All of our Cyrtognatha’s MOTUs are single island endemics, therefore we were able to reduce the parameter “max_range_size” to two29,119,120.

Results

We collected 103 Cyrtognatha individuals from Cuba, Jamaica, Dominican Republic/Hispaniola, Puerto Rico and Lesser Antilles (Fig. 1, Table 1). We confirmed that all individuals are morphologically Cyrtognatha, although we were not able to identify most species. However, we did identifiy two known species: C. espanola (Bryant, 1945) and C. elyunquensis (Petrunkevitch, 1930), the latter, clearly a Cyrtognatha, was previously placed in Tetragnatha and not transferred to Cyrtognatha in the recent revision of the genus65. The CarBio collections from Mexico yielded no Cyrtognatha specimens.

Figure 1
figure 1

(A) Map of the Caribbean with indicated sampling localities. (B) The all-terminal mitochondrial Bayesian phylogeny of Cyrtognatha. Branch colors match those of the islands in A. Notice that all putative species form exclusively single island endemic pattern.

We obtained COI sequences for all Cyrtognatha individuals. Using computational methods for species delimitation our Cyrtognatha dataset is estimated to contain from 11 to 14 MOTUs (Supplementary Note S1). The results from PTP, mPTP and ABGD were mostly consistent, disagreeing only on the status of three putative species. To these species that are supported by some but not all analyses, we added the label B or C after the species name: Cyrtognatha SP10B, Cyrtognatha SP2B and Cyrtognatha SP2C. On the other hand, we dismiss the results from GMYC analyses using either a single versus multiple threshold option, which failed to recover reliable MOTUs. The composition of our dataset is most likely not compatible with GMYC method as it cannot detect switches between inter- and intraspecific branching patterns, offering us from 1 to 39 MOTUs.

The two gene and the all-terminal, COI, phylogenies yielded nearly identical networks both supporting the monophyly of the Caribbean taxa. However, the root placement in the mtDNA phylogeny is different such that the phylogenetic trees appear to be in strong conflict even though the phylogenetic networks are mostly congruent (Fig. 2, Supplementary Fig. S1). Given that stronger evidence for root placement is expected to come from the two gene phylogeny, we ran an additional analysis constraining the root of the mtDNA phylogeny to reflect that, with otherwise the same settings (Fig. 1, Supplementary Fig. S2). Our all-terminal Bayesian phylogeny supports Caribbean Cyrtognatha monophyly, albeit with only three non-Caribbean samples. Most terminal clades were well supported with lower supports for some deeper nodes (Supplementary Fig. S1). This phylogeny strongly recovers all putative species groups as single island endemics. Furthermore, all geographic areas harbor monophyletic lineages, with the exception of Hispaniola that supports two independent clades. The unconstrained all-terminal COI phylogeny recovers the Lesser Antillean clade as sister to all other Caribbean taxa. However, this relationship is not recapitulated in the concatenated, species level, phylogeny (Fig. 2, Supplementary Fig. S3). The concatenated phylogeny also supports monophyly of the Caribbean taxa but recovers the clade of C. espanola and C. SP12 from Hispaniola as sister to all other Caribbean Cyrtognatha. The species level phylogeny is generally better supported, with the exception of a clade uniting species from Lesser Antilles, Cuba, Hispaniola and Puerto Rico. In both Bayesian analyses the chains successfully converged and ESS as well as PRSF values of summarized MCMC runs parameters were appropriate104.

Figure 2
figure 2

Species level Bayesian phylogeny of Cyrtognatha based on COI and 28S. Relationships agree with Cyrtognatha and Caribbean Cyrtognatha monophyly.

Chronograms produced by BEAST, using either exclusively COI mutation rate or incorporating the additional fossil for time calibration, exhibited very similar time estimates (Fig. 3, Supplementary Fig. S5). We decided to proceed with the mutation rate-only calibrated phylogeny for further analyses because it is less likely to contain known potential biases when calibrating with scarcely available fossils and geological information121,122. The molecular dating analyses based on the three different models in BEAST largely agreed on node ages with less than 1 million years variation. However, the log files from the chronogram based on GTR + G + I model consistently exhibited low ESS values (<50), even with MCMC number of generations having been increased to 200 million. The analyses using the remaining models, RBS and bModelTest, were more appropriate since MCMC chains successfully converged, and the lowest ESS values were 981 and 2214 respectively, thus far exceeding the suggested 200. Additional examination of the log files produced by bModelTest phylogeny with bModelAnalyzer from AppStore.exe109 revealed that MCMC chains spent most time in modified TN93 model with the code 123143 which contributed for 49.56% of posterior probability (for details on bModelTest method of model selection see109). The BEAST chronogram using bModelTest (Fig. 3, Supplementary Fig. S4) yielded the best supported results, amongst the above mentioned approaches, based on ESS values, and was therefore used in subsequent biogeographical analyses. This chronogram (Fig. 3) supports a scenario in which Cyrtognatha diverged from the closely related genus Tetragnatha at 18.7 MYA (95% HDP: 12.8–26.7 MYA). The Caribbean clade is estimated to have split from the mainland Cyrtognatha (represented here by a species from Panama) 15.0 MYA (95% HDP: 10.5–20.7 MYA). The clade with lineages represented on Lesser Antilles diverged from those on Greater Antilles at 11.5 MYA (95% HDP: 8.3–15.6 MYA).

Figure 3
figure 3

Time-calibrated BEAST phylogeny of Cyrtognatha. This chronogram suggests Cyrtognatha colonized the Caribbean in mid-Miocene and refutes ancient vicariant scenarios. The lack of any land bridge connection of the Caribbean with mainland at least since early Oligocene (cca. 33 MYA; GAARlandia) suggests that colonization happened by overwater dispersal. Confidence intervals of clade ages agree with geological history of Caribbean islands.

The comparison of all six models of ancestral area estimation with BioGeoBEARS recovered DIVALIKE + J as most suitable for our data due to highest LnL scores in all tests (Supplementary Table S1). The estimation of ancestral states suggests that the most recent common ancestor of all Caribbean Cyrtognatha in our dataset most likely (62%) resided on Hispaniola (Fig. 4). Moreover, all the Greater Antillean island clades as well as the Lesser Antillean clade most likely originated from Hispaniola with the following probability: Jamaican clade (47%), Lesser Antillean clade (40%), Cuban clade (63%) and Puerto Rican clade (89%) (Fig. 4, Supplementary Table S1).

Figure 4
figure 4

Ancestral area estimation of Cyrtognatha with BioGeoBEARS. The biogeographical analysis, using the most suitable model for our data (DIVALIKE + J, max_range_size = 2), revealed that Hispaniola was most likely colonized first. Following colonization, Cyrtognatha diversified within Hispaniola and subsequently dispersed from there to all other islands of the Caribbean (stars).

Recently, Ree and Sanmartin123 identified certain biases in the selection of those models that employ the founder event (the +J variants of the models in BioGeoBEARS). We followed their concerns and also analyzed the data using the DIVALIKE model that omits the founder event117. These alternative results (Supplementary Fig. S6) differ from those above in detecting an exclusively vicariant cladogenetic set of events. As we discuss below these alternative results are less credible in the context of Caribbean geological history, as the putative vicariant events are too recent.

Discussion

We reconstruct the first Cyrtognatha phylogeny using molecular data from over 100 individuals of this rarely collected group. Our results support Cyrtognatha as a relatively young clade, having diverged from a common ancestor with its possible sister genus Tetragnatha, in early- to mid-Miocene, and colonized the Caribbean in mid-Miocene. As we discuss below, these estimated ages, combined with the phylogenetic patterns, refute ancient vicariant explanations of their Caribbean origin, including the GAARlandia hypothesis. Instead, the patterns suggest colonization of Hispaniola, and subsequent dispersal to other islands.

The all-terminal phylogeny (Fig. 1) reveals clear patterns of exclusively single island endemic (SIE) species. This holds true even for the three MOTUs on the Lesser Antilles island group, as they appear on Guadelupe, St. Lucia and Grenada (Table 1). Even in the absence of the oceanic barriers, i.e. within the larger islands, we find evidence of short range endemism124. While we do not claim to have thorough regional sampling, we find patterns of local endemism in regions where our sampling is particularly dense, providing the strongest test with available data. Many Caribbean spiders such as Spintharus44, Micrathena125 Selenops126 and Nops127, as well as other arachnid lineages such as Amblypygi64 and Pseudoscorpiones63, demonstrate a similar pattern. The distribution and quantity of SIEs depends on island properties such as maximum elevation, size, isolation and geological age128,129,130,131,132. While our focus was not on the effect of physical properties of islands on SIEs, the patterns seem to point towards a higher number of SIEs on the islands with a higher maximum elevation: Hispaniola (3098 m) is occupied by 4 or 6 MOTUs (depending on the delimitation method), Jamaica (2256 m) by three MOTUs and all other islands (<2000 m) by a single MOTU. The Caribbean islands, with the exception of Hispaniola, also harbor exclusively monophyletic Cyrtognatha lineages. The most rigorous tests of island monophyly would require thorough sampling within each island. However, if the patterns we observe represent biogeographic reality, we might explain this observed pattern with a combination of the niche preemption concept and organisms’ dispersal ability133,134,135. A combination of the first colonizer’s advantageous position to occupy empty niches and rare overwater dispersal events of their closely related species leads to competitive exclusion and lower probability for newcomers to establish viable populations on already occupied islands136,137,138. While niche preemption is better studied in plants, it is also applicable to animals, including spiders1,139.

Inferred dates indicate that Cyrtognatha most likely colonized the Caribbean through long distance overwater dispersal after the last hypothesized land connections. An ancient vicariant hypothesis would predict that the early proto-Antilles were connected to the continental America and were colonized in the distant past, possibly over 70 MYA140 and the GAARlandia landbridge putatively existed around 35–33 MYA. These hypothetical scenarios are not consistent with the dates reflected in our BEAST chronogram (Fig. 3) in which we estimate that the Caribbean Cyrtognatha split from its continental population as late as 15 MYA. This suggests that the genus Cyrtognatha is much younger than the most reasonable possible vicariant timeframe. While the estimated most recent common ancestral node is anchored by a single Central American representative, this inference is reasonable if our dating estimates are sound. If more extensive sampling on the continent broke up Caribbean monophyly, this would be evidence of more frequent dispersal between the islands and the continents, but would not likely change our estimations of the age of this ancestral node. More extensive sampling on the Caribbean could feasibly uncover Cyrtognatha taxa that share an older ancestor than that inferred from the Panamanian specimen, thus pushing back estimates of the timing of original colonization, however, given our relatively dense sampling on the Caribbean we find this unlikely.

Although, as explained above, overwater dispersal is the likely scenario, the reconstructed biogeographic patterns do not directly refute vicariance. Indeed, the biogeographic reconstruction (Fig. 4) of a combined ancestral area Panama + Hispaniola at the Cyrtognatha root leaves the possibility of a vicariant interpretation. However, this reconstruction is unlikely to reflect reality, and may be an artifact of our sparse continental taxon sampling.

Likewise, the alternative biogeographic reconstruction that omits the founder event (+J) (Supplementary Fig. S6) is consistent with vicariant origins of all Caribbean subclades. However, in the context of known Caribbean geological events, vicariance is extremely unlikely, thus questioning the validity of this alternative biogeographic history. For example, biotas on Lesser Antilles could not have originated vicariantly with those from Hispaniola given the geological knowledge that Lesser Antilles are de novo islands of volcanic origin, and thus had to be colonized.

There is further evidence that supports overwater dispersal in Cyrtognatha. First, the Jamaican lineage split from the one on Hispaniola soon after colonization of the Caribbean even though Jamaica was never a part of the proto-Antilles, and was thus never physically connected to Hispaniola. Secondly, Puerto Rico was a part of the proto-Antilles but was colonized only recently (4.8 MYA). The results of Jamaican and Puerto Rican colonization from Hispaniola thus are most consistent with a scenario of colonization by overwater dispersal.

The mid-Miocene (ca. 15 MYA) is considered as the start of the modern Earth141 in that the climate began to stabilize and the ocean currents started to take their current form. This combination of events enabled the colonization of the Caribbean islands from eastern-northern parts of South America for example via vegetation rafts passively drifting with water currents142. That also meant that the wind directions and the hurricane paths most likely resembled those of today143, from East to West direction144. In fact, hurricanes may create numerous dispersal/colonization opportunities, especially for the organisms with poor active dispersal abilities140,145. Wind directions and tropical storms are relevant for tetragnathid spiders like Cyrtognatha that disperse by ballooning and could facilitate their colonization of the Caribbean islands in a stepping stone146 or leap-frog147 manner.

With the examination of the relationships in the time calibrated phylogeny (Fig. 3), a colonization of the Caribbean from the continental America may have occurred sometime between 10.5 and 20.7 MYA. The most likely scenario indicates the original colonization of the Greater Antilles (Hispaniola; Fig. 4). Such patterns of colonization of Greater Antilles in Miocene are also evident in many other lineages including vertebrates, invertebrates and plants29,41,148,149,150,151,152,153,154,155. More rigorous tests of Cyrtognatha monophyly, as well as the number and directionality of colonization pathways onto the Caribbean, would require more thorough sampling across potential source populations on the mainland.

Our inference of ancestral ranges proposes an early within island diversification of Cyrtognatha ancestors occupying Hispaniola and predict that Hispaniola is the ancestral area for all Caribbean clades (Fig. 4, Supplementary Fig. S5). The path of colonization does not resemble a straightforward pattern such as the stepping-stone pattern. The colonization sequence seems more random or resembles a “leap-frog” pattern. In our case the clear example of island being “leap frogged” is Puerto Rico. A leap frog pattern could indicate a role of hurricanes in movement among Caribbean islands140,156.

Conclusions

Our phylogenetic analysis of the tetragnathid spider genus Cyrtognatha facilitates reconstruction of its biogeographic history in the Caribbean. The ancestor of this relatively young lineage appears to have colonized the Caribbean overwater in the Miocene and further diversified into an exclusively single island endemic biogeographic pattern seen today. Further sampling of these rarely collected spiders in continental America is needed to confirm the timing, number and source of colonization of the Caribbean and to contrast those from other Caribbean spider clades. For example, Spintharus44,48 and Deinopis157 patterns clearly support ancient vicariance, but Argiope27 readily disperses among the islands. Tetragnatha, the sister lineage of Cyrtognatha, may prove to be of particular interest in comparison, because its biogeographic history on the islands may mirror their global tendency towards repeated colonization of even most remote islands.