Species delimitation and mitonuclear discordance within a species complex of biting midges

Shults, Phillip; Hopken, Matthew; Eyer, Pierre-Andre; Blumenfeld, Alexander; Mateos, Mariana; Cohnstaedt, Lee W.; Vargo, Edward L.

doi:10.1038/s41598-022-05856-x

Download PDF

Article
Open access
Published: 02 February 2022

Species delimitation and mitonuclear discordance within a species complex of biting midges

Phillip Shults^1,4,
Matthew Hopken²,
Pierre-Andre Eyer¹,
Alexander Blumenfeld¹,
Mariana Mateos³,
Lee W. Cohnstaedt⁴ &
…
Edward L. Vargo¹

Scientific Reports volume 12, Article number: 1730 (2022) Cite this article

4761 Accesses
13 Citations
9 Altmetric
Metrics details

Subjects

Abstract

The inability to distinguish between species can be a serious problem in groups responsible for pathogen transmission. Culicoides biting midges transmit many pathogenic agents infecting wildlife and livestock. In North America, the C. variipennis species complex contains three currently recognized species, only one of which is a known vector, but limited species-specific characters have hindered vector surveillance. Here, genomic data were used to investigate population structure and genetic differentiation within this species complex. Single nucleotide polymorphism data were generated for 206 individuals originating from 17 locations throughout the United States and Canada. Clustering analyses suggest the occurrence of two additional cryptic species within this complex. All five species were significantly differentiated in both sympatry and allopatry. Evidence of hybridization was detected in three different species pairings indicating incomplete reproductive isolation. Additionally, COI sequences were used to identify the hybrid parentage of these individuals, which illuminated discordance between the divergence of the mitochondrial and nuclear datasets.

Genetic and geographical delineation of zoonotic vector-borne helminths of canids

Article Open access 24 April 2022

Multiple evolutionary lineages for the main vector of Leishmania guyanensis, Lutzomyia umbratilis (Diptera: Psychodidae), in the Brazilian Amazon

Article Open access 28 July 2021

Genomic differentiation and intercontinental population structure of mosquito vectors Culex pipiens pipiens and Culex pipiens molestus

Article Open access 05 May 2020

Introduction

Speciation is a dynamic evolutionary process through which populations segregate into independently evolving lineages over time¹. When gene flow is restricted either through geographic, behavioral, or ecological isolation, the accumulation of genetic changes, through selection or local genetic drift, may lead to divergence and potentially reproductive isolation^2,3,4,5,6. Thus, the amount of genetic differentiation and level of gene flow between closely related lineages can be used to evaluate the strength of this isolation and determine species status⁷. Depending on the completeness of the speciation process between lineages, it can be challenging to unambiguously identify species⁸. Shallow divergence and hybridization can mask both morphological and genetic differences. While the most accurate assumptions about species delimitation are derived from a multifaceted approach^9,10, genomic data has become a powerful tool to investigate species boundaries^11,12. Both substantial and fine-scale genetic divergence is being uncovered across many study systems, even in the absence of morphological variation^13,14,15,16. Species delimitation is especially important when working with organisms responsible for pathogen transmission, as misidentifications will lead to inaccurate vector surveillance data. Culicoides Latreille (Diptera: Ceratopogonidae) biting midges are responsible for the transmission of many pathogens worldwide^17,18, including bluetongue virus (BTV) and epizootic hemorrhagic disease virus (EHDV). These viruses can cause severe symptoms and death in wild and domestic ungulates and are responsible for substantial economic losses globally^19,20.

In North America, one of the main BTV and EHDV vectors is Culicoides sonorensis Wirth and Jones²⁰, which belongs to the C. variipennis species complex. When originally described, this group consisted of five subspecies²¹; though presently, three distinct species are recognized (C. occidentalis Wirth and Jones, C. sonorensis, and C. variipennis (Coquillet)) with C. albertensis Wirth and Jones and C. australis Wirth and Jones designated as synonyms of C. sonorensis²². Despite the current taxonomic arrangement, species identification remains difficult due to very subtle morphological differences and overall genetic similarities^23,24. Additionally, the presence of cryptic species or hybridization within the complex could be further complicating proper species identification. Under laboratory conditions, C. sonorensis and C. occidentalis have been shown to hybridize²⁵, and both C. occidentalis and C. variipennis occur sympatrically with C. sonorensis²¹. Hybridization with C. sonorensis in nature would represent a pathway for introgression, potentially for genes controlling vector competency²⁶.

Geographic isolation limits gene flow between populations, and thus, life-history traits influencing dispersal ability can drastically influence the level of gene flow among populations (e.g.^27,28). Species with low dispersal ability are particularly likely to exhibit highly differentiated populations resulting in the evolution of cryptic species over a limited spatial scale²⁹. In contrast, species with high dispersal abilities are likely to maintain a high level of gene flow between populations. Studies of Culicoides species in Europe, Africa, and Australia have consistently revealed frequent gene flow between populations, even at continental scales^30,31,32,33. Culicoides species have been shown to randomly disperse away from their larval habitats, up to 2 km daily^34,35, and are also known to disperse via prevailing winds for hundreds of kilometers^36,37,38. The high dispersal ability of biting midges decreases the likelihood of geographic isolation between populations, and as a consequence, may not have played a major role in the diversification of this group. Instead, ecological or behavioral isolation can allow closely related species to occur sympatrically in distinct ecological niches, and may help explain species divergence within Culicoides^39,40. The high rate of dispersal, potential for hybridization, and numerous sympatric populations make the C. variipennis complex an intriguing system in which to study species delimitation and may also provide insights into the mechanisms responsible for speciation within this group.

Here, we evaluated the genetic structure of the C. variipennis complex from broad-ranging sampling locations to test the current taxonomic hypothesis of three distinct species. We used ddRadSeq to analyze 206 individuals collected from 17 sites throughout the United States and Canada. We first estimated the overall genetic similarity and population structure among these samples to delimit distinct lineages within the species complex. We then estimated the level of gene flow within and between the inferred species. As previous attempts to separate these species using common barcoding genes have been inconclusive, we sequenced a region of the COI gene to compare to the putative SNP species identifications. Additionally, we discuss the potential mechanisms controlling reproductive isolation within this species complex.

Results

SNP calling and clustering analyses

In total, 271 individuals were subjected to the ddRADseq procedure and yielded an average of 2.08 million reads per individual. During the initial filtering, 36 individuals were found to have low-quality sequences (phred score of less than 25) and were removed from the dataset. Additionally, 29 individuals were found to have more than 75% missing data and were also removed. The final dataset included 206 individuals from 17 sites and contained 3612 SNPs. The population structure inferred by fastSTRUCTURE that best explains the data was K = 5. Structure plots showing K = 3–6 can be found in Figure S1. At K = 5, most individuals (86%) were unambiguously assigned to one group (98–100% assignment score; Fig. 1). Consistent with these results, the principal component analysis (PCA) and discriminant analysis of principal components (DAPC) grouped these individuals into five main clusters (Figs. 2a & S2). The main difference being that the PCA further segregated one cluster (blue, Fig. 2a) into two separate groups; east and west of the Sierra Nevada mountain range. Further support for the same five clusters was found in the maximum likelihood trees, with a high level of support from each approximation method (Figs. 2b & S3).

Inter- and intra-species population genetics

The geographic distributions of these clusters closely align with the distributions of the species (then subspecies) described in Wirth & Jones (1957) (Fig. 1), and recent morphological analyses of individuals from this study supports the species level designation of these clusters⁴¹. For the remainder of the manuscript, we will refer to each cluster by its corresponding species name. Culicoides occidentalis was located in Western North America, C. sonorensis in the Western and Southern United States (U.S.), C. albertensis in the Midwest U.S. and Ontario, C. variipennis in the Eastern U.S. and Ontario, and a fifth genetic group suggesting the occurrence of an additional, undescribed cryptic species in San Diego, CA. Notably, eight of the 17 sites had more than one species in sympatry, and one site had three species. At four sites, seven individuals were assigned to two genetic groups with an assignment score of ~ 50% for three individuals (scores = 45, 47 and 41%) and of ~ 25% for four individuals (scores = 34, 31, 25 and 24%), which suggests the occurrence of putative F1 or other types of hybrids (e.g., F2 or backcrosses). Interestingly, these hybrids were from three different species pairings (C. sonorensis X C. occidentalis; C. sonorensis X C. variipennis; and C. albertensis X C. variipennis). These hybrid individuals also stood out in the PCA, as they segregated between their parental clusters (Fig. 2a), as well as at the base of each parental branch in the phylogenetic tree (Fig. S3). In addition to these seven hybrids, 20 individuals had a secondary assignment score between 3 and 21%, signifying potential introgression between those pairings. However, we are less confident in STRUCTURE’s ability to identify this level of ancestry as other factors can also lead to mixed assignments.

Table 1 Collection site information and numbers of individuals retained for the SNP analyses.

Full size table

The seven putative hybrids were excluded from the dataset used to calculate the intraspecies summary statistics (rearranged by cluster), which resulted in the isolation of 566 SNPs after more stringent filtering was applied. The mean F_ST between species was 0.7147 (0.6541–0.7470), roughly 9 times higher than the mean F_ST between the populations (i.e., localities) within each species (see below; Tables 2 & S1). Similarly, both the aR and LKC values of intra-individual genetic distance show a low level of divergence/high level of similarity within each species, including the San Diego population (Table S2). The four species-specific datasets were used to calculate the interspecies summary statistics as well as test for isolation by distance (IBD). These datasets contained 22 individuals of C. albertensis from four sites (3423 SNPs), 36 individuals of C. occidentalis from four sites (2714 SNPs), 97 individuals of C. sonorensis from seven sites (2357 SNPs), and 29 individuals of C. variipennis from four sites (2960 SNPs). The expected and observed heterozygosity, F_IS, and number of private alleles for each species are reported in Table S2. No species-level dataset was created for the San Diego species, as this species was uncovered in a single locality.

Table 2 Mean pairwise F_ST within and between species.

Full size table

When examining each species individually, C. albertensis, had no evidence of population structure (K = 1), and had low genetic differentiation among locations (mean F_ST = 0.054) (Fig. 3a; Table 2). Although there does seem to be a pattern of IBD, this was found to not be significant in this species (Mantel test, P = 0.238; Partial Mantel test, P = 0.714). The low number of locations sampled potentially limits the statistical power of these correlations. The results obtained for C. occidentalis showed much more divergence compared to the other species, with populations being strongly differentiated from each other (mean F_ST = 0.411) (Table 2). Additionally, the fastSTRUCTURE analysis suggested that each location of C. occidentalis sampled is distinct (K = 4) (Fig. 3b). While no IBD was found (Mantel test, P = 0.489; Partial Mantel test, P = 0.770), there seems to be a considerable amount of geographic isolation among populations of this species, with pairwise F_ST values ranging from 0.14 to 0.70 (Table S4). Additionally, significant levels of dissimilarity were found between individuals from California and those from the other three populations (Fig. S4). In contrast, low genetic differentiation among locations were found for C. sonorensis (mean F_ST = 0.029), with varying levels of support for IBD in this species (Mantel test, P = 0.039; Partial Mantel test, P = 0.082) (Fig. 3c; Table 2). For this reason, the individuals from Colorado were combined into a single location, as were the individuals from Kansas. The fastSTRUCTURE analysis suggested the occurrence of population structure in C. sonorensis (K = 2), with some individuals from Kansas belonging to a distinct group, though these were not highly divergent from any other C. sonorensis location (Table S4). Individuals of C. variipennis exhibited no evidence of population structure (K = 1) or of IBD (Mantel test, P = 0.587; Partial Mantel test, P = 0.125) (Fig. 3d). Consistently, almost no genetic differentiation was found among locations of this species (mean F_ST = 0.026) (Table 2).

Haplotype network

In total, 285 midges were included in the analysis of a 546 bp region of the COI gene. Four distinct haplogroups were identified with substantial genetic divergence between groups (p-distance = 2.99–3.30%) and little divergence within groups (p-distance = 0.25–0.86%; Fig. 4; Table 3). Consistent with the SNP datasets, C. occidentalis formed a divergent haplogroup, separated from the rest of its range. The mean percent divergence between the two C. occidentalis groups (2.99%) was similar to its divergence from the other species (3.01–3.30%). The San Diego population also clustered as a distinct group, with a similar level of divergence from the other species (3.01–3.03%). Interestingly, C. albertensis, C. sonorensis, and C. variipennis were not separated from each other, and in some cases, C. albertensis and C. variipennis shared identical haplotypes (Fig. 4, S5). Furthermore, these three species exhibit a mean percent divergence between individuals (0.80%) similar to the divergence observed among individuals within C. occidentalis (Table 3). Other than the grouping of C. occidentalis in California, there was no geographic clustering observed.

Table 3 Mean percent divergence (p-distance) within and between species clusters based on the COI gene (ranges listed in parentheses).

Full size table

Discussion

Our study provides valuable insights into the population genetics of the C. variipennis species complex and highlights the presence of potential cryptic species. For most of the species examined, minimal genetic divergence was observed across locations, suggesting the maintenance of gene flow even over large geographic distances. The only exception was C. occidentalis, which showed a high level of geographic isolation, as well as two distinct COI haplogroups. We confirmed that mitochondrial data is not reliable to differentiate three out of the five species, due to the lack of segregation between the mitochondrial haplotypes of C. albertensis, C. sonorensis, and C. variipennis. This stands in stark contrast to their clear differentiation and high level of divergence inferred from the SNP data. Though a substantial amount of divergence exists between all five species, low levels of hybridization, and potentially introgression, are present in sympatric populations. While we do not know the fitness of these hybrids, but this could suggest that strong post-zygotic isolation barriers may have yet to evolve in this group. Thus, pre-zygotic isolation through either ecological or behavioral segregation is a possible mechanism maintaining divergence within this complex. With a considerable amount of geographic overlap between some species (Fig. 1), each sympatric population is potentially experiencing a set of unique selective pressures to maintain species boundaries.

The high degree of genetic differentiation between clusters inferred by the SNP data supports the current species groupings of the C. variipennis complex (C. occidentalis, C. sonorensis, and C. variipennis), as well as raising C. albertensis and a cryptic species in San Diego, California to species status (Fig. S5). While this putative new species was only collected in San Diego, its true distribution could extend well into Mexico. While clear divergence was observed in the SNP data, the mitochondrial data showed a different pattern of divergence. Culicoides albertensis, C. sonorensis, and C. variipennis have a considerable amount of genome-wide differentiation (Fig. 1); however, there was no clear differentiation of the COI gene (Fig. 4). In fact, several individuals of C. albertensis and C. variipennis shared identical haplotypes. Multiple studies have shown a high degree of genetic similarity in mtDNA between C. sonorensis and C. variipennis^23,24,42, though it was proposed that this was due to misidentifications. As all of the individuals included in our mitochondrial haplotype analysis from the current study were identified to species using the SNP data, this lack of mitochondrial separation must have an underlying biological cause. This finding can result from historical introgression with “leaky” pre-zygotic isolation, or semipermeable species boundaries, which have been shown to produce mitochondrial introgression without detectable nuclear DNA introgression in some taxa^43,44. This is likely due to the fact that the mitochondrial genome is independent of the nuclear genome and thus unlinked to the genes contributing to reproductive isolation⁴⁵. However, we cannot rule out that other possibilities could have caused this discordance, such as recent speciation and incomplete lineage sorting or selection^13,46. Regardless, it appears that the evolution of the mitochondrial genome is not congruent with the species tree of the C. variipennis complex. Notably, the SNP phylogenetic tree shows that C. occidentalis, and not C. sonorensis, is the sister taxa of C. albertensis and C. variipennis (Fig. S5). This suggests that the mtDNA similarity between C. albertensis, C. sonorensis and C. variipennis could stem from ongoing hybridization and introgression, rather than incomplete lineage sorting.

Little to no IBD or structure was found within C. albertensis, C. sonorensis, and C. variipennis indicating a substantial amount of connectivity among localities of these species (Fig. 3a,c,d). The number of populations inferred by fastSTRUCTURE for C. sonorensis was K = 2; however, a mean pairwise F_ST of 0.0287 suggests that a high amount of gene flow exists between all locations. This could be an artifact of the propensity of delta K inferring two populations⁴⁷ or from a high level of relatedness among individuals from KS (Fig. S6). Interestingly, although no IBD was found in C. occidentalis, each location of this species clustered as a distinct population (Fig. 3b). The lack of IBD is therefore not indicative of a single, genetically homogeneous population, but rather stems from high levels of divergence between populations regardless of their geographic distances. Focusing sampling efforts on each of these species will surely permit robust landscape genetic approaches to gain understanding of the evolutionary forces driving population structure in this group. This will allow studying whether Culicoides population structure is characterized by uniform or discontinuous isolation by distance, as well as, isolation by adaptation (IBA) or by environment (IBE)⁴⁸.

The strong genetic divergence between the C. occidentalis from California and the other populations was observed in both the SNP and mtDNA data (Tables 2, 3, Fig. 4). It is possible that this may represent a further cryptic species with a dispersal barrier created by the Sierra-Nevada mountain range (Fig. S4). This high level of differentiation within C. occidentalis could be due to geographic isolation alone; however, endosymbionts have also been shown to significantly increase mitochondrial diversity in the presence of geographic structure^49,50. Naturally occurring endosymbionts have been found in Culicoides midges, including C. sonorensis^51,52, and recently, a Cardinium sp. was linked to mitochondrial divergence in C. imicola⁵³. Further screening is needed to determine the diversity and abundance of endosymbionts infecting Culicoides midges, though the possibility remains that they could be playing a role in the phylogeographical structure of C. occidentalis if they are causing incompatibility between populations. Additionally, patchiness of the specialized larval habitat of C. occidentalis, not present in the other members of the C. variipennis complex, could create isolation between populations, as well as reduce the number of individuals within each population. A small effective population size with little to no immigration would allow for a strong effect from drift⁵⁴. While the populations of C. occidentalis outside of California were less diverged from one another, the lowest pairwise F_ST values between these populations were still greater than the highest pairwise values observed within any other species, consistent with the findings of Holbrook et al. (2000) (Table 2).

Similar to other species of Culicoides^30,32,33,55, high values of the inbreeding coefficient (F_IS) were observed in all species investigated in this study (Table S2). Although these previous studies have suggested that the observed high F_IS are an artifact from a large number of null alleles, the consistent reporting of these findings across various species using several types of molecular markers lends support to the hypothesis that high inbreeding has a biological origin in this genus. High levels of inbreeding and heterozygote deficiencies are common among mosquitoes^56,57,58, even when using markers with a low level of null alleles^59,60. Goubert et al. (2016) considered the typical Aedes albopictus population as “a network of interconnected breeding sites, each with a high level of inbreeding”. Although we cannot rule out all other possibilities, our results strongly suggest that some aspects of the reproductive biology of Culicoides induce inbreeding within populations. High F_IS and low F_ST between populations can stem from high levels of migration between populations (i.e., homogenizing allele frequencies at large scale), followed by matings with close relatives within populations (i.e., increasing homozygosity without altering allelic frequencies). It is also possible that our sampling approaches (single night trapping) led to capturing cohorts of closely related individuals.

Low levels of hybridization were found in some sympatric populations involving several different species pairings. Under laboratory conditions, mating between C. sonorensis and C. occidentalis can produce viable offspring for at least six generations, though the hatch rate of the progeny is dependent on the species of the mother²⁵. A cross of female C. sonorensis and male C. occidentalis only yields a 7% hatch rate whereas the reciprocal cross yields a 75% hatch rate. This asymmetrical hybrid viability is likely caused by cytonuclear incompatibility^61,62, though endosymbionts have also been shown to cause reproductive incompatibility⁶³. Upon secondary contact of closely related species, and in the absence of post-zygotic reproductive isolation, the production of unfit hybrids can induce the rapid evolution of premating barriers^2,64,65,66. In most populations however, C. sonorensis females are unlikely to come across C. occidentalis males due to differences in mating behavior. Conversely, C. occidentalis females do come into contact with C. sonorensis males, who do not appear to have mate discrimination⁶⁷, and will likely attempt to mate with these heterospecific females. As there are demographic disparities (population size and structure) between these two species, as well as viable offspring produced from this cross, rampant hybridization and asymmetric introgression would be detrimental to C. occidentalis⁶⁸. Strong selection against hybridization can maintain species boundaries, but as two of the ten C. occidentalis collected from Borax Lake in California (CABL) appeared to be F1 hybrids (Fig. 1), another mechanism, potentially differences in the larval habitat or mating behavior, appears to be limiting directional introgression from C. sonorensis.

Culicoides occidentalis females lay their eggs in highly saline environments (up to 88.0 parts per thousand (ppt))⁶⁹, whereas C. sonorensis eggs will not hatch in water with salinity over 20.0 ppt⁷⁰. However, ecological exclusion via the larval habitat would only limit introgression if the hybrids were inviable in highly saline environments, which does not appear to be the case. The difference in mating behavior between these two species may be a more likely mechanism by which the detrimental effects of hybridization are diminished. Culicoides occidentalis mates at the larval habitat while C. sonorensis mates at or near a host^22,71. Even if a C. occidentalis female mated with a C. sonorensis male, she would return to the high saline pools to lay her eggs and these hybrid offspring would have a high chance of only backcrossing within the C. occidentalis lineage. While only two C. occidentalis x C. sonorensis hybrids were tested in this study, both had C. occidentalis mothers (Fig. 4), providing evidence that this scenario takes place in nature. However, this type of isolation would not explain how C. sonorensis and C. variipennis maintain species boundaries in sympatry as they share a larval habitat. Further studies are needed to determine the mechanisms behind reproductive isolation within this group.

The C. variipennis complex is one of many vector groups in which species delimitation can be challenging^{46,72,73,74,75,76}; however, species identification is an integral part of vector surveillance. The species status of these group members has implications for vector surveillance, as any ambiguity in identification will lead to unreliable data. For example, while C. albertensis and C. sonorensis occur in sympatry, only C. sonorensis is reported as a vector species⁷⁷. The addition of the non-vector species when conducting serological surveys could lead to a severe underestimation of the infection rate within the vector species. As BTV and EHDV are expanding northward into eastern Canada⁷⁸, it has been suggested that the dispersal of C. sonorensis into new areas could be to blame for this incursion⁴². Specimens assigned to C. sonorensis by Jewiss-Gaines et al. (2017) were included in the present study and cluster instead with C. albertensis (“ON”, Fig. 1). Thus, there are likely alternative reasons for the range expansion of these viruses, including an unidentified vector species outside of the C. variipennis complex, such as C. stellifer^79,80. Molecular tools for accurate species-level delimitation within this complex is sorely needed for proper vector surveillance. Additionally, the detection of hybridization between a non-vector and vector species may be evidence of recent speciation, but it also highlights a potential path of introgression for genes controlling vector competency^81,82.

Our study shows that using a population genomic approach to analyze sibling species can identify species-level divergence, fine-scale genetic structuring within species, and uncover the existence of hybrids and cryptic species in Culicoides. Radiation within the C. variipennis complex occurred despite the long-range dispersal capabilities of biting midges as well as hybridization between sympatric species. This does not preclude historical geographic isolation; however, we believe that behavioral and ecological isolation may have shaped evolution within this group or is at least maintaining the current species boundaries. Significant geographic isolation was only found between populations of C. occidentalis, but more sampling is needed to determine if the lack of gene flow between California and the other populations represents an incipient speciation event or IBD. Additionally, focusing efforts in these various hybrid zones may provide a better understanding of the evolution of reproductive isolation in this group. Cryptic species have been reported in a number of other Culicoides species complexes⁸³ and the analyses presented here could help to identify these putative species. Delimiting the species in these complexes, will not only aid in vector surveillance efforts, but continued study of the speciation of closely related vector and non-vector species could produce valuable evolutionary insights into vector competency.

Materials and methods

Sample collection and sequencing

Culicoides midges were collected from 17 sites across the United States and Canada (Table 1). Specimens were collected either as pupae and reared to adulthood, or as adults using CDC light traps baited with CO₂ and UV light (Bioquip 2836BQ). Individuals morphologically assigned to the C. variipennis complex were sorted out from the by-catch and stored in 95% ethanol at -80 °C. Total DNA was extracted from individuals using a Puregene extraction protocol (Gentra Systems, Inc., D-5500A) with the addition of glycogen (ThermoFisher, R0561) to increase yields. DNA was only extracted only from females as their larger body size (compared to the males) produced sufficient amount of DNA for next-gen sequencing. The DNA quality was checked using gel electrophoresis and DNA concentration was measured using a Qubit 3.0 fluorometer and a Qubit dsDNA HS assay kit (Invitrogen, Q33230). A total of 300–400 ng of DNA per sample was sent to Floragenex, Inc. for library preparation using the protocol from Truong et al. (2012). DNA was digested using the restriction enzymes MseI and PstI. After PCR amplification, the samples in each plate were pooled and sequenced on a lane of single-end 100 bp sequencing on a HiSeq4000 at the University of Oregon Genomics Facility, Eugene, OR.

Raw sequence filtering and processing

Raw sequence quality was first assessed using FastQC v.0.11.9 and MultiQC v.1.7^84,85, and then reads were filtered and processed using Stacks v.2.3⁸⁶. Reads with a phred score below 25 were removed as well as individuals with a > 75.0% missing data. Next, reads were aligned to the C. sonorensis genome⁸⁷ (Accession: PRJEB19938) using the Burrows-Wheeler Aligner (BWA-mem)⁸⁸. Finally, aligned reads were run through the reference-based pipeline of Stacks. Filtering options were set to only include loci found in at least half of the sampling locations (-p 8) and those occurring in at least 50% of individuals within those sites (-r 0.5)⁸⁹. The minimum allele frequency was set to 0.05 to protect against potential sequencing errors⁹⁰, and only the first SNP per locus was kept to minimize linkage disequilibrium between SNPs from influencing population structure and phylogenetic analyses. All subsequent file reformatting was done with PGDSpider v.2.1.1.5⁹¹.

Clustering analysis

Population structure in the overall dataset was evaluated using fastSTRUCTURE v.1.04, with Structure_threader utilized to parallelize distinct runs of K^92,93. Models were fitted with the number of genetic clusters (K) set to range from 1 to 10. The most suitable value of K was selected using the chooseK.py function from the fastSTRUCTURE package which selects the model that maximizes the marginal likelihood of the data. Using the output from fastSTRUCTURE and Distruct v.2.3 (http://distruct2.popgen.org), a bar plot was created where each individual is represented by a vertical line divided into K colored segments with the length of each segment being proportional to the estimated membership in each of the inferred K groups. A map of the structuring at each collection site was created using Inkscap v.1.1 (https://inkscape.org/). The clustering of individuals into the distinct genetic groups was also visualized using a principal component analysis (PCA) and a discriminant analysis of principal components (DAPC). The most likely number of genetic groups was inferred by the find.clusters algorithm for the PCA and the optimal number of principal components to inform the DAPC was defined using the function optim.a.score. Both were performed in R⁹⁴ through the adegenet package⁹⁵.

Any individual with more than 25% of their loci grouping with a second cluster in the fastSTRUCTURE analysis was marked as a hybrid and removed from the phylogenetic analysis, due to the uncertainty of assigning them to a given species. Maximum likelihood phylogeny among individuals was run using RAxML v.8.2.12⁹⁶. An acquisition bias correction was applied to the likelihood calculations as alignments were solely composed of SNPs, with each invariant site removed through Phrynomics (https://github.com/bbanbury/phrynomics)⁹⁷. The GTR + G nucleotide substitution model was used for each search. A rapid bootstrap analysis and search for the best-scoring maximum likelihood tree was executed using the extended majority rule-based bootstopping criterion to achieve a sufficient number of bootstrap replicates⁹⁸. Additionally, to cross-validate our results, a second phylogeny was inferred in W-IQ-Tree version 1.6.12⁹⁹, using the TVM + F + G4 substitution model determined by ModelFinder^100,101. Branch support was calculated using 1000 ultrafast bootstraps¹⁰² and a Shimodaira–Hasegawa like approximate likelihood-ratio test (SH-aRLT)^102,103.

To measure the amount of divergence between genetic clusters, a new SNP dataset was generated with individuals grouped by cluster rather than locality. Additionally, to measure the amount of divergence within each genetic cluster, four cluster-specific datasets (grouped by site) were also generated. SNPs were obtained from these new datasets using the same processing methods above except with more stringent filtering parameters. Only SNPs that occurred in at least 75% of the clusters or collection sites and at least half of the individuals within those groups were included. Genetic diversity estimates (F_IS, H_E, and H_O) and population differentiation (pairwise F_ST) were calculated for each species dataset using Genepop v.4.7.0¹⁰⁴. Population differentiation was not calculated for the new species found in San Diego, as only a single population of this species was uncovered. Rousset’s distance aR¹⁰⁵ and Loiselle’s kinship coefficient (LKC)¹⁰⁶ were calculated respectively with SPAGeDi v.1.5¹⁰⁷. Geographic distances among localities were calculated as both Euclidean and anisometric distances and a Mantel test and a Partial Mantel test were preformed to test for isolation-by-distance (IBD)¹⁰⁸. Tests for areas of significant genetic dissimilarity among individuals using aR were implemented in MAPI using 1000 replications¹⁰⁹.

Mitochondrial sequencing and haplotype network

Mitochondrial DNA haplotypes were obtained from a subset of 67 individuals from the five genetic clusters. PCR reactions were performed using a Taq-Pro COMPLETE kit (Denville Scientific, CB4065-4) targeting a partial region of the COI gene with the Lep50 primer set from Folmer et al. (1994) and the thermocycler profile from Herbert et al. (2003). PCR products were cleaned using an EXOSAP-IT kit (ThermoFisher, 78201.1.ML) and prepared for sequencing using a BigDye Terminator v.3.1 Cycle Sequencer Kit (Applied Biosystems, 4337454). Sanger sequencing was done using an Applied Biosystems 3500 Genetic Analyzer. Chromatograms were cleaned and aligned using the software Geneious v.9.1¹¹⁰.

A haplotype network analysis was conducted using the 67 COI sequences obtained in this study combined with 218 C. variipennis complex sequences previously collected from 25 states across the U.S.¹¹¹. Sequences were aligned in MEGA v.10.1.8¹¹² and trimmed to ensure all sequences contained identical lengths. A median-joining analysis was performed using NETWORK v.5.0.1.0¹¹³. Specimens collected in this study were assigned a color based on the results from the SNP clustering analyses while the remaining samples were left unassigned. All individuals were used to calculate the mean uncorrected p-divergence between and within the different groupings inferred from the haplotype network using MEGA.

Data availability

The data reported in this study will be deposited in the Open Science Framework database upon acceptance, https://osf.io (https://doi.org/10.17605/OSF.IO/E3Z72). Mitochondrial sequences obtained in the current study have been deposited under Genbank Accession Numbers OL604713—OL604779.

References

De Queiroz, K. Species concepts and species delimitation. Syst. Biol. 56, 879–886. https://doi.org/10.1080/10635150701701083 (2007).
Article PubMed Google Scholar
Coyne, J. A. & Orr, H. A. Speciation (Sinauer Associates Inc, 2004).
Google Scholar
Endler, J. A. Gene flow and population differentiation: studies of clines suggest that differentiation along environmental gradients may be independent of gene flow. Science 179, 243–250 (1973).
Article ADS CAS PubMed Google Scholar
Mayr, E. Systematics and the Origin of Species, from the Viewpoint of a Zoologist (Harvard University Press, 1999).
Google Scholar
Richardson, J. L., Urban, M. C., Bolnick, D. I. & Skelly, D. K. Microgeographic adaptation and the spatial scale of evolution. Trends Ecol. Evol. 29, 165–176 (2014).
Article PubMed Google Scholar
Nosil, P. Ernst Mayr and the integration of geographic and ecological factors in speciation. Biol. J. Lin. Soc. 95, 26–46 (2008).
Article Google Scholar
Kisel, Y. & Barraclough, T. G. Speciation has a spatial scale that depends on levels of gene flow. Am. Nat. 175, 316–334 (2010).
Article PubMed Google Scholar
Leliaert, F. et al. DNA-based species delimitation in algae. Eur. J. Phycol. 49, 179–196 (2014).
Article Google Scholar
Carstens, B. C., Pelletier, T. A., Reid, N. M. & Satler, J. D. How to fail at species delimitation. Mol. Ecol. 22, 4369–4383 (2013).
Article PubMed Google Scholar
Schlick-Steiner, B. C. et al. Integrative taxonomy: a multisource approach to exploring biodiversity. Annu. Rev. Entomol. 55, 421–438 (2010).
Article CAS PubMed Google Scholar
Capblancq, T., Mavárez, J., Rioux, D. & Després, L. Speciation with gene flow: evidence from a complex of alpine butterflies (Coenonympha, Satyridae). Ecol. Evol. 9, 6444–6457 (2019).
Article PubMed PubMed Central Google Scholar
Pedraza-Marrón, C. d. R. et al. Genomics overrules mitochondrial DNA, siding with morphology on a controversial case of species delimitation. Proc. R. Soc. B 286, 20182924 (2019).
Article PubMed PubMed Central Google Scholar
Hinojosa, J. C. et al. A mirage of cryptic species: genomics uncover striking mitonuclear discordance in the butterfly Thymelicus sylvestris. Mol. Ecol. 28, 3857–3868 (2019).
Article PubMed Google Scholar
Nygren, A. et al. A mega-cryptic species complex hidden among one of the most common annelids in the North East Atlantic. PLoS ONE 13, e0198356 (2018).
Article PubMed PubMed Central CAS Google Scholar
Thielsch, A., Knell, A., Mohammadyari, A., Petrusek, A. & Schwenk, K. Divergent clades or cryptic species? Mito-nuclear discordance in a Daphnia species complex. BMC Evol. Biol. 17, 1–9 (2017).
Article CAS Google Scholar
Eyer, P. A. & Hefetz, A. Cytonuclear incongruences hamper species delimitation in the socially polymorphic desert ants of the Cataglyphis albicans group in Israel. J. Evol. Biol. 31, 1828–1842 (2018).
Article CAS PubMed Google Scholar
Borkent, A. Biology of Disease Vectors. 2nd edn, i–xxiii + 1–785 (Elsevier Academic Press, 2004).
Mellor, P., Boorman, J. & Baylis, M. Culicoides biting midges: their role as arbovirus vectors. Annu. Rev. Entomol. 45, 307–340 (2000).
Article CAS PubMed Google Scholar
Rushton, J. & Lyons, N. Economic impact of Bluetongue: a review of the effects on production. Veterinaria italiana 51, 401–406 (2015).
PubMed Google Scholar
Tabachnick, W. J. Culicoides vriipennis and Bluetongue-Virus eidemiology in the United States. Annu. Rev. Entomol. 41, 23–43. https://doi.org/10.1146/annurev.en.41.010196.000323 (1996).
Article CAS PubMed Google Scholar
Wirth, W. W. & Jones, R. H. The North American Subspecies of Culicoides variipennis (Diptera, Heleidae). U. S. Dep. Agric. Tech. Bull 1170, 1–35 (1957).
Google Scholar
Holbrook, F. R. et al. Sympatry in the Culicoides variipennis Complex (Diptera: Ceratopogonidae): a Taxonomic Reassessment. J. Med. Entomol. 37, 65–76. https://doi.org/10.1603/0022-2585-37.1.65 (2000).
Article CAS PubMed Google Scholar
Hopken, M. W. Pathogen Vectors at the Wildlife-Livestock Interface: Molecular Approaches to Elucidating Culicoides (Diptera: Ceratopogonidae) Biology (University of Colorado, 2016).
Google Scholar
Shults, P. A Study of the Taxonomy, Ecology, and Systematics of Culicoides Species (Diptera: Ceratopogonidae) Including those Associated with Deer Breeding Facilities in Southeast Texas (Texas A&M University, 2015).
Google Scholar
Velten, R. K. & Mullens, B. A. Field morphological variation and laboratory hybridization of Culicoides variipennis sonorensis and C. v. occidentalis (Diptera:Ceratopogonidae) in southern California. J. Med. Entomol. 34, 277–284 (1997).
Article CAS PubMed Google Scholar
Fontaine, M. C. et al. Extensive introgression in a malaria vector species complex revealed by phylogenomics. Science 347, 1258522 (2015).
Article PubMed CAS Google Scholar
Bolnick, D. I. & Otto, S. P. The magnitude of local adaptation under genotype-dependent dispersal. Ecol. Evol. 3, 4722–4735 (2013).
Article PubMed PubMed Central Google Scholar
Slatkin, M. Isolation by distance in equilibrium and non-equilibrium populations. Evolution 47, 264–279 (1993).
Article PubMed Google Scholar
Pante, E. et al. Species are hypotheses: avoid connectivity assessments based on pillars of sand. Mol. Ecol. 24, 525–544 (2015).
Article PubMed Google Scholar
Jacquet, S. et al. Colonization of the Mediterranean basin by the vector biting midge species Culicoides imicola: an old story. Mol. Ecol. 24, 5707–5725. https://doi.org/10.1111/mec.13422 (2015).
Article CAS PubMed Google Scholar
Onyango, M. G. et al. Genotyping of whole genome amplified reduced representation libraries reveals a cryptic population of Culicoides brevitarsis in the Northern Territory, Australia. BMC Genomics 17, 769. https://doi.org/10.1186/s12864-016-3124-1 (2016).
Article CAS PubMed PubMed Central Google Scholar
Onyango, M. G. et al. Delineation of the population genetic structure of Culicoides imicola in East and South Africa. Parasit. Vectors 8, 660. https://doi.org/10.1186/s13071-015-1277-4 (2015).
Article CAS PubMed PubMed Central Google Scholar
Mignotte, A. et al. High dispersal capacity of Culicoides obsoletus (Diptera: Ceratopogonidae), vector of bluetongue and Schmallenberg viruses, revealed by landscape genetic analyses. Parasit. Vectors 14, 1–14 (2021).
Article CAS Google Scholar
Sanders, C. J. & Carpenter, S. Assessment of an immunomarking technique for the study of dispersal of Culicoides biting midges. Infect. Genet. Evol. 28, 583–587 (2014).
Article PubMed Google Scholar
Kluiters, G., Swales, H. & Baylis, M. Local dispersal of palaearctic Culicoides biting midges estimated by mark-release-recapture. Parasit. Vectors 8, 86 (2015).
Article PubMed PubMed Central Google Scholar
Ducheyne, E. et al. Quantifying the wind dispersal of Culicoides species in Greece and Bulgaria. Geospat. Health 10, 177–189 (2007).
Article Google Scholar
Purse, B. V. et al. Climate change and the recent emergence of bluetongue in Europe. Nat. Rev. Microbiol. 3, 171–181 (2005).
Article CAS PubMed Google Scholar
Jacquet, S. et al. Range expansion of the Bluetongue vector, Culicoides imicola, in continental France likely due to rare wind-transport events. Sci. Rep. https://doi.org/10.1038/srep27247 (2016).
Article PubMed PubMed Central Google Scholar
Rundle, H. D. & Nosil, P. Ecological speciation. Ecol. Lett. 8, 336–352 (2005).
Article Google Scholar
Wang, I. J. & Bradburd, G. S. Isolation by environment. Mol. Ecol. 23, 5649–5662 (2014).
Article PubMed Google Scholar
Shults, P. A Study of Culicoides Biting Midges in the Subgenus Monoculicoides: Population Genetics, Taxonomy, Systematics, and Control. Ph.D. thesis, Texas A&M University (2021).
Jewiss-Gaines, A., Barelli, L. & Hunter, F. F. First records of Culicoides sonorensis (Diptera: Ceratopogonidae), a known vector of bluetongue virus, Southern Ontario. J. Med. Entomol. 54, 757–762. https://doi.org/10.1093/jme/tjw215 (2017).
Article CAS PubMed Google Scholar
Chan, K. M. & Levin, S. A. Leaky prezygotic isolation and porous genomes: rapid introgression of maternally inherited DNA. Evolution 59, 720–729 (2005).
Article CAS PubMed Google Scholar
Harrison, R. G. Hybrid zones: windows on evolutionary process. Oxf. Surv. Evol. Biol. 7, 69–128 (1990).
Google Scholar
Harrison, R. G. Animal mitochondrial DNA as a genetic marker in population and evolutionary biology. Trends Ecol. Evol. 4, 6–11 (1989).
Article CAS PubMed Google Scholar
Després, L. One, Two or More Species? Mitonuclear Discordance and Species Delimitation. Molecular ecology 28(17), 3845–3847 (2019).
Article PubMed Google Scholar
Janes, J. K. et al. The K= 2 conundrum. Mol. Ecol. 26, 3594–3602 (2017).
Article PubMed Google Scholar
De Meester, L., Vanoverbeke, J., Kilsdonk, L. J. & Urban, M. C. Evolving perspectives on monopolization and priority effects. Trends Ecol. Evol. 31, 136–146 (2016).
Article PubMed Google Scholar
Ballard, J. W. O., Chernoff, B. & James, A. C. Divergence of mitochondrial DNA is not corroborated by nuclear DNA, morphology, or behavior in Drosophila simulans. Evolution 56, 527–545 (2002).
Article PubMed Google Scholar
Behura, S., Sahu, S., Mohan, M. & Nair, S. Wolbachia in the Asian rice gall midge, Orseolia oryzae (Wood-Mason): Correlation between host mitotypes and infection status. Insect Mol. Biol. 10, 163–171 (2001).
Article CAS PubMed Google Scholar
Covey, H. et al. Cryptic Wolbachia (Rickettsiales: Rickettsiaceae) detection and prevalence in Culicoides (Diptera: Ceratopogonidae) midge populations in the United States. J. Med. Entomol. 57, 1262–1269. https://doi.org/10.1093/jme/tjaa003 (2020).
Article PubMed Google Scholar
Pagès, N., Muñoz-Muñoz, F., Verdún, M., Pujol, N. & Talavera, S. First detection of Wolbachia-infected Culicoides (Diptera: Ceratopogonidae) in Europe: Wolbachia and Cardinium infection across Culicoides communities revealed in Spain. Parasit. Vectors 10, 582. https://doi.org/10.1186/s13071-017-2486-9 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pilgrim, J. et al. Cardinium symbiosis as a potential confounder of mtDNA based phylogeographic inference in Culicoides imicola (Diptera: Ceratopogonidae), a vector of veterinary viruses. Parasit. Vectors 14, 100. https://doi.org/10.1186/s13071-020-04568-3 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hare, M. P. Prospects for nuclear gene phylogeography. Trends Ecol. Evol. 16, 700–706 (2001).
Article Google Scholar
Onyango, M. G. et al. Assessment of population genetic structure in the arbovirus vector midge, Culicoides brevitarsis (Diptera: Ceratopogonidae), using multi-locus DNA microsatellites. Vet. Res. 46, 108. https://doi.org/10.1186/s13567-015-0250-8 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fonseca, D. M., Smith, J. L., Kim, H.-C. & Mogi, M. Population genetics of the mosquito Culex pipiens pallens reveals sex-linked asymmetric introgression by Culex quinquefasciatus. Infect. Genet. Evol. 9, 1197–1203 (2009).
Article CAS PubMed PubMed Central Google Scholar
Goubert, C., Minard, G., Vieira, C. & Boulesteix, M. Population genetics of the Asian tiger mosquito Aedes albopictus, an invasive vector of human diseases. Heredity 117, 125–134 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lehmann, T. et al. Microgeographic structure of Anopheles gambiae in western Kenya based on mtDNA and microsatellite loci. Mol. Ecol. 6, 243–253 (1997).
Article CAS PubMed Google Scholar
Chapuis, M.-P. & Estoup, A. Microsatellite null alleles and estimation of population differentiation. Mol. Biol. Evol. 24, 621–631. https://doi.org/10.1093/molbev/msl191 (2006).
Article CAS PubMed Google Scholar
Manni, M. et al. Molecular markers for analyses of intraspecific genetic diversity in the Asian Tiger mosquito, Aedes albopictus. Parasit. Vectors 8, 1–11 (2015).
Article Google Scholar
Arntzen, J. W., Jehle, R., Bardakci, F., Burke, T. & Wallis, G. P. Asymmetric viability of reciprocal-cross hybrids between Crested and Marbled Newts (Triturus cristatus and T. marmoratus). Evolution 63, 1191–1202. https://doi.org/10.1111/j.1558-5646.2009.00611.x (2009).
Article PubMed Google Scholar
Gibeaux, R. et al. Paternal chromosome loss and metabolic crisis contribute to hybrid inviability in Xenopus. Nature 553, 337. https://doi.org/10.1038/nature25188 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Werren, J. H., Baldo, L. & Clark, M. E. Wolbachia: master manipulators of invertebrate biology. Nat. Rev. Microbiol. 6, 741 (2008).
Article CAS PubMed Google Scholar
Servedio, M. R. & Kirkpatrick, M. The effects of gene flow on reinforcement. Evolution 51, 1764–1772. https://doi.org/10.1111/j.1558-5646.1997.tb05100.x (1997).
Article PubMed Google Scholar
Howard, D. J. Reinforcement: origin, dynamics, and fate of an evolutionary hypothesis. Hybrid zones and the evolutionary process, 46–69 (1993).
Yukilevich, R. Asymmetrical patterns of speciation uniquely support reinforcement in Drosophila. Evolution 66, 1430–1446. https://doi.org/10.1111/j.1558-5646.2011.01534.x (2012).
Article PubMed Google Scholar
Downes, J. A. The Culicoides variipennis complex: a necessary re-alignment of nomenclature (Diptera: Ceratopogonidae). Can. Entomol. 110, 63–69 (1978).
Article Google Scholar
Toews, D. P. & Brelsford, A. The biogeography of mitochondrial and nuclear discordance in animals. Mol. Ecol. 21, 3907–3930 (2012).
Article CAS PubMed Google Scholar
Smith, H. & Mullens, B. A. Seasonal activity, size, and parity of Culicoides occidentalis (Diptera: Ceratopogonidae) in a coastal southern California salt marsh. J. Med. Entomol. 40, 352–355. https://doi.org/10.1603/0022-2585-40.3.352 (2003).
Article PubMed Google Scholar
Linley, J. The effect of salinity on oviposition and egg hatching in Culicoides variipennis sonorensis (Diptera: Ceratopogonidae). J. Am. Mosq. Control Assoc. 2, 79–82 (1986).
CAS PubMed Google Scholar
Gerry, A. C. & Mullens, B. A. Response of Male Culicoides variipennis sonorensis (Diptera: Ceratopogonidae) to carbon dioxide and observations of mating behavior on and near cattle. J. Med. Entomol. 35, 239–244. https://doi.org/10.1093/jmedent/35.3.239 (1998).
Article CAS PubMed Google Scholar
Nolan, D. V. et al. Rapid diagnostic PCR assays for members of the Culicoides obsoletus and Culicoides pulicaris species complexes, implicated vectors of bluetongue virus in Europe. Vet. Microbiol. 124, 82–94 (2007).
Article CAS PubMed Google Scholar
Sebastiani, F. et al. Molecular differentiation of the Old World Culicoides imicola species complex (Diptera, Ceratopogonidae), inferred using random amplified polymorphic DNA markers. Mol. Ecol. 10, 1773–1786 (2001).
Article CAS PubMed Google Scholar
Carlson, D. Identification of mosquitoes of Anopheles gambiae species complex A and B by analysis of cuticular components. Science 207, 1089–1091 (1980).
Article ADS CAS PubMed Google Scholar
Palacios, G. et al. Characterization of the Sandfly fever Naples species complex and description of a new Karimabad species complex (genus Phlebovirus, family Bunyaviridae). J. Gen. Virol. 95, 292 (2014).
Article CAS PubMed PubMed Central Google Scholar
Rivas, G., Souza, N. & Peixoto, A. A. Analysis of the activity patterns of two sympatric sandfly siblings of the Lutzomyia longipalpis species complex from Brazil. Med. Vet. Entomol. 22, 288–290 (2008).
Article CAS PubMed Google Scholar
Wilson, W. C. et al. Current status of bluetongue virus in the Americas. Bluetongue 10, 197–220 (2009).
Article Google Scholar
Allen, S. E. et al. Epizootic Hemorrhagic Disease in White-Tailed Deer, Canada. Emerg. Infect. Dis. 25, 832–834. https://doi.org/10.3201/eid2504.180743 (2019).
Article PubMed PubMed Central Google Scholar
McGregor, B. L. et al. Field data implicating Culicoides stellifer and Culicoides venustus (Diptera: Ceratopogonidae) as vectors of epizootic hemorrhagic disease virus. Parasit. Vectors 12, 258. https://doi.org/10.1186/s13071-019-3514-8 (2019).
Article PubMed PubMed Central Google Scholar
Shults, P., Ho, A., Martin, E. M., McGregor, B. L. & Vargo, E. L. Genetic diversity of Culicoides stellifer (Diptera: Ceratopogonidae) in the Southeastern United States compared with sequences from Ontario, Canada. J. Med. Entomol. 57, 1324–1327. https://doi.org/10.1093/jme/tjaa025 (2020).
Article CAS PubMed Google Scholar
Mallet, J. Hybridization as an invasion of the genome. Trends Ecol. Evol. 20, 229–237 (2005).
Article PubMed Google Scholar
Ciota, A. T., Chin, P. A. & Kramer, L. D. The effect of hybridization of Culex pipiens complex mosquitoes on transmission of West Nile virus. Parasit. Vectors 6, 1–4 (2013).
Article Google Scholar
Meiswinkel, R., Gomulski, L., Delécolle, J., Goffredo, M. & Gasperi, G. The taxonomy of Culicoides vector complexes-unfinished business. Vet. Ital. 40, 151–159 (2004).
CAS PubMed Google Scholar
Ewels, P., Magnusson, M., Lundin, S. & Käller, M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics (Oxford, England) 32, 3047–3048. https://doi.org/10.1093/bioinformatics/btw354 (2016).
Article CAS Google Scholar
Andrews, S. Babraham bioinformatics-FastQC a quality control tool for high throughput sequence data. https://www.bioinformatics.babraham.ac.uk/projects/fastqc (2010).
Rochette, N. C., Rivera-Colón, A. G. & Catchen, J. M. Stacks 2: Analytical methods for paired-end sequencing improve RADseq-based population genomics. Mol. Ecol. 28, 4737–4754 (2019).
Article CAS PubMed Google Scholar
Morales-Hojas, R. et al. The genome of the biting midge Culicoides sonorensis and gene expression analyses of vector competence for bluetongue virus. BMC Genomics 19, 624. https://doi.org/10.1186/s12864-018-5014-1 (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics (Oxford, England) 25, 1754–1760 (2009).
Article CAS Google Scholar
Pante, E. et al. Use of RAD sequencing for delimiting species. Heredity 114, 450–459 (2015).
Article CAS PubMed Google Scholar
Benestan, L. M. et al. Conservation genomics of natural and managed populations: building a conceptual and practical framework. Mol. Ecol. 25, 2967–2977 (2016).
Article PubMed Google Scholar
Lischer, H. E. & Excoffier, L. PGDSpider: an automated data conversion tool for connecting population genetics and genomics programs. Bioinformatics (Oxford, England) 28, 298–299 (2012).
Article CAS Google Scholar
Pina-Martins, F., Silva, D. N., Fino, J. & Paulo, O. S. Structure_threader: An improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems. Mol. Ecol. Resour. 17, e268–e274 (2017).
Article CAS PubMed Google Scholar
Raj, A., Stephens, M. & Pritchard, J. K. Variational Inference of Population Structure in Large SNP Datasets. bioRxiv 10, 001073 (2013).
Google Scholar
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria.http://www.R-project.org/ (2013).
Jombart, Thibaut, and Caitlin Collins. A tutorial for discriminant analysis of principal components (DAPC) using adegenet 2.0. 0. London: Imperial College London, MRC Centre for Outbreak Analysis and Modelling (2015).
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics (Oxford, England) 30, 1312–1313 (2014).
Article CAS Google Scholar
Leaché, A. D., Banbury, B. L., Felsenstein, J., De Oca, A.N.-M. & Stamatakis, A. Short tree, long tree, right tree, wrong tree: New acquisition bias corrections for inferring SNP phylogenies. Syst. Biol. 64, 1032–1047 (2015).
Article PubMed PubMed Central CAS Google Scholar
Pattengale, N. D., Alipour, M., Bininda-Emonds, O. R., Moret, B. M. & Stamatakis, A. How many bootstrap replicates are necessary?. J. Comput. Biol. 17, 337–354 (2010).
Article MathSciNet CAS PubMed Google Scholar
Trifinopoulos, J., Nguyen, L.-T., von Haeseler, A. & Minh, B. Q. W-IQ-TREE: A fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res. 44, W232–W235 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K., Von Haeseler, A. & Jermiin, L. S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, L.-T., Schmidt, H. A., Von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Article CAS PubMed Google Scholar
Hoang, D. T., Chernomor, O., Von Haeseler, A., Minh, B. Q. & Vinh, L. S. UFBoot2: improving the ultrafast bootstrap approximation. Mol. Biol. Evol. 35, 518–522 (2018).
Article CAS PubMed Google Scholar
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 30. Syst. Biol. 59, 307–321. https://doi.org/10.1093/sysbio/syq010 (2010).
Article CAS PubMed Google Scholar
Rousset, F. genepop’007: a complete re‐implementation of the genepop software for Windows and Linux. Molecular ecology resources 8(1), 103–106 (2008).
Article Google Scholar
Rousset, F. Genetic differentiation between individuals. J Evol Biol 13, 58–62 (2000).
Article Google Scholar
Loiselle, B. A., Sork, V. L., Nason, J. & Graham, C. Spatial genetic structure of a tropical understory shrub, Psychotria officinalis (Rubiaceae). Am. J. Bot. 82, 1420–1425 (1995).
Article Google Scholar
Hardy, O. & Vekemans, X. SPAGeDi 1.5. A program for Spatial Pattern Analysis of Genetic Diversity. User's manual http://ebe.ulb.ac.be/ebe/SPAGeDi_files/SPAGeDi_1.5_Manual.pdf. Université Libre de Bruxelles, Brussells, Belgium.[Google Scholar] (2015).
Jay, F., Sjödin, P., Jakobsson, M. & Blum, M. G. Anisotropic isolation by distance: the main orientations of human genetic differentiation. Mol. Biol. Evol. 30, 513–525 (2013).
Article CAS PubMed Google Scholar
Piry, S. et al. Mapping Averaged Pairwise Information (MAPI): a new exploratory tool to uncover spatial structure. Methods Ecol. Evol. 7, 1463–1475 (2016).
Article Google Scholar
Kearse, M. et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics (Oxford, England) 28, 1647–1649. https://doi.org/10.1093/bioinformatics/bts199 (2012).
Article Google Scholar
Hopken, M. W. Pathogen Vectors at The Wildlife-Livestock Interface: Molecular Approaches to Elucidating Culicoides (Diptera: Ceratopogonidae) Ph.D. thesis, Colorado State University (2016).
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549 (2018).
Article CAS PubMed PubMed Central Google Scholar
Bandelt, H. J., Forster, P. & Rohl, A. Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 16, 37–48. https://doi.org/10.1093/oxfordjournals.molbev.a026036 (1999).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors thank Art Borkent, Adam Jewess-Gaines, Dustin Swanson, Bonnie Ryan, Nadja Mayerle, multiple vector control agencies, USDA-APHIS-Wildlife Services, and several landowners for access to property or assisting with collecting specimens used in this study. Funding was provided by a USDA Non-Assistance Cooperative Agreement: 58-3020-9-007 and the Urban Entomology Endowment fund at Texas A&M University.

Author information

Authors and Affiliations

Department of Entomology, Texas A&M University, College Station, TX, 77843, USA
Phillip Shults, Pierre-Andre Eyer, Alexander Blumenfeld & Edward L. Vargo
USDA, APHIS, Wildlife Services, National Wildlife Research Center, Fort Collins, CO, 80521, USA
Matthew Hopken
Department of Ecology and Conservation Biology, Texas A&M University, College Station, TX, 77843, USA
Mariana Mateos
USDA-ARS Arthropod Borne Animal Disease Research Unit, 1515 College Ave, Manhattan, KS, 66502, USA
Phillip Shults & Lee W. Cohnstaedt

Authors

Phillip Shults
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Hopken
View author publications
You can also search for this author in PubMed Google Scholar
Pierre-Andre Eyer
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Blumenfeld
View author publications
You can also search for this author in PubMed Google Scholar
Mariana Mateos
View author publications
You can also search for this author in PubMed Google Scholar
Lee W. Cohnstaedt
View author publications
You can also search for this author in PubMed Google Scholar
Edward L. Vargo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.S., M.M., L.C., and E.V. planned and designed the study, P.S., M.H., and L.C. collected samples, P.S., P.E., and A.B. carried out lab work and analyzed the data, P.S. and P.E. produced the figures, P.S. led writing; M.H., P.E., A.B., M.M., L.C., and E.V. contributed to drafting and editing the manuscript, M.M., L.C., and E.V. provided supervision, P.S., L.C., and E.V. contributed to procuring funds.

Corresponding authors

Correspondence to Phillip Shults or Lee W. Cohnstaedt.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shults, P., Hopken, M., Eyer, PA. et al. Species delimitation and mitonuclear discordance within a species complex of biting midges. Sci Rep 12, 1730 (2022). https://doi.org/10.1038/s41598-022-05856-x

Download citation

Received: 26 August 2021
Accepted: 17 January 2022
Published: 02 February 2022
DOI: https://doi.org/10.1038/s41598-022-05856-x

This article is cited by

Molecular insights versus morphological traits: rethinking identification of the closely related Angiostrongylus cantonensis and Angiostrongylus malaysiensis
- Chanisara Kaenkaew
- Abigail Hui En Chan
- Urusa Thaenkham
Parasites & Vectors (2024)
Development of microsatellite markers for population genetics of biting midges and a potential tool for species identification of Culicoides sonorensis Wirth & Jones
- Phillip Shults
- Megan Moran
- Pierre-Andre Eyer
Parasites & Vectors (2022)
A Review of the Vector Status of North American Culicoides (Diptera: Ceratopogonidae) for Bluetongue Virus, Epizootic Hemorrhagic Disease Virus, and Other Arboviruses of Concern
- Bethany L. McGregor
- Phillip T. Shults
- Emily G. McDermott
Current Tropical Medicine Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

SNP calling and clustering analyses

Inter- and intra-species population genetics

Haplotype network

Discussion

Materials and methods

Sample collection and sequencing

Raw sequence filtering and processing

Clustering analysis

Mitochondrial sequencing and haplotype network

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links