Altitude and hillside orientation shapes the population structure of the Leishmania infantum vector Phlebotomus ariasi

Prudhomme, Jorian; De Meeûs, Thierry; Toty, Céline; Cassan, Cécile; Rahola, Nil; Vergnes, Baptiste; Charrel, Remi; Alten, Bulent; Sereno, Denis; Bañuls, Anne-Laure

doi:10.1038/s41598-020-71319-w

Download PDF

Article
Open access
Published: 02 September 2020

Altitude and hillside orientation shapes the population structure of the Leishmania infantum vector Phlebotomus ariasi

Jorian Prudhomme¹^na1,
Thierry De Meeûs²^na1,
Céline Toty¹,
Cécile Cassan¹,
Nil Rahola¹,
Baptiste Vergnes¹,
Remi Charrel³,
Bulent Alten⁴,
Denis Sereno² &
…
Anne-Laure Bañuls¹

Scientific Reports volume 10, Article number: 14443 (2020) Cite this article

1088 Accesses
4 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Despite their role in Leishmania transmission, little is known about the organization of sand fly populations in their environment. Here, we used 11 previously described microsatellite markers to investigate the population genetic structure of Phlebotomus ariasi, the main vector of Leishmania infantum in the region of Montpellier (South of France). From May to October 2011, we captured 1,253 Ph. ariasi specimens using sticky traps in 17 sites in the North of Montpellier along a 14-km transect, and recorded the relevant environmental data (e.g., altitude and hillside). Among the selected microsatellite markers, we removed five loci because of stutter artifacts, absence of polymorphism, or non-neutral evolution. Multiple regression analyses showed the influence of altitude and hillside (51% and 15%, respectively), and the absence of influence of geographic distance on the genetic data. The observed significant isolation by elevation suggested a population structure of Ph. ariasi organized in altitudinal ecotypes with substantial rates of migration and positive assortative mating. This organization has implications on sand fly ecology and pathogen transmission. Indeed, this structure might favor the global temporal and spatial stability of sand fly populations and the spread and increase of L. infantum cases in France. Our results highlight the necessity to consider sand fly populations at small scales to study their ecology and their impact on pathogens they transmit.

Reconstructing the post-glacial spread of the sand fly Phlebotomus mascittii Grassi, 1908 (Diptera: Psychodidae) in Europe

Article Open access 08 December 2023

The influence of anthropogenic habitat fragmentation on the genetic structure and diversity of the malaria vector Anopheles cruzii (Diptera: Culicidae)

Article Open access 22 October 2020

The changing distribution of Leishmania infantum Nicolle, 1908 and its Mediterranean sandfly vectors in the last 140 kys

Article Open access 14 August 2019

Introduction

Phlebotomus sand flies are vectors of medically important pathogens, such as Leishmania, the causative agent of leishmaniasis¹, and arthropod-borne viruses (Toscana virus, Naples virus, and Sicilian virus)². Phlebotomus ariasi Tonnoir, 1921 is the predominant sand fly species in the French Cevennes region³, and one of the two proven vectors, with Phlebotomus perniciosus Newstead, 1911, of leishmaniasis, which is caused by Leishmania infantum in the South of France⁴. Sand flies are abundant in suburban and rural environments and are often close to human and domestic animal populations^4,5. Phlebotomus ariasi is found resting in houses, animal sheds, caves and weep holes in walls, near roads, and in villages. This species has a wide geographic distribution, including many countries of the Western Mediterranean region, such as Algeria, France, Italy, Morocco, Portugal, Spain, and Tunisia^{6,7,8,9,10,11,12,13,14,15,16}.

During the last 10 years, the risk of emergence or re-emergence of leishmaniasis¹⁷ and phlebovirus infections¹⁸ has increased in France, probably linked to the recent extension of the vector distribution. However, the biology and ecology of Ph. ariasi, one of the main vectors of L. infantum, remain poorly known.

Only few studies have been performed on this species. Analysis of cuticular hydrocarbons highlighted the presence of two Ph. ariasi populations (sylvatic and domestic) in the Cevennes region¹⁹. Studies based on isoenzyme data^20,21, random amplified polymorphic DNA²², and mitochondrial cytochrome b (cytb) gene sequences¹¹ showed differences among Ph. ariasi populations in known leishmaniasis foci in Europe. Finally, two microsatellite loci and other genetic markers were used to understand Ph. ariasi expansion in Europe during the Pleistocene glacial cycles²³.

However, no population genetics study tackled the influence of geographic (spatial organization) together with environmental (altitude and hillside) factors in the distribution of sand flies. Therefore, the aims of this work were to study the structure of Ph. ariasi populations at a local scale, and the impact of environmental factors (geographical distance, altitude, and hillside) on their spatial organization. For this purpose, sampling was performed in a well-documented area (Roquedur, Hérault, France), in which sand fly populations were already ecologically²⁴ and morphometrically²⁵ described, and where human and canine leishmaniasis caused by L. infantum are endemic^5,26. The analysis was carried out using 11 previously described microsatellite loci for Ph. ariasi²⁷.

Results

Genotyping

In total, 1,253 sand flies were genotyped using the 11 loci described in the Supplementary Table S1. Sand fly DNA samples in which more than six loci could not be amplified were removed from the analyses (n = 54 individuals, Group “/” in Supplementary Table S1).

Bayesian clustering

Discriminant analysis of principal components (DAPC) identified 40 clusters with a mean assignment probability P_Ass = 0.8568. Twenty individuals were grouped in one strongly differentiated cluster (cluster 21 with P_Ass = 1) (Fig. 1). Moreover, two outliers (one from cluster 2 and one from cluster 28) were close to cluster 21. Bayesian Analysis of Population Structure (BAPS) found 28 clusters (probabilities for number of clusters = 0.93641). A cluster of 22 individuals (BAPS cluster 28) included the 20 individuals from DAPC cluster 21 and the two outliers highlighted by DAPC. The optimal number of clusters found by STRUCTURE HARVESTER (used to visualize the results of STRUCTURE analysis) was two with ΔK = 16.03 (the second biggest ΔK was 2.55). Surprisingly, the clusters found by STRUCTURE did not match the DAPC or BAPS results. These two clusters grouped individuals from several stations with no obvious relation with any ecological or geographical parameter, and with a very small average assignment probability of individuals to their cluster (P_Ass = 0.53683).The partition found by STRUCTURE and STRUCTURE harvester with K = 2 is probably meaningless.

Cytb fragment sequencing of seven individuals from cluster 21 (# 12, 17, 18, 19, 856, 872, and 874) and one of the two outliers (# 23, cluster 2) showed 99–100% similarity with Ph. ariasi (GenBank accession number: KP685539.1, and sequences in Supplementary File 1). Therefore, the taxonomic status of the 22 outliers could not be elucidated. These 22 individuals (Group A in Supplementary Table S1) were all males captured in three stations (ST02, ST11 and ST12). Neither particular environmental condition nor specific morphological character was recorded for these individuals compared with the other sand fly specimens. The 22 outliers were homozygous for allele 204 at the Aria1 locus. This allele was not found in the other 1,177 individuals (Group B in Supplementary Table S1). The dendrogram (NJTree) seems to exclude two subsamples A from all other subsamples, while a third one appeared fully included within group B (Fig. 2). This structuring appears to be robust since the same tree was obtained with 6 (Aria2, Aria3, Aria4, Aria5, Aria13, and Aria14, loci selected after Linkage Disequilibrium (LD) and F-statistics analyses, see below) or 10 loci (excluding Aria1).

Except when specified otherwise, the 22 individuals of Group A were removed from the analyses to prevent Wahlund effects.

Locus selection for the analyses of Group B sand flies

The Aria1 locus was almost monomorphic (H_S = 0.06) and was removed from the data. Using regression approach we detected a marginally not significant Short Allele Dominance (SAD) for the Aria14 locus (p value = 0.0507). As SAD results from a preferential amplification of the shortest allele in heterozygous individuals²⁸, the Aria14 microsatellite profile of each homozygous individual was checked again following recommendation suggested in De Meeûs et al.²⁹, corrected, and then analyzed using GENEMAPPER 4.0. After correction, SAD could not be detected on this locus any longer (p value = 0.0731).

The Micro-Checker analysis suggested stutter artifacts at five loci (Aria2, Aria3, Aria10, Aria14, and Aria15). The unilateral exact binomial test indicated that the proportion of significant stuttering was not significantly higher than the expected proportion under the null hypothesis for Aria2 (p value = 0.2078), Aria3 (p value = 0.2078) and Aria14 (p value = 0.5819). Nevertheless, it was marginally not significant for Aria10 (p value = 0.0503) and highly significant for Aria15 (p value = 9.728e−06). Therefore, alleles close in size were pooled, avoiding pooling together only rare alleles (i.e., presence of at least one frequent allele in the pool) for these two loci as recommended²⁹. As the Aria10 locus became monomorphic after pooling, this locus was removed from the analysis. For Aria15, alleles 117, 119 and 121 were pooled with allele 123; allele 127 was pooled with allele 125; and allele 131 was pooled with allele 129. However, after correction, the Micro-Checker analysis and the unilateral exact binomial test (p value = 9.728e−06) highlighted that stuttering still affected this locus. Therefore, Aria 15 was also removed from the analysis.

On the remaining 9 loci, the linkage disequilibrium (LD) analysis indicated that 7 locus pairs (out of 11) displayed a significant LD (19.4%). Three of these locus pairs (Aria11 and Aria 12, Aria 11 and Aria 13, Aria 12 and Aria14) remained significant after Benjamini and Yekutieli adjustment. Aria11 and Aria12 displayed outlier profiles with strongly negative F_IS (Fig. 3), above average F_ST (Fig. 4), large F_IS and F_ST variance (Figs. 3, 4), and strong LD. These results suggested the non-neutrality of these loci. Therefore, Aria11 and Aria12 were also removed from the analysis.

To check the result stability, additional DAPC analyses were performed by successively removing the following loci Aria1, Aria10, Aria11, Aria12, and Aria15. As long as Aria1 was kept, the obtained pattern was the same as in Fig. 2 (data not shown). With the remaining six loci (Aria2, Aria3, Aria4, Aria5, Aria13, and Aria14), DAPC and BAPS provided no clear structure. This suggested that the distinction between group A (the 22 outliers) and group B (the 1,177 remaining individuals) only relied on the Aria1 locus (Supplementary Table S1).

Linkage disequilibrium and F-statistics with the six remaining loci

With the remaining six loci (Aria2, Aria3, Aria4, Aria5, Aria13, and Aria14), the proportion of significant LD tests was 13.3% (two locus pairs), but none of these tests remained significant after Benjamini and Yekutieli adjustment. There was still a rather small but significant heterozygote deficit in subsamples (average F_IS = 0.059, p value = 0.0001). Variation across loci appeared pronounced (Fig. 3), but the relationship between F_IS and F_ST was not significant (Spearman's ρ = 0.2319, p value = 0.3292). There was no relationship between the number of missing data at each locus and the F_IS values (ρ = − 0.029, p value = 0.5403). The jackknife method showed that the loci standard error of F_IS was 0.055 and was 27 times bigger than that for F_ST (0.002). The Micro-Checker analysis suggested the presence of null alleles at four loci (Aria2, Aria3, Aria4, and Aria14). According to the criteria described in De Meeûs³⁰ (see “Methods” section), null alleles only explained partly (if any) the observed heterozygote deficit.

To check the result stability again, the F_IS computed for the dataset that included also Group A (complete dataset) was compared with the one obtained without it (Group B) (Supplementary Table S1). The F_IS was bigger in the complete dataset than in Group B alone (F_IS = 0.079 and F_IS = 0.057, respectively), and the difference was significant (p value = 0.0486 unilateral Wilcoxon signed rank test). This translated into a Wahlund effect when Group A individuals were included in the data. These individuals were thus kept excluded.

Regression approach

A Principal Component Analysis (PCA) was undertaken with PCAGen (developed by J. Goudet, freely available at https://www2.unil.ch/popgen/softwares/pcagen.htm). The first two axes were significant using the broken stick criterion but not by permutation testing (p value = 0.2187 for axis 1 and p value = 0.1045 for axis 2) (see “Methods” section). Axis 1 and axis 2 represented 31% and 23% of the total inertia, respectively. We undertook two generalized linear model (glm) with the coordinates of subsamples at these axes (see “Methods” section). For the first axis, the minimum model, after a stepwise procedure, was: axis1 ~ hillside + latitude + longitude + latitude:hillside. For this axis, none of the included variables played a significant role (p value > 0.05 for all tests). For axis 2, the minimum model was: axis2 ~ hillside + altitude + latitude + longitude. Altitude and hillside played a significant role (p values < 0.001) and represented 51% and 15% of the total deviance, respectively (Table 1).

Table 1 Minimum model obtained for the regression of PCA axis 2 coordinates of Phlebotomus ariasi sampling sites.

Full size table

Isolation by altitude distance

As showed above, latitude and longitude had a weak effect on the genetic structure. Geographic parameters were thus removed from the model and isolation by altitudinal distance was tested using the Mantel test for each hillside separately (South–East and North–West, two tests).

When individuals were grouped by altitude levels (see Table 2 and “Methods” section), isolation by altitude distance was significant for the Northern (p value = 0.00605) and marginally non-significant for the Southern hillside (p value = 0.07345). When combined with the generalized binomial procedure³¹, computed with the MultiTest V1³², isolation by altitude appeared highly significant (p value = 0.0054).

Table 2 Sampling stations (ST) in the study area.

Full size table

Effective population sizes and migration

The average effective population size (N_e) was 69 individuals (range: 39 to 98) across subsamples and methods (see “Methods” section). N_e was not correlated with altitude (Spearman's ρ = 0.2216, p value = 0.7864). There was no effect of hillside (Kruskal–Wallis p value = 0.6242). Nei's unbiased estimator of genetic diversities³³ was H_S = 0.52 and H_T = 0.524 for the subsamples and total sample, respectively, with Meirmans and Hedrick's G_ST’’ = 0.017³⁴, and then N_em = (1 − G_ST’’)/4 G_ST’’ = 14 (immigrants per generation in each subpopulation assuming an island model of migration). The correlation between Nei's G_ST and H_S was strongly negative (rho = − 0.94, p value = 0.0083). Therefore, according to Wang’s criterion³⁵, it is more accurate using F_ST. F_ST with the ENA correction for null alleles and 95% confidence intervals (95% CI) after 5,000 bootstraps over loci were computed with FreeNA³⁶. This provided F_ST = 0.014 (95% CI = [0.006, 0.023]), with a corresponding N_em = 18 (95% CI [10, 40]).

Discussion

Locus selection

In this study, using 11 microsatellite markers, we investigated the genetic structure of Ph. ariasi populations in 1,253 individuals collected in 17 stations in the South of France (Supplementary Table S1). Different analyses (Micro-Checker, LD, F_IS and F_ST variance) allowed identifying loci with technical problems and/or loci that may not be neutral concerning natural selection. Consequently, we removed five of the eleven loci for various reasons including the presence of incurable stutter artifacts (Aria10 and Aria15), absence of polymorphism (Aria1), or non-neutral evolution (Aria11 and Aria12). One locus with SAD could be corrected.

Presence of outlier individuals

The Bayesian analyses (DAPC and BAPS) for all remaining loci (Aria2, Aria3, Aria4, Aria5, Aria13, and Aria14) confirmed the 22 outliers (Group A). These outliers were morphologically similar to Ph. ariasi and displayed 99–100% cytb sequence similarities. A previous study, based on chromatographic analysis of cuticular components, provided evidence that there are two distinct Ph. ariasi populations in our study area: one predominantly sylvatic, and the second one domestic¹⁹. However, in our study, there was no geographical distribution difference between individuals from Group A and Group B.

This finding could reflect the existence of cryptic species, as already suspected or reported for other sand fly species (e.g., Lutzomyia longipalpis^37,38, Lutzomyia umbratilis³⁹, and Sergentomyia bailyi⁴⁰). However, the distinction between Group A and Group B mainly depends on a single genetic marker (Aria1), which is fixed for different alleles in each group, while the other markers display a rather weak, though significant, signal. This result is supported by the dendrogram as the same tree was obtained with 6 or 10 loci (excluding Aria1). Additional molecular and biological studies will be necessary to test the cryptic species hypothesis (vector competence, interbreeding, hosts and niche preferences, behavior, etc.). As the taxonomic status of Group A individuals could not be elucidated, they were removed from the analyses.

Genetic structuring in ecotypes

We observed an important and significant heterozygote deficit, instead of the heterozygote excess expected for dioecious populations with random mating⁴¹. We found no evidence of Wahlund effect with the LD-based method described by Manangwa et al.⁴². Consequently, the heterozygote deficit could (non-exclusively) be explained by (1) null alleles; (2) allelic dropout; or (3) positive assortative mating. In the last case, this would require a mutual attraction of sexual partners based on a sufficient proportion of the genome to allow the hitchhiking of microsatellite markers, which are theoretically non-coding DNA sequences.

The glm approach showed a strong influence of altitude and hillside, and a weak (if any) influence of geographic distances on the genetic data. The proportion of deviance (67%) explained by altitude and hillside suggested the existence of ecotypes in this Ph. ariasi population.

The very strong migration rate estimated in our study (more than 25% of the effective subpopulation size) is hard to reconcile with the emergence of genetically distinct ecotypes. Different scenarios can explain ecotype structuring despite the strong migration rate: (1) the death of most immigrants before they can reproduce (unlikely scenario), and (2) significant assortative mating that can also explain the heterozygote deficit observed in our data (see above). The rate of codominant assortative mating based on genes homogeneously distributed in the genome can be approximated with the same equation as for selfing, as described in Hartl et al.⁴³ (page 272), with the equation a ≈ 2F_IS/(1 + F_IS). With a F_IS = 0.057, the assortative mating in our population would be ≈ 0.11 (≈ 11% of zygotes produced).

It is worth noting that previous research on the same species in the same study area highlighted differences in wing phenotypes according to altitude and hillside²⁵. It has been demonstrated that wing configuration is associated with wing beating frequency and mate recognition^44,45. These features would lead to a preferential choice of partners that could explain assortative mating and ecotype structuring. More studies are necessary to investigate the link between phenotypic (wing configuration) and genetic (microsatellite) diversity.

Ecological and epidemiological consequences of this genetic structuration

This structuring has undoubtedly implications on the ecology and evolution of sand flies. This structure might favor the global stability of their populations. Indeed, local environmental changes would have no or low effect on the overall population because the ecotypes would be affected independently. The high levels of migration rates associated with the well-known low capacity of flying of sand flies would help to colonize and recolonize at small-scale environments⁴⁶. It also could explain the global but slow (compared with that of invasive mosquitoes, such as Aedes albopictus) geographical expansion of sand flies observed in France¹⁷.

This particular population structure can also influence Leishmania transmission. Previous studies suggested the existence of different imbricated Leishmania transmission cycles at very small scales^47,48. However, due to the low flying capacities of sand flies^46,49, migrants are expected to disperse mainly over short distances. Therefore, the spread and increase of L. infantum cases in France might be mainly due to the movement of infected hosts.

This study demonstrates for the first time that Ph. ariasi presents a genetic structure in ecotypes. These data highlight the necessity to consider sand fly populations at small and specific scales to determine their ecology and its impact on Leishmania transmission. This structure may explain the long-term stability of sand fly populations.

Not many papers compare available clustering algorithms to date and it is thus hard to really understand when and why such algorithms will converge or diverge in the best partition they offer. For some attempt comparisons see: Latch et al.⁵⁰, Kaeuffer et al.⁵¹, Frantz et al.⁵², Blair et al.⁵³, Bohling et al.⁵⁴, Manangwa et al.⁴².

This type of study needs to be extended to other sand fly species. Indeed, the large diversities of sand fly populations worldwide may correlate with different ecological vector capacities and sensitivity to control measures.

Methods

Study area

The field study was performed in the South of France, on the “massif de l’Oiselette” hill situated between the “Hérault” (Ganges, Hérault) and “Arre” (Le Vigan, Gard) valleys. Sand flies were sampled along a 14 km transect from the “Saint Julien de la Nef” to “Le Vigan” villages, including “Roquedur-le-haut” (at 601 m above sea level) (Fig. 5; Table 2). This region has a Mediterranean sub-humid climate⁵⁵, and is characterized by the presence of plant species typical of scrubland habitats⁵.

The study area was divided in two hillsides with a South–East and North–West orientation, like in previous works performed in this area^3,24,25. Stations were selected based on the paper of Rioux et al.³, first to allow the comparison with the data obtained 30 years ago in terms of species distribution, density and abundance²⁴, second because these stations were distributed along the 14 km transect and represented the altitude and hillside diversity necessary for our population genetics study. Station 7 and Station 8 were not present in Rioux et al.³. These two stations are transitional between the two hillsides, and were thus added in the present study to obtain information on the consequences of transitional ecosystems. The stations were grouped according to altitude and hillside (see Fig. 5; Table 2). This area is characterized by the presence of various domestic animals, such as chicken, sheep, ducks, geese, horses, rabbits, cats, dogs, and also many different wild animals (wild boars, foxes, rodents, lizards, birds, etc.). These animals represent potential sand fly hosts. Moreover, cases of canine leishmaniasis were observed during the collection period (J.P. personal observation).

Sand fly collection and identification

Sand flies were collected monthly using 3,589 sticky traps (20 × 20 cm white paper covered with castor oil, as described by Alten et al.⁵⁶ and Ayhan et al.⁵⁷) between May and September 2011. Seventeen localities (sampling stations) were sampled (Fig. 5; Table 2) with a mean of 189 sticky traps per station, in various biotopes, inside and around human dwellings and animal sheds, close to the vegetation, and in wall crevices. Each trap was collected after 2 days.

In total, 1,253 sand flies were captured and transferred individually into 1.5 mL Eppendorf tubes with 96% ethanol and labeled. Prior to mounting, the sand fly head, genitalia and wings were removed. Heads and genitalia were cleared in Marc-André solution (chloral hydrate/acetic acid) and mounted in chloral gum⁴⁶. Each individual specimen was identified on the basis of the morphology of the pharynges and/or male genitalia or female spermathecae, using the keys of Abonnenc⁴⁶, Lewis⁶ and Killick-Kendrick et al.⁵⁸.

DNA extraction

Sand fly DNA was extracted using the Chelex method⁵⁹ described in Prudhomme et al.²⁷. Extraction was performed at the UMR “Unité des Virus Emergents” (UVE, IHU Méditerranée Infection, Marseille, France). Each entire sand fly was ground using a Mixer Mill MM300 (QIAGEN, Venlo, Netherlands) with one 3-mm tungsten bead in 200 μL Eagle’s Minimal Essential Medium at a frequency of 30 cycles s⁻¹ for 3 min. A volume of 140 μL of each sample was then used for DNA purification by adding the Chelex resin suspension⁵⁹ or by using the Eppendorf epMotion 5075 working station and the Macherey–Nagel NucleoSpin 96 Virus kit.

DNA amplification and genotyping

Based on the microsatellite position in the sequence and the repeated pattern structure, 11 of the most polymorphic loci were selected among the previously described 16 microsatellite markers for Ph. ariasi²⁷. Each 25 μL reaction mix included 1 pmol of forward (labeled with the fluorochrome FAM, ATT0565 or HEX) and reverse primers, 5 ng of sand fly DNA sample, 6 pmol of dNTP mix, 2.5 μL of 10X buffer, and 0.25 μL of Taq polymerase (ROCHE DIAGNOSTICS, 5 UI/μL). DNA was amplified in a thermal cycler using the following conditions: initial denaturation step at 95 °C for 10 min, followed by 40 cycles at 95 °C for 30 s, the specific annealing temperature of each locus²⁷ for 30 s, 72 °C for 1 min, and a final extension step at 72 °C for 10 min. For genotyping, 1 μL of PCR product was added into a standard loading mix (0.5 μL of the internal GeneScan 500LIZ dye size standard and 12.5 μL of Hi-Di formamide) (both from APPLIED BIOSYSTEMS) and sequenced on an ABI Prism 3130 Genetic Analyzer (APPLIED BIOSYSTEMS) automated sequencer. Profiles were read and analyzed using GENEMAPPER 4.0 [APPLIED BIOSYSTEMS, Foster City (CA)].

DNA from a subset of Ph. ariasi samples was used to check the species identification (see “Results” section) by amplifying a cytb gene fragment using the primers N1N-PDR [5′-CA(T/C) ATT CAA CC(A/T) GAA TGA TA-3′] and C3B-PDR [5′-GGT A(C/T)(A/T) TTG CCT CGA (T/A)TT CG(T/A) TAT GA-3′], according to a previously published protocol^60,61 and the following conditions: initial denaturation at 94 °C of 3 min; 5 cycles of denaturation (94 °C for 30 s), annealing (40 °C for 60 s), and extension (68 °C for 60 s), followed by 40 cycles of denaturation (94 °C for 60 s), annealing (44 °C for 60 s) and extension (68 °C for 60 s), and a final extension (68 °C for 10 min). Direct sequencing in both directions was performed by Eurofins Genomics.

Data analysis

Microsatellite raw data were formatted for CREATE⁶² that allowed their transformation into the formats needed for the different analyses. Samples with more than 50% of missing data were removed from the analyses.

Bayesian clustering

Several Bayesian clustering analyses were carried out. To validate the sand fly species identification, the first analysis was based on the 11 selected loci and included all sand fly samples. Then, to study the organization of Ph. ariasi populations, data were analyzed at different scales: hillside, altitude, and station. This analysis was performed with the loci selected after Linkage Disequilibrium (LD) and F-statistics analyses (see “Results” section).

As the clustering method accuracy can vary depending on the statistical properties of specific software programs and datasets^29,42,63, three different Bayesian clustering methods were used. First, a discriminant analysis of principal components (DAPC)⁶⁴ was performed using the adegenet⁶⁵ package for R⁶⁶. This was followed by a Bayesian Analysis of Population Structure (BAPS; admixture model^67,68, maximum number of clusters: 35, repetition: 50; that is freely available at https://www.helsinki.fi/bsg/software/BAPS/), and a STRUCTURE (version 2.3.4) analysis⁶⁹ (burning period: 10,000, number of clusters from 1 to 35, with the admixture model). STRUCTURE HARVESTER vA.2⁷⁰ was used to visualize the STRUCTURE analysis results, examine the ad hoc ΔK statistic, and determine the optimal number of clusters.

Finally, a dendrogram (NJTree) was built with MEGA 7⁷¹ from a Cavalli-Sforza and Edward's chord distance (D_CSE)⁷² matrix as recommended⁷³, between subsamples defined as combinations of the group obtained by Bayesian clustering and sampling stations. Because null alleles were suspected to occur, D_CSE was computed with the INA correction with FreeNA³⁶, after recoding missing data into null homozygotes following authors' recommendation.

Linkage disequilibrium and F-statistics

LD between each locus pair was tested with the G-based permutation test with 10,000 randomizations. This test was performed with F_STAT 2.9.4, an updated version of F_STAT 2.9.3⁷⁴ available at https://www.t-de-meeus.fr/ProgMeeusGB.html. This procedure is the most powerful for testing LD across different subsamples³². There are as many p values as locus pairs. Then, the False Discovery Rate (FDR) correction for multiple non-independent tests described by Benjamini et al.⁷⁵ was applied with R 3.5.1⁶⁶.

Wright's F statistics⁷⁶ were estimated with the Weir and Cockerham's unbiased estimators⁷⁷. Significant departure from 0, for F statistics, was tested by randomizing alleles between individuals within subsamples (deviation from the local random mating test) or individuals between subsamples within the total sample (population subdivision test). The p value corresponded to the number of times a statistic measured in randomized samples was as big as (or bigger than) the observed one (unilateral tests). For local panmixia, the statistic used was f (Weir and Cockerham's F_IS estimator). To test for subdivision, the G-based test⁷⁸ was used. According to De Meeûs et al.³², the G-based test is the most powerful procedure when combining tests across loci.

The 95% confidence intervals (CI) of F-statistics were computed using the jackknife over populations method for each locus or 5,000 bootstraps over loci for the averages, as described in De Meeûs et al.⁷⁹. Parameter estimates, testing, jackknife and bootstrap computations were done with F_STAT 2.9.4.

The determination procedure described by De Meeûs³⁰ and Manangwa et al.⁴² was used to discriminate demographic from technical causes of significant homozygote excess and LD. In the case of null alleles, F_IS and F_ST artificially increase and a positive correlation is expected between the statistics, F_IS standard error is at least twice that of F_ST, and a positive correlation is also expected between F_IS and the number of missing data (putative null homozygotes). Correlations were tested with the unilateral Spearman’s rank correlation test in R 3.5.1⁶⁶. The frequency of null alleles was assessed with Micro-Checker v2.2.3⁸⁰ using Brookfield's second method⁸¹.

The presence of stutter artifacts at each locus in each subsample was evaluated with Micro-Checker v2.2.3⁸⁰. A unilateral exact binomial test was used with R 3.5.1⁶⁶ to determine whether the observed proportion of significant stutter artifacts was greater than the expected 5% under the null hypothesis.

Short Allele Dominance (SAD) was assessed with the method described by Manangwa et al.⁴². The correlation between allele size and F_IT was tested with the unilateral Spearman’s rank correlation test in R 3.5.1⁶⁶. In the case of SAD, a negative correlation is expected between allele size and F_IT⁴².

In the case of significant stutter artifacts or SAD, the incriminated loci were corrected using the method described by De Meeûs et al.²⁹. Stuttering was addressed by pooling alleles close in size. To avoid a spurious increase of heterozygosity, each pooled group contained at least one frequent allele (e.g., with p value ≥ 0.05). SAD was addressed by going back to the chromatograms of homozygous individuals and trying to find a larger size micro-peak that could have been missed in the first reading.

In some instances, F_IS values were compared between subsample groups using the Wilcoxon signed rank test for paired data (the locus was the pairing unit).

Role of environmental factors on sand fly structuring

A principal component analysis (PCA) was done with PCAGen 1.2.1 (developed by J. Goudet, freely available at https://www2.unil.ch/popgen/softwares/pcagen.htm). The significance of the first axes was tested using the broken stick criterion⁸² and 10,000 permutations of individuals across subsamples. The metric of each axis divided by the total genetic diversity corresponded to Weir & Cockerham's Theta (F_ST estimator) (Goudet’s personal communication). The coordinates of subsamples for each significant axis were used as the response variable of generalized linear models (glm). General models were as follows: axis i ~ latitude + longitude + altitude + hillside + latitude: hillside + longitude:hillside + altitude:hillside (where i corresponded to the significant axes; latitude and longitude are the latitudinal and longitudinal GPS coordinates in decimal degrees; altitude is the altitude in meters; hillside: south–east or north–west; and "X:Y" represents the interaction between the explanatory variables X and Y). All glm's were done using R version 3.5.1⁶⁶, with the package rcmdr (R-commander)^83,84. Model selection was performed using a forward stepwise model selection procedure and the Akaike Information Criterion⁸⁵.

The influence of different factors (including interactions) was tested by examining the differences between models (complete, additive, and with one variable) with analyses of variances. As the entry order of explanatory variables matters in R analyses, the mean partial R² of each variable was calculated across all possible models.

Isolation by distance

Geographic distances were computed with Genepop version 4.7.0⁸⁶ using the Euclidian distance computed with the Universal Transverse Mercator (UTM) coordinates (Table 2). Genetic distances were estimated with the Cavalli-Sforza and Edwards chord distance⁷² D_CSE, the most powerful method to detect isolation by distance with microsatellite markers in most situations⁸⁷. Due to the presence of missing data, data were converted into the FreeNA format to compute D_CSE between each station pair with the ENA correction³⁶ that provides a very good correction for the isolation by distance regression slope in the case of null alleles⁸⁷. As these station pair ended into a non‐squared matrix that could not be handled by Genepop, the relationships between genetic and geographic distances were tested with a Mantel test (10⁴ permutations)⁸⁸ in F_STAT 2.9.4. As the Mantel test in F_STAT is bilateral by default, the p value was halved in the case of positive slope, or computed as: “1-(1-bilateral p values)/2” in the case of negative slope, to obtain unilateral p values for a positive slope.

The relationships between genetic and altitudinal distances were also tested in the conditions described above. To avoid any interaction, the analyses for the South and North hillsides were done separately. The two p values obtained were then combined with the generalized binomial test³¹, with MultiTest³², taking into account all tests as recommended when the number of combined tests k < 4⁸⁹.

Effective population sizes and migration

Four different methods were used to estimate the effective population sizes (N_e). The results obtained with these methods were averaged and weighted by the number of times a usable value (after removal of “infinity” results) was obtained. To obtain a range of possible N_e values, the same approach was performed for the minimum and maximum values obtained with each method.

First, NeEstimator v2⁹⁰ was used with the LD method^91,92 that applies a correction for missing data⁹³ and the molecular co-ancestry method⁹⁴. The LD method uses several threshold allele frequencies (0.05, 0.02, 0.01, and all alleles) to compute N_e. The average across the different values obtained with these frequencies was computed.

The intra- and inter-loci correlation method was also used to compute N_e⁹⁵ using Estim v2.2⁹⁶ (available at: https://www.t-demeeus.fr/ProgMeeusGB.html).

Finally, the heterozygote-excess method (expected in dioecious populations) described by Balloux⁴¹ was used for each locus that displayed heterozygote excess, as follows: N_e = [− 1/(2 F_IS)] −F_IS/(1 + F_IS).

To determine the number of immigrants, the standardized differentiation index described by Meirmans and Hedrick was computed to correct for polymorphism excess: G_ST" = n × (H_T − H_S)/[(n × H_T − H_S) × (1 − H_S)]³⁴, where H_T and H_S are Nei's unbiased estimates of genetic diversity in the total sample or within subsamples, respectively³³, and n is the number of subsamples. Genetic diversities were estimated with F_STAT 2.9.4. Assuming an island model, this value was then used to obtain an approximation for the number of immigrants within subpopulations as N_em = (1 − G_ST")/(4 × G_ST"). However, according to Wang’s criterion³⁵ if Nei’s G_ST³³ is negatively correlated with H_S, it is wiser to use F_ST for this computation.

Data availability

All resources used in this article are provided in the Supporting Information and all the analyses are detailed allowing the assessment or verification of the manuscript’s findings.

References

Dolmatova, A. V. & Demina, N. A. Les phlébotomes (Phlebotominae) et les maladies qu’ils transmettent. ORSTOM 20, 1–169 (1966).
Google Scholar
Bichaud, L. et al. Epidemiologic relationship between toscana virus infection and Leishmania infantum due to common exposure to Phlebotomus perniciosus sandfly vector. PLoS Negl. Trop. Dis. 5(9), e1328 (2011).
PubMed PubMed Central Google Scholar
Rioux, J.-A., Killick-Kendrick, R., Perieres, J., Turner, D. & Lanotte, G. Ecologie des Leishmanioses dans le sud de la France. 13. Les sites de “flanc de coteau”, biotopes de transmission privilégiés de la Leishmaniose viscérale en Cévennes. Ann . Parasitol. Hum. Comp. 55(4), 445–453 (1980).
CAS PubMed Google Scholar
Rioux, J.-A. et al. Ecology of leishmaniasis in the South of France. 22. Reliability and representativeness of 12 Phlebotomus ariasi, P. perniciosus and Sergentomyia minuta (Diptera: Psychodidae) sampling stations in Vallespir (eastern French Pyrenees region). Parasite 20, 34 (2013).
PubMed PubMed Central Google Scholar
Rioux, J.-A. et al. Epidémiologie des leishmanioses dans le Sud de la France. Monogr. l’Inst. Natl. Santé Rech. Méd. 20, 1–228 (1969).
Google Scholar
Lewis, D. J. A taxonomic review of the genus Phlebotomus (Diptera: Psychodidae). Bull. Br. Museum 45(2), 121–209 (1982).
Google Scholar
Rossi, E. et al. Mapping the main Leishmania phlebotomine vector in the endemic focus of the Mt. Vesuvius in southern Italy. Geospat. Health 1(2), 191–198 (2007).
PubMed Google Scholar
Ballart, C., Barón, S., Alcover, M. M., Portus, M. & Gallego, M. Distribution of phlebotomine sand flies (Diptera: Psychodidae) in Andorra: First finding of P. perniciosus and wide distribution of P. ariasi. Acta Trop. 122(1), 155–159 (2012).
CAS PubMed Google Scholar
Ballart, C. et al. Importance of individual analysis of environmental and climatic factors affecting the density of Leishmania vectors living in the same geographical area: The example of Phlebotomus ariasi and P. perniciosus in northeast Spain. Geospat. Health 8(2), 389–403 (2014).
PubMed Google Scholar
Boussaa, S., Neffa, M., Pesson, B. & Boumezzough, A. Phlebotomine sandflies (Diptera: Psychodidae) of southern Morocco: Results of entomological surveys along the Marrakech-Ouarzazat and Marrakech-Azilal roads. Ann. Trop. Med. Parasitol. 104(2), 163–170 (2010).
CAS PubMed Google Scholar
Franco, F. et al. Genetic structure of Phlebotomus (Larroussius) ariasi populations, the vector of Leishmania infantum in the western Mediterranean: Epidemiological implications. Int. J. Parasitol. 40(11), 1335–1346 (2010).
PubMed Google Scholar
Ready, P. Leishmaniasis emergence in Europe. Euro Surveill. 15(10), 19505 (2010).
CAS PubMed Google Scholar
Branco, S. et al. Entomological and ecological studies in a new potential zoonotic leishmaniasis focus in Torres Novas municipality, Central Region, Portugal. Acta Trop. 125(3), 339–348 (2013).
CAS PubMed Google Scholar
Barón, S. D. et al. Risk maps for the presence and absence of Phlebotomus perniciosus in an endemic area of leishmaniasis in southern Spain: Implications for the control of the disease. Parasitology 138(10), 1234–1244 (2011).
PubMed Google Scholar
Boudabous, R. et al. The phlebotomine fauna (Diptera: Psychodidae) of the eastern coast of Tunisia. J. Med. Entomol. 46(1), 1–8 (2009).
CAS PubMed Google Scholar
European Centre for Disease Prevention and Control E. Phlebotomine sand flies maps [internet] 2019 [10/01/19]. https://www.ecdc.europa.eu/en/disease-vectors/surveillance-and-disease-data/phlebotomine-maps.
Dedet, J.-P. Les leishmanioses en France métropolitaine. BEH Hors-Sér. 2010, 9–12 (2020).
Google Scholar
Depaquit, J., Grandadam, M., Fouque, F., Andry, P.-E. & Peyrefitte, C. Arthropod-borne viruses transmitted by Phlebotomine sandflies in Europe: A review. Euro Surveill. 15(10), 19507 (2010).
CAS PubMed Google Scholar
Kamhawi, S. et al. Two populations of Phlebotomus ariasi in the Cévennes focus of leishmaniasis in the south of France revealed by analysis of cuticular hydrocarbons. Med. Vet. Entomol. 1(1), 97–102 (1987).
CAS PubMed Google Scholar
Pesson, B., Wallon, M., Floer, M. & Kristensen, A. Étude isoenzymatique de populations méditerranéennes de phlébotomes du sous-genre Larroussius. Parassitologia 33, 471–476 (1991).
PubMed Google Scholar
Ballart, C., Pesson, B. & Gallego, M. Isoenzymatic characterization of Phlebotomus ariasi and P. perniciosus of canine leishmaniasis foci from Eastern Pyrenean regions and comparison with other populations from Europe. Parasite. 25, 3 (2018).
PubMed PubMed Central Google Scholar
Martin-Sanchez, J., Gramiccia, M., Pesson, B. & Morillas-Marquez, F. Genetic polymorphism in sympatric species of the genus Phlebotomus, with special reference to Phlebotomus perniciosus and Phlebotomus longicuspis (Diptera, Phlebotomidae). Parasite 7(4), 247–254 (2000).
CAS PubMed Google Scholar
Mahamdallie, S. S., Pesson, B. & Ready, P. D. Multiple genetic divergences and population expansions of a Mediterranean sandfly, Phlebotomus ariasi, in Europe during the Pleistocene glacial cycles. Heredity 106(5), 714–726 (2010).
PubMed PubMed Central Google Scholar
Prudhomme, J. et al. Ecology and spatiotemporal dynamics of sandflies in the Mediterranean Languedoc region (Roquedur area, Gard, France). Parasit. Vectors 8(1), 1–14 (2015).
Google Scholar
Prudhomme, J. et al. Ecology and morphological variations in wings of Phlebotomus ariasi (Diptera: Psychodidae) in the region of Roquedur (Gard, France): A geometric morphometrics approach. Parasit. Vectors 9(1), 578 (2016).
PubMed PubMed Central Google Scholar
Lachaud, L. et al. Surveillance of leishmaniases in France, 1999 to 2012. Euro Surveill. 18(29), 20534 (2013).
CAS PubMed Google Scholar
Prudhomme, J. et al. New microsatellite markers for multi-scale genetic studies on Phlebotomus ariasi Tonnoir, vector of Leishmania infantum in the Mediterranean area. Acta Trop. 142, 79–85 (2015).
CAS PubMed Google Scholar
Wattier, R., Engel, C. R., Saumitou-Laprade, P. & Valero, M. Short allele dominance as a source of heterozygote deficiency at microsatellite loci: Experimental evidence at the dinucleotide locus Gv1CT in Gracilaria gracilis (Rhodophyta). Mol. Ecol. 7(11), 1569–1573 (1998).
CAS Google Scholar
De Meeûs T, Chan CT, Ludwig JM, Tsao JI, Patel J, Bhagatwala J, Beati L. Deceptive combined effects of short allele dominance and stuttering: An example with Ixodes scapularis, the main vector of Lyme disease in the U.S.A. peerreviewed and recommended by PCI Evolutionary Biology. 2019.
De Meeûs, T. Revisiting, F_IS, F_ST, Wahlund Effects, and Null Alleles. J. Hered. 109(4), 446–456 (2018).
PubMed Google Scholar
Teriokhin, A. T., De Meeûs, T. & Guegan, J. F. On the power of some binomial modifications of the Bonferroni multiple test. J. Gener. Biol. 68(5), 332–340 (2007).
CAS Google Scholar
De Meeûs, T., Guégan, J.-F. & Teriokhin, A. T. MultiTest V.1.2., a program to binomially combine independent tests and performance comparison with other related methods on proportional data. BMC Bioinform. 10(1), 443 (2009).
Google Scholar
Nei, M. & Chesser, R. K. Estimation of fixation indices and gene diversities. Ann. Hum. Genet. 47(3), 253–259 (1983).
CAS PubMed MATH Google Scholar
Meirmans, P. G. & Hedrick, P. W. Assessing population structure: F_ST and related measures. Mol. Ecol. Resour. 11(1), 5–18 (2011).
PubMed Google Scholar
Wang, J. Does G_ST underestimate genetic differentiation from marker data? Mol. Ecol. 24(14), 3546–3558 (2015).
CAS PubMed Google Scholar
Chapuis, M. P. & Estoup, A. Microsatellite null alleles and estimation of population differentiation. Mol. Biol. Evol. 24(3), 621–631 (2007).
CAS PubMed Google Scholar
Maingon, R. et al. Genetic identification of two sibling species of Lutzomyia longipalpis (Diptera: Psychodidae) that produce distinct male sex pheromones in Sobral, Ceará State, Brazil. Mol. Ecol. 12(7), 1879–1894 (2003).
CAS PubMed Google Scholar
Bauzer, L. G., Souza, N. A., Maingon, R. D. & Peixoto, A. A. Lutzomyia longipalpis in Brazil: A complex or a single species? A mini-review. Mem. Inst. Oswaldo Cruz. 102(1), 1–12 (2007).
PubMed Google Scholar
Scarpassa, V. M. & Alencar, R. B. Lutzomyia umbratilis, the main vector of Leishmania guyanensis, represents a novel species complex? PLoS One 7(5), e37341 (2012).
ADS CAS PubMed PubMed Central Google Scholar
Tharmatha, T., Gajapathy, K., Ramasamy, R. & Surendran, S. N. Morphological and molecular identification of cryptic species in the Sergentomyia bailyi (Sinton, 1931) complex in Sri Lanka. Bull. Entomol. Res. 107(1), 58–65 (2016).
PubMed Google Scholar
Balloux, F. Heterozygote excess in small populations and the heterozygote-excess effective population size. Evolution 58(9), 1891–1900 (2004).
PubMed Google Scholar
Manangwa, O. et al. Detecting Wahlund effects together with amplification problems: Cryptic species, null alleles and short allele dominance in Glossina pallidipes populations from Tanzania. Mol. Ecol. Resour. 19(3), 757–772 (2019).
PubMed Google Scholar
Hartl, D. L. & Clarck, A. G. Principles of Population Genetics 2nd edn. (Sinauer Associates Inc, Sunderland, 1989).
Google Scholar
Araki, A. S. et al. Multilocus analysis of divergence and introgression in sympatric and allopatric sibling species of the Lutzomyia longipalpis complex in Brazil. PLoS Negl Trop Dis. 7(10), e2495 (2013).
PubMed PubMed Central Google Scholar
Kyriacou, C. Sex and rhythms in sandflies and mosquitoes: an appreciation of the work of Alexandre Afranio Peixoto (1963–2013). Infect. Genet. Evol. 28, 662–665 (2014).
PubMed PubMed Central Google Scholar
Abonnenc E. Les phlébotomes de la région éthiopienne (Diptera, Psychodidae): Cahiers de l'ORSTOM, série Entomologie médicale et Parasitologie; 1972 01/01. 239.
Rougeron, V. et al. Reproductive strategies and population structure in Leishmania: Substantial amount of sex in Leishmania Viannia guyanensis. Mol. Ecol. 20(15), 3116–3127 (2011).
PubMed Google Scholar
Rougeron, V. et al. Multifaceted population structure and reproductive strategy in Leishmania donovani complex in one Sudanese village. PLoS Negl. Trop. Dis. 5(12), e1448 (2011).
PubMed PubMed Central Google Scholar
Rioux, J.-A. et al. Ecologie des Leishmanioses dans le sud de la France. 12. Dispersion horizontale de Phlebotomus ariasi Tonnoir, 1921. Experiences préliminaires. Ann. Parasitol. Hum. Comp. 54(6), 673–682 (1979).
CAS PubMed Google Scholar
Latch, E. K., Dharmarajan, G., Glaubitz, J. C. & Rhodes, O. E. Relative performance of Bayesian clustering software for inferring population substructure and individual assignment at low levels of population differentiation. Conserv. Genet. 7(2), 295–302 (2006).
Google Scholar
Kaeuffer, R., Réale, D., Coltman, D. & Pontier, D. Detecting population structure using STRUCTURE software: Effect of background linkage disequilibrium. Heredity 99(4), 374–380 (2007).
CAS PubMed Google Scholar
Frantz, A. C., Cellina, S., Krier, A., Schley, L. & Burke, T. Using spatial Bayesian methods to determine the genetic structure of a continuously distributed population: Clusters or isolation by distance? J. Appl. Ecol. 46(2), 493–505 (2009).
Google Scholar
Blair, C. et al. A simulation-based evaluation of methods for inferring linear barriers to gene flow. Mol. Ecol. Resour. 12(5), 822–833 (2012).
PubMed Google Scholar
Bohling, J. H. et al. Describing a developing hybrid zone between red wolves and coyotes in eastern North Carolina, USA. Evol. Appl. 9(6), 791–804 (2016).
PubMed PubMed Central Google Scholar
Le, D. P. bioclimat Mediterraneen: Analyse des formes climatiques par le systeme d’Emberger. Vegetation 34(2), 87–103 (1977).
Google Scholar
Alten, B. et al. Sampling strategies for phlebotomine sand flies (Diptera: Psychodidae) in Europe. Bull. Entomol. Res. 105(6), 664–678 (2015).
CAS PubMed Google Scholar
Ayhan, N. et al. Practical guidelines for studies on sandfly-borne phleboviruses: Part I: Important points to consider ante field work. Vector Borne Zoonot. Dis. 17(1), 73–80 (2017).
Google Scholar
Killick-Kendrick, R. et al. The identification of female sandflies of the subgenus Larroussius by the morphology of the spermathecal ducts. Parassitologia 33, 335–347 (1991).
PubMed Google Scholar
Wang, Q. & Wang, X. Comparison of methods for DNA extraction from a single chironomid for PCR analysis. Pak. J. Zool. 44(2), 421–426 (2012).
CAS Google Scholar
Esseghir, S., Ready, P. D., Killick-Kendrick, R. & Ben-Ismail, R. Mitochondrial haplotypes and phylogeography of Phlebotomus vectors of Leishmania major. Insect. Mol. Biol. 6(3), 221–225 (1997).
Google Scholar
Depaquit, J., Leger, N. & Randrianambinintsoa, F. J. Paraphyly of the subgenus Anaphlebotomus and creation of Madaphlebotomus subg. Nov. (Phlebotominae: Phlebotomus). Med. Vet. Entomol. 29(2), 159–170 (2015).
CAS PubMed Google Scholar
Coombs, J. A., Letcher, B. H. & Nislow, K. H. Create: A software to create input files from diploid genotypic data for 52 genetic software programs. Mol. Ecol. Resour. 8(3), 578–580 (2008).
CAS PubMed Google Scholar
Bohling, J. H., Adams, J. R. & Waits, L. P. Evaluating the ability of Bayesian clustering methods to detect hybridization and introgression using an empirical red wolf data set. Mol. Ecol. 22(1), 74–86 (2013).
PubMed Google Scholar
Jombart, T., Devillard, S. & Balloux, F. Discriminant analysis of principal components: A new method for the analysis of genetically structured populations. BMC Genet. 11(1), 94 (2010).
PubMed PubMed Central Google Scholar
Jombart, T. adegenet: A R package for the multivariate analysis of genetic markers. Bioinformatics 24(11), 1403–1405 (2008).
CAS PubMed Google Scholar
R Development Core Team RT. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, 2012. https://www.R-project.org/. 2018.
Corander, J. & Marttinen, P. Bayesian identification of admixture events using multilocus molecular markers. Mol. Ecol. 15(10), 2833–2843 (2006).
PubMed Google Scholar
Corander, J., Marttinen, P., Siren, J. & Tang, J. Enhanced Bayesian modelling in BAPS software for learning genetic structures of populations. BMC Bioinform. 9, 539 (2008).
Google Scholar
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155(2), 945–959 (2000).
CAS PubMed PubMed Central Google Scholar
Earl, D. A. & vonHoldt, B. M. STRUCTURE HARVESTER: A website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resourc. 4(2), 359–361 (2012).
Google Scholar
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33(7), 1870–1874 (2016).
CAS PubMed PubMed Central Google Scholar
Cavalli-Sforza, L. L. & Edwards, A. W. F. Phylogenetic analysis. Models and estimation procedures. Am. J. Hum. Genet. 19(3 Pt 1), 233–257 (1967).
CAS PubMed PubMed Central Google Scholar
Takezaki, N. & Nei, M. Genetic distances and reconstruction of phylogenetic trees from microsatellite DNA. Genetics 144(1), 389–399 (1996).
CAS PubMed PubMed Central Google Scholar
Goudet, J. FSTAT (Version 1.2): A computer program to calculate F-statistics. J. Hered. 86(6), 485–486 (1995).
Google Scholar
Benjamini, Y. & Yekutieli, D. The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 29(4), 1165–1188 (2001).
MathSciNet MATH Google Scholar
Wright, S. The interpretation of population structure by F-statistics with special regard to systems of mating. Evolution 19(3), 395–420 (1965).
Google Scholar
Weir, B. S. & Cockerham, C. C. Estimating F-statistics for the analysis of population structure. Evolution 38(6), 1358–1370 (1984).
CAS PubMed Google Scholar
Goudet, J., Raymond, M., De Meeûs, T. & Rousset, F. Testing differentiation in diploid populations. Genetics 20, 144 (1996).
Google Scholar
De Meeûs, T. et al. Population genetics and molecular epidemiology or how to “débusquer la bête”. Infect. Genet. Evol. 20, 7 (2007).
Google Scholar
Van Oosterhout, C., Hutchinson, W. F., Wills, D. P. & Shipley, P. MICRO-CHECKER: Software for identifying and correcting genotyping errors in microsatellite data. Mol. Ecol. Notes. 4(3), 535–538 (2004).
Google Scholar
Brookfield, J. F. A simple new method for estimating null allele frequency from heterozygote deficiency. Mol. Ecol. 5(3), 453–455 (1996).
CAS PubMed Google Scholar
Frontier, S. Étude de la décroissance des valeurs propres dans une analyse en composantes principales: Comparaison avec le modèle du bâton brisé. J. Exp. Mar. Biol. Ecol. 25, 67–75 (1976).
Google Scholar
Fox, J. & The, R. Commander: A basic-statistics graphical user interface to R. J. Stat. Softw. 14(9), 1–42 (2005).
Google Scholar
Fox, J. Extending the R Commander by “Plug-In” Packages. R News 7(3), 46–52 (2007).
Google Scholar
Akaïke, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 19(6), 716–723 (1974).
ADS MathSciNet MATH Google Scholar
Rousset, F. GENEPOP’007: A complete re-implementation of the GENEPOP software for Windows and Linux. Mol. Ecol. Resour. 20, 8 (2008).
Google Scholar
Séré, M., Thevenon, S., Belem, A. M. G. & De Meeus, T. Comparison of different genetic distances to test isolation by distance between populations. Heredity 119(2), 55–63 (2017).
PubMed PubMed Central Google Scholar
Mantel, N. The detection of disease clustering and a generalized regression approach. Cancer Res. 27(2), 209–220 (1967).
CAS PubMed Google Scholar
De Meeûs, T. Statistical decision from k test series with particular focus on population genetics tools: A DIY notice. Infect. Genet. Evol. 22, 91–93 (2014).
PubMed Google Scholar
Do, C. et al. NeEstimator v2: Re-implementation of software for the estimation of contemporary effective population size (N_e) from genetic data. Mol. Ecol. Resour. 14(1), 209–214 (2014).
CAS PubMed Google Scholar
Waples, R. S. A bias correction for estimates of effective population size based on linkage disequilibrium at unlinked gene loci*. Conserv. Genet. 7(2), 167–184 (2006).
Google Scholar
Waples, R. S. & Do, C. ldne: A program for estimating effective population size from data on linkage disequilibrium. Mol. Ecol. Resour. 8(4), 753–756 (2008).
PubMed Google Scholar
Peel, D., Waples, R. S., Macbeth, G. M., Do, C. & Ovenden, J. R. Accounting for missing data in the estimation of contemporary genetic effective population size (N_e). Mol. Ecol. Resour. 13(2), 243–253 (2013).
CAS PubMed Google Scholar
Nomura, T. Estimation of effective number of breeders from molecular coancestry of single cohort sample. Evol. Appl. 1(3), 462–474 (2008).
PubMed PubMed Central Google Scholar
Vitalis, R. & Couvet, D. Estimation of effective population size and migration rate from one- and two-locus identity measures. Genetics 157(2), 911–925 (2001).
CAS PubMed PubMed Central Google Scholar
Vitalis, R. & Couvet, D. Estim 1.0: A computer program to infer population parameters from one- and two-locus gene identity probabilities. Mol. Ecol. Notes 1(4), 354–356 (2005).
Google Scholar

Download references

Acknowledgements

This work was supported by EU Grant FP7-261504 EDENext (https://www.edenext.eu). The contents of this publication are the sole responsibility of the authors and do not necessarily reflect the views of the European Commission. The authors wish to thanks the ISEM GenSeq technical facilities, Institut des Sciences de l’Evolution de Montpellier, and the Centre Méditerranéen de l’Environnement et de la Biodiversité where some of the samples were prepared. We are also particularly grateful to Pr Rioux for sharing knowledge of the study area, Mr Lacoste and the “Fondation France”, IRD (Institut de Recherche pour le Développement), CNRS (Centre National de la Recherche Scientifique) and INFRAVEC2 project (https://infravec2.eu/) for financial support. We also thank Elisabetta Andermacher for assistance in preparing and editing the manuscript and Jérôme Goudet for useful tips regarding PCAGen.

Author information

These authors contributed equally: Jorian Prudhomme and Thierry De Meeûs.

Authors and Affiliations

MIVEGEC Univ Montpellier, IRD, CNRS, Centre IRD, 911 avenue Agropolis, 34394, Montpellier, France
Jorian Prudhomme, Céline Toty, Cécile Cassan, Nil Rahola, Baptiste Vergnes & Anne-Laure Bañuls
INTERTRYP, IRD, Cirad, Univ Montpellier, Montpellier, France
Thierry De Meeûs & Denis Sereno
Unité des Virus Emergents (UVE: Aix Marseille Univ, IRD 190, INSERM 1207, IHU Méditerranée Infection), 13385, Marseille, France
Remi Charrel
ESRL Laboratories, Department of Biology, Ecology Section, Faculty of Science, Hacettepe University, 0680, Beytepe, Ankara, Turkey
Bulent Alten

Authors

Jorian Prudhomme
View author publications
You can also search for this author in PubMed Google Scholar
Thierry De Meeûs
View author publications
You can also search for this author in PubMed Google Scholar
Céline Toty
View author publications
You can also search for this author in PubMed Google Scholar
Cécile Cassan
View author publications
You can also search for this author in PubMed Google Scholar
Nil Rahola
View author publications
You can also search for this author in PubMed Google Scholar
Baptiste Vergnes
View author publications
You can also search for this author in PubMed Google Scholar
Remi Charrel
View author publications
You can also search for this author in PubMed Google Scholar
Bulent Alten
View author publications
You can also search for this author in PubMed Google Scholar
Denis Sereno
View author publications
You can also search for this author in PubMed Google Scholar
Anne-Laure Bañuls
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.S., B.A. and A.-L.B. designed the study. J.P. did the molecular characterization. J.P. and T.D.M. analyzed the data. D.S. J.P., C.T., C.C., N.R., B.V., R.C., B.A., D.S. and A.-L.B. contributed to field sampling, sand flies species identification and sample preparation. J.P., T.D.M. and A.-L.B. wrote the manuscript with support from C.T., C.C., N.R., B.V., R.C, B.A. All authors have read and approved the actual version of the manuscript.

Corresponding author

Correspondence to Jorian Prudhomme.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Prudhomme, J., De Meeûs, T., Toty, C. et al. Altitude and hillside orientation shapes the population structure of the Leishmania infantum vector Phlebotomus ariasi. Sci Rep 10, 14443 (2020). https://doi.org/10.1038/s41598-020-71319-w

Download citation

Received: 03 April 2020
Accepted: 15 July 2020
Published: 02 September 2020
DOI: https://doi.org/10.1038/s41598-020-71319-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Reconstructing the post-glacial spread of the sand fly Phlebotomus mascittii Grassi, 1908 (Diptera: Psychodidae) in Europe

The influence of anthropogenic habitat fragmentation on the genetic structure and diversity of the malaria vector Anopheles cruzii (Diptera: Culicidae)

The changing distribution of Leishmania infantum Nicolle, 1908 and its Mediterranean sandfly vectors in the last 140 kys

Introduction

Results

Genotyping

Bayesian clustering

Locus selection for the analyses of Group B sand flies

Linkage disequilibrium and F-statistics with the six remaining loci

Regression approach

Isolation by altitude distance

Effective population sizes and migration

Discussion

Locus selection

Presence of outlier individuals

Genetic structuring in ecotypes

Ecological and epidemiological consequences of this genetic structuration

Methods

Study area

Sand fly collection and identification

DNA extraction

DNA amplification and genotyping

Data analysis

Bayesian clustering

Linkage disequilibrium and F-statistics

Role of environmental factors on sand fly structuring

Isolation by distance

Effective population sizes and migration

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary information

Supplementary Table

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links