Introduction

Malaria is considered as the parasitic disease that has the greatest impact on public health1. Plasmodium spp. infection becomes perpetuated in a cycle of disease and poverty, contributing towards affected individuals’ worsening quality of life and limiting the possibility of eradicating such infections2.

Malaria is transmitted by female mosquitoes from the genus Anopheles, with mammals being the definitive host1,3. Six species from the genus Plasmodium have been described as causing malaria in human beings: P. falciparum, P. vivax, P. malariae, P. ovale curtisi, P. ovale wallikeri and P. knowlesi4,5, with Plasmodium spp. being endemic in 91 countries and causing 212 million cases of infection per year (429,000 leading to death)6.

Current mitigation measures in disease-endemic countries have not had the desired impact since an increase in malaria cases has been reported for countries such as Colombia, where 55,866 cases were confirmed in 2015 (annual parasite index: 5.4 cases per 1,000 inhabitants)7,8. Colombia thus accounts for 10% of cases of malaria in the Americas9,10,11, with Colombia’s Amazon region being the focus of an outbreak of malaria during the last few years7,12.

The Amazon basin covering a large part of southern Colombia (108,951 km2) is a major transmission and disease load foci13,14, which operates relatively independently from other Colombian regions. The Amazon region’s habitat diversity and its own climatic characteristics (seasonal rainfall effects) determine vector presence and abundance (i.e. Anopheles benarrochi, Anopheles oswaldoi, Anopheles darlingi). Such vectors are anthropophilic and highly efficient regarding parasite transmission and have facilitated the increase in cases of malaria amongst the region’s inhabitants, together with the demographics of human settlements, and clinical and housing conditions in the region and their related dynamics14,15.

Risk factors for acquiring malaria have been described on different levels (genetic, social determinants and environmental) and influence exposure to parasitic infection, its course and outcomes16,17,18,19. These factors also facilitate infection by more than one Plasmodium spp. (mixed-species malaria); however, these mixed-species are currently being under-diagnosed given the use of conventional techniques10. Little is currently known regarding the biology and establishment of Plasmodium mixed infections, but insight into the frequency of mixed-species infections in the population and the factors affecting their transmission is essential for developing effective disease elimination measures20,21. The factors involved in malaria transmission and those influencing mixed Plasmodium spp. species infection in highly endemic regions need to be determined, particularly at a time when rapid climatic changes can modify host-vector-pathogen relationship dynamics.

This study aimed to establish the frequency of three Plasmodium spp. within the population, determine the distribution of mixed infections and identify infected patient profiles in the Colombian Amazon region.

Results

Characteristics of the population being analysed

Of the 2,106 patients invited to participate in the study, 5.3% (n = 111) were excluded due to negative results with human β-globin gene amplification; 1,995 subjects thus became the object of statistical analysis. The sampling region was divided into areas in accordance with the population characteristics (Fig. 1); 344 samples were taken in area 1, 257 samples in area 2, 566 samples in area 3 and 828 samples in area 4 (Additional file 1: Table S1). The average age of the population was 26.6 years (SD: 19.8 years) and 48.2% (n = 961) reported a previous episode of malaria, mainly those living in area 4 (n = 441). Table 1 provides the distribution of sociodemographic characteristics amongst the population in accordance with the Plasmodium spp. infection stage (as determined by molecular biology).

Figure 1
figure 1

Geographical locations of the 57 localities where samples were collected (this map was modified from a map downloaded from the Instituto Geográfico Agustín Codazzi, IGAC)60,61. Images are freely accessible and modifiable in accordance with IGAC policies.

Table 1 Sociodemographic characteristics of the sample population.

Detecting Plasmodium spp. by conventional microscopy and PCR

By analysing thick blood smears (TBS), 37% (n = 737/1,995) of the population were identified as positive for Plasmodium spp., 31.3% (n = 625/1,995) for P. vivax, 6.4% (n = 128/1,995) for P. falciparum and less than 1% (n = 16/1,995) had mixed-species infections (Additional file 2: Fig. S1a). Parasitaemia varied from 32 to 85,320 parasites/µL blood (mean: 10,100; SD: 11,603), being higher in P. vivax (mean: 10,585; SD: 11,920) than in P. falciparum (mean: 7,752; SD: 9,099).

Regarding parasite DNA detection, 88% (n = 1,750/1,995) of the target population were infected with Plasmodium spp., with P. vivax being the most prevalent species (71.0%; n = 1,412/1,995), followed by P. malariae (43.2%; n = 862/1,995) and P. falciparum (21.7%; n = 432/1,995). Mixed infection events (simultaneous infection by ≥2 species) were found in 43.2% (n = 844/1,995) of the target population (Additional file 2: Fig. S1b), with the P. vivax/P. malariae combination being the most frequently detected (n = 504/1,995) (Fig. 2).

Figure 2
figure 2

Cumulative frequency of Plasmodium species and their contribution to malaria in 1,750 people in whom parasitic DNA was identified using molecular techniques. P.v = Plasmodium vivax, P.m = Plasmodium malariae and P.f = Plasmodium falciparum.

It was found that 75% of the cases were infected with P. vivax and P. malariae (Fig. 2). Parasite frequency ranged from 82% to 100% (Additional file 3: Fig. S2a) when evaluating parasite infection with respect to age and P. vivax was the most prevalent species amongst all age groups, showing a greater frequency in the 31–60-year-old age group (p = 0.002; Chi2 tests) (Table 1; Additional file 3: Fig. S2b).

Evaluating the sampling areas and the types of settlement

Additional analysis evaluated the parasite infection status (single and mixed), the mean rate of Plasmodium spp. parasitaemia and the distribution with respect to the area sampled. Sampling area 1 had the highest single infection frequency (48.3%) (not statistically significant: p = 0.561; Chi2 test); mixed infections appeared most frequently in area 2 (p = 0.001; Chi2 test) (Fig. 3a). Mean parasitaemia levels were lower in cases of single infection (9,854 parasites/µL) than in mixed infections (10,394 parasites/µL), but this difference was not statistically significant (p = 0.533; T-test) (Additional file 4: Fig. S3a). However, parasitaemia varied significantly depending on the area being sampled (p = 0.026; ANOVA test). Bonferroni test correction showed significant differences between areas 3 and 4 (p = 0.022) (Additional file 4: Fig. S3b).

Figure 3
figure 3

Relative frequency of parasite infection and Plasmodium spp. distribution with respect to the area sampled [area 1 (n = 344), area 2 (n = 257), area 3 (n = 566) and area 4 (n = 828)]. Part (a) shows the distribution of parasite infection frequency with respect to the Plasmodium spp. infection status. Blue represents the uninfected target population. Green represents the proportion of the target population infected by a single species. Dark red represents the proportion of the target population with a mixed species infection. Part (b) shows the relative frequency of Plasmodium spp.

The results of the analysis of Plasmodium spp. distribution with respect to area showed that P. vivax had the greatest frequency (greater than 65%) in almost all localities, except for P. vivax in Punta Brava and Yaguas (Additional file 5: Table S3) and P. malariae (absent in seven localities evaluated) (Additional file 5: Table S3) (40.7% to 47.7% relative frequencies). P. falciparum prevalence was significantly lower in area 1 relative to all other (p = 0.001; Fisher’s exact test) (Fig. 3b).

Parasite infection was evaluated with respect to the type of settlement; Leticia and Puerto Nariño are urban settlements; the remaining localities are rural. There were similar infection percentages for all types of settlement; however, the parasite density index (PDI) was higher for rural areas (index: 57.8). P. falciparum infection was mostly restricted to rural settlements (Additional file 6: Table S3).

Plasmodium spp. infection profiles

A clinical profile was created for each participant based on the symptoms reported in a survey conducted during sampling. Vomiting (p = 0.018; Fisher’s exact test) and diarrhoea (p = 0.005; Fisher’s exact test) occurred most frequently in the study population with single Plasmodium spp. infections, whereas severe headache was most frequently reported in the population with mixed-species infections (p = 0.001; Fisher’s exact test) (Additional file 7: Fig. S4). The distribution of symptoms was similar for all species of infecting Plasmodium, with fever being the most frequently reported symptom amongst the three species (89% to 91%) and a rash being the least frequently reported symptom in the sample population (2.1% to 3.2%) (Additional file 8: Fig. S5).

Logistic regression was used to identify the association between the variables evaluated (age, area, parasitaemia, access to basic services (public water and electricity supply, sewerage service), nearby water stagnations, use of mosquito nets and use of insecticides) and the presence of mixed-species infection. Patients having 2,000 to 4,999 parasites/µL blood parasitaemia [adjusted odds ratio (aOR) 0.61: 0.38–0.98, 95% confidence interval (CI)] or 5,000 to 9,999 parasites/µL blood parasitaemia (aOR 0.48: 0.29–0.77, 95% CI) had a lower probability of acquiring a mixed infection. No significant associations were observed for the other variables included in the model (Table 2).

Table 2 Risk factors associated with mixed infections.

Analysing the strength of the association between sociodemographic, clinical and laboratory variables (as previously mentioned), and the combination of parasite species revealed positive associations between area 1 and mixed P. vivax and P. malariae infections (aOR 2.13: 1.33–3.42, 95% CI), access to a public water supply and mixed P. malariae and P. falciparum infections (aOR 6.90: 4.98–8.28, 95% CI) and triple infections (simultaneous infection by the three species being evaluated) (aOR 3.05: 1.20–7.74, 95% CI). Variables showing less significant associations were parasitaemia (5,000 to 9,999 parasites/µL blood) in P. malariae and P. falciparum infections (aOR 0.18: 0.35–0.93, 95% CI) and triple infection events with parasitaemia higher than 2,000 parasites/µL blood and area 1 (Additional file 9: Table S4).

Multiple correspondence analysis (MCA) was used for identifying Plasmodium spp. infection profiles by compiling clinical and sociodemographic variables (Tables 3 and 4). Three main axes emerged after analysing the change in inertia in the histogram showing the eigenvalues of the active variables (Table 5 and Fig. 4). Three profiles were constructed around these axes (epidemiological and clinical variables related to P. falciparum infection, those related to triple infection (P. falciparum, P. vivax and P. malariae) and those related to double infection by P. vivax and P. malariae) (Table 5 and Fig. 4).

Table 3 Contribution, cosine squared and active variable test values.
Table 4 Test values for illustrative variables.
Table 5 Profile structure.
Figure 4
figure 4

Multiple correspondence analysis (MCA). Part (a) represents the mode on axes 1 and 2. Part (b) represents the mode on axes 1 and 3. Part (c) represents the mode on axes 2 and 3. The variables contributing towards each profile are highlighted; green indicates the positive pole and red indicates the negative pole variables.

The first profile consisted of variables related to the area in which the patients reside, their sanitary conditions (i.e. nearby stagnant water, mosquito nets and insecticide use) and certain symptoms (i.e. headache, shivering and vomiting). Residing in Puerto Nariño, area 4, having no stagnant water nearby, having a history of malaria and displaying mild symptoms (i.e. mild headache without shivering) correlated with P. falciparum infection (Table 5 and Fig. 4a).

The second profile related to triple P. falciparum, P. vivax and P. malariae infection, and the variables covered age, living conditions, medical history and symptoms (i.e. 19–60 years of age, having no stagnant water nearby, having a history of malaria, symptoms including abdominal pain, normal-coloured urine, slight to moderate headache, diarrhoea and no fever) (Table 5 and Fig. 4b).

The third profile (double P. vivax and P. malariae infection) highlighted more severe symptomatology, together with parasitaemia. The variables included living in Puerto Nariño, area 4, fever, shivering, vomiting, diarrhoea, moderate headache, normal-coloured urine and >9,999 or between 2,000–4,999 parasitaemia (Table 5 and Fig. 4c).

Discussion

The climatic, environmental and geographic characteristics in South America provide favourable conditions for the circulation of Plasmodium spp. and vector-borne diseases such as malaria, which thereby poses a significant threat to public health in countries such as Colombia22. The population living in the Colombian Amazon region is particularly vulnerable, showing high malarial morbidity and mortality7,9.

In this report, most of the study population resided in rural areas lacking access to water, sewerage and/or gas (i.e. public services). Their type of housing is palafitic (i.e. stilt houses over water/alongside a river supported by pillars or simple stakes, or houses built on bodies of calm water such as lakes, lagoons and slow running large rivers), often but not always having palm-leaf roofs and wooden walls, thereby exposing their inhabitants to the environment and the vector’s ecosystem, thus increasing their probability of acquiring parasitic infections23. Such living conditions result in the high prevalence of malaria in this population, and in other populations in Colombia and worldwide8,24,25.

The active search for parasite infections has involved the simultaneous use of molecular and conventional microscopy techniques. This approach has enabled the diagnosis of P. malariae and mixed-species malaria infections (Additional file 2: Fig. S1). TBS as a diagnostic tool for malaria may not be sufficient as it leads to under-reporting (mainly of mixed-species malaria) and is limited in its ability of ensure timely treatment. Its use must thus be complemented by techniques providing greater sensitivity (i.e. molecular techniques)10,13,26,27. Prompt and accurate diagnosis of malarial infection in symptomatic populations and the identification of asymptomatic and sub-microscopic infections contributing to transmission can thus constitute part of the effective control and management of disease, with a view to eliminating malaria28,29.

Using molecular techniques enabled the identification of a large number of parasite infections and a high PDI (Additional file 6: Table S3) for Colombia; municipalities in Colombia’s Pacific region and the Antioquia region have reported similar results in terms of infection and PDI30. Molecular diagnostic tools have enabled the successful and highly sensitive detection of parasite species involved in mixed infections. In this study, more than 40% of the target population had mixed-species infections (Fig. 3 and Additional file 2: Fig. S1b), which was consistent with previous reports in India20, Thailand31, Papua New Guinea32 and Brazil33.

As previously reported for Colombia, P. vivax was associated with the highest frequency of malaria in all localities evaluated (Fig. 2)3,34; conversely, in the Peruvian Amazonian region the prevalence of this species varies in accordance with the area being evaluated35.

P. malariae was the second most highly ranked species in terms of disease frequency and contribution to infections (Fig. 2). This parasite species is known to be widespread across sub-Saharan Africa and south-eastern Asia36; however, molecular detection methods identified a higher proportion of P. malariae compared with microscopy in our study and in previous studies in Colombia and worldwide3,10,31,33.

P. falciparum showed lower prevalence and contribution to cases of malaria in the target population. A differential infection frequency was detected for this species with respect to the type of settlement, with the number of cases of infection with this parasite being greater in rural populations (Fig. 4b and Additional file 6: Table S3).

Differential parasitaemia levels were detected amongst the different areas being sampled. Individuals living in endemic areas who have been exposed to the parasite from an early age display a certain degree of immunity, as exemplified by low parasitaemia levels when exposed to new infections37,38. This may partially explain why the population inhabiting area 4 had the lowest levels of parasitaemia (Additional file 4: Fig S3b), consistent with the fact that more than 50% of this area’s inhabitants had suffered previous episodes of malaria. However, further studies regarding the association between previous episodes of malaria and parasitaemia levels are needed.

Evaluating the factors associated with mixed-species infection revealed that high parasitaemia levels were less frequently associated with simultaneous P. falciparum and P. malariae infection (Table 2 and Additional file 9: Table S4). Cross protection has been reported for these two parasite species39, as they share common antigens37,40, therefore host immunity limits parasitaemia in this type of mixed infection.

Parasite infection may be favoured by certain host characteristics that increase the interaction of parasites with target cells, thereby leading to greater infection success41,42; for example, the probability of being bitten and the transmission frequency is greater in endemic populations32,43,44. Some areas within the Colombian Amazon region were found to be associated with higher levels of parasite infection; mixed infections (P. vivax and P. malariae) were associated with localities in areas 1 and 4 and Puerto Nariño (Additional file 9: Tables S4 and 5), whereas P. falciparum infection was concentrated in rural populations, mainly in localities in area 4 (Table 5 and Additional file 6: Table S3).

Spatial factors influence the parasite-host-vector interaction and contribute towards the appearance of high transmission foci or hotspots within a geographical area45,46. In these foci, high levels of parasite circulation are observed, thereby facilitating dispersion to other localities and contributing to the spread of infections47,48.

The mean parasitaemia values were similar for different types of infection (single or mixed) (Additional file 4: Fig. S3a), suggesting that more than one species of the same organism did not seem to have an additive effect on the amount of circulating parasites. Previous studies have proposed a density-dependent regulation mechanism interacting with other factors such as a species-genotype specific immune response, resulting in stabilisation of the Plasmodium population and episodes that are not dependent on infection by particular species49, which may help to explain our findings.

The coexistence of more than one parasite species in the same individual may be mediated by host and pathogen factors, such as the host immune response initially directed against the species or genotype at the highest density, thereby favouring the persistence of infection at lower density in a particular host39. The species/genotype coexistence model is controlled by parasite density-dependent regulation mechanisms; this model suggests that parasitaemia of the first infecting species (which has the highest prevalence amongst the target population) is downregulated on coinfection with the second species (which has the lowest prevalence). However, when the most prevalent species exceeds a threshold, the hosts’ immune response is triggered to limit the infection; such a mechanism is turned off once the parasite density is under control, thereby favouring population growth of the second species in mixed infections and persistence of the parasites in the host39,44,49.

Such mechanisms are largely modulated by the host. Our study evaluated whether specific clinical profiles amongst the target population were linked to infection with particular Plasmodium species. Fever was the symptom detected at the greatest frequency with all parasite species (Additional file 8: Fig. S5), as well as headaches for mixed infections (Additional file 7: Fig. S4).

MCA revealed dependent relationships between active and illustrative variables (Tables 3 and 4) and three profiles were compiled from the results (Table 5 and Fig. 4). The first profile suggested that symptoms such as headache and diarrhoea, along with previous episodes of malaria, occurred in the target population regardless of the species or infection status (single or mixed). It has been reported that infection-derived immunity in regions with constant parasite circulation (endemic regions), such as the Colombian Amazon region, induces a clinical course with non-specific symptomatology25.

The second profile related to triple infection and a population aged from 19 to 60 years (Fig. 4b and Table 5). High mixed infection frequencies were observed in this age group (Additional file 3: Fig. S2a), i.e. the economically-active population who are potentially those most exposed to mosquito bites and therefore to parasite transmission. The target region’s economic activity is related to artisan-produced handicrafts exploiting wood, fishing, mining and small-scale cultivation in community gardens, all situations that favour the transmission of disease and limit the effectiveness of parasite control measures22,34.

The third profile related to severe symptoms (i.e. fever) and mixed P. vivax and P. malariae infections (Fig. 4c and Table 5). This profile supported the aforementioned parasite density-dependent population regulation model39,49,50. This model illustrates that a parasite species present at higher density would influence the growth of other parasite species activating typical clinical symptoms in the host and maintaining stability of the population dynamics of parasite species51.

In-depth analysis is required for defining infection hotspots. Time series analysis should be used for parasite detection to establish whether infection events are due to transient infection or transmission foci, and risk maps and the population distribution (for host and vector) should be analysed to determine the localities of disease cases16,46,48. Identifying whether a specific area has high disease transmission enables appropriate management strategies to be designed to effectively limit the parasite’s transmission cycle47,48.

Control measures implemented in Colombia have focused on reducing the disease burden by the large-scale provision of insecticide-treated mosquito nets, periodic intra-household spraying and the presence of government agencies responsible for control, diagnosis and treatment12,30,52. Although endemic countries have introduced disease mitigation measures, they have not had the desired impact as the number of malaria cases has increased, particularly in rural areas7.

The present study actively searched for symptomatic patients in geographically isolated localities lacking nearby healthcare posts. The average family income is less than $250 per month in these areas, so a trip to a health centre represents a considerable family expense (around $50 per trip), so many parasitic infections are not seen or treated by healthcare control programmes22.

Greater malaria control efforts are required for progression towards the elimination of this disease; thus, understanding the distribution patterns of particular parasite species and the factors that influence malaria transmission in the Colombian Amazon region is crucial. The results of this study provide additional insight into malarial infections in the Colombian Amazon region, helping define the areas to be prioritised in terms of malaria prevention and control measures, with the aim of decreasing malarial incidence and approaching the long-term goal of eradication.

Methods

Study area and population

This transversal study was carried out from July 2015 to April 2016; it included the population of the Colombian Amazon trapezium, inhabitants from the towns of Leticia and Puerto Nariño and rural settlements located along the banks of the Amazon and Loretuyacu rivers. The Colombian Amazon region represents 42% of Colombia’s territory and is formed by the Caquetá, Putumayo, Vaupés, Guainía, Guaviare and Amazon departments, the latter comprising the greatest geographical area12,53,54. The Amazon department has 77,088 inhabitants (population density: 1.5 inhabitants per km2)12. The town of Leticia and its surrounding communities had a projected population of 41,639 inhabitants according to the Departamento Administrativo Nacional de Estadística (DANE – Colombian Official Statistics Department) 2016 figures; Puerto Nariño and its neighbouring communities accounted for 8,279 inhabitants12.

Fifty-seven localities were sampled and grouped into four areas, taking into account their location and mobilisation towards basins converging on major tributaries (the Amazon and Loretuyacu rivers) (Fig. 1 and Additional file 1: Table S1). Area 1 included 32 localities (including the town of Leticia, the capital of the Amazon department and the remaining rural settlements), area 2 covered 10 localities (one settlement being mainly urban and the rest rural), area 3 covered seven localities (all rural) and area 4 covered eight localities (rural settlements all along the banks of the Loretuyacu river).

Ethical considerations and sample-taking

Inclusion criteria consisted of recognising symptoms related to malarial infection when taking samples, such as headache, fever during the previous 8 days and sweating. People without malaria symptoms were not included in the study (exclusion criterion). The aim of the study was explained to patients; those who accepted an invitation to participate signed an informed consent form. A survey was then conducted that compiled information regarding participants’ socio-demographic characteristics and risk factors for malaria infection. This study was approved and supervised by the Universidad del Rosario’s (Colombia) School of Medicine and Health Sciences (EMCS) Research Ethics Committee (Comité de Ética en Investigacion - CEI) (CEI-ABN026-000161). Patients under 18 years of age who accepted the invitation to participate signed an informed consent, along with their tutors’ written approval. All methods and experiments were performed in accordance with the approved guidelines.

Two blood samples were collected simultaneously by capillary puncture. The first (TBS) was subjected to parasitological diagnosis by optical microscopy following Giemsa staining; the samples were processed and read on site at the time of sample collection. The second sample was stored on Flinders Technology Associates (FTA) cards and transported to the molecular biology laboratory of the FIDIC for molecular identification of the infecting parasite.

Molecular diagnosis of Plasmodium spp

A Pure Link Genomic DNA mini kit (Invitrogen) was used for extracting the DNA from the FTA cards, following the manufacturer’s instructions. This was followed by PCR amplification of the extracted DNA to confirm the presence of the human β-globin constitutive gene segment3,10.

The infecting Plasmodium species (P. vivax, P. falciparum and/or P. malariae) were identified in the β-globin-positive samples by nested PCR. Specific primers against the 18 S rRNA fragment were used in the first round of PCR for genus detection and a second amplification (using the first PCR product as template) was performed to distinguish the P. falciparum, P. vivax and P. malariae species. The amplification conditions for these PCRs have been described previously by our group3,10.

Statistical analysis

Descriptive statistics were used to summarise the sociodemographic variables, such as the sample-taking area, access to basic services (public water and electricity supply, sewerage service) and risk factors (nearby stagnant water, mosquito nets and insecticide use); these were presented as percentages with their respective 95% confidence intervals (95% CI). Age and parasitaemia (defined by TBS, as the number of parasites per 8,000 white cells/μL/number of white cells) were reported, along with their respective means and standard deviations (SD)55. The parasite density index (PDI) was taken as the amount of confirmed cases of malaria/population at risk56. Mixed infections were defined as the simultaneous detection of two or more Plasmodium spp. Fisher’s exact or Chi2 tests were used for establishing statistically significant differences amongst the data. ANOVA was used for comparing means and Bonferroni test was used for adjusting for multiple comparisons. A t-test was used for comparing the mean values for parasitaemia with the parasite infection status (single and/or mixed infection).

Logistic regression analysis was used for modelling the risk of a mixed infection, taking mixed infections as a dependent variable. Independent variables included in the model were age, residing in an urban or rural area, parasitaemia reported by TBS and housing conditions such as sewerage, gas and electricity supply, nearby stagnant water and mosquito net and insecticide use. STATA 12 software was used for analysing the data.

Multiple correspondence analysis (MCA) was used for establishing patient profiles, taking into account the nature of the clinical, epidemiological and laboratory variables (fundamentally categorical) estimated in this study. MCA was used for evaluating the degree to which each clinical and epidemiological variable participated in the compiling of profiles or groups of clinical significance in terms of similarity with or proximity to the different categories of variables, thus facilitating the incorporation of laboratory variables (infection presence/absence, parasitaemia) into these profiles’ for observing patterns57,58,59. In this way, groups were identified that had clinical significance from different groupings of categories of variables (i.e. this method was used to identify how sociodemographic characteristics and risk factors were grouped with single or multiple infections).

Two groups of variables were chosen for this analysis: active variables used for constructing factorial axes and supplementary or illustrative variables, which enriched factorial axes interpretation once they had been constructed58.

Sociodemographic, epidemiological and clinical variables were considered active variables, i.e. age, gender, origin, area, mosquito net and insecticide use, nearby stagnant water, fever, headache, vomiting, shivering, diarrhoea, urine colour, abdominal pain, outbreaks on the skin and previous episodes of malaria. The contribution values for each category were analysed for interpreting the axes compiled by the active variables, and the categories with a contribution value of more than 2.5 [mean contribution of 40 categories (100/40 = 2.5%)] were selected57.

Illustrative variables were the presence/absence of P. vivax, P. falciparum and P. malariae infection and parasitaemia. Cosine values were evaluated for estimating the quality of each active variable’s representation on each axis. The test values were used to determine whether the representation of each category on each axis significantly differed from 0 (≤−2 or ≥ 2 cut-off points), thus giving an evaluation of each category’s significance57.

The structure and formation of each profile were analysed using a bi-dimensional graphical representation. The active variables (epidemiological and clinical variables and risk factors) were represented on each axis by filled boxes and the nominal illustrative variables (infection by each of the three species and parasitaemia) were represented by empty rhombuses. The test values sign indicated each modality’s position on the positive or negative pole of each axis. Square size was proportional to each modality’s contribution on the most representative axis. Possible dependence and similarity relationships were identified, taking into account the distance between the variables represented on the graph, regarding the categories thus represented. SPAD-5 software was used for MCA.