A machine learning COVID-19 mass screening based on symptoms and a simple olfactory test

Azeli, Youcef; Fernández, Alberto; Capriles, Federico; Rojewski, Wojciech; Lopez-Madrid, Vanesa; Sabaté-Lissner, David; Serrano, Rosa Maria; Rey-Reñones, Cristina; Civit, Marta; Casellas, Josefina; El Ouahabi-El Ouahabi, Abdelghani; Foglia-Fernández, Maria; Sarrá, Salvador; Llobet, Eduard

doi:10.1038/s41598-022-19817-x

Download PDF

Article
Open access
Published: 16 September 2022

A machine learning COVID-19 mass screening based on symptoms and a simple olfactory test

Youcef Azeli^1,2,3,
Alberto Fernández⁴,
Federico Capriles¹,
Wojciech Rojewski¹,
Vanesa Lopez-Madrid¹,
David Sabaté-Lissner⁵,
Rosa Maria Serrano^1,8,
Cristina Rey-Reñones^6,7,12,
Marta Civit⁵,
Josefina Casellas¹,
Abdelghani El Ouahabi-El Ouahabi¹,
Maria Foglia-Fernández⁹,
Salvador Sarrá¹⁰ &
…
Eduard Llobet¹¹

Scientific Reports volume 12, Article number: 15622 (2022) Cite this article

1070 Accesses
2 Citations
67 Altmetric
Metrics details

Subjects

Abstract

The early detection of symptoms and rapid testing are the basis of an efficient screening strategy to control COVID-19 transmission. The olfactory dysfunction is one of the most prevalent symptom and in many cases is the first symptom. This study aims to develop a machine learning COVID-19 predictive tool based on symptoms and a simple olfactory test, which consists of identifying the smell of an aromatized hydroalcoholic gel. A multi-centre population-based prospective study was carried out in the city of Reus (Catalonia, Spain). The study included consecutive patients undergoing a reverse transcriptase polymerase chain reaction test for presenting symptoms suggestive of COVID-19 or for being close contacts of a confirmed COVID-19 case. A total of 519 patients were included, 386 (74.4%) had at least one symptom and 133 (25.6%) were asymptomatic. A classification tree model including sex, age, relevant symptoms and the olfactory test results obtained a sensitivity of 0.97 (95% CI 0.91–0.99), a specificity of 0.39 (95% CI 0.34–0.44) and an AUC of 0.87 (95% CI 0.83–0.92). This shows that this machine learning predictive model is a promising mass screening for COVID-19.

Long COVID: major findings, mechanisms and recommendations

Article 13 January 2023

Risk of COVID-19 death in adults who received booster COVID-19 vaccinations in England

Article Open access 16 January 2024

Cerebrospinal fluid reference proteins increase accuracy and interpretability of biomarkers for brain diseases

Article Open access 01 May 2024

Introduction

Since the first cases of severe acute respiratory syndrome coronavirus 2 (SARS-COV-2) were diagnosed in December 2019 in the Chinese city of Wuhan, the coronavirus disease 2019 (COVID-19) has spread rapidly¹. The strategies applied in the vast majority of countries to control the virus transmission have been ineffective. The first results concerning the safety and effectiveness of different types of vaccines have raised optimism in the scientific community due to the possibility of controlling COVID-19². But recent real-world data indicate that the effectiveness of the BNT162b2 and ChAdOx1 vaccines against infections with symptoms or high viral burden is reduced with new variants such as delta. In addition, infections in vaccinated patients have similar viral loads compared to unvaccinated patients³. This justifies the high number of new cases and a mortality rate difficult to eradicate in some countries with high vaccination rates⁴. On the other hand, a large part of the world population remains susceptible to SARS-COV2 infection, due to lack of access to vaccines, making herd immunity likely unachievable.

The early detection of symptoms suggestive of infection, rapid and efficient testing, contact tracing and isolation are the basis of an effective screening strategy to control transmission of COVID-19 and decrease the disease burden on healthcare systems. The Achilles' heel in the fight against this disease is the large number of patients who are asymptomatic or have only a few symptoms that are difficult to differentiate from a common cold, but who are nonetheless able to transmit the disease^5,6. It was estimated that 51.9% of SARS-COV-2 infected cases were asymptomatic or had only 1 or 2 symptoms suggestive of COVID-19⁷. The reference diagnostic tool for COVID-19 is reverse transcriptase polymerase chain reaction (RT-PCR). Its accessibility may be limited for low-resource healthcare systems and its cost and time requirements preclude its use as a mass triage tool. Recently a screening tool based on a machine learning model including clinical features and symptoms has been constructed to prioritize testing for COVID-19⁸. It was found that a predictive model for COVID-19 that included the combination of symptoms and wearable sensor data performed better than a model based on symptoms alone⁹.

Olfactory dysfunction (OD) has recently been described as one of the most prevalent symptoms reaching 50% to 75% of COVID-19 patients and could be used as a means of screening to help identify people who should self-isolate^10,11,12. With the delta variant OD has been described in the 39% of the cases being as well one of the most prevalent symptom¹³.

A symptom predictive model for COVID-19 based on a smartphone app including age, sex, loss of smell and taste, persistent cough, severe fatigue and skipped meals obtained a sensitivity of 65%¹⁴. At the time of diagnosis, a recent prospective study found that 31% of patients affected by COVID-19 presented OD¹⁵. Between 11.8 and 23% of cases presented OD before any other symptoms^10,16. The validated olfactory tests are subjective and difficult to implement. A recent study showed that a simplified test based on the identification and the assessment of intensity of three different scents was able to detect unperceived OD in COVID-19 patients¹⁵.

Hydroalcoholic gels are widely distributed as they are one of the main strategies for decreasing virus transmission¹⁷. Fragrance essential oils such as lavender, eucalyptus and lemon make them more pleasant and can enhance their anti-viral effect^18,19. These features make an aromatized hydroalcoholic gel a good candidate for being used as part of a simple, fast and cost-effective large-scale olfactory screening test.

The aim of this study was to develop and validate, using cross-validation techniques, a machine learning diagnostic predictive model for COVID-19 mass screening using symptoms and a simple olfactory test based on an aromatized hydroalcoholic gel, which could be especially useful when testing resources are limited.

Results

Characteristics of the study population

During the study period 3788 patients underwent RT-PCR to diagnose COVID-19 at one of the study health centres. The inclusion of cases and RT-PCRs performed per week at the centres while participating in the study can be consulted in the supporting information Fig. S1. A total of 626 patients were initially included in the study protocol. Of these, 107 patients were excluded because of incomplete data or exclusion criteria as shown in Fig. 1.

The final analysis of the study included 519 patients, out of whom 341 patients (65.7%) were from primary care and 179 (34.3%) were from the hospital Emergency Department. According to the criteria for carrying out a RT-PCR test, 386 (74.4%) had at least one symptom suggestive of COVID-19, 118 (22.7%) were asymptomatic and were close contacts of a COVID-19 case, and 15 (2.9%) were asymptomatic and were tested for unknown reasons. A positive RT-PCR was found in 117 patients (22.5%) and a negative RT-PCR was found in 402 patients (77.5%).

The mean (SD) age of the study population was 42.3 (16.3) years, the age range was between 18 and 98 years and 48% were male. None of the patients requiring hospital admission died. Table 1 shows the background and clinical characteristics of the study population.

Table 1 Background and clinical characteristics of the study population.

Full size table

COVID-19 symptoms and olfactory test results

The mean (SD) number of days of the symptom evolution was 5.8 (5.6) for the COVID-19 positive patients and 5.1 (12.1) for the COVID-19 negative patients with an absolute difference of 0.75 (95% CI − 1.35 to 2.84; P = 0.48). The symptoms most strongly associated with COVID-19 were OD and GD. Fever, dry cough, asthenia, myalgia, headache, diarrhoea, OD, and GD were the eight symptoms associated with COVID-19. Table 2 shows the reported symptoms and olfactory test results in the population study.

Table 2 Patient reported COVID-19 symptoms and olfactory test results.

Full size table

In the total population study, the olfactory test 1 was positive in 267 patients (51.4%) and negative in 252 patients (48.6%). Among patients with a positive olfactory test result 112 cases (41.9%) identified the gel smell as alcohol, 57 cases (21.3%) as cologne, 27 cases (10.1%) as aromatic herbs, 10 cases (3.7%) as non-citrus fruits (3.7%), 6 cases (2.2%) as alcoholic beverages (2.2%) and 22 cases (8.2%) as other responses. In 25 cases (9.4%) participants reported that they “didn’t smell anything at all” and in 8 cases a “don’t know” response (3%) was reported. Among patients with a negative olfactory test result, 207 cases (82.1%) identified the gel smell as lemon, 26 cases (10.3%) as citrus, 13 cases (5.1%) as orange, 2 cases (0.8%) as tangerine, 2 cases (0.8%) as citronella, and 2 cases (0.8%) as lime.

A positive olfactory test 1 was associated with COVID-19 (OR 1.86; 95% CI 1.22–2.85, P < 0.01). The response “do not smell anything at all” was strongly associated with COVID-19 (OR 4.06; 95% CI 1.8–9.17). Among the 13 asymptomatic COVID-19 positive patients, 10 (76.9%) had a positive olfactory test 1 result and only 3 patients presented a negative olfactory test 1. An olfactory test 1 positive result in asymptomatic patients was associated with COVID-19 (OR 3.94; 95% CI 1.03–15.03). The detailed results of the olfactory test and the diagnostic values of the relevant symptoms, the combination of symptoms and olfactory test for predicting COVID-19 were available in the S1 and S2 Tables.

Results of the machine learning predictive model

Table 3 shows the results of the different classification trees constructed with machine learning according to the variables introduced in the model.

Table 3 Results of machine learning model.

Full size table

By only introducing the relevant symptoms into the model, the sensitivity was 0.86 (95 CI 0.79–0.92), the specificity was 0.37 (95% CI 0.33–0.42) and the AUC was 0.86 (0.81–0.9) for the total population study, and 0.97 (95%CI 0.92–0.99), 0.11 (95% CI 0.07–0.15) and 0.89 (95% CI 0.81–0.9) respectively for the symptomatic population. The sensitivity and specificity obtained was 0.94 (95%CI 0.88–0.98) and 0.32 (95% CI 0.28–0.37) when the olfactory test was introduced into the model for the total study population. The constructed sensitive classification tree only took into account the result of the olfactory test 1 and ignored the result of the olfactory test 2. Considering other clinical variables, the model also included sex and age, reaching a sensitivity of 0.97 (0.91–0.99), a specificity of 0.39 (0.34–0.44) and an AUC of 0.87 (95% CI 0.83–0.92) for the total study population, and 0.98 (95%CI 0.93–1), 0.31 (95% CI 0.26–0.37) and 0.89 (95% CI 0.84–0.93) for the symptomatic population, respectively. The specific classification tree built took into account the relevant symptoms and age and obtained a sensitivity of 0.29 (0.21–0.38), a specificity of 0.95 (95% CI 0.92–0.97) and an AUC of 0.85 (0.8–0.89) for the total study population.

The resulting receiver operating characteristic (ROC) curve is shown in the Fig. 2 and the precision-recall curve of the sensitive and specific tree algorithm are shown in Figs. S2 and S3.

Discussion

The combination of symptoms and a simple olfactory test based on identifying the smell of a hydroalcoholic gel made it possible to develop a predictive model with high sensitivity, which has important clinical implications.

A predictive model based on symptoms reported on a smartphone-based app obtained a lower sensitivity of 0.65 (95% CI 0.62–0.67) and a lower AUC of 0.76 (95% CI 0.74–0.78) to predict COVID-19 than our predictive model¹⁴. Another predictive model using machine learning based on symptoms, gender, age and close contacts obtained a lower sensitivity which was between 0.85 and 0.87 depending on the possible working points and a similar AUC of 0.86 (95% CI 0.85–0.87)⁸. The different results of our model, depending on the variables included, show similar or even higher diagnostic values with respect to those models proposed as population screening. The model presented has the advantage that it includes asymptomatic patients and does not include close contacts in its variables as this could be difficult to determine in a situation of community transmission. To our knowledge, this is the first model including an olfactory test and built using a prospective population-based study. It is important to highlight that the symptoms combination have the higher weight in the model results, although the low false negative rate of the olfactory test among asymptomatic COVID-19 patients, helps improving the sensitivity of the model. The sample for this population-based study was obtained from patients following current indications for RT-PCR testing, and the sensitivity and specificity figures obtained make this model useful as a population-based screening.

In the same direction, new advances are being made in the development of new point-of-care, rapid, sensitive and inexpensive diagnostic methods to detect COVID-19 that can be useful to fight this pandemic and prepare for the next ones²⁰. An effective mass screening based on antigen test detection of SARS-CoV-2 has been described²¹. But a recent review of several antigen tests on the market states that two-thirds had overall sensitivities (30.8%-68.9%) below the World Health Organization recommended standard of ≥ 80% raising concerns whether the antigen detection alone is sufficient for COVID-19 mass screening²². Combining our predictive model with an antigen test could be a promising mass screening strategy.

Regarding the olfactory test, it has obtained a sensitivity almost twice as high as a more complex olfactory test for predicting COVID-19 based on identifying the smell of three scented paper strips and a 4-item scale intensity rate¹⁵. In addition, the simplicity of the olfactory test means it can be implemented as a self-test, making it a more suitable population screening olfactory test than any test reported so far. The wide distribution of this predictive tool due to its low cost also contributes to improving the disease situational awareness of the population. This may be especially useful in those scenarios where preventive measures are gradually being relaxed and there is still a need to protect older and more vulnerable people due to the rapid waning of vaccine protection over time against new variants such as omicron^23,24.

Our work has some limitations. Our study was conducted when the alpha variant was the most predominant variant in our area. Recent data show a high prevalence of classical symptoms such as cough, fever and olfactory and taste dysfunction among vaccinated and unvaccinated COVID-19 infected patients where the delta variant is predominant suggesting that our model may be useful in this setting³. The omicron variant has been associated with a reduced capacity to penetrate olfactory epithelial cells and produce anosmia²⁵. A prospective study based on a focused questionnaire for assessing olfactory function found that the prevalence of OD caused by the omicron variant was 24.6%²⁶. This drop in the prevalence of OD with this variant may affect the sensitivity of our model. Regarding the olfactory test, a high percentage of patients identified the smell of the gel as alcohol. The alcoholic matrix of the gel could hinder olfactory recognition, explaining the low specificity found. Moreover, the patient’s capacity to identify smell may decrease in an uncomfortable scenario such as an emergency department during a pandemic.

It is important to highlight that in our study no side effects related to the inhalation of the hydroalcoholic gel were reported. One study described that repeated exposure to a hydroalcoholic gel by inhalation does not increase blood ethanol levels²⁷. The side effects described in the literature are related to the occurrence of dermatitis or are due to the ingestion of the gel^28,29.

The value added by our COVID-19 predictive model in this field is its potential applications such as its inclusion in a mass testing strategy in order to save costs. Our predictive model could be useful to quickly rule out non-infected patients and for selecting the population that could benefit from a more expensive diagnostic test such as antigen testing or RT-PCR helping to reduce the costs for the health system or for companies with a rigorous occupational risk policy such as hospitals, nursing homes or large companies. It could also be especially useful for controlling transmission in those regions where testing resources are limited due to scarce economic resources or logistical difficulties.

This predictive model has been patented (EP 21 382 524.3) and is available upon request. The effectiveness of its implementation in different epidemiological settings should be tested by performing external validations; therefore, the collaboration of the scientific community is encouraged.

Conclusion

A machine learning predictive model for COVID-19 using symptoms and a simple olfactory test based on an aromatized hydroalcoholic gel showed high sensitivity for diagnosing COVID-19. The capacity of this predictive model to detect infected SARS-COV-2 patients among asymptomatic patients makes it a promising tool for the fight against COVID-19. This predictive model could be especially useful for mass screening when testing resources are limited.

Methods

Study design and setting

This is a population-based prospective cohort study conducted following the TRIPOD Statement for multivariable diagnosis prediction model³⁰. The study was carried out in the Emergency Department of Sant Joan University Hospital of Reus, which is the reference hospital of the region, and in all five primary care centres of Reus (Catalonia, Spain) public health network.

The municipality of Reus is located in the Mediterranean area, has a surface area of 52.82 km² and at the beginning of 2020 it had a population of 106,168 inhabitants and a density of 2010.0 inhabitants/km²³¹. This study was approved by the Ethics Committee of the Pere i Virgili Health Research Institute (Ref: 120/2020) and the IDIAP Jordi Gol Clinical Research Ethics Committee (Codi: 20/114-PCV). The study was conducted in accordance with de Declaration of Helsinki and Good Clinical Practices. All study participants were required to sign an informed consent form.

Participants

The study included consecutive patients undergoing RT-PCR for the first time to rule out COVID-19 infection who consulted the hospital emergency department or their primary care centre between 15 June and 11 September, 2020. Patients were tested for presenting symptoms suggestive of COVID-19 or for being close contacts of a confirmed COVID-19 case. Close contacts were considered those persons who had shared an area with a positive case at a distance of less than 2 m, for more than 15 min, without protection and from 48 h prior to the onset of symptoms.

The study did not include patients under 18 years of age, patients who did not sign the informed consent form, and patients with pathologies or conditions that may interfere with the olfactory function, such as any degree of cognitive impairment, Parkinson's disease, chronic rhinosinusopathy, head trauma, nasal obstruction, treatment with high concentrations of oxygen, acute respiratory failure, patients with an altered state of consciousness, or who use inhaled corticosteroids.

Olfactory test development

A multidisciplinary cooperation was established for creating a hydro-alcoholic hand sanitizing gel that meets current requirements in terms of its composition³².

Based on the literature and habits of our Mediterranean study population, it was determined that the most suitable odoriferous substance was lemon³³. Tests were carried out with different concentrations of lemon essential oil and lemon fragrances of synthetic origin. The composition of the gel was adapted to attenuate the smell of alcohol. A study was carried out to determine the most effective composition with and without thickener. Gas chromatography and mass spectrometry were used to obtain semi-quantitative results. A headspace sampling technique was used to establish the effectiveness of the volatile odoriferous substance that evaporated from the hydroalcoholic gel at 37 ºC. Finally, two hydroalcoholic gels with increasing concentrations of lemon essential oil were created as an olfactory test.

Description of the olfactory test

The olfactory test was performed by appropriately trained primary care and emergency nurses before the sample for SARS-COV-2 RT-PCR was collected. Therefore, both the patient and the healthcare personnel did not know the patient’s infection status. Firstly, the test consisted of applying 1 ml of 0.3% gel (olfactory test 1) using a dispenser onto the patient’s palm. Then the patient rubbed the gel on their hands and waited for 3 s. The patient was then asked to smell their hands and to “please, identify the smell of this gel”. The answer was recorded on the basic data collection sheet regardless of the result. If the answer was not lemon or if it was inconclusive, the same test was repeated after 30 s with the 0.5% gel (olfactory test 2). The olfactory test was considered negative if the patient recognized a citrus fruit, and the olfactory test was considered positive if the patient could not smell the gel or did not recognize a citrus fruit.

Data collection

A data collection sheet was completed by the attending nurse before taking the sample for the RT-PCR test. It included the results of the two olfactory tests when both were performed. It also included age, gender, duration of symptoms (in days), and a yes/no questionnaire to check for symptoms such as fever, dry cough, dyspnoea, anorexia, myalgia, headache, diarrhoea, asthenia, productive cough, sore throat, OD or gustatory dysfunction (GD), others or no symptoms. The RT-PCR test for detecting SARS-COV-2 was considered the gold standard for diagnosis. During our study, the RT-PCR was performed by trained personnel according to the technical considerations of the manufacturer using a double sampling of the pharynx and the nose. The conservation of the sample and the transfer to the laboratory followed the channels of the usual clinical practice of the centre. RT-PCR tests were carried out with the VIASURE SARS-COV-2 Real Time PCR Detection Kit (CerTest Biotec, Zaragoza, Spain), or with the Procleix1 method in a Panther automated extractor and amplifier (Grifols Laboratories, Barcelona, Spain). Once all the data collection sheets were completed, the medical digital records were consulted and the RT-PCR test results were recorded, as well as the patient’s background, evolution and discharge diagnosis. Regarding the severity of the disease, the patients attended and discharged immediately were considered as mild, those admitted to the hospital as moderate and those requiring ICU during hospitalization as severe.

This study was conducted at the beginning of the second wave of COVID-19 in our region³⁴. The 14-day cumulative incidence of COVID-19 cases in the city of Reus increased gradually from 0.9 cases/100,000 inhabitants on 15 June to 376.09 cases/100,000 inhabitants on 24 August³⁵.

Model development and internal validation

First, an analysis was conducted to explore the independent variables associated with COVID-19. The symptoms that proved to be statistically significant in a logistic regression predictive model, were fever, dry cough, myalgia, headache, diarrhoea, asthenia, altered sense of smell, and altered sense of taste. These 8 symptoms were defined as relevant and presented as well as their combinations the strongest associations with the predicted event. Diagnostic values were calculated for each symptom separately and their combinations for the total population and the symptomatic population. The productive cough variable was also included as a relevant symptom.

In order to facilitate the search for the best combination of variables to predict the diagnosis of COVID-19, we decided to build a model based on a decision tree constructed by machine learning that could also facilitate its clinical use following guidelines³⁶. Other modelling methodologies such as random forests or artificial neural networks were discarded because they need larger training datasets and also because their interpretability is not as straightforward as that of decision trees. Priority was given to the construction of a parsimonious model using as few variables as possible, robust by minimising missing data, transparent and simple. Moreover, minimising false negatives was also a priority in the predictive model construction to allow its use as a population screening.

The 8 relevant symptoms and the result of the olfactory test were variables significantly associated with COVID-19. Sex and age were as well sequentially introduced into the model as these variables were considered clinically relevant¹⁰. The final model had 11 independent variables therefore the study sample complied with the standard rule of ten clinical events per predictive variable³⁷.

The number of relevant symptoms was counted for each patient and this new variable was used to develop the model based on classification trees using a recursive partitioning algorithm³⁸. The growth of the trees was controlled to avoid overfitting the data. Trees were pruned to the size that minimized the cross-validated error. In addition, these classification trees were built using the following parameters: the splitting index was the Gini coefficient; the minimum number of patients in any node of a tree for a split to be attempted was set at 30; the minimum number of patients in any terminal node of a tree was set at 10; node splits were only attempted if they improved the fit by a factor of 0.01; and the number of cross-validations to be run was set at 10. The sizes of the trees obtained using this strategy range between six and seven leaves (terminal nodes), which proves that overfitting has been successfully avoided.

In order to obtain different values of sensitivity and specificity in the resulting classification trees, distinct costs of false positives and false negatives were used in the loss matrix parameter that drives the splitting function of the classification tree algorithm. In particular, the specific classification tree was grown using equal cost values for false positives and false negatives, while the sensitive classification tree was grown using a cost value for false negatives that was eight times the cost value for false positives.

The internal model validation was carried out using the R package cross validation techniques in machine learning.

Statistical analysis

The quantitative variables used in this study were described using the mean, the standard deviation (SD), the median and the first and third quartiles. The differences between means and their corresponding 95% confidence interval (CI) were also used to compare groups of patients. Categorical variables were described using the number of cases, percentages and 95% CI. Comparisons between groups of patients were performed using Student’s T test for quantitative variables, while the chi-squared test was used for categorical variables. Groups of patients were also compared in terms of the risk difference and odds ratio (OR) of the binary variables, and their corresponding 95% CI. All tests were two-tailed and P-values lower than 0.05 were considered statistically significant. Diagnostic values in terms of sensitivity, specificity, positive predictive value, negative predictive value, positive likelihood ratio and negative likelihood ratio, as well as their corresponding 95% CI, were calculated for the binary variables and smell tests. Several predictive models were analysed to handle missing data in the study protocol. A data-complete analysis was adopted over other strategies due to the low relevance of the missing data in the final results of the predictive machine learning model. All statistical analyses were performed using R software version 4.0.

Data availability

The datasets generated and/or analysed during the current study are not publicly available but are available from the corresponding author on reasonable request.

References

Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 382, 727–733 (2020).
Article CAS Google Scholar
Rubin, E. J., Baden, L. R. & Morrissey, S. Audio interview: Covid-19 vaccine fundamentals. N. Engl. J. Med. 383, e146 (2020).
Article CAS Google Scholar
Pouwels, K. B. et al. Effect of delta variant on viral burden and vaccine effectiveness against new SARS-CoV-2 infections in the UK. Nat. Med. https://doi.org/10.1038/s41591-021-01548-7 (2021).
Article PubMed PubMed Central Google Scholar
UK Health Security Agency. GOV.UK Coronavirus (COVID-19) in the UK. https://coronavirus.data.gov.uk.
Rapid and frequent testing. Nat. Biomed. Eng. 4, 1121–1122 (2020).
Gandhi, M., Yokoe, D. S. & Havlir, D. V. Asymptomatic transmission, the achilles’ heel of current strategies to control covid-19. N. Engl. J. Med. 382, 2158–2160 (2020).
Article CAS Google Scholar
Pollán, M. et al. Prevalence of SARS-CoV-2 in Spain (ENE-COVID): A nationwide, population-based seroepidemiological study. Lancet 396, 535–544 (2020).
Article Google Scholar
Zoabi, Y., Deri-Rozov, S. & Shomron, N. Machine learning-based prediction of COVID-19 diagnosis based on symptoms. NPJ Digit. Med. 4, 3 (2021).
Article Google Scholar
Quer, G. et al. Wearable sensor data and self-reported symptoms for COVID-19 detection. Nat. Med. 27, 73–77 (2021).
Article CAS Google Scholar
Lechien, J. R. et al. Olfactory and gustatory dysfunctions as a clinical presentation of mild-to-moderate forms of the coronavirus disease (COVID-19): A multicenter European study. Eur. Arch. Otorhinolaryngol. 277, 2251–2261 (2020).
Article Google Scholar
Xydakis, M. S. et al. Post-viral effects of COVID-19 in the olfactory system and their implications. Lancet Neurol. 20, 753–761 (2021).
Article CAS Google Scholar
Spinato, G. et al. Alterations in smell or taste in mildly symptomatic outpatients with SARS-CoV-2 infection. JAMA 323, 2089 (2020).
Article CAS Google Scholar
Chau, N. V. V. et al. An observational study of breakthrough SARS-CoV-2 Delta variant infections among vaccinated healthcare workers in Vietnam. EClinicalMedicine 41, 101143 (2021).
Article Google Scholar
Menni, C. et al. Real-time tracking of self-reported symptoms to predict potential COVID-19. Nat. Med. 26, 1037–1040 (2020).
Article CAS Google Scholar
Villerabel, C. et al. Diagnostic value of patient-reported and clinically tested olfactory dysfunction in a population screened for COVID-19. JAMA Otolaryngol. Head Neck Surg. 147, 271–279 (2021).
Article Google Scholar
Kaye, R., Chang, C. W. D., Kazahaya, K., Brereton, J. & Denneny, J. C. COVID-19 anosmia reporting tool: Initial findings. Otolaryngol. Head Neck Surg. 163, 132–134 (2020).
Article Google Scholar
Golin, A. P., Choi, D. & Ghahary, A. Hand sanitizers: A review of ingredients, mechanisms of action, modes of delivery, and efficacy against coronaviruses. Am. J. Infect. Control 48, 1062–1067 (2020).
Article Google Scholar
Astani, A., Reichling, J. & Schnitzler, P. Comparative study on the antiviral activity of selected monoterpenes derived from essential oils. Phytother. Res. 24, 673–679 (2010).
Article CAS Google Scholar
da Silva, J. K. R., Figueiredo, P. L. B., Byler, K. G. & Setzer, W. N. Essential oils as antiviral agents, potential of essential oils to treat SARS-CoV-2 infection: An in-silico investigation. Int. J. Mol. Sci. 21, 3426 (2020).
Article Google Scholar
Liu, Y. et al. A fM–aM detection of the SARS-CoV-2 antigen by advanced lateral flow immunoassay based on gold nanospheres. ACS Appl. Nano Mater. 4, 13826–13837 (2021).
Article CAS Google Scholar
Alemany, A. et al. Analytical and clinical performance of the panbio COVID-19 antigen-detecting rapid diagnostic test. J. Infect. 82, 186–230 (2021).
Article CAS Google Scholar
Wan, Z., Zhao, Y., Lu, R., Dong, Y. & Zhang, C. Rapid antigen detection alone may not be sufficient for early diagnosis and/or mass screening of COVID-19. J. Med. Virol. 93, 6462–6464 (2021).
Article CAS Google Scholar
Andrews, N. et al. Covid-19 vaccine effectiveness against the omicron (B.1.1.529) variant. N. Engl. J. Med. 386, 1532–1546 (2022).
Article CAS Google Scholar
Tartof, S. Y. et al. Durability of BNT162b2 vaccine against hospital and emergency department admissions due to the omicron and delta variants in a large health system in the USA: A test-negative case–control study. Lancet Resp. Med. https://doi.org/10.1016/S2213-2600(22)00101-1 (2022).
Article Google Scholar
Butowt, R., Bilińska, K. & von Bartheld, C. Why does the omicron variant largely spare olfactory function? Implications for the pathogenesis of anosmia in coronavirus disease 2019. J. Infect. Dis. https://doi.org/10.1093/infdis/jiac113 (2022).
Article PubMed Google Scholar
Boscolo-Rizzo, P. et al. Coronavirus disease 2019 (COVID-19)-related smell and taste impairment with widespread diffusion of severe acute respiratory syndrome-coronaviris-2 (SARS-CoV-2) Omicron variant. Int. Forum Allergy Rhinol. https://doi.org/10.1002/alr.22995 (2022).
Article PubMed PubMed Central Google Scholar
Huynh-Delerme, C. et al. Short communication: Is ethanol-based hand sanitizer involved in acute pancreatitis after excessive disinfection? An evaluation with the use of PBPK model. J. Toxicol. 2012, 1–7 (2012).
Article Google Scholar
Emadi, A. & Coberly, L. Intoxication of a hospitalized patient with an isopropanol-based hand sanitizer. N. Engl. J. Med. 356, 530–531 (2007).
Article CAS Google Scholar
Quenan, S. & Piletta, P. Hand dermatitis in healthcare workers: 15-years experience with hand sanitizer solutions. Contact Dermatitis 84, 339–340 (2020).
Article Google Scholar
Collins, G. S., Reitsma, J. B., Altman, D. G. & Moons, K. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD Statement. BMC Med. 13(1), 1 (2015).
Article Google Scholar
IDESCAT. Statistical Institute of Catalonia. https://www.idescat.cat/emex/?id=431233&lang=en#h1f.
European Chemicals Agency. Information on biocides. https://echa.europa.eu/es/information-on-chemicals/biocidal-active-substances.
Moein, S. T. et al. Smell dysfunction: A biomarker for COVID-19. Int. Forum Allergy Rhinol. 10, 944–950 (2020).
Article Google Scholar
Iftimie, S. et al. First and second waves of coronavirus disease-19: A comparative study in hospitalized patients in Reus, Spain. PLoS ONE 16, e024809 (2021).
Article Google Scholar
GENCAT. Salut/Dades COVID. https://dadescovid.cat/setmanal?tipus_territori=territori&id_html=up_4_23&tipus=municipi&codi=43123.
Luo, W. et al. Guidelines for developing and reporting machine learning predictive models in biomedical research: A multidisciplinary view. J. Med. Internet Res. 18, e323 (2016).
Article ADS Google Scholar
Peduzzi, P., Concato, J., Kemper, E., Holford, T. R. & Feinstein, A. R. A simulation study of the number of events per variable in logistic regression analysis. J. Clin. Epidemiol. 49, 1373–1379 (1996).
Article CAS Google Scholar
Mola, F. Advances in Data Science and Classification 311–318 (Springer, 1998).
Book Google Scholar

Download references

Acknowledgements

We would like to thank the company LUCTA for selflessly contributing to the creation and production of the gel of the olfactory test. Special thanks to Javier Marín Hernández, the Fragrance director of LUCTA. We greatly appreciate the support of the Ernesto Ventós Foundation. Thanks to the entire team of doctors, nurses and clinical assistants of the Emergency Department of the University Hospital Sant Joan de Reus who were able to carry out their work and also help in this research project at such a difficult time. Thanks to Mauricio Blotta, Leonor Arjona, Maria del Carmen Lareu, Luis Erwin Rodríguez, Sofia Ascanio, Laia Boters, Esther Duarte, Alba Diez, Ona Benach, Nuria Garcia, Andrea Simón, María Casanovas, Aurora del Pleguezuelos, María Francisca Mestre, Cristina Mañero, Anna María García, María Jesús Sevillano, María Teresa Xicola, Anna Baúles, Felicitas Garcia, Esther Lidia Marti, Tania Fabregat, Ana Dueñas, Mar Angels, Alba Rofes, Cristina Rodríguez, María Zaragoza, Carlos Olguin, Ana Català, Andrea Martos, Marta Pont, Camila Alejandra Hidalgo, Lourdes Fernández, Victoria Hidalgo, Marta Taberner, Giuliana Correa, Ana Vicente, Ignacio Escrigas, Jose Francisco López and Laia Mir. Thanks to all the collaborators from the Institut Català de la Salut: Jonathan Caselles Martinez, Judith Alonso Sansó, Raquel Navas Nuñez, Laia Sabaté Miró, Lucia Virginia Ndjoli Malonga, Laura Aubí Masip, Eva Torne Rivas, Maria Mercè Vizcarro López, Maria del Carme Barceló Prats, Maria Isabel Tomsen Fajardo, María Cinta Prats Bartra, Irene Pijuán Robledo, Maria Jesús Sánchez Lozano and Blanca Canela Llauradó. E.L. is a recipient of the 2018 ICREA Academia Award.

Author information

Authors and Affiliations

Servei d’Urgències, Hospital Universitari Sant Joan, Reus, Spain
Youcef Azeli, Federico Capriles, Wojciech Rojewski, Vanesa Lopez-Madrid, Rosa Maria Serrano, Josefina Casellas & Abdelghani El Ouahabi-El Ouahabi
Sistema d’Emergències Mèdiques de Catalunya, Barcelona, Spain
Youcef Azeli
Institut d’Investigació Sanitària Pere i Virgili (IISPV), Tarragona, Spain
Youcef Azeli
Departament d’Enginyeria Química, Universitat Rovira i Virgili, Tarragona, Spain
Alberto Fernández
CUAP Reus, Gerència Territorial Camp de Tarragona, Institut Català de la Salut, Tarragona, Spain
David Sabaté-Lissner & Marta Civit
Research Support Unit-Camp de Tarragona, Catalan Institute of Health (ICS), Tarragona, Spain
Cristina Rey-Reñones
School of Medicine and Health Sciences, Universitat Rovira i Virgili, Reus, Spain
Cristina Rey-Reñones
Atenció Primaria CAP Maria Fortuny-Reus V, Reus, Spain
Rosa Maria Serrano
ORL Service Sant Joan University Hospital, Reus, Spain
Maria Foglia-Fernández
Direcció Médica, Hospital Universitari Sant Joan, Reus, Spain
Salvador Sarrá
MINOS-IURESCAT, Universitat Rovira i Virgili, Tarragona, Spain
Eduard Llobet
IDIAP Jordi Gol, Catalan Institute of Health (ICS), USR Camp de Tarragona, Reus, Spain
Cristina Rey-Reñones

Authors

Youcef Azeli
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Fernández
View author publications
You can also search for this author in PubMed Google Scholar
Federico Capriles
View author publications
You can also search for this author in PubMed Google Scholar
Wojciech Rojewski
View author publications
You can also search for this author in PubMed Google Scholar
Vanesa Lopez-Madrid
View author publications
You can also search for this author in PubMed Google Scholar
David Sabaté-Lissner
View author publications
You can also search for this author in PubMed Google Scholar
Rosa Maria Serrano
View author publications
You can also search for this author in PubMed Google Scholar
Cristina Rey-Reñones
View author publications
You can also search for this author in PubMed Google Scholar
Marta Civit
View author publications
You can also search for this author in PubMed Google Scholar
Josefina Casellas
View author publications
You can also search for this author in PubMed Google Scholar
Abdelghani El Ouahabi-El Ouahabi
View author publications
You can also search for this author in PubMed Google Scholar
Maria Foglia-Fernández
View author publications
You can also search for this author in PubMed Google Scholar
Salvador Sarrá
View author publications
You can also search for this author in PubMed Google Scholar
Eduard Llobet
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.A. and F.C. conceived the idea. Y.A. and E.L. designed the study and the first CovidGel formula, W.R., V.L.M., D.S.-L., R.M.S., C.R.R., M.C., J.C., A.E.-E., M.F.F., collected the study data. M.F.F., C.R.R. and S.S. helped with technical support. Y.A., E.L. and A.F. analysed and interpreted the data results. A.F. built the machine learning model. Y.A. wrote the manuscript. All authors revised the manuscript.

Corresponding author

Correspondence to Youcef Azeli.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Azeli, Y., Fernández, A., Capriles, F. et al. A machine learning COVID-19 mass screening based on symptoms and a simple olfactory test. Sci Rep 12, 15622 (2022). https://doi.org/10.1038/s41598-022-19817-x

Download citation

Received: 22 November 2021
Accepted: 05 September 2022
Published: 16 September 2022
DOI: https://doi.org/10.1038/s41598-022-19817-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Long COVID: major findings, mechanisms and recommendations

Risk of COVID-19 death in adults who received booster COVID-19 vaccinations in England

Cerebrospinal fluid reference proteins increase accuracy and interpretability of biomarkers for brain diseases

Introduction

Results

Characteristics of the study population

COVID-19 symptoms and olfactory test results

Results of the machine learning predictive model

Discussion

Conclusion

Methods

Study design and setting

Participants

Olfactory test development

Description of the olfactory test

Data collection

Model development and internal validation

Statistical analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links