Predictive factors for the presence and long-term persistence of SARS-CoV-2 antibodies in healthcare and university workers

While patient groups at risk for severe COVID-19 infections are now well identified, the risk factors associated with SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) transmission and immunization are still poorly understood. In a cohort of staff members of a Belgian tertiary academic hospital tested for SARS-CoV-2 antibodies during the early phase of the pandemic and followed-up after 6 weeks, 3 months and 10 months, we collected personal, occupational and medical data, as well as symptoms based on which we constructed a COVID-19 score. Seroprevalence was higher among participants in contact with patients or with COVID-19 confirmed subjects or, to a lesser extent, among those handling respiratory specimens, as well as among participants reporting an immunodeficiency or a previous or active hematological malignancy, and correlated with several symptoms. In multivariate analysis, variables associated with seropositivity were: contact with COVID-19 patients, immunodeficiency, previous or active hematological malignancy, anosmia, cough, nasal symptoms, myalgia, and fever. At 10 months, participants in contact with patients and those with higher initial COVID-19 scores were more likely to have sustained antibodies, whereas those with solid tumors or taking chronic medications were at higher risk to become seronegative.

Since the beginning of the COVID-19 (COronaVIrus Disease 2019) pandemic, virtually no one has been spared exposure to SARS-CoV-2 (Severe Acute Respiratory Syndrome CoronaVirus 2). Numerous epidemiological studies have been conducted in the general population or among healthcare workers, but few data are available on subject characteristics, direct exposure, and symptoms in large cohorts. Using data from our four-phase monitoring of the humoral response to SARS-CoV-2 infection in nearly 4000 volunteer healthcare and nonhealthcare workers 1 , we aimed to identify predictors of SARS-CoV-2 seropositivity and long-term persistence of these antibodies. While we used several serological tests to identify IgM, IgA and IgG in the principal study, we focused on the results of the IgG DiaSorin assay for this part of the study. SARS-CoV-2 serologic tests were performed on days 0, 45, and 90, and participants with positive results in phases 1, 2, and/or 3 (positive for ≥ 1 IgG and/or for ≥ 2 IgM or IgA tests) were again invited to undergo SARS-CoV-2 IgG testing 10 months after the first sampling.

Results
Phase 1. Between April 6 and May 5 2020, 3776 staff members were tested (74.7% women, median age 39.8 years), and 336 (8.9%) participants had a detectable IgG immune response against SARS-CoV-2 using the DiaSorin assay. As expected, IgG seroprevalence significantly increased over time (time calculated from March 1, date of the first Belgian case of COVID-19, in weeks; OR 1.044 [95% CI 1.028-1.060], p < 0.0001), from 6.3% among participants tested during the first week (week 5 of the pandemic in Belgium) to 14.0% in participants tested during the last week of phase 1 (week 8 of the pandemic in Belgium). In subsequent analyses, all other variables were therefore adjusted for time between March 1 and testing.
Among the 3776 participants, 3719 consented to share personal and health data and were included in the subsequent analyses. Seroprevalence was similar regardless of gender, age, living area or recent travels, and tended to increase with weight (although this did not reach statistical significance; OR 1.007 [95% CI 0.999-1.015], p = 0.078), but not with body mass index. IgG seropositivity was not different in smokers and non smokers, but tended to decrease, although not significantly, with the number of cigarettes smoked per day (OR 0.941 [95% CI 0.884-1.002], p = 0.059) ( Table 1). SARS-CoV-2 seropositivity was significantly higher in participants with previous or active hematological malignancy (31.3% vs 8.7%, OR 4.83 [95% CI 1.65-14.14], p = 0.004) and in those with an immunodeficient status (26.3% vs 8.7%, OR 3.28 [95% CI 1. .26], p = 0.02). There was no impact of other severe comorbidities, ongoing pregnancy, or chronic medications (Supplementary Table S1).
Regarding risk of exposure, we observed a higher seroprevalence in staff members working in contact with patients (9.5% vs 7.5%, OR 1.30 [95% CI 1.01-1.68], p = 0.04), and particularly in nurses (10.6%; OR 1.57 [95% CI 1.06-2.33] compared to the administrative staff) ( Table 2). Additional detailed comparisons (27 categories) were performed: compared to secretaries (who are not in contact with patients), seroprevalence was higher in most occupational categories in direct contact with patients (reception staff members, nurse assistants, porters, occupational therapists, nurses, physicians and physicians in training). The other functions were not identified as risk factors (Supplementary Table S2 Table 2). Laboratory workers handling respiratory specimens tended to have a higher SARS-CoV-2 seroprevalence (13.5% vs 8.8%, OR 1.82 [95% CI 0.97-3.39], p = 0.062). There was no significant difference according to type of mask used (if any), but this differed strongly with exposure (indeed, at the beginning of the pandemic, mask availability was insufficient-particularly for FFP2 masks, and their use was therefore adapted to the risk of exposure).
Multivariate analyses included all variables with p-values < 0.10 in univariate analyses (excluding the previous diagnosis of COVID -19), with symptoms either detailed separately or pooled in our COVID-19 score. In the first model including all symptoms, significant variables associated with higher SARS-CoV-2 seroprevalence were: time between March 1 and testing, immunodeficiency, previous or active hematological malignancy, type of contact with COVID-19 patients, former (> vs < 14 days) symptoms, three previous symptoms i.e. anosmia, Table 1. Predictors of SARS-Cov-2 positive serology (IgG DiaSorin) in phase1: subject characteristics. Place of residence, specific workplace, number/country/period of travel abroad, and type of mask used were also tested and found not significant. SD: standard deviation; BMI = body mass index; OR = odds ratio of univariate logistic regression models adjusted for time between March 1 and testing; CI = confidence interval. a 57 subjects did not consent to sharing personal health data. b 29 subjects who reported being smokers did not share their tobacco consumption. www.nature.com/scientificreports/ myalgia and fever, while sore throat was associated with a lower seroprevalence (Fig. 1a). In the second model where individual symptoms were replaced by the COVID-19 score, the latter was significantly associated with a higher seroprevalence, as well as the same variables as in the first model, with the addition of handling of respiratory specimens (Fig. 1b).

Phases 2 and 3.
When analyzing the evolution of IgG after 6 (phase 2) and 12 weeks (phase 3) in 3187 and 2498 subjects who completed phases 2 and 3, respectively, we observed broadly the same variables associated with seroprevalence overall in phases 1-2-3 in univariate analyses adjusted for time between March 1 and testing, except for the appearance of a negative correlation between seroprevalence and age (OR 0.989 [95% CI 0.980-0.998], p = 0.013) (Supplementary Tables S3-S6). In multivariate analysis, in the first model including all symptoms, significant variables associated with higher seroprevalence overall in phases 1-2-3 were: type of contact with COVID-19 patients, immunodeficiency, previous or active hematological malignancy, five previous symptoms i.e. anosmia, myalgia, cough, nasal symptoms and fever, while sore throat and abdominal pain were associated with lower seroprevalence. In the second model in which individual symptoms were replaced by the COVID-19 score, significant variables associated with a higher seroprevalence were time between March 1 and testing, type of contact with COVID-19 patients, immunodeficiency, previous or active hematological malignancy, and COVID-19 score, while variables associated with lower seroprevalence were age and smoking (Supplementary Fig. S1a-b).
Phase 4. Ten months later, 277 (88.5%) of 313 previously seropositive participants had detectable IgG by the DiaSorin test. Factors associated with sustained seropositivity in both univariate and multivariate analyses were contact with patients and most symptoms (only myalgia in the multivariate analysis) as well as our COVID-19 score, whereas participants with a solid tumor or taking any chronic medication were more likely to become seronegative (Fig. 2a, b).

Discussion
In our cohort, analysis of personal and occupational factors of our participants identified several predictors of SARS-CoV-2 IgG seropositivity. As expected at the beginning of a pandemic, seroprevalence increased with time throughout the study weeks, and was higher among workers in contact with patients, but also, to a lesser extent, among laboratory staff handling respiratory specimens. Contact with COVID-19 patients increased the risk of seroconversion, with a higher risk for household contacts than for contacts with infected patients or colleagues (as confirmed in other cohorts 2 ). These results highlight the value of protective equipment at work, and should encourage us to maintain strict measures in hospitals, and even to reinforce protective protocols for www.nature.com/scientificreports/ all healthcare workers in contact with patients, and particularly with COVID-19 patients (or with respiratory samples from them). We confirmed the pertinence of the COVID-19 score used to screen patients at risk for SARS-CoV-2 infection. Anosmia, fever, myalgia and cough were the strongest predictive symptoms of seropositivity. Importantly, 5.8% of asymptomatic participants had a positive serology, emphasizing the need for large-scale screening strategies to detect asymptomatic carriers as part of a risk management strategy. On the other hand, 21/82 participants diagnosed with COVID-19 based on RT-PCR had a negative serology, probably because most of them experienced a mild form of the disease (only one subject from our cohort was hospitalized). Indeed, antibody levels were correlated with symptom severity in several studies [3][4][5] , which was confirmed in our cohort. In those patients who did not develop detectable anti-SARS-CoV-2 antibodies despite PCR-proven infection, a certain degree of immune response is nevertheless possible as cellular immune responses (not evaluated here) also participate to SARS-CoV-2 immunity 6 .
When analyzing the serological evolution after 10 months, being in contact with patients and a higher COVID-19 score were associated with persistent anti-Sars-CoV2 IgG. Recurrent exposure to SARS-CoV-2 could possibly have contributed to stability of IgG levels in some healthcare workers in contact with patients. Indeed, several studies have shown the persistence of IgG + memory B cells after SARS-CoV-2 infection, which suggests the possibility of a rapid antibody response upon re-exposure 7,8 .
Participants reporting an immunocompromised status or a previous or active hematological malignancy (but not a solid tumor) had a higher seroprevalence. This could appear contradictory to other studies that have shown an impaired humoral response to natural infection with SARS-CoV-2 in patients with hematological malignancies, and to a lower extent in those with solid tumors 9 . However, in contrast to these studies, most of the subjects in our cohort who reported a history of hematological malignancy (mostly lymphoma) were young, in remission and without treatment. Weaker humoral responses to SARS-CoV-2 vaccines have also been described in patients suffering from hematological malignancies, but mostly in patients undergoing treatment  www.nature.com/scientificreports/ or recently treated with anti-B-cell therapies, or after allogeneic transplantation [10][11][12][13] . There are now consistent data showing that immunocompromised patients with hematological malignancies [14][15][16] or after allogeneic transplantation 17 developing COVID-19 infection have unfavorable outcomes. Our results might indicate that young subjects with a distant history of hematological malignancy are more susceptible to be infected when in contact with patients, but develop efficient immune responses. However, the small number of subjects do not allow us to draw firm conclusions on this topic on the basis of these data. The same hypothesis could be made regarding our group of mildly immunodeficient subjects, but the type of disease or treatment was heterogeneous  www.nature.com/scientificreports/ (mainly immunosuppressive therapy for autoimmune disease, but also leucopenia, splenectomy, IgA deficiency, …), preventing us from advancing any conclusion on this topic. In our study, long-term follow-up identified an active or previous solid tumor (and not hematological malignancy) as a risk factor for loss of seropositivity. It remains to be confirmed in large cohorts of cancer patients that protection against SARS-Cov-2 infection may be less durable in such seropositive patients. Finally, in some analyses, smoking was associated with a lower SARS-CoV-2 seroprevalence, which has also been observed in a French cohort 18 . Interestingly, smoking has been reported to increase ACE2 receptor expression 19 , to decrease the humoral response to SARS-CoV-2 vaccines 20 , and to increase the risks of COVID-19-related hospitalization and death 21 .
The main limitation of our study is that the population is mainly composed of young healthcare workers, who may not be representative of the overall population. Indeed, seroprevalence in our cohort decreased slightly with age. However, SARS-CoV-2 seroprevalence was comparable in administrative hospital staff and university researchers without occupational exposure to infected patients. On the other hand, the low numbers of subjects with severe comorbidities or with chronic medications may have precluded the identification of additional predictive factors. Nevertheless, the strength of this study is the prospective collection of many subject characteristics, including health information, exposure to COVID-19 patients, and symptoms, in a large population with a long-term follow-up.
In conclusion, we demonstrate a higher prevalence and persistence of SARS-CoV-2 natural antibodies in healthcare workers in contact with patients and in those with more severe symptoms (higher COVID-19 score). Interestingly, while immunocompromised participants were at higher risk to be seropositive, they appeared to retain this natural immunity, unlike those on chronic medications or with a solid tumor. It remains to be demonstrated whether these antibodies remain similarly protective in all patients, and whether these results can be extrapolated to vaccine immunity. As the emergence of variants weakens the collective immune protection, these data are crucial to tailor protective measures in the fight against the pandemic, especially among exposed front-line caregivers.

Methods
Study design and participants. The design of this study is described in Huynen et al. 1 . In this population, we collected general information and personal health history. For previously symptomatic participants, we calculated an empiric COVID-19 score partly implemented at CHU of Liège to screen patients before admission during the first waves of the pandemic.
This study was approved by the CHU of Liège Ethics Committee under number 2020/117 and has been performed in accordance with the Declaration of Helsinki. An informed consent form was signed by each participant. Data were recorded in a centralized database and pseudo-anonymized before statistical analysis.
Serological testing. The method of serological testing is described in Huynen et al. 1 . In this part of the study, we only used the results of the IgG DiaSorin assay.

Statistical analyses.
Results are presented as means and standard deviation (SD) for continuous variables, and as frequency tables for qualitative variables, globally and for each serologic type. Univariate logistic regression models adjusted for time between March 1, 2020, and testing were applied to test seroprevalence with respect to demographics, clinical data, symptoms, and exposure. A multivariate logistic regression was done with stepwise selection. Results are reported as odds ratios (OR) and 95% confidence intervals. Results are considered significant at the 5% level (p < 0.05). Calculations were done using SAS version 9.4, and forest plots were generated using GraphPad Prism version 8.4.3.

Data availability
The study protocol and individual participant data that underlie the results reported in this article, after deidentification, can be shared with investigators whose proposed use of the data has been approved by the ethic committee of the University Hospital of Liège. Data can be provided for meta-analysis or other projects comparing the seroprevalence estimates in different regions. Requests should be addressed to the senior author at yves. beguin@chuliege.be. www.nature.com/scientificreports/