Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# Genetic variability in the expression of the SARS-CoV-2 host cell entry factors across populations

## Abstract

The entry of SARS-CoV-2 into host cells is dependent upon angiotensin-converting enzyme 2 (ACE2), which serves as a functional attachment receptor for the viral spike glycoprotein, and the serine protease TMPRSS2 which allows fusion of the viral and host cell membranes. We devised a quantitative measure to estimate genetic determinants of ACE2 and TMPRSS2 expression and applied this measure to >2500 individuals. Our data show significant variability in genetic determinants of ACE2 and TMPRSS2 expression among individuals and between populations, and indicate a genetic predisposition for lower expression levels of both key viral entry genes in African populations. These data suggest that host genetics related to viral entry mechanisms might influence interindividual variability in disease susceptibility and severity of COVID-19.

## Introduction

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a novel single-stranded RNA virus of the Coronaviridae family. This recently emerged virus is the cause of a pandemic infection that can result in severe life-threatening disease (coronavirus disease-2019; COVID-19) [1, 2]. Similar to SARS-CoV (which caused SARS), SARS-CoV-2 entry into target host cells is mediated through binding of the viral spike glycoprotein to angiotensin-converting enzyme 2 (ACE2) on the cell surface [3,4,5,6]. The interaction between coronaviral spike proteins and their attachment receptor ACE2 is believed to be key for viral transmissibility and dissemination of infection to organs and tissues [7]. In addition, the host cell serine protease TMPRSS2 cleaves the spike protein of SARS-CoV-2 to allow fusion of the viral and host cell membranes, which is an essential step in viral entry [3, 8]. Other molecules have been proposed to play alternative roles in the SARS-CoV-2 entry mechanisms. It has been suggested that cathepsin B and L may cleave the spike protein in the absence of TMPRSS2 [3]. The results of other studies proposed that SARS-CoV-2 entry is primarily mediated through endocytosis and that PIKfyve, TPC2, and cathepsin L but not cathepsin B are essential for viral entry [5, 8]. In addition, it has been described that furin may have an effect along with TMPRSS2 and cathepsins on activating viral entry [8]. However, a previous study revealed inconclusive results since furin preactivation enhanced or reduced SARS-CoV-2 pseudovirus entry in different cell types [6, 8]. These data suggest that SARS-CoV-2 may use alternative mechanisms to enter the cells but the role of these mechanisms and molecules involved is less clear and needs further investigation.

To gain insight into genetic determinants of SARS-CoV-2 transmissibility and potential viremia and disseminated infection, we devised a quantitative measure to assess the cumulative effect of genetic variants upon the expression of the two key molecules involved in SARS-CoV-2 viral entry, ACE2 and TMPRSS2, and then applied this measure to 2504 individuals from 5 different populations around the world.

## Results and discussion

We calculated a cumulative genetic expression score (GES) for ACE2 and TMPRSS2, as a measure to estimate host genetic determinants of viral entry of SARS-CoV-2. We evaluated this measure in 2504 individuals across the 5 major populations included in the 1000 Genomes project (African, Admixed American, European, East Asian, and South Asian) [9]. Because ACE2 is located on the X-chromosome, we analyzed female and male individuals separately. There was a significant difference in the cumulative GES of ACE2 between populations (ANOVA P < 0.0001 for both male and female groups). Genetic determinants of highest expression of ACE2 were observed in South Asian and East Asian populations, while African populations were genetically associated with the lowest ACE2 expression levels (Fig. 1).

Similarly, significant differences for TMPRSS2 were observed in both female and male groups across populations (ANOVA P < 0.0001). East Asian populations had the highest values for genetic determinants of TMPRSS2 expression, and Africans showed genetic predisposition for the lowest TMPRSS2 expression levels across populations (Fig. 1).

As mentioned earlier, ACE2 is located on the X-chromosome (and not on the pseudoautosomal region). Therefore, female individuals will have two copies of the gene while males will only have one. Normally, X-chromosome genes are subject to random X-chromosome inactivation, silencing one gene copy in females to keep gene expression balance between females and males. However, a number of X-chromosome genes, including ACE2, are known to escape X-chromosome inactivation [10]. We did not observe differences in genetically determined ACE2 expression between male and female individuals. Previous reports showed higher expression of ACE2 in male compared to female tissues, which was predominantly attributed to non-genetic factors, consistent with our findings [10, 11]. No difference between male and female individuals for TMPRSS2 was observed in our study.

These data suggest that genetic determinants of ACE2 and TMPRSS2 expression might play a role in the variability of transmission and severity of SARS-CoV-2 between populations. African populations showed a genetic predisposition for lower expression levels of both ACE2 and TMPRSS2, which are vital for SARS-CoV-2 entry into host cells. These data suggest that a genetic component might contribute to lower numbers of reported COVID-19 cases in Africa. However, it remains likely that non-genetic factors such as age and comorbidities might play a more important role than host genetic elements, especially in determining disease severity and outcome in infected individuals. In addition, genome-wide association studies will be needed to characterize genetic susceptibility to a more severe disease course in patients infected with COVID-19. Additional studies to replicate and extend our findings and examine expression levels of ACE2 and TMPRSS2 in different cell types across populations and in patients infected with COVID-19 are warranted.

## Methods

We devised a cumulative GES to estimate genetically determined expression of ACE2 and TMPRSS2. We used expression quantitative trait loci (eQTL) data for ACE2 and TMPRSS2 in tissues included in the Genotype-Tissue Expression project (GTEx, release V8) [12]. All eQTL variants affecting ACE2 and TMPRSS2 expression across all cell types and tissues were identified, and then pruned to remove variants in linkage disequilibrium (LD). LD pruning was performed using PLINK v.1.9 and the combined haplotypes of the 1000 Genomes Project populations [13]. For variants that demonstrate eQTL effects in multiple tissues, the most significant eQTL normalized effect size value was used. The genetic variants used to calculate GES values for ACE2 and TMPRSS2 are shown in Supplementary Dataset 1. A total of 21 genetic polymorphisms that affect ACE2 expression, and 14 that affect TMPRSS2 expression were identified and used in subsequent analyses. The cumulative GES was derived using the formula: $${\mathrm{GES = }}\mathop {\sum}\nolimits_{{\mathrm{i = 1}}}^x {\left( {n_i \times {\mathrm{NES}}_i} \right)}$$, where n is the number of alternative alleles (0, 1, or 2), x is the number of evaluated variants in ACE2 and TMPRSS2, and NES is the normalized effect size which reflects the expression in the alternative allele relative to the reference allele for each variant. We calculated the GESs for ACE2 and TMPRSS2 in a total of 2504 individuals from the five major populations included in the 1000 Genomes Project phase 3 release: African, n = 661; Admixed American, n = 347; East Asian, n = 504; European, n = 503; and South Asian, n = 489 [9]. To determine if the cumulative GES was different across populations, one-way ANOVA following by Tukey’s multiple comparison test was performed using GraphPad Prism version 8.1.1 (GraphPad Software, La Jolla California USA). ANOVA P values < 0.05 and Tukey’s adjusted P values < 0.05 were considered significant.

## References

1. 1.

Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395:497–506.

2. 2.

Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395:1054–62.

3. 3.

Hoffmann M, Kleine-Weber H, Schroeder S, Kruger N, Herrler T, Erichsen S, et al. SARS-CoV-2 cell entry depends on ACE2 and TMPRSS2 and is blocked by a clinically proven protease inhibitor. Cell. 2020;181:271–.e8.

4. 4.

Zhou P, Yang XL, Wang XG, Hu B, Zhang L, Zhang W, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–3.

5. 5.

Ou X, Liu Y, Lei X, Li P, Mi D, Ren L, et al. Characterization of spike glycoprotein of SARS-CoV-2 on virus entry and its immune cross-reactivity with SARS-CoV. Nat Commun. 2020;11:1620.

6. 6.

Walls AC, Park YJ, Tortorici MA, Wall A, McGuire AT, Veesler D. Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein. Cell. 2020;181:281–92. e6.

7. 7.

Gu J, Korteweg C. Pathology and pathogenesis of severe acute respiratory syndrome. Am J Pathol. 2007;170:1136–47.

8. 8.

Shang J, Wan Y, Luo C, Ye G, Geng Q, Auerbach A, et al. Cell entry mechanisms of SARS-CoV-2. Proc Natl Acad Sci USA. 2020;117:11727–34.

9. 9.

Genomes Project C, Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, et al. A global reference for human genetic variation. Nature. 2015;526:68–74.

10. 10.

Tukiainen T, Villani AC, Yen A, Rivas MA, Marshall JL, Satija R, et al. Landscape of X chromosome inactivation across human tissues. Nature. 2017;550:244–8.

11. 11.

Liu J, Ji H, Zheng W, Wu X, Zhu JJ, Arnold AP, et al. Sex differences in renal angiotensin converting enzyme 2 (ACE2) activity are 17beta-oestradiol-dependent and sex chromosome-independent. Biol Sex Differ. 2010;1:6.

12. 12.

Consortium GT, Laboratory DA, Coordinating Center—Analysis Working Group, Statistical Methods groups—Analysis Working Group, Enhancing Gg, Fund NIHC, et al. Genetic effects on gene expression across human tissues. Nature. 2017;550:204–13.

13. 13.

Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7.

## Acknowledgements

AHS is funded by the Lupus Research Alliance and the National Institute of Arthritis and Musculoskeletal and Skin Diseases of the National Institutes of Health grant number R01AR070148.

## Author information

Authors

### Contributions

LO-F Performed the analysis, interpreted that data, and edited the manuscript. AHS Conceived and designed the study, interpreted the data, and wrote the manuscript.

### Corresponding author

Correspondence to Amr H. Sawalha.

## Ethics declarations

### Conflict of interest

The authors declare that they have no conflict of interest.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Ortiz-Fernández, L., Sawalha, A.H. Genetic variability in the expression of the SARS-CoV-2 host cell entry factors across populations. Genes Immun 21, 269–272 (2020). https://doi.org/10.1038/s41435-020-0107-7

• Revised:

• Accepted:

• Published:

• Issue Date:

• ### The immune dysregulations in COVID-19: Implications for the management of rheumatic diseases

• Fan Xiao
• , Man Han
• , Xiaoxia Zhu
• , Yuan Tang
• , Enyu Huang
• , Hejian Zou
• , Quan Jiang
•  & Liwei Lu

Modern Rheumatology (2021)

• ### Genomic and epidemiological characteristics of SARS-CoV-2 in Africa

• Jones Lamptey
• , Favour Oluwapelumi Oyelami
• , Michael Owusu
• , Bernard Nkrumah
• , Paul Oluwagbenga Idowu
• , Armin Czika
• , Philip El-Duah
• , Richmond Yeboah
• , Augustina Sylverken
• , Oluwatayo Israel Olasunkanmi
• , Ellis Owusu-Dabo
• , Christian Drosten
•  & Zvi Bentwich

PLOS Neglected Tropical Diseases (2021)

• ### High Rates of SARS-CoV-2 Family Transmission in Children of Healthcare Workers During the First Pandemic Wave in Madrid, Spain

• Ana Méndez-Echevarría
• , Talía Sainz
• , Beatriz de Felipe
• , Sonia Alcolea
• , Peter Olbrich
• , Walter A. Goycochea-Valdivia
• , Luis Escosa-García
• , Lorena Cobo
• , Cristina Calvo
•  & Olaf Neth

Pediatric Infectious Disease Journal (2021)

• ### Genetics Insight for COVID-19 Susceptibility and Severity: A Review

• Ingrid Fricke-Galindo
•  & Ramcés Falfán-Valencia

Frontiers in Immunology (2021)

• ### SARS-CoV-2 Entry Related Viral and Host Genetic Variations: Implications on COVID-19 Severity, Immune Escape, and Infectivity

• Szu-Wei Huang
•  & Sheng-Fan Wang

International Journal of Molecular Sciences (2021)