Medical genomics has focused almost entirely on those of European descent. Other ethnic groups must be studied to ensure that more people benefit, say Carlos D. Bustamante, Esteban González Burchard and Francisco M. De La Vega.
Those most in need must not be the last to benefit from genetic research
Reviewers and granting bodies must demand racial and ethnic diversity in genome studies
Global genomics needs the financial support of governments and non-profits
In the past decade, researchers have dramatically improved our understanding of the genetic basis of complex chronic diseases, such as Alzheimer's disease and type 2 diabetes, through more than 1,000 genome-wide association studies (GWAS). These scan the genomes of thousands of people for known genetic variants, to find out which are associated with a particular condition.
Yet the findings from such studies are likely to have less relevance than was previously thought for the world's population as a whole. Ninety-six per cent of subjects included in the GWAS conducted so far are people of European descent1(see 'Sampling bias'). And a recent Nature survey suggests that this bias is likely to persist in the upcoming efforts to sequence people's entire genomes2.
Geneticists worldwide must investigate a much broader ensemble of populations, including racial and ethnic minorities. If we do not, a biased picture will emerge of which variants are important, and genomic medicine will largely benefit a privileged few.
Since the 1970s, geneticists have known that most of the genetic variance between individuals stems from differences in DNA sequence (genetic variants). Of the millions of small sequence differences identified worldwide, some are 'common variants' — that is, they are found in more than 5% of people in many populations, and some of these occur widely, in people of different geographical and ethnic origins. Indeed, most GWAS have sought to find common variants associated with disease, in the hope that discoveries in one population will generalize to others.
GWAS have unearthed clear associations between common variants and many common diseases. But depending on the condition in question, these explain only between 5% and 50% of the diseases' heritability. Many of the genetic factors thought to be responsible are still 'missing'.
This suggests that 'rare' genetic variants (those that occur in less than 5% of the world's population but which comprise the bulk of genetic variants) may be disproportionately important3 — both in determining a person's risk of getting a complex disease and in predicting their response to a particular drug4. Rare variants tend to be population specific5 (see 'Comparing the uncomparable'). So if they do play a key part in disease, the lack of diversity in genetic studies will be severely skewing our understanding of which are important.
Several researchers have begun to assess the ability to generalize GWAS discoveries between different populations. Preliminary results suggest that findings from one population may not always easily translate to the rest of the world, although many more and larger comparisons are needed.
For example, in people with Native South American ancestry, a particular variant of a protein that transports cholesterol into cells is common and is strongly associated with low levels of high-density lipoprotein cholesterol, obesity and type 2 diabetes. European, Asian and African populations do not have this variant6.
Conversely, in dozens of studies in European populations, researchers have found 19 common single-nucleotide changes that are strongly associated with type 2 diabetes. In a further study of 6,000 people including European Americans, African Americans, Latinos, Japanese Americans and Native Hawaiians, 13 of these polymorphisms continue to be strongly associated with the disease7. Yet 5 of the 19 variants seem to have different effects in the different ethnic groups, and the role of one variant is unclear.
There are several reasons why findings in one population might not generalize to another. Disease-associated versions (or alleles) of a gene may vary substantially in frequency in different ethnic groups. Also, GWAS identify genetic markers associated with a particular trait, not the mutations causing the disease. If a given marker is linked to a mixture of common and rare causal alleles8, some of the rare ones are likely to differ in frequency in different populations, or even be completely absent in some5.
The degree to which common genetic markers are linked to underlying causal mutations will also vary depending on the population being studied. For example, African populations are generally more genetically diverse than European, Asian or indigenous American populations, so one might expect to see weaker associations between markers and mutations in African and African-diaspora populations (such as African-Americans).
As well as genes and the environment differing between populations, the gene–environment interactions can vary, and these could significantly change a person's likelihood of developing a disease.
Already there is evidence that measures of genetic ancestry can improve clinical care for people of mixed race. For example, physicians assessing the effects of lung disease compare measures of lung function (obtained by having the patient breathe into a spirometer) to a reference standard for healthy people of the same gender and racial group. Doctors make more accurate diagnoses when they use patients' actual genetic ancestry to make comparisons, instead of self-selected or inferred categorizations of race or ethnicity9.
Likewise, researchers this year showed that Native American ancestry is associated with a greater risk of childhood acute lymphoblastic leukaemia returning after a remission, and that children with more than 10% Native American ancestry need an additional round of chemotherapy to respond to the treatment10.
So why are geneticists reluctant to undertake studies of people with diverse ancestries?
A key concern is that as more populations are included in a study, it becomes increasingly likely that a sequence variant will be associated with disease because of differences in race or ethnicity between cases and controls, rather than because of differences in people's disease status. A lack of appropriate methods or access to the right data may make it difficult for many investigators to control for this. Certainly, it is hard to both collect samples from tens or hundreds of thousands of patients, and balance costs, statistical power and project deadlines with broad ethnic representation.
Such challenges, however, do not justify restricting the beneficiaries of medical genomic research to a small subset of humanity. Population-based studies must be carried out on a global scale. This means giving incentives to researchers in developed countries to increase the representation of minority populations in their studies and — crucially — empowering investigators in the developing world to undertake genomics research themselves.
The 'missing heritability problem' has led many to become dismissive of GWAS. A danger of this GWAS fatigue is that it deters others from applying the approach to populations where it is likely to yield excellent results. GWAS has proved most successful in relatively small homogeneous populations — in Finland, Iceland and Costa Rica, say, where people generally stay put. Large families and limited migration are common among populations in Latin America, Africa and South Asia — suggesting that new and important associations between diseases and regionally common genetic variants may be found easily in these groups.
Replicating an association study in a different ethnic group is often one-tenth the original cost.
Moreover, large-scale GWAS are a feasible option for many research groups worldwide given that it now costs less than US$250 to obtain genetic data across millions of markers per person. (Whole-genome sequencing costs about 20 times more.) In fact, replicating an association study in a different ethnic group is often one-tenth the original cost. So at the very least, associations found in Europeans should be investigated in other ethnic groups.
Key to the success of global-scale GWAS are extensive and accurate catalogues of human genomic variation. The 1,000 Genomes Project is an excellent first step to providing a reference resource for researchers worldwide. It aims to catalogue genetic variants occurring in more than 1% of various populations throughout the world — including mixed-race North and South Americans as well as diverse populations from Africa, Europe, the Far East and South Asia.
Boosting genomic studies globally will also require initiatives that foster collaboration between countries, and enable the transfer of funding and technology beyond China, the United States and the European Union. The Human Heredity and Health in Africa Initiative, for example, is empowering local researchers. Supported by the US National Institutes of Health (NIH) and the UK Wellcome Trust in London, the initiative is the first major attempt to help African investigators study the genomic and environmental determinants of common diseases in African populations.
The private and philanthropic sector can also play an important part. The Slim Initiative for Genomic Medicine, a collaboration between the Mexican National Human Genome Research Institute and the Broad Institute of the United States, was launched last year and is enabling researchers to study type 2 diabetes and cancer in Latin American populations. This project is funded by the charitable foundation of the Mexican business magnate and philanthropist, Carlos Slim Helú.
For these efforts to be successful, researchers and physicians in the participating developing countries cannot simply provide samples. In addition, local expertise, resources and technology centres must be developed so that local populations benefit directly from home-grown research — as at the BGI (formerly the Beijing Genomics Institute) in Shenzhen, China. Local researchers will often better understand the history of local populations, such as whether they have recently switched from a rural to an urban lifestyle, and so will have deeper insights into likely environmental effects.
Being more inclusive doesn't win points for grant applicants.
At the same time, medical geneticists working in wealthy nations must include their own minority and immigrant populations in their studies and develop the tools needed to compare results between populations. Many 'exome' sequencing projects (which sequence only the coding regions of the genome) are moving in this direction. For example, the US National Heart, Lung and Blood Institute aims to sequence the exomes of 7,000 people, roughly half of whom are African-Americans, to identify variants associated with cardiovascular and lung diseases.
Piecing together the mosaic
To make medical genomics truly global, geneticists need new statistical methods to dissect the contribution of genetic, socio-cultural and environmental factors to both chronic and infectious disease.
Currently, 'ancestry metrics' are used to correct for the effect of shared ancestry on the results of association studies. But these methods do not work very well for groups whose genomes are a mosaic of fragments drawn from many different populations. Reference data from the relevant ancestral populations, including historically marginalized populations such as native Americans and Australian Aborigines, will help geneticists to separate spurious from real associations. And such understudied populations must be properly on board for this to happen. Researchers should gauge local values and concerns, and invest time and money into education and outreach to explain to the people they intend to study, as well as to the general public, why studying global (and local) health is so important.
The Center for Research on Genomics and Global Health headed by Charles Rotimi at the US National Human Genome Research Institute in Bethesda, Maryland, is beginning to gather the data and formulate the methods needed to understand the complex interplay that creates health disparities among ethnic groups for diseases such as type 2 diabetes, hypertension and obesity in Africa and elsewhere. Meanwhile, the Slim, the Wellcome Trust and the Bill & Melinda Gates Foundation based in Seattle, Washington, have begun to support research in understudied populations. Ultimately, however, global genomics needs the financial support of governments.
One way to encourage researchers to branch out may be for peer reviewers and granting bodies to stress the importance of racial and ethnic diversity in medical genetic studies. The NIH mandated the inclusion of diverse subjects in 1985. In the 26 years since, just 7% of GWAS have included minorities — perhaps because being more inclusive doesn't win points for grant applicants.
It is tempting to focus on populations that are motivated, organized, medically compliant and otherwise easy to study. But by failing to develop resources, methodologies and incentives for underserved people, we risk perpetuating the health disparities that plague the medical system. Those most in need must not be the last to receive the benefits of genetic research.
Need, A. C. & Goldstein, D. B. Trends Genet. 25, 489–494 (2009).
Nature 467, 1026–1027 (2010).
Manolio, T. A. et al. Nature 461, 747–753 (2009).
McClellan, J. & King, M. C. Cell 141, 210–217 (2010).
Gravel, S. et al. Proc. Natl Acad. USA doi:10.1073/pnas.1019276108 (2011).
Acuna-Alonzo, V. et al. Hum. Mol. Genet. 19, 2877–2885 (2010).
Waters, K. M. et al. PLoS Genet. 6, e1001078 (2010).
Adeyemo, A. & Rotimi, C. Public Health Genomics 13, 72–79 (2010).
Kumar, R. et al. N. Engl. J. Med. 363, 321–330 (2010).
Yang, J. J. et al. Nature Genet. 43, 237–241 (2011).
C.D.B. is on the Scientific Advisory Board of the Slim Initiative for Genomic Medicine and part of the International Working Group for the Center for Research on Genomics and Global Health. He is a funded investigator of the National Heart, Lung, and Blood Institute's Exome Sequencing Project and of the National Human Genome Research Institute's 1,000 Genomes Project. C.D.B. and E.G.B. are unpaid consultants to 23andMe, a company in Mountain View, California, that does direct-to-consumer genetic testing. C.D.B. is also a consultant to Ancestry.com, a company in Provo, Utah, that provides direct-to-consumer products for family history tracking and genetic ancestry.
About this article
Cite this article
Bustamante, C., De La Vega, F. & Burchard, E. Genomics for the world. Nature 475, 163–165 (2011). https://doi.org/10.1038/475163a
This article is cited by
Evaluating significance of European-associated index SNPs in the East Asian population for 31 complex phenotypes
BMC Genomics (2023)
Similarity and diversity of genetic architecture for complex traits between East Asian and European populations
BMC Genomics (2023)
Nature Communications (2023)
npj Precision Oncology (2023)
Sankhya B (2023)