The sequencing of the human genome represents a major milestone that will have profound consequences for the practice of medicine. Many new disease genes will be identified, and this information may someday be used to predict a patient's risk of developing a specific disease or response to a particular drug. The following six News and Views articles discuss how The Human Genome Project will revolutionize the diagnosis and treatment of diseases including diabetes, asthma, cancer, autoimmunity and cardiac disease, as well as the potential for developing 'personalized therapies'. They also serve to remind us that although we have our 'genetic blueprint' in hand, a large amount of work remains before we fully understand how to best use it.
Type II diabetes has reached global epidemic proportions, and the availability of the human genome sequence will help elucidate the genetic basis of this complex disease. But finding the true diabetogenes will not be easy.
Type II (non-insulin dependent) diabetes is a complex group of disorders characterized by two apparently distinct pathophysiologic defectsinsulin resistance (a decrease in insulin action at the level of muscle, fat and liver), and a failure of the pancreatic -cells to compensate for this resistance by appropriately increasing insulin secretion. There is strong evidence for an important role of genetics in both of these components1; however identifying the genes responsible for the development of type II diabetes has proven problematic. Will the sequencing and analysis of the human genome2 help unravel the genetics of this multifactorial disorder?
Thus far, most of the success in defining type II diabetes genes (diabetogenes) has been achieved by studying relatively rare forms of the disease (Fig. 1). Maturity-onset diabetes of the young (MODY) is an autosomal dominant disease that only accounts for approximately 2% of all diabetic patients. To date, six forms of MODY have been defined, each involving a gene important for -cell function. These genes encode hepatic nuclear factor-4 (HNF-4, associated with MODY 1), glucokinase (associated with MODY 2), HNF-1 (associated with MODY 3), insulin promoter factor-1 (IPF-1, associated with MODY 4), HNF-1 (associated with MODY 5) and NEUROD1/2 (associated with MODY 6)3,
4,
5,
6. Although insulin resistance appears to be the earliest defect in most patients with type II diabetes1, insulin receptor mutations are very rare and are usually observed in patients with severe insulin resistance syndromes7. About 0.1−1% of type II diabetes is maternally inherited, associated with deafness and due to defects in the mitochondrial genome. Interestingly, the most common mitochondrial variant (A3243G in tRNAleu) also causes MELAS syndrome (mitochondrial encephalomyopathy, lactic acidosis and stroke-like episodes)8. Some forms of lipoatrophic diabetes appear to be caused by mutations in nuclear lamins A/C (ref. 9), whereas Wolfram's syndrome (diabetes insipidus, diabetes mellitus, optic atrophy and deafness) is caused by a mutation in an uncharacterized protein with 10 transmembrane segments10.
Figure 1. Location of potential type II diabetes genes.
Several loci have been identified through linkage studies (black). Other loci include genes for monogenic forms of diabetes (red), candidate genes that have shown some evidence of association (blue) and candidate genes with known amino acid polymorphisms (green). An arrow indicates the mitochondrial mutation that is associated with the syndromes of diabetes insipidus, diabetes mellitus, and optic atrophy and deafness, which also causes MELAS syndrome.
Attempts to identify the diabetogenes responsible for the most common forms of type II diabetes have depended on genome-wide linkage studies or studies of candidate genes. The pre-sequencing phase of the Human Genome Project provided genetic and physical maps11, allowing whole-genome screens to identify chromosomal regions that are linked (cosegregate), with type II diabetes in families (Fig. 1). A locus on 2q37, termed NIDDM1, is linked to diabetes in some Mexican-Americans and corresponds to a single nucleotide polymorphism (SNP) in an intron of the gene for the calcium-dependent protease calpain 10 (ref. 12). This locus appears to act together with a gene on chromosome 15 to increase diabetes susceptibility13. Loci on 20q12-13 and 1q21-24 have been linked to diabetes in several Caucasian populations, but the specific diabetogenes in these regions remain to be determined14,
15,
16,
17,
18,
19,
20.
More than 60 potential candidate genes involved in insulin action, insulin secretion and adipose metabolism (since both obesity and lipoatrophy are linked to diabetes) have also been examined in the search for type II diabetes susceptibility genes. Although variants have been identified in many of these, only a few have been shown to associate with diabetes or impaired protein function (Fig. 1). Amino acid variants in insulin receptor substrate-1 (IRS-1) have been shown to decrease insulin signaling21, and variants in the peroxisome proliferator-activated receptor and the -cell transcription factor IPF1 have been associated with an increased risk of diabetes22,
23,
24. However, for most individuals, the genes that underlie type II diabetes susceptibility remain unknown.
Clearly, the detailed human genome sequence information now available2 will lead to the identification of more candidate genes for type II diabetes and will expedite our efforts to pinpoint specific genes responsible for chromosomal linkage peaks. But finding the true diabetogenes will not be easy. Despite several major family and population studies, the intervals linked with type II diabetes are large and may contain hundreds, or even thousands, of genes. Prioritizing the analysis of these genes based on known function may be misleading, as illustrated by the surprising role of lamins as diabetogenes in some forms of lipoatrophic diabetes9.
Thus, fine mapping is important, and the draft genome sequence offers us an important tool for this task: a map of 1.4 million single nucleotide polymorphisms (SNPs)(ref. 25). By genotyping hundreds of SNPs, one should be able to narrow the locus to a diabetes-associated haplotype. However, this strategy may prove difficult if there is allelic heterogeneity (different variants at the same locus contributing to diabetes in different people), as observed in MODY (ref. 4). The prospects of using the SNP map to find diabetogenes in genome-wide scans are also complicated by the enormous variance of linkage disequilibrium throughout the genome, along with the uncertainty of the linkage disequilibrium between SNPs and the actual polymorphisms that contribute to diabetes26. Furthermore, a large number of SNP markers would be needed to cover the whole genome (up to 500,000 according to one simulation)27.
Another important way in which the genome sequence is likely to facilitate the search for type II diabetes genes is by suggesting new components of insulin secretion and insulin action. Although much progress has been made over in the last decade in dissecting these cellular pathways, there are still many unsolved areas, and mining the human genome should help us fill these gaps. For example, by searching the draft sequence for homology to genes involved in membrane compartment organization, Bock et al.28 have identified 31 previously unknown Rab proteins. This family of low-molecular-weight GTPases comprises essential components of the vesicle-trafficking machinery and may be key elements in the translocation of glucose transporters from intracellular compartments to the plasma membrane in response to insulin, as well as insulin secretion.
At least 18 paralogues (distantly related or unrelated proteins which perform the same function in different cellular lineages) of drug targets were also identified in the genome, including a paralogue of the insulin-like growth factor-1 receptor2. The same 'mining' approach can be applied to other protein classes, such as SH2 domain-containing proteins, to find other unknown paralogues of known mediators of insulin action or secretion. Once these new candidates are defined, the SNP database should be a useful tool for identifying variants at these loci that can be tested for association with diabetes. One discouraging finding in this regard, however, is that the functional classification of the proteins derived from the human genome suggests that more genes encode proteins involved in metabolism, transcription/translation and intracellular signaling, than virtually all other categories of function together2. Thus, the number of candidate diabetogenes is likely to be very large.
Finally, there are three major caveats for the utility of the genome sequence in our search for diabetogenes that cannot be neglected: 1) type II diabetes is genetically heterogenous; 2) common forms of type II diabetes are almost certainly polygenic; and 3) strong gene/gene and gene/environment interactions play important roles in development of type II diabetes . Thus, mutations or polymorphisms that cause only modest deficits in gene/protein function may become clinically significant when coupled with other genetic or acquired defects. the resulting imperfect correlation between genotype and phenotype makes the task of finding the diabetogenes a formidable one.
In the U.S., there has been a 39% increase in prevalence of type II diabetes between 1990 and 1999, reflecting the strong environmental input into disease expression29. Likewise, animal and cellular models have indicated that haplo-insufficiency of two genes, each of which is clinically silent, can result in disease expression due to strong epistatic (synergistic) gene/gene interactions30.
For the Human Genome Project to keep its promise of helping to define the genes for complex disorders such as type II diabetes, we must understand these gene/gene and gene/environment interactions and develop methods to study the cellular impact of genetic variation in a systematic way. Most importantly, we need good physiologic phenotyping to identify more homogeneous disease subgroups for genetic analysis. These are exciting times in the field of geneticshowever much needs to be done if we are to find the major genes underlying the current epidemic of diabetes.
Kahn, C.R. Insulin action, diabetogenes, and the cause of type 2 diabetes. Diabetes43, 1066−1084 (1994). | PubMed | ISI | ChemPort |
International Human Genome Consortium. Initial sequencing and analysis of the human genome. Nature409, 860−921 (2001). | Article | PubMed | ISI | ChemPort |
Vionnet, N. et al. Nonsense mutation in the glucokinase gene causes early-onset non-insulin-dependent diabetes mellitus Nature356, 721−722 (1992). | Article | PubMed | ISI | ChemPort |
Yamagata, K. et al. Mutations in the hepatocyte nuclear factor-1 gene in maturity-onset diabetes of the young. Nature384, 455−458 (1996). | Article | PubMed | ISI | ChemPort |
Malecki, M.T. et al. Mutations in NEUROD1 are associated with the development of type 2 diabetes mellitus. Nature Genet.23, 323−328 (1999). | Article | PubMed | ISI | ChemPort |
Krook, A. & O'RahillyS. Mutant insulin receptors in syndromes of insulin resistance. Baillieres. Clin. Endo. Metab.10, 97−122 (1996). | ISI | ChemPort |
Shackleton, S. et al. LMNA, encoding lamin A/C, is mutated in partial lipodystrophy. Nature Genet.2, 153−156 (2000).
Strom, T.M. et al. Diabetes insipidus, diabetes mellitus, optic atrophy and deafness (DIDMOAD) caused by mutations in a novel gene (wolframin) coding for a predicted transmembrane protein. Hum. Mol. Genet.7, 2021−2028 (1998). | Article | PubMed | ISI | ChemPort |
Hudson, T.J. et al. An STS-based map of the human genome. Science270, 1945−1954 (1995). | PubMed | ISI | ChemPort |
Horikawa, Y. et al. Genetic variation in the gene encoding calpain-10 is associated with type 2 diabetes mellitus. Nature Genet.26,163−175 (2000). | Article | PubMed | ISI | ChemPort |
Cox, N.J. et al. Loci on chromosomes 2 (NIDDM1) and 15 interact to increase susceptibility to diabetes in Mexican Americans. Nature Genet.21, 213−215 (1999). | Article | PubMed | ISI | ChemPort |
Ji, L. et al. New susceptibility locus for NIDDM is localized to human chromosome 20q. Diabetes46, 876−881 (1997). | PubMed | ISI | ChemPort |
Bowden, D.W. et al. Linkage of genetic markers on human chromosomes 20 and 12 to NIDDM in Caucasian sib pairs with a history of diabetic nephropathy. Diabetes46, 882−886 (1997). | PubMed | ISI | ChemPort |
Zouali, H. et al. A susceptibility locus for early-onset-insulin dependent (type 2) diabetes mellitus maps to chromosome 20q, proximal to the phosphoenolpyruvate carboxykinase gene. Hum. Mol. Genet.6, 1401−1408 (1997). | Article | PubMed | ISI | ChemPort |
Hanson, R.L. et al. An autosomal genomic scan for loci linked to type II diabetes mellitus and body-mass index in Pima Indians. Am. J. Hum. Genet.63, 1130−1138 (1998). | Article | PubMed | ISI | ChemPort |
Elbein, S.C., Hoffman, M.D., Teng, K., Leppert, M.F. & Hasstedt, S.J. A genome-wide search for type 2 diabetes susceptibility genes in Utah Caucasians. Diabetes48, 1175−1182 (1999). | PubMed | ISI | ChemPort |
Ghosh, S. et al. The Finland-United States investigation of non-insulin-dependent diabetes mellitus genetics (FUSION) study. I. An autosomal genome scan for genes that predispose to type 2 diabetes. Am. J. Hum. Genet.5, 1174−1185 (2000).
Watanabe, R.M. et al. The Finland-United States investigation of non-insulin-dependent diabetes mellitus genetics (FUSION) study. II. An autosomal genome scan for diabetes-related quantitative-trait loci. Am. J. Hum. Genet.5, 1186−1200 (2000).
Almind, K., Inoue, G., Pedersen, O. & Kahn, C.R. A common amino acid polymorphism in insulin receptor substrate-1 causes impaired insulin signaling. Evidence from transfection studies. J. Clin. Invest.97, 2569−2575 (1996). | PubMed | ISI | ChemPort |
Yen, C.J. et al. Molecular scanning of the human peroxisome proliferator activated receptor gamma (hPPAR ) gene in diabetic Caucasians: identification of a Pro12Ala PPAR 2 missense mutation. Biochem. Biophys. Res. Commun.241, 270−274 (1997). | Article | PubMed | ISI | ChemPort |
Altshuler, D. et al. The common PPAR Pro12Ala polymorphism is associated with decreased risk of type 2 diabetes. Nature Genet.26, 76−80 (2000). | Article | PubMed | ISI | ChemPort |
Hani, E.H. et al. Defective mutations in the insulin promoter factor-1 (IPF-1) gene in late-onset type 2 diabetes mellitus. J. Clin. Invest.104, R41−48 (1999). | PubMed | ISI | ChemPort |
The international SNP map working group. A map of human genome sequence variation containing 1.4 million single nucleotide polymorphisms. Nature409, 928−933 (2001). | Article | PubMed | ISI | ChemPort |
Weiss, K.M. & Terwilliger, J.D. How many diseases does it take to map a gene with SNPs? Nature Genet.26, 151−157, 2000 | Article | PubMed | ISI | ChemPort |
Kruglyak, L. Prospects for whole-genome linkage disequilibrium mapping of common disease genes. Nature Genet.22, 139−144 (1999). | Article | PubMed | ISI | ChemPort |
Mokdad, A.H. et al. Diabetes trends in the U.S.: 1990−1998. Diabetes Care9,1278−1283 (2000) and Diabetes Care10, 412 (2001).
Bruning, J.C. et al. Development of a novel polygenic model of NIDDM in mice heterozygous for IR and IRS-1 null alleles. Cell88, 561−572 (1997) | Article | PubMed | ISI | ChemPort |