Abstract
Genetic variation in the human genome is an emerging resource for studying cancer, a complex set of diseases characterised by both environmental and genetic contributions. The number of common germ-line variants is great, on the order of 10–15 million per person, and represents a remarkable opportunity to investigate the aetiology, interindividual differences in treatment response and outcomes of specific cancers. The study of genetic variation can elucidate critical determinants in environmental exposure and cancer, which could have future implications for preventive and early intervention strategies. However, we are in the initial stages of characterising the tools (i.e., the single-nucleotide polymorphism, SNP) to rigorously analyse the genetic contributions to complex diseases, such as cancer. If the promise of the genomic era is to be realised, we must integrate this information into new strategies for implementation in both public health measures and, most importantly, provision of individual cancer-related care.
Similar content being viewed by others
Genetic variation in the human genome
In the process of generating a draft sequence of the human genome, it has become clear that the extent of genetic variation is much larger than previously estimated (Lander et al, 2001; Venter et al, 2001). The most common sequence variation in the human genome is the stable substitution of a single base, the single-nucleotide polymorphism (SNP). By definition, a SNP has a minor allele frequency of greater than 1% in at least one population (Risch, 2000). Most SNPs are ‘silent’ and do not alter the function or expression of a gene. It makes sense to conceptually reserve the term ‘mutation’ for rare variants with a particularly high penetrance, usually associated with a detrimental phenotype, such as a classical monogenic disorder (e.g., sickle cell disease or haemophilia). For the purposes of this review, the focus will be on SNPs with low penetrance or no phenotypic effect.
The total number of SNPs in the human genome is estimated to be more than 10 million (Botstein and Risch, 2003) and the number of SNPs with a minor allele frequency over 10% is estimated to be perhaps as many as five million (Kruglyak and Nickerson, 2001). Single-nucleotide polymorphisms are distributed throughout the human genome, at an estimated overall frequency of at least on in every 1000 base pair (bp) (Carlson et al, 2003), but with marked regional differences. Single-nucleotide polymorphisms arise because of point mutations that are selectively maintained in populations. Single-nucleotide polymorphism frequencies are determined by: (1) the amount of time since the mutation occurred; (2) evolutionary pressure on biologically significant variants and those linked to the functional variant; (3) random genetic drift and (4) bottleneck events.
Single-nucleotide polymorphisms in the same chromosomal region are not inherited randomly, but as combinations of alleles, which form haplotype blocks. It appears that the genome is organised into distinct blocks of linkage disequilibrium (LD), intercepted by regions in which LD breaks down rapidly (Bonnen et al, 2002; Sabeti et al, 2002). Thus, the complexity of analysing SNPs in a gene or locus can be reduced by the analysis of markers inherited on a haplotype. Practically, haplotypes can be inferred by LD analysis of a region in unrelated subjects, or characterised molecularly in family pedigrees (see Figure 1). It is notable that both the frequency of SNPs and the extent of LD may vary significantly between populations. In addition, there are many population private variants (Carlson et al, 2003).
SNPs and phenotype
As the age of genomics proceeds forward in search of genetic variants (i.e. SNPs) that influence disease susceptibility and outcome, a great effort has been directed at picking SNPs for study. The promise of utilising biallelic SNPs for whole genome linkage studies, though very exciting in theory, is still some time off in the future because of issues related to the prohibitive cost and impracticality of genotyping the required thousands of SNPs, not to mention developing the databases and analytical tools needed to interpret the data (Kruglyak, 1999). Still, we can count on this approach in the future, but for now, because of limited resources and analytical tools, most investigators will continue to utilise a strategy that investigates specific genes, known as the candidate gene approach.
The candidate gene approach examines SNPs, chosen from genes that ‘make sense’. In other words, they fit a plausible understanding of the biology. Single-nucleotide polymorphisms can also be chosen from a region previously identified by linkage study or microarray expression analysis. Intense effort has focused on SNPs that alter protein function or gene expression. A hierarchy for predicting a possible phenotypic expression for a SNP has been proposed (Risch, 2000). It has been estimated that there are perhaps 50 000–250 000 SNPs which confer a biological effect, most of which are distributed in and around the 30 000 genes (Risch, 2001). Prediction of a biological effect is probably more complicated than once argued; for instance, a synonymous SNP, which does not alter the amino-acid sequence has been shown to influence the stability of the DRD2 transcript, resulting in alterated expression (Duan et al, 2003).
Since the majority of SNPs do not confer a phenotypic alteration, but lie on ancestral haplotypes, a critical distinction must be made between SNPs as genetic markers and causal factors associated with a phenotypic effect. To date, the literature has focused on SNPs that have a predicted or demonstrated functional effect. A candidate gene approach that includes haplotype-tagging SNPs investigates genetic variation across the gene or locus. Once a haplotype has been confirmed as a marker for a phenotype, it is necessary to analyse the component SNPs to determine the causal variants. In some cases, it might be necessary to study additional SNPs, to better define the haplotype structure in search of causal variants. Attempts to by-pass this and pick only ‘functionally’ important SNPs limits the opportunity to ‘mark’ a gene or region. Moreover, there is a certain amount of hubris to continue to expect that picking individual SNPs is an effective strategy to identify alterations in genes that could contribute to complex diseases, such as cancer. No doubt, preclinical studies will suggest which variants to study, but the technical and bioinformatics advances have already generated a rich resource for study, which includes many genes and SNPs, of which little is known.
Study design
Traditionally, two different methods have been applied to the study of genetic variants in human disease. In SNP and haplotype studies, the measure of effect is a change in risk, that is, attributable risk. The results indicate a nonrandom distribution of genetic variants between cases and controls. In general, a SNP finding is neither necessary nor sufficient for the complex disease under study. Firstly, in highly penetrant monogenic disorders, family linkage analysis has been successful in identifying many rare genetic disorders. So far, more than 1200 disease-causing genes have been identified (http://www.ncbi.nlm.nih.gov:80/entrez/query.fcgi?db=OMIM). For complex diseases, such as most cancers, the paucity of family pedigrees has limited the success of this strategy, especially in search of moderate or low penetrance genes (Risch, 2001). Even in instances in which rare germ-line mutations in TP53 or VHL genes have been mapped, penetrance is not complete (i.e., the presence of a mutation does not necessarily lead to cancer); this observation suggests that additional modifying genes as well as environmental factors contribute to cancer even in the high-risk setting. In studies of subjects with a monogenic disorder (i.e., cystic fibrosis or chronic granulomatous disease), it has been possible to identify modifying variants that influence the risk for well-defined phenotypes (Foster et al, 1998; Garred et al, 1999).
The second strategy, which is more commonly employed, is the genetic association study in unrelated subjects. Collection of larger data sets of cases and controls permits investigation of SNPs and haplotypes, which confer a moderate or low effect. Until recently, the primary approach utilised has been the case–control design, but, recently, many have advocated turning to population-based cohort studies. Traditionally, case–control studies have been designed to detect susceptibility loci and, rarely, protective alleles (Risch and Merikangas, 1996; Risch, 2000).
Analytical issues in SNP studies
Estimation of LD and construction of haplotypes represents a powerful tool for conducting association studies (Gabriel et al, 2002). Even if the extent of LD and the number of haplotypes vary across the genome and between populations, it appears that there is generally low haplotype diversity for each locus, with some differences observed in separate populations (Daly et al, 2001). Consequently, association studies can genotype a limited set of SNPs that contribute to common haplotypes, with haplotype-tagging SNPs (ht-SNPs) (Stram, 2003). Resequence analysis is required to capture sufficient diversity in populations of unrelated subjects, which can be analysed to infer haplotype structure across the locus (Bonnen et al, 2002). Analysis with ht-SNPs saves money and DNA, but requires new analytical tools to estimate the effect of individual haplotypes on both the main effect (i.e., susceptibility or outcome) and gene–gene interactions. Already several studies have demonstrated the utility of haplotype analysis; for instance, an association has been reported between haplotypes on chromosome 19 and basal cell carcinoma; similarly, haplotypes of the interleukin-4 promoter are associated with chronic disseminated candidiasis, a life-threatening infection in patients with acute leukaemia (Yin et al, 2002; Choi et al, 2003).
Association studies have been plagued by an inability to consistently yield reproducible results. Several factors contribute to this conundrum and are hotly debated among experts in the field. Since replication is critical for acceptance of the causal association between a SNP or haplotype and an outcome, it has been argued that false-positive associations are better tolerated than false negatives. Thus far, the problem has been that there have been too many false positives. Some have argued that the execution of a large number of tests, by definition, contributes to the high rate of false positives, but others argue that the variables are not independent, certainly not when considering haplotypes (Colhoun et al, 2003; Lohmueller et al, 2003). Another factor for lacking reproducibility is inadequately powered studies, both in the initial and follow-up studies. Admixture of populations has been advanced as a confounding factor, though the effect has been demonstrated to be less than anticipated in published examples (Wacholder et al, 2000). Still, genetic drift coupled with the rate and genomic distribution of recombination and mutation events in populations under different pressures will affect LD and, thus, the potential to discover a given association (Frisse et al, 2001; Colhoun et al, 2003; Lohmueller et al, 2003). Currently, alternative remedies have been advanced that address novel statistical analysis, Bayesian approaches and meta-analyses of published studies. It is anticipated that current analytical tools will rapidly evolve in response to revisitation of data sets.
SNPs in cancer research
Genetic association studies with SNPs targeting cancer can be divided into two broad categories, investigation of susceptibility and of outcomes (see Table 1). The latter seeks to determine prognostic information for survival, complications or response to pharmacological intervention (i.e., pharmacogenomics). To date, nearly all published studies have examined at most a few SNPs or genes, and on a rare occasion variants within genes of a pathway or related biological process, such as DNA repair enzymes (i.e., XRCC1 and XRCC3) or xenobiotic metabolism genes (i.e., NAT1 and NAT2). Though early in the study of SNPs and cancer, technical and bioinformatics advances make it possible to dramatically increase the number of genes applied to a study. It is important to bear in mind that SNP studies require replication prior to acceptance, and certainly before clinical implementation. Therefore, it is critical to consider the current published literature as very preliminary analysis of what will certainly be a complex process.
Susceptibility to cancer
The aetiology of a specific cancer is probably associated with a set of genetic variants, many of which could adversely interact with environmental factors. So far, the initial studies of single genes have established a paradigm that will eventually examine gene–gene interactions. Studies in lung cancer have interrogated genes important for tobacco metabolism and nicotine addiction. The approach focuses on the interaction of a strong environmental carcinogen, and seeks to identify genetic variants that confer susceptibility or protection from tobacco smoke. In this regard, the gene–environment interaction represents a stress on the host, and perhaps intensifies the phenotypic effect of the ‘causal’ SNP. For instance, the myeloperoxidase gene, MPO, has been extensively studied, yielding moderately reproducible data in studies with Caucasians. The functional consequence of the G to A transition at −463 of the proximal promoter leads to reduced MPO mRNA expression. Individuals homozygous for the A allele have a significantly lower risk for developing lung cancer compared to individuals with two G alleles (London et al, 1997; Cascorbi et al, 2000; Le Marchand et al, 2000).
It is notable that, in one context, the SNP or haplotype can be protective, whereas, in another, confer increased susceptibility. For example, one of the strongest cases can be made for the association of polymorphisms in N-acetyltransferase 2 (NAT2) with bladder and colon cancer. NAT1 and NAT2 encode enzymes that are important for biotransformation of aromatic and heterocyclic amines, known carcinogens. The risk of developing urinary bladder cancer is particularly high in the slowest NAT2 acetylator phenotype, and is exacerbated by the history of smoking. On the other hand, for heterocyclic amine-related colon cancer, NAT2 rapid acetylator phenotype confers a higher risk (Golka et al, 2002; Hein, 2002).
Outcome and SNPs
Variants can be associated with outcome and, thus, could be applied to clinical decision making. For instance, genetic variants could alter the risk for metastatic or aggressive tumour. To date, few studies have unequivocally shown the importance of germ-line variants as prognostic markers, but, since a tiny percentage of the known genes have been adequately studied, investigation of SNPs remains active. So far, preliminary results suggest that SNPs in CYP3A4 are associated with long-term outcome in prostate cancer. The promoter SNP A-290G in CYP3A4, a gene involved in the oxidation of testosterone to 2B-, 6B, or 15B-hydroxytestosterone, appears to be associated with the severity of disease, as measured by TNM stage and Gleason grade. The effect is stronger in older men without a family history of prostate cancer (Rebbeck et al, 1998; Paris et al, 1999).
Pharmacogenomics is the study of the inherited basis of interindividual differences in drug response. It has been estimated that inherited differences account for inter individual variation observed in drug response (Kalow et al, 1998). There have been two parallel approaches in pharmacogenomics. One approach has been the search for genetic variants that are associated with severe adverse effects, which, in turn, can be used to screen for individuals who should not receive the drug in question. The second approach has focused on identifying markers that predict drug efficacy. The former has been widely embraced by the commercial sector, whereas the latter has been more tepidly pursued. Nevertheless, the promise of pharmacogenomics is that it could lead to tailored drug therapy, which some have dubbed as ‘individualised medicine.’ The harsh reality is that the choice facing an individual will have to be made on the basis of risk assessment gathered from large, population-based studies.
Candidate genes have focused on drug metabolism, that is, uptake, activation, degradation and excretion, to identify SNPs that are associated with life-threatening adverse reactions. Armed with validated SNPs and haplotype markers that associate with adverse drug reactions, clinicians could perform screening tests and, if indicated, choose alternate therapy. However, application to clinical medicine will be daunting, since many SNPs will identify risk factors that have only moderate-to-low penetrance. In other words, the clinician will be faced with assessing the overall risk for adverse effects, which will have to be weighed against potential benefits as well as the availability of alternative therapies.
A seminal example of the importance of pharmacogenomics is the story of acute childhood leukaemia and variants in the thiopurine S-methyltransferase (TPMT) gene. The protein encoded by the TPMT gene is a catalyst for S-methylation of thiopurines, commonly used in the treatment of haematopoietic malignancies and autoimmune disorders (e.g., azathiprine, 6-mercaptopurine and thioguanine). Thiopurine drugs are activated to thioguanine nucleotides, which are cytotoxic. Cytotoxicity can be intended (e.g., antileukaemia therapy), but if prolonged, life-threatening. It was discovered that several rare variants in the TPMT gene correlated with decreased activity; homozygous individuals suffered substantial haematopoietic toxicity (Krynetski et al, 1996). Recently, it has been reported that children with inactivating TPMT variants are at a greater risk for relapse, perhaps reflecting inadequate administration of thiopurines (Black et al, 1998). Pediatric leukaemia patients with TPMT variants, who receive cranial irradiation, have a greater likelihood for secondary brain tumours.
The search for markers useful for predicting drug efficacy has focused on the cytochrome P-450 system and, specifically, the CYP2D6 gene. This gene contributes to the metabolism of many anticancer agents. Common SNPs in CYP2D6 impair the activity of CYP2D6 and perhaps alter the pharmacokinetics of anticancer drugs (Kroemer and Eichelbaum, 1995). In some circumstances, the impaired CYP2D6 activity actually enhances the drug effect, but perhaps at the expense of enhanced toxicity.
Epigenetic changes and genetic variation
Genomic instability has been characterised in many human cancers and its signature is the pattern of sequence alterations, frequently resulting in allelic imbalance. Until recently, efforts to capture global patterns of genomic imbalance have employed microsatellites, but the utility of dense SNP markers has been demonstrated in proof-of-principle studies. Global patterns of genomic imbalance can be detected by allelotyping of cancers, and point to regions where allelic imbalance could contribute to cancer. Since loss of heterozygosity (LOH) within one or more chromosomal region is a common form of allelic imbalance, sites of LOH can be investigated for the presence of tumour-suppressor genes. Studies in bladder, lung and prostate cancers have discovered previously unknown allelic imbalances in multiple sites using SNPs for LOH analysis (Lindblad-Toh et al, 2000). It is likely that patterns of LOH, through SNP analysis, could have diagnostic and prognostic implications. Specific LOH pattern could be correlated with expression array profiles to identify causal variants.
The future of SNPs and haplotypes in cancer
The analysis of SNPs and haplotypes in cancer research has pleotropic implications for clinical and public health issues, as well as cancer biology. New findings can lead to targeted therapies for cancer intervention and prevention. The initial excitement of applying genetic variation to cancer has yielded to the realisation that a complex set of challenges lie ahead, which will tax the scientific, clinical and social adaptation of the new era of genomic medicine. Already, there are examples in pharmacogenomics and cancer susceptibility that have established the importance of this field, but it must be emphasised that they represent the beginning. The next decade could witness the investigation and implementation of genetic variation in the diagnosis, prevention and treatment of cancer.
The promise of SNP and haplotype analysis is that it will yield insights into exposure and cancer, and specifically lay the foundation for primary preventive strategies pertaining to lifestyle (i.e., alterations in diet, exercise and weight control) and chemoprevention. In turn, secondary prevention will address screening and public health measures designed to identify and intervene in high-risk individuals. The latter will become more complicated as we integrate information about pharmacogenomics, second malignancies and long-term complications. This is particularly complex because the consequences of one set of SNPs could have deleterious consequences for one outcome, but yet be advantageous for another.
A major challenge ahead is the development of more sophisticated analytical tools for handling the expanding data. We will need to address the complex issue of gene–gene interactions, particularly if profiles of SNPs are to be used in the clinical venue. At the same time, it is imperative to develop structures to include more individuals in population-based studies, yet preserve both confidentiality and individuality. In many respects, this is the most daunting challenge, to derive the information to interpret the significance of specific SNPs and haplotypes from population-based studies, yet apply the information to the individual. It could have a significant impact on the economics of medical practice.
In the end, the clinical application of what we learn from SNP studies will probably require a major shift in the paradigm of practice. Practitioners could make treatment decisions or give advice based on an assessment of risk. To begin to address the central issues of SNPs, haplotypes and cancer, detailed knowledge of the scope and pattern of genetic variation will be required, namely SNPs, haplotypes and linkage disequilibrium for genes and pathways critical to cancer in different populations.
Change history
16 November 2011
This paper was modified 12 months after initial publication to switch to Creative Commons licence terms, as noted at publication
References
Black AJ, McLeod HL, Capell HA, Powrie RH, Matowe LK, Pritchard SC, Collie-Duguid ES, Reid DM (1998) Thiopurine methyltransferase genotype predicts therapy-limiting severe toxicity from azathioprine. Ann Intern Med 129: 716–718
Bonnen PE, Wang PJ, Kimmel M, Chakraborty R, Nelson DL (2002) Haplotype and linkage disequilibrium architecture for human cancer-associated genes. Genome Res 12: 1846–1853
Botstein D, Risch N (2003) Discovering genotypes underlying human phenotypes: past successes for Mendelian disease, future approaches for complex disease. Nat Genet 33 (Suppl): 228–237
Carlson CS, Eberle MA, Rieder MJ, Smith JD, Kruglyak L, Nickerson DA (2003) Additional SNPs and linkage-disequilibrium analyses are necessary for whole-genome association studies in humans. Nat Genet 33: 518–521
Cascorbi I, Henning S, Brockmoller J, Gephart J, Meisel C, Muller JM, Loddenkemper R, Roots I (2000) Substantially reduced risk of cancer of the aerodigestive tract in subjects with variant-463A of the myeloperoxidase gene. Cancer Res 60: 644–649
Choi EH, Foster CB, Taylor JG, Erichsen HC, Chen RA, Walsh TJ, Anttila VJ, Ruutu T, Palotie A, Chanock SJ (2003) Association between chronic disseminated candidiasis in adult acute leukemia and common IL4 promoter haplotypes. J Infect Dis 187: 1153–1156
Colhoun HM, McKeigue PM, Davey Smith G (2003) Problems of reporting genetic associations with complex outcomes. Lancet 361: 865–872
Daly MJ, Rioux JD, Schaffner SF, Hudson TJ, Lander ES (2001) High-resolution haplotype structure in the human genome. Nat Genet 29: 229–232
Duan J, Wainwright MS, Comeron JM, Saitou N, Sanders AR, Gelernter J, Gejman PV (2003) Synonymous mutations in the human dopamine receptor D2 (DRD2) affect mRNA stability and synthesis of the receptor. Hum Mol Genet 12: 205–216
Foster CB, Lehrnbecher T, Mol F, Steinberg SM, Venzon DJ, Walsh TJ, Noack D, Rae J, Winkelstein JA, Curnutte JT, Chanock SJ (1998) Host defense molecule polymorphisms influence the risk for immune-mediated complications in chronic granulomatous disease. J Clin Invest 102: 2146–2155
Frisse L, Hudson RR, Bartoszewicz A, Wall JD, Donfack J, Di Rienzo A (2001) Gene conversion and different population histories may explain the contrast between polymorphism and linkage disequilibrium levels. Am J Hum Genet 69: 831–843
Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, Higgins J, DeFelice M, Lochner A, Faggart M, Liu-Cordero SN, Rotimi C, Adeyemo A, Cooper R, Ward R, Lander ES, Daly MJ, Altshuler D (2002) The structure of haplotype blocks in the human genome. Science 296: 2225–2229
Garred P, Pressler T, Madsen HO, Frederiksen B, Svejgaard A, Hoiby N, Schwartz M, Koch C (1999) Association of mannose-binding lectin gene heterogeneity with severity of lung disease and survival in cystic fibrosis. J Clin Invest 104: 431–437
Golka K, Prior V, Blaszkewicz M, Bolt HM (2002) The enhanced bladder cancer susceptibility of NAT2 slow acetylators towards aromatic amines: a review considering ethnic differences. Toxicol Lett 128: 229–241
Hein DW (2002) Molecular genetics and function of NAT1 and NAT2: role in aromatic amine metabolism and carcinogenesis. Mutat Res 506–507: 65–77
Kalow W, Tang BK, Endrenyi L (1998) Hypothesis: comparisons of inter- and intra-individual variations can substitute for twin studies in drug research. Pharmacogenetics 8: 283–289
Kroemer HK, Eichelbaum M (1995) ‘Its the genes, stupid’. Molecular bases and clinical consequences of genetic cytochrome P450 2D6 polymorphism. Life Sci 56: 2285–2298
Kruglyak L (1999) Prospects for whole-genome linkage disequilibrium mapping of common disease genes. Nat Genet 22: 139–144
Kruglyak L, Nickerson DA (2001) Variation is the spice of life. Nat Genet 27: 234–236
Krynetski EY, Tai HL, Yates CR, Fessing MY, Loennechen T, Schuetz JD, Relling MV, Evans WE (1996) Genetic polymorphism of thiopurine S-methyltransferase: clinical importance and molecular mechanisms. Pharmacogenetics 6: 279–290
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann N, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blocker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjackob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowski J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Colllins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, Szustakowki J, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ (2001) Initial sequencing and analysis of the human genome. Nature 409: 860–921
Le Marchand L, Seifried A, Lum A, Wilkens LR (2000) Association of the myeloperoxidase −463G → a polymorphism with lung cancer risk. Cancer Epidemiol Biomarkers Prev 9: 181–184
Lindblad-Toh K, Tanenbaum DM, Daly MJ, Winchester E, Lui WO, Villapakkam A, Stanton SE, Larsson C, Hudson TJ, Johnson BE, Lander ES, Meyerson M (2000) Loss-of-heterozygosity analysis of small-cell lung carcinomas using single-nucleotide polymorphism arrays. Nat Biotechnol 18: 1001–1005
Lohmueller KE, Pearce CL, Pike M, Lander ES, Hirschhorn JN (2003) Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease. Nat Genet 33: 177–182
London SJ, Lehman TA, Taylor JA (1997) Myeloperoxidase genetic polymorphism and lung cancer risk. Cancer Res 57: 5001–5003
Paris PL, Kupelian PA, Hall JM, Williams TL, Levin H, Klein EA, Casey G, Witte JS (1999) Association between a CYP3A4 genetic variant and clinical presentation in African-American prostate cancer patients. Cancer Epidemiol Biomarkers Prev 8: 901–905
Rebbeck TR, Jaffe JM, Walker AH, Wein AJ, Malkowicz SB (1998) Modification of clinical presentation of prostate tumors by a novel genetic variant in CYP3A4. J Natl Cancer Inst 90: 1225–1229
Risch N (2001) The genetic epidemiology of cancer: interpreting family and twin studies and their implications for molecular genetic approaches. Cancer Epidemiol Biomarkers Prev 10: 733–741
Risch N, Merikangas K (1996) The future of genetic studies of complex human diseases. Science 273: 1516–1517
Risch NJ (2000) Searching for genetic determinants in the new millennium. Nature 405: 847–856
Sabeti PC, Reich DE, Higgins JM, Levine HZ, Richter DJ, Schaffner SF, Gabriel SB, Platko JV, Patterson NJ, McDonald GJ, Ackerman HC, Campbell SJ, Altshuler D, Cooper R, Kwiatkowski D, Ward R, Lander ES (2002) Detecting recent positive selection in the human genome from haplotype structure. Nature 419: 832–837
Stram DO, Leigh Pearce C, Bretsky P, Freedman M, Hirschhorn JN, Altshuler D, Kolonel LN, Henderson BE, Thomas DC (2003) Modeling and E-M estimation of haplotype-specific relative risks from genotype data for a case-control study of unrelated individuals. Hum Hered 55: 179–190
Wacholder S, Rothman N, Caporaso N (2000) Population stratification in epidemiologic studies of common genetic variants and cancer: quantification of bias. J Natl Cancer Inst 92: 1151–1158
Wacholder S, Rothman N, Caporaso N (2000) Population stratification in epidemiologic studies of common genetic variants and cancer: quantification of bias. J Natl Cancer Inst 92: 1151–1158
Yin J, Rockenbauer E, Hedayati M, Jacobsen NR, Vogel U, Grossman L, Bolund L, Nexo BA (2002) Multiple single nucleotide polymorphisms on human chromosome 19q13.2-3 associate with risk of Basal cell carcinoma. Cancer Epidemiol Biomarkers Prev 11: 1449–1453
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
From twelve months after its original publication, this work is licensed under the Creative Commons Attribution-NonCommercial-Share Alike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/
About this article
Cite this article
Erichsen, H., Chanock, S. SNPs in cancer research and treatment. Br J Cancer 90, 747–751 (2004). https://doi.org/10.1038/sj.bjc.6601574
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/sj.bjc.6601574
Keywords
This article is cited by
-
Effect of SMAD4 gene polymorphism on breast cancer risk in Bangladeshi women
Beni-Suef University Journal of Basic and Applied Sciences (2023)
-
Sensitive electrochemical detection of polymorphisms in IL6 and TGFβ1 genes from ovarian cancer DNA patients using EcoRI and DNA hairpin-modified gold electrodes
Microchimica Acta (2023)
-
SNP+ to predict dropout rates in SNP arrays
Conservation Genetics Resources (2023)
-
Contribution of 3D genome topological domains to genetic risk of cancers: a genome-wide computational study
Human Genomics (2022)
-
Contributions of HOTAIR polymorphisms to the susceptibility of cancer
International Journal of Clinical Oncology (2021)