Main

With the mapping and sequencing of the human genome now complete, both medical geneticists1,2 and their critics3,4 expect that the practice of public health is likely to become “geneticized” such that information about genetic risks will be more widely available. Many medical geneticists expect that human genome research will reveal information about individual susceptibility to diseases so that people can learn of their risks for future illness, and thereby be enabled to practice more effective health maintenance and disease prevention.2 Public health professionals are increasingly using more genetic information to target interventions of behavioral and environmental factors that lead to many diseases.1 Given that this dramatic increase in information regarding genetically-induced illness brings with it numerous social, psychological, and ethical challenges,5 it is essential, at the least, that such genetic information is conveyed from professionals to lay individuals in an accurate manner. However, communicating about genetic information with the lay public in a fashion to achieve these goals is likely to be challenging.

Researchers have argued that the increasing mass media coverage of genetic research might promote biological determinism, thus leading to negative outcomes such as fatalism, discrimination, and lower likelihood that individuals might adopt preventive health behaviors,3,4,611 or even lead to poorer health outcomes.1214

Research also suggests that lay people are less likely to believe that a condition caused by “genetics” is amenable to treatment.15,16 In an experimental study conducted by Senior et al.,17 for example, participants were given messages that either attributed a prediction of future heart disease to the outcome of a genetic test or to that of an unspecified test. Those who were told that the results were based on a genetic test saw the disease as less preventable.

The tendency to perceive genetic causation as inconsistent with effective behavioral prevention coexists with a tendency of lay people to understand genes and the set of other factors generally grouped as “environment” to be separate from each other, rather than as interacting.18,19 Although the details of how lay people interpret this model of separate causation are not yet clearly documented, for at least some people, seeing genes and behaviors as separate may mean that they believe that when risk-increasing versions of genes are present, genes trump behaviors. This interpretation seems likely for the approximately one-quarter to one-third of the population who hold highly genetically deterministic views of heart disease20 and other common conditions.21 For such individuals, behavioral changes such as healthier diet or exercise may not be perceived as effective preventative strategies for “genetic” diseases.

As researchers have pointed out, this “separate causation” model is inaccurate.19,22 A common spectrum of disease causation ranges from completely genetic to completely environmental, and in most cases, the genetic factors and environmental factors interact synergistically. Thus, at one level of environmental exposure, allele#1 may produce a higher level of risk than allele#2, whereas at another level of environmental exposure, the risk levels may be equivalent or even reversed. As Khoury et al.19 point out, the separate causation conceptualization is often evident in the form of a pie chart used to summarize and present information about causes of a specific condition that add up to 100%.19 However, stating that some condition is 60% genetic and 40% environmental is simply misleading, because most if not all human diseases result from the interaction between genetic susceptibility and environmental factors. We call the model implicit in the pie chart an “additive” model. The additive model has been replaced in scientific research by the interactive model at the general level, and is in the process of being replaced in concrete characterizations in a wide range of disease-specific research.2325 It is desirable to understand the barriers that seem to be preventing lay people's adoption of these interaction models. The diffusion of innovation model26 suggests that new technologies and knowledge would spread through society with an S-shaped curve. The early adopters (10–25% of the society members) select the innovation first, followed by relatively rapid adoptions by the majority, until it becomes common practice.26 Regarding accurate information on gene-environment relationships, medical professionals, researchers, and institutions are the early adopters and lead the way in learning the vast amount of genetic information. They are also likely to be one set of opinion leaders27 who have the potential to channel information to the public.

There is empirical evidence that websites have become a key channel for diffusing innovations from medical experts to the lay public. According to data from the Health Information National Trends Survey,28 in 2003, 50.7% of respondents reported looking for health or medical information on the internet for themselves. In 2005, the percentage increased to 58.4%. In 2003, 45.8% of respondents reported looking for health or medical information on the internet for others, and in 2005, 59.5% of respondents reported so. When asked where they actually went for health information in 2003, 48.6% indicated they searched online first, with only 10.9% contacting their physicians first.29 Therefore, it is plausible that lay individuals seek and acquire information on genetic information from such websites. Although these are unlikely to be the sole sources of lay understanding of gene-environment relationships, an assessment of the content of these websites should allow us to offer recommendations to these institutions on how to communicate genetic information better, to avoid increasing deterministic or fatalistic beliefs among lay individuals. The goal of this article is therefore to examine the disease prevention discourse addressed to the public on the web by health organizations such as National Institute of Health, Center for Disease Control and Prevention, and WebMD (table available online only), using web content analysis.

The focus of the sample of websites used for this analysis will be disease prevention and treatment, because web site creators seem to assume that lay people go to the web looking for this information, rather than going primarily searching for causal information. However, most of these sites describe causal and risk factors as well as prevention and cure recommendations. Consequently, these are the primary sites that represent causes of disease to lay people. The pairing of causal and prevention information gains support from the developing theories that show a link between lay people's understanding of health causation and their sense of appropriate cures.9,30 Preventative or curative prescriptions are in many cases most credible or comprehensible when the prescription is consonant with the lay person's understanding of the cause or mechanism of the disease.15,17,31,32 For example, if lack of exercise is perceived as a cause of an illness, more exercise will be perceived as a credible way of getting better. In contrast, if genetics is perceived as a cause of heart disease, then neither exercise nor dietary factors will be perceived as helpful. As a result, genetic risk provision may influence not only causal attributions, but also perceptions of disease preventability and self-efficacy of engaging in preventive health behaviors.17

It is effectively impossible to analyze every disease given the variety and complexity of medical conditions. Four diseases thus were selected for this study. Because of the prevalence, seriousness, and potential for prevention of heart disease and diabetes, these two conditions were included in the analysis. Because lung cancer is the leading cause of cancer-related death in the United States33 and is widely perceived as a serious disease but under greater volitional control than heart disease and diabetes (because of smoking behavior21), it was also included to expand the potential range of understandings of gene-environment relationships. It would be desirable to understand the breadth of the range of lay people's understanding of gene-behavior interaction. Given research showing that lay people harbor a “mind-body” dualism that assigns greater causative roles to genetics for conditions perceived as “physical” and lesser for conditions perceived as “mental,”34 it was deemed important to include depression, which is the mental condition that receives the most widespread attention on the web.35,36

METHODS

Retrieval of online recommendations

Web content analysis will be used as the main methodology for this study. Content analysis uses human coders to assign statements to a set of categories that indicate the themes of interest. It employs intercoder reliability checks to achieve a systematic, objective, and quantitative method for studying the content of communication.37 As a form of the new media, the World Wide Web serves similar communication functions as other mass media, but also opens additional realms for content analytic research because of its interactivity, decentralization, and global reach.38

A two-step online search was conducted to develop the sample of online medical recommendations. First, links were obtained from the website of the American Public Health Association39 with focus on four diseases: heart disease, diabetes, lung cancer, and depression. Second, the investigators also used several popular search engines such as Google, MSN, Ask, and Yahoo to find additional websites, using the key words “heart disease prevention,” “prevent heart disease,” “risk factors of heart disease,” “heart disease risks,” “causes of heart disease,” “heart disease causes.” The other three diseases are substituted in the key words to generate relevant results. The links of relevant results from each search engine were retained and their contents were retrieved for analysis. Then the investigators examined the webpage for prevention recommendations, and information about causes and risk factors for diseases. News stories and recommendations from unidentified sources were excluded from the analysis. In total 73 unique messages on 71 webpages from 46 sponsor health institutions, foundations, and companies were retrieved for content analysis. The table available online only presents the links to, and summary of the recommendations. All websites analyzed were saved as webpage complete (. htm) files to a local hard disk and archived in August 2007, so they may be viewed as they appeared when initially accessed. Given that a website changes constantly, this is a crucial step in web content analysis.38

Content analysis of medical recommendations

The coding scheme

A coding scheme is the set of all coding categories applied to a collection of texts, in which a “coding category” identifies the extant options different messages employ within a particular content domain.40 The coding scheme for this study was adapted from a content analytic scheme developed from two interview studies (n = 50, n = 46) that collected qualitative data on lay perceptions of the roles of genes and the environment in determining health status.41 The coding scheme included categories such as disease type (e.g., if the message talks about heart disease versus diabetes versus lung cancer versus depression); message purpose (e.g., prevention or causes/risk factors), and causality (gene versus behavior versus both),. In contrast to machine based coding that uses fixed dictionaries that assign a set of words to each category, the use of human coders enables judgments about the meaning of words in context. It is therefore not possible to specify in advance all of the words included in each category, however, e.g., such key terms as “gene,” “genetics,” “family history” would in most cases be coded as a focus on genes, whereas notions of “race” based on human genetic variation might also be coded as a gene focus, because lay people tend to interpret race as having a genetic component42; whereas such key words as “exercise,” “diet,” “drinking,” “smoking” would generally be coded as an emphasis on the environment.

The text was further coded with regard to the relationship between genes and environment in determining health status: A “behavior only model” attributes one's health condition/disease to behavioral factors only and/or recommends lifestyle change as preventive measures. A “gene only model” considers only genes as causes and risk factors of health conditions and fails to offer any behavioral recommendations. An “additive model” considers both genes and behaviors as factors that determine one's health conditions; however, these two forces are viewed as independent of each other; therefore, their impact on one's health is additive. A “trigger” model is an interactive one, but it is “all or nothing” or in a compressed time span. The genes have no impact on one's health unless they are activated by the effect of a particular behavior or environmental component. The trigger model may also imply a compressed time span such that the disease occurs immediately upon exposure to the “trigger.” In what we labeled as the “interactive” model, both genes and behaviors are important factors to one's health; moreover, their impact is synergistic through time, that is, the impact of genes on health varies on different levels of behavior or vice versa. If a recommendation suggests a genetic test or checking for family history, it would generally be coded as genetic risk and the coders would then analyze the gene-environment relationships in the text.

Intercoder reliability

Following the coding scheme, two coders coded the text for 40 of the 73 recommendations. The intercoder reliability (Krieppendorff α43) was assessed upon this proportion of the data using the Concord package in the open source software package R. Krippendorff α ranged from 0.81 to 1 for all variables. They were all above the minimal requirement of 0.80 suggested by Riffeet al.44 Differences between the coders were resolved through discussion. One of the coders then coded the rest of the data.

RESULTS

Presence of genetic information

Thirty-eight of the total 73 messages presented genetic information that was denoted by the phrase/word of “gene,” “genetics,” “family history,” “race,” or some combination of the three. Direct words such as “gene” or “genetic” appeared mostly in messages about depression, whereas indirect words such as “family history” appeared comparatively often in diabetes messages. Race or ethnicity information appeared mostly in diabetes information as well. Table 1 presents the frequencies of the gene theme across the health conditions of heart disease, diabetes, lung cancer, and depression.

Table 1 Presence of genetic information across diseases

Presence of environmental information

Behavior regulation (e.g., diet, exercise, drinking, and smoking), social facilitation (e.g., family and friends relationships, and socio-economic status), and physical environment (e.g., air, working conditions, and living surroundings) are regarded as three distinctive types of nongenetic factors which we grouped as global “environment.” Each of the web pages features at least one of these three elements of the environment theme. A closer inspection showed that behavioral factors were mentioned in almost all messages (in 72 of the 73) and 19 of 73 mentioned social facilitation and 23 mentioned a physical environmental factor (Table 2). Messages about heart disease and diabetes predominantly discuss behavioral factors as the nongenetic component. Social facilitation, however, was much more discussed regarding depression compared with other diseases. Environmental factors, in contrast, were mentioned much more in messages dealing with lung cancer compared with other diseases. Whereas for heart disease and diabetes, most messages only discussed one type of nongenetic factor, for lung cancer two types of nongenetic factors were always mentioned (behavior and physical environment); and for depression, two or three types were mentioned (especially behavior and social facilitation).

Table 2 Presence of environmental information across diseases

Co-presence of gene and environment themes

In 38 of the total 73 messages, both the gene theme and environment theme are present; 35 only mention environment. The gene theme does not appear in any one of the online recommendations by itself. It is always accompanied by some forms of discussion of behavioral factors.

Relationship of gene and environment

Thirty-five of the total 73 messages depict a behavior only model. These articles focus on the behavior theme only and do not mention genetics at all. An example of such recommendation follows45:

  • Tobacco use is related to one-third of all cancer and 80% of all lung cancer. The best way to prevent lung cancer is not to smoke. If you already smoke, you should try to quit. The longer you use tobacco, the greater your risk becomes. Research shows that the chance of developing lung cancer decreases once you quit and continue to abstain from tobacco for a long period of time. You should also avoid breathing in other people's smoke. Nonsmokers exposed to secondhand smoke are at an increased risk for developing lung cancer. A nonsmoker living with a smoker has a 25% increased risk for lung cancer. Never expose children to secondhand smoke. Prolonged exposure can interfere with normal lung development and increase their risk of developing respiratory illnesses such as asthma. In the workplace, follow work and safety guidelines to reduce your exposure to hazardous chemicals and second-hand smoke.

  • A good diet with lots of fruits and vegetables may also help prevent lung cancer.

Among the thirty-eight messages that contain both the gene theme and the environment theme, 34 present an additive model, in which both genetic and behavioral factors contribute to or add to a person's chance of getting a disease, but there is no indication that these factors interact with each other. This is an example46:

A person with some or all of the following listed health risk factors may never develop type 2 diabetes. However, the latest medical findings show that the chances of getting type 2 diabetes mellitus increase the more health risk factors you have.

  • A family history of diabetes. If a parent or sibling in your family has diabetes, your risk of developing type 2 diabetes increases.

  • Age over 45. The chance of getting type 2 diabetes increases with age.

  • Race or ethnic background. The risk of type 2 diabetes is greater in Hispanics, blacks, Native Americans, and Asians.

  • Metabolic syndrome (also called insulin resistance syndrome).

  • Being overweight. If you are overweight, defined as a body mass index >25, you are at higher risk of type 2 diabetes.

  • Hypertension. High blood pressure increases the risk of developing type 2 diabetes.

  • Abnormal cholesterol levels. High density lipoprotein (“good”) cholesterol levels under 35 mg/dL and/or a triglyceride level over 250 mg/dL increases your risk of type 2 diabetes.

  • History of gestational diabetes. Getting diabetes during pregnancy or delivering a baby over nine pounds can increase your risk of type 2 diabetes.

Other health risk factors for type 2 diabetes include:

  • A history of polycystic ovary disease.

  • Habitually inactive.

  • History of vascular disease (such as stroke).

Three messages present a trigger model, where what happens in the external environment and the individual's behaviors activate a gene that predisposes the individual to a higher risk of certain disease. The following is an example of the trigger model47:

  • There is an increased risk for developing depression when there is a family history of the illness. Not everyone with a genetic predisposition develops depression. Life events, a major loss or change, chronic stress, and alcohol and drug abuse, may trigger episodes of depression.

No messages presented a codable gene-behavior interaction model. Isolated fragments of text might be interpreted by some readers as consistent with an interactive model, such as the sentence in the paragraph cited above from the WebMD46: “… the chances of getting type 2 diabetes mellitus increase the more health risk factors you have.” But it is vague and readers could interpret it either as risk factors adding to each other independently, or interacting multiplicatively.

Health conditions and gene-environment relationship

Table 3 presents the models of gene-environmental relationship within each health condition. Only depression features an interactive model, and it is the “trigger” model. For heart disease, nine recommendations present a behavior only model, nine present an additive model. Interactive models such as trigger or multiplicative are not presented in any of the websites; neither is the gene only model. For diabetes, eight present the behavior only model, 11 present the additive model, and there are no interactive or gene only models presented. For lung cancer, 11 presentations are behavior only model, and the remaining six present the additive model. For depression, seven present the behavior only model, nine represent an additive model, and the remaining three feature a trigger model.

Table 3 Gene-environment relationships across diseases

DISCUSSION AND IMPLICATION

This study examined the websites of major organizations and informational portals that offer health information and recommendations for disease prevention as potentially important sources of information for lay individuals' understanding of genetic and environmental relationships with regard to health outcomes. The content of such online medical recommendations were analyzed for four diseases: heart disease, diabetes, lung lancer, and depression, with focus on genetic and environment information.

Although more than half of the recommendations provided genetic risk information, none of the online sources attribute health status to genes only. In addition, there is a clear dominance of behavioral factors: All the messages mentioned the attribution of environmental factors to human health, including behavior, social, and physical environment. Over 98% of websites mentioned behavior regulation. When the analysis shifted to how the gene and environment information was presented, the results suggested a strong and potentially undesirable imbalance in representations of gene-environment relationships. Over 82% of recommendations on sites that discuss genetic information presented an additive model, which considers gene and environment both to contribute to one's health condition, but presents them as working independently, without interaction. Medical recommendation websites thus have not caught up with the evolving science. There are potential reasons for this, including the tendency of websites to feature concrete, disease-specific information, which is still lagging the shift to the interactive model on a more general level.

Nonetheless, as suggested in the introduction, the continued presentation of the additive model might contribute to genetically deterministic or fatalistic thinking when one's genes predispose one to higher risk, instead of lower risk, for a certain health condition, because individuals might assume that the genetic component trumps the behavioral factors. Indeed, in on-going analysis of in-depth interviews we have conducted with lay individuals, some people do seem to assume (especially with regard to diabetes, but also heart disease) that behavioral factors only really matter for those who do not have a genetic “cause” to their disease.41 The additive approach thus may enhance, for example, perceptions that to the extent that obesity or obesity-related diabetes have genetic components they are not amenable to individual behavioral control or social environmental modification. This is also consistent with French et al.48 who found a subadditive model among lay individuals wherein either heavy smoking or a family history of heart disease alone is sufficient for them to perceive high risk.

The trigger model is the only version of an interactive model of the gene-environment relationship that we found in a form fully articulated enough to be codable. It is interesting that this model occurs only in the case of depression. It has previously been identified as occurring in lay discourse about mental health issues (including addiction)34 This model is also problematic. It tends to suppress the long-term accretive nature of the impact of most health behaviors. Furthermore, although genes can provide both protection and susceptibility, meaning that they can either decrease or increase one's risks, the trigger model focuses on the negative side. It addresses how one's risk for certain health condition can increase as an outcome of gene and behavior combined, but in the many situations where a behavior cannot be completely avoided (e.g., stress, eating fat, salt, simple sugars), it may not be perceived as providing a model of how one's risks can be reduced and/or disease prevented. In other words, although the trigger model does not provide a model that is genetically deterministic or fatalistic in all cases, it may not facilitate lifestyle change in most cases either. In these cites, more generally, there is an absence of any positive casting of genes as “protective,” that is, there are not representations of the concept that some genes may provide some protection for people from some particular health hazards. The most commonly represented models, depicted in opinion leaders on health information, therefore, might be contributing to health fatalism; at least, they have little impact preventing such beliefs.

The analysis also indicates that there is some variation in causal representations according to disease. These variations do not seem to strictly track with a variation in the genetic component or the degree of interaction between genes and other factors in diseases. For example, the documentation of a genetic49 and behavioral input exists for lung cancer, but lung cancer is represented predominantly in behavioral terms. In the case of lung cancer, this may be due to the long-standing antismoking campaign's linkage to lung cancer prevention, rather than to the characteristics of disease causation or the developmental stage of the research on interaction effects. More refined attention to the variation in representations by disease seems warranted.

Perhaps the most important finding in the study is a null one—no website presented an accurate and directly articulated representation of gene-environment interaction. Genes and environment interact, which means that the impact of genes on one's health and that of behavior are synergistic. On one hand, some genes can amplify the damages from unhealthy behavior, hence accelerate, or elevate the likelihood of, the occurrence of a certain health condition. On the other hand, this also means that risks can be lowered, among those with genetic risks, once a healthier lifestyle is adopted. Understanding and appreciation of this interactive relationship has the potential to prevent and reverse health fatalism among lay individuals.

This study has not provided an account of all medical recommendations, but only those from a wide variety of frequently visited sites on the web as detailed in the table available online only. The potential effects of the contents as we have described them remain speculative, and additional empirical message effects studies are warranted. Even if studies show that these web sites have the undesirable effects indicated, this does not mean that the web sites have caused the lay attitudes in the historical sense. These web sites may simply be reflecting pre-existing lay attitudes. Genetics researchers and medical providers have themselves until recently worked primarily from an additive rather than interactive model. Human language and mental processes may even be biased toward single causation rather than interactive models.50 Consequently, it may also be more difficult to communicate the notion of an interaction than an additive model. Whatever the reasons for the existing contents of the websites, efforts to develop effective ways of communicating the somewhat complex notion of gene-behavior interaction to update the representation of genetics on medical recommendation websites seems warranted by a desire for increased accuracy and because it may be important to avoid enhancing genetic determinism and fatalism in the face of an increased presence and role for genetics in public health discourse.