Each individual has a unique and complex set of genetic, lifestyle, and environmental factors that impact clinical biomarkers and contribute to the manifestation of common conditions such as heart disease, diabetes, obesity, and hypertension. For this reason, a systems-based approach to quantifying wellness and detecting transitions to disease is well suited for prevention of chronic conditions common to modernized societies.

While there is strong scientific interest for using multi-omic data to prevent chronic diseases related to lifestyle and behavior, to date little value has been demonstrated for consumers or patients. For example, some studies have shown that simply receiving genetic information about one’s risk for chronic diseases does not lead to behavior change or actual risk reduction1, although more recent studies including an updated meta-analysis show modest behavior changes resulting from genetic information2,3. In addition, some scientists and physicians are understandably critical of providing genetic information in the absence of measuring the relevant clinical markers.

For some disease phenotypes, the relative contributions of genetics and lifestyle have been explored. One recent study found that a polygenic risk score and a lifestyle risk score had independent and additive effects on cardiovascular outcomes4. Because of the important effects of lifestyle on chronic disease risk, studies have also examined the effectiveness of health coaching on promoting clinical changes. Generally, these studies have found lifestyle coaching to be beneficial5,6. Furthermore, while there is some evidence that genetic predisposition has an impact on clinical response7,8, much less is known about the role of genetics in determining response to lifestyle change, supporting the need for further study.

To address these gaps, we developed a systems-based approach, “Scientific Wellness”, which combines genetic and clinical data with lifestyle coaching. For each individual, we generate personal, dense, dynamic data (PD3) clouds, which can aid in identifying unique actionable possibilities to optimize wellness and avoid disease. A critical component of Scientific Wellness is regular interaction with a registered dietitian or other allied healthcare provider for education and behavior modification. We previously published a pilot study that established the feasibility and potential clinical impact of this approach9.

Here we report on the results of a real-world observational study, using longitudinal blood, saliva, and anthropometric data gathered from 2531 participants in the commercial Scientific Wellness program (Arivale, Inc). We quantify the average change of each clinical marker for participants in their first year of the program, both in the population as a whole, and in clinically-relevant strata. For several clinical markers, we also examine the relationship between genetic predisposition and baseline values. Finally, we estimate the impact of genetic predisposition on changes in these clinical markers during the first year of the program.


The study cohort had a mean age of 48 years (SD = 12), approximately 60% were female, and 79% were White, 9% Asian, 4% Hispanic, and 2% Black. The number of participants with longitudinal observations for each clinical marker is shown in Table 1. The average time between measurements for blood lab data was 179 days (SD = 49), blood pressure 142 days (SD = 81), salivary cortisol 307 days (SD = 95), waist circumference 102 days (SD = 81), and weight 9 days (SD = 28).

Table 1 Clinical characteristics of the study population.

In total, 55 clinical markers were analyzed. At baseline and latest measurements, population means, SDs, and % of participants out of range (OOR) were recorded (Table 1 and Supplementary Table 1).

Longitudinal Changes

We estimated 6- and 12-month changes for the average participant, adjusted for confounding effects; we refer to these as “adjusted changes” throughout. These adjusted changes were estimated for the entire participant population, as well as for strata defined by baseline reference ranges (‘normal at baseline’, ‘low at baseline’, and ‘high at baseline’) when available and with sufficient sample size (Fig. 1, Table 2 and Supplementary Table 2).

Figure 1
figure 1

Longitudinal changes for select clinical markers. Panels a,c,e, and g: Adjusted changes for the average participant in the entire study population. Panels b,d,f,h: Adjusted average differences from the ‘normal at baseline’ strata at baseline for each baseline strata over time in the program. In panels, the points represent the estimates from the GLMMs, and the vertical bars show the 95% confidence intervals. The solid lines connecting the points visually shows the trajectories over time and the dashed horizontal lines show the starting value of the entire population or corresponding strata.

Table 2 Longitudinal changes from generalized linear mixed model analyses stratified by baseline range.

There was evidence of sustained improvements in clinical markers related to cardiometabolic risk, inflammation, nutrition, and anthropometrics. Several clinical markers, including triglycerides, gamma-glutamyl transpeptidase (GGT), hemoglobin A1c (HbA1c), omega-3 index, vitamin-D, waist circumference, and weight, had improvements in the entire population as well as in each baseline strata. Some of these clinical markers, such as HbA1c (Fig. 1a,b), had improvements from baseline to 6 months as well as from 6 to 12 months, while others, such as Vitamin D (Fig. 1c,d), had improvements from baseline at both 6 and 12 months, but remained stable between 6 and 12 months.

Other clinical markers, such as HDL-C, homocysteine, and insulin, showed improvements in the baseline OOR strata, but showed no evidence of change in the baseline normal strata. When considering the entire population, HDL-C showed no evidence of change, while homocysteine (Fig. 1e,f), showed improvements.

Lastly, markers such as LDL-C, glucose, hs-CRP, and diastolic and systolic blood pressure, had improvements in the baseline OOR strata, but had worsening in the normal strata. This pattern of changes may be indicative of regression to the mean effects arising due to measurement variability, along with using strata defined by baseline observations of the outcome variable10. Regression to the mean leads to biased overestimates of changes in strata analyses; however, it does not bias estimates of changes in the entire population, for which some of these markers showed improvements, such as LDL-C (Fig. 1g,h) and diastolic and systolic blood pressure.

Phenotypic Variation in Baseline Measures Explained by Genetic Markers

Associations were replicated between 11 of 13 genetic markers tested and the baseline measurements of clinical markers with which they were expected to be correlated. The most informative polygenic scores (PGSs) were for low-density lipoprotein cholesterol or LDL-C (11.1% variation explained), total cholesterol (8.7%), high-density lipoprotein cholesterol or HDL-C (6.9%), and triglycerides (3.9%). Compared to participants with LDL-C PGS in the second or third quartile (Q2/Q3) of the population (i.e. genetic predisposition for average LDL-C), participants with LDL-C PGS in the first quartile (Q1) of the population (genetic predisposition for lower LDL-C) had adjusted baseline LDL-C that was 15.7 mg/dL lower on average. Conversely, participants with LDL-C PGS in the fourth quartile (Q4) of the population (genetic predisposition for higher LDL-C) had adjusted baseline LDL-C that was 13.7 mg/dL higher on average.

The most informative single nucleotide polymorphisms (SNPs) were rs174537 (13.8% variation explained for arachidonic acid and 1.8% for EPA) and rs4588 (1.5% variation explained for vitamin D). Compared to participants with the GG genotype at rs4588, participants with the GT and TT genotypes had adjusted baseline vitamin D that was 1.8 ng/mL lower and 6.0 ng/mL lower on average, respectively. The percent of variation explained for PGS and SNPs used in this study were comparable to the original studies11,12. The partial r2 and estimated effect sizes for each clinical-genetic marker pair tested are presented in Table 3 and Supplementary Table 3.

Table 3 Effects of genetics on baseline levels and on longitudinal changes of clinical markers.

Effect of Genetics on Longitudinal Changes

The SNPs with the strongest genetic effects on longitudinal changes of associated clinical markers were the same SNPs that were most informative for baseline levels of those clinical markers (Table 3 and Supplementary Table 3). The G allele of rs174537 was additively associated with higher baseline levels of both arachidonic acid and EPA among participants in the program. Interestingly, having more copies of the G allele was associated with a greater increase of arachidonic acid through the course of the program (0.3% by wt. for GT vs TT, and 0.6% by wt. for GG vs. TT), but no difference in change of EPA.

We found similar longitudinal effects on differential change of clinical markers for the lipid PGSs (Table 3 and Supplementary Table 3). Adjusting for baseline LDL-C, those with an LDL-C PGS in Q1 (predisposed to lower/better LDL-C levels) had a 3.8 mg/dL greater decrease in LDL-C on average than those with an LDL-C PGS in Q2 or Q3 after the same amount of time in the program (Supplementary Figure 1). Adjusting for baseline total cholesterol, those with a total cholesterol PGS in Q1 (predisposed to lower/better total cholesterol levels) had a 4.3 mg/dL greater decrease in total cholesterol on average than those with a total cholesterol PGS in Q2 or Q3 after the same amount of time in the program. Adjusting for baseline HDL-C, those with an HDL-C PGS in Q4 (predisposed to higher/better HDL-C levels) had a 1.3 mg/dL greater increase in HDL-C on average than those with an HDL-C PGS in Q2 or Q3 after the same amount of time in the program. Lastly, adjusting for baseline triglycerides, those with a triglycerides PGS in Q4 (predisposed to higher/worse triglycerides levels) had a 5.1 mg/dL lesser decrease in triglycerides on average than those with a triglycerides PGS in Q2 or Q3 after the same amount of time in the program.


This study extends the results of previous studies9,13 and supports the importance of a Scientific Wellness approach, combining multi-omic data and personalized lifestyle coaching in a real-world setting. First, participants saw notable improvements in multiple clinical markers related to health, many of which were observed in the entire population, not just in those who began with out of range values. Second, previously reported associations between genetic markers and select clinical markers were replicated. Most intriguingly, some genetic markers were found to be associated with differences in the longitudinal changes in response to this lifestyle coaching program. These results suggest that certain genetic predispositions have an effect on the magnitude of change in clinical markers achieved through this program.

Some clinical improvements observed in this real-world study were comparable to improvements seen in diet and lifestyle randomized controlled trials (RCTs). For example, we found an adjusted average decrease in HbA1c of about 0.20% at 12 months in the entire study population. Among those participants with elevated baseline HbA1c, the adjusted average decrease was 0.26% at 12 months. The Diabetes Prevention Program, a RCT comparing intensive lifestyle intervention vs. metformin/standard care, saw slightly less than a 0.1% decrease in HbA1c at 12 months for those in the lifestyle intervention arm14. According to a meta-analysis, a 0.16% improvement in HbA1c in prediabetes is associated with at least a 1% reduction in the annualized incidence of diabetes, or an estimated 880,000 fewer cases of diabetes per year in the U.S15,16. These findings highlight that a program founded in systems biology and behavioral theory, and using scalable telephonic coaching, can provide improvements in glycemic health that compare to those seen in landmark clinical trials, and could have a meaningful impact on public health.

The estimated effect sizes of many of the genetics used in this study on baseline lab markers were clinically meaningful. For example, the difference in average adjusted baseline LDL-C was 29.4 mg/dL between LDL-C PGS Q1 and Q4. In previous studies, a reduction of similar magnitude in LDL-C (38.7 mg/dL) was found to be associated with a 23% reduction in relative risk of major vascular events17. Additionally, the LDL-C PGS was associated with differences in longitudinal changes of LDL-C, controlling for baseline values and time in the program. On average, after controlling for other risk factors, a participant with LDL-C PGS in Q1 would have seen a 3.82 mg/dL greater decrease in LDL-C in response to coaching than a participant with LDL-C PGS in Q2 or Q3. This suggests that a lifestyle coaching program may be more effective at lowering LDL-C for an individual with genetic predisposition for lower LDL-C relative to an individual with higher genetic risk. This result may not be surprising, as poor lifestyle choices may explain why someone with low genetic risk still has high LDL-C. These results are consistent with earlier studies showing high genetic predisposition for adverse lipid profiles limits the improvement in total cholesterol in response to lifestyle change7,8. Importantly, our results suggest that as the understanding of genetic predisposition continues to improve, so too will the ability to provide targeted personalized lifestyle recommendations, as well as the ability to identify when medical treatment is the best course of action.

This study has several limitations. As an observational study without a control group, we cannot separate the effect of coaching and the effect of being provided personalized data. In the future, it would be interesting to compare the full Scientific Wellness program to a standard coaching program that did not provide any clinical or genetic data to measure these effects separately. The lack of a control group may be particularly limiting for analyzing changes stratified by baseline clinical marker values, as regression to the mean could lead to biased estimates of effects10. We attempted to control for this by reporting changes in the total population as well as the out-of-range population. A pattern of improvements in the baseline OOR strata but worsening in the normal strata may be indicative of regression to the mean effects arising due to measurement variability.

Due to the personalized nature of the coaching, not all participants were working on improving all out of range clinical markers. Thus, our results may under-estimate the actual impact of health coaching on clinical outcomes. Coaches were aware of participants’ genetic predispositions when they generated personalized recommendations, which could lead to a bias in which participants with greater genetic predispositions (e.g. for higher LDL-C) received more aggressive lifestyle interventions. However, our results indicate that participants with greater genetic predispositions improve less in the program relative to participants with lower genetic predispositions. Therefore, any coaching bias to intervene more aggressively would act to attenuate our results rather than amplify them.

An additional potential limitation is the issue of compliance bias, which we were unable to address in the current study. Hypothetically, individuals who know they have higher - or lower - genetic risk for a trait may be less motivated to actively engage in lifestyle change. Some studies3 have reported greater self-reported behavior change in people who learned they were high-risk genetic carriers compared to low-risk non-carriers, but others2,18 did not find a relationship between genetic risk score and behavior change. Importantly, previous studies did not involve interactions with a trained lifestyle coach who can identify underlying core motivations and provide behavioral support and accountability to drive sustained behavior change. At the start of this study, data on participants’ actions and compliance were not collected in a way suitable for analysis; these data are now being collected and will be analyzed in the future.

Human wellness and disease are complex biological phenomena. The Scientific Wellness approach deals with this complexity by generating large amounts of multi-omic data, which we refer to as personal, dense, dynamic data (PD3) clouds, on many different biological systems for each individual. PD3 clouds can be used to understand an individual’s unique actionable possibilities for optimal wellness. This approach has the potential to transform our understanding of personalized medicine.


This real-world study of a Scientific Wellness program demonstrated not only clinical improvements in participants with out of range biomarkers at baseline, but also many clinical improvements in the overall population, presumably related to sustained engagement and lifestyle changes. Furthermore, we report that genetic predisposition for nutrition and wellness-related phenotypes impacts clinical responses to a lifestyle coaching program. We believe that investigations into the relationship between genetic predispositions and the impact of lifestyle intervention will prove a fruitful avenue for further study.



All research was conducted in accordance to regulations and guidelines for observational research in human subjects. The study was reviewed and approved by the Western IRB (Study Number 1178906). The research was performed entirely using de-identified and aggregated data of individuals who had signed a research authorization allowing the use of their anonymized data in research. Per current U.S. regulations for use of deidentified data, informed consent was not required. To be eligible to join the program, participants had to be over 18 years of age, not pregnant, and a resident of any U.S. state except New York. The participants analyzed in this study are the 92% of participants who agreed to research use as of 6/19/2018 and enrolled in the program between July 2015 and March 2018.

Personalized lifestyle coaching

Personalized, telephonic lifestyle coaching was provided to each participant in the program by registered dietitians, certified nutritionists, or registered nurses. A participant’s clinical data were available for them to view online via a data dashboard. To address specific OOR clinical markers, coaches provided lifestyle recommendations based on published scientific evidence which were further personalized in the context of the participant’s health goals and relevant genetic predispositions. Coaches did not make recommendations solely based on genetic risk, although they might take genetics into account when developing a behavioral plan for an out-of-range biomarker. For example, reducing sodium or caffeine might be recommended to any participant with high blood pressure, but if they also had risk alleles indicating enhanced susceptibility to dietary sodium or caffeine, this would be emphasized. See Supplementary Methods for details on personalized lifestyle coaching and Supplementary Table 4 for general clinical recommendations given for out-of-range biomarkers.

Lab Data

Fasting blood draws were scheduled every 6 months but actual collection times varied. Salivary cortisol measurements were collected at home using a 4-time-point collection procedure and analyzed by ZRT (Beaverton, OR). Blood pressure measurements were recorded at each blood draw, and some participants provided additional self-reported measurements between visits via the data dashboard.

All laboratory tests were performed in CLIA-approved labs. The labs provided reference ranges for a majority of these clinical markers. Reference ranges for blood pressure were defined by U.S. public guidelines19. See Supplementary Methods for details on lab data collection.

Anthropometric Data

Height, weight, and waist circumference were measured either at the blood draws (45%) or were self-reported via an online assessment or through the Fitbit Aria scale. Reference ranges for anthropometric data were defined by U.S. public health guidelines20.

Genetic Data

Genetic data were collected using whole genome sequencing for 2,380 participants or SNP microarray genotyping for 151 participants. Curated genetic markers relevant to nutrition and wellness were reported to all participants as part of the program. These included SNPs previously associated with a nutrition or wellness-related phenotype (e.g. rs4588 with Vitamin D21,22 and rs174537 with omega-3 and omega-6 fatty acids12,23), and polygenic scores (PGSs for LDL-C, HDL-C, triglycerides, BMI, and waist circumference. Each of these PGSs was constructed using publicly available summary statistics from published Genome-Wide Association Studies (GWAS)11,24,25. See Supplementary Methods for details on genotype calling.

Polygenic Score Creation

Briefly, the set of SNPs included in a PGS was determined as follows. The Benjamini-Hochberg26 procedure was applied to the p-values for all SNPs tested in the GWAS to account for multiple testing by controlling the false discovery rate (FDR) at a 5% level. This FDR filtered set of SNPs was then further pruned using linkage disequilibrium (LD): pairs of SNPs in close proximity capturing highly correlated information (r2 > 0.2) were identified, and the SNP with the smaller p-value in the pair was kept; this was repeated until all remaining SNPs were mutually uncorrelated (r2 < 0.2 for all pairs). The PGS for each individual was then calculated by summing up the published effect size for each selected SNP multiplied by the number of effect alleles the individual carried for that SNP, across all of the selected SNPs. Missing genotypes were mean imputed using the effect allele frequency. See Supplementary Table 5 for the list of variants in each polygenic score and their associated effect sizes. The homocysteine polygenic score was computed based on specific rules, which are provided in Supplementary Table 5.

Data and Sample Filtering

To be included in the analysis of a clinical marker (labs or anthropometrics), a participant was required to have a baseline measurement within 30 days of their first blood draw, and at least one follow-up measurement between 90 days and 15 months later. Measurements collected more than 15 months after the participant’s baseline blood draw were excluded. Blood draws reported as non-fasting were excluded (1.7% of all blood draws). Lipid measurements were excluded for participants who reported taking cholesterol-lowering medication; diabetes markers were excluded for participants who reported taking blood sugar medication; blood pressure measurements were excluded for participants who reported taking blood pressure medication. Additionally, 17 participants who reported having type 1 diabetes were excluded from analyses of diabetes markers.

Longitudinal Changes in Clinical Markers

Generalized linear mixed models (GLMMs) were used to estimate the average change in each clinical marker after 6 and 12 months in the program. The actual collection times of measurements varied from participant to participant. Therefore, rather than treating time in the program as a categorical variable with pre-specified collection points, linear regression splines were used to fit time as a continuous variable, allowing for differences in the trajectory of change of the clinical marker throughout the course of the program. To adjust for potential confounding effects, age at baseline, sex, enrollment channel, genetic ancestry, observation season, and observation vendor were included as fixed effects covariates in each GLMM.

Longitudinal Changes Stratified by Baseline Range

We classified participants into strata by their baseline measurements: those with baseline measurements within the healthy range (as defined by clinical reference ranges) were classified as ‘normal’, below this range as ‘low’, and above this range as ‘high’. We estimated the change at 6 and 12 months for the average participant by baseline strata using GLMMs as described above, with the addition of an interaction term between a categorical variable for baseline strata and the linear regression spline for time in the program. Changes were not estimated for baseline strata containing less than 50 participants.

Association of Genetic and Clinical Markers at Baseline

Linear regression models were used to estimate the % of variation explained by the genetic markers (SNPs or PGSs) provided to participants, as well as SNP genotype or PGS quartile effect sizes on baseline levels of the corresponding clinical markers. The same covariates included in the longitudinal change models were included in these regressions.

Impact of Genetic Markers on Longitudinal Changes

Each genetic marker tested for association with a particular clinical marker at baseline was also tested for an effect on the longitudinal change of that clinical marker. Linear mixed models (LMMs) were used to identify interaction effects between different SNP genotypes or PGS quartiles and clinical marker changes, adjusting for baseline clinical marker values after the same amount of time in the program. Fixed effects covariates based on the same potential confounding variables as described for the longitudinal change models were used in these models.

See Supplementary Methods for details on regression models. P-values for all analyses were adjusted for the effects of multiple hypotheses testing using the Benjamini-Hochberg procedure26 (Supplementary Tables 2 and 3). All of the Results discussed in this study were significant after multiple hypothesis correction.