Studies have established the importance of physical activity and fitness for long-term cardiovascular health, yet limited data exist on the association between objective, real-world large-scale physical activity patterns, fitness, sleep, and cardiovascular health primarily due to difficulties in collecting such datasets. We present data from the MyHeart Counts Cardiovascular Health Study, wherein participants contributed data via an iPhone application built using Apple’s ResearchKit framework and consented to make this data available freely for further research applications. In this smartphone-based study of cardiovascular health, participants recorded daily physical activity, completed health questionnaires, and performed a 6-minute walk fitness test. Data from English-speaking participants aged 18 years or older with a US-registered iPhone who agreed to share their data broadly and who enrolled between the study’s launch and the time of the data freeze for this data release (March 10 2015–October 28 2015) are now available for further research. It is anticipated that releasing this large-scale collection of real-world physical activity, fitness, sleep, and cardiovascular health data will enable the research community to work collaboratively towards improving our understanding of the relationship between cardiovascular indicators, lifestyle, and overall health, as well as inform mobile health research best practices.
observation design · source-based data analysis objective · data collection and processing objective
physical activity · sleep
crowd-sourced data generation
sex · height · weight · age · smoking status measurement · employment status
Homo sapiens · United States of America
Machine-accessible metadata file describing the reported data (ISA-Tab format)
Background and Summary
Mobile technology, in particular advances in smartphone sensors, offers an opportunity to evaluate and monitor cardiovascular health and fitness1,2 with unprecedented connectivity. Direct measurement of activity through “always-on”, low-power motion chips allows for objective, real-world measurements of physiologic parameters. The widespread use of smartphones globally could thus transform research in this area and potentially improve clinical outcomes3,4,5,6,7.
In March 2015, MyHeart Counts (https://github.com/ResearchKit/MyHeartCounts) was launched as an observational smartphone-based study developed using Apple’s ResearchKit software development library (http://researchkit.org/). The study’s goal is to evaluate the feasibility of frequent, remote data sampling of physiologic parameters as measured by smartphone measures of fitness, activity, and sleep. These data may facilitate a more complete understanding of the association between objective measures of health, self-reported disease, and quality of life. Researchers may use these data to characterize activity profiles as to better understand the impact of the number of activity transitions on health8. There are many questions that may stem from these data, each of which will require a community of researchers to explore.
Findings from the MyHeart Counts study revealed a clustering of participants by activity pattern into several distinct groups–sedentary, active, active only on workdays, active only on non-workdays. These cluster assignments were found to correlate with participants’ self-reported incidence of cardiovascular disease as well as self-reported mental well-being. Additionally, the MyHeart Counts study results suggest that patterns of activity correlate with different incidence of self-reported cardiovascular disease: individuals with multiple short bursts of physical activity throughout the day reported better health as compared to counterparts who performed the same number of minutes of physical activity, but in one longer session. These findings are described in JAMA Cardiology8.
MyHeart Counts utilized remote enrollment and consent in which participants self-guide through a visually engaging eConsent, in addition to a traditional consent form, prior to deciding to join the study9 (Fig. 1). A critical aspect of this transparent consent process is providing participants with an explicit decision point, allowing them to specify if the de-identified data they donate to the study can also be studied by qualified researchers worldwide10.
The MyHeart Counts iOS app was downloaded among 110,056 users from March 9, 2015 to October 28, 2015. The number of users who enrolled in the study, consented and shared data worldwide (broadly) and Stanford only (narrow) is shown in Fig. 1. The study cohort described here is composed of contributions from study participants who designated broad sharing of their data (n = 34,189).
Among consented participants, 4,900 (10.2%) completed a 6-minute walk test at the end of day 7 in the study. The 6-minute walk test activity not only collects distance traveled, but also accelerometry during the walk. This comprises the largest 6-minute walk data cohort to date11,12,13,14,15. Additionally, MyHeart Counts users were able to upload data from wearable devices compatible with HealthKit. The most popular wearable devices used are listed in Table 1. These additional data from wearables enabled more continuous monitoring of activity patterns than through the phone alone.
Our aim in sharing data donated by MyHeart Counts participants is to encourage the consolidation of a broad, diverse, and collaborative community of mobile health researchers. We invite diverse solvers from around the world to engage in better understanding how mobile technologies can impact cardiovascular health.
These methods are expanded versions of descriptions in our related work8.
The MyHeart Counts app was made available starting in March 2015 through the Apple App Store (https://itunes.apple.com/us/app/myheart-counts/id972189947?mt=8) in the United States for iPhone 4S or newer requiring a minimum of iOS 8. Enrollment was open to individuals 18 years of age or older who were able to read and understand English and had iPhones registered in the United States. Participants then completed an interactive eConsent process that included animated icons, concise text, and links for more information16. In completing the eConsent, participants designated a data sharing preference: only with Stanford (“narrow sharing”) or more broadly with qualified researchers worldwide (no default choice was presented) (Fig. 1c).
After completing the eConsent, participants were asked to e-sign an electronically rendered traditional consent form. A copy of the signed consent document was sent to participants by email, allowing for verification of their enrollment in the study. Following enrollment, participants could choose their next actions within the study, including setting a 4 digit passcode or registering a fingerprint scan to secure the study app, or completing preliminary study activities. These data were sent to Stormpath, a service used by the bridge server to perform login and store PHI separately from other forms of study data. As part of onboarding, participants were invited to grant the study app access to their iPhone’s HealthKit, Motion Activity, Notifications, and Location Service. Ethical oversight of the study was obtained from Stanford University’s Research Compliance Office (Protocol #IRB-31409).
Consented participants contributed a range of data passively, as well as data that were contributed actively through forms and surveys, and via the 6-minute walk test.
Data collected as part of onboarding included participant account information (name, email, password), as well as study data (gender, height, weight, wake & sleep times). These study data were sent to the server the first time the participant opened the study application after verifying their email post-consent.
We enabled passive data collection from HealthKit and Core Motion when the participant opened their study app for the first time after verifying their email. HealthKit is a framework designed to capture, store, and facilitate sharing of health and physical activity data collected from iPhone sensors between apps. Additionally, a variety of apps and devices may write to HealthKit (e.g., Fitbit Sync Helper, Nike+ Run Club, Apple Watch, Beddit). The MyHeart Counts app captures a variety of body measurements (height, weight), physical activity data (active energy expenditure in kcal, cycling distance, flights climbed, sleep analysis, stand hours, steps, walking and running distance, workouts), health results (blood glucose), and vital signs (diastolic/systolic blood pressure, oxygen saturation) if they have been entered in HealthKit.
With users’ permission, during the initial 7-day monitoring period, motion was recorded through the Core Motion coprocessor chip of their iPhone (iPhone 5S or newer). The low-power chip integrates a number of sensor signals, including a triaxial accelerometer, gyroscope, compass, and barometer, to estimate the presence of movement, distance traveled, as well as the modality of movement, (i.e., walking, running, cycling, driving). Throughout the study, users were able to visualize these data on a dashboard built into the app. Data were sent to the server whenever 50 Kb was collected or when older than 24 h, using Wifi or cellular.
On the final day of the study (eighth day post-enrollment), participants were presented with a final set of questionnaires. These consisted of the Well-Being and Risk Perception Survey with additional questions used to compute the participant’s Atherosclerotic Cardiovascular Disease Risk Score17,18 from which a Heart Age19 was calculated. All survey questions as well as app screenshots of the survey presentation are available on the Synapse MyHeart Counts Public Researcher Portal (Wiki, Data Description, Survey Data Gathered in the MyHeart Counts App20.
During the same interval, participants were asked to complete a self-administered 6-minute walk test. The 6-minute walk test is a phone-guided task that triggers the collection of global positioning system displacement-based distances, pedometer-based distances, pedometer step counts, and accelerometer and gyroscope measurements in both raw and processed formats.
Correlation analysis was performed to determine whether a participant’s duration of app usage during the first 8 days post-enrollment was associated with responses to the above-mentioned surveys (Fig. 2). It was found that participants with self-reported heart disease, vascular disease, and family history of heart disease used the app longer (Fig. 2a). Specifically, family history of heart disease correlated with 0.23+/−0.13 (p = 1.734e-4) more days of app usage, presence of heart disease correlated with 0.56+/−0.18 (p = 8.78e-10) more days of app usage, and presence of vascular disease correlated with 0.47+/−0.25 (p = 1.90e-4) more days of app usage. Similarly, participants’ mental well-being correlated with app usage (Fig. 2b). Participants who reported scores of 8–10 on the “feel worthwhile” and “happy” questions used the app longer as compared to those who reported low (1–3) or medium (4–7) values. Conversely, those who reported high values on the “worried” and “depressed” questions were found to use the app for a shorter period of time. Self-perceived risk was also significantly associated with duration of app usage (Fig. 2c). On the scale for this survey, those with a medium (score = 3) self-perceived 10-year risk used the app 0.29+/−0.03 (p < 2.2e-16) days longer than those with low (score = 1 or 2) or high (score = 4 or score = 5) perceived risk.
Data collection and distribution
Data were sent by the app in encrypted form to Bridge Server, a RESTful API and researcher web interface developed and operated by Sage Bionetworks (http://sagebase.org/) and run on Amazon Web Services (AWS). Bridge is designed to allow collection and management of mobile health data from apps by providing apps the ability to securely create accounts for participants. The server then records consent and identifying personal information required for account creation separately from study data. Separation of personal information from study data is accomplished by storing personal information and accounts in a separate accounts database, and storing study data is S3 buckets on AWS. A dictionary stored in the Bridge server can convert an account identifier, used by the app when sending data, into a healthCode, used by the research team to identify an individual in the coded data (https://developer.sagebridge.org/articles/security.html).
Coded study data, consisting of survey responses, mobile sensor measurements and device data was exported to Synapse (https://www.synapse.org/) for distribution to researchers. Synapse21 is a general-purpose data and analysis sharing service where members can work collaboratively, analyze data, share insights, and track the attribution and provenance of those insights to share with others. Synapse is developed and operated by Sage Bionetworks as a service to the biomedical research community. These Bridge and Synapse services have been used to support numerous health studies, including all five of the initial ResearchKit apps launched in March 20158,22,23 as well as subsequent studies24.
Multiple updates of the MyHeart Counts app were released during the study period to address software-related concerns and to implement new features. Because of an initial technical issue with the integration of HealthKit and ResearchKit data, demographic information is missing for a number of early participants. Participants were subsequently emailed to request they upgrade so this missing information could be provided.
Data was restricted to records shared by versions of the app released before October 28, 2015 (Version 1.0, 1.0.2, 1.0.3, 1.0.4, 1.0.5, 1.0.6, 1.0.7, 1.0.8, 1.5.0 [AKA 1.5.1 build 10]). For tables containing records lacking an AppVersion column, such as HealthKit and 6MWT Displacement data, data sent before October 28, 2015 was included. A total of 48,968 participants consented to the study and agreed to share their data broadly with the research community. 40,017 participants completed at least one survey or task after joining the study, of whom 34,189 agreed to share their data broadly. 6,870 completed all surveys presented in the first 8 days of their participation and were ages 40–70 years, allowing for computation of their 10-year risk score. 4990 completed at least one 6MWT with 6,927 total 6MWT completed. Clinical and demographic characteristics are provided in Table 2.
The number of study participants who provided daily HealthKit step data and activity pattern data derived from the phone’s core motion accelerometer are illustrated in Fig. 3. As an example, the distribution of the total number of days of motion and HealthKit step data provided by users during the study period are also illustrated in Fig. 3.
For the 25,774 participants who supplied location data, we illustrate their geographic distribution (https://www.aggdata.com/free/united-states-zip-codes) by state in Fig. 4. The three states with the largest number of participants are California (n = 9,813), New Jersey (n = 3,560), and New York (n = 3,252).
The survey data provided here are participant reported outcomes responses. For the current data release, participants were allowed to enter survey responses that may not be physiologically possible. This was corrected in version 2 of MyHeart Counts, released Dec 12, 2016 (data not included in this release).
The Core Motion data provided here are derived from Apple iPhone devices with proprietary technical validation. We do not provide test-retest nor other technical validation data sets here, however others have reported technical validation of the Core Motion sensor in a different context25.
The 6-Minute Walk Test was validated in an outdoor setting by comparison of step count and distance reported by the MyHeart Counts app with corresponding values obtained in accordance with the ATS Statement: Guidelines for the Six-Minute Walk Test (n = 20)1. On a validation set of 26 tests, mean error was −3.39 yards, mean absolute error was 56.65 yards, and standard deviation was 70.28 yards8. A negative correlation (pearson = −0.58) was found between distance walked and Six-Minute Walk Test error. Due to limitations in how the ActiveTask was encoded, if the study app leaves the foreground during a test, the data collected may not be complete.
The MyHeart Counts study experienced similar limitations as the five other ResearchKit fully-mobile large-scale flagship studies26. Although the study recruited over 50,000 users within an interval of 6 months, users did not sustain follow up and there was a significant drop off rate as the mean time of engagement with the app was 4.1 days, consistent with the Asthma Health and mPower studies22,23. The high dropout rate was due to low barriers to exit and entry which was a double edge sword as it resulted in both increased dropout but also facilitated engagement of individuals difficult to reach with more traditional means. No sign of systematic bias was found in the characteristics of users who dropped out early versus those who remained in the study longer. Readers may find a more comprehensive list of limitations related to the MyHeart Counts study in a previous publication8. Usage Notes
We have instituted governance structures that balance sharing these data for secondary research with commensurate privacy protections for participants:
Researchers interested in accessing these data should complete the following steps:
Register for a Synapse account (www.synapse.org)
Become a Synapse Certified User by passing a short quiz (www.synapse.org/#!Quiz:Certification)
Have their Synapse User Profile validated by the Synapse Access and Compliance Team (ACT)
Submit an Intended Data Use statement that is publicly posted
Agree to the data-specific Conditions for Use (see DOIs for each data source)
While certain data types may have additional Conditions for Use (e.g., HealthKit data), the overarching Conditions for Use are as follows:
You confirm that you will not attempt to re-identify research participants for any reason, including for re-identification theory research
You reaffirm your commitment to the Synapse Awareness and Ethics Pledge
You agree to abide by the guiding principles for responsible research use and data handling as described in the Synapse Governance documents
You commit to keeping these data confidential and secure
You agree to use these data exclusively as described in your submitted Intended Data Use statement
You understand that these data may not be used for commercial advertisement or to re-contact research participants
You agree to report any misuse or data release, intentional or inadvertent, to the Access and Compliance Team (ACT) within 5 business days by emailing firstname.lastname@example.org
You agree to publish findings in open access publications
You promise to acknowledge the research participants as data contributors and study investigators on all publication or presentation resulting from using these data as follows: ‘These data were contributed by users of the MyHeart Counts mobile application as part of the MyHeart Counts study developed by Stanford University and described in Synapse20.
See the full instructions for requesting data access on the Accessing the MyHeart Counts Data page (Wiki, Accessing the MyHeart Counts Data20).
Examples of client interactions with these data are provided in GitHub: https://github.com/AshleyLab/myheartcounts/tree/DataReleaseManuscript.
The MyHeart Counts iOS app (https://github.com/ResearchKit/MyHeartCounts) was built using Apple’s ResearchKit framework (http://researchkit.org/), which is open source and available on GitHub (https://github.com/researchkit/researchkit). It leverages AppCore (https://github.com/ResearchKit/AppCore), a layer built on top of ResearchKit that was shared among the five initial ResearchKit apps. The Bridge iOS SDK (https://github.com/Sage-Bionetworks/Bridge-iOS-SDK) provides integration with Sage Bionetworks’ Bridge Server, a back-end data service designed for collection of participant donated study data (https://developer.sagebridge.org/). The MyHeart Counts app can be downloaded on the Apple App Store at (https://itunes.apple.com/us/app/myheart-counts/id972189947?mt=8).
Dorsey, E. R. et al. The Use of Smartphones for Health Research. Acad. Med. 92, 157–160 (2017).
McConnell, M. V., Turakhia, M. P., Harrington, R. A., King, A. C. & Ashley, E. A. Mobile Health Advances in Physical Activity, Fitness, and Atrial Fibrillation: Moving Hearts. J. Am. Coll. Cardiol. 71, 2691–2701 (2018).
Glynn, L. G. et al. Effectiveness of a smartphone application to promote physical activity in primary care: the SMART MOVE randomised controlled trial. Br. J. Gen. Pract. 64, e384–91 (2014).
Bort-Roig, J., Gilson, N. D., Puig-Ribera, A., Contreras, R. S. & Trost, S. G. Measuring and influencing physical activity with smartphone technology: a systematic review. Sports Med. 44, 671–686 (2014).
Saran, T., Pedrycz, A., Mucha, D. & Mucha, D. Follow-up monitoring of physical activity after rehabilitation by means of a mobile application: Effectiveness of measurements in different age groups. Adv. Clin. Exp. Med., https://doi.org/10.17219/acem/69131 (2018).
Seifert, A., Schlomann, A., Rietz, C. & Schelling, H. R. The use of mobile devices for physical activity tracking in older adults’ everyday life. Digit. Health 3, 2055207617740088 (2017).
Patel, D. N., Nossel, C., Patricios, J. & Maboreke, J. Bright spots, physical activity investments that work: Vitality Active Rewards-a smartphone app that incentivises programme members to be physically active. Br. J. Sports Med., https://doi.org/10.1136/bjsports-2018-099271 (2018).
McConnell, M. V. et al. Feasibility of Obtaining Measures of Lifestyle From a Smartphone App: The MyHeart Counts Cardiovascular Health Study. JAMA Cardiol. 2, 67–76 (2017).
Wilbanks, J. & Friend, S. H. First, design for data sharing. Nat. Biotechnol. 34, 377 (2016).
Grady, C. et al. Informed Consent. N. Engl. J. Med. 376, 856–867 (2017).
Ulrich, S. et al. Reference values for the 6-minute walk test in healthy children and adolescents in Switzerland. BMC Pulm. Med. 13, 49 (2013).
Tveter, A. T., Dagfinrud, H., Moseng, T. & Holm, I. Health-related physical fitness measures: reference values and reference equations for use in clinical practice. Arch. Phys. Med. Rehabil. 95, 1366–1373 (2014).
Kanburoglu, M. K., Ozdemir, F. M., Ozkan, S. & Tunaoglu, F. S. Reference values of the 6-minute walk test in healthy Turkish children and adolescents between 11 and 18 years of age. Respir. Care 59, 1369–1375 (2014).
Britto, R. R. et al. Reference equations for the six-minute walk distance based on a Brazilian multicenter study. Brazilian Journal of Physical Therapy 17, 556–563 (2013).
D’silva, C., Vaishali, K. & Venkatesan, P. Six-Minute Walk Test-Normal Values of School Children Aged 7–12 Y in India: A Cross-Sectional Study. Indian J. Pediatr. 79, 597–601 (2011).
Doerr, M., Suver, C. & Wilbanks, J. Developing a Transparent, Participant-Navigated Electronic Informed Consent for Mobile-Mediated Research., https://doi.org/10.2139/ssrn.2769129 (2016).
Goff, D. C. Jr. et al. 2013 ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation 129, S49–73 (2014).
Lloyd-Jones, D. M. Prediction of Lifetime Risk for Cardiovascular Disease by Risk Factor Burden at 50 Years of Age. Circulation 113, 791–798 (2006).
D’Agostino, R. B. Sr. et al. General cardiovascular risk profile for use in primary care: the Framingham Heart Study. Circulation 117, 743–753 (2008).
Hershman, S. et al. The MyHeart Counts Study, lifestyle and cardiovascular health data collected using ResearchKit. Synapse, https://doi.org/10.7303/syn11269541 (2018).
Derry, J. M. J. et al. Developing predictive molecular maps of human disease through community-based modeling. Nat. Genet. 44, 127–130 (2012).
Chan, Y.-F. Y. et al. The Asthma Mobile Health Study, a large-scale clinical observational study using ResearchKit. Nat. Biotechnol. 35, 354 (2017).
Bot, B. M. et al. The mPower study, Parkinson disease mobile data collected using ResearchKit. Sci. Data 3, 160011 (2016).
Webster, D. E. et al. The Mole Mapper Study, mobile phone skin imaging and melanoma risk data collected using ResearchKit. Sci. Data 4, 170005 (2017).
Yu, Y. et al. Initial Validation of Mobile-Structural Health Monitoring Method Using Smartphones. Int. J. Distrib. Sens. Netw. 11, 274391 (2015).
ATS Committee on Proficiency Standards for Clinical Pulmonary Function Laboratories. ATS statement: guidelines for the six-minute walk test. Am. J. Respir. Crit. Care Med. 166, 111–117 (2002).
Profile 5: Population Division, U.S. Bureau of the Census. Popul. Index 45, 13 (1979).
Adams, R. Revised Physical Activity Readiness Questionnaire. Can. Fam. Physician 45, 992, 995, 1004–5 (1999).
Arena, R. et al. The role of worksite health screening: a policy statement from the American Heart Association. Circulation 130, 719–734 (2014).
OECD. OECD Guidelines on Measuring Subjective Well-being., https://doi.org/10.1787/9789264191655-en (2013).
The Division of Cardiovascular Medicine, Department of Medicine, Stanford University, received in-kind (software development) support from Apple Inc. We would like to acknowledge the contributions of all our academic and technology partners who helped with the project, with special thanks to our developers Dariusz Lesniak and Paweł Kowalczyk. Software development was partially funded by the Stanford Data Science Initiative. Steven Hershman is supported by The Division of Cardiovascular Medicine, Department of Medicine.
Dr. McConnell is an employee of Verily Life Sciences LLC. Dr. Harrington is on the board of directors for Scanadu Inc (which is privately held) but reported receiving no consulting fees and reported having stock options with no current value. Megan Doerr is a co-inventor of Cleveland Clinic’s MyFamily (MyLegacy) intellectual property portfolio, licensed to Family Care Path, Inc. As part of this license, Ms. Doerr is entitled to a share in both royalties and returns of equity.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
ISA-Tab metadata file
About this article
Cite this article
Hershman, S.G., Bot, B.M., Shcherbina, A. et al. Physical activity, sleep and cardiovascular health data for 50,000 individuals from the MyHeart Counts Study. Sci Data 6, 24 (2019). https://doi.org/10.1038/s41597-019-0016-7
This article is cited by
DiaTrend: A dataset from advanced diabetes technology to enable development of novel analytic solutions
Scientific Data (2023)
Journal of Cardiovascular Translational Research (2023)
Scientific Data (2022)
Scientific Data (2020)
Indicators of retention in remote digital health studies: a cross-study evaluation of 100,000 participants
npj Digital Medicine (2020)