Biomarkers are physiologic, pathologic, or anatomic characteristics that are objectively measured and evaluated as an indicator of normal biologic processes, pathologic processes, or biological responses to therapeutic interventions. Recent advances in the development of mobile digitally connected technologies have led to the emergence of a new class of biomarkers measured across multiple layers of hardware and software. Quantified in ones and zeros, these “digital” biomarkers can support continuous measurements outside the physical confines of the clinical environment. The modular software–hardware combination of these products has created new opportunities for patient care and biomedical research, enabling remote monitoring and decentralized clinical trial designs. However, a systematic approach to assessing the quality and utility of digital biomarkers to ensure an appropriate balance between their safety and effectiveness is needed. This paper outlines key considerations for the development and evaluation of digital biomarkers, examining their role in clinical research and routine patient care.
Biomarkers are characteristics (such as a physiologic, pathologic, or anatomic characteristic or measurement) that are objectively measured and evaluated as an indicator of normal biologic processes, pathologic processes, or biological responses to a therapeutic intervention.1 Building on this standard definition, we describe an emerging class of biomarker, the “digital biomarker”, which has important implications for both clinical trials and clinical care. “Digital” refers to the method of collection as using sensors and computational tools, generally across multiple layers of hardware and software. The measurements are often made outside the physical confines of the clinical environment using home-based connected products2 including wearable, implantable, and ingestible devices and sensors. Digital biomarkers span a broad range of diagnostic and prognostic measurements (Table 1). We discuss development and evaluation of the digital biomarkers, outlining opportunities and challenges associated with their use in clinical research and routine care. As remote monitoring of digital biomarkers becomes increasingly prevalent, we discuss the challenges to patient privacy and patient autonomy.
Just as clinicians must evaluate a drug’s safety and effectiveness by critically appraising clinical trials, they will increasingly need to know how to evaluate, select, and “prescribe” digital health tools and biomarkers. Some biomarkers are immediately familiar to patients or physicians as they are digitized versions of well-established metrics—for example, glucometer readings transmitted by Bluetooth, or the timed six-minute walk test measured with the smartphone’s built-in gyroscope and accelerometer. Others, such as the smartphone-derived tapping test for Parkinson’s disease severity, are novel and evolving.3 Digital biomarkers are an essential component in autoregulated closed loop systems. For example, in an “artificial pancreas” model, a continuous glucose sensor linked to an insulin pump can automatically dose insulin in patients with diabetes.4
The anatomy and evaluation of digital biomarkers
An input layer such as a camera, microphone, or sensor captures a digital biomarker signal. For example, photoplethysmographs measure blood volume changes in the microvasculature using an optical sensor placed on the skin surface. A signal processing layer, typically an algorithm, converts the input signal into actionable metrics (e.g., oxygen saturation and/or heart rate), or digital biomarkers. Although measuring blood volume changes using photoplethysmography is widely accepted in medical practice, the interplay among hardware, sensors, and algorithms can make the evaluation of emerging digital biomarkers difficult. There are several challenges in deciding not only whether a digital biomarker is valid, but equally important, whether it is “fit-for-purpose”, meaning that the product has an explicit context of use, meets appropriate requirements for accuracy and precision, and is accompanied by the metadata needed for analysis and interpretation.5
Analytical verification uses engineering bench tests to ensure that the product is measuring and storing values accurately by confirming the tool’s accuracy, precision, and reliability. Confidence in the performance of digital biomarkers is an important consideration for researchers, clinicians, and patients. For example, the verification step ensures that the translation from raw data, e.g., that a heart rate sensor measuring electrical potential in millivolts, faithfully converts that signal into an accurate heart rate, expressed in beats per unit of time.
As with diagnostics, the performance of digital biomarker algorithms may vary across different patient populations, producing different rates of false-positive or false-negative outputs in different groups. Validation addresses whether the measurement is applicable in the target population and context of use,6 which would render digital biomarker “fit for purpose”. For example, a tool measuring sleep and waking periods perform against polysomnography may perform differently in a patient population with insomnia versus sleep apnea versus healthy volunteers.
Digital biomarker products can be composed of multiple individual software and hardware components. When the components are interoperable, they can be mixed and matched as modular components to assemble a diverse array of offerings. For example, the US Food and Drug Administration (FDA) recently approved the Dexcom integrated continuous glucose monitoring system as the first type of continuous glucose monitoring system that can be used in a modular fashion with other compatible medical devices and electronic interfaces, including automated insulin dosing systems and diabetes management devices.7
Software and hardware manufacturers have started to specialize in modular pieces of a connected product’s data flow tool chain (Fig. 1).
Regulation of modular components
The FDA regulatory process can often address particular, modular, components along a digital biomarker’s measurement apparatus. The FDA is piloting a program that would “pre-certify” companies and their policies8 in order to offer a streamlined path to market for their product-level approvals and modifications.
Historically, most of the software-products have been categorized as software in a medical device (SiMD), which operates the device and sensors (e.g., firmware). More recently, digital biomarker components are categorized as software as a medical device (SaMD) solutions. SaMDs can perform a medical function without being part of a hardware medical device (e.g., machine-learning based tools in mobile apps8) have novel properties and potential for wider adoption. Definitions distinguishing SaMD from SiMD are evolving. The FDA recently cleared two SaMDs compatible with the Apple Watch for detection of atrial fibrillation. The first is an “over the counter” electrocardiogram app for display of atrial fibrillation9 and the second can notify the user of an irregular rhythm.10 The hardware, the Apple Watch, serves as a component supporting digital biomarker measurement. The Apple Watch over the counter EKG app and irregular rhythm notifications a re FDA cleared as SaMDs.
While modularity enables mixing and matching across a variety of components, it can also be a source of potential error. For example, performance changes to an operating system may affect the speed of computation11 and, for example, corrupt measurement of a Parkinson’s tapping test, which uses a smartphone to calculate a digital biomarker based on timed reaction.
Potential benefits and risks of digital biomarkers
As new modalities are incorporated into connected devices, mobile apps, and software products for patients at home, a natural area of growth in biomarker collection is remote collection of patient-generated measurements. As digital biomarkers are increasingly used as endpoints in clinical trials, we anticipate that clinicians will have a growing number of validated means of gathering clinical insights on patients remotely. However, incorporation of these tools in clinical research is dependent on accelerating the development of new study designs such as those employed in decentralized clinical trials, where many of the trial participant touchpoints occur at home.12 Furthermore, verification and validation of digital biomarkers require a uniquely collaborative approach, with engineering, data science, health information technology, and clinical research functions tightly coordinated as integrated multidisciplinary units.
New digital biomarkers are directly targeting clinical management. The Empatica Embrace Watch, for example, is a “smartband” wrist-device that measures sympathetic nervous impulses at the skin and infers parasympathetic activity from heart rate variation. Its algorithm detects seizures and its associated app suite can alert care providers. There are many examples of digital biomarkers in use or actively under development today, as well as computational metrics with potential for development into digital biomarkers (Table 1). We expect that as digital biomarkers become increasingly used in clinical trials, patient and physician adoption will increase in care and self-management. Digital tools also allow deep collection of data on individual trial participants as well as patients in clinical settings, thereby providing an opportunity for “N of 1” clinical investigations, the cornerstone of evidence generation for personalization of care.
As new platforms for connected technologies emerge, “composite” biomarkers simultaneously incorporating multi-sourced physiologic parameters (e.g., blood pressure, heart rate, and oxygen saturation) and patient-reported information can have higher diagnostic and prognostic value. With more data, an algorithm’s accuracy improves. For example, incorporation of the user’s height, weight, age, and gender increases step count accuracy, because a 25-year-old’s gait is not equivalent to that of an 80-year-old. Availability of contextual information will enable more personalized algorithms (e.g., a step count algorithm designed for a population with late-stage Parkinson’s), and also can combine data sources to create novel measures for conditions that have historically struggled to have meaningful endpoints (e.g., brain and nervous system disorders).
Ensuring privacy and autonomy is paramount as digital biomarkers are incorporated into care and self-management, and incentive programs encouraging wellness and treatment plan adherence. While healthcare delivery organizations using digital biomarkers are of course Health Insurance Portability and Accountability Act (HIPAA) covered entities, when citizens engage directly with the technologies or technology companies, HIPAA does not apply.13 Social media and targeted advertising platforms typically employ end-user-license agreements and terms of service to outline data-sharing rights and privacy policies. However, like informed consent, health data rights should cover a continuum of activities over time. Therefore, data use agreements for digital biomarker development should contain clear statements on conditions for data usage especially for tools that collect near-continuous data, like movement, voice, and other sensitive biometric states.
Connected software products may pose cybersecurity challenges exposing trial participants and patients to privacy breaches or even safety risks. Just as HIPAA and the Common Rule are written to protect a patient’s medical record data and biospecimens, nascent efforts are building protections for digital “specimens”. New frameworks are emerging around the security,14 ethics,13 and informed consent challenges,15 of digital phenotyping technologies.16 One approach—a promising one for tracking security vulnerabilities and issues of performance, transparency, and accuracy—would require software manufacturers to provide, in premarket submission to the FDA, a “Software Bill of Materials” which is analogous to the ingredient list for a medication.17
A challenge to the evaluation of algorithms is that many are proprietary, patented or are trade secrets. For example, the AliveCor, Cardiogram, and Apple atrial detection algorithms and training data sets, for example, are not published. Instead, these companies offer a textual description of what the code does. The Empatica epilepsy monitor, for example, does not readily output raw signal, but instead, only the processed output interpreted by its proprietary algorithm. Hence the impact on a population of a digital biomarker-driven clinical management plan may not always be transparent to patients and clinicians. Testing characteristics, including selected thresholds for action, sensitivity, and specificity should be made transparent to the healthcare professional, regulators, and trial participant and patient users of digital biomarkers.
In recent years, digital biomarker development has begun integration into translational and clinical research. An increasing number of industry and academic investigators are at the leading edge of a new wave of innovations.
To accrue maximum benefit to the patient, a safe and effective digital biomarker ecosystem requires transparency of the algorithms, interoperable components with open interfaces to accelerate the development of new multicomponent systems, high integrity measurement systems. The time is now to give forethought to strong incentive structures to promote the safe and effective use of digital biomarkers. Generally, the verification and validation of a digital biomarker should be not construed as a one-time process, but rather, a learning digital health system should continuously collect data and handle modifications and updates overtime. Industry, researchers, regulators, clinicians, and patients have a joint responsibility to design such a learning system that can improve digital biomarker products, empower patients, and improve health and healthcare delivery for everyone
114th Congress. H.R.34—21st Century Cures Act (2015–2016). https://www.nejm.org/doi/full/10.1056/NEJMp1615745.
Byrom, B. et al. Selection of and evidentiary considerations for wearable devices and their measurements for use in regulatory decision making: recommendations from the ePRO Consortium. Value Health 21, 631–639 (2018).
Zhan, A. et al. Using smartphones and machine learning to quantify Parkinson disease severity: the mobile Parkinson disease score. JAMA Neurol. 75, 876–880 (2018).
Kovatchev, B. The artificial pancreas in 2017: the year of transition from research to clinical practice. Nat. Rev. Endocrinol. 14, 74–76 (2018).
Atreja, A. et al. Mobilizing mHealth Innovation for Real-World Evidence Generation (Duke Margolis Center for Health Policy, https://healthpolicy.duke.edu/sites/default/files/atoms/files/duke-margolis_mhealth_action_plan.pdf, 2018).
Izmailova, E. S., Wagner, J. A. & Perakslis, E. D. Wearable devices in clinical trials: hype and hypothesis. Clin. Pharmacol. Ther. 104, 42–52 (2018).
Parmar, A. FDA Clears New Dexcom CGM that Requires No Patient Calibration Earlier than Expected. https://medcitynews.com/2018/03/fda-clears-new-dexcom-cgm-requires-no-patient-calibration-earlier-expected/ (2018).
Shuren, J., Patel, B. & Gottlieb, S. FDA regulation of mobile medical apps. JAMA 320, 337–338 (2018).
U.S. Food and Drug Administration. ECG App: Electrocardiograph Software for Over-the-Counter Use. https://www.accessdata.fda.gov/cdrh_docs/pdf18/DEN180044.pdf (2018).
U.S. Food and Drug Administration. Irregular Rhythm Notification Feature: Photoplethysmograph Analysis Software for Over-the-Counter Use. https://www.accessdata.fda.gov/cdrh_docs/pdf18/DEN180042.pdf (2018).
Apple. A Message to Our Customers about iPhone Batteries and Performance. https://www.apple.com/iphone-battery-and-performance/ (2017).
Steinhubl, S. R., McGovern, P., Dylan, J. & Topol, E. J. The digitised clinical trial. Lancet 390, 2135 (2017).
Martinez-Martin, N., Insel, T. R., Dagum, P., Greely, H. T. & Cho, M. K. Data mining for health: staking out the ethical territory of digital phenotyping. npj Digit. Med. 1, 68 (2018).
Clinical Trials Transformative Initiative. CTTI Unveils Recommendations for Using Mobile Technologies in Clinical Research. https://www.ctti-clinicaltrials.org/news/ctti-unveils-recommendations-using-mobile-technologies-clinical-research (2018).
Sage Bionetworks. Elements of Informed Consent. http://sagebionetworks.org/in-the-news/elements-informed-consent/ (2018).
Torous, J., Onnela, J. P. & Keshavan, M. New dimensions and new tools to realize the potential of RDoC: digital phenotyping via smartphones and connected devices. Transl. Psychiatry 7, e1053 (2017).
U.S. Food and Drug Administration. Medical Device Safety Action Plan: Protecting Patients, Promoting Public Health. https://www.fda.gov/downloads/AboutFDA/CentersOffices/OfficeofMedicalProductsandTobacco/CDRH/CDRHReports/UCM604690.pdf. Accessed 25 Feb 2019.
Gold, M. et al. Digital technologies as biomarkers, clinical outcomes assessment, and recruitment tools in Alzheimer's disease clinical trials. Alzheimers Dement. 4, 234–242 (2018).
Ritchie, K. et al. The midlife cognitive profiles of adults at high risk of late-onset Alzheimer's disease: the PREVENT study. Alzheimers Dement. 13, 1089–1097 (2017).
Dowling, A. V., Favre, J. & Andriacchi, T. P. Inertial sensor-based feedback can reduce key risk metrics for anterior cruciate ligament injury during jump landings. Am. J. Sports Med. 40, 1075–1083 (2012).
Varela Casal, P. et al. Clinical validation of eye vergence as an objective marker for diagnosis of ADHD in children. J. Atten. Disord. https://doi.org/10.1177/1087054717749931 (2018).
Rajpurkar, P., Hannun, A., Masoumeh, H., Bourn, C. & Ng, A. Cardiologist-level arrhythmia detection with convolutional neural networks. arXiv preprint arXiv:1707.01836, https://arxiv.org/pdf/1707.01836.pdf (2017).
Gosh, S. S. & Ciccarelli, G. Speaking one's mind: vocal biomarkers of depression and Parkinson disease. J. Acoust. Soc. Am. 139, 2193 (2016).
RespApp. Diagnosing Respiratory Disease in Children Using Cough Sounds 2 (SMARTCOUGH-C-2). https://www.clinicaltrials.gov/ct2/show/NCT03392363 (2018).
Sage Bionetworks. Sage Bionetworks in Collaboration with The Michael J. Fox Foundation Announce Winners in the DREAM Parkinson’s Disease Digital Biomarker Challenge. https://www.businesswire.com/news/home/20180117006187/en (2018).
Barrett, M. A. et al. Effect of a mobile health, sensor-driven asthma management platform on asthma control. Ann. Allergy Asthma Immunol. 119, 415–421 (2017).
Wolz, R., Munro, J., Guerrero, R., Hill, D. L. & Dauvilliers, Y. Predicting sleep/wake patterns from 3-axis accelerometry using deep learning. Alzheimer Dement. 13, P1012 (2017).
Moreau, A. et al. Detection of nocturnal scratching movements in patients with atopic dermatitis using accelerometers and recurrent neural networks. IEEE J. Biomed. Health Inform. 22, 1011–1018 (2018).
Mindstrong Health. Mindstrong Health and Takeda Partner to Explore Development of Digital Biomarkers for Mental Health Conditions. https://www.prnewswire.com/news-releases/mindstrong-health-and-takeda-partner-to-explore-development-of-digital-biomarkers-for-mental-health-conditions-300604553.html (2018).
physIQ. physIQ. http://www.physiq.com/resources/.
Bosl, W. J., Tager-Flusberg, H. & Nelson, C. A. EEG analytics for early detection of autism spectrum disorder: a data-driven approach. Sci. Rep. 8, 6828 (2018).
Halcox, J. P. J. et al. Assessment of remote heart rhythm sampling using the AliveCor Heart Monitor to screen for atrial fibrillation: The REHEARSE-AF Study. Circulation 136, 1784–1794 (2017).
Kessing, L. V. Effects of Erythropoietin on Cognition and Neural Activity in Bipolar Disorder (PRETEC-EPO). https://clinicaltrials.gov/ct2/show/NCT03315897 (2017).
Padwal, R. S. Validation of the Omron HEM-9210T by the ANSI/AAMI/ISO 81060-2 with two novel cuffs: wide range and extra-large. Blood Press Monit. 22, 379 (2017).
This work was supported by the PrecisionLink initiative at Boston Children’s Hospital and by a grant from the National Institutes of Health/NIGMS R01GM104303. Andrea Coravos acknowledges support from Harvard-MIT Center for Regulatory Science. For suggesting compelling examples of digital biomarkers for the Table, we gratefully acknowledge Jessie Bakker, MS PhD, Brandon Ballinger, Chris Benko, Brian M. Bot, Jeffrey D. Bower, PhD, Ray Dorsey, MD, Ariel V. Dowling, PhD, Robert Ellis, PhD, Luca Foschini, PhD, Robert Furberg, PhD, Jennifer C. Goldsack, MChem MA MBA, Ankit Gordhandas, Elena Izmailova, PhD, Matthew Johnson, Daniel Karlin, MD MA, Ashley Mateus, PhD, Donald McLaren, PhD, Shyamal Patel, PhD, Barry Peterson, PhD, Ariel Dora Stern, PhD, Iain Simpson, PhD, A. Sofia Warner, MD, William Wood, MD MPH, and Noah Zimmerman, PhD.
A.C. has been developing an open-source pre-competitive digital biomarker catalog, at Elektra Labs, a startup company, with funding from the Harvard Business School, the NSF, and the Mount Sinai School of Medicine. The other authors declare no competing interests.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Coravos, A., Khozin, S. & Mandl, K.D. Developing and adopting safe and effective digital biomarkers to improve patient outcomes. npj Digit. Med. 2, 14 (2019). https://doi.org/10.1038/s41746-019-0090-4
This article is cited by
Decentralized clinical trials and rare diseases: a Drug Information Association Innovative Design Scientific Working Group (DIA-IDSWG) perspective
Orphanet Journal of Rare Diseases (2023)
Psilocybin therapy for treatment resistant depression: prediction of clinical outcome by natural language processing
Annals of Biomedical Engineering (2023)
Enabling endpoint development for interventional clinical trials in individuals with Angelman syndrome: a prospective, longitudinal, observational clinical study (FREESIAS)
Journal of Neurodevelopmental Disorders (2023)
Large multicenter randomized trials in autism: key insights gained from the balovaptan clinical development program
Molecular Autism (2022)