Recent advances in our understanding of disease mechanisms have led to the development of new drugs that are enabling precision medicine. For example, the co-development of kinase inhibitors that target 'driver mutations' in metastatic non-small-cell lung cancer (NSCLC) with companion diagnostics has led to substantial improvements in the treatment of some patients. However, growing evidence suggests that most patients with metastatic NSCLC and other advanced cancers may not have tumours with single driver mutations. Furthermore, the generation of clinical evidence in genomically diverse and geographically dispersed groups of patients using traditional trial designs and multiple competing therapies is becoming more costly and challenging.
Strategies aimed at creating new efficiencies in clinical evidence generation and extending the benefits of precision medicine to larger groups of patients are driving a transformation from a reductionist approach to drug development (for example, a single drug targeting a driver mutation and traditional clinical trials) to a holistic approach (for example, combination therapies targeting complex multiomic signatures and real-world evidence). This transition is largely fuelled by the rapid expansion in the four dimensions of biomedical big data, which has created a need for greater organizational and technical capabilities (Fig. 1). Appropriate management and analysis of such data requires specialized tools and expertise in health information technology, data science and high-performance computing. For example, efforts to generate clinical evidence using real-world data are being limited by challenges such as capturing clinically relevant variables from vast volumes of unstructured content (such as physician notes) in electronic health records and organizing various structured data elements that are primarily designed to support billing rather than clinical research. So, new standards and quality-control mechanisms are needed to ensure the validity of the design and analysis of studies based on electronic health records.
As a point of convergence for biomedical data intended to support medical product approval decisions, the FDA has taken several steps in recent years to maximize its capabilities for incorporating big data into regulatory decision making. Precision FDA, a research and development portal for validating bioinformatics approaches for processing next-generation sequencing data, and the Sentinel Initiative, a post-market safety surveillance programme based on data from health insurers, are among the examples (see Further information).
Recently, the FDA's Office of Hematology and Oncology Products, in collaboration with the US Department of Health and Human Services' Innovation, Design, Entrepreneurship and Action (IDEA) laboratory, launched the Information Exchange and Data Transformation (INFORMED) initiative to build technical and organizational infrastructure for big data analytics (see Further information and Supplementary information S1 (table)). As a multi-disciplinary effort that draws on the expertise of a range of clinicians, data scientists and entrepreneurs-in-residence, INFORMED has created a new model for collaborative oncology regulatory science research. The activities under INFORMED focus on increasing capabilities for a broad range of data sets, from clinical trials to electronic health records and biosensor technologies. The effort includes a growing portfolio of projects and research collaborations aimed at improving interoperability of disparate information systems, supporting digitization of biomedical content still residing in analogue formats (such as paper and PDF files) and testing new technical frameworks such as blockchain to facilitate secure exchange of health data at scale (Nat. Rev. Drug Discov. 15, 670–671; 2016). It is our hope that by contributing to the advancement of the field of oncology through robust and collaborative regulatory science, we can support transformations that are tuned to the changing needs of cancer patients, together with innovators in the community.
The authors declare no competing financial interests.
Supplementary information S1 (table)
Mission statement, primary objectives, and core capabilities of the Information Exchange and Data Transformation (INFORMED) Initiative at FDA's oncology center of excellence (PDF 104 kb)
Rights and permissions
About this article
Cite this article
Khozin, S., Kim, G. & Pazdur, R. From big data to smart data: FDA's INFORMED initiative. Nat Rev Drug Discov 16, 306 (2017). https://doi.org/10.1038/nrd.2017.26
This article is cited by
A cross-sectional survey on the early impact of COVID-19 on the uptake of decentralised trial methods in the conduct of clinical trials
External Comparator Groups Derived from Real-world Data Used in Support of Regulatory Decision Making: Use Cases and Challenges
Current Epidemiology Reports (2022)
Is surgical axillary staging necessary in women with T1 breast cancer who are treated with breast-conserving therapy?
Cancer Communications (2019)
Artificial intelligence and machine learning in clinical development: a translational perspective
npj Digital Medicine (2019)
INFORMED: an incubator at the US FDA for driving innovations in data science and agile technology
Nature Reviews Drug Discovery (2018)