A dataset quantifying polypharmacy in the United States

Quinn, Katie J.; Shah, Nigam H.

doi:10.1038/sdata.2017.167

Download PDF

Data Descriptor
Open access
Published: 31 October 2017

A dataset quantifying polypharmacy in the United States

Scientific Data volume 4, Article number: 170167 (2017) Cite this article

7404 Accesses
22 Altmetric
Metrics details

Subjects

Abstract

Polypharmacy is increasingly common in the United States, and contributes to the substantial burden of drug-related morbidity. Yet real-world polypharmacy patterns remain poorly characterized. We have counted the incidence of multi-drug combinations observed in four billion patient-months of outpatient prescription drug claims from 2007–2014 in the Truven Health MarketScan® Databases. Prescriptions are grouped into discrete windows of concomitant drug exposure, which are used to count exposure incidences for combinations of up to five drug ingredients or ATC drug classes. Among patients taking any prescription drug, half are exposed to two or more drugs, and 5% are exposed to 8 or more. The most common multi-drug combinations treat manifestations of metabolic syndrome. Patients are exposed to unique drug combinations in 10% of all exposure windows. Our analysis of multi-drug exposure incidences provides a detailed summary of polypharmacy in a large US cohort, which can prioritize common drug combinations for future safety and efficacy studies.

Design Type(s)	observation design • subject-based data analysis objective
Measurement Type(s)	Polypharmacy
Technology Type(s)	digital curation
Factor Type(s)
Sample Characteristic(s)	United States of America

Machine-accessible metadata file describing the reported data (ISA-Tab format)

Polypharmacy, hospitalization, and mortality risk: a nationwide cohort study

Article Open access 03 November 2020

Patterns of patients with polypharmacy in adult population from Korea

Article Open access 27 October 2022

The impact of pharmacogenetic testing in patients exposed to polypharmacy: a scoping review

Article 17 June 2021

Background and Summary

Concomitant use of multiple prescription drugs (‘polypharmacy‘) is increasingly common, with 10% of the population^1,2 and 30% of older adults in the United States taking five or more drugs simultaneously^1–3. Similarly high prevalence is reported in other countries (e.g., the United Kingdom⁴, Sweden⁵, China⁶, Brazil⁷, and India⁸. The prevalence of polypharmacy is driven by high rates of comorbidities (in the United States in 2012, 26% of all adults, and 61% of adults over 65 years of age had two or more chronic conditions⁹), and exacerbated by clinical practices enabling overprescription and insufficient monitoring^10,11. Drug-related morbidity has become a substantial healthcare burden: in the United States, adverse drug reactions are prevalent (causing 4 hospitalizations per 1000 people each year¹⁰), serious (among top 10 common causes of death¹²), and expensive (with associated annual costs estimated at US$30billion¹³ to US$180billion¹⁴).

Exposure to multiple drugs puts patients at additive risk of each single drug’s potential adverse outcomes. In a study of an elderly cohort, the strongest predictor of a potentially harmful medication was the number of drug prescriptions¹⁵. But drugs can also interact to increase risk beyond ‘the sum of the parts’, either by canceling an intended drug action, enhancing existing risks, or creating new risks. It’s estimated that over 20% of adverse drug reactions are due to underlying drug interactions^16,17, and that risk of drug interaction increases with the number of drugs taken¹⁸. However, despite increasing awareness of morbidity related to polypharmacy, multi-drug exposure patterns remain poorly characterized.

Insurance claims records enable analysis of prescription practices in a large patient cohort, even for drug regimens that would be rare in smaller cohorts. The 21st Century Cures Act, enacted in December 2016, recognizes the value of, and mandates the use of, observational patient-experience data, such as insurance claims, for drug surveillance¹⁹. The relative strengths of insurance claims for characterizing population-level drug use are that data reflect prescriptions that are actually dispensed to the patient, and capture prescription information across very large cohorts. As for most sources of drug use data, whether patients actually ingested the drugs remains a limitation.

Here, we publish a dataset of multi-drug exposure incidence in a large insured cohort in the United States, both in terms of drug ingredients and drug classes. We analyze outpatient prescription drug claims from the Truven Health MarketScan® Research Databases, which contain health coverage records for over 100 million employees, dependents, and retirees in the United States from 2007–2014, amounting to over 4 billion months of patient observation. Table 1 summarizes relevant metrics of the database. Prior to our work, Sutherland et al. have reported on co-prescription trends using self-reported data from a small but nationally-representative cohort of 10,000 NHANES participants. The frequency of some drug-pair exposures among elderly participants was also reported². To our knowledge, ours is the first study to quantify the incidence of specific combinations of more than two drugs. Figure 1 summarizes our workflow of processing prescription drug claims into discrete exposure time-windows, and then counting concomitant drug exposures for drug ingredients and ATC-II drug classes.

Table 1 Summary of Truven Health MarketScan Research Database prescription data and drug combination counts.

Full size table

**Figure 1: Data analysis workflow to generate drug combination exposure incidences from prescription drug claims.**

This dataset will benefit researchers who study multi-drug safety or efficacy. The most common multi-drug combinations can be prioritized for subsequent studies of multi-drug safety or efficacy. As a side benefit, by mapping drugs to disease based on indications, the dataset can also provide a summary of comorbidities that drive the observed prescription trends.

This dataset will also benefit practitioners by enabling risk stratification of patients based on the multi-drug combinations they are on; for example, the dataset enables analyses identifying associations of specific drug combinations with health outcomes (such as emergency department visits), which could enable patient risk stratification at the time of medication reconciliation.

Ultimately combined analyses spanning both safety and risk stratification will enable systematic progress towards safe polypharmacy. With roughly 40 million individuals experiencing polypharmacy in the US and as many as 10% of all adults worldwide, the existence of such datasets is crucial for a data-driven quantification of which drug combinations are risky.

Methods

Data source

Prescription drug claim data were derived from the Truven Health MarketScan 2007 to 2014 Commercial Claims and Encounters and Medicare Supplemental and Coordination of Benefits Databases, which were accessed via the Stanford Center for Population Health Sciences Data Core. Further details about the Data Core and its operating procedures are available at http://med.stanford.edu/phs/phs-data-center.html. These databases represent the health services of approximately 100 million employees, dependents, and retirees in the United States with primary or Medicare supplemental coverage through privately insured fee-for-service, point-of-service, or capitated health plans. The Commercial Claims and Encounters, and Medicare Supplemental and Coordination of Benefits populations comprise 90 and 10% of the total cohort respectively, with a mean age of 33 and 73 years and gender fraction of 49 and 45% male, creating a combined cohort with a mean age of 37 years and 48% male. Patients are observed for a median of 29 months. We analyzed the outpatient prescription drug claims of 150 million patients over 4.3 billion months of patient enrollment. We focus on outpatient prescriptions since 90% of all prescriptions are in the outpatient setting²⁰, and inpatient drug treatment patterns differ substantially.

Drug list curation and drug mapping

We curated a set of 1429 drugs, defined by RxNORM ingredient level, by beginning with the 1165 drug ingredients occurring in all of DrugBank, RxNORM, and UMLS (previously curated in our lab²¹), adding drug ingredients occurring commonly in the Truven Health MarketScan Databases, and removing vaccines, and vitamins and minerals (which are more often obtained over-the-counter than by prescription).

Of this set of 1429 drug ingredients, 864 occur in the Truven Health MarketScan Database prescription claims. Drugs are identified in prescriptions using National Drug Codes (NDCs). We built a mapping of NDCs to RxNORM-defined drug ingredients using the NLM’s RxMix API to match on strings containing drug names, first with strict matching (which maps the majority of NDCs with very low error), then with approximate string matching (which maps the remaining NDCs but requires manual validation of string matches).

Combination drugs (e.g., Norco) count as exposure to each drug ingredient (e.g., acetaminophen and hydrocodone). The approximate drug cost per day was calculated as the median payment-per-days-supply for all patient orders of that drug. Drugs are also classified by Anatomical Therapeutic Chemical (ATC) class at the second level (‘therapeutic main group’), by mapping RxNORM identifiers to ATC codes. The 864 unique drug ingredients in the dataset map to 79 unique second-level ATC classes.

Extracting discrete exposure windows from drug prescriptions

To count concomitant drug exposures from prescription claims, we first scanned drug prescriptions into discrete exposure windows. We defined exposure periods as non-overlapping 30-day windows. We selected a 30-day window because it is the most common prescription duration, and thus a natural timescale for prescriptions. A patient is considered exposed to a drug starting from the date the prescription and for the duration of the days-of-supply. If any of those days overlap with a window, the patient is considered exposed in that window (Fig. 2a). This method is computationally efficient and provides a good approximation of concomitant exposure.

**Figure 2: Illustration of conversion of drug prescription date of service and days of supply into discrete exposures.**

As a known limitation, the method overestimates exposures, and thus concomitant exposures, if a patient does not take the prescription for its full duration. As shown in Fig. 2b, non-overlapping windows introduce error when either: 1) non-concomitant prescriptions are separated by less than 30- days yet both overlap with a particular exposure window; or conversely 2) when prescriptions separated by only a few days fall into different exposure windows. We create exposure windows using a simple integer division of patient age-in-days by 30, which is computationally efficient. However this creates partial windows of observation at the beginning and end of each patient’s eligibility period, with a mean duration of 15-days. Given that patients are observed for a median of 29 months, this error is present only in about 5% of windows. However, these 30-day non-overlapping windows simplify computation, with a low error rate, for the purposes of ranking the most common multi-drug exposures.

Using this method, individual patient prescription claims were converted into drug exposures in discrete windows (Fig. 1c), resulting in 5.1 billion drug exposures. This dataset was then used to count concomitant drug exposure.

Counting concomitant multi-drug exposures

There are two ways to count multi-drug exposure: exposure to an ‘exact’ set of drugs (and no additional drugs), and exposure to ‘at least’ a particular set of N-drugs (which may or may not be taken with additional drugs). Each of these variants captures valuable information: ‘exact’ counts quantify the absolute number of concomitant drug exposures, and how many patients are exposed to a precise sets of drugs; ‘at least’ counts are important for knowing all patients exposed to any given drug combination. See example shown in Fig. 1: Concomitant exposure to drug ingredients A (class Q), B (class Q), and C (class R) will contribute a count to A+B+C for ‘exact’ drug ingredient exposure, and each of A, B, C, A+B, A+C, B+C, and A+B+C for ‘at least’ drug exposure.

This method counted 220 million unique ‘exact’ drug combinations exposures, with patients exposed to a median of 2 drugs and 95th-percentile of 8 drugs per window (Fig. 3a). In approximately 10% of windows, patients were exposed to a unique set of drugs, never observed elsewhere in the entire database. This is in agreement with a recent study of treatment pathways that found that 10% of diabetes and depression patients and almost 25% of hypertension patients received therapeutic regimens that were unique within a 250-million-large patient cohort²².

**Figure 3: Distributions of the number of unique concomitant drug exposures per patient-months.**

To count ‘at least’ multi-drug exposures, we created a drug-based index to the summarized 220 million ‘exact’ counts. (This required much less computation than indexing on the original 5.1 billion exposure windows.) We then performed an intersect operation for each ‘at-least’ drug combination of interest. Counting all possible drug combinations is infeasible, and unnecessary since most combinations are never observed. The challenge was to create a list of N-drug combinations likely to have high concomitant exposures. We achieved this with a ‘greedy’ approach of constructing N-drug combinations from N-minus-1 subset drug combinations observed in at least 1000 exposure windows, for each of N=2, 3, 4 and 5 drugs.

An additional metric of interest is the extent to which drug combinations are concomitant beyond what would be expected by chance, given their marginal frequencies. Drug combinations’ overrepresentation was defined as the ratio of the observed-to-expected drug combination incidence in two ways: first (for N>1) based on single-drug frequencies, which gives the overall overrepresentation; and second (for N>2) based on the minimum of each of N permutations of (N-1)+1 drug subsets, which is greater than the single-drug overrepresentation, and gives the overrepresentation of the drug combination beyond its subsets. (As an example, the co-incidence of drugs A+B+C is compared to the incidence expected by chance based on the incidences of drugs A+B and C, A+B and B, and B+C and A. The smallest overrepresentation is reported. The second method is only reported for N>2, because the two methods are equivalent for N=2.)

We repeated these computations of ‘exact’ and ‘at least’ exposure counts, and their overrepresentation, for the 79 second-level ATC drug classes. Second-level ATC drug class names were extracted from the website of the WHO Collaborating Centre for Drug Statistics Methodology. Though one drug ingredient can map to multiple ATC classes, we count only the primary class. Continuing the example in Fig. 1, concomitant exposure to drugs from classes Q and R would be counted as (iii) Q+R for ‘exact’ drug class exposure, and (iv) Q, R, and Q+R for ‘at least’ drug class exposure. (Note that drug classes are counted only once, even if a patient is taking two or more drugs from a particular class). This calculation yielded 39 million unique exact drug class exposures, with patients exposed to a median of 2 and 95th-percentile of 7 drug classes per window (Fig. 3b).

Code availability

Code used to generate the dataset is available on a public github repository (https://github.com/katieq/QuantifyingPolypharmacy). To avoid disclosing the format of the Truven Health Marketscan Databases, the code begins at the step of analyzing prescription data extracted into a data-frame with columns for a patient identifier, drug identifier, age of prescription, and days of supply.

Data Records

The dataset of exposure counts for drug and drug-class combinations is publicly available online at Dryad (Data Citation 1) in 12 tab-delimited data files and a README.txt file. The tab-delimited data files are outlined below and in Table 2. The accompanying README.txt file contains filenames and descriptions of file contents. Data files can be accessed directly by their associated URLs, for example by reading into R with the readr package’s read_tsv function. Table 2 summarizes attributes of the underlying patient claims data in the 2007–2014 Truven Health MarketScan Commercial and Medicare Supplemental Databases.

Table 2 Data Records description.

Full size table

Data Record 1: Drug ingredient combination exposure counts

Data Record 1 contains the exposure incidences for the most common combinations of N=1-to-5 drugs in five files, with one row per combination. All single drugs (N=1) and drug pairs (N=2) are included; for N=3-to-5, drug combinations with at least 10,000 exposure counts are included. Exposure counts below 100 patient-windows are reported as ‘<100’ to protect patient privacy. Each row contains N+5 tab-delimited columns comprising: the name for each drug ingredient, the count of windows with concomitant exposure to this drug combination, potentially concomitant with additional drugs (atleast_exposure_count), the count of windows with concomitant exposure to this drug combination and no additional drugs (exact_exposure_count), the ratio of the two previous columns (fraction_exact), the ratio of the atleast_exposure_count to the total number of observed windows with any prescription (fraction_all_windows), overrepresentation beyond expected based on marginal frequencies of single drugs (observe_per_expect_1s) and (N-1)+1 drug subsets (observe_per_expect_N1), and an estimate of the daily cost of the drug combination (estimate_drug_combo_cost_per_day).

Data Record 2: Drug class combination exposure counts

The contents of Data Record 2 are equivalent to Data Record 1, but for level-II ATC drug classes instead of drug ingredients. The record also contains five files for combinations of N=1-to-5 drug classes, with one row per combination. As for Data Record 1, all single drug class (N=1) and drug class pairs (N=2) are included, and drug class combinations with at least 10,000 exposure counts are included for N=3-to-5; exact exposure counts of less than 100 are reported as ‘>100’. Columns are equivalent to Data Record 1’s, with drug class names replacing drug ingredient names. However daily cost can not be calculated at the drug class level.

Data Record 3: Drug mappings

Data Record 3 contains two tab-delimited files containing the list of 1429 drug ingredients and 93 corresponding ATC level-2 drug classes considered in this study. The drug ingredient file contains one drug per row sorted alphabetically by drug ingredient name, with five columns for the drug ingredient name, RxNorm CUI number, UMLS CUI, Drug Bank ID, ATC code, second-level ATC drug class name, and estimated median cost per day. The drug class mappings file contains one ATC level-2 drug class per row sorted alpha-numerically by ATC class, with two columns for the ATC code and name. (Thus the ATC level-2 class names in the drug ingredient file are redundant, but included for convenience).

Technical Validation

We validated our method in three ways. First, we compared our computational method’s results to manual counting, by reading the dates and days-supply for a random sample of ten patients’ 260 drug prescriptions. The counts of concomitant drug exposures matched perfectly, indicating that our method does indeed accurately extract concomitant drug exposures as intended without errors in arithmetic.

Second, we conducted a sensitivity analysis on the duration of the discrete exposure window, by counting concomitant drug exposures in a random sample of 10,000 patients with an exposure window of 10, 20, 30, 40, 50, 60, and 90 days. Since all prescriptions are considered ‘exposures’ for the entire duration of the window, longer windows slightly increase the mean drug exposure counts (average drug exposure count is 3.1, 3.2 and 3.8 for a 10-, 30-, or 90-day window respectively), and thus increase the relative incidence (i.e., ranking) of prescriptions with short-durations (e.g., antibiotics or short-term pain relief). Thus the choice of exposure window duration does affect the Data Records. Therefore we set the window duration equal to the most common prescription duration. Prescriptions in this claims database are most often for 30 days (50% of all prescriptions), with about 20% for 10 or fewer days. Thus a 30-day window is an appropriate timescale to capture changes in drug exposures.

Finally, we tested the sensitivity to cohort size, by comparing the drug combination incidence ranking obtained using the entire Truven Health MarketScan cohort (approximately 100 million patients) to a random sample of 100,000 (1e5) patients, and 1,000,000 (1e6) patients. As expected, analysis of smaller cohorts obtains a similar ranking of the common drug combinations, but inaccurately estimates the incidence, and thus the ranking, of rare drug combinations. In addition, smaller cohorts overestimate the patients exposed to unique drug combinations, never observed elsewhere in the database: In the complete cohort, 10% of drug combinations are observed only once, but in a cohort of 1e5 patients, that fraction is 20%. Thus while smaller cohorts are sufficient to rank the incidence of common drug combinations, a large patient cohort is required to accurately estimate the incidence of drug combinations.

Limitations

The accuracy of this dataset as a summary of multi-drug exposure incidences in the United States is limited to some extent by the underlying data source and our method of computation. The Truven Health MarketScan Research Databases cohort is commercial claims, and not a fully representative sample of the United States population. We examine drug exposure based on filled prescriptions, but patients may take none or only a fraction of the dispensed drugs. Since there is bias on adherence between drugs, this will introduce bias in the resulting single drug and drug combination incidences. However, billing data from filled prescriptions are more accurate than alternative sources, such as doctor’s notes or prescription orders that may go unfilled.

We only observe and analyze prescription drugs, but over-the-counter drugs and supplements contribute a significant portion of total drug exposures²³. Though patient surveys can offer information about exposure to over-the-counter drugs and supplements, they rely on patient memory, and lack the cohort size and accuracy of prescription records.

Our method scans prescription drug claims according to 30-day exposure windows. Shorter or longer exposure windows would increase or decrease apparent multi-drug exposures respectively. The size of the exposure window affects drug combinations’ relative incidence (i.e., ranking), with longer windows increasing the apparent incidence of combinations including drugs with short prescription durations (e.g., antibiotics or short-term pain relief). However the rankings are agnostic to exposure window duration with our choice of a window of 30-days, to match the days-of-supply of the majority of prescriptions.

Finally, our analysis uses all data from 2007–2014, ignoring the likely non-stationarity of prescription patterns²⁴ (as suggested by the increase in prevalence of polypharmacy from 8% to 15% between the 1999–2000 and 2011–2012 NHANES surveys^23,25). Nonetheless, our multi-drug exposure dataset provides a ranking of common concomitant prescription drug exposures for a large population in the United States.

Usage Notes

This summary of multi-drug prescription patterns in a large cohort enables further analysis of the trends, safety, or efficacy of multi-drug use.

Prioritize common multi-drug combinations for adverse event association analysis

The common multi-drug combinations identified here can now be prioritized for analysis of association with adverse health outcomes. An example illustrating this use case is identifying which of the common 3-drug combinations in Data Record 1 are most overrepresented in the 30-days prior to Emergency Department visits (Table 3). It is important to note that this association tells us nothing about causation, but merely identifies drug combinations taken at increased rates by patients prior to ED visits. Thus, as indicators of patients’ health state, multi-drug combinations could potentially be used to identify patients at risk of an ED visit in the near-future. Similar association analysis can be completed with any desirable or undesirable outcome, in any cohort of interest, for various study designs.

Table 3 Common 3-drug combinations most overrepresented prior to ED visits.

Full size table

Identify drugs used concomitantly with a given drug of interest

This dataset can be used to profile the common co-exposures for any drug ingredient or class of interest. Table 4 shows this analysis for the first line diabetes drug metformin and the opioid oxycodone. The summary was obtained by extracting rows containing metformin or oxycodone from the 2-drug table of Data Record 1, and normalizing by the total exposure counts for metformin or oxycodone. Code to perform this analysis for any drug ingredient or drug class of interest is provided in the R-script get_codrugs.R at the code repository (https://github.com/katieq/QuantifyingPolypharmacy). This could be repeated for larger drug combinations using the 3-, 4-, or 5-drug tables.

Table 4 Summary of the most common and most overrepresented drug ingredient co-exposures with metformin and oxycodone.

Full size table

Stratify patients by risk of adverse health outcomes, based on prescription set

This dataset can now be used to calculate the increased risk of undesirable health outcomes associated with a particular set of prescriptions. Such a risk estimate can be used to stratify patients according to risk of future adverse health events, and then to flag prescription changes that place patients in a higher risk category, or to identify prescription combination changes that lower patients’ risk category. Of course, such risk stratification implies no causality whatsoever; however, such analyses can provide a succinct report on the risks experienced by a cohort of similarly-treated patients.

Additional information

How to cite this article: Quinn, K. J. & Shah, N. H. A dataset quantifying polypharmacy in the United States. Sci. Data 4:170167 doi: 10.1038/sdata.2017.167 (2017).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Gu, Q., Dillon, C. F. & Burt, V. L. Prescription drug use continues to increase: U.S. prescription drug data for 2007-2008. NCHS Data Brief 1–8 (2010).
Sutherland, J. J. et al. Co-prescription trends in a large cohort of subjects predict substantial drug-drug interactions. PLoS ONE 10, 3 (2015).
Google Scholar
Bushardt, R. L., Massey, E. B., Simpson, T. W., Ariail, J. C. & Simpson, K. N. Polypharmacy: misleading, but manageable. Clin. Interv. Aging 3, 383–389 (2008).
Article Google Scholar
Payne, R. A. The epidemiology of polypharmacy. Clin. Med. 16, 465–469 (2016).
Article Google Scholar
Hovstadius, B., Hovstadius, K., Astrand, B. & Petersson, G. Increasing polypharmacy-an individual-based study of the Swedish population 2005-2008. BMC Clin. Pharmacol. 10, 16 (2010).
Article Google Scholar
Dong, L., Yan, H. & Wang, D. Polypharmacy and its correlates in village health clinics across 10 provinces of Western China. J. Epidemiol. Community Health 64, 549–553 (2010).
Article Google Scholar
Oliveira, M. G., Amorim, W. W., de Jesus, S. R., Rodrigues, V. A. & Passos, L. C. Factors associated with potentially inappropriate medication use by the elderly in the Brazilian primary care setting. Int. J. Clin. Pharm. 34, 626–632 (2012).
Article Google Scholar
Rambhade, S., Chakarborty, A., Shrivastava, A., Patil, U. K. & Rambhade, A. A survey on polypharmacy and use of inappropriate medications. Toxicol. Int 19, 68–73 (2012).
Article Google Scholar
Ward, B. W., Schiller, J. S. & Goodman, R. A. Multiple Chronic Conditions Among US Adults: A 2012 Update. Prev. Chronic Dis 11, E62 (2014).
PubMed PubMed Central Google Scholar
Shehab, N. et al. US Emergency Department Visits for Outpatient Adverse Drug Events, 2013-2014. JAMA 316, 2115–2125 (2016).
Article Google Scholar
Kessler, C., Ward, M. J. & McNaughton, C. D. Reducing Adverse Drug Events: The Need to Rethink Outpatient Prescribing. JAMA 316, 2092–2093 (2016).
Article Google Scholar
Lazarou, J., Pomeranz, B. H. & Corey, P. N. Incidence of adverse drug reactions in hospitalized patients: a meta-analysis of prospective studies. JAMA 279, 1200–1205 (1998).
Article CAS Google Scholar
Sultana, J., Cutroneo, P. & Trifirò, G. Clinical and economic burden of adverse drug reactions. J. Pharmacol. Pharmacother 4, S73–S77 (2013).
Article Google Scholar
Ernst, F. R. & Grizzle, A. J. Drug-related morbidity and mortality: updating the cost-of-illness model. J. Am. Pharm. Assoc. 41, 192–199 (2001).
CAS Google Scholar
Jirón, M. et al. Trends in Prevalence and Determinants of Potentially Inappropriate Prescribing in the United States: 2007 to 2012. J. Am. Geriatr. Soc 64, 788–797 (2016).
Article Google Scholar
Magro, L., Moretti, U. & Leone, R. Epidemiology and characteristics of adverse drug reactions caused by drug-drug interactions. Expert Opin. Drug Saf. 11, 83–94 (2012).
Article CAS Google Scholar
Strandell, J., Bate, A., Lindquist, M. & Edwards, I. R. Swedish, Finnish, Interaction X-referencing Drug-drug Interaction Database (SFINX Group). Drug-drug interactions - a preventable patient safety issue? Br. J. Clin. Pharmacol. 65, 144–146 (2008).
Article Google Scholar
Johnell, K. & Klarin, I. The relationship between number of drugs and potential drug-drug interactions in the elderly: a study of over 600,000 elderly patients from the Swedish Prescribed Drug Register. Drug Saf. 30, 911–918 (2007).
Article Google Scholar
H.R.34 - 21st Century Cures Act, Congress.govhttps://www.congress.gov/bill/114th-congress/house-bill/34/text (2016).
Schumock, G. T. et al. National trends in prescription drug expenditures and projections for 2016. Am. J. Health. Syst. Pharm. 73, 1058–1075 (2016).
Article Google Scholar
Iyer, S. V., Harpaz, R., LePendu, P., Bauer-Mehren, A. & Shah, N. H. Mining clinical text for signals of adverse drug-drug interactions. J. Am. Med. Inform. Assoc 21, 353–362 (2014).
Article Google Scholar
Hripcsak, G. et al. Characterizing treatment pathways at scale using the OHDSI network. Proc. Natl. Acad. Sci. USA 113, 7329–7336 (2016).
Article CAS Google Scholar
Qato, D. M., Wilder, J., Schumm, L. P., Gillet, V. & Alexander, G. C. Changes in Prescription and Over-the-Counter Medication and Dietary Supplement Use Among Older Adults in the United States, 2005 vs 2011. JAMA Intern. Med. 176, 473–482 (2016).
Article Google Scholar
Jung, K. & Shah, N. H. Implications of non-stationarity on predictive modeling using EHRs. J. Biomed. Inform. 58, 168–174 (2015).
Article Google Scholar
Kantor, E. D., Rehm, C. D., Haas, J. S., Chan, A. T. & Giovannucci, E. L. Trends in Prescription Drug Use Among Adults in the United States From 1999-2012. JAMA 314, 1818–1831 (2015).
Article CAS Google Scholar

Data Citations

Quinn, K. J., & Shah, N. H. Dryad Digital Repository https://doi.org/10.5061/dryad.sm847 (2017)

Download references

Acknowledgements

Data for this project were accessed using the Stanford Center for Population Health Sciences Data Core. The PHS Data Core is supported by a National Institutes of Health National Center for Advancing Translational Science Clinical and Translational Science Award (UL1 TR001085) and from Internal Stanford funding. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

Author information

Authors and Affiliations

Stanford Center for Biomedical Informatics Research, Stanford, CA 94305, California, USA
Katie J. Quinn & Nigam H. Shah

Authors

Katie J. Quinn
View author publications
You can also search for this author in PubMed Google Scholar
Nigam H. Shah
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Katie J. Quinn.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

ISA-Tab metadata

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article.

Reprints and permissions

About this article

Cite this article

Quinn, K., Shah, N. A dataset quantifying polypharmacy in the United States. Sci Data 4, 170167 (2017). https://doi.org/10.1038/sdata.2017.167

Download citation

Received: 18 August 2017
Accepted: 29 September 2017
Published: 31 October 2017
DOI: https://doi.org/10.1038/sdata.2017.167

Subjects

Abstract

Similar content being viewed by others

Polypharmacy, hospitalization, and mortality risk: a nationwide cohort study

Patterns of patients with polypharmacy in adult population from Korea

The impact of pharmacogenetic testing in patients exposed to polypharmacy: a scoping review

Background and Summary

Methods

Data source

Drug list curation and drug mapping

Extracting discrete exposure windows from drug prescriptions

Counting concomitant multi-drug exposures

Code availability

Data Records

Data Record 1: Drug ingredient combination exposure counts

Data Record 2: Drug class combination exposure counts

Data Record 3: Drug mappings

Technical Validation

Limitations

Usage Notes

Prioritize common multi-drug combinations for adverse event association analysis

Identify drugs used concomitantly with a given drug of interest

Stratify patients by risk of adverse health outcomes, based on prescription set

Additional information

References

References

Data Citations

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

ISA-Tab metadata

ISA-Tab metadata

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links