Dexter Hadley thinks that artificial intelligence (AI) could do a far better job at detecting breast cancer than doctors do — if screening algorithms could be trained on millions of mammograms. The problem is getting access to such massive quantities of data. Because of privacy laws in many countries, sensitive medical information remains largely off-limits to researchers and technology companies.
So Hadley, a physician and computational biologist at the University of California, San Francisco, is trying a radical solution. He and his colleagues are building a system that allows people to share their medical data with researchers easily and securely — and retain control over it. Their method, which is based on the blockchain technology that underlies the cryptocurrency Bitcoin, will soon be put to the test. By May, Hadley and his colleagues will launch a study to train their AI algorithm to detect cancer using mammograms that they hope to obtain from between three million and five million US women.
The team joins a growing number of academic scientists and start-ups who are using blockchain to make sharing medical scans, hospital records and genetic data more attractive — and more efficient. Some projects will even pay people to use their information. The ultimate goal of many teams is to train AI algorithms on the data they solicit using the blockchain systems.
These efforts come as the public grows increasingly concerned about how tech giants mine and profit from personal data, including some medical information. In 2016, DeepMind, an AI company in London owned by Google’s parent, Alphabet, became mired in controversy after press reports revealed that a branch of the UK National Health Service had given the company access to 1.6 million patient records without adequate consent. The information included names and sensitive information, such as whether a person had sexually transmitted diseases.
“Right now, Google and Facebook have siloed repositories of data about you that you have no control over,” says Andrew Lippman, a computer scientist at the Massachusetts Institute of Technology in Cambridge. “But in the world of medicine, there is no Facebook.” Using blockchain to secure and share decentralized medical information “could be a model of data-identity control" generally, he adds.
Blockchain is a distributed electronic system that records transactions in an expanding chain of ‘blocks’ that are extremely difficult to alter. To break into one block, a hacker would have to tamper independently with all the blocks that link to it — a daunting task.
In Hadley’s study, blockchain will function as a series of switches that guide how data flow between participants, clinicians and researchers. Women taking part will be able to give or revoke access to their data using an online portal, breastwecan.org, that relies on blockchain to secure data stored in the cloud.
The researchers plan to train their AI algorithm on millions of mammograms from healthy women and those with breast cancer. The goal is to classify tumours more precisely than doctors do; physicians miss up to one-quarter of cancers present in mammograms. The accuracy of an algorithm generally grows as it is trained on more, and more varied, data, just as a radiologist’s ability to distinguish tumours improves with experience.
Hadley hopes that women will share their mammograms to improve breast-cancer screening generally — and to gain access to, and control, over information that has customarily been held by clinics. Women who participate in the study will be able to view their scans on breastwecan.org, along with standard clinical interpretations of their risk of breast cancer, based on tissue density, age and other known factors.
Other groups are developing blockchain-based marketplaces to broker data exchanges between individuals and companies or academic researchers — and arrange payment. One such effort is Nebula Genomics, a start-up co-founded by geneticist George Church of Harvard University in Cambridge, Massachusetts. Nebula aims to connect people who want their genomes sequenced with companies willing to pay for that service in return for access to the resulting data. People who pay for their own sequencing will be able to sell access to their genetic information using Nebula; payment will come in the form of digital tokens that can be exchanged for US dollars.
Church says Nebula will ensure that its partner companies keep any promises they make — on issues such as how long a company will retain a person’s data. By contrast, when customers of genomic-sequencing firms such as 23andMe in Mountain View, California, consent to share their data for research, they largely relinquish control over how it is used. Many sequencing firms sell anonymized genetic data in bulk to biotechnology and pharmaceutical firms.
Giving people more control over their medical records could also yield more-immediate health benefits, Lippman says. He and his graduate students have developed a blockchain-based system for sharing health records, called MedRec, that will be tested at Beth Israel Deaconess Medical Center in Boston this year. The system allows users to insert information into their health records, including data from wearable electronic devices such as Fitbits. Clinicians and researchers could use these extra data, with permission, to tailor treatments.
Ultimately, Hadley says, the immense amount of routine medical data that physicians collect can only yield medical advances if the information is shared and studied. “We need to engage people so that they show us their data,” he says. “So we need to think in medicine about the technologies that let us have good data governance, and blockchain happens to be one of them right now.”
Nature 555, 293-294 (2018)
See Editorial: AI diagnostics need attention
Sign up for the daily Nature Briefing email newsletter
Stay up to date with what matters in science and why, handpicked from Nature and other publications worldwide.