In 2007, DNA pioneer James Watson became the first person to have his entire genome sequencedmaking all of his 6 billion base pairs publicly available for research. Well, almost all of them. He left one spot blank, on the long arm of chromosome 19, where a gene called APOE lives. Certain variations in APOE increase your chances of developing Alzheimers, and Watson wanted to keep that information private.
Except it wasnt. Researchers quickly pointed out you could predict Watsons APOE variant based on signatures in the surrounding DNA. They didnt actually do it, but database managers wasted no time in redacting another two million base pairs surrounding the APOE gene.
This is the dilemma at the heart of precision medicine: It requires people to give up some of their privacy in service of the greater scientific good. To completely eliminate the risk of outing an individual based on their DNA records, youd have to strip it of the same identifying details that make it scientifically useful. But now, computer scientists and mathematicians are working toward an alternative solution. Instead of stripping genomic data, theyre encrypting it.
Gill Bejerano leads a developmental biology lab at Stanford that investigates the genetic roots of human disease. In 2013, when he realized he needed more genomic data, his lab joined Stanford Hospitals Pediatrics Departmentan arduous process that required extensive vetting and training of all his staff and equipment. This is how most institutions solve the privacy perils of data sharing. They limit who can access all the genomes in their possession to a trusted few, and only share obfuscated summary statistics more widely.
So when Bejerano found himself sitting in on a faculty talk given by Dan Boneh, head of the applied cryptography group at Stanford, he was struck with an idea. He scribbled down a mathematical formula for one of the genetic computations he uses often in his work. Afterward, he approached Boneh and showed it to him. Could you compute these outputs without knowing the inputs? he asked. Sure, said Boneh.
Last week, Bejerano and Boneh published a paper in Science that did just that. Using a cryptographic genome cloaking method, the scientists were able to do things like identify responsible mutations in groups of patients with rare diseases and compare groups of patients at two medical centers to find shared mutations associated with shared symptoms, all while keeping 97 percent of each participants unique genetic information completely hidden. They accomplished this by converting variations in each genome into a linear series of values. That allowed them to conduct any analyses they needed while only revealing genes relevant to that particular investigation.
Just like programs have bugs, people have bugs, says Bejerano. Finding disease-causing genetic traits is a lot like spotting flaws in computer code. You have to compare code that works to code that doesnt. But genetic data is much more sensitive, and people (rightly) worry that it might be used against them by insurers, or even stolen by hackers. If a patient held the cryptographic key to their data, they could get a valuable medical diagnosis while not exposing the rest of their genome to outside threats. You can make rules about not discriminating on the basis of genetics, or you can provide technology where you cant discriminate against people even if you wanted to, says Bejerano. Thats a much stronger statement.
The National Institutes of Health have been working toward such a technology since reidentification researchers first began connecting the dots in anonymous genomics data. In 2010, the agency founded a national center for Integrating Data for Analysis, Anonymization and Sharing housed on the campus of UC San Diego. And since 2015, iDash has been funding annual competitions to develop privacy-preserving genomics protocols. Another promising approach iDash has supported is something called fully homomorphic encryption, which allows users to run any computation they want on totally encrypted data without losing years of computing time.
Megan Molteni
The Go-To Gene Sequencing Machine With Very Strange Results
Sarah Zhang
Cheap DNA Sequencing Is Here. Writing DNA Is Next
Rachel Ehrenberg, Science News
Scrubbing IDs Out of Medical Records for Genetic Studies
Kristen Lauter, head of cryptography research at Microsoft, focuses on this form of encryption, and her team has taken home the iDash prize two years running. Critically, the method encodes the data in such a way that scientists dont lose the flexibility to perform medically useful genetic tests. Unlike previous encryption schemes, Lauters tool preserves the underlying mathematical structure of the data. That allows computers to do the math that delivers genetic diagnoses, for example, on totally encrypted data. Scientists get a key to decode the final results, but they never see the source.
This is extra important as more and more genetic data moves off local servers and into the cloud. The NIH lets users download human genomic data from its repositories, and in 2014, the agency started letting people store and analyze that data in private or commercial cloud environments. But under NIHs policy, its the scientists using the datanot the cloud service providerresponsible with ensuring its security. Cloud providers can get hacked, or subpoenaed by law enforcement, something researchers have no control over. That is, unless theres a viable encryption for data stored in the cloud.
If we dont think about it now, in five to 10 years a lot peoples genomic information will be used in ways they did not intend, says Lauter. But encryption is a funny technology to work with, she says. One that requires building trust between researchers and consumers. You can propose any crazy encryption you want and say its secure. Why should anyone believe you?
Thats where federal review comes in. In July, Lauters group, along with researchers from IBM and academic institutions around the world launched a process to standardize homomorphic encryption protocols. The National Institute for Standards and Technology will now begin reviewing draft standards and collecting public comments. If all goes well, genomics researchers and privacy advocates might finally have something they can agree on.
See the rest here:
To Protect Genetic Privacy, Encrypt Your DNA - WIRED
- June 11th At Westport, CT: Federal Red Flags, HIPAA Security Rules and Fraud Prevention - November 7th, 2009 [November 7th, 2009]
- Do not learn Dvorak! - November 7th, 2009 [November 7th, 2009]
- You Can’t Solve Problems By Making It Illegal To Have The Problem - November 7th, 2009 [November 7th, 2009]
- A Force Fix for Healthcare - November 7th, 2009 [November 7th, 2009]
- Yahble, HIT, Bubblecon, BIZDEV!, Solid State - November 7th, 2009 [November 7th, 2009]
- 15 things that suck about the Palm Pre - November 7th, 2009 [November 7th, 2009]
- What an Indie Genomics Lab Looks Like - November 7th, 2009 [November 7th, 2009]
- Practice Fusion: Class D Felony? - February 26th, 2010 [February 26th, 2010]
- Practice Fusion Responds - March 7th, 2010 [March 7th, 2010]
- Practice Fusion: Do the math: $44,000 is a LIE - March 10th, 2010 [March 10th, 2010]
- How Much Until Doctors Approve of 23andMe? - March 10th, 2010 [March 10th, 2010]
- Biochemicals as Media, Not Methods - March 10th, 2010 [March 10th, 2010]
- More Practice Fusion Reality Distortion - March 10th, 2010 [March 10th, 2010]
- Same Test Results: 23andMe is Myriad is BRCA is Medicine - March 12th, 2010 [March 12th, 2010]
- BRCA is 23andMe is Myriad is Medicine - March 13th, 2010 [March 13th, 2010]
- Getting Serious About Genomics as Common Medical Practice - March 15th, 2010 [March 15th, 2010]
- The New John Mackey of Genetics: Linda Avey? - March 15th, 2010 [March 15th, 2010]
- Keep the Medical, Well, Medical - March 16th, 2010 [March 16th, 2010]
- If 23andMe shuts down, it won’t be for some mundane reason like the bills weren’t paid - March 16th, 2010 [March 16th, 2010]
- If I Run A Medical Practice, How Do I Use A 23andMe? - March 17th, 2010 [March 17th, 2010]
- 23andMe Contract in Bad Faith - March 19th, 2010 [March 19th, 2010]
- Doctors CANNOT Use 23andMe Due To 23andMe’s Bad Faith Contract - March 20th, 2010 [March 20th, 2010]
- Pathway Compared to 23andMe and Navigenics - March 22nd, 2010 [March 22nd, 2010]
- There’s a Word for “Views Differ” When One View Is The State - March 24th, 2010 [March 24th, 2010]
- Association for Molecular Pathology, et al. v. USPTO, et al. – Opinion - March 29th, 2010 [March 29th, 2010]
- Birth of a Super Villain - April 3rd, 2010 [April 3rd, 2010]
- “Medical Products” like 23andMe must not become the new “Financial Products” - April 4th, 2010 [April 4th, 2010]
- How I Would Apply Genomic Technology In Clinical Use Today - April 5th, 2010 [April 5th, 2010]
- Gmail Enterprise: World’s Best EMR - April 6th, 2010 [April 6th, 2010]
- Brief Primer on Health Law Compliance - April 9th, 2010 [April 9th, 2010]
- Spoiler: You ARE the “Valids” - April 9th, 2010 [April 9th, 2010]
- Rachel Lehmann-Haupt Line by Line Take Down - April 9th, 2010 [April 9th, 2010]
- Is Medicare Bankrupt? What the Hell Is Going On? - April 17th, 2010 [April 17th, 2010]
- The Big Shuffle: Medicare Cuts Rates by 21.3% (but not “technically”) - April 17th, 2010 [April 17th, 2010]
- “Tech Hiring Binge” == “Fear for Your Job, Nerds” - April 18th, 2010 [April 18th, 2010]
- How Bad is Bad? $.20 on the Private Medical Insurance Dollar - April 20th, 2010 [April 20th, 2010]
- Update: How Bad is Bad? It Used to Be $.45 on the Medical Insurance Dollar - April 20th, 2010 [April 20th, 2010]
- World’s Best “EMR” for $1000: Google Spreadsheets + iPad - April 21st, 2010 [April 21st, 2010]
- Don’t Insult Me with your “AOL Keyword” Strategy, Google Health - April 21st, 2010 [April 21st, 2010]
- How to Play LAWGAMES - April 23rd, 2010 [April 23rd, 2010]
- Top 4 Predatory Schemes Encroaching on American Medicine: Part 1 - April 25th, 2010 [April 25th, 2010]
- What’s the Big Deal About iPads? - April 27th, 2010 [April 27th, 2010]
- Got Google Android for Google I/O - April 27th, 2010 [April 27th, 2010]
- Google Enterprise meets HIPAA and HITECH Compliant Laws - April 29th, 2010 [April 29th, 2010]
- Pixels of Accuracy CHALENGE: Diagnostic Medical Imaging - April 29th, 2010 [April 29th, 2010]
- 23andMe Launder AlioGenetics Doesn’t Even Bother to Remove 23andMe Logo - April 30th, 2010 [April 30th, 2010]
- Anthem of CT Denies $600 Until “Subscriber Responds to our Coordination of Benefits Questionnaire” - May 1st, 2010 [May 1st, 2010]
- Apple And Google Team Up To Launch Revolutionary Mobile Health System - May 1st, 2010 [May 1st, 2010]
- Funny Pictures from This Year Building the Medical Practice - May 6th, 2010 [May 6th, 2010]
- Remote Medical Video Monitoring on iPad and iPhone - May 7th, 2010 [May 7th, 2010]
- Google Calendar Overhead Waiting Room Display - May 7th, 2010 [May 7th, 2010]
- Various Whiteboards on Solid State Medical Operations - May 7th, 2010 [May 7th, 2010]
- The Raw Facts about Counsyl - May 7th, 2010 [May 7th, 2010]
- Brawndo: Still Mutilating Thirst, Still Not Yet Sold at the Stop-n-Shop Pharmacy - May 9th, 2010 [May 9th, 2010]
- Video: Google Enterprise to Outsource Medical Administration - May 9th, 2010 [May 9th, 2010]
- Gattaca: “The Matrix” of Genomics - May 11th, 2010 [May 11th, 2010]
- 23andMe Now Diagnoses Fatal Tay-Sachs Disease - May 12th, 2010 [May 12th, 2010]
- Why Was Pathway Targeted for FDA Enforcement and Not 23andMe? - May 15th, 2010 [May 15th, 2010]
- John Dolan on Aging and the Horrifying Conclusion of GWAS - May 16th, 2010 [May 16th, 2010]
- Sam R. Riley Wants To Tell You About Practice Fusion - May 17th, 2010 [May 17th, 2010]
- Response to “Genomic Medicine: Lost” - May 19th, 2010 [May 19th, 2010]
- Death And Taxes: CMS to IRS - May 19th, 2010 [May 19th, 2010]
- Please Stop Antagonizing the AMA - May 26th, 2010 [May 26th, 2010]
- Dan Vorhaus, Attorney At Law, Legally Advises Medical Doctors Can Use 23andMe To Provide Medical Advice - May 28th, 2010 [May 28th, 2010]
- Singularity Summit 2010 in San Francisco to Explore Intelligence Augmentation - June 7th, 2010 [June 7th, 2010]
- OpenPCR: DNA amplification for anyone - June 10th, 2010 [June 10th, 2010]
- FDA sends letters to 5 genetic testing companies - June 11th, 2010 [June 11th, 2010]
- Amazon And The NIH Team Up To Put Human Genome In The Cloud - March 31st, 2012 [March 31st, 2012]
- ReproSource Comments on New Study Linking Infertility to Genetics - April 25th, 2012 [April 25th, 2012]
- Genetics 101 Part 1: What are genes? - Video - April 30th, 2012 [April 30th, 2012]
- Red Ice Radio - David Icke - Hour 1 - The Manipulation of Humanity - Video - April 30th, 2012 [April 30th, 2012]
- Genetics Part 5: Human Genetic Disorders - Video - April 30th, 2012 [April 30th, 2012]
- C2CAM - The Nephilim, Genetic Manipulation - April 30th, 2012 [April 30th, 2012]
- Human Nature talk with Robert Sapolsky, Gabor Mate, James Gilligan, Richard Wilkinson - Video - April 30th, 2012 [April 30th, 2012]
- Human Genetic Diseases - Video - April 30th, 2012 [April 30th, 2012]
- Alien Scientist on Genetics, Implants - April 30th, 2012 [April 30th, 2012]
- Research and Markets: Genetics, 6th Edition International Student Version Continues To Educate Today's Students for ... - May 4th, 2012 [May 4th, 2012]
- Myriad Genetics to Present at the Bank of America Merrill Lynch 2012 Health Care Conference - May 4th, 2012 [May 4th, 2012]
- Genetics may explain some people's dislike of meat - May 4th, 2012 [May 4th, 2012]
- 'Blond Genes' May Vary Around the World - May 4th, 2012 [May 4th, 2012]