Scientists develop a computational method to estimate the importance of each letter in the human genome
Cold Spring Harbor, NY - There are 3 billion letters in the human genome, and scientists have endlessly debated how many of them serve a functional purpose. There are those letters that encode genes, our hereditary information, and those that provide instructions about how cells can use the genes. But those sequences are written with a comparative few of the vast number of DNA letters. Scientists have long debated how much of, or even if, the rest of our genome does anything, some going so far as to designate the part not devoted to encoding proteins as "junk DNA."
In work published today in Nature Genetics, researchers at Cold Spring Harbor Laboratory (CSHL) have developed a new computational method to identify which letters in the human genome are functionally important. Their computer program, called fitCons, harnesses the power of evolution, comparing changes in DNA letters across not just related species, but also between multiple individuals in a single species. The results provide a surprising picture of just how little of our genome has been "conserved" by Nature not only across species over eons of time, but also over the more recent time period during which humans differentiated from one another.
"In model organisms, like yeast or flies, scientists often generate mutations to determine which letters in a DNA sequence are needed for a particular gene to function," explains CSHL Professor Adam Siepel. "We can't do that with humans. But when you think about it, Nature has been doing a similar experiment on a very large scale as species evolve. Mutations occur across the genome at random, but important letters are retained by natural selection, while the rest are free to change with no adverse consequence to the organism."
It was this idea that became the basis of their analysis, but it alone wasn't enough. "Massive research consortia, like the ENCODE Project, have provided the scientific community with a trove of information about genomic function over the last few years," says Siepel. "Other groups have sequenced large numbers of humans and nonhuman primates. For the first time, these big data sets give us both a broad and exceptionally detailed picture of both biochemical activity along the genome and how DNA sequences have changed over time."
Siepel's team began by sorting ENCODE consortium data based on combinations of biochemical markers that indicate the type of activity at each position. "We didn't just use sequence patterns. ENCODE provided us with information about where along the full genome DNA is read and how it is modified with biochemical tags," says Brad Gulko, a Ph.D. student in Computer Science at Cornell University and lead author on the new paper. The combinations of these tags revealed several hundred different classes of sites within the genome each having a potentially different role in genomic activity.
The researchers then turned to their previously developed computational method, called INSIGHT, to analyze how much the sequences in these classes had varied over both short and long periods of evolutionary time. "Usually, this, kind of analysis is done comparing different species - like humans, dogs, and mice - which means researchers are looking at changes that occurred over relatively long time periods," explains Siepel. But the INSIGHT model considers the changes among dozens of human individuals and close relatives, such as the chimpanzee, which provides a picture of evolution over much shorter time frames.
The scientists found that, at most, only about 7% of the letters in the human genome are functionally important. "We were impressed with how low that number is," says Siepel. "Some analyses of the ENCODE data alone have argued that upwards of 80% of the genome is functional, but our evolutionary analysis suggests that isn't the case." He added, "other researchers have estimated that similarly small fractions of the genome have been conserved over long time evolutionary periods, but our analysis indicates that the much larger ENCODE-based estimates can't be explained by gains of new functional sequences on the human lineage. We think most of the sequences designated as 'biochemically active' by ENCODE are probably not evolutionarily important in humans."
According to Siepel, this analysis will allow researchers to isolate functionally important sequences in diseases much more rapidly. Most genome-wide studies implicate massive regions, containing tens of thousands of letters, associated with disease. "Our analysis helps to pinpoint which letters in these sequences are likely to be functional because they are both biochemically active and have been preserved by evolution." says Siepel. "This provides a powerful resource as scientists work to understand the genetic basis of disease."
###
See the article here:
Harnessing data from Nature's great evolutionary experiment
- June 11th At Westport, CT: Federal Red Flags, HIPAA Security Rules and Fraud Prevention - November 7th, 2009 [November 7th, 2009]
- Do not learn Dvorak! - November 7th, 2009 [November 7th, 2009]
- You Can’t Solve Problems By Making It Illegal To Have The Problem - November 7th, 2009 [November 7th, 2009]
- A Force Fix for Healthcare - November 7th, 2009 [November 7th, 2009]
- Yahble, HIT, Bubblecon, BIZDEV!, Solid State - November 7th, 2009 [November 7th, 2009]
- 15 things that suck about the Palm Pre - November 7th, 2009 [November 7th, 2009]
- What an Indie Genomics Lab Looks Like - November 7th, 2009 [November 7th, 2009]
- Practice Fusion: Class D Felony? - February 26th, 2010 [February 26th, 2010]
- Practice Fusion Responds - March 7th, 2010 [March 7th, 2010]
- Practice Fusion: Do the math: $44,000 is a LIE - March 10th, 2010 [March 10th, 2010]
- How Much Until Doctors Approve of 23andMe? - March 10th, 2010 [March 10th, 2010]
- Biochemicals as Media, Not Methods - March 10th, 2010 [March 10th, 2010]
- More Practice Fusion Reality Distortion - March 10th, 2010 [March 10th, 2010]
- Same Test Results: 23andMe is Myriad is BRCA is Medicine - March 12th, 2010 [March 12th, 2010]
- BRCA is 23andMe is Myriad is Medicine - March 13th, 2010 [March 13th, 2010]
- Getting Serious About Genomics as Common Medical Practice - March 15th, 2010 [March 15th, 2010]
- The New John Mackey of Genetics: Linda Avey? - March 15th, 2010 [March 15th, 2010]
- Keep the Medical, Well, Medical - March 16th, 2010 [March 16th, 2010]
- If 23andMe shuts down, it won’t be for some mundane reason like the bills weren’t paid - March 16th, 2010 [March 16th, 2010]
- If I Run A Medical Practice, How Do I Use A 23andMe? - March 17th, 2010 [March 17th, 2010]
- 23andMe Contract in Bad Faith - March 19th, 2010 [March 19th, 2010]
- Doctors CANNOT Use 23andMe Due To 23andMe’s Bad Faith Contract - March 20th, 2010 [March 20th, 2010]
- Pathway Compared to 23andMe and Navigenics - March 22nd, 2010 [March 22nd, 2010]
- There’s a Word for “Views Differ” When One View Is The State - March 24th, 2010 [March 24th, 2010]
- Association for Molecular Pathology, et al. v. USPTO, et al. – Opinion - March 29th, 2010 [March 29th, 2010]
- Birth of a Super Villain - April 3rd, 2010 [April 3rd, 2010]
- “Medical Products” like 23andMe must not become the new “Financial Products” - April 4th, 2010 [April 4th, 2010]
- How I Would Apply Genomic Technology In Clinical Use Today - April 5th, 2010 [April 5th, 2010]
- Gmail Enterprise: World’s Best EMR - April 6th, 2010 [April 6th, 2010]
- Brief Primer on Health Law Compliance - April 9th, 2010 [April 9th, 2010]
- Spoiler: You ARE the “Valids” - April 9th, 2010 [April 9th, 2010]
- Rachel Lehmann-Haupt Line by Line Take Down - April 9th, 2010 [April 9th, 2010]
- Is Medicare Bankrupt? What the Hell Is Going On? - April 17th, 2010 [April 17th, 2010]
- The Big Shuffle: Medicare Cuts Rates by 21.3% (but not “technically”) - April 17th, 2010 [April 17th, 2010]
- “Tech Hiring Binge” == “Fear for Your Job, Nerds” - April 18th, 2010 [April 18th, 2010]
- How Bad is Bad? $.20 on the Private Medical Insurance Dollar - April 20th, 2010 [April 20th, 2010]
- Update: How Bad is Bad? It Used to Be $.45 on the Medical Insurance Dollar - April 20th, 2010 [April 20th, 2010]
- World’s Best “EMR” for $1000: Google Spreadsheets + iPad - April 21st, 2010 [April 21st, 2010]
- Don’t Insult Me with your “AOL Keyword” Strategy, Google Health - April 21st, 2010 [April 21st, 2010]
- How to Play LAWGAMES - April 23rd, 2010 [April 23rd, 2010]
- Top 4 Predatory Schemes Encroaching on American Medicine: Part 1 - April 25th, 2010 [April 25th, 2010]
- What’s the Big Deal About iPads? - April 27th, 2010 [April 27th, 2010]
- Got Google Android for Google I/O - April 27th, 2010 [April 27th, 2010]
- Google Enterprise meets HIPAA and HITECH Compliant Laws - April 29th, 2010 [April 29th, 2010]
- Pixels of Accuracy CHALENGE: Diagnostic Medical Imaging - April 29th, 2010 [April 29th, 2010]
- 23andMe Launder AlioGenetics Doesn’t Even Bother to Remove 23andMe Logo - April 30th, 2010 [April 30th, 2010]
- Anthem of CT Denies $600 Until “Subscriber Responds to our Coordination of Benefits Questionnaire” - May 1st, 2010 [May 1st, 2010]
- Apple And Google Team Up To Launch Revolutionary Mobile Health System - May 1st, 2010 [May 1st, 2010]
- Funny Pictures from This Year Building the Medical Practice - May 6th, 2010 [May 6th, 2010]
- Remote Medical Video Monitoring on iPad and iPhone - May 7th, 2010 [May 7th, 2010]
- Google Calendar Overhead Waiting Room Display - May 7th, 2010 [May 7th, 2010]
- Various Whiteboards on Solid State Medical Operations - May 7th, 2010 [May 7th, 2010]
- The Raw Facts about Counsyl - May 7th, 2010 [May 7th, 2010]
- Brawndo: Still Mutilating Thirst, Still Not Yet Sold at the Stop-n-Shop Pharmacy - May 9th, 2010 [May 9th, 2010]
- Video: Google Enterprise to Outsource Medical Administration - May 9th, 2010 [May 9th, 2010]
- Gattaca: “The Matrix” of Genomics - May 11th, 2010 [May 11th, 2010]
- 23andMe Now Diagnoses Fatal Tay-Sachs Disease - May 12th, 2010 [May 12th, 2010]
- Why Was Pathway Targeted for FDA Enforcement and Not 23andMe? - May 15th, 2010 [May 15th, 2010]
- John Dolan on Aging and the Horrifying Conclusion of GWAS - May 16th, 2010 [May 16th, 2010]
- Sam R. Riley Wants To Tell You About Practice Fusion - May 17th, 2010 [May 17th, 2010]
- Response to “Genomic Medicine: Lost” - May 19th, 2010 [May 19th, 2010]
- Death And Taxes: CMS to IRS - May 19th, 2010 [May 19th, 2010]
- Please Stop Antagonizing the AMA - May 26th, 2010 [May 26th, 2010]
- Dan Vorhaus, Attorney At Law, Legally Advises Medical Doctors Can Use 23andMe To Provide Medical Advice - May 28th, 2010 [May 28th, 2010]
- Singularity Summit 2010 in San Francisco to Explore Intelligence Augmentation - June 7th, 2010 [June 7th, 2010]
- OpenPCR: DNA amplification for anyone - June 10th, 2010 [June 10th, 2010]
- FDA sends letters to 5 genetic testing companies - June 11th, 2010 [June 11th, 2010]
- Amazon And The NIH Team Up To Put Human Genome In The Cloud - March 31st, 2012 [March 31st, 2012]
- ReproSource Comments on New Study Linking Infertility to Genetics - April 25th, 2012 [April 25th, 2012]
- Genetics 101 Part 1: What are genes? - Video - April 30th, 2012 [April 30th, 2012]
- Red Ice Radio - David Icke - Hour 1 - The Manipulation of Humanity - Video - April 30th, 2012 [April 30th, 2012]
- Genetics Part 5: Human Genetic Disorders - Video - April 30th, 2012 [April 30th, 2012]
- C2CAM - The Nephilim, Genetic Manipulation - April 30th, 2012 [April 30th, 2012]
- Human Nature talk with Robert Sapolsky, Gabor Mate, James Gilligan, Richard Wilkinson - Video - April 30th, 2012 [April 30th, 2012]
- Human Genetic Diseases - Video - April 30th, 2012 [April 30th, 2012]
- Alien Scientist on Genetics, Implants - April 30th, 2012 [April 30th, 2012]
- Research and Markets: Genetics, 6th Edition International Student Version Continues To Educate Today's Students for ... - May 4th, 2012 [May 4th, 2012]
- Myriad Genetics to Present at the Bank of America Merrill Lynch 2012 Health Care Conference - May 4th, 2012 [May 4th, 2012]
- Genetics may explain some people's dislike of meat - May 4th, 2012 [May 4th, 2012]
- 'Blond Genes' May Vary Around the World - May 4th, 2012 [May 4th, 2012]