Moore Foundation Selects Matthew Stephens for Data-Driven-Discovery Grant

Contact Information

Available for logged-in reporters only

Newswise The Gordon and Betty Moore Foundation today announced the University of Chicagos Matthew Stephens as the recipient of a Moore Investigator in Data-Driven Discovery award. Stephens, a professor in statistics and human genetics, is among 14 scientists from academic institutions nationwide who will receive a total of $21 million over five years to catalyze new data-driven scientific discoveries. Stephens grant is for $1.5 million.

These Moore Investigator Awards are part of a $60 million, five-year Data-Driven Discovery Initiative within the Gordon and Betty Moores Science Program. The initiativeone of the largest privately funded data scientist programs of its kindis committed to enabling new types of scientific breakthroughs by supporting interdisciplinary, data-driven researchers.

Science is generating data at unprecedented volume, variety and velocity, but many areas of science dont reward the kind of expertise needed to capitalize on this explosion of information, said Chris Mentzel, program director of the Data-Driven Discovery Initiative. We are proud to recognize these outstanding scientists, and we hope these awards will help cultivate a new type of researcher and accelerate the use of interdisciplinary, data-driven science in academia.

Stephens is a data scientist who develops statistical and computational analysis tools for the large datasets being generated in the biological sciences. Over the last 15 years, Stephens and his collaborators have made seminal contributions to several problems in population genetics, including identifying structure (clusters) in genetic data, and modeling correlations among genetic variants.

The methods for identifying structure, which Stephens developed with his collaborators (Jonathan Pritchard, Peter Donnelly and Daniel Falush), have driven scientific discoveries in hundreds of organisms. Science papers in 2002, 2003, and 2004 used their method to elucidate the genetic structure of human populations, the Heliobacter pylori stomach bacterium, and domestic dog breeds, respectively. The original paper of Stephens and his collaborators has been cited more than 11,000 times. And, in an example of the potential for cross-fertilization of ideas across disciplines, similar methods have also become popular in machine learning to identify structure in large collections of text documents.

Stephenss work modeling correlations among genetic variants began with a paper in 2003, with graduate student Na Li, PhD03. At the time scientists were grappling with a problem: they had an elegant model (based on work by UChicagos Richard Hudson, professor in ecology & evolution) relating these correlations to the underlying recombination process, which mixes a parents genetic material before transmission to an offspring, but these models were computationally intractable for even small datasets.

Li and Stephens solved this problem by simplifying the model enough to make it computationally tractable. This new simplified model has found widespread application in the last 10 years: Stephens, Li and their collaborators used their model to demonstrate that most recombination in human genes occurs in relatively narrow channels (``hotspots) rather than being spread uniformly. And thousands of scientists conducting genomic studies now make regular use of these models to impute missing genotype data to substantially improve the efficacy of their studies.

Stephenss recent focus has been on developing methods for data integration combining information on multiple related processes. An important application of these methods which he has been pursuing with collaborators, including Yoav Gilad, Jonathan Pritchard and Anna DiRienzo - is to combine information measured on cellular processes, such as gene expression, and transcription factor binding, to help understand the mechanisms of genetic regulation within living cells.

Link:

Moore Foundation Selects Matthew Stephens for Data-Driven-Discovery Grant

Related Posts

Comments are closed.