John D. Storey is the William R. Harman '63 and Mary-Love Harman Professor in Genomics at Princeton University.[1] His research is focused on statistical inference of high-dimensional data, particularly genomic data. Storey was the founding director of the Princeton University Center for Statistics and Machine Learning.[2]

John D. Storey
NationalityAmerican
Alma materStanford University Ph.D. (2002)
Known forQ-value
AwardsCOPSS Presidents' Award (2015)
Mortimer Spiegelman Award (2015)
Scientific career
FieldsStatistics
Statistical genetics
Genomics
InstitutionsPrinceton University
Doctoral advisorRobert Tibshirani
Doctoral studentsJeffrey T. Leek
Websitestoreylab.org

Research

edit

Storey's early research focused on the false discovery rate. At the time the false discovery rate had only been studied in the context of sequential p-value methods and it was not yet in widespread use. However, Storey showed that false discovery rates can be approached through point estimation[3] opening up this very active branch of statistics to false discovery rates. He simultaneously proved a result showing that the positive false discovery rate (pFDR) is exactly equal to a Bayesian posterior probability, thereby providing the first direct connection between false discovery rates and Bayesian theory.[4] In these works, he also invented the q-value, which is a false discovery rate analogue of the p-value. Storey then introduced false discovery rates and q-values as widely applicable measures of statistical significance in genomics, shifting the focus from false positive control to false discovery rate control.[5] With Jeff Leek, Storey discovered that "expression heterogeneity", or unmodeled sources of systematic variation in gene expression data, are very prevalent and need to be modeled and corrected when analyzing genome-wide gene expression data.[6] Leek and Storey introduced "surrogate variable analysis", which is a high-dimensional regression model that includes both known and unknown covariates. He has developed a number of methods for estimating this model. Recently, Storey has shifted his focus to population genomics, where he has introduced genome-wide models of allele frequencies, Hardy–Weinberg equilibrium, and F-statistics that hold under arbitrary population structures.

Honors and awards

edit

References

edit
  1. ^ "Faculty chosen for endowed professorships". News, Office of Communications, Princeton University. October 8, 2014.
  2. ^ "Storey to head new Center for Statistics and Machine Learning".
  3. ^ Storey, John D. (2002). "A direct approach to false discovery rates". Journal of the Royal Statistical Society, Series B (Statistical Methodology). 64 (3): 479–498. CiteSeerX 10.1.1.320.7131. doi:10.1111/1467-9868.00346. S2CID 122987911.
  4. ^ Storey, John D. (2003). "The positive false discovery rate: a Bayesian interpretation and the q-value". The Annals of Statistics. 31 (6): 2013–2035. doi:10.1214/aos/1074290335.
  5. ^ Storey, John D.; Tibshirani, Robert (2003). "Statistical significance for genomewide studies". PNAS. 100 (16): 9440–9445. Bibcode:2003PNAS..100.9440S. doi:10.1073/pnas.1530509100. PMC 170937. PMID 12883005.
  6. ^ Leek, Jeff; Storey, John (2007-09-28). "Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis". PLOS Genetics. 3 (9): 1724–35. doi:10.1371/journal.pgen.0030161. PMC 1994707. PMID 17907809.
  7. ^ "FACULTY AWARD: Six professors named 2011 AAAS fellows".
  8. ^ "IMS Fellows announced « IMS Bulletin".
  9. ^ "Storey receives COPSS Presidents' Award for outstanding statisticians 40 or younger".
  10. ^ "FACULTY AWARD: Storey receives Mortimer Spiegelman Award for health statisticians under 40".
edit