Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies

Nilanjan Chatterjee, Bill Wheeler, Joshua Sampson, Patricia Hartge, Stephen J. Chanock, Ju Hyun Park

Research output: Contribution to journalArticlepeer-review

283 Scopus citations

Abstract

We report a new method to estimate the predictive performance of polygenic models for risk prediction and assess predictive performance for ten complex traits or common diseases. Using estimates of effect-size distribution and heritability derived from current studies, we project that although 45% of the variance of height has been attributed to SNPs, a model trained on one million people may only explain 33.4% of variance of the trait. Models based on current studies allow for identification of 3.0%, 1.1% and 7.0% of the populations at twofold or higher than average risk for type 2 diabetes, coronary artery disease and prostate cancer, respectively. Tripling of sample sizes could elevate these percentages to 18.8%, 6.1% and 12.2%, respectively. The utility of polygenic models for risk prediction will depend on achievable sample sizes for the training data set, the underlying genetic architecture and the inclusion of information on other risk factors, including family history.

Original languageEnglish
Pages (from-to)400-405
Number of pages6
JournalNature Genetics
Volume45
Issue number4
DOIs
StatePublished - Apr 2013

Fingerprint

Dive into the research topics of 'Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies'. Together they form a unique fingerprint.

Cite this