Making the Most of Clumping and Thresholding for Polygenic Scores. - Research - Institut Pasteur

A little guide for advanced search:

Tip 1. You can use quotes "" to search for an exact expression.
Example: "cell division"
Tip 2. You can use + symbol to restrict results containing all words.
Example: +cell +stem
Tip 3. You can use + and - symbols to force inclusion or exclusion of specific words.
Example: +cell -stem

e.g. searching for members in projects tagged cancer

Search for

Count

IN

OUT

Content 1

Content Type

member
team
department
center
program_project
nrc
whocc
project
software
tool
patent

Keywords

Positions

Administrative Staff
Assistant Professor
Associate Professor
Clinical Research Assistant
Clinical Research Nurse
Clinician Researcher
Department Manager
Dual-education Student
Full Professor
Honorary Professor
Lab assistant
Master Student
MD-PhD Student
Medical Staff
Non-permanent Researcher
Nursing Staff
Permanent Researcher
Pharmacist
PhD Student
Physician
Post-doc
Prize
Project Manager
Research Associate
Research Engineer
Retired scientist
Technician
Undergraduate Student
Veterinary
Visiting Scientist

Appointments

Deputy Director of Center
Deputy Director of Department
Deputy Director of National Reference Center
Deputy Head of Facility
Director of Center
Director of Department
Director of Institute
Director of National Reference Center
Group Leader
Head of Facility
Head of Operations
Head of Structure
Honorary President of the Departement
Labex Coordinator

Content 2

Content Type

member
team
department
center
program_project
nrc
whocc
project
software
tool
patent

Keywords

Positions

Administrative Staff
Assistant Professor
Associate Professor
Clinical Research Assistant
Clinical Research Nurse
Clinician Researcher
Department Manager
Dual-education Student
Full Professor
Honorary Professor
Lab assistant
Master Student
MD-PhD Student
Medical Staff
Non-permanent Researcher
Nursing Staff
Permanent Researcher
Pharmacist
PhD Student
Physician
Post-doc
Prize
Project Manager
Research Associate
Research Engineer
Retired scientist
Technician
Undergraduate Student
Veterinary
Visiting Scientist

Appointments

Deputy Director of Center
Deputy Director of Department
Deputy Director of National Reference Center
Deputy Head of Facility
Director of Center
Director of Department
Director of Institute
Director of National Reference Center
Group Leader
Head of Facility
Head of Operations
Head of Structure
Honorary President of the Departement
Labex Coordinator

Search

EN
FR

Go back

Scroll to top

Human SNORA31 variations impair cortical neuron-intrinsic immunity to HSV-1 and underlie herpes simplex encephalitis

High-Throughput Crystallization Pipeline at the Crystallography Core Facility of the Institut Pasteur

Scientific Fields

Diseases

Organisms

Applications

Technique

Published in American journal of human genetics - 05 Dec 2019

Privé F, Vilhjálmsson BJ, Aschard H, Blum MGB,

Link to Pubmed [PMID] – 31761295

Link to DOI – S0002-9297(19)30422-710.1016/j.ajhg.2019.11.001

Am J Hum Genet 2019 12; 105(6): 1213-1221

Polygenic prediction has the potential to contribute to precision medicine. Clumping and thresholding (C+T) is a widely used method to derive polygenic scores. When using C+T, several p value thresholds are tested to maximize predictive ability of the derived polygenic scores. Along with this p value threshold, we propose to tune three other hyper-parameters for C+T. We implement an efficient way to derive thousands of different C+T scores corresponding to a grid over four hyper-parameters. For example, it takes a few hours to derive 123K different C+T scores for 300K individuals and 1M variants using 16 physical cores. We find that optimizing over these four hyper-parameters improves the predictive performance of C+T in both simulations and real data applications as compared to tuning only the p value threshold. A particularly large increase can be noted when predicting depression status, from an AUC of 0.557 (95% CI: [0.544-0.569]) when tuning only the p value threshold to an AUC of 0.592 (95% CI: [0.580-0.604]) when tuning all four hyper-parameters we propose for C+T. We further propose stacked clumping and thresholding (SCT), a polygenic score that results from stacking all derived C+T scores. Instead of choosing one set of hyper-parameters that maximizes prediction in some training set, SCT learns an optimal linear combination of all C+T scores by using an efficient penalized regression. We apply SCT to eight different case-control diseases in the UK biobank data and find that SCT substantially improves prediction accuracy with an average AUC increase of 0.035 over standard C+T.

Published on: 05 Dec 2019 • Modified on: 26 Jan 2022