Skip to main navigation Skip to search Skip to main content

Prioritizing individual genetic variants after kernel machine testing using variable selection

  • Qianchuan He*
  • , Tianxi Cai
  • , Yang Liu
  • , Ni Zhao
  • , Quaker E. Harmon
  • , Lynn M. Almli
  • , Elisabeth B. Binder
  • , Stephanie M. Engel
  • , Kerry J. Ressler
  • , Karen N. Conneely
  • , Xihong Lin
  • , Michael C. Wu
  • *Corresponding author for this work
  • Fred Hutchinson Cancer Research Center
  • Harvard University
  • National Institutes of Health
  • Emory University
  • Max Planck Institute of Psychiatry
  • University of North Carolina at Chapel Hill
  • McLean Hospital

Research output: Contribution to journalArticlepeer-review

Abstract

Kernel machine learning methods, such as the SNP-set kernel association test (SKAT), have been widely used to test associations between traits and genetic polymorphisms. In contrast to traditional single-SNP analysis methods, these methods are designed to examine the joint effect of a set of related SNPs (such as a group of SNPs within a gene or a pathway) and are able to identify sets of SNPs that are associated with the trait of interest. However, as with many multi-SNP testing approaches, kernel machine testing can draw conclusion only at the SNP-set level, and does not directly inform on which one(s) of the identified SNP set is actually driving the associations. A recently proposed procedure, KerNel Iterative Feature Extraction (KNIFE), provides a general framework for incorporating variable selection into kernel machine methods. In this article, we focus on quantitative traits and relatively common SNPs, and adapt the KNIFE procedure to genetic association studies and propose an approach to identify driver SNPs after the application of SKAT to gene set analysis. Our approach accommodates several kernels that are widely used in SNP analysis, such as the linear kernel and the Identity by State (IBS) kernel. The proposed approach provides practically useful utilities to prioritize SNPs, and fills the gap between SNP set analysis and biological functional studies. Both simulation studies and real data application are used to demonstrate the proposed approach.

Original languageEnglish
Pages (from-to)722-731
Number of pages10
JournalGenetic Epidemiology
Volume40
Issue number8
DOIs
StatePublished - 1 Dec 2016
Externally publishedYes

Keywords

  • KNIFE
  • genetic association studies
  • kernel machine methods
  • set-based
  • variable selection

Fingerprint

Dive into the research topics of 'Prioritizing individual genetic variants after kernel machine testing using variable selection'. Together they form a unique fingerprint.

Cite this