Simple algorithms to calculate asymptotic null distributions of robust tests in case-control genetic association studies in R

Yong Zang, Wing Kam Fung, Gang Zheng

Research output: Contribution to journalArticle

43 Citations (Scopus)

Abstract

The case-control study is an important design for testing association between genetic markers and a disease. The Cochran-Armitage trend test (CATT) is one of the most commonly used statistics for the analysis of case-control genetic association studies. The asymptotically optimal CATT can be used when the underlying genetic model (mode of inheritance) is known. However, for most complex diseases, the underlying genetic models are unknown. Thus, tests robust to genetic model misspecification are preferable to the model-dependant CATT. Two robust tests, MAX3 and the genetic model selection (GMS), were recently proposed. Their asymptotic null distributions are often obtained by Monte-Carlo simulations, because they either have not been fully studied or involve multiple integrations. In this article, we study how components of each robust statistic are correlated, and find a linear dependence among the components. Using this new finding, we propose simple algorithms to calculate asymptotic null distributions for MAX3 and GMS, which greatly reduce the computing intensity. Furthermore, we have developed the R package Rassoc implementing the proposed algorithms to calculate the empirical and asymptotic p values for MAX3 and GMS as well as other commonly used tests in case-control association studies. For illustration, Rassoc is applied to the analysis of case-control data of 17 most significant SNPs reported in four genome-wide association studies.

Original languageEnglish (US)
Pages (from-to)1-24
Number of pages24
JournalJournal of Statistical Software
Volume33
Issue number8
DOIs
StatePublished - Jan 1 2010
Externally publishedYes

Fingerprint

Genetic Association
Robust Tests
Case-control
Null Distribution
Asymptotic distribution
Calculate
Trend Test
Model Selection
Statistics
Case-control Data
Robust Statistics
Model Misspecification
Linear dependence
Case-control Study
p-Value
Asymptotically Optimal
Genome
Monte Carlo Simulation
Genes
Model

Keywords

  • Algorithm
  • Asymptotic distributions
  • Dependence of trend tests
  • Genetic model selection
  • MAX3
  • Robust tests

ASJC Scopus subject areas

  • Software
  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Cite this

Simple algorithms to calculate asymptotic null distributions of robust tests in case-control genetic association studies in R. / Zang, Yong; Fung, Wing Kam; Zheng, Gang.

In: Journal of Statistical Software, Vol. 33, No. 8, 01.01.2010, p. 1-24.

Research output: Contribution to journalArticle

@article{a4d700c34a514065b090262de7093ab2,
title = "Simple algorithms to calculate asymptotic null distributions of robust tests in case-control genetic association studies in R",
abstract = "The case-control study is an important design for testing association between genetic markers and a disease. The Cochran-Armitage trend test (CATT) is one of the most commonly used statistics for the analysis of case-control genetic association studies. The asymptotically optimal CATT can be used when the underlying genetic model (mode of inheritance) is known. However, for most complex diseases, the underlying genetic models are unknown. Thus, tests robust to genetic model misspecification are preferable to the model-dependant CATT. Two robust tests, MAX3 and the genetic model selection (GMS), were recently proposed. Their asymptotic null distributions are often obtained by Monte-Carlo simulations, because they either have not been fully studied or involve multiple integrations. In this article, we study how components of each robust statistic are correlated, and find a linear dependence among the components. Using this new finding, we propose simple algorithms to calculate asymptotic null distributions for MAX3 and GMS, which greatly reduce the computing intensity. Furthermore, we have developed the R package Rassoc implementing the proposed algorithms to calculate the empirical and asymptotic p values for MAX3 and GMS as well as other commonly used tests in case-control association studies. For illustration, Rassoc is applied to the analysis of case-control data of 17 most significant SNPs reported in four genome-wide association studies.",
keywords = "Algorithm, Asymptotic distributions, Dependence of trend tests, Genetic model selection, MAX3, Robust tests",
author = "Yong Zang and Fung, {Wing Kam} and Gang Zheng",
year = "2010",
month = "1",
day = "1",
doi = "10.18637/jss.v033.i08",
language = "English (US)",
volume = "33",
pages = "1--24",
journal = "Journal of Statistical Software",
issn = "1548-7660",
publisher = "University of California at Los Angeles",
number = "8",

}

TY - JOUR

T1 - Simple algorithms to calculate asymptotic null distributions of robust tests in case-control genetic association studies in R

AU - Zang, Yong

AU - Fung, Wing Kam

AU - Zheng, Gang

PY - 2010/1/1

Y1 - 2010/1/1

N2 - The case-control study is an important design for testing association between genetic markers and a disease. The Cochran-Armitage trend test (CATT) is one of the most commonly used statistics for the analysis of case-control genetic association studies. The asymptotically optimal CATT can be used when the underlying genetic model (mode of inheritance) is known. However, for most complex diseases, the underlying genetic models are unknown. Thus, tests robust to genetic model misspecification are preferable to the model-dependant CATT. Two robust tests, MAX3 and the genetic model selection (GMS), were recently proposed. Their asymptotic null distributions are often obtained by Monte-Carlo simulations, because they either have not been fully studied or involve multiple integrations. In this article, we study how components of each robust statistic are correlated, and find a linear dependence among the components. Using this new finding, we propose simple algorithms to calculate asymptotic null distributions for MAX3 and GMS, which greatly reduce the computing intensity. Furthermore, we have developed the R package Rassoc implementing the proposed algorithms to calculate the empirical and asymptotic p values for MAX3 and GMS as well as other commonly used tests in case-control association studies. For illustration, Rassoc is applied to the analysis of case-control data of 17 most significant SNPs reported in four genome-wide association studies.

AB - The case-control study is an important design for testing association between genetic markers and a disease. The Cochran-Armitage trend test (CATT) is one of the most commonly used statistics for the analysis of case-control genetic association studies. The asymptotically optimal CATT can be used when the underlying genetic model (mode of inheritance) is known. However, for most complex diseases, the underlying genetic models are unknown. Thus, tests robust to genetic model misspecification are preferable to the model-dependant CATT. Two robust tests, MAX3 and the genetic model selection (GMS), were recently proposed. Their asymptotic null distributions are often obtained by Monte-Carlo simulations, because they either have not been fully studied or involve multiple integrations. In this article, we study how components of each robust statistic are correlated, and find a linear dependence among the components. Using this new finding, we propose simple algorithms to calculate asymptotic null distributions for MAX3 and GMS, which greatly reduce the computing intensity. Furthermore, we have developed the R package Rassoc implementing the proposed algorithms to calculate the empirical and asymptotic p values for MAX3 and GMS as well as other commonly used tests in case-control association studies. For illustration, Rassoc is applied to the analysis of case-control data of 17 most significant SNPs reported in four genome-wide association studies.

KW - Algorithm

KW - Asymptotic distributions

KW - Dependence of trend tests

KW - Genetic model selection

KW - MAX3

KW - Robust tests

UR - http://www.scopus.com/inward/record.url?scp=77953172073&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77953172073&partnerID=8YFLogxK

U2 - 10.18637/jss.v033.i08

DO - 10.18637/jss.v033.i08

M3 - Article

VL - 33

SP - 1

EP - 24

JO - Journal of Statistical Software

JF - Journal of Statistical Software

SN - 1548-7660

IS - 8

ER -