A hybrid parametric and empirical likelihood model for evaluating interactions in case-control Studies

Stat Interface. 2016;9(2):147-158. doi: 10.4310/sii.2016.v9.n2.a3. Epub 2015 Nov 4.

Abstract

The case-control design provides an effective way to collect covariate information conditioning on subjects' disease status. The standard logistic regression model can be used to model the interaction between two covariates under such a design, but the prospective logistic regression method might not be the most efficient one when certain appropriate constraints can be imposed on the covariate distribution. We develop a hybrid approach for the statistical inference of the interaction under the case-control design. We use a parametric model to characterize the conditional distribution of one covariate given the another covariate in the control population, while leaving the distribution of the later covariate to be fully nonparametric. A maximum hybrid parametric and empirical likelihood method is adopted for the evaluation of all parameters. The estimator and the associated test derived from the proposed semiparametric model are suitable for evaluating the interaction between two covariates of various types (discrete or continuous). Asymptotic results for both the estimators and the test statistics were established, and the advantages of the proposed method over the existing ones are demonstrated through simulation results and a real data example.

Keywords: Case-control; Genetic association studies; Hybrid parametric and empirical likelihood; Interaction test; Primary 62J12; secondary 62P10.