AUTHOR=Kuo Chia-Ling , Duan Yinghui , Grady James TITLE=Unconditional or Conditional Logistic Regression Model for Age-Matched Case–Control Data? JOURNAL=Frontiers in Public Health VOLUME=6 YEAR=2018 URL=https://www.frontiersin.org/journals/public-health/articles/10.3389/fpubh.2018.00057 DOI=10.3389/fpubh.2018.00057 ISSN=2296-2565 ABSTRACT=
Matching on demographic variables is commonly used in case–control studies to adjust for confounding at the design stage. There is a presumption that matched data need to be analyzed by matched methods. Conditional logistic regression has become a standard for matched case–control data to tackle the sparse data problem. The sparse data problem, however, may not be a concern for loose-matching data when the matching between cases and controls is not unique, and one case can be matched to other controls without substantially changing the association. Data matched on a few demographic variables are clearly loose-matching data, and we hypothesize that unconditional logistic regression is a proper method to perform. To address the hypothesis, we compare unconditional and conditional logistic regression models by precision in estimates and hypothesis testing using simulated matched case–control data. Our results support our hypothesis; however, the unconditional model is not as robust as the conditional model to the matching distortion that the matching process not only makes cases and controls similar for matching variables but also for the exposure status. When the study design involves other complex features or the computational burden is high, matching in loose-matching data can be ignored for negligible loss in testing and estimation if the distributions of matching variables are not extremely different between cases and controls.