AUTHOR=Wang Shi-Heng , Chen Wei J., Tsai Yu-Chin , Huang Yung-Hsiang , Hwu Hai-Gwo , Hsiao Chuhsing Kate TITLE=A stochastic inference of de novo CNV detection and association test in multiplex schizophrenia families JOURNAL=Frontiers in Genetics VOLUME=4 YEAR=2013 URL=https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2013.00185 DOI=10.3389/fgene.2013.00185 ISSN=1664-8021 ABSTRACT=

The copy number variation (CNV) is a type of genetic variation in the genome. It is measured based on signal intensity measures and can be assessed repeatedly to reduce the uncertainty in PCR-based typing. Studies have shown that CNVs may lead to phenotypic variation and modification of disease expression. Various challenges exist, however, in the exploration of CNV-disease association. Here we construct latent variables to infer the discrete CNV values and to estimate the probability of mutations. In addition, we propose to pool rare variants to increase the statistical power and we conduct family studies to mitigate the computational burden in determining the composition of CNVs on each chromosome. To explore in a stochastic sense the association between the collapsing CNV variants and disease status, we utilize a Bayesian hierarchical model incorporating the mutation parameters. This model assigns integers in a probabilistic sense to the quantitatively measured copy numbers, and is able to test simultaneously the association for all variants of interest in a regression framework. This integrative model can account for the uncertainty in copy number assignment and differentiate if the variation was de novo or inherited on the basis of posterior probabilities. For family studies, this model can accommodate the dependence within family members and among repeated CNV data. Moreover, the Mendelian rule can be assumed under this model and yet the genetic variation, including de novo and inherited variation, can still be included and quantified directly for each individual. Finally, simulation studies show that this model has high true positive and low false positive rates in the detection of de novo mutation.