AUTHOR=Magnotti John F., Ma Wei Ji , Beauchamp Michael S. TITLE=Causal inference of asynchronous audiovisual speech JOURNAL=Frontiers in Psychology VOLUME=4 YEAR=2013 URL=https://www.frontiersin.org/journals/psychology/articles/10.3389/fpsyg.2013.00798 DOI=10.3389/fpsyg.2013.00798 ISSN=1664-1078 ABSTRACT=
During speech perception, humans integrate auditory information from the voice with visual information from the face. This multisensory integration increases perceptual precision, but only if the two cues come from the same talker; this requirement has been largely ignored by current models of speech perception. We describe a generative model of multisensory speech perception that includes this critical step of determining the likelihood that the voice and face information have a common cause. A key feature of the model is that it is based on a principled analysis of how an observer should solve this causal inference problem using the asynchrony between two cues and the reliability of the cues. This allows the model to make predictions about the behavior of subjects performing a synchrony judgment task, predictive power that does not exist in other approaches, such as