AUTHOR=Mann Richard P. , Mushtaq Faisal , White Alan D. , Mata-Cervantes Gabriel , Pike Tom , Coker Dalton , Murdoch Stuart , Hiles Tim , Smith Clare , Berridge David , Hinchliffe Suzanne , Hall Geoff , Smye Stephen , Wilkie Richard M. , Lodge J. Peter A. , Mon-Williams Mark TITLE=The Problem with Big Data: Operating on Smaller Datasets to Bridge the Implementation Gap JOURNAL=Frontiers in Public Health VOLUME=4 YEAR=2016 URL=https://www.frontiersin.org/journals/public-health/articles/10.3389/fpubh.2016.00248 DOI=10.3389/fpubh.2016.00248 ISSN=2296-2565 ABSTRACT=
Big datasets have the potential to revolutionize public health. However, there is a mismatch between the political and scientific optimism surrounding big data and the public’s perception of its benefit. We suggest a systematic and concerted emphasis on developing models derived from smaller datasets to illustrate to the public how big data can produce tangible benefits in the long term. In order to highlight the immediate value of a small data approach, we produced a proof-of-concept model predicting hospital length of stay. The results demonstrate that existing small datasets can be used to create models that generate a reasonable prediction, facilitating health-care delivery. We propose that greater attention (and funding) needs to be directed toward the utilization of existing information resources in parallel with current efforts to create and exploit “big data.”