AUTHOR=Clissa Luca , Lassnig Mario , Rinaldi Lorenzo TITLE=How big is Big Data? A comprehensive survey of data production, storage, and streaming in science and industry JOURNAL=Frontiers in Big Data VOLUME=6 YEAR=2023 URL=https://www.frontiersin.org/journals/big-data/articles/10.3389/fdata.2023.1271639 DOI=10.3389/fdata.2023.1271639 ISSN=2624-909X ABSTRACT=

The contemporary surge in data production is fueled by diverse factors, with contributions from numerous stakeholders across various sectors. Comparing the volumes at play among different big data entities is challenging due to the scarcity of publicly available data. This survey aims to offer a comprehensive perspective on the orders of magnitude involved in yearly data generation by some public and private leading organizations, using an array of online sources for estimation. These estimates are based on meaningful, individual data production metrics and plausible per-unit sizes. The primary objective is to offer insights into the comparative scales of major big data players, their sources, and data production flows, rather than striving for precise measurements or incorporating the latest updates. The results are succinctly conveyed through a visual representation of the relative data generation volumes across these entities.