AUTHOR=Stevenson Robert D. , Suomela Todd , Kim Heejun , He Yurong TITLE=Seven Primary Data Types in Citizen Science Determine Data Quality Requirements and Methods JOURNAL=Frontiers in Climate VOLUME=3 YEAR=2021 URL=https://www.frontiersin.org/journals/climate/articles/10.3389/fclim.2021.645120 DOI=10.3389/fclim.2021.645120 ISSN=2624-9553 ABSTRACT=

Data quality (DQ) is a major concern in citizen science (CS) programs and is often raised as an issue among critics of the CS approach. We examined CS programs and reviewed the kinds of data they produce to inform CS communities of strategies of DQ control. From our review of the literature and our experiences with CS, we identified seven primary types of data contributions. Citizens can carry instrument packages, invent or modify algorithms, sort and classify physical objects, sort and classify digital objects, collect physical objects, collect digital objects, and report observations. We found that data types were not constrained by subject domains, a CS program may use multiple types, and DQ requirements and evaluation strategies vary according to the data types. These types are useful for identifying structural similarities among programs across subject domains. We conclude that blanket criticism of the CS data quality is no longer appropriate. In addition to the details of specific programs and variability among individuals, discussions can fruitfully focus on the data types in a program and the specific methods being used for DQ control as dictated or appropriate for the type. Programs can reduce doubts about their DQ by becoming more explicit in communicating their data management practices.