Frontiers | Automated Data Curation and Data Governance Automation

About this Research Topic

Submission closed

Background

While there have been great advances in data analytics in recent years including distributed computing for Big Data, machine learning including deep learning, less attention has been paid to the data curation and data governance processes supporting data analytics. A common complaint is that data scientists spend 80% of their time preparing data for analysis and only 20% of the time in the actual analysis. This is because the tools and methods used in data preparation require a substantial amount of human time and effort for tasks such as data quality analysis, data cleaning, data enhancement, data standardization, data integration, testing, and validation. Data preparation is just one phase of data curation, the management of data through its entire life cycle from acquisition to disposal. Furthermore, as organizations realize the value of their data, they are implementing data governance programs to ensure they have a complete inventory of their data and its contents, and a way to exercise authority and accountability over data as an organizational asset. As with data curation, most data governance processes require substantial human time and effort to be effective.

The aim of this Research Topic is to examine the Automated Data Curation and Data Governance Automation research to develop unsupervised methods and techniques to automate data curation and data governance processes to the greatest extent possible. The goal of fully automating data cleaning and integration has been labeled as a “data washing machine” by Richard Wang with some initial development led by John R. Talburt. Similar work has begun in the industry to develop methods for automating many of the data governance tasks, such as “positive data control” for maintaining the enterprise data catalog. Replacing human analysis with scalable, unsupervised automation of these processes will not be easy but necessary to keep pace with the increasing volume and variety of data driving modern decision systems.

Submissions to this Research Topic can address but are not limited to the following themes within the context of automated methods for:

• Data quality assessment and metrics
• Generating data quality validation rules
• Data cleansing (data washing machines)
• Spelling correction
• Missing value imputation
• Data standardization
• Multi-source data integration
• Entity and identity resolution
• Data governance policy and standards conformance
• Metadata generation
• Data catalog initialization and setup
• Updating data catalogs and business glossaries
• Data operations logging and data provenance
• Positive data control
• Generating data products
• Data as a service
• Data archiving, deletion, and disposal

Keywords: Data curation, data governance, data life cycle, data process automation, unsupervised data operations

Important note: All contributions to this Research Topic must be within the scope of the section and journal to which they are submitted, as defined in their mission statements. Frontiers reserves the right to guide an out-of-scope manuscript to a more suitable section or journal at any stage of peer review.

Topic editors

Frequently asked questions

Frontiers' Research Topics are collaborative hubs built around an emerging theme.Defined, managed, and led by renowned researchers, they bring communities together around a shared area of interest to stimulate collaboration and innovation.
Unlike section journals, which serve established specialty communities, Research Topics are pioneer hubs, responding to the evolving scientific landscape and catering to new communities.
The goal of Frontiers' publishing program is to empower research communities to actively steer the course of scientific publishing. Our program was implemented as a three-part unit with fixed field journals, flexible specialty sections, and dynamically emerging Research Topics, connecting communities of different sizes and maturity.
Research Topics originate from the scientific community. Many of our Research Topics are suggested by existing editorial board members who have identified critical challenges or areas of interest in their field.
As an editor, Research Topics will help you build your journal, as well as your community, around emerging, cutting-edge research. As research trailblazers, Research Topics attract high-quality submissions from leading experts all over the world.
A thriving Research Topic can potentially evolve into a new specialty section if there is sustained interest and a growing community around it.
Each Research Topic must be approved by the specialty chief editor, and it falls under the editorial oversight of our editorial boards, supported by our in-house research integrity team. The same standards and rigorous peer review processes apply to articles published as part of a Research Topic as for any other article we publish.
In 2023, 80% of the Research Topics we published were edited or co-edited by our editorial board members, who are already familiar with their journal's scope, ethos, and publishing model. All other topics are guest edited by leaders in their field, each vetted and formally approved by the specialty chief editor.
Publishing your article within a Research Topic with other related articles increases its discoverability and visibility, which can lead to more views, downloads, and citations. Research Topics grow dynamically as more published articles are added, causing frequent revisiting, and further visibility.
As Research Topics are multidisciplinary, they are cross-listed in several fields and section journals – increasing your reach even more and giving you the chance to expand your network and collaborate with researchers in different fields, all focusing on expanding knowledge around the same important topic.
Our larger Research Topics are also converted into ebooks and receive social media promotion from our digital marketing team.
 
Frontiers offers multiple article types, but it will depend on the field and section journals in which the Research Topic will be featured. The available article types for a Research Topic will appear in the drop-down menu during the submission process.
Check available article types here 
Yes, we would love to hear your ideas for a topic. Most of our Research Topics are community-led and suggested by researchers in the field. Our in-house editorial team will contact you to talk about your idea and whether you’d like to edit the topic. If you’re an early-stage researcher, we will offer you the opportunity to coordinate your topic, with the support of a senior researcher as the topic editor. 

Suggest your topic here 
A team of guest editors (called topic editors) lead their Research Topic. This editorial team oversees the entire process, from the initial topic proposal to calls for participation, the peer review, and final publications.
The team may also include topic coordinators, who help the topic editors send calls for participation, liaise with topic editors on abstracts, and support contributing authors. In some cases, they can also be assigned as reviewers.
As a topic editor (TE), you will take the lead on all editorial decisions for the Research Topic, starting with defining its scope. This allows you to curate research around a topic that interests you, bring together different perspectives from leading researchers across different fields and shape the future of your field. 
 
You will choose your team of co-editors, curate a list of potential authors, send calls for participation and oversee the peer review process, accepting or recommending rejection for each manuscript submitted.
As a topic editor, you're supported at every stage by our in-house team. You will be assigned a single point of contact to help you on both editorial and technical matters. Your topic is managed through our user-friendly online platform, and the peer review process is supported by our industry-first AI review assistant (AIRA).
If you’re an early-stage researcher, we will offer you the opportunity to coordinate your topic, with the support of a senior researcher as the topic editor. This provides you with valuable editorial experience, improving your ability to critically evaluate research articles and enhancing your understanding of the quality standards and requirements for scientific publishing, as well as the opportunity to discover new research in your field, and expand your professional network.
Yes, certificates can be issued on request. We are happy to provide a certificate for your contribution to editing a successful Research Topic.
Research Topics thrive on collaboration and their multi-disciplinary approach around emerging, cutting-edge themes, attract leading researchers from all over the world.
As a topic editor, you can set the timeline for your Research Topic, and we will work with you at your pace. Typically, Research Topics are online and open for submissions within a few weeks and remain open for participation for 6 – 12 months. Individual articles within a Research Topic are published as soon as they are ready.
Find out more about our Research Topics
Our fee support program ensures that all articles that pass peer review, including those published in Research Topics, can benefit from open access – regardless of the author's field or funding situation.
Authors and institutions with insufficient funding can apply for a discount on their publishing fees. A fee support application form is available on our website.
In line with our mission to promote healthy lives on a healthy planet, we do not provide printed materials. All our articles and ebooks are available under a CC-BY license, so you can share and print copies.

Share on

Frontiers in Big Data

Data Mining and Management

Impact

61kTopic views
48kArticle views
11kArticle downloads

View impact

Automated Data Curation and Data Governance Automation

About this Research Topic

Background

Topic editors

john r talburt

lisa ehrlinger

justin magruder

Frequently asked questions

Frontiers in Big Data

Data Mining and Management