- 1State Key Laboratory of Agricultural Genomics, Key Laboratory of Genomics, Ministry of Agriculture, BGI Research, Shenzhen, China
- 2School of Tropical Agriculture and Forestry (School of Agriculture and Rural Affairs, School of Rural Revitalization), Hainan University, Haikou, Hainan, China
- 3Hainan Yazhou Bay Seed Laboratory, Sanya Nanfan Researh, Sanya, China
- 4Fang Zhiyuan Academician Team Innovation Center of Hainan Province, Haikou, China
- 5School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, China
- 6College of Agriculture, Food and Natural Resources (CAFNR), Division of Plant Sciences & Technology, University of Missouri, Columbia, MO, United States
Editorial on the Research Topic
Bioinformatics, big data and agriculture: a challenge for the future
The application of bioinformatics and big data analytics to agriculture represents an exciting frontier with an immense potential (Hesami et al., 2022; Khan et al., 2022). These technologies can help drive increased productivity, sustainability, and resilience in agriculture (Sahu and Liu, 2023). However, significant challenges remain in effectively leveraging bioinformatics and big data in the agricultural sector. A key challenge is developing analytical techniques suited for the massive, complex datasets generated by agricultural systems (Aubry, 2019; Hesami et al., 2022). Sophisticated algorithms and methods are required to derive meaningful insights from diverse data sources like genomic sequences, soil nutrient profiles, and climate patterns (Chen et al., 2020; Wang et al., 2023a). Translating these insights into practical solutions will also require an interdisciplinary approach, integrated expertise in biology, computer science, engineering, and social sciences.
Another major obstacle is the lack of data sharing and collaboration in the highly competitive agricultural industry. While proprietary datasets can provide advantages to individual companies, open data platforms and public-private partnerships will be essential for fully realizing the benefits of big data in agriculture (Aubry, 2019; Lawniczak et al., 2022). Improved and efficient data integration and communication across the public and private sectors is critical moving forward. Beyond technical hurdles, deploying big data solutions on a large-scale requires tackling pervasive digital disparities (Aubry, 2019). Subsistence farmers lack of skills in utilizing sophisticated digital technologies. Considerable investments and reform policy are needed to make precision agriculture tools economically viable and accessible to those who need them most. This Research Topic highlights advances in bioinformatics and data science in agriculture in all its diversity, covering both method developments and their applications in agriculture.
The first study by Liu et al. demonstrates the potential of applying bioinformatics and machine learning techniques to analyze agricultural microbiome data for detecting crop diseases. Using a meta-analysis approach, the author(s) integrated microbiome datasets from hundreds of citrus plant samples across different geographic regions and health statuses. They discovered distinct changes in microbial diversity and composition associated with huanglongbing (HLB) disease, identified bacterial biomarkers in the leaves and roots that could accurately predict HLB infection. The random forest and bagging models achieved nearly 100% accuracy in classifying disease states based on select microbiome features. These findings highlight the power of bioinformatics and big data mining to extract valuable insights from complex agricultural datasets. The integration of next-generation sequencing and advanced analytical methods promises to enable early, non-invasive diagnosis of crop diseases like HLB before symptoms manifest.
In the next study, Yang et al. introduce a cloud-based platform called the Variety Test Platform (VTP) to improve the efficiency and quality of crop variety testing. Variety trials are critical for evaluating new crop breeds, but managing the testing process across locations is challenging. The VTP provides tools to standardize trial design, data collection, analysis, and reporting. Key features include configurable trait standards, mobile apps for recording observations, automated design and analysis, and integrated data quality checks. The system was used to manage trials for five major crops across thousands of sites in China. Overall, the integrated platform increased productivity, ensured uniform protocols, and generated shareable, analysis-ready variety of trial data. The VTP demonstrates how modern IT can transform management of the varietal evaluation process to accelerate access to improved crop breeds.
Shen et al. identified and analyzed 26 NHX genes across three Cucurbita species - C. moschata, C. maxima, and C. pepo. Promoter analysis revealed cis-regulatory elements related to salt stress responses. Transcriptome data showed NHX genes were differentially expressed under salt stress in C. moschata and C. maxima leaves. CmoNHX1 was downregulated with salt treatment. Heterologous expression of CmoNHX1 in Arabidopsis decreased salt tolerance, contrary to expectations. The results provide insights into the evolution and salt stress responses of the NHX gene family in Cucurbita. Overall, the study lays a foundation for breeding salt-tolerant Cucurbita varieties through modulating NHX genes.
Likewise, Wang et al. performed a comprehensive evolutionary analysis of the growth-regulating factor (GRF) gene family in eight important cereal crops. The authors identified 96 GRF genes and found they could be classified into 3 groups based on phylogenetic analysis. Further analysis showed that whole genome/segmental duplications played a major role in the expansion of the GRF family. Expression data revealed GRFs are induced by multiple hormones and are highly expressed in young inflorescences, implying roles in hormone signaling and inflorescence development. Notably, they found that overexpression of one GRF in rice, OjGRF11, increased sensitivity to auxin treatment and inhibited root elongation by altering the expression of auxin signaling genes. Overall, this evolutionary analysis provides important insights into the structure, function and expansion of the GRF gene family in cereal crops.
Looking ahead, bioinformatics and big data will revolutionize agriculture, enhancing productivity, sustainability, and food security globally. Yet fully integrating these emerging technologies into traditional farming requires surmounting systemic challenges. Cross-disciplinary, cooperative initiatives are key to equitably distributing the immense potential rewards. While the promise is vast, patience and nuance will be vital in transitional implementation.
Author contributions
SS: Conceptualization, Writing – original draft, Writing – review & editing. MW: Writing – review & editing. MA: Writing – review & editing.
Acknowledgments
We thank the authors of the papers published in this Research Topic for their valuable contributions and the referees for their rigorous review.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Aubry, S. (2019). The future of digital sequence information for plant genetic resources for food and agriculture. Front. Plant Sci. 10. doi: 10.3389/fpls.2019.01046
Chen, C., Chen, H., Zhang, Y., Thomas, H. R., Frank, M. H., He, Y., et al. (2020). TBtools: An integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 13 (8), 1194–1202. doi: 10.1016/j.molp.2020.06.009
Hesami, M., Alizadeh, M., Jones, A. M. P., Torkamaneh, D. (2022). Machine learning: its challenges and opportunities in plant system biology. Appl. Microbiol. Biotechnol. 106 (9-10), 3507–3530. doi: 10.1007/s00253-022-11963-6
Khan, M. H. U., Wang, S., Wang, J., Ahmar, S., Saeed, S., Khan, S. U., et al. (2022). Applications of artificial intelligence in climate-resilient smart-crop breeding. Int. J. Mol. Sci. 23 (19), 11156. doi: 10.3390/ijms231911156
Lawniczak, M. K. N., Durbin, R., Flicek, P., Lindblad-Toh, K., Wei, X., Archibald, J. M., et al. (2022). Standards recommendations for the earth bioGenome project. Proc. Natl. Acad. Sci. U.S.A. 119 (4), e2115639118. doi: 10.1073/pnas.2115639118
Sahu, S. K., Liu, H. (2023). A genetic solution for the global food security crisis. J. Integr. Plant Biol. 65 (6), 1359–1361. doi: 10.1111/jipb.13500
Keywords: big data, sustainable agriculture, digital agriculture, data analysis, data
Citation: Sahu SK, Waseem M and Aslam MM (2023) Editorial: Bioinformatics, big data and agriculture: a challenge for the future. Front. Plant Sci. 14:1271305. doi: 10.3389/fpls.2023.1271305
Received: 02 August 2023; Accepted: 04 October 2023;
Published: 16 October 2023.
Edited and Reviewed by:
Mahendar Thudi, Dr. Rajendra Prasad Central Agricultural University, IndiaCopyright © 2023 Sahu, Waseem and Aslam. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sunil Kumar Sahu, c3VuaWxrdW1hcnNhaHVAZ2Vub21pY3MuY24=; Muhammad Waseem, MTg0MzI4QGhhaW5hbnUuZWR1LmNu