Skip to main content

PERSPECTIVE article

Front. Plant Sci., 16 January 2024
Sec. Plant Biophysics and Modeling
This article is part of the Research Topic Plant Sensing and Computing - PlantComp 2022 View all 8 articles

Plant science in the age of simulation intelligence

  • 1KERMIT and Biobix, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
  • 2IDLAB-AIRO, Ghent University, imec, Ghent, Belgium
  • 3Plant Sciences Unit, Flanders Research Institute for Agriculture, Fisheries and Food, Melle, Belgium

Historically, plant and crop sciences have been quantitative fields that intensively use measurements and modeling. Traditionally, researchers choose between two dominant modeling approaches: mechanistic plant growth models or data-driven, statistical methodologies. At the intersection of both paradigms, a novel approach referred to as “simulation intelligence”, has emerged as a powerful tool for comprehending and controlling complex systems, including plants and crops. This work explores the transformative potential for the plant science community of the nine simulation intelligence motifs, from understanding molecular plant processes to optimizing greenhouse control. Many of these concepts, such as surrogate models and agent-based modeling, have gained prominence in plant and crop sciences. In contrast, some motifs, such as open-ended optimization or program synthesis, still need to be explored further. The motifs of simulation intelligence can potentially revolutionize breeding and precision farming towards more sustainable food production.

1 Introduction

As computational capabilities have grown, modeling has become a specialized area of agricultural sciences. These models can accurately simulate the performance of plants, crops, and greenhouses under various conditions and have therefore been used extensively for generating scientific hypotheses, informing and accelerating breeding programs, optimizing crop management and providing policy recommendations (Silva and Giller, 2020). However, models remain highly species-specific, complex, and difficult to calibrate due to many interlinked parameters (Wallach et al., 2021). Current developments and increased availability of phenotyping data provide an extensive source of data for model development and calibration, required for the extension of model applicability to novel or ‘forgotten’ crops and to studying impacts of pests and micronutrients. (Silva and Giller, 2020). However, more modern and powerful modeling paradigms are needed to address these issues and to infer large sets of parameters from phenotyping data. This paper introduces some fundamental concepts of the emerging field of simulation intelligence (SI) to plant science. SI is the merger of scientific computing and artificial intelligence (Lavin et al., 2021). Specifically for plant science, this will result in a new field that combines novel phenotyping approaches with modeling.

Phenotyping is quantifying (a subset of) plant traits that result from the interaction between plant genetics and the environmental conditions to which plants are exposed (Walter et al., 2015). Due to internal regulatory mechanisms, these phenotypic responses are situated at multiple organizational scales (cell, tissue, organ, plant, field) and across timescales. Biotic and abiotic external drivers also influence these mechanisms. While phenotyping ideally involves capturing the entire state of a plant in space and time, only partial observations are practically feasible, leading to the need for a wide range of sensory devices that capture part of the phenotype. A more holistic and dynamic view of phenotyping is necessary to overcome challenges involved in current approaches, including improving the temporal resolution and broadening results from specific studies to more diverse conditions (Das Choudhury et al., 2018).

Most models consist of mathematical equations for predicting plant behavior, morphology and growth as a function of environment, genetics, and management. Plant models describe and connect plant processes typically studied in isolation and consequently predict integrated responses. As such, models are often used for hypothesis development and improved understanding of plant processes, but also as decision support tools for breeding (e.g., genomic prediction), crop management (e.g., irrigation scheduling) and policy-making (e.g., climate change scenarios) (Peng et al., 2020).

Depending on their objective, models vary at the level with which processes are included (black-box ↔ mechanistic axis) and at the scale they operate in terms of space (molecule ↔ ecosystem axis) and time (second ↔ century axis). Models are considered ‘process-based’ or ‘mechanistic’ when parameters have a biophysical interpretation and their equations explicitly describe processes (e.g., photosynthesis, water transport). They operate at a different spatial and (often also) temporal scale. At each spatial level, there is an extra level of abstraction, but, interestingly, the scale does not necessarily determine whether a model is more or less mechanistic, as plants tend to adapt to their environment in an integrated way (Tardieu et al., 2020). On the other end of the spectrum, entirely data-driven models based on machine learning algorithms often lack interpretable parameters. The latter group of models is vital in breeding [e.g., genomic prediction (Korte and Farlow, 2013; Hickey et al., 2017; De Meyer et al., 2023)], but also in greenhouse climate control (Hemming et al., 2020). Consequently, models often only operate on a single point in the tempo-spatial domain, limiting their use beyond their initial conceptualization. Nevertheless, there are efforts to connect modeling scales from the molecular level up to the crop system (Peng et al., 2020).

Recently, surrogate plant models have become popular (Corrales et al., 2022; Cheng et al., 2023; Zhang et al., 2023), because these allow for creating a “digital twin” of the plant systems for decision support and control. A surrogate data-driven model is trained to mimic the mechanistic model’s output accurately. When properly trained, such a surrogate can be several magnitudes faster than the original model while behaving nearly identically (Gherman et al., 2023). Apart from the complete replacement of mechanistic models by data-driven models, these can also be combined. For example, Zhang et al. (2023) demonstrate how these can be coupled in series (e.g., crop model simulation outputs as input to a machine learning model), in parallel (e.g., data assimilation in crop models) or via modules (e.g., part of a crop model is replaced by a machine learning module). This is a stepping stone towards SI, leading to cross-pollination between scientific computing, artificial intelligence, plant modeling and phenotyping.

2 Combining scientific computing with artificial intelligence

Complex biological systems require powerful tools to study them. On the one hand, many of these systems require substantial domain knowledge, often as conservation laws and reaction mechanisms, for which traditional mechanistic modeling and simulation paradigms are well suited. This is known as “scientific computing” and relies mainly on ordinary differential equations, partial differential equations, agent-based models and their ilk. On the other hand, many mechanisms are yet to be elucidated while, at the same time, a plethora of multimodal data is available. This motivates using a data-driven approach (referred to as “artificial intelligence” or “machine learning”). Recent advances blur the lines between traditional methodologies, and so-called scientific machine learning combines both, for example, in neural ordinary differential equations (Chen et al., 2018; Innes et al., 2019; Rackauckas et al., 2021), where the solvers are treated as differentiable programs that can fit data to learn unknown dynamics of the problems.

The advances in scientific computing and machine learning and their use in studying complex, dynamical multi-scale systems gave rise to a more generalized view: the new field of simulation intelligence (SI). Lavin et al. (2021) outlined nine vital, interconnected computing technology motifs, visually represented in Figure 1:

1. Multi-scale and multi-physics modelling (Karniadakis et al., 2021): integrating different types of simulators;

2. Surrogate modelling and emulation (Purcell and Neubauer, 2023): replacing a complex model or system with a different one;

3. Simulation-based inference (Cranmer et al., 2020): using the simulator to infer parameters or states;

4. Causal modelling and inference (Schölkopf et al., 2021): including or identifying causal concepts within the model;

5. Agent-based modelling (Zhang and DeAngelis, 2020): simulating a system as a collection of semiautonomous agents;

6. Probabilistic programming (Schoot et al., 2021): interpreting code as a stochastic program;

7. Differentiable programming (Baydin et al., 2018): computing and using derivatives and gradients of computer code and simulators;

8. Open-ended optimization (Stock and Gorochowski, 2023): trying to find continuous improvements;

9. Program synthesis (David and Kroening, 2017): automatically discovering the code to solve a problem.

Figure 1
www.frontiersin.org

Figure 1 Overview of the different SI motifs for plants. (A) Multi-scale and multi-physics modeling considers the different scales (from cell to ecosystem or field) and physical processes (radiation, hydraulics, fluidics). (B) Surrorgate modeling and emulation considers a virtual digital twin of the plant system. (C) Simulation-based inference, such as approximate Bayesian computing, allows a simulator to infer parameters or states from data. (D) Causal modeling and inference takes into account the different levels of causal reasoning that are possible. (E) Agent-based modeling simulates a system as a collection of semiautonomous agents. (F) Probabilistic programming allows for general computing with stochastic components and performs general inferences about parameters and states. (G) Differential programming computing and simulation with gradients and derivatives. (H) Open-ended optimization aims at finding continuous improvements and adaptations, for example, in plant breeding. (I) Program synthesis automatically generates the code to solve a problem, e.g., extract the L-system to describe a plant.

We included a reference for each motif that covers this specific topic more in-depth.

This technology stack has enormous potential to advance fields such as material science, agriculture, chemistry, medicine, climate, and synthetic biology. Simulation intelligence can also significantly impact plant sciences. By combining modeling and phenotyping, one can uncover mechanisms underlying plant dynamics. For example, functional-structural plant modeling aims to develop holistic plant growth and function models, harmonizing architecture with (eco)physiology. The Quantitative Plant1 initiative collects plant phenotyping data sets, plant analysis tools and models. The availability of plant phenotypic data repositories and plant biophysical models are prerequisites for SI.1

Concretely, SI aims to handle the following challenges in using computational modeling for real-world problems:

Inverse problem solving, where one wants to use a model to infer hidden states or parameters from observations or measurements. For example, in root phenotyping, researchers use electrical resistance tomography measurements to infer root properties non-invasively (Whalley et al., 2017).

Uncertainty reasoning, which relates to the inherent uncertainty of dealing with biological systems, both epistemic (i.e., incomplete knowledge of the processes) and aleatoric (i.e., the irreducible noise, for example, due to biological stochasticity) (Hüllermeier and Waegeman, 2021). Quantifying uncertainty is of great importance for plant breeding and precision crop management (Asseng et al., 2013; Tao et al., 2018; Folberth et al., 2019; Nelson et al., 2019; Hernández and López, 2020; Dokoohaki et al., 2021), especially when dealing with a changing climate.

Human-machine teaming relates to the interaction between the model or machine intelligence and the breeder, farmer or other users. This includes intelligent dashboards and ways for users to query the simulator for decision-making and inject data, observations and results into the model. Bridging the gap between models and users is a significant challenge in digital agriculture (Antle et al., 2017; Slob et al., 2023; Zhang et al., 2023).

3 Nine simulation intelligence motifs for plant science

This section discusses the nine SI motifs outlined in Lavin et al. (2021) and discusses how they can relate to plant science. We speculate about the opportunities they could present in digital agriculture when we find only a few plant-related examples. Due to the broad scope of these topics, we have to be concise. Our primary aim is to inform the quantitative plant scientist of available SI tools. We refer to the work of Lavin et al. (2021) or specific overview papers for an extensive introduction.

3.1 Multi-scale and multi-physics modeling

Plant processes occur on different scales (Figure 1A). These can be spatial, from the molecular processes in the cell (micrometers) to the ecosystem (up to kilometers), or temporal, the processes of interest range from nanoseconds (e.g., photosynthesis) to months or years (e.g., growth). For example, plants generate and use various rhythms and oscillations at all scales and organization levels (Damineli et al., 2022). Plant modelers are aware of the fractal complexity of plant modeling, where lower-level processes can be abstracted away in so-called meta-mechanisms (Tardieu et al., 2020). Meta-mechanisms are, e.g., response curves of plant traits to environmental conditions, which can be characterized in a high-throughput fashion. Such meta-mechanisms are largely determined by physical trade-offs that limit evolution (Kempes et al., 2019). Meta-mechanisms can be tailored to specific plant species or cultivars, an open challenge in plant modeling (Silva and Giller, 2020).

Plant models involve various kinds of physical models, going from molecular and metabolic processes (Farquhar et al., 1980), hydraulic functioning (De Swaef et al., 2022), to soil and atmospheric physics (Liu et al., 2020b). Modern (functional-structural) plant modeling involves advanced physics simulation such as ray tracing to assess radiation (De Visser et al., 2014; Bailey, 2018; Retkute et al., 2018) and computational fluid dynamics (Bartzanas et al., 2013; Jiao et al., 2020). The latter are often computationally demanding and might require appropriate tools, such as surrogate modeling (discussed later), to make them feasible for, e.g., greenhouse control.

Physics-informed machine learning can be a powerful aid in incorporating the different scales and physics (Karniadakis et al., 2021). Here, data-driven models are fitted not only to match their training data but also to adhere to known physical laws and are ideally suited to integrate data into different physical processes. This has shown great success in biomedical modeling, for example, in modeling blood flow in an intracranial aneurysm (Raissi et al., 2020). In crop science, Cavanagh et al. (2021) used physics-informed deep learning to study the morphological changes induced by Asian soybean rust. Similar directions for holistic plant modeling will undoubtedly be fruitful.

3.2 Surrogate modeling and emulation

A surrogate model is a model that replaces an often expensive computation or process (Figure 1B). In scientific computing, expensive simulations, such as computational fluid dynamics, are often replaced by relatively inexpensive methods of training and deploying machine learning surrogates, such as Gaussian processes or artificial neural networks. For example, Cheng et al. (2023) used a data-driven surrogate model in combination with a multi-objective genetic algorithm to reduce irrigation and nitrogen fertilization by 44% and 37%, respectively.

Surrogate models play a pivotal role in developing of digital twins – real-time synchronized virtual representations of products, processes, or environments. These dynamic digital counterparts facilitate a bidirectional flow of information, leveraging real-world data while influencing management and decision-making processes. Positioned at the forefront of digital agriculture and smart farming (Verdouw et al., 2021; Slob et al., 2022; Purcell and Neubauer, 2023), digital twins seamlessly integrate with the principles of Industry 4.0 tailored for agricultural contexts.

Digital twins exhibit versatility, being applied to emulate plants, greenhouses, or entire supply chains. Their primary utility is improving cost-efficiency, such as reducing water and fertilizer consumption and elevating prediction accuracy (Ariesen-Verschuur et al., 2022). Verdouw et al. (2021) categorizes digital twins based on their relationship to virtual objects—whether it pertains to an imaginary entity (e.g., a yet-to-grow cultivar), an existing object, future states for predictive analysis, or a historical object. Additionally, digital twins serve distinct purposes, being employed for both monitoring and prescription.

In recent years, digital twins have demonstrated significant successes in agriculture. Examples include emulating various wheat development stages to predict yield (Skobelev et al., 2020), optimizing yield and minimizing energy requirements in underground hydroponic farms (Jans-Singh et al., 2020), exploring virtual replicas of greenhouses through immersive VR experiences (Slob et al., 2023), monitoring the health and quality of individual plants in orchards (Moghadam et al., 2020), and fine-tuning greenhouse control systems (Chaux et al., 2021). Using simulations to govern systems and explore hypothetical interventions aligns closely with causal reasoning, as discussed in Section 3.4.

3.3 Simulation-based inference

In plant modeling, knowledge of the processes of interest is often encoded in process-based models. Typically, given the initial conditions and the parameter values, these models can simulate data that can be compared with quantitative measurements such as biomass growth, development stage, or transpiration. Generating this data is called forward modeling. However, when one observes the data, one would often infer the likely hidden states or parameter values, a process that is a much more challenging inverse problem (Figure 1C). The field of simulation-based inference deals with developing inference methods for highly intricate simulators, i.e., to extract the parameters of a mechanistic model algorithmically (Cranmer et al., 2020). Simulation-based inference is often called likelihood-free inference – as contrasted with classical statistical estimation problems. The likelihood function implicitly defined by the simulator is often not tractable, making this a challenging endeavor. Inverse problems are usually solved using a Bayesian perspective, where the parameters or states have associated prior distributions. The simulator acts as an implicit likelihood function, linking the model with the data and parameters.

Approximate Bayesian Computation (ABC) is a widely utilized approach for simulation-based inference (Marin et al., 2012; Romero-Cuellar and Francés, 2023). In ABC, the simulator generates synthetic data by sampling parameters from a prior distribution or proposal distribution and using these parameters to perform a simulation. These synthetic datasets are characterized by summary statistics, such as the total biomass, used to compare the simulated data with collected observations. Parameter values producing synthetic data with summary statistics closely aligned with those of the actual data, often measured using Euclidean distance, are retained. These selected values provide approximate samples of the posterior distribution. ABC’s most commonly used variant operates similarly to rejection sampling, and its sampling properties are well understood. However, the conventional ABC method becomes inefficient, especially when dealing with large parameter spaces. Notably, ABC has been successfully applied in plant science to merge crop growth models with whole genome data (Technow et al., 2015), to infer root architecture (Ziegler et al., 2019) and to characterize the morphodynamic progression of Asian soybean rust (Cavanagh et al., 2021). The progression of machine learning and SI techniques, including probabilistic and differentiable programming (see Section 3.5), has significantly influenced simulation-based inference. For instance, in a study by Monti et al. (2023), the parameters of an agent-based model were learned directly from data by redefining it as a probabilistic program.

3.4 Causal modeling and inference

Data-driven modeling has achieved remarkable success across various scientific and technological domains. However, purely statistical models often fail to uncover the underlying causal mechanisms behind the observed data. As an illustration, consider a simple linear regression model predicting yield based on nutrient inputs. This model might erroneously suggest that fertilization decreases yield, neglecting the confounding effect of poorer soils where fertilizers are commonly applied.

The significance of understanding causality has been underscored by Judea Pearl in his work, including “The Book of Why” (Pearl and Mackenzie, 2018). Pearl introduced a hierarchy of causal reasoning that data-driven models can accomplish, comprising:

1. Observations: Detecting associations in data, such as estimating tree biomass from their diameter at breast height.

2. Interventions: Predicting outcomes resulting from active manipulations of the system, like projecting the effects of flower pruning on fruit production.

3. Counterfactuals: Imagining potential scenarios where conditions or interventions differed, as in assessing whether larger fruits would have resulted from more extensive flower removal.

Pearl’s mathematical insights reveal that some models are inherently limited in performing higher-level causal reasoning. Thus, plant scientists who aim to predict and manage must exercise caution when employing data-driven models from observational data because the causal link between the predictors is not exploited by default. For example, a data-driven model might conclude that watering harms a plant’s water status, as irrigation and water stress are correlated. This scenario highlights the limitations of relying solely on observational data, representing the first level of causal reasoning in Judea Pearl’s hierarchy, where associations in data are detected without considering active manipulations or counterfactual scenarios. The limitations contrast with many mechanistic models, which can often be used directly for interventions and counterfactuals. The crux lies in developing models incorporating the structural relationships between variables of interest, advocating for mechanistic and hybrid models. The evolving field of causal machine learning continues to gain prominence (Schölkopf et al., 2021) (Figure 1D).

3.5 Agent-based modelling

Agent-based models (ABMs) depict complex systems as interconnected, autonomous agents (e.g., organs or whole plants) interacting from the bottom up (Figure 1E). These models frequently encompass stochastic elements and can replicate macro-level processes stemming from micro-level interactions. Consequently, ABMs are a natural fit for elucidating multi-scale phenomena. In ecology and plant science, ABMs are widely employed (DeAngelis and Mooij, 2005; McLane et al., 2011; Zhang and DeAngelis, 2020), offering insights into growth, carbon allocation, reproduction, and more. These models can portray individual plants within functional-structural plant models (FSPMs) or capture entire plant communities, such as field ecosystems.

Interestingly, ABMs precisely capture individual plant behaviors due to plants’ modular structure, comprising elements like roots, leaves, stems, and branches. Each module functions autonomously, gathering, producing, or distributing resources for the overall plant’s advantage. Remarkably, plants lack a central controlling entity, resembling a decentralized “swarm intelligence” (Baluška et al., 2010a; Oborny, 2019; van Schijndel et al., 2022). For instance, a plant’s root tips exhibit both sensory and command center roles, independently deciding growth directions and even forming symbiotic relationships with mycorrhizal fungi (Baluška et al., 2010b). Some liken this to a “solid” brain, where individual units are fixed. In contrast, others argue for “liquid” brain aspects (van Schijndel et al., 2022), like in plants with vegetative propagation, like strawberries, exploring diverse niches to optimize their niche.

3.6 Probabilistic programming

The language of probability theory is an effective way to describe biological systems, given their inherent stochastic nature (Figure 1F). Specifically, Bayesian statistics is a consistent framework to update the scientist’s prior beliefs (encoded in prior distributions) with measurements and observations (encoded in the likelihood) into the so-called posterior distribution (Schoot et al., 2021). In plant science, Bayesian reasoning is applied in, for instance, plant pathology and epidemics (Mila and Carriquiry, 2004), modeling life stage events (Humplík et al., 2020), and predicting maize yield (Lacasa et al., 2020). Though powerful, Bayesian and probabilistic methods can be complex in practice because conditioning a distribution (e.g., computing the posterior) requires normalization, often involving computing intractable integrals or sums. Probabilistic programming is a relatively new, general approach to making probabilistic methods more accessible in the scientific community.

A probabilistic programming language (PPL) allows one to write, in principle, arbitrary complex stochastic programs from which the scientist can make inferences by sampling. Hence, a universal PPL provides two constructs: i) a way to sample from the stochastic program and ii) a way to condition during inference. For example, one can write a program to simulate flowering vines and then constrain regions where they are present. This allows one to sample vines that grow in a specific shape, such as a letter (Ritchie et al., 2016). PPLs have shown success throughout the biological sciences, for example, in inferring phylogeny (Ronquist et al., 2020), protein structure alignment (Moreta et al., 2019) and inferring signaling pathways (Merrell and Gitter, 2020). There are a plethora of PPLs available, many interfacing with scientifically popular programming languages for sciences, for example, Stan (Stan Development Team, 2023), Pyro (Bingham et al., 2019), or Turing (Holt and Cordy, 1988).

3.7 Differentiable programming

While probabilistic programming facilitates generic computations involving probability distributions, differentiable programming (Izzo et al., 2016; Innes et al., 2019) extends computation by enabling differentiation of arbitrary computer programs (Figure 1G). This empowers the fine-tuning of program behavior using gradient-based optimization techniques. This achievement relies on automatic differentiation (Baydin et al., 2018) – numerically computing (exact) derivatives by directly manipulating the computational graph – a foundational concept in deep learning. Differentiable programming has exerted a profound scientific influence, acting as a cornerstone for nearly all deep learning research over the past decade and diverse domains beyond deep learning. These domains encompass ordinary differential equations (Chen et al., 2018; Rackauckas et al., 2021; Núñez et al., 2023), scientific machine learning, robotics Degrave et al. (2019), physics, protein science (Ingraham et al., 2019; AlQuraishi and Sorger, 2021), combinatorial optimization (Liu et al., 2020a), and geosciences (Shen et al., 2023). The utility of differentiable programming extends to harmonizing process-based and data-driven models. Within plant sciences, differentiable plant models offer an avenue to assess sensitivity directly, calibrate parameters using gradients, apply probabilistic programming techniques for uncertainty quantification, and gain control over conditions for optimizing growth. Concrete achievements in plant sciences include the creation of 3D digital twin leaf models from image data (Li et al., 2022) and solving inverse problems related to photosynthesis (Aboelyazeed et al., 2023). We also propose that advancements in differentiable ray tracing (Li et al., 2018), computational fluid dynamics (Bezgin et al., 2023), and physics engines (de Avila Belbute-Peres et al., 2018; Degrave et al., 2019) hold substantial promise for enhancing plant simulations.

3.8 Open-ended optimization

Open-ended systems possess the ability to achieve limitless improvement and continuously generate novelty (Stanley and Lehman, 2015; Banzhaf et al., 2016; Stanley et al., 2017) (Figure 1H). In such systems, the focus primarily lies on creating novelty rather than being driven by a specific objective function (Stanley and Lehman, 2015). Open-endedness is a characteristic observed in various complex systems, including natural evolution and technological innovation. Its principles have been explored for diverse applications such as designing new computer architectures (Ackley and Small, 2014), software development (Fix et al., 2021), artificial neural networks (Guttenberg et al., 2018), and novel cancer treatment strategies (Balaz et al., 2021).

Our other work delves into how open-endedness and quality-diversity algorithms can contribute to biotechnology and synthetic biology (Stock and Gorochowski, 2023). Expanding this perspective, we propose that open-endedness can significantly impact plant breeding, a critical aspect in ensuring global food security (Lenaerts et al., 2019). Conventional breeding approaches often prioritize incorporating positive traits into populations, potentially at the expense of diversity (Louwaars, 2018). A noteworthy exception occurred in the 1970s when Zelder, a breeding company, intentionally bred wheat varieties to enhance diversity as a defense against yellow rust (Groenewegen, 1977). Embracing the open-ended optimization viewpoint, one could design breeding schemes capable of continually generating new cultivars with novel and desirable traits. Insights from the field of quality-diversity optimization (Pugh et al., 2016), which focuses on generating new variants that combine functionality and diversity, have the potential to revolutionize breeding strategies for developing crops and cultivars suited to a dynamically changing world. In silico evolution experiments can help to understand the allometric relations observed in plants due to environmental conditions (Eloy et al., 2017). As such, they might help to design new cultivars.

3.9 Program synthesis

Program synthesis automates software creation to tackle specific problems (David and Kroening, 2017) (Figure 1I). Here, the focus shifts towards generating optimized code by capturing users’ intentions. A notable application of this concept is evident in the recently released ChatGPT, where users use natural language queries to program or create computer code. This synthesis technique plays a pivotal role in simplifying intricate mathematical system descriptions.

Program synthesis offers avenues for extracting insights from biological experiments in mathematical modeling. For instance, Koksal et al. (2013) automatically generated biological models from mutation experiments and recommended new experiments to differentiate between potential models. This approach holds promise for analyzing high-throughput CRISPR-Cas-based knockout experiments, offering valuable insights for plant breeding (Van Huffel et al., 2022). Symbolic regression uses genetic programming to automatically discover a white-box model of one system (Angelis et al., 2023; Cranmer, 2023) – ideal to find the earlier-discussed meta-mechanisms. The DreamCoder system can uncover simple programs that generate example datasets (Ellis et al., 2023). These programs encompass diverse forms, such as regular expressions, graphics, symbolic equations, and physical laws. These techniques aid in discovering allometric laws and meta-mechanisms to support model building. They also facilitate the automatic extraction of rules for L-systems, enabling the creation of virtual plants based on a limited set of examples. In summary, custom computer algebra systems (Ma et al., 2022) and language compilers can streamline equations and code, resulting in concise and numerically stable plant models.

4 Discussion and outlook

The sections above discussed SI and its (potential) impact on the plant sciences. Here, we will give a more holistic point of view of why SI can be instrumental in discovering new and improving current practices in plant sciences. SI provides a holistic, top-down look at plant science and a systemic approach for leveraging fragmented phenotypic data and ecophysiological knowledge contained in process-based models.

Process-based plant models (including FSPMs) are continuously under development and updated with relevant knowledge. These are applied for decision support and (climate) scenario analysis, but also for answering scientific, plant-physiological questions, often in combination with plant phenotypic data. High-throughput plant phenotyping data of crop performance and development is now also being applied in crop breeding (Araus and Cairns, 2014; Gill et al., 2022). Still, the impact of these phenotypic data often needs to be more specific to the objectives of the experiments wherein these were collected. Therefore, SI concepts can facilitate the connection between phenotypic data and ecophysiological plant models and, as such, broaden the use of these phenotypic data and expand knowledge of the processes they rely on.

We identify three key prerequisites to embrace SI’s philosophy in plant sciences fully. (1) The development of cheap sensor technology for environmental monitoring and plant phenotyping enables continuous monitoring of larger populations in real-time. (2) Open datasets and code, following the FAIR principles (Wilkinson et al., 2016), which is finding its way into plant sciences (Saint Cast et al., 2022) but is already much more prevalent in other scientific domains (Scheffler et al., 2022). (3) Interdisciplinary collaborations bridge potential knowledge gaps and enable cross-disciplinary approaches to succeed faster.

The increased amount of open data, along with AI, allows the processing of larger amounts of combined data and models and opens up new or improved applications in plant breeding or greenhouse control, as seen in other domains, e.g. Degrave et al. (2022) who leveraged simulation and experimental data to learn a closed-loop controller a tokamak reactor. Recent research explores similar hybrid approaches to control plant and crop systems (Kang and Wang, 2017; Ifrim et al., 2021; Mahmood et al., 2023). Simulation and scientific computing is centered around creating mechanistic computational models to simulate real-world phenomena, while machine learning focuses on leveraging learning algorithms to extract knowledge and insights from scientific data. Both approaches have strengths and can be combined to enhance scientific understanding and decision-making.

Throughout our literature survey, we identified the SI motifs embedded in numerous plant-related projects. This finding aligns with the core objectives of SI, which are geared towards addressing the issues inherent in modeling complex systems:

● Solving inverse problems;

● Integrating mechanistic knowledge with data;

● Navigating uncertainty, and;

● Fostering effective communication between the model and the user.

In applied domains, such as plant and crop modeling, advancements often trail the cutting edge of computational methodologies. Consequently, it is unsurprising that relatively mature SI motifs, such as surrogate models, agent-based modeling, and differentiable programming, showcase the highest prevalence in plant-related examples. In contrast, motifs in their infancy, such as probabilistic programming, open-ended optimization, and program synthesis, exhibit fewer concrete applications in plant and crop science. These emerging SI topics can advance plant and agricultural sciences toward a more sustainable future.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author.

Author contributions

MS: Writing – original draft, Writing – review & editing. OP: Writing – original draft, Writing – review & editing. TD: Writing – original draft, Writing – review & editing. FW: Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was partially funded by Ghent University grant number BOF-GOA-01G01923.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Footnotes

  1. ^ https://quantitative-plant.org/(accessed on March 2023).

References

Aboelyazeed, D., Xu, C., Hoffman, F. M., Liu, J., Jones, A. W., Rackauckas, C., et al. (2023). A differentiable, physics-informed ecosystem modeling and learning framework for large-scale inverse problems: Demonstration with photosynthesis simulations. Biogeosciences 20, 2671–2692. doi: 10.5194/bg-20-2671-2023

CrossRef Full Text | Google Scholar

Ackley, D., Small, T. (2014). “Indefinitely scalable computing = artificial life engineering,” in ALIFE 14: The Fourteenth International Conference on the Synthesis and Simulation of Living Systems (Manhattan, New York: MIT Press), 606–613. doi: 10.1162/978-0-262-32621-6-ch098

CrossRef Full Text | Google Scholar

AlQuraishi, M., Sorger, P. K. (2021). Differentiable biology: Using deep learning for biophysics-based and data-driven modeling of molecular mechanisms. Nat. Methods 18 (10), 1169–1180. doi: 10.1038/s41592-021-01283-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Angelis, D., Sofos, F., Karakasidis, T. E. (2023). Artificial intelligence in physical sciences: Symbolic regression trends and perspectives. Arch. Comput. Methods Eng. 30, 3845–3865. doi: 10.1007/s11831-023-09922-z

CrossRef Full Text | Google Scholar

Antle, J. M., Jones, J. W., Rosenzweig, C. (2017). Next generation agricultural system models and knowledge products: Synthesis and strategy. Agric. Syst. 155, 179–185. doi: 10.1016/j.agsy.2017.05.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Araus, J. L., Cairns, J. E. (2014). Field high-throughput phenotyping: The new crop breeding frontier. Trends Plant Sci. 19, 52–61. doi: 10.1016/j.tplants.2013.09.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Ariesen-Verschuur, N., Verdouw, C., Tekinerdogan, B., Ariesen-Verschuur, N., Verdouw, C., Tekinerdogan, B. (2022). Digital Twins in greenhouse horticulture: A review. doi: 10.1016/J.COMPAG.2022.107183

CrossRef Full Text | Google Scholar

Asseng, S., Ewert, F., Rosenzweig, C., Jones, J. W., Hatfield, J. L., Ruane, A. C., et al. (2013). Uncertainty in simulating wheat yields under climate change. Nat. Climate Change 3, 827–832. doi: 10.1038/nclimate1916

CrossRef Full Text | Google Scholar

Bailey, B. N. (2018). A reverse ray-tracing method for modelling the net radiative flux in leaf-resolving plant canopy simulations. Ecol. Model. 368, 233–245. doi: 10.1016/j.ecolmodel.2017.11.022

CrossRef Full Text | Google Scholar

Balaz, I., Petrić, T., Kovacevic, M., Tsompanas, M.-A., Stillman, N. (2021). Harnessing adaptive novelty for automated generation of cancer treatments. Biosystems 199, 104290. doi: 10.1016/j.biosystems.2020.104290

PubMed Abstract | CrossRef Full Text | Google Scholar

Baluška, F., Lev-Yadun, S., Mancuso, S. (2010a). Swarm intelligence in plant roots. Trends Ecol. Evol. 25, 682–683. doi: 10.1016/j.tree.2010.09.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Baluška, F., Mancuso, S., Volkmann, D., Barlow, P. W. (2010b). Root apex transition zone: A signalling–response nexus in the root. Trends Plant Sci. 15, 402–408. doi: 10.1016/j.tplants.2010.04.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Banzhaf, W., Baumgaertner, B., Beslon, G., Doursat, R., Foster, J. A., McMullin, B., et al. (2016). Defining and simulating open-ended novelty: Requirements, guidelines, and challenges. Theory Biosci. 135, 131–161. doi: 10.1007/s12064-016-0229-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Bartzanas, T., Kacira, M., Zhu, H., Karmakar, S., Tamimi, E., Katsoulas, N., et al. (2013). Computational fluid dynamics applications to improve crop production systems. Comput. Electron. Agric. 93, 151–167. doi: 10.1016/j.compag.2012.05.012

CrossRef Full Text | Google Scholar

Baydin, A. G., Pearlmutter, B. A., Radul, A. A., Siskind, J. M. (2018). Automatic differentiation in machine learning: A survey. J. Mach. Learn. Res. 18, 1–43.

Google Scholar

Bezgin, D. A., Buhendwa, A. B., Adams, N. A. (2023). JAX-Fluids: A fully-differentiable highorder computational fluid dynamics solver for compressible two-phase flows. Comput. Phys. Commun. 282, 108527. doi: 10.1016/j.cpc.2022.108527

CrossRef Full Text | Google Scholar

Bingham, E., Chen, J. P., Jankowiak, M., Obermeyer, F., Pradhan, N., Karaletsos, T., et al. (2019). Pyro: deep universal probabilistic programming. J. Mach. Learn. Res. 20, 1–6.

Google Scholar

Cavanagh, H., Mosbach, A., Scalliet, G., Lind, R., Endres, R. G. (2021). Physics-informed deep learning characterizes morphodynamics of Asian soybean rust disease. Nat. Commun. 12, 6424. doi: 10.1038/s41467-021-26577-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Chaux, J. D., Sanchez-Londono, D., Barbieri, G. (2021). A digital twin architecture to optimize productivity within controlled environment agriculture. Appl. Sci. 11, 8875. doi: 10.3390/APP11198875

CrossRef Full Text | Google Scholar

Chen, R. T. Q., Rubanova, Y., Bettencourt, J., Duvenaud, D. (2018). “Neural ordinary differential equations,” in Proceedings of the 32nd Conference on Advances in Neural Information Processing Systems (Montre´al, Canada), 1–13. doi: 10.1007/978-3-662-55774-73

CrossRef Full Text | Google Scholar

Cheng, D., Yao, Y., Liu, R., Li, X., Guan, B., Yu, F. (2023). Precision agriculture management based on a surrogate model assisted multiobjective algorithmic framework. Sci. Rep. 13, 1142. doi: 10.1038/s41598-023-27990-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Corrales, D. C., Schoving, C., Raynal, H., Debaeke, P., Journet, E.-P., Constantin, J. (2022). A surrogate model based on feature selection techniques and regression learners to improve soybean yield prediction in southern France. Comput. Electron. Agric. 192, 106578. doi: 10.1016/j.compag.2021.106578

CrossRef Full Text | Google Scholar

Cranmer, M. (2023). Interpretable machine learning for science with PySR and SymbolicRegression.jl. doi: 10.48550/arXiv.2305.01582. Dataset.

PubMed Abstract | CrossRef Full Text | Google Scholar

Cranmer, K., Brehmer, J., Louppe, G. (2020). The frontier of simulation-based inference. Proc. Natl. Acad. Sci. 117, 30055–30062. doi: 10.1073/pnas.1912789117

CrossRef Full Text | Google Scholar

Damineli, D. S. C., Portes, M. T., Feijó, J. A. (2022). Electrifying rhythms in plant cells. Curr. Opin. Cell Biol. 77, 102113. doi: 10.1016/j.ceb.2022.102113

PubMed Abstract | CrossRef Full Text | Google Scholar

Das Choudhury, S., Bashyam, S., Qiu, Y., Samal, A., Awada, T. (2018). Holistic and component plant phenotyping using temporal image sequence. Plant Methods 14, 35. doi: 10.1186/s13007-018-0303-x

PubMed Abstract | CrossRef Full Text | Google Scholar

David, C., Kroening, D. (2017). Program synthesis: Challenges and opportunities. Philos. Trans. R. Soc. A: Mathematical Phys. Eng. Sci. 375, 20150403. doi: 10.1098/rsta.2015.0403

CrossRef Full Text | Google Scholar

DeAngelis, D. L., Mooij, W. M. (2005). Individual-based modeling of ecological and evolutionary processes. Annu. Rev. Ecology Evolution Systematics 36, 147–168. doi: 10.1146/annurev

CrossRef Full Text | Google Scholar

de Avila Belbute-Peres, F., Smith, K., Allen, K., Tenenbaum, J., Kolter, J. Z. (2018). “End-to-end differentiable physics for learning and control,” in Advances in Neural Information Processing Systems, vol. 31. (Curran Associates, Inc).

Google Scholar

Degrave, J., Felici, F., Buchli, J., Neunert, M., Tracey, B., Carpanese, F., et al. (2022). Magnetic control of tokamak plasmas through deep reinforcement learning. Nature 602, 414–419. doi: 10.1038/s41586-021-04301-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Degrave, J., Hermans, M., Dambre, J., Wyffels, F. (2019). A differentiable physics engine for deep learning in robotics. Front. Neurorobotics 13. doi: 10.3389/fnbot.2019.00006

CrossRef Full Text | Google Scholar

De Meyer, S., Cruz, D. F., De Swaef, T., Lootens, P., De Block, J., Bird, K., et al. (2023). Predicting yield of individual field-grown rapeseed plants from rosette-stage leaf gene expression. PloS Comput. Biol. 19, e1011161. doi: 10.1371/journal.pcbi.1011161

PubMed Abstract | CrossRef Full Text | Google Scholar

De Swaef, T., Pieters, O., Appeltans, S., Borra-Serrano, I., Coudron, W., Couvreur, V., et al. (2022). On the pivotal role of water potential to model plant physiological processes. silico Plants 4, diab038. doi: 10.1093/insilicoplants/diab038

CrossRef Full Text | Google Scholar

De Visser, P., van der Heijden, G., Buck-Sorlin, G. (2014). Optimizing illumination in the greenhouse using a 3D model of tomato and a ray tracer. Front. Plant Sci. 5. doi: 10.3389/fpls.2014.00048

CrossRef Full Text | Google Scholar

Dokoohaki, H., Kivi, M. S., Martinez-Feria, R., Miguez, F. E., Hoogenboom, G. (2021). A comprehensive uncertainty quantification of large-scale process-based crop modeling frameworks. Environ. Res. Lett. 16, 084010. doi: 10.1088/1748-9326/ac0f26

CrossRef Full Text | Google Scholar

Ellis, K., Wong, C., Nye, M., Sable-Meyer, M., Cary, L., Morales, L., et al. (2023). DreamCoder: Growing 474 generalizable, interpretable knowledge with wake-sleep Bayesian program learning. Philosophical 475 Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 381, 1–18. 476. doi: 10.1098/rsta.2022.0050

CrossRef Full Text | Google Scholar

Eloy, C., Fournier, M., Lacointe, A., Moulia, B. (2017). Wind loads and competition for light sculpt trees into self-similar structures. Nat. Commun. 8, 1–12. doi: 10.1038/s41467-017-00995-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Farquhar, G. D., von Caemmerer, S., Berry, J. A. (1980). A biochemical model of photosynthetic CO2 assimilation in leaves of C3 species. Planta 149, 78–90. doi: 10.1007/BF00386231

PubMed Abstract | CrossRef Full Text | Google Scholar

Fix, S., Probst, T., Ruggli, O., Hanne, T., Christen, P. (2021). Open-ended automatic programming through combinatorial evolution. Intelligent Syst. Design Appl. 418, 1–12. doi: 10.48550/arXiv.2102.10475

CrossRef Full Text | Google Scholar

Folberth, C., Elliott, J., Müller, C., Balkovič, J., Chryssanthacopoulos, J., Izaurralde, R. C., et al. (2019). Parameterization-induced uncertainties and impacts of crop management harmonization in a global gridded crop model ensemble. PloS One 14, e0221862. doi: 10.1371/journal.pone.0221862

PubMed Abstract | CrossRef Full Text | Google Scholar

Gherman, I. M., Abdallah, Z. S., Pang, W., Gorochowski, T. E., Grierson, C. S., Marucci, L. (2023). Bridging the gap between mechanistic biological models and machine learning surrogates. PloS Comput. Biol. 19, e1010988. doi: 10.1371/journal.pcbi.1010988

PubMed Abstract | CrossRef Full Text | Google Scholar

Gill, T., Gill, S. K., Saini, D. K., Chopra, Y., de Koff, J. P., Sandhu, K. S. (2022). A comprehensive review of high throughput phenotyping and machine learning for plant stress phenotyping. Phenomics 2, 156–183. doi: 10.1007/s43657-022-00048-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Groenewegen, L. (1977). Multilines as a tool in breeding for reliable yields. Cereal Res. Commun. 5, 125–132.

Google Scholar

Guttenberg, N., Virgo, N., Penn, A. (2018). On the potential for open-endedness in neural networks. Artif. Life 25, 145–167. doi: 10.48550/arXiv.1812.04907

CrossRef Full Text | Google Scholar

Hemming, S., de Zwart, F., Elings, A., Petropoulou, A., Righini, I. (2020). Cherry tomato production in intelligent greenhouses—Sensors and AI for control of climate, irrigation, crop yield, and quality. Sensors 20, 6430. doi: 10.3390/s20226430

PubMed Abstract | CrossRef Full Text | Google Scholar

Hernández, S., López, J. L. (2020). Uncertainty quantification for plant disease detection using Bayesian deep learning. Appl. Soft Computing 96, 106597. doi: 10.1016/j.asoc.2020.106597

CrossRef Full Text | Google Scholar

Hickey, J. M., Chiurugwi, T., Mackay, I., Powell, W. (2017). Genomic prediction unifies animal and plant breeding programs to form platforms for biological discovery. Nat. Genet. 49, 1297–1303. doi: 10.1038/ng.3920

PubMed Abstract | CrossRef Full Text | Google Scholar

Holt, R. C., Cordy, J. R. (1988). The Turing programming language. Commun. ACM 31, 1410–1423. doi: 10.1145/53580.53581

CrossRef Full Text | Google Scholar

Hüllermeier, E., Waegeman, W. (2021). Aleatoric and epistemic uncertainty in machine learning: An introduction to concepts and methods. Mach. Learn. 110, 457–506. doi: 10.1007/s10994-021-05946-3

CrossRef Full Text | Google Scholar

Humplík, J. F., Dostál, J., Ugena, L., Spíchal, L., De Diego, N., Vencálek, O., et al. (2020). Bayesian approach for analysis of time-to-event data in plant biology. Plant Methods 16, 14. doi: 10.1186/s13007-020-0554-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Ifrim, G., Barbu, M., Horincar, G., Titica, M. (2021). “Data-driven multivariable control of a microalgae growth process,” in 2021 25th International Conference on System Theory, Control and Computing (ICSTCC) (Iasi, Romania), 321–326. doi: 10.1109/ICSTCC52150.2021.9607111

CrossRef Full Text | Google Scholar

Ingraham, J., Riesselman, A., Sander, C., Marks, D. (2019). “Learning protein structure with a differentiable simulator,” in 7th International Conference on Learning Representations, ICLR 2019. 1–24.

Google Scholar

Innes, M., Edelman, A., Fischer, K., Rackauckas, C., Saba, E., Shah, V. B., et al. (2019). A differentiable programming system to bridge machine learning and scientific computing. doi: 10.48550/arXiv.1907.07587

PubMed Abstract | CrossRef Full Text | Google Scholar

Izzo, D., Biscani, F., Mereta, A. (2016). Differentiable genetic programming. doi: 10.48550/arXiv.1611.04766. Dataset.

CrossRef Full Text | Google Scholar

Jans-Singh, M., Leeming, K., Choudhary, R., Girolami, M. (2020). Digital twin of an urban-integrated hydroponic farm. Data-Centric Eng. 1, e20. doi: 10.1017/DCE.2020.21

CrossRef Full Text | Google Scholar

Jiao, W., Liu, Q., Gao, L., Liu, K., Shi, R., Ta, N. (2020). Computational fluid dynamics-based simulation of crop canopy temperature and humidity in double-film solar greenhouse. J. Sensors 2020, e8874468. doi: 10.1155/2020/8874468

CrossRef Full Text | Google Scholar

Kang, M., Wang, F.-Y. (2017). From parallel plants to smart plants: Intelligent control and management for plant growth. IEEE/CAA J. Automatica Sin. 4, 161–166. doi: 10.1109/JAS.2017.7510487

CrossRef Full Text | Google Scholar

Karniadakis, G. E., Kevrekidis, I. G., Lu, L., Perdikaris, P., Wang, S., Yang, L. (2021). Physicsinformed machine learning. Nat. Rev. Phys. 3, 422–440. doi: 10.1038/s42254-021-00314-5

CrossRef Full Text | Google Scholar

Kempes, C. P., Koehl, M. A., West, G. B. (2019). The scales that limit: The physical boundaries of evolution. Front. Ecol. Evol. 7. doi: 10.3389/fevo.2019.00242

CrossRef Full Text | Google Scholar

Koksal, A. S., Pu, Y., Srivastava, S., Bodik, R., Fisher, J., Piterman, N. (2013). “Synthesis of biological models from mutation experiments,” in Proceedings of the 40th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages (New York, NY, USA: Association for Computing Machinery), 469–482, POPL ‘13. doi: 10.1145/2429069.2429125

CrossRef Full Text | Google Scholar

Korte, A., Farlow, A. (2013). The advantages and limitations of trait analysis with GWAS: A review. Plant Methods 9, 29. doi: 10.1186/1746-4811-9-29

PubMed Abstract | CrossRef Full Text | Google Scholar

Lacasa, J., Gaspar, A., Hinds, M., Jayasinghege Don, S., Berning, D., Ciampitti, I. A. (2020). Bayesian approach for maize yield response to plant density from both agronomic and economic viewpoints in North America. Sci. Rep. 10, 15948. doi: 10.1038/s41598-020-72693-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Lavin, A., Zenil, H., Paige, B., Krakauer, D., Gottschlich, J., Mattson, T., et al. (2021). Simulation intelligence: Towards a new generation of scientific methods. doi: 10.48550/arXiv.2112.03235

CrossRef Full Text | Google Scholar

Lenaerts, B., Collard, B. C. Y., Demont, M. (2019). Review: Improving global food security through accelerated plant breeding. Plant Sci. 287, 110207. doi: 10.1016/j.plantsci.2019.110207

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, T.-M., Aittala, M., Durand, F., Lehtinen, J. (2018). Differentiable Monte Carlo ray tracing through edge sampling. ACM Trans. Graphics 37, 222:1–222:11. doi: 10.1145/3272127.3275109

CrossRef Full Text | Google Scholar

Li, W., Zhu, D., Wang, Q. (2022). A single view leaf reconstruction method based on the fusion of ResNet and differentiable render in plantgrowthdigitaltwinsystem. Comput. Electron. Agric. 193, 106712. doi: 10.1016/j.compag.2022.106712

CrossRef Full Text | Google Scholar

Liu, J., Gao, F., Zhang, J. (2020a). Gumbel-softmax optimization: A simple general framework for combinatorial optimization problems on graphs. Stud. Comput. Intell. 881, 879–890. doi: 10.1007/978-3-030-36687-273

CrossRef Full Text | Google Scholar

Liu, Y., Kumar, M., Katul, G. G., Feng, X., Konings, A. G. (2020b). Plant hydraulics accentuates the effect of atmospheric moisture stress on transpiration. Nat. Climate Change 10, 691–695. doi: 10.1038/s41558-020-0781-5

CrossRef Full Text | Google Scholar

Louwaars, N. P. (2018). Plant breeding and diversity: A troubled relationship? Euphytica 214, 114. doi: 10.1007/s10681-018-2192-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma, Y., Gowda, S., Anantharaman, R., Laughman, C., Shah, V., Rackauckas, C. (2022). ModelingToolkit: A composable graph transformation system for equation-based modeling. doi: 10.48550/arXiv.2103.05244. Dataset.

CrossRef Full Text | Google Scholar

Mahmood, F., Govindan, R., Bermak, A., Yang, D., Al-Ansari, T. (2023). Data-driven robust model predictive control for greenhouse temperature control and energy utilisation assessment. Appl. Energy 343, 121190. doi: 10.1016/j.apenergy.2023.121190

CrossRef Full Text | Google Scholar

Marin, J.-M., Pudlo, P., Robert, C. P., Ryder, R. J. (2012). Approximate Bayesian computational methods. Stat Computing 22, 1167–1180. doi: 10.1007/s11222-011-9288-2

CrossRef Full Text | Google Scholar

McLane, A. J., Semeniuk, C., McDermid, G. J., Marceau, D. J. (2011). The role of agent-based models in wildlife ecology and management. Ecol. Model. 222, 1544–1556. doi: 10.1016/j.ecolmodel.2011.01.020

CrossRef Full Text | Google Scholar

Merrell, D., Gitter, A. (2020). Inferring signaling pathways with probabilistic programming. Bioinformatics 36, i822–i830. doi: 10.1093/bioinformatics/btaa861

PubMed Abstract | CrossRef Full Text | Google Scholar

Mila, A. L., Carriquiry, A. L. (2004). Bayesian analysis in plant pathology. Phytopathology® 94, 1027–1030. doi: 10.1094/PHYTO.2004.94.9.1027

PubMed Abstract | CrossRef Full Text | Google Scholar

Moghadam, P., Lowe, T., Edwards, E. (2020). Digital twin for the future of orchard production systems. Proceedings 36, 92. doi: 10.3390/PROCEEDINGS2019036092

CrossRef Full Text | Google Scholar

Monti, C., Pangallo, M., De Francisci Morales, G., Bonchi, F. (2023). On learning agent-based models from data. Sci. Rep. 13, 9268. doi: 10.1038/s41598-023-35536-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Moreta, L. S., Al-Sibahi, A. S., Theobald, D., Bullock, W., Rommes, B. N., Manoukian, A., et al. (2019). “A probabilistic programming approach to protein structure superposition,” in 2019 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB). 1–5. doi: 10.1109/CIBCB.2019.8791469

CrossRef Full Text | Google Scholar

Nelson, N., Stubbs, C. J., Larson, R., Cook, D. D. (2019). Measurement accuracy and uncertainty in plant biomechanics. J. Exp. Bot. 70, 3649–3658. doi: 10.1093/jxb/erz279

PubMed Abstract | CrossRef Full Text | Google Scholar

Núñez, M., Barreiro, N. L., Barrio, R. A., Rackauckas, C. (2023). Forecasting virus outbreaks with social media data via neural ordinary differential equations. Sci. Rep. 13, 10870. doi: 10.1038/s41598-023-37118-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Oborny, B. (2019). The plant body as a network of semi-autonomous agents: A review. Philos. Trans. R. Soc. B: Biol. Sci. 374, 20180371. doi: 10.1098/rstb.2018.0371

CrossRef Full Text | Google Scholar

Pearl, J., Mackenzie, D. (2018). The Book of Why: The New Science of Cause and Effect (New York City: Basic Books).

Google Scholar

Peng, B., Guan, K., Tang, J., Ainsworth, E. A., Asseng, S., Bernacchi, C. J., et al. (2020). Towards a multiscale crop modelling framework for climate change adaptation assessment. Nat. Plants 6, 338–348. doi: 10.1038/s41477-020-0625-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Pugh, J. K., Soros, L. B., Stanley, K. O. (2016). Quality diversity: A new frontier for evolutionary computation. Front. Robotics AI 3. doi: 10.3389/frobt.2016.00040

CrossRef Full Text | Google Scholar

Purcell, W., Neubauer, T. (2023). Digital twins in agriculture: A state-of-the-art review. Smart Agric. Technol. 3, 100094. doi: 10.1016/j.atech.2022.100094

CrossRef Full Text | Google Scholar

Rackauckas, C., Ma, Y., Martensen, J., Warner, C., Zubov, K., Supekar, R., et al. (2021). Universal differential equations for scientific machine learning. doi: 10.48550/arXiv.2001.04385. Dataset.

CrossRef Full Text | Google Scholar

Raissi, M., Yazdani, A., Karniadakis, G. E. (2020). Hidden fluid mechanics: Learning velocity and pressure fields from flow visualizations. Sci. (New York N.Y.) 367, 1026–1030. doi: 10.1126/science.aaw4741

CrossRef Full Text | Google Scholar

Retkute, R., Townsend, A. J., Murchie, E. H., Jensen, O. E., Preston, S. P. (2018). Three-dimensional plant architecture and sunlit–shaded patterns: A stochastic model of light dynamics in canopies. Ann. Bot. 122, 291–302. doi: 10.1093/aob/mcy067

PubMed Abstract | CrossRef Full Text | Google Scholar

Ritchie, D., Thomas, A., Hanrahan, P., Goodman, N. D. (2016). “Neurally-guided procedural models: Amortized inference for procedural graphics programs using neural networks,” in Proceedings of the 29th Conference on Neural Information Processing Systems.

Google Scholar

Romero-Cuellar, J., Francés, F. (2023). How to assess climate change impact models: Uncertainty analysis of streamflow statistics via approximate Bayesian computation (ABC). Hydrological Sci. J. 68, 1611–1626. doi: 10.1080/02626667.2023.2231437

CrossRef Full Text | Google Scholar

Ronquist, F., Kudlicka, J., Senderov, V., Borgström, J., Lartillot, N., Lundén, D., et al. (2020). Universal probabilistic programming: A powerful new approach to statistical phylogenetics. bioRxiv 4, 244. doi: 10.1101/2020.06.16.154443

CrossRef Full Text | Google Scholar

Saint Cast, C., Lobet, G., Cabrera-Bosquet, L., Couvreur, V., Pradal, C., Tardieu, F., et al. (2022). Connecting plant phenotyping and modelling communities: Lessons from science mapping and operational perspectives. silico Plants 4, diac005. doi: 10.1093/insilicoplants/diac005

CrossRef Full Text | Google Scholar

Scheffler, M., Aeschlimann, M., Albrecht, M., Bereau, T., Bungartz, H.-J., Felser, C., et al. (2022). FAIR data enabling new horizons for materials research. Nature 604, 635–642. doi: 10.1038/s41586-022-04501-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Schölkopf, B., Locatello, F., Bauer, S., Ke, N. R., Kalchbrenner, N., Goyal, A., et al. (2021). Towards causal representation learning. Proc. IEEE 109, 612–634. doi: 10.1109/JPROC.2021.3058954

CrossRef Full Text | Google Scholar

Schoot, R., Depaoli, S., King, R., Kramer, B., Mamp, K., Tadesse, M. G., et al. (2021). Bayesian statistics and modelling. Nat. Rev. Methods Primers 1, 1. doi: 10.1038/s43586-020-00001-2

CrossRef Full Text | Google Scholar

Shen, C., Appling, A. P., Gentine, P., Bandai, T., Gupta, H., Tartakovsky, A., et al. (2023). Differentiable modelling to unify machine learning and physical models for geosciences. Nat. Rev. Earth Environ. 4, 552–567. doi: 10.1038/s43017-023-00450-9

CrossRef Full Text | Google Scholar

Silva, J. V., Giller, K. E. (2020). Grand challenges for the 21st century: What crop models can and can’t (yet) do. J. Agric. Sci. 158, 794–805. doi: 10.1017/S0021859621000150

CrossRef Full Text | Google Scholar

Skobelev, P. O., Mayorov, I. V., Simonova, E. V., Goryanin, O. I., Zhilyaev, A. A., Tabachinskiy, A. S., et al. (2020). Development of models and methods for creating a digital twin of plant within the cyber-physical system for precision farming management. J. Physics: Conf. Ser. 1703, 12022. doi: 10.1088/1742-6596/1703/1/012022

CrossRef Full Text | Google Scholar

Slob, N., Hurst, W., Slob, N., Hurst, W. (2022). Digital twins and industry 4.0 technologies for agricultural greenhouses. doi: 10.3390/SMARTCITIES5030059

CrossRef Full Text | Google Scholar

Slob, N., Hurst, W., van de Zedde, R., Tekinerdogan, B. (2023). Virtual reality-based digital twins for greenhouses: A focus on human interaction. Comput. Electron. Agric. 208, 107815. doi: 10.1016/j.compag.2023.107815

CrossRef Full Text | Google Scholar

Stan Development Team (2023). Stan Modeling Language Users Guide and Reference Manual. Available at: https://mc-stan.org.

Google Scholar

Stanley, K., Lehman, J. (2015). Why Greatness Cannot Be Planned: The Myth of the Objective (Springer).

Google Scholar

Stanley, K. O., Lehman, J., Soros, L. (2017). Open-endedness: The last grand challenge you’ve never heard of. Available at: https://www.oreilly.com/radar/open-endedness-the-last-grand-challenge-youvenever-heard-of/.

Google Scholar

Stock, M., Gorochowski, T. (2023). Open-endedness in synthetic biology: a route to continual innovation for biological design. Sci. Adv. doi: 10.31219/osf.io/wv5ac

CrossRef Full Text | Google Scholar

Tao, F., Rötter, R. P., Palosuo, T., Gregorio Hernández Díaz-Ambrona, C., Mínguez, M. I., Semenov, M. A., et al. (2018). Contribution of crop model structure, parameters and climate projections to uncertainty in climate change impact assessments. Global Change Biol. 24, 1291–1307. doi: 10.1111/gcb.14019

CrossRef Full Text | Google Scholar

Tardieu, F., Granato, I. S. C., Van Oosterom, E. J., Parent, B., Hammer, G. L. (2020). Are crop and detailed physiological models equally ‘mechanistic’ for predicting the genetic variability of wholeplant behaviour? The nexus between mechanisms and adaptive strategies. silico Plants 2, diaa011. doi: 10.1093/insilicoplants/diaa011

CrossRef Full Text | Google Scholar

Technow, F., Messina, C. D., Totir, L. R., Cooper, M. (2015). Integrating crop growth models with whole genome prediction through approximate bayesian computation. PloS One 10, e0130855. doi: 10.1371/journal.pone.0130855

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Huffel, K., Stock, M., Ruttink, T., De Baets, B. (2022). Covering the combinatorial design space of multiplex CRISPR/cas experiments in plants. Front. Plant Sci. 13. doi: 10.3389/fpls.2022.907095

PubMed Abstract | CrossRef Full Text | Google Scholar

van Schijndel, L., Snoek, B. L., ten Tusscher, K. (2022). Embodiment in distributed information processing: “Solid” plants versus “liquid” ant colonies. Quantitative Plant Biol. 3, e27. doi: 10.1017/qpb.2022.22

CrossRef Full Text | Google Scholar

Verdouw, C., Tekinerdogan, B., Beulens, A. J. M., Wolfert, S. (2021). Digital twins in smart farming. Agric. Syst. 189, 1–19. doi: 10.1016/J.AGSY.2020.103046

CrossRef Full Text | Google Scholar

Wallach, D., Palosuo, T., Thorburn, P., Hochman, Z., Gourdain, E., Andrianasolo, F., et al. (2021). The chaos in calibrating crop models: Lessons learned from a multi-model calibration exercise. Environ. Model. Softw. 145, 105206. doi: 10.1016/j.envsoft.2021.105206

CrossRef Full Text | Google Scholar

Walter, A., Liebisch, F., Hund, A. (2015). Plant phenotyping: From bean weighing to image analysis. Plant Methods 11, 14. doi: 10.1186/s13007-015-0056-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Whalley, W., Binley, A., Watts, C., Shanahan, P., Dodd, I., Ober, E., et al. (2017). Methods to estimate changes in soil water for phenotyping root activity in the field. Plant Soil 415, 407–422. doi: 10.1007/s11104-016-3161-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., et al. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3, 160018. doi: 10.1038/sdata.2016.18

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, B., DeAngelis, D. L. (2020). An overview of agent-based models in plant biology and ecology. Ann. Bot. 126, 539–557. doi: 10.1093/aob/mcaa043

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, N., Zhou, X., Kang, M., Hu, B.-G., Heuvelink, E., Marcelis, L. F. M. (2023). Machine learning versus crop growth models: An ally, not a rival. AoB Plants 15, plac061. doi: 10.1093/aobpla/plac061

PubMed Abstract | CrossRef Full Text | Google Scholar

Ziegler, C., Dyson, R. J., Johnston, I. G. (2019). Model selection and parameter estimation for root architecture models using likelihood-free inference. J. R. Soc. Interface 16, 20190293. doi: 10.1098/rsif.2019.0293

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: simulation intelligence, digital agriculture, phenotyping, quantified plant, digital twin, modeling, scientific computing, artificial intelligence

Citation: Stock M, Pieters O, De Swaef T and wyffels F (2024) Plant science in the age of simulation intelligence. Front. Plant Sci. 14:1299208. doi: 10.3389/fpls.2023.1299208

Received: 22 September 2023; Accepted: 07 December 2023;
Published: 16 January 2024.

Edited by:

Lars Hendrik Wegner, Foshan University, China

Reviewed by:

Renata Retkute, University of Cambridge, United Kingdom

Copyright © 2024 Stock, Pieters, De Swaef and wyffels. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Michiel Stock, bWljaGllbC5zdG9ja0B1Z2VudC5iZQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.