Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol., 21 May 2024
Sec. Microbiotechnology

Lentinula edodes substrate formulation using multilayer perceptron-genetic algorithm: a critical production checkpoint

Naser Safaie
Naser Safaie1*Mina SalehiMina Salehi2Siamak FarhadiSiamak Farhadi3Ali AligholizadehAli Aligholizadeh1Valiollah MahdizadehValiollah Mahdizadeh1
  • 1Department of Plant Pathology, Faculty of Agriculture, Tarbiat Modares University, Tehran, Iran
  • 2Department of Plant Genetics and Breeding, Faculty of Agriculture, Tarbiat Modares University, Tehran, Iran
  • 3Seed and Plant Improvement Institute, Agricultural Research, Education and Extension Organization (AREEO), Karaj, Iran

Shiitake (Lentinula edodes) is one of the most widely grown and consumed mushroom species worldwide. They are a potential source of food and medicine because they are rich in nutrients and contain various minerals, vitamins, essential macro- and micronutrients, and bioactive compounds. The reuse of agricultural and industrial residues is crucial from an ecological and economic perspective. In this study, the running length (RL) of L. edodes cultured on 64 substrate compositions obtained from different ratios of bagasse (B), wheat bran (WB), and beech sawdust (BS) was recorded at intervals of 5 days after cultivation until the 40th day. Multilayer perceptron-genetic algorithm (MLP-GA), multiple linear regression, stepwise regression, principal component regression, ordinary least squares regression, and partial least squares regression were used to predict and optimize the RL and running rate (RR) of L. edodes. The statistical values showed higher prediction accuracies of the MLP-GA models (92% and 97%, respectively) compared with those of the regression models (52% and 71%, respectively) for RL and RR. The high degree of fit between the forecasted and actual values of the RL and RR of L. edodes confirmed the superior performance of the developed MLP-GA models. An optimization analysis on the established MLP-GA models showed that a substrate containing 15.1% B, 45.1% WB, and 10.16% BS and a running time of 28 days and 10 h could result in the maximum L. edodes RL (10.69 cm). Moreover, the highest RR of L. edodes (0.44 cm d−1) could be obtained by a substrate containing 30.7% B, 90.4% WB, and 0.0% BS. MLP-GA was observed to be an effective method for predicting and consequently selecting the best substrate composition for the maximal RL and RR of L. edodes.

1 Introduction

The cultivation of the mushrooms not only lessens the environmental effects of the wastes used as substrate but also provides an economically viable alternative for producing high-quality and beneficial food and precious metabolites (Israilides and Philippoussis, 2003). Shiitake, Lentinula edodes (Berk.) Pegler, has been cultivated for thousands of years. L. edodes is one of the best-known medicinal and edible mushrooms worldwide, especially in East Asia, due to its widespread use in food and traditional medicines (Xu et al., 2014; Mahdizadeh et al., 2021; Ahmad et al., 2023). This valuable medicinal mushroom is used to treat influenza, tumors, high blood pressure, heart disease, obesity, age-related sexual dysfunction, liver problems, diabetes, fatigue, weakness, high cholesterol, and respiratory diseases (Bisen et al., 2010; Ahmad et al., 2023). Shiitake is nourishing and appetizing as a food and exhibit several pharmacological properties, including antiviral, immunomodulatory, antifungal, antibacterial, antiviral and anticancer effects (Bisen et al., 2010; Diallo et al., 2020; García-Cruz et al., 2020; Ahmad et al., 2023). The lignan-rich fraction extracted from L. edodes mycelial culture medium is promising for the treatment of AIDS and hepatitis B (Bisen et al., 2010).

The preferred substrate for L. edodes is hardwood sawdust supplemented with rice bran, wheat bran and millet powder (Philippoussis et al., 2007). Bagasse substrate supplemented with rice bran improved mushroom quality and biological efficiency of shiitake (Rossi et al., 2003). Bagasse is an important solid residue in the sugarcane industry, and hundreds of millions of tons of bagasse are produced worldwide every year (Mahmud and Anannya, 2021). The use of bagasse (Selman-Housein et al., 2000) and other agricultural residues (Devi et al., 2017), including wheat bran, is crucial from both environmental and economic perspectives. Beech sawdust is obtained in abundance from the timber industry (Jahedi et al., 2023). The reuse of these solid wastes is also crucial for environmental reasons (Gordić et al., 2014; Jahedi et al., 2023). In this study, different ratios of bagasse (B), beech sawdust (BS), and wheat bran (WB) were used as growing substrates to determine the compatibility of agricultural and industrial residues for shiitake production. The ratio of substrate components should be optimized to achieve the best running of L. edodes. A detailed analysis of the effects of the ratio of growing substrate components on the running length (RL) and running rate (RR) of L. edodes and their optimal quantity selection would lay the foundation for the commercialization of the conversion of agro-industrial residues into foods with therapeutic active ingredients. However, optimizing the ratio of growing substrate components to achieve the maximum running is expensive and time-consuming. Evaluating the relationship between the input variables “ratios of substrate components (B, BS, and WB) and running time” and the output variables “RL and RR of L. edodes” could simplify the optimization of the growing substrate for production of this valuable medicinal and edible mushroom. Analyses of biological data have been conducted using multivariate statistical methods, including multiple linear regression (MLR), stepwise regression (SR), principal component regression (PCR), ordinary least squares regression (OLSR), and partial least squares regression (PLSR; Wold, 1966; Farhadi et al., 2020; Salehi et al., 2020b). Growth is a highly nonlinear and complex biological process involving multiple interconnected signaling and biochemical pathways (Olas et al., 2020). The optimal conditions for a complex process can be accurately predicted using efficient nonlinear computational algorithms (Struik et al., 2005; Gallego et al., 2011). Traditional prediction and modeling methods, including regression models, have negligible nonlinear fitting and prediction capabilities (Farhadi et al., 2020; Salehi et al., 2020a,b). Artificial intelligence (AI) is used to address issues that cannot be explained using traditional computing techniques. One of the key aspects of AI is artificial neural networks (ANNs), which discover multifactorial nonlinear relationships between the output and input data (Farhadi et al., 2020; Hesami et al., 2020; Hesami and Jones, 2020; Salehi et al., 2020a; Jahedi et al., 2023). ANNs are brain-inspired systems that mimic the human brain’s ability to sense and think in a simplified manner to process information and recognize patterns in systems that involve ambiguity and uncertainty (Patnaik, 1999; Agatonovic-Kustrin and Beresford, 2000). ANNs learn through experience and gain intelligence by discovering hidden patterns and relationships (Agatonovic-Kustrin and Beresford, 2000). Response surface methodology (Kovalchuk et al., 2017), multivariate adaptive regression splines (Akin et al., 2020), and multilayer perceptron-genetic algorithm (MLP-GA; Jahedi et al., 2023) have been used for culture media optimization in plant and fungal research. Multilayer perceptron (MLP) is an effective tool in solving complicated non-linear problems, handles the massive data set well, quickly predicts after training, and obtains the same accuracy even with small data. Disadvantages of MLP include that the degree to which the dependent variable influences each independent variable is unknown, some hyperparameters of the MLP, such as the number of hidden neurons and layers, must be tuned, which requires time and power, model performance depends on the training quality, and the MLP is sensitive to feature scaling (Pedregosa et al., 2011). MLP, one of the most well-known feedforward neural networks, exhibits superior prediction accuracy compared to traditional statistical methods for calculating mathematical functions for analyzing and interpreting various unpredictable data sets (Salehi et al., 2020a; Jahedi et al., 2023). However, there are numerous problems in designing and training ANNs. Due to a large number of hidden neurons, the training time increases and data overfitting occurs. Low accuracy rates are caused by a few hidden neurons (Matignon, 2005). In addition, one of the main problems is the allocation of weights in the MLP structure which directly affects the model performance. Network topology and learning algorithm parameters control the weights. Learning rates, the number of memory taps, and the number of hidden nodes and layers are the network factors that can influence ANN performance (Tahmasebi and Hezarkhani, 2009). To address these issues, ANNs have been combined with other optimization techniques, such as genetic algorithms (GA; Plumb et al., 2005; Salehi et al., 2020a).

GA, developed by Holland (1992), is a popular evolutionary algorithm that excels in finding answers to problems and has been used to optimize bioprocesses (Osama et al., 2015; Farhadi et al., 2020; Salehi et al., 2020a; Jahedi et al., 2023). The GA is a search algorithm inspired by genetics and natural selection (Holland, 1992). GAs apply biologically inspired operators, including crossover, mutation, and selection, to obtain high-quality optimization solutions (Katoch et al., 2021). The principles of GA are the creation of the initial population of search solutions (chromosomes), followed by selecting the superior search solutions for crossover using a roulette wheel selection method and determining the best solution (optimal value, fittest chromosome) among them (Holland, 1992). GA obtains solutions improving over time, requires no derivative information, displays superior parallel capabilities, optimizes continuous functions and discrete functions, and multi-objective problems, is the best option for a wide range of optimization problems, and can handle a broad search space. However, GA is not an efficient method for solving simple problems and ones with available derivative information. Additionally, GAs may be computationally costly, especially when working with large problem spaces or complex fitness evaluations. Also, the quality of the final solution found using GA to a problem is not guaranteed (Sivanandam and Deepa, 2008). This study attempted to automatically determine the optimal number of neurons using GA to improve the prediction accuracy and determine the optimized values of the inputs “(B, BS, and WB ratio)” for the maximum mycelial RL and RR of shiitake (L. edodes). Furthermore, the performances of the regression and MLP-GA models were evaluated in terms of the prediction accuracy of the output variables optimized for the maximum mycelial RL and RR of L. edodes, and the most important inputs (factors) were detected for the highest RL and RR of shiitake.

2 Materials and methods

Shiitake (L. edodes) strain VM230 was obtained from the culture collection of the Plant Pathology Department at Tarbiat Modares University (Mahdizadeh et al., 2021).

2.1 Preparation of media

Bagasse (B), beech sawdust (BS) and wheat bran (WB) were used as components for the growing substrate. First, they were pulverized with a blender and passed through a sieve (pore size of 5 mm in diameter; Supplementary Figure S1). Different ratios of substrate components (Table 1) were mixed manually. It is noteworthy that the ratios reported for each component (B, WB, and BS; Table 1) are percentages of 10 g of each component. Each substrate was poured halfway (1.4 mL) into a glass Pasteur pipette (0.6 mm diameter, 11.5 cm height, and 2.8 mL volume). The Pasteur pipettes were then placed horizontally so that the substrate was placed on one side of the glass pipette and the other side was left empty for ventilation. The Pasteur pipettes were placed in a tray and some distilled water was added. They were then kept in the dark at 25°C for 12 h so that the substrate in the Pasteur pipettes completely absorbed the water. A sample of the substrates was weighed, gently dried in an oven at 45°C, and then weighed again to determine the final water content (55%). The substrates were then autoclaved twice at 121°C and 1.5 atm for 40 min at an interval of 24 h. One mycelial agar plug (4 mm diameter) per replicate was cut from the edge of the growing shiitake (L. edodes) colony previously cultured on the culture medium prepared from the extract “B (25%) + WB (50%) + BS (25%),” using a sterilized cork-borer, cultured in the Pasteur pipette (Supplementary Figure S2) containing the different ratios of substrate components listed in Table 1 and kept in the dark at 25°C. The RL (cm) of L. edodes cultured on each substrate was recorded at 5-day intervals after cultivation until the day 40. The running rate (RR; cm d−1) of L. edodes was calculated using Eq. (1).

Runningratecmday1=RunninglengthcmRunningtimeday    (1)
Table 1
www.frontiersin.org

Table 1. Different substrate compositions for the cultivation of shiitake (Lentinula edodes) and the day (after cultivation) on which the recorded highest statistically significant running length (RL).

2.2 Experimental design

The experiment was conducted based on a randomized complete block design (RCBD) with factorial arrangement containing four factors: the bagasse ratio with four levels, i.e., 25% (2.5 g), 50% (5 g), 75% (7.5), and 100% (10 g), the wheat bran ratio with four levels (25%, 50%, 75%, and 100%), the beech sawdust ratio with four levels (25%, 50%, 75%, and 100%), and the running time with eight levels (days 5, 10, 15, 20, 25, 30, 35, and 40 after cultivation), given 512 treatments, and four replicates. The substrate compositions was listed in Table 1.

2.3 Statistical analysis

Parametric statistics were used because the normality hypothesis and homogeneity of variance were satisfied. Analysis of variance (ANOVA) was used to assess the factorial experiment based on the randomized complete block design (RCBD). Least significant difference was used to compare the growing substrate compositions (64 substrate compositions are presented in Table 1) in a time-dependent manner to identify the highest significant RL of L. edodes during the running time for each substrate. Mean comparisons were performed using Student–Newman–Keuls for the factorial experiments (examining the effects of different ratios of substrate components on the RL and RR of L. edodes in B-ratio-, WB-ratio-, and BS-ratio-dependent manners).

2.4 Model development

Data were normalized using the Box–Cox transformation (Box and Cox, 1964) before the machine learning algorithm was tested. No outliers were identified by the principal component analysis. The performance of each tested model on the dataset (2,048 and 256 data lines for RL and RR, respectively) was calculated using an eight-fold cross-validation method with 10 repeats, and the model with the highest prediction accuracy for unknown data from the dataset was determined.

2.4.1 Multilayer perceptron model

Three-layered MLP modeling was used to determine the effects of the ratio of substrate components, namely B, BS, and WB, and running time on the RL and RR of L. edodes.

MLP, a supplement to the feedforward neural network, consists of three distinct layers: input, hidden, and output layers, as shown in Figure 1. The input layer (visible layer) feeds the input variables into the second layer (hidden layer). Between input and output lie one or more intermediate layers, so-called hidden layers, which form the internal brain of the network. The output layer predicts the output based on the information transmitted by the input layer. The predicted output is compared with the actual output and the error is calculated. Based on this error, the network weights are updated via the intermediate layer from the output layer to the input layer (Singhal and Sharma, 2023).

Figure 1
www.frontiersin.org

Figure 1. Schematic of multilayer perceptron (MLP) architecture.

2.4.2 Genetic algorithm

The GA was employed (i) to optimize the MLP architecture design, including the optimal number of neurons, and (ii) to determine the optimal values of the input variables (B, BS, and WB ratios and running time) in the established MLP-GA models for the maximum RL and RR of shiitake.

Holland (1992) proposed GA as a search strategy based on principles of natural selection (Goldberg, 1989; Sakawa, 2002; Guo et al., 2010). The GA begins with a population of random initial solutions (Figure 2). Each individual in a population is called a chromosome, which represents a possible solution to a problem. Chromosomes evolve over a series of iterations called generations. Chromosomes were assessed during each generation using fitness measurements (Figure 2). The next generation is created by crossing two chromosomes from the current generation or by changing one chromosome through a mutation, creating new chromosomes called offspring (Verma and Verma, 2012). To maintain a stable population size, a new generation is created by selecting parents and offspring based on fitness values and rejecting others. Fitter chromosomes have a higher chance of being selected. The best chromosome, which likely reflects the optimal solution to the problem, is obtained after several generations using algorithms (Guo et al., 2010).

Figure 2
www.frontiersin.org

Figure 2. Schematic representation of genetic algorithm (GA) as an evolutionary optimization algorithm.

The population size, crossover probability and mutation probability are the most important GA parameters. A generation number of 500, a crossover rate of 0.85, a mutation rate of 0.01, and an initial population of 50 (Haupt and Haupt, 2004; Abramson, 2007) were set to develop the fittest MLP structure and optimize the input variables (the ratios of B, BS, and WB, and running time) for the maximum output variables (the RL and RR of L. edodes).

The optimization strategy for finding the optimal MLP architecture and optimal values of input variables for the maximum RL and RR of L. edodes using the GA in the developed MLP-GA models is shown in Figure 3.

Figure 3
www.frontiersin.org

Figure 3. Flowchart of integrating multilayer perceptron (MLP) with genetic algorithm (GA) to optimize of the MLP architecture and input values to achieve the maximum level of each output.

The performance of the MLP-GA models was evaluated using the coefficient of determination (R2), root mean square error (RMSE) and mean absolute error (MAE), as previously reported (Farhadi et al., 2020; Salehi et al., 2020a,b).

2.4.3 Sensitivity analysis of the models

The MLP-GA models were subjected to sensitivity analysis to ascertain the degree of importance of the model input variables (ratios of B, BS, WB, and running time) on the model output variables, i.e., the RL and RR of L. edodes. The criterion used to determine the sensitivity of the ratios of B, BS, and WB and running time was the variable sensitivity error (VSE) value showing the performance (RMSE) of the MLP-GA model when a particular input variable was not available in the model. The variable sensitivity ratio (VSR) was derived as the ratio of the VSE and MLP-GA model errors (RMSE value) when all input variables were accessible. The estimated VSR values were then rescaled to fall within the range [0,1]. The highest importance variable of the model was the input variable with the highest VSR (Farhadi et al., 2020; Salehi et al., 2020a, 2021).

The MLP-GA and regression models were developed and evaluated mathematically using MATLAB (R2010a; MATLAB, 2010) and XLSTAT (XLSTAT, 2017), respectively, and graphics were created using GraphPad Prism 9 (2020).

2.5 Validation experiment

The ratios of bagasse, wheat bran, and beech sawdust and running time optimized by the GA were examined to assess the efficiency of the MLP-GA model in forecasting and optimizing the RL and RR of L. edodes.

3 Results

3.1 Effect of running time on Lentinula edodes running length

To determine the relationship between the running length (RL) and running time of L. edodes, the RL of shiitake cultured on growing substrate compositions (Table 1) was assessed at 5-d intervals after cultivation until the 40th day. Overall, L. edodes RL increased during the running time and its highest significant level on S25/50/0, S25/100/0, S25/100/25, S25/100/50, S25/100/100, S50/50/25, S50/50/50, S50/50/100, S50/100/0, S50/100/25, S100/100/25, S100/100/50, and S100/100/100 was recorded on 25th day after cultivation (Table 1). Accordingly, the highest statistically significant RL of L. edodes cultured on S0/25/0, S0/25/25, S0/25/50, S0/25/100, S0/50/0, S0/50/25, S0/50/50, S0/50/100, S0/100/0, S0/100/25, S0/100/50, S0/100/100, S25/0/100, S25/25/0 S25/25/25, S25/25/50, S25/25/100, S25/50/25, S25/50/50, S25/50/100, S50/0/50, S50/0/100, S50/25/0, S50/25/25, S50/25/50, S50/25/100, S50/50/0, S50/100/50, S50/100/100, S100/25/0, S100/25/25, S100/25/50, S100/25/100, S100/50/25, S100/50/50, S100/50/100, and S100/100/0 was measured 30 days after cultivation. Also, the highest significant RL of shiitake cultured on S25/0/0, S25/0/25, S25/0/50, S50/0/0, S50/0/25, and S100/50/0 was observed 35 days after cultivation. Besides, the highest statistically significant RL of L. edodes cultured on S0/0/25, S0/0/50, S0/0/100, S100/0/0, S100/0/25, S100/0/50, S100/0/100 was recorded on 40th day after cultivation (Table 1).

3.2 Effects of different substrate compositions on Lentinula edodes running length

The effects of different growing substrate compositions on running length (RL) of L. edodes were explored in running-time- (L. edodes culture age), B-ratio-, WB-ratio-, and BS-ratio-dependent manners. As mentioned previously, the highest significant level of L. edodes RL was measured on 64 substrates. Accordingly, these time points were considered as a benchmark for the RL of L. edodes and were used in the subsequent statistical analysis. ANOVA showed that the main effects of the factors “B, WB and BS ratios” and their interactions (reciprocal and trilateral effects) on the RL (Supplementary Table S1) and RR (Supplementary Table S2) of L. edodes were highly significant. This suggests that the bagasse ratio affected the RL and RR of L. edodes differently at each ratio of wheat bran and beech sawdust or vice versa (i.e., the ratio of wheat bran and beech sawdust affected the RL and RR of L. edodes differently at each bagasse ratio). To examine these significant interaction effects, the B ratio was further analyzed on each ratio of WB and BS (Tables 2, 3).

Table 2
www.frontiersin.org

Table 2. Effects of different ratios of growing substrate components on the running length (cm) of shiitake (Lentinula edodes).

Table 3
www.frontiersin.org

Table 3. Effects of the different ratios of growing substrate components on the running rate (cm d−1) of shiitake (Lentinula edodes).

A mean comparison of the RL and RR of L. edodes cultured on the 64 substrates (Table 1) is presented in Tables 2, 3, respectively. Among the 64 substrate compositions used for L. edodes cultivation, the highest RL of L. edodes was recorded in substrates “S0/100/100, S0/50/0, S50/100/50, S25/50/25, S100/25/25, and S0/100/0” (Table 2). As shown in Table 3, S25/50/0, S25/100/0, S25/100/25, S25/100/50, S50/50/25, S50/100/0, S50/100/25, S100/100/50, and S100/100/100 exhibited the highest RR of L. edodes cultures on the different substrate compositions (Table 1).

3.3 Regression analysis

Mushroom growth is influenced by the type and ratio of substrate components (Ashrafuzzaman et al., 2009). Accordingly, optimizing the growing substrate composition is the main step to maximize the mycelial RL and RR of L. edodes. Predicting the optimal ratio of substrate components and running time is crucial for large-scale production of L. edodes.

Different regression models (MLR, SR, OLSR, PCR and PLSR) were evaluated to determine the best regression method for predicting the RL and RR of L. edodes in response to the ratio of substrate components and the running time. All the regression models displayed statistically significant relationships between the input variables (ratios of B, BS, WB, and running time) and output variables “L. edodes RL and RR” (Table 4). The statistics of the developed MLR, SR, OLSR, PCR, and PLSR models for the RL and RR of L. edodes for the ratios of B, BS, and WB and running time are presented in Table 4. The goodness-of-fit, measured by R2, showed no difference in the predictive performance of the regression models developed for the RL and RR of L. edodes in the training and testing subsets (Table 4). The prediction accuracies of the MLR, SR, OLSR, PCR, and PLSR models were determined by plotting the actual (observed) values against the predicted values of the training subset (RL: R2 = 0.71, 0.70, 0.71, 0.71, and 0.71, respectively; RR: R2 = 0.52, 0.51, 0.52, 0.52, and 0.52, respectively; Figures 4, 5). No differences were observed in the prediction performance of the developed MLR, SR, OLSR, PCR, and PLSR models for the RL and RR of L. edodes (Figures 4, 5). The R2 values for the testing subset showed that the developed models accounted for 71% and 52% variability in RL and RR of L. edodes, respectively (Table 4).

Table 4
www.frontiersin.org

Table 4. Statistics on multiple linear regression (MLR), stepwise regression (SR), principal component regression (PCR), ordinary least squares regression (OLSR), partial least squares regression (PLSR), and multilayer perceptron-genetic algorithm (MLP-GA) models for the running length and running rate of Lentinula edodes cultivated on different substrates obtained from different ratios of bagasse, wheat bran, and beech sawdust.

Figure 4
www.frontiersin.org

Figure 4. Scatter plot of actual data vs. predicted values of the running length of Shiitake (Lentinula edodes) using multiple linear regression (MLR), stepwise regression (SR), principal component regression (PCR), ordinary least squares regression (OLSR), partial least squares regression (PLSR), and multilayer perceptron-genetic algorithm (MLP-GA) models in the training subset. The solid line shows a fitted simple regression line for scatter points.

Figure 5
www.frontiersin.org

Figure 5. Scatter plot of actual data vs. predicted values of shiitake (Lentinula edodes) running rate using multiple linear regression (MLR), stepwise regression (SR), principal component regression (PCR), ordinary least squares regression (OLSR), partial least squares regression (PLSR), and multilayer perceptron-genetic algorithm (MLP-GA) models in the training subset. The solid line shows a fitted simple regression line for scatter points.

3.4 Multilayer perceptron-genetic algorithm analysis

First, the ratios of B, BS, and WB, and running time were used as input variables, and running length (RL) and running rate (RR) of L. edodes were used as output variables. The output variables were then predicted according to the established MLP-GA models. The performance of the developed MLP-GA models was evaluated by plotting the predicted values vs. the actual values for training (Figures 4, 5) and testing subsets (Table 4). Excellent agreement between the forecast and observed values of L. edodes RL and RR was observed for both the training and testing subsets (Figures 4, 5; Table 4).

The goodness-of-fit criterion results of the established MLP-GA models showed that the established models could accurately (R2 = 0.97, and 0.92) predict RL and RR of shiitake (L. edodes) in the testing subset, which was not utilized in the training processes of the MLP-GA models (Table 4). Furthermore, the established MLP-GA models exhibited balanced statistical values for the training and testing subsets (Table 4).

3.5 Sensitivity analysis of the models

VSRs were computed using all the data lines (training and testing subsets: 2,048 and 256 data points for the RL and RR, respectively) to list the input variables according to their relative importance in the model. VSRs were calculated for each of the output variables (L. edodes RL and RR) for the ratios of B, BS, and WB and running time (Table 5). Analysis of the RL model revealed that L. edodes RL was most sensitive to running time (VSR = 1.000), followed by WB ratio (VSR = 0.535), B ratio (VSR = 0.008), and WB ratio (VSR = 0.000). Accordingly, the RR of L. edodes was more sensitive to the WB ratio (VSR = 1.000), followed by the B ratio (VSR = 0.062) and the BS ratio (VSR = 0.000; Table 5).

Table 5
www.frontiersin.org

Table 5. Importance (according to the sensitivity analysis) and optimal levels of the different factors, including bagasse ratio, wheat bran ratio, beech sawdust ratio, for achieving the maximum running length and running rate of Lentinula edodes cultured on different substrates using multilayer perceptron-genetics algorithm (MLP-GA) models.

3.6 Mathematical optimization

The MLP-GA was linked to the GA to find the optimal levels of the input variables (ratios of B, BS, and WB and running time) for achieving the maximum RL and RR of L. edodes. The optimization analysis on the established MLP-GA models revealed that the substrate composition of 15.1% (1.51 g) bagasse, 45.1% (4.51 g) wheat bran and 10.16% (1.01 g) beech sawdust and the running time of 28 days and 10 h could result in the maximum L. edodes RL (10.69 cm; Table 5). The highest RR of L. edodes (0.44 cm d−1) could be obtained by a substrate containing 30.7% (3.07 g) bagasse, and 90.4% (9.04 g) wheat bran (Table 5).

3.7 Comparison of MLP-GA and regression models

The statistical values showed a higher prediction accuracy of the MLP-GA models compared with that of the regression models in terms of the calculated R2 for MLP-GA vs. regression models: RL = 0.97 vs. 0.71, and RR = 0.92 vs. 0.52 (Table 4).

3.8 Validation experiment

The RL of L. edodes cultured on S15/45/10 containing B (1.51 g), WB (4.51 g), and BS (1.01 g) for 29 d (optimized input variables in the MLP-GA model using the GA) was measured to be 10.81 cm ± 0.35 cm. Furthermore, L. edodes RR of 0.448 cm d−1 ± 0.039 was obtained on S31/90/0 containing B (3.07 g), WB (9.04 g), and BS (0.00 g).

4 Discussion

Forecasting the optimal ratio of substrate components is essential to enhance shiitake production and reduce costs. This is the first report to establish a mathematical model for the prediction of the RL and RR of L. edodes according to the ratios of growing substrate components (B, WB, and BS) and running time, as well as the optimization of growing substrate composition to achieve the maximum RL and RR. An accurate modeling system is required to forecast the optimal ratio of substrate components (B, BS, and WB) and running time for the maximum RL and RR of L. edodes. Various regression models and MLP-GA modeling were used to examine the relationships among the four input variables “ratios of B, WB, and BS and running time” and the output variables “RL and RR,” and the prediction probability of RL and RR using the studied input variables. Such mathematical forecasts have not yet been reported for valuable medicinal and edible mushrooms. Previous studies (Salehi et al., 2020a,b, 2021; Jahedi et al., 2023) have also indicated that MLP-GA models have superior accuracy compared with regression models (Figures 4, 5; Table 4). R-squared (Table 4), a measure of the fit of regression models for the testing subset, showed that these models could describe 71 and 52% variability in the RL and RR of shiitake, respectively, for unseen data. The results of the present study suggest that the developed MLP-GA models correctly predicted the RL and RR of L. edodes (R2 = 0.97 and 0.92, respectively; Table 4) in the testing subset, which was not utilized in the MLP-GA training process. The closeness of the errors of the training and testing subsets (Table 4), as well as the small number of hidden neurons confirmed the absence of overlearning during training and the good generalizability of the developed MLP-GA models for unseen data (Lou and Nakai, 2001; Farhadi et al., 2020; Salehi et al., 2020a, 2021). From the statistical metrics including R2 and RMSE of the training and testing subsets (Table 4), it can be inferred that the tansig activation function worked well for modeling throughout the experiment. Furthermore, the small RMSE values (Table 4) show the great potential of the developed MLP-GA models for predicting the parameters (output variables). Mycelium running is a colonization of the substrate thoroughly by fungal hyphae. The fungus first colonizes the entire substrate and then creates new hyphae by lateral branching, increasing the density and surface area of the colony. Fungi thrive in both nutrient-rich and nutrient-poor culture media. However, their lateral branching occurs only in the nutrient-rich medium, like what is observed in sparse and dense growth of fungal hyphae on water agar and potato dextrose agar. This could explain why the optimum substrate compositions for RL and RR are different.

Regardless of previous studies on the effects of growing substrate composition and running time on RL and RR of L. edodes, the question remains: which input variables are most important for RL and RR of L. edodes? As mentioned previously, sensitivity analysis showed that running time was the most important variable affecting mycelial RL of L. edodes (Table 5). The RR of L. edodes is important for commercial cultivation. Sensitivity analysis revealed that WB ratio was the most important factor affecting RR of L. edodes (Table 5). Shiitake running is a complex bioprocess that requires reliable methods for the model development and optimization. The MLP-GA has been successfully used to solve problems in various areas that are highly challenging and for which there is no known answer (Eftekhari et al., 2018; Salehi et al., 2020a; Jahedi et al., 2023). The growing interest in ANNs is primarily due to their ability to solve problems in various domains, their ability to model complex and nonlinear relationships, their ability to predict relationships on unseen data, and the fact that they do not require the specification of statistical data distribution (Mahanta, 2017). Due to the high prediction performance of the testing subset (Table 4), the established MLP-GA correctly predicted the RL and RR of L. edodes.

5 Conclusion

In this study, the RL and RR of L. edodes were modeled and optimized using mathematical methods for the first time. The high degree of fit between the forecasted and actual values of the output variables (the RL and RR of L. edodes) confirmed the superior performance of the developed MLP-GA models. This study presents MLP-GA as a useful mathematical tool for predicting and optimizing complex systems such as the growth of medical mushrooms, and the RL and RR of L. edodes in response to the composition of the growing substrate.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

NS: Conceptualization, Investigation, Methodology, Project administration, Resources, Supervision, Writing – review & editing. MS: Conceptualization, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing. SF: Formal analysis, Investigation, Software, Visualization, Writing – review & editing. AA: Investigation, Methodology, Writing – review & editing. VM: Resources, Writing – review & editing.

Funding

The authors acknowledge the Iran National Science Foundation (INSF, no. 95004326) and the Research Deputy of Tarbiat Modares University, Tehran, for their financial support of this research project.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2024.1366264/full#supplementary-material

References

Abramson, M. (2007). Genetic algorithm and direct search toolbox user’s guide. Natick, MA: The MathWorks Inc.

Google Scholar

Agatonovic-Kustrin, S., and Beresford, R. (2000). Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research. J. Pharm. Biomed. Anal. 22, 717–727. doi: 10.1016/S0731-7085(99)00272-1

PubMed Abstract | Crossref Full Text | Google Scholar

Ahmad, I., Arif, M., Xu, M., Zhang, J., Ding, Y., and Lyu, F. (2023). Therapeutic values and nutraceutical properties of shiitake mushroom (Lentinula edodes): A review. Trends Food Sci. Technol. 134, 123–135. doi: 10.1016/j.tifs.2023.03.007

Crossref Full Text | Google Scholar

Akin, M., Eyduran, S. P., Eyduran, E., and Reed, B. M. (2020). Analysis of macronutrient related growth responses using multivariate adaptive regression splines. Plant Cell Tiss. Org. Cult. 140, 661–670. doi: 10.1007/s11240-019-01763-8

Crossref Full Text | Google Scholar

Ashrafuzzaman, M., Kamruzzaman, A., Ismail, M. R., Shahidullah, S., and Fakir, S. (2009). Substrate affects growth and yield of shiitake mushroom. Afr. J. Biotechnol. 8, 2999–3006.

Google Scholar

Bisen, P., Baghel, R. K., Sanodiya, B. S., Thakur, G. S., and Prasad, G. (2010). Lentinus edodes: a macrofungus with pharmacological activities. Curr. Med. Chem. 17, 2419–2430. doi: 10.2174/092986710791698495

PubMed Abstract | Crossref Full Text | Google Scholar

Box, G. E., and Cox, D. R. (1964). An analysis of transformations. J. R. Stat. Soc. Series B Stat. Methodology 26, 211–243.

Google Scholar

Devi, S., Gupta, C., Jat, S. L., and Parmar, M. S. (2017). Crop residue recycling for economic and environmental sustainability: The case of India. Open Agric. 2:486–494. doi: 10.1515/opag-2017-0053

Crossref Full Text | Google Scholar

Diallo, I., Boudard, F., Morel, S., Vitou, M., Guzman, C., Saint, N., et al. (2020). Antioxidant and anti-inflammatory potential of shiitake culinary-medicinal mushroom, Lentinus edodes (Agaricomycetes), sporophores from various culture conditions. Int. J. Med. Mushrooms. 22, 535–546. doi: 10.1615/IntJMedMushrooms.2020034864

PubMed Abstract | Crossref Full Text | Google Scholar

Eftekhari, M., Yadollahi, A., Ahmadi, H., Shojaeiyan, A., and Ayyari, M. (2018). Development of an artificial neural network as a tool for predicting the targeted phenolic profile of grapevine (Vitis vinifera) foliar wastes. Front. Plant Sci. 9:837. doi: 10.3389/fpls.2018.00837

Crossref Full Text | Google Scholar

Farhadi, S., Salehi, M., Moieni, A., Safaie, N., and Sabet, M. S. (2020). Modeling of paclitaxel biosynthesis elicitation in Corylus avellana cell culture using adaptive neuro-fuzzy inference system-genetic algorithm (ANFIS-GA) and multiple regression methods. PLoS One 15:e0237478. doi: 10.1371/journal.pone.0237478

PubMed Abstract | Crossref Full Text | Google Scholar

Gallego, P. P., Gago, J., and Landín, M. (2011). “Artificial neural networks technology to model and predict plant biology process” in Artificial neural networks-methodological and biomedical applications. ed. K. Suzuki (Croatia: Intech Open Access Publisher), 197–216.

Google Scholar

García-Cruz, F., Durán-Páramo, E., Garín-Aguilar, M., del Toro, G. V., and Chairez, I. (2020). Parametric characterization of the initial pH effect on the polysaccharides production by Lentinula edodes in submerged culture. Food Bioprod. Process. 119, 170–178. doi: 10.1016/j.fbp.2019.10.016

Crossref Full Text | Google Scholar

Goldberg, D. E. (1989). Genetic algorithms in search. Optimization, and machine learning. Addison-Wesley Longman Publishing Co. Inc., Boston, MA, USA.

Google Scholar

Gordić, D., Babić, M., Jelić, D., Konćalović, D., and Vukašinović, V. (2014). Integrating energy and environmental management in wood furniture industry. Sci. World J. 2014:596958. doi: 10.1155/2014/596958

PubMed Abstract | Crossref Full Text | Google Scholar

Guo, P., Wang, X., and Han, Y., (2010). The enhanced genetic algorithms for the optimization design, In: 2010 3rd international conference on biomedical engineering and informatics. IEEE, pp. 2990–2994.

Google Scholar

Haupt, R. L., and Haupt, S. E. (2004). Practical genetic algorithms. Studies in Computational Intelligence. John Wiley & Sons, Inc., Hoboken, NJ, USA.

Google Scholar

Hesami, M., and Jones, A. M. P. (2020). Application of artificial intelligence models and optimization algorithms in plant cell and tissue culture. Appl. Microbiol. Biotechnol. 104, 9449–9485. doi: 10.1007/s00253-020-10888-2

PubMed Abstract | Crossref Full Text | Google Scholar

Hesami, M., Naderi, R., and Tohidfar, M. (2020). Introducing a hybrid artificial intelligence method for high-throughput modeling and optimizing plant tissue culture processes: the establishment of a new embryogenesis medium for chrysanthemum, as a case study. Appl. Microbiol. Biotechnol. 104, 10249–10263. doi: 10.1007/s00253-020-10978-1

PubMed Abstract | Crossref Full Text | Google Scholar

Holland, J. H. (1992). Genetic algorithms. Sci. Am. 267, 66–73. doi: 10.1038/scientificamerican0792-66

Crossref Full Text | Google Scholar

Israilides, C., and Philippoussis, A. (2003). Bio-technologies of recycling agro-industrial wastes for the production of commercially important fungal polysaccharides and mushrooms. Biotechnol. Genet. Eng. Rev. 20, 247–260. doi: 10.1080/02648725.2003.10648045

PubMed Abstract | Crossref Full Text | Google Scholar

Jahedi, A., Salehi, M., Goltapeh, E. M., and Safaie, N. (2023). Multilayer perceptron-genetic algorithm as a promising tool for modeling cultivation substrate of Auricularia cornea native to Iran. PLoS One 18:e0281982. doi: 10.1371/journal.pone.0281982

PubMed Abstract | Crossref Full Text | Google Scholar

Katoch, S., Chauhan, S. S., and Kumar, V. (2021). A review on genetic algorithm: past, present, and future. Multimed. Tools Appl. 80, 8091–8126. doi: 10.1007/s11042-020-10139-6

PubMed Abstract | Crossref Full Text | Google Scholar

Kovalchuk, I. Y., Mukhitdinova, Z., Turdiyev, T., Madiyeva, G., Akin, M., Eyduran, E., et al. (2017). Modeling some mineral nutrient requirements for micropropagated wild apricot shoot cultures. Plant Cell Tiss. Org. Cult. 129, 325–335. doi: 10.1007/s11240-017-1180-0

Crossref Full Text | Google Scholar

Lou, W., and Nakai, S. (2001). Artificial neural network-based predictive model for bacterial growth in a simulated medium of modified-atmosphere-packed cooked meat products. J. Agric. Food Chem. 49, 1799–1804. doi: 10.1021/jf000650m

PubMed Abstract | Crossref Full Text | Google Scholar

Mahanta, J. (2017). Introduction to Neural Networks, Advantages and Applications. Towards Data Sci KDnuggets News 17:n28, 96851bd1a207.

Google Scholar

Mahdizadeh, V., Safaie, N., Salehi, M., and Jahedi, A. (2021). Substrate preference of shiitake Lentinula edodes (Berk.) Pegler strains. J. Crop Prot 10, 63–74.

Google Scholar

Mahmud, M. A., and Anannya, F. R. (2021). Sugarcane bagasse-a source of cellulosic fiber for diverse applications. Heliyon 7:7771. doi: 10.1016/j.heliyon.2021.e07771

PubMed Abstract | Crossref Full Text | Google Scholar

Matignon, R. (2005). Neural network modeling using SAS enterprise miner : AuthorHouse, Bloomington, Ind, USA.

Google Scholar

MATLAB (2010). 7.10. 0 (R2010a). Natick, Massachusetts: The MathWorks Inc.

Google Scholar

Olas, J. J., Fichtner, F., and Apelt, F. (2020). All roads lead to growth: imaging-based and biochemical methods to measure plant growth. J. Exp. Bot. 71, 11–21. doi: 10.1093/jxb/erz406

PubMed Abstract | Crossref Full Text | Google Scholar

Osama, K., Mishra, B. N., and Somvanshi, P. (2015). Machine learning techniques in plant biology. The Omics of plant science: PlantOmics, 731–754.

Google Scholar

Patnaik, P. (1999). Applications of neural networks to recovery of biological products. Biotechnol. Adv. 17, 477–488. doi: 10.1016/S0734-9750(99)00013-0

PubMed Abstract | Crossref Full Text | Google Scholar

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011). Scikit-learn: machine learning in Python. J Machine Learn Res 12, 2825–2830.

Google Scholar

Philippoussis, A., Diamantopoulou, P., and Israilides, C. (2007). Productivity of agricultural residues used for the cultivation of the medicinal fungus Lentinula edodes. Int. Biodeter. Biodegr. 59, 216–219. doi: 10.1016/j.ibiod.2006.10.007

PubMed Abstract | Crossref Full Text | Google Scholar

Plumb, A. P., Rowe, R. C., York, P., and Brown, M. (2005). Optimisation of the predictive ability of artificial neural network (ANN) models: a comparison of three ANN programs and four classes of training algorithm. Eur. J. Pharm. Sci. 25:395–405. doi: 10.1016/j.ejps.2005.04.010

Crossref Full Text | Google Scholar

Rossi, I. H., Monteiro, A. C., Machado, J. O., Andrioli, J. L., and Barbosa, J. C. (2003). Shiitake (Lentinula edodes) production on a sterilized bagasse substrate enriched with rice bran and sugarcane molasses. Braz. J. Microbiol. 34, 66–71. doi: 10.1590/S1517-83822003000100014

Crossref Full Text | Google Scholar

Sakawa, M. (2002). Foundations of Genetic Algorithms. In: Genetic Algorithms and Fuzzy Multiobjective Optimization. Operations Research/Computer Science Interfaces Series Springer, Boston, MA. 14 doi: 10.1007/978-1-4615-1519-7_2

Crossref Full Text | Google Scholar

Salehi, M., Farhadi, S., Moieni, A., Safaie, N., and Ahmadi, H. (2020a). Mathematical modeling of growth and paclitaxel biosynthesis in Corylus avellana cell culture responding to fungal elicitors using multilayer perceptron-genetic algorithm. Front. Plant Sci. 11:1148. doi: 10.3389/fpls.2020.01148

Crossref Full Text | Google Scholar

Salehi, M., Farhadi, S., Moieni, A., Safaie, N., and Hesami, M. (2021). A hybrid model based on general regression neural network and fruit fly optimization algorithm for forecasting and optimizing paclitaxel biosynthesis in Corylus avellana cell culture. Plant Methods 17, 1–13. doi: 10.1186/s13007-021-00714-9

Crossref Full Text | Google Scholar

Salehi, M., Moieni, A., Safaie, N., and Farhadi, S. (2020b). Whole fungal elicitors boost paclitaxel biosynthesis induction in Corylus avellana cell culture. PLoS One 15:e0236191. doi: 10.1371/journal.pone.0236191

Crossref Full Text | Google Scholar

Selman-Housein, G., Lopez, M. A., Ramos, O., Carmona, E. R., Arencibia, A. D., Menendez, E., et al., (2000). Towards the improvement of sugarcane bagasse as raw material for the production of paper pulp and animal feed, plant genetic engineering: Towards the third millennium. In: Proceedings of the international symposium on plant genetic engineering, Havana, Cuba, 6–10 December, 1999. Elsevier Science Publishers, pp. 189–193.

Google Scholar

Singhal, A., and Sharma, D. K. (2023). Voice signal-based disease diagnosis using IoT and learning algorithms for healthcare, implementation of smart healthcare systems using AI, IoT, and Blockchain. Academic press, Elsevier, USA. 59–81. doi: 10.1016/B978-0-323-91916-6.00005-9

Crossref Full Text | Google Scholar

Sivanandam, S. N., and Deepa, S. N. (2008). “Genetic algorithms” in Introduction to genetic algorithms. eds. S. N. Sivanandam and S. N. Deepa (Berlin: Heidelberg, Springer), 15–37.

Google Scholar

Struik, P. C., Yin, X., and de Visser, P. (2005). Complex quality traits: now time to model. Trends Plant Sci. 10, 513–516. doi: 10.1016/j.tplants.2005.09.005

PubMed Abstract | Crossref Full Text | Google Scholar

Tahmasebi, P., and Hezarkhani, A. (2009). Application of optimized neural network by genetic algorithm, IAMG09. California: Stanford University.

Google Scholar

Verma, G., and Verma, V. (2012). Role and applications of genetic algorithm in data mining. Int. J. Comput. Appl. 48, 5–8. doi: 10.5120/7438-0267

Crossref Full Text | Google Scholar

Wold, H. (1966). Multivariate analysis edited by: Krishnaiah pr. New York, Wiley: Academic Press.

Google Scholar

XLSTAT (2017). Data analysis and statistical solution for Microsoft excel. Paris: Addinsoft.

Google Scholar

Xu, X., Yan, H., Tang, J., Chen, J., and Zhang, X. (2014). Polysaccharides in Lentinus edodes: isolation, structure, immunomodulating activity and future prospective. Crit. Rev. Food Sci. Nutr. 54, 474–487. doi: 10.1080/10408398.2011.587616

Crossref Full Text | Google Scholar

Keywords: Shiitake, medicinal and edible mushroom, running, optimization, artificial neural network

Citation: Safaie N, Salehi M, Farhadi S, Aligholizadeh A and Mahdizadeh V (2024) Lentinula edodes substrate formulation using multilayer perceptron-genetic algorithm: a critical production checkpoint. Front. Microbiol. 15:1366264. doi: 10.3389/fmicb.2024.1366264

Received: 05 January 2024; Accepted: 19 April 2024;
Published: 21 May 2024.

Edited by:

Seung Gu Shin, Gyeongsang National University, Republic of Korea

Reviewed by:

Elsherbiny A. Elsherbiny, Mansoura University, Egypt
Meleksen Akin, Iğdır Üniversitesi, Türkiye

Copyright © 2024 Safaie, Salehi, Farhadi, Aligholizadeh and Mahdizadeh. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Naser Safaie, bnNhZmFpZUBtb2RhcmVzLmFjLmly

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.