- 1Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
- 2Department of Plant Pathology, Agricultural College and Research Institute, Tamil Nadu Agricultural University, Killikulam, Tuticorin, India
- 3Department of Soil Science and Agricultural Chemistry, Agricultural College and Research Institute, Tamil Nadu Agricultural University, Killikulam, Tuticorin, India
Agricultural application of plant-beneficial bacteria to improve crop yield and alleviate the stress caused by environmental conditions, pests, and pathogens is gaining popularity. However, before using these bacterial strains in plant experiments, their environmental stress responses and plant health improvement potential should be examined. In this study, we explored the applicability of three unsupervised machine learning-based data integration methods, including principal component analysis (PCA) of concatenated data, multiple co-inertia analysis (MCIA), and multiple kernel learning (MKL), to select osmotic stress-tolerant plant growth-promoting (PGP) bacterial strains isolated from the rice phyllosphere. The studied datasets consisted of direct and indirect PGP activity measurements and osmotic stress responses of eight bacterial strains previously isolated from the phyllosphere of drought-tolerant rice cultivar. The production of phytohormones, such as indole-acetic acid (IAA), gibberellic acid (GA), abscisic acid (ABA), and cytokinin, were used as direct PGP traits, whereas the production of hydrogen cyanide and siderophore and antagonistic activity against the foliar pathogens Pyricularia oryzae and Helminthosporium oryzae were evaluated as measures of indirect PGP activity. The strains were subjected to a range of osmotic stress levels by adding PEG 6000 (0, 11, 21, and 32.6%) to their growth medium. The results of the osmotic stress response experiments showed that all bacterial strains accumulated endogenous proline and glycine betaine (GB) and exhibited an increase in growth, when osmotic stress levels were increased to a specific degree, while the production of IAA and GA considerably decreased. The three applied data integration methods did not provide a similar grouping of the strains. Especially deviant was the ordination of microbial strains based on the PCA of concatenated data. However, all three data integration methods indicated that the strains Bacillus altitudinis PB46 and B. megaterium PB50 shared high similarity in PGP traits and osmotic stress response. Overall, our results indicate that data integration methods complement the single-table data analysis approach and improve the selection process for PGP microbial strains.
Introduction
Global food production has been significantly affected by climate change and the evolution of pests and pathogens (FAO, 2022). Rice is one of the world’s most important food crops, but it is highly vulnerable to numerous abiotic and biotic stresses (Sandhu et al., 2020). Plants have several mechanisms to adapt to and establish tolerance and resistance to stress (Vats, 2018; Chauhan et al., 2022; Hidangmayum et al., 2022; Sachdev and Ansari, 2022). One such mechanism is symbiosis, wherein beneficial microorganisms living in or on plant organs directly and/or indirectly support plant growth and protect them from biotic and abiotic stresses (Shinwari et al., 2019). Given this beneficial effect, more attention has been paid to rice microbiome to increase crop yield and achieve sustainable agricultural goals. The microbial community composition of the rice rhizosphere has been extensively investigated using both culture-dependent and -independent approaches (Ding et al., 2019). Although the role of rhizobacteria in crop health improvement under abiotic stress has been widely reported (Ayuso-Calles et al., 2021; Saxena et al., 2021), the contribution of rice phyllosphere and spermosphere microbial communities and populations to this process is only partially understood (Kim and Lee, 2020).
The phyllosphere refers to a plant’s total aboveground surface, representing an unstable habitat, where microbes are subjected to extreme and highly variable environmental factors, such as light intensity, ultraviolet (UV) radiation, temperature, and dryness. Bacteria dominate this habitat, and their number estimated either by cultivation or direct microscopy can be up to 107 cells per cm2 on plant leaf surfaces (Vorholt, 2012; Remus-Emsermann and Schlechter, 2018). Microbial genetic and metabolic competence has been shown to help bacteria overcome and survive extreme environments (Sessitsch et al., 2012; Shu and Huang, 2022). Venkatachalam et al. (2016) demonstrated that the adaptability and functionality of culturable rice microbial communities were related to their diversity and abundance in the phyllosphere, while their ability to stimulate plant growth was strongly influenced by the rice cultivation method. Bacillus species can be highly resistant to extreme abiotic stress factors, such as UV radiation, high and low temperatures, and dryness, by forming resistant endospores and having higher survival rates than other bacterial species in the phyllosphere (Setlow, 2014; Saleem et al., 2017).
Under high temperatures and desiccation stress, microorganisms regularly encounter osmotic stress in the phyllosphere habitat. Under osmotic stress, most bacteria produce compatible osmolytes, such as proline, glycine betaine (GB), ectoine, trehalose, and sucrose (Santos and Da Costa, 2002). Proline reduces osmotic stress by acting as a chemical chaperone that directly breaks down the reactive oxygen species produced during stress and by means of an indirect mechanism that activates the signaling pathways that promote cell survival (Liang et al., 2013). In contrast, quaternary ammonium compounds, such as GB and choline, serve as osmoprotectants. Choline is a precursor of GB, and during osmotic stress, a certain level of choline is oxidized to GB by choline oxidase and betaine aldehyde dehydrogenase (Fitzsimmons et al., 2012). It has been demonstrated that plant growth-promoting bacteria (PGPB) that accumulate osmolytes reduce plant salinity stress (Upadhyay et al., 2011, 2012). Therefore, to understand the osmotic stress tolerance of microorganisms, it is essential to investigate their osmolyte production potential.
The levels of phytohormones in plants significantly affect their growth by regulating metabolism and defense mechanisms (Egamberdieva et al., 2017). Many plant-associated bacteria can produce beneficial phytohormones, mainly indole-acetic acid (IAA), gibberellic acid (GA), cytokinin, and abscisic acid (ABA; Belimov et al., 2014; Nutaratat et al., 2017; Baliyan et al., 2022; Mekureyaw et al., 2022). Some microbes support host plants indirectly by preventing the growth and infestation of pests and pathogens. Such microbes produce a variety of compounds that inhibit the growth of competing organisms. Some of the most important antagonistic mechanisms involve the production of hydrogen cyanide (HCN), which affects cellular respiration and siderophore production, leads to iron chelation, and limits iron accessibility to pathogens (Sayyed et al., 2013; Muthukumar et al., 2022; Singh et al., 2022). Such behavior of beneficial bacteria against host pathogens can suppress diseases in in vitro and in planta (Ramakrishna et al., 2019; Mahmud et al., 2021). Therefore, the application of phyllosphere bacteria that can produce phytohormones, have antagonistic activity against pathogens, and tolerate osmotic stress can be greatly beneficial in improving rice health through the alleviation of biotic and abiotic stress.
To select the most promising strains for plant applications, the phenotypic and, less often, the genomic data related to PGP characteristics of microbial strains are explored by applying statistical and exploratory analyses. The data obtained for microbial strains are most often examined using univariate statistical methods including t-tests, ANOVA, and linear models. Less frequently, univariate data analysis methods are complemented with multivariate approaches, such as cluster analysis or principal component analysis, which allow the elucidation of patterns among strains and relationships with their properties (da Costa et al., 2014). When several types of datasets, such as PGP traits and abiotic and biotic stress response parameters, are generated to characterize the potential of PGP-microbial strains for improving plant growth and health, an integration of these datasets is needed to facilitate the simultaneous identification of important phenotypic and genomic features during strain selection.
If the properties of microbial strains are assessed using different measurement methods and experimental conditions, the data produced by a particular method can be considered single-view data. It is possible to fuse microbial strain data from different perspectives using multi-view learning (MVL), which utilizes the consensual and complementary information between different views of the same set of microbial strains. MVL, also known as data fusion or integration from multiple feature sets, is an emerging direction in multi-view machine learning that can be used to improve generalization performance (Zhao et al., 2017; Li et al., 2018). Picard et al. (2021) recently delineated data integration methods into five different integration strategies (early, mixed, intermediate, late, and hierarchical). Data integration can be applied in two ways, depending on the nature of datasets: horizontal integration, which studies the same parameters across different microbial strains, and vertical integration, which examines multiple sets of variables on the same set of strains. Generally, machine learning methods are highly effective in data integration when datasets are appropriately transformed and combined (Picard et al., 2021; Cai et al., 2022).
Thus far, data integration methods have been applied to multi-omics datasets, where the combination of different layers of molecular information obtained for plant-related microbial communities has been analyzed (Kaul et al., 2016). The suitability of this data analysis approach for characterizing and selecting PGP microbial strains using various types of datasets has not yet been explored. We hypothesized that the application of data integration methods can enhance the selection of PGP microbial strains for plant application if multiple PGP-related datasets are measured for these strains.
The main aim of this study was to test the applicability of three unsupervised machine learning-based data integration methods for simultaneous grouping and trait evaluation of osmotic stress-tolerant PGP bacterial strains isolated from the rice phyllosphere. In addition, a set of univariate and multivariate data analysis methods conventionally used for this purpose was implemented.
Materials and methods
Study design
To assess PGP traits and osmotic stress responses in eight bacterial strains isolated from the phyllosphere of drought-tolerant rice varieties grown in Paramakudi, Tamil Nadu, India (Arun et al., 2020; Devarajan et al., 2021), two sets of different assessments were conducted. The following strains (genbank1 accession numbers in brackets) were included in this study: Bacillus endophyticus PB3 (MK969113), B. australimaris PB17 (MK979279), B. pumilus PB18 (MK979280), B. safensis PB23 (MK979280), Staphylococcus sciuri PB24 (MK994020), B. altitudinis PB37 (MK994020), B. altitudinis PB46 (MK979282), and B. megaterium PB50 (MK979284). The flowchart shows the study design in detail (Figure 1).
Figure 1. The flow chart depicts the basic workflow used in the current study to select the best phyllosphere bacterial strains for use in rice drought alleviation by screening the plant growth-promoting traits and osmotic stress response of bacterial strains. Obtained datasets were analyzed first using univariate and multivariate exploratory analyses. In the next step, the obtained datasets were integrated and analyzed using three unsupervised machine learning techniques: principal component analysis, multiple kernel learning, and multiple co-inertia analysis.
Assessment of PGP traits
Production of phytohormones
To estimate the production of bacterial phytohormones, the strains were inoculated in triplicate into tryptic soy broth (TSB) and allowed to grow at 37°C for 24 h. The growth temperature for bacterial strains in TSB broth was set at 37°C for all studies. This temperature was selected based on the air temperature (42°C) in this region and the fact that the crop leaf surface temperature is expected to be 5°C lower (37°C) than the atmospheric temperature (Deva et al., 2020). Extraction, purification, and quantitative determination of IAA, GA, zeatin, and ABA from different strains were carried out using the methods described by Karadeniz et al. (2006). Briefly, phytohormones were extracted from the bacterial supernatant using ethyl acetate and thin-layer chromatography (TLC) was developed using a mixture of isopropanol/ammonia/distilled water (10:1:1 v/v/v). IAA, GA, zeatin, and ABA bands and Rf (retardation factor) values of the samples were visualized under 254 nm of UV light according to the standards of IAA, GA, zeatin, and ABA. The detected bands were scraped from TLC plates and dissolved in methanol. The purified samples were analyzed by ultra-high-performance liquid chromatography (UHPLC) using an evaporative light scattering detector (ELSD) system with reversed phase C-18 column (Shimadzu, Japan) at an isocratic flow rate of 0.5 min ml−1 at 40°C. The wavelengths were 280, 208, 254, and 265 nm for IAA, GA, zeatin, and ABA, respectively (HiMedia, India), and the total duration for the detection of each hormone was approximately 15 min. The results of all phytohormone concentrations are expressed as μg ml−1.
Production of siderophores
Siderophores production was evaluated in all bacterial strains. Briefly, the chrome azurol sulfonate (CAS) medium was prepared by adding the CAS solution to melted King’s B agar medium at a 1:15 ratio, and 10 μl of rice phyllosphere bacterial strains actively grown in TSB at 37°C for 24 h were spot inoculated on the center of the CAS plate. Colonies with a yellow-orange halo after 3 days of incubation (28 ± 2°C) were considered positive for siderophore production (Schwyn and Neilands, 1987). For siderophore quantification, 100 μl of fresh culture was inoculated into 100 ml of iron-free succinic broth medium and incubated in a rotary shaker at 30°C for 24 h (120 rpm). Subsequently, the broth culture was centrifuged at 10,000 rpm for 10 min. The absorbance of the supernatant was measured at 400 nm using a spectrophotometer (LAMBDA 365 UV–Vis spectrophotometer, PerkinElmer, Mumbai, India), and siderophore production was calculated using the molar extinction coefficient (ε = 20,000 M−1 cm−1). The hydroxamate- and catecholate-type siderophores were characterized using Arnow’s and tetrazolium tests, respectively (Arnow, 1937). The quantification of siderophores is represented as μM, and the presence or absence of siderophore type is expressed by the signs “+” and “−,” respectively.
Production of HCN
The production of HCN by the bacterial strains was measured using the alkaline picric acid method described by Wei et al. (1991). For the qualitative measurement of HCN production, a change in the color of the filter paper strips from yellow to light brown, brown, or brick red was recorded as a mild (+), moderate (+ +), or strong (+ + +) reaction, respectively, whereas no color shift was considered as a negative (−) reaction. The color in the paper was eluted using 10 ml of distilled water, and the filter paper in a sterile medium blank was used as a control. HCN quantification was performed by measuring the absorbance of the eluted samples at 625 nm, and the results are presented as optical density (OD) units.
Evaluation of antagonistic activity
The biocontrol activity of all bacterial strains was tested against two fungal pathogens (Pyricularia oryzae and Helminthosporium oryzae) obtained from the Department of Plant Pathology, Tamil Nadu Agricultural University, Coimbatore, India, following the protocol proposed by Ji et al. (2013). In brief, the mycelial disks (5 mm diameter) of each rice pathogenic fungus were placed on the edge of potato dextrose agar media (30 mm), and each bacterium was streak inoculated close to the center of the plates that were incubated for 5–6 days at 25°C. Subsequently, the mycelial growth inhibition percentage of P. oryzae (PO.IP) and H. oryzae (HO.IP) by each bacterial strain was calculated as follows:
Evaluation of osmotic stress effect on bacterial strains
Estimation of bacterial growth and sample preparation
Phyllosphere bacterial strains were inoculated in 100 ml of TSB in side-arm flasks with different concentrations of PEG 6000 (0, 11, 21, and 32.6%) and incubated at 37°C for 24 h. The optical density (OD) of the broth cultures was estimated at 600 nm, and the growth results were expressed as OD units. The cells were then centrifuged at 10, 000 rpm for 5 min and the supernatant was collected to assess IAA and GA production. Cell pellets were used to measure the accumulation of endogenous proline and GB.
Estimation of IAA and GA production
To determine IAA concentration, 2 ml of the supernatant was mixed with 4 ml of Salkowski reagent (2% of 0.5 M FeCl3 in 35% HClO4), and sterile TSB was used as a control (Meudt and Gaines, 1967). Subsequently, the mixtures were incubated in the dark at 25°C for 24 h. The absorbance of IAA was measured at 520 nm using a spectrophotometer. The concentration of IAA in the samples was determined using the IAA standard curve and expressed as μg ml−1.
To determine GA concentrations, 2 ml of the supernatant was mixed with 2 ml of zinc acetate solution and 2 ml of potassium ferrocyanide solution before centrifugation at 8,000 rpm for 10 min (Holbrook et al., 1961). A 5-ml aliquot of supernatant was added to 5 ml of 30% hydrochloric acid and incubated at 27°C for 75 min. The absorbance was measured at 254 nm, and the GA concentration in the samples was determined using a standard GA curve and expressed as μg ml−1.
Estimation of proline and GB production
The endogenous production of proline and GB was measured using the protocol described by Qurashi and Sabri (2013). The harvested cells were boiled and centrifuged to collect the supernatant. For proline estimation, 150 μl of the supernatant was mixed with 100 ml of water and 1 ml of ninhydrin reagent (0.35% ethanol), and 150 μl of sterile water was used as a control. This mixture was heated for 20 min, and then the absorbance was measured at 520 nm. The concentration of proline was calculated using a standard curve prepared with L-proline and expressed as μg ml−1.
To estimate the endogenous accumulation of GB, the extracted supernatant was diluted (1:1) in boiled 2 N H2SO4, and sterile water was used as a control (Qurashi and Sabri, 2013). The mixture (0.50 ml) was cooled for 60 min in ice water, and 200 μl of cold KI-I2 reagent was added with gentle vortexing. The mixture was incubated for 16 h at 4°C and gradually centrifuged for 15 min at 10,000 rpm. After carefully removing the supernatant, the resulting pellet was dissolved in 9 ml of 1,2-dichloroethane, and the absorbance was measured at 365 nm. The concentration of GB was calculated using a standard curve and expressed as μg ml−1.
Statistical analysis
All experiments were conducted in triplicate, and the results are expressed as means with standard deviations. The obtained data were checked for normality and outliers and log-transformed prior to data analysis, if necessary. The following data analysis methods were used for analysis of direct and indirect microbial PGP traits: one-way ANOVA and multivariate ANOVA (MANOVA) with strain type as a single factor. Two-way ANOVA and MANOVA was applied to the set of microbial parameters (growth, production of indole-acetic acid, gibberellic acid, proline, and glycine betaine) with two factors—strain type and stress level. One-way ANOVA test was followed by Tukey’s post-hoc test. The significance level was set to 0.05 for all tests. Multivariate exploratory analyses included principal component analysis (PCA), heatmaps, k-means clustering, and spectral clustering. PCA was performed separately for two datasets (direct and indirect PGP traits and osmotic stress data). Heatmaps, k-means clustering, and spectral clustering were used as additional methods for analysis of osmotic stress dataset. The details of software packages used for the data analysis are provided in Supplementary Table S1 (Paradis et al., 2004; Lê et al., 2008; R Core Team, 2013; Meng et al., 2014; Mariette and Villa-Vialaneix, 2018; Kolde, 2019; John et al., 2020). In the case of PCA, the variables contributing significantly to the principal component (PC) axes were determined using the PCAtest package with random permutation, a bootstrap replication value of 1,000, and an alpha level of 0.05 (Camargo, 2022). The importance of the variable was ranked from each principal component axis using the total loading values.
Data integration
We used horizontal integration of the datasets to compare multiple treatment factors on similar variables and samples. We applied unsupervised machine learning techniques and implemented them using three integration techniques: early integration (PCA), mixed integration (multiple kernel learning, MKL), and intermediate integration (multiple co-inertia analysis, MCIA). Supplementary Table S10 contains information on the dataset combinations used for the integration methods.
In the case of the PCA analysis, the datasets were first combined and transformed, and then the PCA function from the factomineR package was applied to the obtained dataset (Supplementary Table S10). MCIA is a mixed integration technique in which the dataset is dimensionally reduced and transformed using ordination methods, such as PCA, correspondence analysis (COA), or non-symmetric correspondence analysis (NSCA), before being combined for analysis. In this study, the MCIA was performed on the data list (Supplementary Table S10) using the omicade4 package (Supplementary Table S1). Data dimension reduction, transformation, and analysis were performed using a single-function MCIA with singular value decomposition. The output was saved as a MCIA class object, and from that object, the variables from each dataset were visualized with regard to the sample relationship using the plotVar function.
The same data list was used in the R package mixKernel to perform MKL (Supplementary Tables S1, S10), wherein each dataset was first converted into a kernel object using the compute.kernel function with a linear kernel, thereby converting a linearly inseparable space object into a linearly separable one. The fully unsupervised MKL (UKML) method was used to combine all kernel objects, and the resulting kernels were subjected to PCA and an important variable analysis using the kernel.pca and kernel.pca.permute functions, respectively.
The results of different data integration techniques were compared using the congruence among distance matrices (CADM) approach (Campbell et al., 2011) implemented in the CADM.global function using the R package ape: analyses of phylogenetics and evolution (Supplementary Table S1).
Results
Direct and indirect PGP activities of phyllosphere bacterial strains
The analysis of direct PGP activities showed significant differences (one-way MANOVA, p < 0.001) in the production of phytohormones by the studied bacterial strains (Table 1; Supplementary Table S2). Bacillus megaterium PB50 showed the highest production of all four phytohormones, followed by B. endophyticus PB3, which produced the highest levels of IAA, GA, and ABA, and B. altitudinis PB46, which produced high levels of GA and cytokinin. The lowest levels of IAA and GA were produced by B. australimaris PB17 and those of cytokinin and ABA were produced by B. safensis PB23 and S. sciuri PB24, respectively.
Table 1. Mean and standard deviation values of the direct plant growth-promoting traits [indole-acetic acid (IAA), gibberellic acid (GA), cytokinin, and abscisic acid (ABA) production] measured for each rice phyllosphere bacterial strain (number of replicates, n = 3).
Similarly, the studied strains significantly differed in their indirect PGP trait values (one-way MANOVA, p < 0.001, Table 2; Supplementary Table S2). Siderophore production was found to be the highest in B. megaterium PB50, followed by B. pumilus PB18. The characterization of siderophores revealed that only B. megaterium PB50 produced hydroxamate-type siderophores, whereas the remaining strains produced catecholate-type siderophores (Table 2). The analysis of HCN production results showed that B. pumilus PB18 produced the highest amount of HCN, while B. safensis PB23 and S. Sciuri PB24 produced the least amount of HCN. The assessment of the antagonistic activity of phyllosphere bacterial strains against the rice foliar pathogens revealed that only B. endophyticus PB3 and B. australimaris PB17 had an antagonistic activity against P. oryzae, and only B. megaterium PB50 inhibited the growth of H. oryzae (Table 2; Supplementary Figure S1).
Table 2. Mean and standard deviation values of the indirect plant growth-promoting traits, such as siderophore (Sid.) production, qualitative (Qual.), and quantitative (Quant.) assessment of hydrogen cyanide (HCN) production, and growth inhibition of Pyricularia oryzae and Helminthosporium oryzae measured for each rice phyllosphere bacterial strain (number of replicates, n = 3).
Bacterial growth and production of phytohormones and osmolytes under different osmotic stress levels
Analysis of bacterial growth under different osmotic stress levels showed that under non-stressed conditions, B. pumilus PB18, B. safensis PB23, S. sciuri PB24, and B. megaterium PB50 reached the highest OD, with no significant difference between them. No significant difference was observed among B. australimaris PB17, B. altitudinis PB37, and B. altitudinis PB46, which had the lowest OD values (Table 3; Supplementary Table S6). The maximum growth values were achieved by B. altitudinis PB46 at 11 and 21% PEG 6000 concentrations and by B. megaterium PB50 at 32% PEG 6000 (Figure 2A). The lowest growth was observed in B. altitudinis PB37 at 11 and 21% PEG 6000 concentrations and in B. australimaris PB17 and B. altitudinis PB37 at 32.6% PEG 6000, with no significant difference between them.
Table 3. Mean and standard deviation values of growth of each rice phyllosphere bacterial strain (number of replicates, n = 3) measured under three different osmotic stress conditions and a non-stress condition.
Figure 2. Results of two-way ANOVA showing the mean differences of dependent variables according to two independent variables (strain and stress) and their interaction. Shown are mean and standard error values for two grouping variables. Plots (A) growth; (B) indole-acetic acid (IAA); (C) gibberellic acid (GA); (D) proline; and (E) glycine betaine. The strain codes refer to the following strains: Bacillus endophyticus PB3, B. australimaris PB17, B. pumilus PB18, B. safensis PB23, S. sciuri PB24, B. altitudinis PB37, B. altitudinis PB46, and B. megaterium PB50. AIC, akaike information criterion; BIC, Bayesian information criterion; and η2, effect size. Individual p values are given in Supplementary Table S5.
As the PEG 6000 concentration increased, IAA and GA production decreased in all strains (Table 4; Figures 2B,C). The strain B. megaterium PB50 produced the highest IAA amount under almost all stress conditions, except for PEG 6000 at the 21% level, when this value was the highest for B. altitudinis PB46. The strain B. megaterium PB50 also exceeded the other studied strains in the production of GA at all stress levels, followed by B. endophyticus PB3 at 11% PEG 6000 and B. altitudinis PB46 at 21 and 32.6% PEG 6000.
Table 4. Mean and standard deviation values of indole-acetic acid (IAA) and gibberellic acid (GA) production (μg ml−1) by each rice phyllosphere bacterial strain (number of replicates, n = 3) measured under three different osmotic stress conditions and a non-stress condition.
All studied strains showed a gradual increase in proline and GB production from non-stress to stress conditions at 21% PEG 6000, but at 32.6% PEG 6000, the production dropped below the non-stress level for both osmolytes (Table 5; Figures 2D,E). Nevertheless, substantial differences were observed for the different strains in osmolyte production at different stress levels. Bacillus pumilus PB18 accumulated the highest proline content under non-stress conditions, whereas B. megaterium PB50 accumulated the lowest amount under these conditions. Furthermore, under 11, 21, and 32.6% PEG 6000 concentrations, the bacterial strains B. pumilus PB18 and B. safensis accumulated the maximum amount of proline, with no significant difference between them. The strains B. australimaris PB17 and B. altitudinis PB37 accumulated the lowest amount of proline at all three stress levels, with no significant differences between them. Different strains showed different GB production patterns under variable stress conditions. Under non-stress conditions, the bacterial strain B. australimaris PB17 produced the highest GB, whereas B. altitudinis PB46 and B. megaterium PB50 produced the highest GB at 11 and 21% PEG 6000 levels. Bacillus megaterium PB50 produced the maximum GB at 32.6% PEG 6000. Under non-stress conditions as well as 11, 21, and 32.6% PEG 6000 levels, B. altitudinis PB37 had the lowest GB.
Table 5. Mean and standard deviation values of proline and glycine betaine (GB) accumulation (μg ml−1) by each rice phyllosphere bacterial strain (number of replicates, n = 3) measured under three different osmotic stress conditions and a non-stress condition.
Two-way MANOVA revealed the significance of the interaction effect (strain–osmotic stress interaction) and the main effects (strains and osmotic stress level) in the examined parameters (Supplementary Table S3). The results of one-way MANOVA revealed that the impact of bacterial strains on microbial parameters was statistically significant at each stress level (Supplementary Table S4), except for bacterial growth, which was not significant in the non-stress conditions (Supplementary Tables S6–S9). The two-way ANOVA results showed significant (p < 0.05) differences in all traits tested among bacterial strains and osmotic stress levels, as well as their interaction effects (Supplementary Table S5).
The response pattern of the studied strains based on biochemical properties and growth under various osmotic stress levels was revealed by the PCA plot (Figure 3). The PCA first (PC1) axis captured 68.4% of the overall data variation, and all variables significantly contributed to the PCA first axis (Supplementary Table S11). All strains had a generally similar response pattern that consisted of a gradual change along the PCA second (PC2) axis in the case of 11 and 21% stress levels of PEG 6000, followed by a shift in the same direction along the PC1 axis when strains were exposed to 32.6% PEG 6000 (Figure 3C). The gradual change along the PC2 axis was related mainly to the increase in GB and proline production with the increase in PEG 6000 level to 21%, while the application of 32.6% PEG 6000 was characterized by a reduction in strain growth values (Figure 3D).
Figure 3. Results of principal component analysis (PCA) based on the plant growth-promoting (PGP) trait data of rice phyllosphere bacteria (number of replicates, n = 3). (A) Score plot and (B) loading plot of variables according to first two principal component axes. Bacterial strains are indicated by 95% confidence ellipses. The plots correspond to 79.2% of the total data variance and variance proportions are shown along each principal component axis. (C) Score plot and (D) loading plot of variables along first two principal component axes based on microbial dataset [indoleacetic acid (IAA), gibberellic acid (GA), proline, glycine betaine, and growth] of rice phyllosphere bacterial strains measured at different PEG 6000 concentrations (number of replicates, n = 3). The plots correspond to 89.9% of the total data variance, and variance proportions are shown along each principal component axis. Variables with asterisk in plot B and D are significant along the first principal component axis. Abbreviations used in plot (B) are, abscisic acid (ABA), hydrogen cyanide (HCN), Helminthosporium oryzae inhibition percentage (HO_IP), and Pyricularia oryzae inhibition percentage (PO_IP). The codes of the strains in plots (A,C) refer to the following strains: Bacillus endophyticus PB3, B. australimaris PB17, B. pumilus PB18, B. safensis PB23, Staphylococcus sciuri PB24, B. altitudinis PB37, B. altitudinis PB46, and B. megaterium PB50.
The heatmap based on PGP traits showed that the strains could be divided into two main groups based on their response to osmotic stress levels. At the 32% PEG 6000 level, the strains formed one group, and the strains at other stress levels formed another cluster (Supplementary Figure S4). The results of k-means clustering indicated four optimal clusters, of which two (stress levels 0 and 32.6%) were partially homogenous (Supplementary Figure S5 A). Moreover, these results indicated that B. altitudinis PB46 and B. megaterium PB50 were the most deviant in their responses to different stress levels. Spectral clustering identified two strain groups with similar response patterns to increasing PEG 6000 stress levels. The first group consisted of four strains (B. endophyticus PB3, B. pumilus PB18, B. altitudinis PB46, and B. megaterium PB50), and the second group consisted of three strains (B. australimaris PB17, B. safensis PB23, and S. sciuri PB24; Supplementary Figure S5 B). Bacillus altitudinis PB37 was not included in either group. The main difference between the response dynamics of the two groups was related to a more profound stress response in the first group than in the second group and the strain B. altitudinis PB37. This higher stress response was more reflected at the stress levels of 11 and 21% PEG 6000 when the relative stress response magnitude was 1.5–2-fold higher in the first group than in the second group.
Integration of different datasets
Early integration approach
A PCA was used to explore the variation among strains based on a joint dataset of direct and indirect PGP variables (Figure 3). The PC1 axis explained 54.6% of the data variance, and the statistically significant variables in PCA were siderophore, IAA, GA, GB, ABA production, and H. oryzae inhibition percentage (Supplementary Table S11). The PC1 axis separated B. megaterium PB50, B. altitudinis PB46, and B. endophyticus PB3 from the other strains (Figures 3A,B). The PC2 axis emphasized the variation among strains in P. oryzae inhibition percentage and HCN production ability, indicating that strain B. pumilus PB18 exhibited the highest P. oryzae inhibition percentage and HCN production potential among the studied strains. A heatmap based on direct and indirect PGP traits was used to assess similarities among the studied bacterial strains (Supplementary Figure S2). Based on the intensity of the PGP response, the strains were separated into one large group of seven strains with two subclusters, whereas the B. megaterium PB50 strain formed a separate cluster. The large cluster comprised two sub-clusters, with B. endophyticus PB3, B. pumilus PB18, and B. altitudinis PB46 grouped together as moderate-performance strains, and B. australimaris PB17, B. safensis PB23, S. sciuri PB24, and B. altitudinis PB37 grouped separately because of their poor performance in PGP activities. Furthermore, k-means clustering led to the identification of six optimal clusters in this dataset, with B. australimaris PB17 and B. altitudinis PB37 and B. safensis PB23 and S. sciuri PB24 clustered together, and both clusters were close to each other, as shown in the PCA ordination plot (Supplementary Figure S3).
In addition to PCA on the joint PGP trait dataset (Figures 3A,B), PCA was performed by combining all osmotic stress datasets (datasets 3–6; Supplementary Table S10) and all six datasets (datasets 1–6; Supplementary Table S10). The results for PCA on all osmotic stress datasets are provided in Supplementary Figure S6, and these data were used as input later in the congruence among the distance matrices approach. The PCA results for all six datasets are shown in Figure 4. PC1 was significant (p < 0.05), accounting for 56% of the total variance, whereas PC2 accounted for only 12.5% of the total variance. Most variables had a significant impact on PC1 (Supplementary Table S11). The maximum variance was attributed by the variables measured at a stress level of 32.6% PEG 6000 32.6%.
Figure 4. Results of principal component analysis (PCA) based on the integration of plant growth-promoting traits data and osmotic stress response parameters datasets. (A) Score plot and (B) loading plot of variables according to first two principal components. The plots correspond to 68.5% of the total data variance, and variance proportions are shown along each principal component axis. Variables with asterisk in the plot (B) are significant along the first principal component axis. Abbreviations used in plot (B) are, indoleacetic acid (IAA), gibberellic acid (GA), glycine betaine (GB), abscisic acid (ABA), hydrogen cyanide (HCN), Helminthosporium oryzae inhibition percentage (HO.IP), and Pyricularia oryzae inhibition percentage (PO.IP). The codes of the strains in plot (A) refer to the following strains: Bacillus endophyticus PB3, B. australimaris PB17, B. pumilus PB18, B. safensis PB23, Staphylococcus sciuri PB24, B. altitudinis PB37, B. altitudinis PB46, and B. megaterium PB50.
Multiple co-inertia analysis
Multiple co-inertia analysis was applied to jointly analyze the six datasets (Supplementary Table S10). The graphical outputs of this analysis are shown in Figure 5; Supplementary Figure S7. The MCIA first axis captured the highest variance (78.3%) in the datasets and separated three strains, B. endophyticus PB3, B. altitudinis PB46, and B. megaterium PB50, from the remaining strains (Figure 5A). The MCIA second axis, explaining 16.0% of the data variation, emphasized the distinction between the strains B. pumilus PB18 and B. altitudinis PB37 and the other studied strains. The pseudo-eigenvalue space of six datasets (Figure 5D) indicates that three datasets (direct PGP traits as well as stress at PEG 6000 levels of 0 and 11%) contributed the most to the MCIA first axis, while the contribution of indirect PGP parameters was small. The datasets of the stress at PEG 6000 levels of 21 and 32.6% contributed to the MCIA second axis. Correlations between datasets were the highest at the stress levels of 11, 21, and 32.6% PEG 6000 (RV = 0.71–0.76) and lowest in the case of the datasets of indirect PGP traits and stress levels of 0, 11, and 21% of PEG 6000 (RV = 0.35–0.51). Projections of all variables onto the first two MCIA axes space indicated that the strains B. altitudinis PB46 and B. megaterium PB50, and to a lesser extent, B. endophyticus PB3, were associated with higher IAA and GA values (Supplementary Figure S7). The concentration of GB was an important variable for the separation of strains under non-stress conditions, and it coincided with higher siderophore and cytokinin production. The dataset correlation plot based on the RV coefficient value showed that the dataset of intermediate stress level (21% PEG 6000) was highly positively correlated with the datasets of 11 and 32.6% PEG 6000 datasets (Supplementary Figure S8 A). The positive correlation of the stress and non-stress datasets of 11% PEG 6000 with the datasets of PGP traits was higher than that of the other two stress levels, indicating that the stress level influenced the function of all studied strains. Indirect PGP traits did not positively correlate with other dataset types.
Figure 5. Multiple co-inertia analysis (MCIA) results based on six datasets [direct plant growth-promoting (PGP) traits, indirect PGP traits, microbial parameters under osmotic stress at PEG concentrations 0, 11, 21, and 32.6%]. (A) A plot of the first two components in the sample space. Each sample is represented by a shape where lines connect the six datasets for each sample to a center point (MCIA global score). (B) Variable space for each dataset. (C) A scree plot of absolute eigenvalues (bars) and the proportions of variance for the eigenvectors (line). (D) A plot of data weighting space that shows the pseudo-eigenvalues space of all datasets indicating the variance of an eigenvalue contributed by each dataset. The codes of the strains in plot (A) refer to the following strains: Bacillus endophyticus PB3, B. australimaris PB17, B. pumilus PB18, B. safensis PB23, Staphylococcus sciuri PB24, B. altitudinis PB37, B. altitudinis PB46, and B. megaterium PB50.
Multiple kernel learning
The MKL technique was applied to the same six datasets (Supplementary Table S10), and the combined kernel principal component analysis (KPCA) was used in further exploratory analysis. The results of these analyses revealed that the majority of the overall variance in the data was captured by the first axis of KPCA, which clearly separated the two strains, B. altitudinis PB46 and B. megaterium PB50, from the rest of the strains (Figure 6A). To identify the influence of the variables on KPCA, an important variable plot was computed (Figure 6B), which showed that GA and HCN production were the most important direct and indirect PGP traits, respectively. The production of IAA and GA was important in the non-stress conditions, whereas GA was also relevant at the 11% PEG 6000 stress level. Proline production was found to be an important variable in both 21 and 32.6% PEG 6000 stress levels. The kernel correlation plot showed that the dataset correlation followed a pattern similar to that observed with the MCIA analysis, indicating that the dataset correlation was heavily influenced by stress levels (Supplementary Figure S8 A).
Figure 6. Results of multiple kernel learning analysis (MKL). (A) A plot of kernel principal component analysis (KPCA) based on six datasets (direct plant growth-promoting (PGP) traits, indirect PGP traits, microbial parameters under osmotic stress at polyethylene glycol (PEG) concentrations 0, 11, 21, and 32.6%). (B) A plot for important variables in each dataset assessed using the Crone-Crosby distance. The codes of the strains in plot (A) refer to the following strains: Bacillus endophyticus PB3, B. australimaris PB17, B. pumilus PB18, B. safensis PB23, Staphylococcus sciuri PB24, B. altitudinis PB37, B. altitudinis PB46, and B. megaterium PB50. Abbreviations used in plot (B) are indoleacetic acid (IAA), gibberellic acid (GA), abscisic acid (ABA), hydrogen cyanide (HCN), Helminthosporium oryzae inhibition percentage (HO_IP), and Pyricularia oryzae inhibition percentage (PO_IP).
Comparison of data analysis results
The CADM test indicated that some of the obtained distance matrices were similar (CADM global test, p < 0.001). The most congruent were strain groupings according to the PCA based on all datasets and the PCA based on osmotic stress data (Mantel r = 0.54, p < 0.001). In addition, the MCIA and KPCA results correlated (Mantel r = 0.46, p < 0.01). The similarities among the studied data integration methods were visualized using a dendrogram based on the Mantel test correlation values (Supplementary Figure S9). The posteriori test indicated that the strain distance matrices of the KPCA and PCA based on osmotic stress data were incongruent with the rest of the distance matrices.
Discussion
Estimating strains difference using univariate and multivariate analysis
Among the eight strains included in this study, seven were spore-forming gram-positive Bacillus strains that were generally resistant to the effects of dryness, heat, UV radiation, and various other environmental stressors (Nicholson et al., 2000; Tortora et al., 2019). However, the measured biochemical and microbiological indicators revealed significant differences in osmotic stress tolerance and PGP properties among the strains.
Based on the univariate analysis results, B. endophyticus PB3, B. altitudinis PB46, and B. megaterium PB50 stood out with their phytohormone production ability, with B. megaterium PB50 producing the highest amounts of all four phytohormones. The production of these phytohormones by B. megaterium species has been reported previously (Karadeniz et al., 2006), and this ability has been associated with the stimulation of growth and stress alleviation in plants (Shahzad et al., 2017; Sun et al., 2017; Kang et al., 2019; Zerrouk et al., 2020) by a direct effect on phytohormone levels (Vejan et al., 2016).
The production of HCN and siderophores in bacterial cells serves as a defense mechanism against other microbes. While known to be good siderophore producers that inhibit the growth of several plant pathogens, Bacillus species have been shown to possess moderate HCN production ability (Muthukumar et al., 2022). In this study, all Bacillus strains could produce siderophores; however, B. megaterium PB50 was the most effective siderophore producer among the studied strains and was the only strain that produced hydroxamate-type siderophores (Ferreira et al., 2019). Bacillus megaterium is a dominant rice phyllosphere bacterial species that has shown antagonistic activity against several predominant rice fungal pathogens in vitro (Islam and Nandi, 1985; Gowdu and Balasubramanian, 1988). Moreover, B. megaterium PB50 was among the three Bacillus strains that showed antagonistic activity against rice fungal pathogens in this study. However, another strain, B. pumilus PB18, also exhibited antagonistic activity against fungal pathogens and had the highest siderophore production activity among the strains examined in this study.
In all strains, a decrease in IAA and GA production was observed under more pronounced osmotic stress conditions; however, the production was again the highest in B. megaterium PB50 at different stress levels. Similarly, a higher production of IAA by B. megaterium at the 15% PEG level was detected by Armada et al. (2014). In addition, decreased IAA, GA, and cytokinin production under stress conditions (20% PEG 6000) compared to non-stress conditions was observed in Azospirillum brasilense and B. subtilis (Ilyas et al., 2020). In addition, to B. megaterium PB50, B. altitudinis PB46 stood out from the other strains in responding to stress levels. Similar to earlier reports for different Bacillus species (Paul et al., 2015; Lee et al., 2018), the growth of the two aforementioned strains was faster at 11% PEG 6000 than under non-stress conditions.
Bacterial growth and proline and GB accumulation increased with increasing osmotic stress levels of up to 21% PEG 6000 in all strains. Phytohormone production decreased with increasing proline and GB concentrations. In contrast to B. megaterium PB50, which showed the lowest proline accumulation and highest phytohormone production under stress, B. pumilus PB18 showed the highest osmolyte accumulation and lowest phytohormone production under stress. Although all studied strains were gram-positive bacteria that could synthesize endogenous proline under osmotic stress conditions by boosting the synthesis process and then degrading proline for another metabolism (Tempest et al., 1970; Deutch, 2019), and the GB is transported from an external source or converted from imported choline during stress (Onyango and Alreshidi, 2018), the results of this study suggest that osmolyte accumulation is balanced. Each bacterium uses a specific osmolyte synthesis mechanism to overcome abiotic stress (Bremer and Krämer, 2019). It has been shown earlier that under non-stress conditions, bacteria favor phytohormone synthesis rather than osmolyte accumulation, whereas the opposite occurs under stress conditions. Such a shift in metabolic activity is common in bacterial cells, allowing them to maintain homeostasis when there is an increase in osmotic pressure in the extracellular medium (Varela et al., 2004; Lahtvee et al., 2014; Cesar et al., 2020).
Integration of bacterial strain datasets
Complementary information from several types of datasets produced for microbial strains can be exploited to obtain better insights into microbial strain grouping and to elucidate factors behind strain clustering using machine-learning-based data integration methods. We applied three data-integration approaches to obtain microbial datasets: early, intermediate, and mixed integration.
In the early integration approach, several microbial datasets were combined into a single table and processed using PCA. This approach highlighted two strains, B. altitudinis PB46 and B. megaterium PB50, which were the most distinct from the other strains. At the same time, the PCA of the concatenated data did not clearly indicate the variables important for the separation of microbial strains in the PCA plot. Although the early integration approach is appealing owing to its simplicity and easy implementation, the complexity, data imbalance, and possible noise in the underlying matrix may complicate learning. As a linear dimension reduction method, PCA is the most commonly used approach for concatenated data, whereas nonlinear methods (t-distributed stochastic neighbor embedding, t-SNA; uniform manifold approximation, and UMAP) may provide better performance depending on the dataset properties (Xiang et al., 2021).
For mixed data integration, we utilized the MKL technique, in which a linear kernel was first computed for each microbial dataset and then the obtained kernels were combined to produce a global similarity matrix that describes microbial strains across all included datasets (Mariette and Villa-Vialaneix, 2018). The resulting similarity matrix was used as the input for the PCA. The results of this method also emphasized the separation of the two strains, B. altitudinis PB46 and B. megaterium PB50, from the other studied strains. In addition, the analysis outcome provided estimates of the variable importance. Direct PGP traits, such as GA and proline production at stress levels of 21 and 32% of PEG 6000, were highlighted by this analysis. The MKL technique has the advantage of preserving the original properties of the data and combining different data types by applying appropriate transformations (Zampieri et al., 2019). However, the MKL analysis results may be challenging to interpret because the interactions and correlations among different datasets are not always well determined.
The third method, MCIA, is an intermediate strategy of data integration, where multiple microbial datasets are jointly integrated without prior transformation. Similar to MKL, the MCIA reduces the dimensionality and complexity of datasets. The MCIA results clearly showed the distinction of strain B. megaterium PB50 from the other strains. The direct PGP trait data set (particularly GA and IAA production) and stress level 11% (GA production) were the most important indicators of the variation among the microbial strains. MCIA is considered one of the best-performing algorithms for the benchmarking of joint multi-omics dimensionality reduction approaches in the case of cancer datasets and provides an effective elucidation of relationships among the studied datasets (Cantini et al., 2021). Moreover, it can be considered a variation of the canonical correspondence analysis (CCA) method (Vahabi and Michailidis, 2022). The applicability of other CCA extensions could also be tested for simultaneous feature selection and classification in microbial datasets in the future.
The three applied data integration methods did not provide a similar grouping of the studied strains, except for the ordination of microbial strains based on the PCA of concatenated data. However, all three data integration methods indicated that the strains B. altitudinis PB46 and B. megaterium PB50 had high similarity in their biochemical properties and osmotic stress response. However, when applied together with B. endophyticus PB3 in a rice growth experiment under drought conditions, these two strains had variable effects on plant growth, biochemical properties, and gene expression (Devarajan et al., 2021). Strain B. megaterium PB50, and to a lesser extent B. altitudinis PB46, induced elevated drought tolerance in rice plants, while B. endophyticus PB3 had no effect. Simultaneously, the MCIA method placed the strain B. endophyticus PB3 close to B. altitudinis PB46 and B. megaterium PB50. Both MCIA and MKL indicate that GA production is one of the main features of microbial strains. In addition, MCIA emphasized IAA and MKL proline production at higher osmotic stress values. Several studies have reported the importance of microbial IAA, GA, and proline production in mitigating drought stress in crops (Ashry et al., 2022; Fadiji et al., 2022; Uzma et al., 2022).
One option for improving strain selection and feature identification in the future is to apply a combination of unsupervised and supervised data integration methods (Singh et al., 2019). In such cases, the outcome of plant inoculation experiments could be included in the process of data integration. It could also be that the microbial parameters included in the current data analysis did not completely cover the microbial traits required for successful plant application. In addition to biochemical properties, different omics (genome, transcriptome, proteome, and metabolome) datasets could be produced for microbial strains. The analysis of such datasets could be challenging, although there are several data integration methods specifically designed for the analysis of different omics layers.
Conclusion
Overall, our findings suggest that when selecting bacterial strains to improve crop resilience under field drought conditions, various aspects of the PGP activity and osmotic stress tolerance of microbial strains should be considered simultaneously. Data integration methods could complement the single-table data analysis approach and may provide better insight into the microbial strain selection process. In addition, data integration allows for the exploration of complex microbial strain datasets within a single analytical framework. Currently, there are no general rules for selecting the most efficient data integration method for biological datasets. Thus, additional benchmarking of different joint data analysis methods with larger microbial strain datasets is advisable for PGP microbial strains. Another aspect of the data integration of PGP microbial strains, which needs further exploration, is related to the joint analysis of microbial biochemical and omics datasets. The integration of microbial biochemical and omics datasets may provide better insights for producing mixtures of PGP microbes that perform better than a single strain.
In this study, we used only joint data analysis to combine diverse datasets related to microbial strain properties. In the future, supervised data integration methods could be applied to combine greenhouse experiments and field trial data with PGP strain characterization data to improve the strain selection process.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Author contributions
AD and SG designed the study. AD performed the experiments, assisted by SG and GM and wrote the first draft of the manuscript. AD and JT performed data analyses. MT, JT, SG, and GM read the first version of the manuscript. All authors contributed to the article and approved the submitted version.
Funding
This research was funded by the Department of Biotechnology, Ministry of Science and Technology, grant number BT/IN/Indo-US/Foldscope/39/2015; Department of Science and Technology, Science and Engineering Research Board, grant number EMR/2016/008061; and Estonian Research Council (grant numbers PRG548 and PRG916).
Acknowledgments
The authors thank the Department of Biotechnology, Foldscope Project (E28ACO), India; Department of Science and Technology, Science and Engineering Research Board (DST-SERB project E28ADI), New Delhi, India; Tamil Nadu Agricultural University, Coimbatore, India; ERDF Research Fund; and Estonian Research Council for providing financial support for this study.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2022.1058772/full#supplementary-material
Footnotes
References
Armada, E., Roldán, A., and Azcon, R. (2014). Differential activity of autochthonous bacteria in controlling drought stress in native Lavandula and salvia plants species under drought conditions in natural arid soil. Microb. Ecol. 67, 410–420. doi: 10.1007/s00248-013-0326-9
Arnow, L. E. (1937). Colorimetric determination of the components of 3, 4-dihydroxyphenylalanine-tyrosine mixtures. J. Biol. Chem. 118, 531–537. doi: 10.1016/S0021-9258(18)74509-2
Arun, K. D., Sabarinathan, K. G., Gomathy, M., Kannan, R., and Balachandar, D. (2020). Mitigation of drought stress in rice crop with plant growth-promoting abiotic stress-tolerant rice phyllosphere bacteria. J. Basic Microbiol. 60, 768–786. doi: 10.1002/jobm.202000011
Ashry, N. M., Alaidaroos, B. A., Mohamed, S. A., Badr, O. A., El-Saadony, M. T., and Esmael, A. (2022). Utilization of drought-tolerant bacterial strains isolated from harsh soils as a plant growth-promoting rhizobacteria (PGPR). Saudi J. Biol. Sci. 29, 1760–1769. doi: 10.1016/j.sjbs.2021.10.054
Ayuso-Calles, M., Flores-Félix, J. D., and Rivas, R. (2021). Overview of the role of rhizobacteria in plant salt stress tolerance. Agronomy 11:9. doi: 10.3390/agronomy11091759
Baliyan, N., Dhiman, S., Dheeman, S., Kumar, S., Arora, N. K., and Maheshwari, D. K. (2022). Optimization of gibberellic acid production in endophytic Bacillus cereus using response surface methodology and its use as plant growth regulator in chickpea. J. Plant Growth Regul. 41, 3019–3029. doi: 10.1007/s00344-021-10492-2
Belimov, A. A., Dodd, I. C., Safronova, V. I., Dumova, V. A., Shaposhnikov, A. I., Ladatko, A. G., et al. (2014). Abscisic acid metabolizing rhizobacteria decrease ABA concentrations in planta and alter plant growth. Plant Physiol. Biochem. 74, 84–91. doi: 10.1016/j.plaphy.2013.10.032
Bremer, E., and Krämer, R. (2019). Responses of microorganisms to osmotic stress. Annu. Rev. Microbiol. 73, 313–334. doi: 10.1146/annurev-micro-020518-115504
Cai, Z., Poulos, R. C., Liu, J., and Zhong, Q. (2022). Machine learning for multi-omics data integration in cancer. iScience 25:103798. doi: 10.1016/j.isci.2022.103798
Camargo, A. (2022). PCAtest: testing the statistical significance of principal component analysis in R. PeerJ. 10:e12967. doi: 10.7717/peerj.12967
Campbell, V., Legendre, P., and Lapointe, F. J. (2011). The performance of the congruence among distance matrices (CADM) test in phylogenetic analysis. BMC Evol. Biol. 11:64. doi: 10.1186/1471-2148-11-64
Cantini, L., Zakeri, P., Hernandez, C., Naldi, A., Thieffry, D., Remy, E., et al. (2021). Benchmarking joint multi-omics dimensionality reduction approaches for the study of cancer. Nat. Commun. 12:124. doi: 10.1038/s41467-020-20430-7
Cesar, S., Anjur-Dietrich, M., Yu, B., Li, E., Rojas, E., Neff, N., et al. (2020). Bacterial evolution in high-osmolarity environments. MBio 11, e01191–e01120. doi: 10.1128/mBio.01191-20
Chauhan, P. K., Upadhyay, S. K., Tripathi, M., Singh, R., Krishna, D., Singh, S. K., et al. (2022). Understanding the salinity stress on plant and developing sustainable management strategies mediated salt-tolerant plant growth-promoting rhizobacteria and CRISPR/Cas9. Biotechnol. Genet. Eng. Rev. 17, 1–37. doi: 10.1080/02648725.2022.2131958
da Costa, P. B. D., Granada, C. E., Ambrosini, A., Moreira, F., de Souza, R., dos Passos, J. F. M., et al. (2014). A model to explain plant growth promotion traits: a multivariate analysis of 2,211 bacterial isolates. PLoS One 9:e116020. doi: 10.1371/journal.pone.0116020
Deutch, C. E. (2019). L-proline catabolism by the high G+ C gram-positive bacterium Paenarthrobacter aurescens strain TC1. Antonie Van Leeuwenhoek 112, 237–251. doi: 10.1007/s10482-018-1148-z
Deva, C. R., Urban, M. O., Challinor, A. J., Falloon, P., and Svitákova, L. (2020). Enhanced leaf cooling is a pathway to heat tolerance in common bean. Front. Plant Sci. 11:19. doi: 10.3389/fpls.2020.00019
Devarajan, A. K., Muthukrishanan, G., Truu, J., Truu, M., Ostonen, I., Kizhaeral, S. S., et al. (2021). The foliar application of Rice phyllosphere bacteria induces drought-stress tolerance in Oryza sativa (L.). Plants 10:387. doi: 10.3390/plants10020387
Ding, L. J., Cui, H. L., Nie, S. A., Long, X. E., Duan, G. L., and Zhu, Y. G. (2019). Microbiomes inhabiting rice roots and rhizosphere. FEMS Microbiol. Ecol. 95:5. doi: 10.1093/femsec/fiz040
Egamberdieva, D., Wirth, S. J., Alqarawi, A. A., Abd Allah, E. F., and Hashem, A. (2017). Phytohormones and beneficial microbes: essential components for plants to balance stress and fitness. Front. Microbiol. 8:2104. doi: 10.3389/fmicb.2017.02104
Fadiji, A. E., Santoyo, G., Yadav, A. N., and Babalola, O. O. (2022). Efforts towards overcoming drought stress in crops: revisiting the mechanisms employed by plant growth-promoting bacteria. Front. Microbiol. 13:962427. doi: 10.3389/fmicb.2022.962427
FAO (2022). Crops and climate change impact briefs – climate-smart agriculture for more sustainable, resilient, and equitable food systems. Rome.
Ferreira, C. M. H., Vilas-Boas, Â., Sousa, C. A., Soares, H. M. V. M., and Soares, E. V. (2019). Comparison of five bacterial strains producing siderophores with ability to chelate iron under alkaline conditions. AMB Express 9:78. doi: 10.1186/s13568-019-0796-3
Fitzsimmons, L. F., Hampel, K. J., and Wargo, M. J. (2012). Cellular choline and glycine betaine pools impact osmoprotection and phospholipase C production in Pseudomonas aeruginosa. J. Bacteriol. 194, 4718–4726. doi: 10.1128/JB.00596-12
Gowdu, B. J., and Balasubramanian, R. (1988). Role of phylloplane micro-organisms in the biological control of foliar plant diseases. J. Plant Dis. Prot. 95, 310–331.
Hidangmayum, A., Dwivedi, P., Kumar, P., and Upadhyay, S. K. (2022). Seed priming and foliar application of chitosan ameliorate drought stress responses in Mungbean genotypes through modulation of Morpho-physiological attributes and increased Antioxidative defense mechanism. J. Plant Growth Regul. 1-18. doi: 10.1007/s00344-022-10792-1
Holbrook, A. A., Edge, W. J. W., and Bailey, F. (1961). Spectrophotometric method for determination of gibberellic acid. Adv. Chemother. 28, 159–167. doi: 10.1021/ba-1961-0028.ch018
Ilyas, N., Mumtaz, K., Akhtar, N., Yasmin, H., Sayyed, R. Z., Khan, W., et al. (2020). Exopolysaccharides producing bacteria for the amelioration of drought stress in wheat. Sustain. For. 12:8876. doi: 10.3390/su12218876
Islam, K. Z., and Nandi, B. (1985). Control of brown spot of rice by bacillus megaterium. J. Plant Dis. Prot. 92, 241–246.
Ji, S. H., Paul, N. C., Deng, J. X., Kim, Y. S., Yun, B. S., and Yu, S. H. (2013). Biocontrol activity of bacillus amyloliquefaciens CNU114001 against fungal plant diseases. Mycobiology 41, 234–242. doi: 10.5941/MYCO.2013.41.4.234
John, C. R., Watson, D., Barnes, M. R., Pitzalis, C., and Lewis, M. J. (2020). Spectrum: fast density-aware spectral clustering for single and multi-omic data. Bioinformatics 36, 1159–1166. doi: 10.1093/bioinformatics/btz704
Kang, S. M., Khan, A. L., Waqas, M., Asaf, S., Lee, K. E., Park, Y. G., et al. (2019). Integrated phytohormone production by the plant growth-promoting rhizobacterium bacillus tequilensis SSB07 induced thermotolerance in soybean. J. Plant Interact. 14, 416–423. doi: 10.1080/17429145.2019.1640294
Karadeniz, A., Topcuoğlu, Ş. F., and İnan, S. (2006). Auxin, gibberellin, cytokinin and abscisic acid production in some bacteria. World J. Microbiol. Biotechnol. 22, 1061–1064. doi: 10.1007/s11274-005-4561-1
Kaul, S., Sharma, T., and Dhar, K. M. (2016). ‘Omics’ tools for better understanding the plant–endophyte interactions. Front. Plant Sci. 7:955. doi: 10.3389/fpls.2016.00955
Kim, H., and Lee, Y. H. (2020). The rice microbiome: a model platform for crop holobiome. Phytobiomes J. 4, 5–18. doi: 10.1094/PBIOMES-07-19-0035-RVW
Kolde, R. (2019). pheatmap: Pretty Heatmaps. R package version 1. 0. 12. CRAN. Available at: https://cran.r-project.org/web/packages/pheatmap/index.html
Lahtvee, P. J., Seiman, A., Arike, L., Adamberg, K., and Vilu, R. (2014). Protein turnover forms one of the highest maintenance costs in Lactococcus lactis. Microbiology 160, 1501–1512. doi: 10.1099/mic.0.078089-0
Lê, S., Josse, J., and Husson, F. (2008). FactoMineR: an R package for multivariate analysis. J. Stat. Softw. 25, 1–18. doi: 10.18637/jss.v025.i01
Lee, Y. H., Jang, S. J., Han, J. H., Bae, J. S., Shin, H., Park, H. J., et al. (2018). Enhanced tolerance of Chinese cabbage seedlings mediated by bacillus aryabhattai H26-2 and B. siamensis H30-3 against high temperature stress and fungal infections. Plant Pathol. J. 34, 555–566. doi: 10.5423/PPJ.OA.07.2018.0130
Li, Y., Wu, F. X., and Ngom, A. (2018). A review on machine learning principles for multi-view biological data integration. Brief. Bioinform. 19, bbw113–bbw340. doi: 10.1093/BIB/BBW113
Liang, X., Zhang, L., Natarajan, S. K., and Becker, D. F. (2013). Proline mechanisms of stress survival. Antioxid. Redox Signal. 19, 998–1011. doi: 10.1089/ars.2012.5074
Mahmud, A. A., Upadhyay, S. K., Srivastava, A. K., and Bhojiya, A. A. (2021). Biofertilizers: a nexus between soil fertility and crop productivity under abiotic stress. CRSUST 3:100063. doi: 10.1016/j.crsust.2021.100063
Mariette, J., and Villa-Vialaneix, N. (2018). Unsupervised multiple kernel learning for heterogeneous data integration. Bioinformatics 34, 1009–1015. doi: 10.1093/bioinformatics/btx682
Mekureyaw, M. F., Pandey, C., Hennessy, R. C., Nicolaisen, M. H., Liu, F., Nybroe, O., et al. (2022). The cytokinin-producing plant beneficial bacterium Pseudomonas fluorescens G20-18 primes tomato (Solanum lycopersicum) for enhanced drought stress responses. J. Plant Physiol. 270:153629. doi: 10.1016/j.jplph.2022.153629
Meng, C., Kuster, B., Culhane, A. C., and Gholami, A. M. (2014). A multivariate approach to the integration of multi-omics datasets. BMC Bioinform 15:162. doi: 10.1186/1471-2105-15-162
Meudt, W. J., and Gaines, T. P. (1967). Studies on the oxidation of indole-3-acetic acid by peroxidase enzymes. I. Colorimetric determination of indole-3-acetic acid oxidation products. Plant Physiol. 42, 1395–1399. doi: 10.1104/pp.42.10.1395
Muthukumar, A., Raj, T. S., Prabhukarthikeyan, S. R., Kumar, R. N., and Keerthana, U. (2022). “Pseudomonas and bacillus: a biological tool for crop protection in New and Future Developments,” in Microbial Biotechnology and Bioengineering. eds. H. B. Singh and A. Vaishnav (Elsevier), 145–158.
Nicholson, W. L., Munakata, N., Horneck, G., Melosh, H. J., and Setlow, P. (2000). Resistance of bacillus endospores to extreme terrestrial and extraterrestrial environments. Microbiol. Mol. Biol. Rev. 64, 548–572. doi: 10.1128/MMBR.64.3.548-572.2000
Nutaratat, P., Monprasit, A., and Srisuk, N. (2017). High-yield production of indole-3-acetic acid by Enterobacter sp. DMKU-RP206, a rice phyllosphere bacterium that possesses plant growth-promoting traits. 3 Biotech 7:305. doi: 10.1007/s13205-017-0937-9
Onyango, L. A., and Alreshidi, M. M. (2018). Adaptive metabolism in staphylococci: survival and persistence in environmental and clinical settings. J. Pathog. 2018:1092632. doi: 10.1155/2018/1092632
Paradis, E., Claude, J., and Strimmer, K. (2004). APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20, 289–290. doi: 10.1093/bioinformatics/btg412
Paul, S., Aggarwal, C., Thakur, J. K., Bandeppa, G. S., Khan, M., Pearson, L. M., et al. (2015). Induction of osmoadaptive mechanisms and modulation of cellular physiology help Bacillus licheniformis Strain SSA 61 adapt to salt stress. Curr Microbiol. 70, 610–617. doi: 10.1007/s00284-014-0761-y
Picard, M., Scott-Boyer, M. P., Bodein, A., Périn, O., and Droit, A. (2021). Integration strategies of multi-omics data for machine learning analysis. Comput. Struct. Biotechnol. J. 19, 3735–3746. doi: 10.1016/j.csbj.2021.06.030
Qurashi, A. W., and Sabri, A. N. (2013). Osmolyte accumulation in moderately halophilic bacteria improves salt tolerance of chickpea. Pak. J. Bot. 45, 1011–1016.
Ramakrishna, W., Yadav, R., and Li, K. (2019). Plant growth promoting bacteria in agriculture: two sides of a coin. Appl. Soil Ecol. 138, 10–18. doi: 10.1016/j.apsoil.2019.02.019
Remus-Emsermann, M. N., and Schlechter, R. O. (2018). Phyllosphere microbiology: at the interface between microbial individuals and the plant host. New Phytol. 218, 1327–1333. doi: 10.1111/NPH.15054
Sachdev, S., and Ansari, M. I. (2022). “Role of plant microbiome under stress environment to enhance crop productivity,” in Augmenting Crop Productivity in Stress Environment. eds. S. A. Ansari, M. I. Ansari, and A. Husen (Singapore: Springer), 205–221.
Saleem, M., Meckes, N., Pervaiz, Z. H., and Traw, M. B. (2017). Microbial interactions in the phyllosphere increase plant performance under herbivore biotic stress. Front. Microbiol. 8:41. doi: 10.3389/fmicb.2017.00041
Sandhu, N., Yadav, S., and Kumar, A. (2020). “Advances in developing multigene abiotic and biotic stress-tolerant rice varieties,” in Abiotic Stress in Plants. Vol. 1 eds. S. Fahad, S. Saud, Y. Chen, C. Wu, and D. Wang (London, UK: INTECH Open Access Publishers), 1–20.
Santos, H., and Da Costa, M. S. (2002). Compatible solutes of organisms that live in hot saline environments. Environ. Microbiol. 4, 501–509. doi: 10.1046/j.1462-2920.2002.00335.x
Saxena, R., Kumar, M., and Tomar, R. S. (2021). “Plant–rhizobacteria interactions to induce biotic and abiotic stress tolerance in plants,” in Plant, Soil and Microbes. eds. S. K. Dubey and S. K. Verma (Singapore: Springer), 1–18.
Sayyed, R. Z., Chincholkar, S. B., Reddy, M. S., Gangurde, N. S., and Patel, P. R. (2013). “Siderophore producing PGPR for crop nutrition and phytopathogen suppression,” in Bacteria in Agrobiology: Disease Management. ed. S. D. Maheshwari (Berlin, Heidelberg: Springer), 449–471.
Schwyn, B., and Neilands, J. B. (1987). Universal chemical assay for the detection and determination of siderophores. Anal. Biochem. 160, 47–56. doi: 10.1016/0003-2697(87)90612-9
Sessitsch, A. N., Hardoim, P. A., Döring, J., Weilharter, A., Krause, A. N., Woyke, T. A., et al. (2012). Functional characteristics of an endophyte community colonizing rice roots as revealed by metagenomic analysis. Mol. Plant-Microbe Interact. 25, 28–36. doi: 10.1094/MPMI-08-11-0204
Setlow, P. (2014). Germination of spores of bacillus species: what we know and do not know. J. Bacteriol. 196, 1297–1305. doi: 10.1128/JB.01455-13
Shahzad, R., Khan, A. L., Bilal, S., Waqas, M., Kang, S. M., and Lee, I. J. (2017). Inoculation of abscisic acid-producing endophytic bacteria enhances salinity stress tolerance in Oryza sativa. Environ. Exp. Bot. 136, 68–77. doi: 10.1016/j.envexpbot.2017.01.010
Shinwari, Z. K., Tanveer, F., and Iqrar, I. (2019). “Role of microbes in plant health, disease management, and abiotic stress management,” in Microbiome in Plant Health and Disease. eds. V. Kumar, R. Prasad, M. Kumar, and D. Choudhary (Singapore: Springer), 231–250.
Shu, W. S., and Huang, L. N. (2022). Microbial diversity in extreme environments. Nat. Rev. Microbiol. 20, 219–235. doi: 10.1038/s41579-021-00648-y
Singh, P., Chauhan, P. K., Upadhyay, S. K., Singh, R. K., Dwivedi, P., Wang, J., et al. (2022). Mechanistic insights and potential use of siderophores producing microbes in rhizosphere for mitigation of stress in plants grown in degraded land. Front. Microbiol. 13:898979. doi: 10.3389/fmicb.2022.898979
Singh, A., Shannon, C. P., Gautier, B., Rohart, F., Vacher, M., Tebbutt, S. J., et al. (2019). Diablo: an integrative approach for identifying key molecular drivers from multi-omics assays. Bioinformatics 35, 3055–3062. doi: 10.1093/bioinformatics/bty1054
Sun, Z., Liu, K., Zhang, J., Zhang, Y., Xu, K., Yu, D., et al. (2017). IAA producing Bacillus altitudinis alleviates iron stress in Triticum aestivum L. seedling by both bioleaching of iron and up-regulation of genes encoding ferritins. Plant Soil 419, 1–11. doi: 10.1007/s11104-017-3218-9
Tempest, D. W., Meers, J. L., and Brown, C. M. (1970). Influence of environment on the content and composition of microbial free amino acid pools. J. Gen. Microbiol. 64, 171–185. doi: 10.1099/00221287-64-2-171
Tortora, G. J., Funke, B. R., and Case, C. L. (2019). Microbiology: An Introduction 13th Edn. Boston: Pearson Education.
Upadhyay, S. K., Singh, J. S., Saxena, A. K., and Singh, D. P. (2012). Impact of PGPR inoculation on growth and antioxidant status of wheat under saline conditions. Plant Biol. 14, 605–611. doi: 10.1111/j.1438-8677.2011.00533.x
Upadhyay, S. K., Singh, J. S., and Singh, D. P. (2011). Exopolysaccharide-producing plant growth-promoting rhizobacteria under salinity condition. Pedosphere 21, 214–222. doi: 10.1016/S1002-0160(11)60120-3
Uzma, M., Iqbal, A., and Hasnain, S. (2022). Drought tolerance induction and growth promotion by indole acetic acid producing Pseudomonas aeruginosa in Vigna radiata. PLoS One 17:e0262932. doi: 10.1371/journal.pone.0262932
Vahabi, N., and Michailidis, G. (2022). Unsupervised multi-omics data integration methods: a comprehensive review. Front. Genet. 13:560. doi: 10.3389/FGENE.2022.854752/BIBTEX
Varela, C. A., Baez, M. E., and Agosin, E. (2004). Osmotic stress response: quantification of cell maintenance and metabolic fluxes in a lysine-overproducing strain of Corynebacterium glutamicum. Appl. Environ. Microbiol. 70, 4222–4229. doi: 10.1128/AEM.70.7.4222-4229.2004
Vejan, P., Abdullah, R., Khadiran, T., Ismail, S., and Nasrulhaq Boyce, A. (2016). Role of plant growth promoting rhizobacteria in agricultural sustainability—a review. Molecules 21:573. doi: 10.3390/molecules21050573
Venkatachalam, S., Ranjan, K., Prasanna, R., Ramakrishnan, B., Thapa, S., and Kanchan, A. (2016). Diversity and functional traits of culturable microbiome members, including cyanobacteria in the rice phyllosphere. Plant Biol. 18, 627–637. doi: 10.1111/plb.12441
Vorholt, J. A. (2012). Microbial life in the phyllosphere. Nat. Rev. Microbiol. 10, 828–840. doi: 10.1038/nrmicro2910
Wei, G., Kloepper, J. W., and Tuzun, S. (1991). Induction of systemic resistance of cucumber to Colletotrichum orbiculare by select strains of plant growth-promoting rhizobacteria. Phytopathology 81, 1508–1512. doi: 10.1094/Phyto-81-1508
Xiang, R., Wang, W., Yang, L., Wang, S., Xu, C., and Chen, X. (2021). A comparison for dimensionality reduction methods of single-cell RNA-seq data. Front. Genet. 12:320. doi: 10.3389/FGENE.2021.646936/BIBTEX
Zampieri, G., Vijayakumar, S., Yaneske, E., and Angione, C. (2019). Machine and deep learning meet genome-scale metabolic modeling. PLoS Comput. Biol. 15:e1007084. doi: 10.1371/JOURNAL.PCBI.1007084
Zerrouk, I. Z., Rahmoune, B., Auer, S., Rößler, S., Lin, T., Baluska, F., et al. (2020). Growth and aluminum tolerance of maize roots mediated by auxin-and cytokinin-producing bacillus toyonensis requires polar auxin transport. Environ. Exp. Bot. 176:104064. doi: 10.1016/j.envexpbot.2020.104064
Keywords: osmotic stress, rice phyllosphere, plant growth-promoting bacteria, data integration, strain selection
Citation: Devarajan AK, Truu M, Gopalasubramaniam SK, Muthukrishanan G and Truu J (2022) Application of data integration for rice bacterial strain selection by combining their osmotic stress response and plant growth-promoting traits. Front. Microbiol. 13:1058772. doi: 10.3389/fmicb.2022.1058772
Edited by:
José David Flores Félix, Universidade da Beira Interior, PortugalReviewed by:
Sudhir K. Upadhyay, Veer Bahadur Singh Purvanchal University, IndiaJudith Naamala, McGill University, Canada
Rashid Nazir, COMSATS University Islamabad, Abbottabad Campus, Pakistan
Copyright © 2022 Devarajan, Truu, Gopalasubramaniam, Muthukrishanan and Truu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Arun Kumar Devarajan, YXJ1bi5rdW1hci5kZXZhcmFqYW5AdXQuZWU=; Sabarinathan Kuttalingam Gopalasubramaniam, c2FiYXJpbWljcm9AaG90bWFpbC5jb20=