Skip to main content

ORIGINAL RESEARCH article

Front. Chem., 09 July 2024
Sec. Theoretical and Computational Chemistry

Elucidating structure–property relationships of guar gum biomolecules: insights from M-polynomial and QSPR modeling

  • Department of Mathematics, University of Gujrat, Gujrat, Pakistan

This study investigates the quantitative structure–property relationship (QSPR) modeling of guar gum biomolecules, focusing on their structural parameters. Guar gum, a polysaccharide with diverse industrial applications, exhibits various properties such as viscosity, solubility, and emulsifying ability, which are influenced by its molecular structure. In this research, M-polynomial and associated topological indices are employed as structural descriptors to represent the molecular structure of guar gum. The M-polynomial and associated topological indices capture important structural features, including size, shape, branching, and connectivity. By correlating these descriptors with experimental data on guar gum properties, predictive models are developed using regression analysis techniques. The analysis revealed a strong correlation between the boiling point and molecular weight and all the considered topological descriptors. The resulting models offer insights into the relationship between guar gum structure and its properties, facilitating the optimization of guar gum production and application in various industries. This study demonstrates the utility of M-polynomial and QSPR modeling in elucidating structure–property relationships of complex biomolecules like guar gum, contributing to the advancement of biomaterial science and industrial applications.

1 Introduction

Guar gum, an innovative agrochemical, is derived from the endosperm of cluster beans. The species Cyamopsis tetragonoloba, belonging to the Leguminosae family, produces the seeds used in guar gum production, and these seeds are resistant to drought (Prem et al., 2005). The concept of transdomestication was introduced by Hymowitz, although the exact origins of this practice are still a matter of dispute. Further information on the subject can be found in Whistler (1948) and BeMiller (2009). Guar gum has recently piqued the curiosity of several experts for numerous reasons. Guar gum powder serves as a thickener, stabilizer, and health management tool in various industries. It consists of galactomannan polysaccharides and can be derived from different sources, including algae, plants, microorganisms, and animals (Hovgaard and Brondsted, 1996). These polysaccharides are characterized by their stability, non-toxicity, hydrophilicity, and biodegradability. Its unique properties, such as high viscosity, solubility, and stability, make it a versatile ingredient with a wide range of applications in industries like food, pharmaceuticals, cosmetics, and oil drilling. Additionally, guar gum is known for its biocompatibility and non-toxic nature, making it suitable for various biomedical and pharmaceutical applications. By studying guar gum, researchers can explore its potential uses, optimize its properties, and develop innovative products with improved performance and functionality. Overall, the selection of guar gum for research offers opportunities to advance knowledge in various fields and contribute to the development of sustainable and high-performing materials and products.

Various derivatives of guar gum exist, such as carboxymethyl guar HPG, hydroxymethyl guar CMG, hydroxypropylethyl guar, ammonium hydroxylpropyl trimethyl chloride guar, acryloyloxy guar, methacryloyl guar, guar gum esters, and carboxymethyl hydroxypropyl guar CMHPG. The computation of guar gum and its derivatives employs molecular graphs, as detailed in Shanmukha et al. (2022). The most widely used derivatives are carboxymethyl guar (CMG), hydroxypropyl guar (HPG), and carboxymethyl hydroxypropyl guar (CMHPG), which are discussed in this study with respect to topological indices.

Chemical graph theory is an interdisciplinary field that links chemistry and mathematics. Graph modeling, originating from early chemical experiments, is a crucial aspect of theoretical chemistry. The subfield of cheminformatics analyzes quantitative structure–activity relationships (QSAR) and quantitative structure–property relationships (QSPR) to predict the biological activity and characteristics of guar gum and its derivatives. Utilizing topological indices and physico-chemical substances, it is possible to infer the pharmacological activity of these compounds without conducting experiments. Noteworthy, studies related to topological indices and physico-chemical substances include Arockiaraj et al. (2023a); Arockiaraj et al. (2023b); and Arockiaraj et al. (2023c). Recently, algebraic polynomials, such as the Hosoya polynomial (Consonni and Todeschini, 2010a) and M-polynomials (Deutsch and Klavar, 2014), have gained prominence in chemistry for determining distance-based topological indices and degree-based topological indices, respectively. The M-polynomial yields closed forms for various degree-based indices, while the Hosoya polynomial focuses on distance-based indices. The M-polynomial, often denoted as M(x,y), is a polynomial used in the study of chemical graph theory, particularly in the enumeration of certain chemical structures known as molecular graphs. The M-polynomial encodes information about the molecular graph’s topology, such as its number of vertices, edges, and other structural properties. It has applications in the enumeration of molecular isomers, the prediction of molecular properties, and the study of chemical reactions. Significant knowledge regarding degree-based graph invariants can be found in the M-polynomial literature, including Munir et al. (2016a); Munir et al. (2016b); Munir et al. (2016c); Munir et al. (2016d); and Ajmal et al. (2017). There is a wealth of knowledge regarding degree-based graph invariants in the M-polynomial.

The M-polynomial of graph Γ is defined as

MΓ;x,y=stmstΓxsyt,

where mst(Γ) is the number of edges νυE(Γ) such that {dν, dυ} = {s, t}.

Table 1 contains some degree-based TIs and the M-polynomial for the graph Γ:Dx=x(g(x,y))(x), Dy=y(g(x,y))(y), Sx=0xg(t,y)tdt, Sy=0yg(x,t)tdt,I(g(x,y))=(g(x,x)), and Qα(g(x,y))=xα(g(x,y)).

Table 1
www.frontiersin.org

Table 1. M-polynomials are used to derive several degree-based topological indices.

The topological index, frequently known as the connectedness index, was introduced in 1947 as a result of Weiner’s research (Consonni and Todeschini, 2010b). The earliest and most extensively researched topological index was the Wiener index [for further information, see (Wiener, 1947; Gutman et al., 1986)]. One of the earliest topological indices, the Randić index (Randic, 1975), was first introduced by Milan Randić in 1975 and is represented by the symbol R12(Γ). Its definition is as follows:

R12Γ=υνEΓ1dυdν.(1)

In 1998, Bollobs and Erds (1998) and Amic et al. (1998) independently proposed the general Randić index, which has been extensively studied for its numerous mathematical properties (Caporossi et al., 2003; Hu et al., 2005). For a detailed survey, refer to Li et al. (2006). The general Randić index is defined as

RαΓ=υνEΓ1dυdνα,(2)

and RRα(Γ)=υνE(Γ)(dυdν)α is the definition of the inverse Randić index.

Many papers and books, such as Kier and Hall (1976) and Kier and Hall (1986), have been produced on this topological index. For drug design, the Randić index was recognized. The first and second Zagreb indices were introduced by Gutman and Trinajstić. They are denoted as follows: M1(Γ) = υνE(Γ)(dυ + dν) and M2(Γ) = υνE(Γ)(dυdν), respectively. The reader is referred to Nikolic et al. (2003); Das and Gutman (2004); Gutman and Das (2004); Vukicevic and Graovac (2004); and Trinajstic et al. (2010) for further information on these indices. Among several modifications of Zagreb indices, one is the second modified Zagreb index. According to Milicevic et al. (2004), the second modified Zagreb index for a simple connected graph Γ is defined as

M2mΓ=υνEΓ1dυdν(3)

The symmetric division index (SDD) is particularly useful in determining the total surface area for polychlorobiphenyls (Gupta et al., 2016) based on the discrete Adriatic indices. The symmetric division index of a connected graph G is defined as

SDDΓ=υνEΓmindυ,dνmaxdυ,dν+maxdυ,dνmindυ,dν.(4)

The harmonic index is an additional Randić index variation that is described as

HΓ=υνEΓ2dυ+dν.(5)

There is a relationship between graph eigenvalues and the harmonic index (Favaron et al., 1993). Using MathChem, the elegant structure of extremal graphs is used to generate the inverse sum index (Balaban, 1982), a significant predictor of the octane isomer total surface area.

IΓ=υνEΓdυdνdυ+dν.(6)

Furtula et al. (2010) is credited for the augmented Zagreb index AZI, which is characterized as

AZIΓ=υνEΓdυdνdυ+dν23.(7)

Graph invariant AZI has higher prediction power than the atom-bond connectivity index (Furtula et al., 2010) and is a useful predictive measure for analyzing the heat of formation in octanes and heptanes (for more detail, see (Estrada et al., 1998; Das, 2010)). A few well-known degree-based topological indices (which are defined in Eqs 17) with M-polynomials (Deutsch and Klavar, 2014) are related in the following Table 1.

2 Methodology

Molecular graphs and vertex and edge partitions are used to modify the molecular structure of guar gum and its chemical derivatives (Figure 1). Topological indices are derived using M-polynomial. Graphical comparisons of the aforementioned defined indices are made using vertex partition, edge partition, and combinatorial computing. The graphical representation of the outcomes and the comparative study of the findings are performed via 2D plotting in Figure 2 and 3D plotting in Figure 3 are shown by utilizing Mathematica software. In this study, physio-chemical properties of the selected guar gum and its derivatives were obtained from ChemSpider, providing a comprehensive dataset for analysis. Several topological indices were calculated using M-polynomials, extracting key molecular information relevant to biological activities. Subsequently, these indices were utilized in quantitative structure–property relationship (QSPR) analysis, employing SPSS software. The process involves constructing linear, quadratic, and logarithmic models to establish correlations between the calculated topological indices and the properties of selected guar gum and its derivatives. This meticulous methodology aims to uncover patterns and relationships within the molecular structures, contributing to a deeper understanding of the properties of selected guar gum and its derivatives.

Figure 1
www.frontiersin.org

Figure 1. Molecular structure. (A) guar gum, (B) HPG, (C) CMG, and (D) CMHPG.

Figure 2
www.frontiersin.org

Figure 2. Comparison of topological indices of guar gum in blue, HPG and CMG in yellow, and CMHPG in green, respectively. (A) M1, (B) M2, (C) mM2, (D) SDD, (E) H, (F) I, and (G) AZI.

Figure 3
www.frontiersin.org

Figure 3. 3D plots of M-polynomials. (A) Guar gum, (B) CMG, (C) CMHPG.

3 Main results and discussions

This section outlines our primary analytical findings and subdivides the material into three sections: guar gum, hydroxypropyl guar, and carboxymethyl guar. The M-polynomials and their associated topological indices are derived for the chemical structures of guar gum, which are useful in the QSPR study. Guar gum is a polysaccharide used in various industries including food, pharmaceuticals, and cosmetics. QSPR studies based on M-polynomials and topological indices can help in understanding how its structural features relate to its properties such as viscosity, solubility, and emulsifying ability. By establishing quantitative relationships between structure and properties, researchers can optimize the production and application of guar gum for various industrial purposes.

3.1 Guar gum

This section uses various topological indices to analyze the molecular graph of guar gum. Figure 4 shows the vertex and edge partitioning of guar gum.

Ei0,i1=e=υνEΓm,n:dυ=i0,dν=i1,

such that |E{1,2}| = 1, |E{1,3}| = 7n + 1, |E{2,2}| = 2n, |E{2,3}| = 14n − 1, and |E{3,3}| = 9n.

Figure 4
www.frontiersin.org

Figure 4. Guar gum molecular graph for n = 3.

Theorem 3.2. Consider a molecular graph Γ for guar gum. Then,

MΓ;x,y=xy2+7n+1xy3+2nx2y2+14n1x2y3+9nx3y3.

Proof. Using the M-polynomial of Γ, we obtain the following equation:

MΓ;x,y=i0i1mi0i1Γxi0yi1.=12m12Γx1y2+13m13Γx1y3+22m22Γx2y2+23m22Γx2y3+33m33Γx3y3.=uvE1,2m12Γx1y2+uvE1,3m13Γx1y3+uvE2,2m22Γx2y2+uvE2,3m23Γx2y3+uvE3,3m33Γx3y3.=|E1,2x1y2+|E1,3|x1y3+|E2,2|x2y2+|E2,3|x2y3+|E3,3|x3y3.=xy2+7n+1xy3+2nx2y2+14n1x2y3+9nx3y3.

Proposition 3.3. Consider a molecular graph Γ for guar gum. Then,

1.M1Γ=160n+2.2.M2Γ=194n1.3.mM2Γ=6.167n+0.67.4.RαΓ=2α+3α7n+1+22α+1n+2α3α14n1+32α+2n.5.RRαΓ=12α+7n+13α+n22α1+14n12α3α+n32α2.6.SDDΓ=75.67n+2.5.

Proof: From the edge partitioning of guar gum and using the definition M-polynomial of Γ, we obtain the following equations (Eqs 816):

gx,y=MΓ;x,y=xy2+7n+1xy3+2nx2y2+14n1x2y3+9nx3y3,
Dxgx,y=xy2+7n+1xy3+4nx2y2+214n1x2y3+27nx3y3,(8)
Dygx,y=xy2+37n+1xy3+4nx2y2+314n1x2y3+27nx3y3,(9)
DxDygx,y=xy2+37n+1xy3+8nx2y2+614n1x2y3+81nx3y3,(10)
Sxgx,y=xy2+7n+1xy3+nx2y2+1214n1x2y3+93nx3y3,(11)
SySxgx,y=12xy2+137n+1xy3+12nx2y2+1614n1x2y3+99nx3y3,(12)
DxαDyαgx,y=2αxy2+3α7n+1xy3+2.2α.2αnx2y2+2α.3α14n1x2y3,(13)
+9n3α.3αx3y3,(14)
SyαSxαgx,y=xy22α+7n+13αxy3+nx2y22α1.2α+14n12α.3αx2y3+9nx3y33α.3α,(15)
SyDxgx,y=xy22+7n+13xy3+2nx2y2+2314n1x2y3+9nx3y3,(16)
SxDygx,y=2xy2+37n+1xy3+2nx2y2+3214n1x2y3+9nx3y3.(17)

From Table 1, we obtain the following equation:

1.M1Γ=Dx+DyMΓ;x,y|x=y=1=160n+2.2.M2Γ=DxDyMΓ;x,y|x=y=1=194n1.3.mM2Γ=SxSyMΓ;x,y|x=y=1=6.167n+0.67.4.RαΓ=DxαDyαMΓ;x,y|x=y=1=2α+3α7n+1+22α+1n+2α3α14n1+32α+2n.5.RRαΓ=SxαSyαMΓ;x,y|x=y=1=12α+7n+13α+n22α1+14n12α3α+n32α2.6.SDDΓ=DxSy+SxDyMΓ;x,y|x=y=1=75.67n+2.5.

Proposition 3.4. Consider a molecular graph Γ for guar gum. Then,

1.HΓ=13.1n+0.76.2.IΓ=37.55n+0.22.3.AZIΓ=254.125n+3.375.

Proof: From the edge partitioning of guar gum and using the definition M-polynomial of Γ, we obtain the following equations (Eqs 1720):

gx,y=MΓ;x,y=xy2+7n+1xy3+2nx2y2+14n1x2y3+9nx3y3.
Igx,y=x3+7n+1x4+2nx4+214n1x5+9nx6,(18)
SxIgx,y=13x3+7n+14x4+12nx4+1514n1x5+32nx6,(19)
SxIDxDygx,y=23x3+37n+14x4+2nx4+6514n1x5+272nx6,(20)
Sx3Q2IDx3Dy3gx,y=8x+277n+18x2+16nx2+814n1x3+9.27.2764nx4.(21)

From Table 1, we obtain the following equation:

1.HΓ=2SxIMΓ;x,y|x=1=13.1n+0.76.2.IΓ=SxIDxDyMΓ;x,y|x=1=37.55n+0.22.3.AZIΓ=Sx3Q2IDx3Dy3MΓ;x,y|x=1=254.125n+3.375.

3.2 Results of HPG and CMG molecular graphs

When the chemical derivatives of guar gum, such as HPG and CMG, were developed into molecular graphs. The results for these chemical derivatives were combined, as shown below, because the vertex and edge partitions are similar. Figure 5 shows the molecular graphs of HPG and CMG. From the vertices and edges of HPG and CMG, we obtain the following equation:

Ei0,i1=e=υνECMGm,n:dυ=i0,dν=i1,

such that |E{1,2}| = 2n + 1, |E{1,3}| = 7n + 1, |E{2,2}| = 4n, |E{2,3}| = 14n − 1, and |E{3,3}| = 12n.

Figure 5
www.frontiersin.org

Figure 5. Molecular graph for n = 3: (A) HPG and (B) CMG.

Theorem 3.6. Consider a molecular graph Γ for hydroxypropyl Guar and carboxymethyl guar. Then,

MCMG;x,y=2n+1xy2+7n+1xy3+4nx2y2+14n1x2y3+12nx3y3.

Proof: From the edge partitioning of HPG and CMG and using the definition M-polynomial of Γ, we obtain the following equation:

MCMG;x,y=i0i1mi0i1Γxi0yi1.=12m12Γx1y2+13m13Γx1y3+22m22Γx2y2+23m22Γx2y3+33m33Γx3y3.=uvE1,2m12Γx1y2+uvE1,3m13Γx1y3+uvE2,2m22Γx2y2+uvE2,3m23Γx2y3+uvE3,3m33Γx3y3.=|E1,2x1y2+|E1,3|x1y3+|E2,2|x2y2+|E2,3|x2y3+|E3,3|x3y3.=2n+1xy2+7n+1xy3+4nx2y2+14n1x2y3+12nx3y3.

Proposition 3.7. Consider a molecular graph Γ for hydroxypropyl guar and carboxymethyl guar. Then,

1.M1Γ=192n+1.2.M2Γ=233n1.3.mM2Γ=7.99n+0.663.4.RαΓ=2α2n+1+3α7n+1+22α+2n+2α3α14n1+4.32α+1n.5.RRαΓ=12α2n+1+7n+13α+n22α2+14n12α3α+4n32α1.6.SDDΓ=90.66n+3.66.

Proof: From the edge partitioning of HPG and CMG and using the definition M-polynomial of Γ, we obtain the following equations (Eqs 2130):

hx,y=MCMG;x,y=2n+1xy2+7n+1xy3+4nx2y2+14n1x2y3+12nx3y3,
Dxhx,y=2n+1xy2+7n+1xy3+8nx2y2+214n1x2y3+36nx3y3,(22)
Dyhx,y=22n+1xy2+37n+1xy3+8nx2y2+314n1x2y3+36nx3y3,(23)
DxDyhx,y=22n+1xy2+37n+1xy3+16nx2y2+614n1x2y3+108nx3y3,(24)
Sxhx,y=2n+1xy2+7n+1xy3+2nx2y2+1214n1x2y3+4nx3y3,(25)
SySxhx,y=2n+12xy2+137n+1xy3+2nx2y2+1314n1x2y3+4nx3y3,(26)
DxαDyαhx,y=2α2n+1xy2+3α7n+1xy3+4.2α.2αnx2y2,(27)
+2α.3α14n1x2y3+12n3α.3αx3y3,(28)
SyαSxαhx,y=2n+1xy22α+7n+13αxy3+4nx2y22α.2α+14n12α.3αx2y3+12nx3y33α.3α,(29)
SyDxhx,y=2n+1xy22+7n+13xy3+4nx2y2+2314n1x2y3+12nx3y3,(30)
SxDyhx,y=22n+1xy2+37n+1xy3+4nx2y2+3214n1x2y3+12nx3y3.(31)

From Table 1, we obtain the following equation:

1.M1Γ=Dx+DyMCMG;x,y|x=y=1=192n+1.2.M2Γ=DxDyMCMG;x,y|x=y=1=233n1.3.mM2Γ=SxSyMCMG;x,y|x=y=1=7.99n+0.663.4.RαΓ=DxαDyαMCMG;x,y|x=y=1=2α2n+1+3α7n+1+22α+2n+2α3α14n1+4.32α+1n.5.RRαΓ=SxαSyαMCMG;x,y|x=y=1=12α2n+1+7n+13α+n22α2+14n12α3α+4n32α1.6.SDDΓ=DxSy+SxDyMCMG;x,y|x=y=1=90.66n+3.66.

Proposition 3.8. Consider a molecular graph Γ for hydroxypropyl guar and carboxymethyl guar. Then,

1.HΓ=16.44n+0.78.2.IΓ=45.38n+0.22.3.AZIΓ=320.315n+3.375.

Proof: From the edge partitioning of HPG and CMG and using the definition M-polynomial of Γ, we obtain the following equations (Eqs 3134):

hx,y=MΓ;x,y=2n+1xy2+7n+1xy3+4nx2y2+14n1x2y3+12nx3y3,
Ihx,y=2n+1x3+7n+1x4+4nx4+14n1x5+12nx6,(32)
SxIhx,y=2n+13x3+7n+14x4+nx4+1514n1x5+2nx6,(33)
SxIDxDyhx,y=232n+1x3+37n+14x4+4nx4+6514n1x5+18nx6,(34)
Sx3Q2IDx3Dy3hx,y=82n+1x+277n+18x2+32nx2+814n1x3,(35)
+12.27.2764nx4.(36)

From Table 1, we obtain the following equation:

1.HΓ=2SxIMCMG;x,y|x=1=16.44n+0.78.2.IΓ=SxIDxDyMCMG;x,y|x=1=45.38n+0.22.3.AZIΓ=Sx3Q2IDx3Dy3MCMG;x,y|x=1=320.315n+3.375.

3.3 Results for the CMHPG molecular graph

In the carboxymethyl hydroxypropyl guar molecular graph shown in Figure 6, for the vertices and edges, we obtain the following equation:

Ei0,i1=e=υνECMHPGm,n:dυ=i0,dν=i1,

such that |E{1,2}| = 3n + 1, |E{1,3}| = 6n + 1, |E{2,2}| = 5n, |E{2,3}| = 15n − 1, and |E{3,3}| = 12n.

Figure 6
www.frontiersin.org

Figure 6. Molecular graph of CMHPG for n = 3.

Theorem 3.10. Consider a molecular graph Γ for carboxymethyl hydroxypropyl guar. Then,

MCMHPG;x,y=3n+1xy2+6n+1xy3+5nx2y2+15n1x2y3+12nx3y3.

Proof: From the edge partitioning of CMHPG and using the definition M-polynomial of Γ, we obtain the following equation:

MCMHPG;x,y=i0i1mi0j1Γxi0yi1.=12m12Γx1y2+13m13Γx1y3+22m22Γx2y2+23m22Γx2y3+33m33Γx3y3.
=uvE1,2m12Γx1y2+uvE1,3m13Γx1y3+uvE2,2m22Γx2y2+uvE2,3m23Γx2y3+uvE3,3m33Γx3y3.=|E1,2x1y2+|E1,3|x1y3+|E2,2|x2y2+|E2,3|x2y3+|E3,3|x3y3.=3n+1xy2+6n+1xy3+5nx2y2+15n1x2y3+12nx3y3.

Proposition 3.11. Consider a molecular graph Γ for carboxymethyl hydroxypropyl guar. Then,

1.M1Γ=200n+2.2.M2Γ=242n1.3.mM2Γ=8.58n+0.663.4.RαΓ=2α3n+1+3α6n+1+5n.22α+2α3α15n1+12n.32α.5.RRαΓ=12α3n+1+6n+13α+5n22α+15n12α3α+12n32α.6.SDDΓ=106.03n+3.66.

Proof: From the edge partitioning of CMHPG and using the definition M-polynomial of Γ, we obtain the following equations (Eqs 3543):

Jx,y=MCMHPG;x,y=3n+1xy2+6n+1xy3+4nx2y2+15n1x2y3+12nx3y3,
DxJx,y=3n+1xy2+6n+1xy3+10nx2y2+215n1x2y3+36nx3y3,(37)
DyJx,y=23n+1xy2+36n+1xy3+10nx2y2+315n1x2y3+36nx3y3,(38)
DxDyJx,y=23n+1xy2+36n+1xy3+20nx2y2+615n1x2y3+108nx3y3,(39)
SxJx,y=3n+1xy2+6n+1xy3+52nx2y2+1215n1x2y3+4nx3y3,(40)
SySxJx,y=3n+12xy2+136n+1xy3+54nx2y2+1615n1x2y3+43nx3y3,(41)
DxαDyαJx,y=2α3n+1xy2+3α6n+1xy3+5.2α.2αnx2y2+2α.3α15n1x2y3,(42)
+12n3α.3αx3y3,(43)
SyαSxαJx,y=3n+1xy22α+6n+13αxy3+5nx2y22α.2α+15n12α.3αx2y3+12nx3y33α.3α,(44)
SyDxJx,y=3n+1xy22+6n+13xy3+5nx2y2+2315n1x2y3+12nx3y3,(45)
SxDyJx,y=23n+1xy2+36n+1xy3+5nx2y2+3215n1x2y3+12nx3y3.(46)

From Table 1, we obtain the following equation:

1.M1Γ=Dx+DyMCMHPG;x,y|x=y=1=200n+2.2.M2Γ=DxDyMCMHPG;x,y|x=y=1=242n1.3.mM2Γ=SxSyMCMHPG;x,y|x=y=1=8.58n+0.663.4.RαΓ=DxαDyαMCMHPG;x,y|x=y=1=2α3n+1+3α6n+1+5n.22α+2α3α15n1+12n.32α.5.RRαΓ=SxαSyαMCMHPG;x,y|x=y=1=12α3n+1+6n+13α+5n22α+15n12α3α+12n32α.6.SDDΓ=DxSy+SxDyMCMHPG;x,y|x=y=1=106.03n+3.66.

Proposition 3.12. Consider a molecular graph Γ for carboxymethyl hydroxypropyl guar. Then,

1. H(Γ) = 17.5n + 0.76

2. I(Γ) = 47.5n + 0.22

3. AZI(Γ) = 340.95n + 3.4

Proof: From the edge partitioning of CMHPG and using the definition M-polynomial of Γ, we obtain the following equations (Eqs 4447):

Jx,y=MCMPHG;x,y=3n+1xy2+6n+1xy3+5nx2y2+15n1x2y3+12nx3y3,
IJx,y=3n+1x3+6n+1x4+5nx4+15n1x5+12nx6,(47)
SxIJx,y=3n+13x3+6n+14x4+54nx4+1515n1x5+2nx6,(48)
SxIDxDyJx,y=233n+1x3+36n+14x4+5nx4+6515n1x5+18nx6,(49)
Sx3Q2IDx3Dy3Jx,y=83n+1x+276n+18x2+40nx2+815n1x3,(50)
+12.27.2764nx4.(51)

From Table 1, we obtain the following equation:

1.HΓ=2SxIMCMHPG;x,y|x=1=17.5n+0.76.2.IΓ=SxIDxDyMCMHPG;x,y|x=1=47.5n+0.22.3.AZIΓ=Sx3Q2IDx3Dy3MCMHPG;x,y|x=1=340.95n+3.4.

4 Molecular structures and computations of topological indices for different carbohydrates

Carbohydrates are organic molecules made up of carbon, hydrogen, and oxygen. They come in three main types: monosaccharides, disaccharides, and polysaccharides. These sugars or polymers can be aldehydes or ketones and serve various purposes in various species. Exoskeletons, found on arthropods, are made of chitin, a nitrogen-containing polymer. Guar gum, a non-ionic polysaccharide derived from the cluster bean endosperm, is a new agrochemical. Polysaccharides are polymers formed when several monomer units are linked together through condensation. They are the most common type of biomolecules and are found in various sources, including algal, plant, microbial, and animal sources. The structural parameters of polysaccharides typically define its chemical compositions, molecular weights, molecular structure, degree of substitution, viscosity, solubility, and particle size, contributing to their structural and chemical properties. These parameters collectively determine the functionality and performance of guar gum in various applications, making it a versatile ingredient in numerous industries. They are highly stable, secure, non-toxic, hydrophilic, and biodegradable as natural biomaterials.

This section examines eight carbohydrate molecules: arabinose, galactose, maltose, sucrose, sorbose, ribose, hydroxymethyl furfural, and raffinose. Their molecular structures are depicted in Figure 7, and their physical and chemical properties are listed in Table 2. Table 3 includes consideration of topological indices through vertex and edge partitioning, as defined in Tables 411.

Figure 7
www.frontiersin.org

Figure 7. Molecular structures of carbohydrates: (A) arabinose, (B) galactose, (C) maltose, (D) sucrose, (E) sorbose, (F) ribose, (G) hydroxymethyl furfural (HMF), and (H) raffinose.

Table 2
www.frontiersin.org

Table 2. Properties of aforementioned structures.

Table 3
www.frontiersin.org

Table 3. Computation of topological indices.

Table 4
www.frontiersin.org

Table 4. Vertex and edge partitioning of arabinose.

Table 5
www.frontiersin.org

Table 5. Vertex and edge partitioning of galactose.

Table 6
www.frontiersin.org

Table 6. Vertex and edge partitioning of maltose.

Table 7
www.frontiersin.org

Table 7. Vertex and edge partitioning of sucrose.

Table 8
www.frontiersin.org

Table 8. Vertex and edge partitioning of sorbose.

Table 9
www.frontiersin.org

Table 9. Vertex and edge partitioning of ribose.

Table 10
www.frontiersin.org

Table 10. Vertex and edge partitioning of HMP.

Table 11
www.frontiersin.org

Table 11. Vertex and edge partitioning of raffinose.

5 Regression models

This section discusses linear, quadratic, and logarithmic regression models. Linear regression predicts the value of one variable based on another, while quadratic models modify variables. Log-regression models linearize variables and test the significance level between topological indices and molecular structures. The mathematical expressions for the linear, quadratic, and logarithmic regression models are as follows:

P=A+BTI,
P=A+BTI+CTI2,
P=A+BlnTI.

where P is the property of the molecular structure, A is the constant, B and C are the regression coefficients, and TI is the topological index. The results of these three regression models for the aforementioned indices are calculated using SPSS statistical software, which are depicted in Tables 1218.

Table 12
www.frontiersin.org

Table 12. Statistical parameters for H(Γ).

Table 13
www.frontiersin.org

Table 13. Statistical parameters for I(Γ).

Table 14
www.frontiersin.org

Table 14. Statistical parameters for AZI(Γ).

Table 15
www.frontiersin.org

Table 15. Statistical parameters for M1(Γ).

Table 16
www.frontiersin.org

Table 16. Statistical parameters for M2(Γ).

Table 17
www.frontiersin.org

Table 17. Statistical parameters for mM2(Γ).

Table 18
www.frontiersin.org

Table 18. Statistical parameters for SDD(Γ).

6 Discussion

A regression model is a statistical technique used to predict the value of a continuous target variable based on input features. Four metrics are crucial in regression analysis: correlation coefficient (r), R-squared (r2), F-statistic, and p-value. The R-squared value measures how well the independent variables predict the dependent variable, with a higher R-squared indicating a stronger linear relationship. The F-statistic tests the overall significance of the model, with a significant F-statistic indicating a non-zero effect of at least one independent variable on the dependent variable. A small p-value indicates statistical significance, indicating that at least one independent variable has a significant effect on the dependent variable.

This study employs linear, quadratic, and logarithmic regression models to predict five physical and chemical properties of various carbohydrates, considering topological indices as independent and properties as dependent variables. It has been observed that two properties, namely, molecular weight and boiling point, give the best predicted value by each regression model (as the value of R is greater than 0.68 in each case except for linear and logarithmic regressions of the topological index H). Linear, quadratic, and logarithmic models and their four important metric values are shown in Tables 1218. The study focuses on the significance of topological descriptors in predicting the molecular structures of gaur gum and its derivatives, which are crucial in predicting the molecular weight of carbohydrates. Quantitative structure–property relationship (QSPR) methodology is a powerful approach used in the field of drug design, material science, environmental chemistry, cheminformatics, and computational chemistry. QSPR methodology focuses on establishing mathematical relationships between the chemical structure of compounds and their properties, allowing for the prediction of properties based on molecular features. One notable difference is that QSPR specifically targets physical and chemical properties of compounds, while QSAR often focuses on biological activities. Additionally, molecular modeling techniques may involve more complex simulations and calculations to predict molecular behavior. Finally, QSPR methodology offers a systematic and quantitative approach to predict physicochemical properties of chemical compounds based on their molecular structure, distinct from other methodologies such as QSAR, MD simulation, DFT, machine learning models, and hybrid QSAR/QSPR models, each with its own advantages and limitations depending on the specific application and research goals.

7 Conclusion

This study focuses on the analysis of the polysaccharide guar gum and its chemical variants, namely, HPG, CMG, and CMHPG. Initially, molecular graphs are used to represent these polysaccharides, and vertex and edge partitions are defined. The closed form of the M-polynomial is then computed for these molecular graphs, using various topological indices such as Zagreb indices, Randić index, inverse Randić index, H index, SDD index, I index, and AZI index. The molecular structures of the four polysaccharides are compared graphically based on these nine degree-based topological indices. It is important to note that polysaccharides are a type of biopolymer and have diverse applications, particularly in food preservation, the pharmaceutical industry, and petroleum extraction. The findings of this research will be valuable for chemists and pharmaceutical researchers in their respective fields of study. The results of this investigation can have various applications in the field of polymer science and material engineering. By analyzing the topological indices of guar gum, researchers can gain insights into its molecular structure, connectivity, and properties. This information can be used to predict and understand the behavior of guar gum in different environments, such as its solubility, viscosity, and interactions with other molecules.

Furthermore, these results can help in the design and optimization of guar gum-based products and formulations. By correlating the topological indices with the performance of guar gum in various applications, researchers can tailor its properties to meet specific requirements in industries such as food, pharmaceuticals, cosmetics, and agriculture. Overall, these results can contribute to a better understanding of its structure–property relationships and facilitate the development of innovative products and technologies in diverse fields.

8 Future work

The authors will investigate the polysaccharide guar gum and its chemical variants with respect to the generalized reverse degree for future work.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author.

Author contributions

WK: writing–original draft and writing–review and editing. SY: writing–original draft and writing–review and editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Ajmal, M., Nazeer, W., Munir, M., Kang, S. M., and Jung, C. Y. (2017). The M-polynomials and topological indices of generalized prism network. Int. J. Math. analysis 11 (6), 293–303. doi:10.12988/ijma.2017.7118

CrossRef Full Text | Google Scholar

Amic, D., Belo, D., Lucic, B., Nikolic, S., and Trinajstic, N. (1998). The vertex-connectivity index revisited. J. Chem. Inf. Comput. Sci. 38 (5), 819–822. doi:10.1021/ci980039b

CrossRef Full Text | Google Scholar

Arockiaraj, M., Greeni, A. B., and Kalaam, A. R. A. (2023b). Comparative analysis of reverse degree and entropy topological indices for drug molecules in blood cancer treatment through QSPR regression models. Polycycl. Aromat. Compd., 1–18. doi:10.1080/10406638.2023.2271648

CrossRef Full Text | Google Scholar

Arockiaraj, M., Greeni, A. B., and Kalaam, A. R. A. (2023c). Linear versus cubic regression models for analyzing generalized reverse degree based topological indices of certain latest corona treatment drug molecules. Int. J. Quantum Chem. 123 (16), e27136. doi:10.1002/qua.27136

CrossRef Full Text | Google Scholar

Arockiaraj, M., Paul, D., Clement, J., Tigga, S., Jacob, K., and Balasubramanian, K. (2023a). Novel molecular hybrid geometric-harmonic-Zagreb degree based descriptors and their efficacy in QSPR studies of polycyclic aromatic hydrocarbons. Sar. QSAR Environ. Res. 34 (7), 569–589. doi:10.1080/1062936x.2023.2239149

PubMed Abstract | CrossRef Full Text | Google Scholar

Balaban, A. T. (1982). Highly discriminating distance-based topological index. Chem. Phys. Lett. 89, 399–404. doi:10.1016/0009-2614(82)80009-2

CrossRef Full Text | Google Scholar

BeMiller, J. N. (2009). One hundred years of commercial food carbohydrates in the United States. J. Agric. food Chem. 57 (18), 8125–8129. doi:10.1021/jf8039236

PubMed Abstract | CrossRef Full Text | Google Scholar

Bollobs, B., and Erds, P. (1998). Graphs of extremal weights. Ars Comb. 50, 225.

Google Scholar

Caporossi, G., Gutman, I., Hansen, P., and Pavlovic, L. (2003). Graphs with maximum connectivity index. Comput. Biol. Chem. 27 (1), 85–90. doi:10.1016/s0097-8485(02)00016-5

CrossRef Full Text | Google Scholar

Consonni, V., and Todeschini, R. (2010a). Molecular descriptors. Recent advances in QSAR studies: methods and applications, 29–102.

CrossRef Full Text | Google Scholar

Consonni, V., and Todeschini, R. (2010b). Molecular descriptors. Recent advances in QSAR studies: methods and applications. Berlin, Germany: Springer, 29–102.

CrossRef Full Text | Google Scholar

Das, K. C. (2010). Atom-bond connectivity index of graphs. Discrete Appl. Math. 158 (11), 1181–1188. doi:10.1016/j.dam.2010.03.006

CrossRef Full Text | Google Scholar

Das, K. C., and Gutman, I. (2004). Some properties of the second Zagreb index. MATCH Commun. Math. Comput. Chem. 52 (1), 3–1.

Google Scholar

Deutsch, E., and Klavar, S. (2014). M-polynomial and degree-based topological indices. Available at: https://arxiv.org/abs/1407.1592.

Google Scholar

Estrada, E., Torres, L., Rodriguez, L., and Gutman, I. (1998). An atom-bond connectivity index: modelling the enthalpy of formation of alkanes. Indian J. Chem. 37A.

Google Scholar

Fajtlowicz, S. (1987). On conjectures of Graffiti-II. Congr. Numer 60, 187–197.

Google Scholar

Favaron, O., Maheo, M., and Sacle, J. F. (1993). Some eigenvalue properties in graphs (conjectures of Graffiti — II). Discrete Math. 111 (1-3), 197–220. doi:10.1016/0012-365x(93)90156-n

CrossRef Full Text | Google Scholar

Furtula, B., Graovac, A., and Vukicevic, D. (2010). Augmented zagreb index. J. Math. Chem. 48, 370–380. doi:10.1007/s10910-010-9677-3

CrossRef Full Text | Google Scholar

Gupta, C. K., Lokesha, V., Shwetha, S. B., and Ranjini, P. S. (2016). On the symmetric division deg index of graph. Southeast asian Bull. Math. 40 (1).

Google Scholar

Gutman, I., and Das, K. C. (2004). The first Zagreb index 30 years after. MATCH Commun. Math. Comput. Chem. 50 (1), 83–92.

Google Scholar

Gutman, I., and Furtula, B. (2008). Recent results in the theory of Randic index. Kragujevac: Univ. Kragujevac.

Google Scholar

Gutman, I., Polansky, O. E., Gutman, I., and Polansky, O. E. (1986). Chemical graphs. Math. Concepts Org. Chem., 19–22. doi:10.1007/978-3-642-70982-1_4

CrossRef Full Text | Google Scholar

Hovgaard, L., and Brondsted, H. (1996). Current applications of polysaccharides in colon targeting. Crit. Rev. Ther. drug Carr. Syst. 13, 185–223. doi:10.1615/critrevtherdrugcarriersyst.v13.i3-4.10

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, Y., Li, X., Shi, Y., Xu, T., and Gutman, I. (2005). On molecular graphs with smallest and greatest zeroth-order general Randic index. MATCH Commun. Math. Comput. Chem. 54 (2), 425–434.

Google Scholar

Kier, L. B., and Hall, L. H. (1976). Molecular connectivity ill chemistry and drug research. Amsterdam, Netherlands: Elsevier.

Google Scholar

Kier, L. B., and Hall, L. H. (1986). Molecular connectivity in structure-activity analysis. Hoboken, New Jersey, United States: Wiley. (No Title).

Google Scholar

Li, X., Gutman, I., and Randi, M. (2006). Mathematical aspects of Randic-type molecular structure descriptors. Serbia: University of kragujevac, Faculty of Science.

Google Scholar

Li, X., and Shi, Y. (2008a). A survey on the Randic index. MATCH Commun. Math. Comput. Chem. 59 (1), 127–156.

Google Scholar

Li, X., and Shi, Y. (2008b). Some new results on a conjecture about the randic index, mathematical chemistry monographs No. 6. Recent results in the theory of randic index. Kragujevac, 57–72.

Google Scholar

Milicevic, A., Nikolic, S., and Trinajstic, N. (2004). On reformulated Zagreb indices. Mol. Divers. 8, 393–399. doi:10.1023/b:modi.0000047504.14261.2a

PubMed Abstract | CrossRef Full Text | Google Scholar

Munir, M., Nazeer, W., Rafique, S., and Kang, S. M. (2016a). M-polynomial and related topological indices of nanostar dendrimers. Symmetry 8 (9), 97. doi:10.3390/sym8090097

CrossRef Full Text | Google Scholar

Munir, M., Nazeer, W., Rafique, S., and Kang, S. M. (2016b). M-polynomial and degree-based topological indices of polyhex nanotubes. Symmetry 8 (12), 149. doi:10.3390/sym8120149

CrossRef Full Text | Google Scholar

Munir, M., Nazeer, W., Rafique, S., and Kang, S. M. (2016c). M-polynomial and degree-based topological indices of polyhex nanotubes. Symmetry 8 (12), 149. doi:10.3390/sym8120149

CrossRef Full Text | Google Scholar

Munir, M., Nazeer, W., Shahzadi, Z., and Kang, S. M. (2016d). Some invariants of circulant graphs. Symmetry 8 (11), 134. doi:10.3390/sym8110134

CrossRef Full Text | Google Scholar

Nikolic, S., Kovacevic, G., Milicevic, A., and Trinajstic, N. (2003). The Zagreb indices 30 years after. Croat. Chem. acta 76 (2), 113–124.

Google Scholar

Prem, D., Singh, S., Gupta, P. P., Singh, J., and Kadyan, S. P. S. (2005). Callus induction and de novo regeneration from callus in Guar (Cyamopsis tetragonoloba). Plant Cell, tissue organ Cult. 80, 209–214. doi:10.1007/s11240-004-0738-9

CrossRef Full Text | Google Scholar

Randi, M. (2001). The connectivity index 25 years after. J. Mol. Graph. Model. 20 (1), 19–35. doi:10.1016/s1093-3263(01)00098-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Randi, M. (2008). On history of the Randi index and emerging hostility toward chemical graph theory. MATCH Commun. Math. Comput. Chem. 59 (1), 5–124.

Google Scholar

Randic, M. (1975). Characterization of molecular branching. J. Am. Chem. Soc. 97 (23), 6609–6615. doi:10.1021/ja00856a001

CrossRef Full Text | Google Scholar

Shanmukha, M. C., Usha, A., Siddiqui, M. K., Fufa, S. A., and Praveen, B. M. (2022). Degree-based molecular descriptors of guar gum and its chemical derivatives. J. Chem. 2022, 1–14. doi:10.1155/2022/7371538

CrossRef Full Text | Google Scholar

Trinajstic, N., Nikolic, S., Milicevic, A., and Gutman, I. (2010). About the zagreb indices. Kem. u Ind. Cas. Kem. i Kem. Inz. Hrvat. 59 (12), 577–589.

Google Scholar

Vukicevic, D., and Graovac, A. (2004). Valence connectivity versus Randic, Zagreb and modified Zagreb index: a linear algorithm to check discriminative properties of indices in acyclic molecular graphs. Croat. Chem. acta 77 (3), 501–508.

Google Scholar

Whistler, R. L. (1948). Guar-a new industrial crop. Chem. Ind. 62, 60–61.

Google Scholar

Whistler, R. L., and Hymowitz, T. (1979). Guar: agronomy, production, industrial use, and nutrition. West Lafayette, Indiana, United States: Purdue University Press.

Google Scholar

Wiener, H. (1947). Structural determination of paraffin boiling points. J. Am. Chem. Soc. 69 (1), 17–20. doi:10.1021/ja01193a005

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: chemical graph theory, QSPR, physicochemical properties, M-polynomials, topological indices, guar gum

Citation: Khalid W and Yousaf S (2024) Elucidating structure–property relationships of guar gum biomolecules: insights from M-polynomial and QSPR modeling. Front. Chem. 12:1410876. doi: 10.3389/fchem.2024.1410876

Received: 02 April 2024; Accepted: 20 May 2024;
Published: 09 July 2024.

Edited by:

Sudip Pan, Jilin University, China

Reviewed by:

Nasarul Islam, Government of Jammu & Kashmir, India
Abhishek Kumar, University of Lucknow, India
Micheal Arockiaraj, Loyola College, India

Copyright © 2024 Khalid and Yousaf. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shamaila Yousaf, shumaila.yousaf@uog.edu.pk

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.