Skip to main content

ORIGINAL RESEARCH article

Front. Phys., 18 April 2024
Sec. Chemical Physics and Physical Chemistry

Analyzing the role of reducible molecular descriptors and thermodynamic aspects of anti-tuberculosis drugs via QSPR study

Sajid Mahboob AlamSajid Mahboob Alam1Niat NigarNiat Nigar1Muhammad Waheed Rasheed
Muhammad Waheed Rasheed2*Laiba AminLaiba Amin2
  • 1School of Mathematics, Minhaj University Lahore, Lahore, Pakistan
  • 2Department of Mathematics, Division of Science and Technology, University of Education, Lahore, Pakistan

Mycobacterium tuberculosis is an organism that causes tuberculosis (TB), a common infectious disease that has a high death and morbidity rate. Topological indices are mathematical tools used to describe the structural properties of molecules or networks. They provide a quantitative measure of the connectivity and complexity of a system, and play a crucial role in numerous area such as biochemistry and bioinformatics. The purpose of topological indices is to simplify complex structures into numerical values that can be easily analyzed and compared. QSPR modeling is a technique in chemistry that relates the structure of a chemical compound to its physical or chemical properties. It is used to predict properties like boiling points, solubilities, toxicities, and even biological activities of compounds. This saves time, resources, and enables researchers to make informed decisions in drug discovery, material science, and many other areas. In this study, we conducted an analysis of several drugs used for the treatment of tuberculosis. We focused on computing the reducible topological indices based on their degrees. Several techniques and approaches are employed. To perform calculations, we used edge partition methodology, analytical techniques, theoretical graph utilities, and degree counting method. Additionally, we examined six physicochemical properties of these drugs. To establish quantitative structure-property relationship models and evaluate their effectiveness, we employed linear, quadratic, and logarithmic regression analysis. By analyzing the reducible topological indices and physicochemical properties, we aimed to gain a deeper understanding of the drugs’ characteristics and their potential impact on tuberculosis treatment. This study established a significant relationship between the defined indices with two key properties: molar mass and collision cross section. The correlation coefficients for molar mass range from 0.7 to 0.9, while the collision cross section range from 0.8 to 0.9. These results demonstrate a strong association between the indices and the properties under investigation. Furthermore, it is worth noting that both molar mass and collision cross section satisfy the requirements for p-value and F-test value across all indices. This indicates the statistical significance of the observed correlations and the reliability of our findings.

1 Introduction

A dreadful disease in which a conflict between the soul and the body is so gradual, calm, and solemn, with such a sure outcome, that the mortal half rots away day by day and grain by grain. A disease that “sometimes moves in enormous steps and sometimes at a tardy, sluggish pace, but, slow or quick, is always sure and certain.” (Nicholas Nickleby; Charles Dickens). The things Charles Dickens said are still true today [1].

Bacteria called Mycobacterium tuberculosis (MTB) are typically responsible for the infectious disease known as tuberculosis (TB). Typically, tuberculosis primarily targets the lungs; nevertheless, it has the ability to impact various other regions of the body. In cases where there are no symptoms, it is referred to as latent tuberculosis [2]. Active disease occurs in about 10 percent of latent infections and, when untreated, results in a mortality rate of approximately 50 percent among those affected. TB is a global disease, although the majority of cases occur in low and middle-income nations. There are eight countries, namely, Bangladesh, China, India, Indonesia, Nigeria, Pakistan, Philippines, and South Africa, where approximately half of all individuals with TB can be located. It is believed that approximately 25 percent of the world’s population carries a TB infection; however, the majority of individuals do not progress to acquire TB disease, and a small percentage may be able to overcome the infection. It cannot be spread by those who are infected but not (yet) unwell with the illness. The lifetime chance of developing tuberculosis is 5–10 percent for those with TB bacterium infection. Individuals with weakened immune systems, including those with HIV, diabetes, malnourishment, or tobacco use, are more susceptible to illness. People who are most likely to contract tuberculosis (TB) often fall into two distinct categories:

1. Those who have recently become infected with TB bacteria.

2. Those with weakened immune systems due to underlying medical conditions.

People who have active tuberculosis in their lungs cough, spit, speak, or sneeze can transmit the disease to others through the air. Latent tuberculosis patients do not transmit the illness. Figure 1 represented a general overview of the prevalent ways that Mycobacterium tuberculosis infections occur. In this illustration, the bacterium is released by an infected person (A) through sneezing or coughing, at which point the pathogen exits from person (A)’s body and comes as an aerosol. At B point, a healthy individual (B) inhales particles in the air containing MTB. The next stage is typically a pulmonary infection when the virus reaches the effective titre in person (B). When someone self-inoculates, the infection travels from the lung to the gastrointestinal tract. Because it occurs when person (B), who has pulmonary tuberculosis, swallows his own contaminated sputum, the term “self-inoculation” is employed.

Figure 1
www.frontiersin.org

Figure 1. The common methods through which Mycobacterium tuberculosis causes infections are portrayed in the overall image.

TB illness symptoms vary depending on the location of the TB bacteria’s growth within the body. The lungs are where TB germs typically grow called pulmonary TB. Chest pain, coughing up blood or sputum (phlegm from deep inside the lungs), and a severe cough lasting 3 weeks or longer are some of the signs of tuberculosis (TB) sickness in the lungs. Additional indications of tuberculosis illness include weakness or exhaustion, appetite loss, weight loss, chills, fever, and night sweats. TB illness symptoms vary depending on the affected area of the body. Individuals with latent tuberculosis infection do not exhibit any symptoms, do not feel ill, and are unable to infect others. Other types of tuberculosis (TB) are caused when the infection travels outside the lungs in 15–20 percent of active patients. All of these conditions are referred to as extrapulmonary tuberculosis [3, 4]. Extrapulmonary tuberculosis (TB) is more common in young children and those with weaker immune systems. This affects people who possess HIV in over 50 percent of situations. Tuberculosis can affect more than just the lungs. It can spread to different parts of the body, leading to extrapulmonary infections. Notable sites of infection include the pleura, which is the lining around the lungs, resulting in tuberculous pleurisy. The central nervous system can also be affected, causing tuberculous meningitis. The lymphatic system, responsible for immune function, can be infected, leading to scrofula of the neck. The genitourinary system, involving the reproductive and urinary organs, can also be targeted, resulting in urogenital tuberculosis. Additionally, tuberculosis can affect the bones and joints, causing Pott disease of the spine. These are just a few examples of the various extrapulmonary infection sites associated with tuberculosis. Figure 2A presented the main symptoms of variants and stages of tuberculosis.

Figure 2
www.frontiersin.org

Figure 2. (A) Symptoms of Tuberculosis (B) The global chart of the death rate of TB and HIV from 2000 to 2020.

It was estimated in 2018 that 25 percent of people on the planet may be subconsciously infected with tuberculosis. Every year, approximately 1 percent of the population contracts a new infection. After COVID-19, TB is the second most common infectious disease-related cause of mortality in 2020, with an anticipated 10 million cases of active TB and 1.5 million people dying. As of 2018, the majority of tuberculosis (TB) cases were reported in three regions: South-East Asia (44%), Africa (24%), and the Western Pacific (18%). Interestingly, more than half of all cases were diagnosed in just seven countries, namely, India (27%), China (9%), Indonesia (8%), the Philippines (6%), Pakistan (6%), Nigeria (4%), and Bangladesh (4%) [5]. These statistics shed light on the global distribution of TB, highlighting the concentration of cases in specific regions and countries. By 2021, there were only about 2% more new cases per year on average. When it comes to tuberculin testing, almost 80% of individuals in several Asian and African nations test positive, compared to 5%–10% of Americans. Humans have had tuberculosis from ancient times [6]. More than 2 million people globally per year pass away from tuberculosis (TB), the most common infectious disease-related cause of mortality. Around the world, one in three people, or 2-3 billion people, are estimated to be infected with Mycobacterium tuberculosis (M. tuberculosis), with a lifetime risk of active TB disease of 5%–15%. An estimated 9.6 million individuals were diagnosed with TB in 2014, and 1.5 million died from it, including 1.1 million HIV-negative people and 400,000 people living with HIV. Although TB exists in every country, the majority of TB patients live in low- and middle-income nations, particularly in areas like Sub-Saharan Africa and South East Asia [7]. Since 2000 until 2020, Figure 2B illustrates the worldwide trends in estimated death counts attributed to TB and HIV.

Antibiotics are used in the treatment of tuberculosis. Both the illness and infection from tuberculosis should be treated. The most often prescribed antibiotics include Lumefrantrine, Mefloquine, Piperaquine, Isoniazid, and Primaquine, among others. These drugs must be used consistently for 4–6 months in order to be effective. It is risky to quit taking the drugs too soon or without consulting a doctor. This may cause TB that is still alive to develop medication resistance.

The field of mathematics known as “graph theory” is devoted to the study of graphs’ characteristics and uses. A graph is a type of mathematical structure made up of edges (links) connecting nodes (vertices). These nodes and edges can be represented a numerous real-world scenarios, including social networks, transport systems, and computer networks. Graph theory offers a structure for using graph representations to analyze and solve difficult issues. The concept of the degree of a node in graph theory identifies the quantity of edges that are connected to that node. In order to grasp the connectivity and structure of a graph, it is crucial to understand this concept. An important area of study in graph theory is graph coloring, where colors are designated to nodes within a graph to prevent neighboring nodes from sharing the same color. This concept is helpful in the realms of scheduling, resource allocation, and solving challenges related to map coloring. Graph theory plays a critical role in computer science, operations research, and biology due to its diverse range of practical applications. The idea of graph algorithms is crucial in solving problems related to network routing, social media analysis, and data visualization in computer science. Applying graph theory in operations research leads to the optimization of transportation networks, supply chains, and communication systems. The application of graph theory in the field of biology enables researchers to map out the connections among genes, proteins, and diseases, resulting in breakthroughs in network biology and personalized medicine. In general, graph theory is an effective tool that keeps discovering new uses in a variety of scientific and commercial fields.

The study of mathematical models of chemical phenomena using graph theory is known as chemical graph theory. Chemical graph theory has a subfield called topological indices that correlates specific physico-chemical characteristics of the underlying chemical molecule. Numerous publications on topological indices have been published to date, numbering in the hundreds [8]. A topological index is a function called “Top” from a set of finite simple graphs to the set of real numbers, where “” is the property that Top(G) = Top(H) if both G and H are isomorphic. A topological index is a numerical value related to chemical structure which asserts to show a relationship between chemical structure and various physico-chemical qualities, chemical reactivity, or you could say biological activity [9]. In reality, topological indices are created by converting a molecular graph into a number that describes the topology of that graph. In molecular modeling, we investigate the connection between the structure, characteristics, and activity of chemical substances. Molecular descriptors have a key part in the main chemical structure-related fields like chemistry, pharmacology, etc [10]. In 1998, the generalized Randi index was introduced independently by Bollobas et al. [11] and Amic et al. [12]. Mathematicians and chemists both studied this index [1317]. Quantitative structure–activity/property/toxicity relationships (QSAR, QSPR, QSTR) modelling is a well-known and well-established field of study. In this type of modelling, physicochemical and molecular descriptors are associated with a bioassay of a medication that is designed to elicit a conventional pharmacological response. These connections have been recognized for a long time as an essential part of the drug discovery and development process. They provide major insights into the role that molecular characteristics play in the biological activity of compounds that are both similar to one another and unrelated to one another. The descriptor-based QSPR techniques make use of the entire structure of the compounds by modelling the property as a function of chemical descriptors. This allows for the most accurate predictions possible. These descriptors might simply be the quantity of such atoms, or they could be features that characterize the linkages between atoms or the form of the molecule in three dimensions. In this manner, a 1D descriptor is a one-dimensional linear representation of the molecule, 2D descriptors are a representation of the molecule in two dimensions on a plane, and 3D descriptors are a representation of the molecule in three dimensions of space. The majority of topological and connectivity indexes can be rearranged using two-dimensional descriptors. Three-dimensional descriptors provide a concise summary of the geometry, the surface, and the volume of the molecule. Quantitative structure-activity relationships (QSARs) are mathematical models created to correlate several types of biological activity, chemical reactivity, equilibrium, physical, and physicochemical qualities [18]. In the developing of QSAR models, twenty primary categories of mistakes have been identified by Dearden et al. They do this by utilizing bad data, overfitting, and bad training and test sets [19]. Simply see Figure 3 for the general overview of the QSPR method. Check out [2024] for further details on QSPR analysis across various different fields.

Figure 3
www.frontiersin.org

Figure 3. General methodology of a QSPR study.

The main objective of this article is to develop a quantitative structure-property relationship (QSPR) analysis for tuberculosis (TB) drugs.

• Firstly, we collected data on 13 drugs used in the treatment of TB, along with their six physicochemical properties. These properties were obtained from reputable online sources such as PubChem and ChemSpider.

• Next, we calculated the numerical results of ten degree-based reducible indices to analyze the molecular structure of the 13 TB drugs. Three techniques, namely, edge partition, vertex degree, and counting degree, were used to calculate the ten discussed reducible indices.

• The QSPR analysis was then used to establish a strong positive relationship between the indices and properties. To conduct this analysis, we employed three regression equations: linear-regression, quadratic-regression, and logarithmic-regression. These models were chosen for their strong significance in determining the relationship between the properties and indices.

• One of the statistical parameters used, known as the correlation coefficient (r), provides insight into the reliability and significance level between the physical properties and calculated the numerical values of the TB drugs.

• Finally, we utilized line graphs to visually compare all correlation coefficients based on the numerical data. This graphical representation allows for a comprehensive discussion of the relationships between the properties and indices.

In various fields, this paragraph covers the novel progress in the topic of topological indices. In 2023, Abid Mahboob et al. [25] introduced eight novel indices based on the degrees method. They determine the QSPR analysis by utilizing linear, quad, and log models to establish relationships between the novel indices and physical properties of 18 molecular structures of cancer drugs. The significance of all the results is evident as they meet the criteria of a p-value (0.05) and an F-test value (> 2.5). For example, the reducible 1st Zagreb index proves to be the most accurate predictor for MR, MW, and P, with correlation values reaching 0.9. Abid Mahboob et al. [26] introduced two new terminal descriptors called the “first” and “second” terminal Zagreb indices. These descriptors enhance the QSPR analysis by utilizing linear and quad models to determine the best fit predictor for seven physical properties of 23 molecular structures associated with kidney cancer drugs. The calculations revealed that all correlation “r” values exceeded 0.7, providing substantial evidence of a significant relationship between the indices and drug properties. Muhammad Nazri Husin and Anamila Ariffin [27] conducted a comprehensive study on various graph networks, including chain silicate networks CS1 and CSn, hexagonal networks HX2, HX3, and HX4, and oxide networks OX1 and OX2. They focused on analyzing the generalize the Randic that is based on edge, geometric arithmetic, ABC, MABC, Zagreb indices. The researchers employed the concept of line graphs to investigate these networks. In their study, Jia-Bao Liu et al. [28] conducted calculations to determine various numerical descriptors derived from multiplicative degrees. These descriptors were specifically applied to analyze the structures of three networks: the planar octahedron network [POH(2)], the triangular prism network [TP(2)], and the dominating planar octahedron network [DPOH(2)]. Muhammad Shoaib Sardaret al. [29] conducted calculations to determine various topological indices using a degree-based technique. These descriptors were calculated to analyze the structures of double silicon carbide Si2C3-I(p,q) where p = 1 and q = 1 and its D[Si2C3-I(1,1)], as well as the strong double graph of silicon carbide and its D[Si2C3-I(1,1)]. Additionally, they compared the topological indices both numerically and graphically. Zahid Raza et al. [30] calculated the expected values of the first Zagreb connection index in various structures, including a random cyclooctatetraene chain, random polyphenyls chain, and random chain network with l consisting of octagons, hexagons, and pentagons. They analyzed the maximum and minimum chains with COCl, PPCl, and PGl, specifically focusing on the expected values of these chains. In 2024, Imran Nadeem et al. [31] conducted a study on the inequalities between general Randic type graph invariants, specifically the general Randic and zeroth-general Randic indices. These indices are calculated based on the vertex-degree methodology. The general Randic index is denoted as Rα, while the zeroth-general Randic index is denoted as Qα. The researchers identified linear inequality relationships between these graph invariants. Additionally, they posed an open problem: to determine the linear inequality between the general Randic index Rα and the zeroth-general Randic index Qα for any negative real number α.

2 Fundamental definitions with literature review

Let G= [V(G), E(G)] be a simple graph having |V(G)| the order and |E(G)| the size of a graph, where V(G) is considered as node set and E(G) ⊆ V(G) × V(G) is a bond set. Every vertex is considered as an atom in a graph, and bonding within the two atoms is known as edge. The valency or degree of any node is the number of total edges which are incident to the node. Now, few useful reducible TI’s is explained which is given below:

2.1 Reducible first and second Zagreb indices

Gutman and Trinajstic [32] introduced the two graph invariants known as the first Zagreb index, denoted as M1, is a graph theoretic concept used to measure the molecular size or shape of a chemical compound. It takes into account the sum of the degrees of all the vertices in a molecule. On the other hand, the second Zagreb index, denoted as M2, calculates the sum of products of degrees for pairs of adjacent vertices in a graph. This index is related to the presence and distribution of bond lengths within a compound. These indices serve as valuable tools in exploring and understanding molecular structures with significant applications across diverse fields including drug discovery, material science, and environmental monitoring. S.R. Islam and M.Pal examined the relationship between the second Zagreb index and physicochemical properties in QSPR research of fuzzy octane isomers graphs [33]. It is their conclusion that this index provides a significantly accurate estimation of the acentric factor and entropy. For instance, the value of acentric factor and entropy for octane isomers are taken from [34]. They calculate the correlation coefficient of M2 with acentric factor is −0.977966531 and the correlation coefficients of the M2 with entropy is −0.91961647. These outcomes declare the appropriateness of this index for octane isomers in chemical compounds. The Zagreb indices and their variations have been utilized to study ideas such as heterosystems, chirality, ZE-isomerism, and molecular complexity.

Abid Mehboob et al. [25] were motivated by the framework of these indices to propose the idea of reducible first and second Zagreb indices. In QSPR modeling and chemical informatics, these indices has produced outstanding outcomes by establishing links between the structural characteristics of molecules and their corresponding properties or activities. RM1 is the notation for this index, which represents the total sum of the “n” degrees of all vertices in a relevant molecular graph, while RM2 denotes the second reducible zagreb index, defined as the summation of products of the “n” degrees for pairs of adjacent vertices in a molecular graph. In which du and dv represent the degrees of the vertices while “n” indicates the order of the vertices. These indices are defined mathematically as:

RM1G=uvEGndu+ndv,
RM2G=uvEGndu×ndv.

2.2 Reducible reciprocal randic index

Some of the most well-known topological indices in regard to chemistry and chemical graph theory was proposed by Milan Randic in 1975 [35] and is useful for evaluating the degree of branching of the carbon atom skeleton of saturated hydrocarbons. The first genuine degree-based topological index in history was the Randic index. This index can help chemists predict certain properties of molecules such as boiling points or toxicity levels. By knowing the level of complexity in a molecule using the randic index, scientists can make informed decisions about its behavior and potential uses. Mingao Yuan derived the limits of the Randic index and its variants for an inhomogeneous Erdos–Rényi random graph [36]. These results shed light on the impact of network heterogeneity on the indices and offer new insights into the Randic index and its variants.

In order to calculate the reducible randic index, you need to add up the square root of the distance between each bond. This distance is calculated by multiplying the “n” degrees of the vertices in a molecular graph. This index are defined mathematically as:

RRG=uvEGndu×ndv.

2.3 Reducible reciprocal sum connectivity index

Zhou and Trinajstic [37] were the ones who initially brought the idea of the sum connectivity index. This index was developed by drawing inspiration from the Randic-index. It is a concept used in chemistry to measure how connected different atoms are within a molecule. It is a way to determine the overall structural complexity of a molecule and its ability to interact with other molecules. This index is takes into account factors like bond lengths, angles, and types of bonds between atoms to calculate a numerical value. Muhammad Hussain et al. [38] have conducted a comprehensive analysis to determine the sum connectivity index for three subdivisions of the H3BO3 layer structure. These subdivisions include the subdivision S(H3BO3), the line graph L(H3BO3), and the caged chain C(H3BO3). The researchers employed various mathematical and computational methods to accurately calculate the sum connectivity index for each subdivision. The correlation between this index and the straight line graph is significant [39].

A novel index known as the reducible sum connectivity index has been introduced. It is calculated by summing the square root of each bonds distance, where the distance is obtained by summing the “n” degrees of the connecting vertices, and has a mathematical definition as:

RSCG=uvEGndu+ndv.

2.4 Reducible 1st and 2nd hyper Zagreb indices

Shirdel et al. [40] presented the concept of the hyper Zagreb index, which is a molecular descriptor referred to as distance-based Zagreb index. These indices are numerical values that provide important information about a molecule’s structure. The first hyper Zagreb index is obtained by summing up the squares of the degrees of all vertices in a molecule. On the other hand, the second hyper Zagreb index is calculated by multiplying the degrees of adjacent vertices and then summing them all up. By examining both indices, researchers can gain insight into the complexity of a molecule and make predictions about its chemical properties. Ishita Sarkar and Manjunath Nanjappa [41] conducted a study on the combinatorial inequalities related to edges, vertices, and their corresponding neighborhood notions. They also incorporated other molecular descriptors in their calculations to determine the exact expressions of the second hyper-Zagreb index for specific corona products involving the semi-total point graph. Abdul Hakeem et al. [42] studied the QSPR analysis on heart attack disease using degree-based indices. The researchers found that the first hyper Zagreb index exhibited a robust correlation with molecular weight and boiling point within the range of 0.9. Similarly, the second hyper Zagreb index demonstrated a strong correlation with boiling point within the same range.

In the molecular graph, the reducible first hyper Zagreb index is obtained by summing up the squares of the “n” degrees of all vertices. On the other hand, the reducible second hyper Zagreb index is calculated by multiplying the “n” degrees of adjacent vertices and then summing them all up. Mathematically these indices is defined as:

RHM1G=uvEGndu+ndv2,
RHM2G=uvEGndu×ndv2.

2.5 Reducible sigma index

The concept of Albertson index served as motivation for Gutman, leading him to propose the idea of the Sigma index [43]. His paper explores the inverse problem concerning the sigma index, demonstrating that an even value will always be obtained for this index in any given graph. Merve and Ismail conducting a calculation of the sigma index for various graphs, adding the subdivision of the cycle graph Cn, the graph of S(G), and the r-subdivision graph of Sr(G) [44]. Additionally, they provide examples of well-known graph cycles. Ozge Colakoglu Havare [45] studied the QSPR analysis with curvilinear regression model by using various molecular descriptors including Sigma index for the monocarboxylic acids.

The reducible sigma index can be obtained by subtracting the squares of the “n” degrees of each vertex in a molecule. This index is mathematically defined as:

RSG=uvEGndundv2.

2.6 Reducible forgotten index

Furtula and Gutman conducted an analysis on the latest version of Zagreb indices, referred to as the Forgotten index [46]. Its definition lies in using the sum of cubes of degrees in molecular graphs to examine the influence of structure dependency on different properties. The purpose of presenting this index is to demonstrate its numerous significant features and emphasize its exceptional effectiveness in enhancing the chemical attributes of the original Zagreb index. This index plays a crucial role in significantly enhancing the physico-chemical applicability of the first Zagreb index. Some upper bounds for the F-index and certain applications were computed by Hosamani [47]. The precise formulas for the graph operations with regards to the F-index were calculated by Nilanjan and other authors [48]. Recently, the reducible forgotten index is introduced which is mathematically defined as:

RFG=uvEGndu2+ndv2.

2.7 Reducible 1st and 2nd Gourava index

The first and second Gourava indices were introduced by V.R. Kulli [49], who drew inspiration from the definition of the Zagreb indices and their extensive usage. In calculating this index, Kulli applies the equation’s definition to both armchair and zigzag edge polyhex nanotubes, with a particular focus on a few well-known graph types. Ying Wang et al. [50] prove the graphs uniqueness on trees with given independence numbers with maximum both 1st and 2nd Gourava indices. The mathematical definition of the reducible first and second Gourava indices was recently introduced, which is stated as:

RG1G=uvEGndu+ndvndu+ndv,
RG2G=uvEGndu+ndvndu×ndv.

3 Methods and strategies

Topological indices are derived from various mathematical approaches, and one of the most popular methodologies is based on vertex degrees. The degree of a vertex in a graph represents the number of edges connected to it. By analyzing the distribution of degrees in a system, we can obtain valuable information about its structural features and properties. Degree-based topological indices are calculated using mathematical formulas that take into account the degrees of vertices in a graph. We took several steps to get at the results, including gathering data on the pharmaceutical properties of medications used to treat tuberculosis disease. The data for these properties has been collected from the PubChem website. The numerical values of the molecular descriptors are determined by employing the techniques of vertex degree and edge division. This method involves grouping or bundling the edges and vertices based on their specific attributes. To develop the QSPR analysis, a three regression-equation (linear-regression, quadratic-regression, and logarithmic-regression) and eight statistical parameters (including p-value and F-test value) were employed. Analyzing and predicting data patterns is made easier with the use of these valuable models. The coefficient correlation r results of these three regression-model are determined through the use of the SPSS software. The illustration of the 2D molecular structures of these medications is achieved through the use of software applications like Chem-Draw and Chem-Doodle. By visually representing the intricate structures, these software enable a deeper understanding of the chemical properties of various compounds. Bar graphs have been employed to visually represent the comparisons of all correlation values. These graphs are created through Microsoft Excel. All indices can be calculated using software like Matlab, whereas Maple has been exclusively designed for graph sketching purposes. This methodology has the advantage that all the indices can be computed rapidly. This manuscript presents a concise and comprehensible flowchart of the work, using Figure 4 to make it easy and understandable for readers.

Figure 4
www.frontiersin.org

Figure 4. Flowchart of this manuscript.

4 Analyzing the structural and mathematical computations associated with anti-tuberculosis drugs

This section provides an overview of the structural information and computational analysis of reducible indices for the treatment of tuberculosis (TB) disease. The article primarily focuses on the planar and 2D forms of drugs, disregarding their actual 3D structures. Within this context, the discussion revolves around thirteen prominent anti-tuberculosis drugs, namely, Isoniazid, Pyrazinamide, Ethionamide, Levofloxacin, Amikacin, Ofloxacin, Cycloserine, 4-aminosalicyclic acid, Ethambutol, Ciprofloxacin, Bedaquiline, Kanamycin, and Streptomycin. The thirteen anti-tuberculosis drugs mentioned have been extensively studied and utilized in the treatment of TB. Each drug possesses unique characteristics and mechanisms of action, making them valuable tools in combating this infectious disease. Figure 5 visually displays the chemical structures of these medicine, which are commonly used to treat tuberculosis disease. Additionally, this section provides a comprehensive overview of both the experimental data and the calculation of reducible indices. The six physicochemical properties: molar mass, XLOGP3, Complexity, LOGP, melting point and collision cross section were obtained from the PubChem website and are presented in Table 2. The values of the ten degree-based reducible topological indices were computed using the formulas illustrated in Table 1. All the indices considered in this study are based on vertex and edge degrees. It is noting that a total of thirteen structures were analyzed, each exhibiting eight distinct types of edge bundles. In addition, the maximum vertex degree is restricted to four, and thus the node degree varies between one and four. Therefore, these eight edge bundles are:

E1,2=e=uvEG|du=1,dv=2,E2,2=e=uvEG|du=2,dv=2E1,3=e=uvEG|du=1,dv=3,E3,3=e=uvEG|du=3,dv=3E1,4=e=uvEG|du=1,dv=4,E2,4=e=uvEG|du=2,dv=4E2,3=e=uvEG|du=2,dv=3,E3,4=e=uvEG|du=3,dv=4

Figure 5
www.frontiersin.org

Figure 5. Thirteen Chemical structure of drugs used to treat Tuberculosis disease.

Table 1
www.frontiersin.org

Table 1. Thirteen drugs related to tuberculosis disease with their calculated reducible topological indices.

The use of isoniazid, along with other medications, is necessary to treat active tuberculosis (TB) infections. Additionally, when an individual tests positive for TB through a skin test and is suspected of having the infection, isoniazid is administered alone to prevent the development of active TB infections. As an antibiotic, isoniazid functions by preventing bacterial growth. Only bacterial illnesses are treated by this antibiotic. Conversely, it proves ineffective against viral illnesses such as the flu or the common cold. The molecular formula of isoniazid is C6H7N3O. It consists of carbon (C), hydrogen (H), nitrogen (N), and oxygen (O) atoms. Figure 6 illustrates the 2D molecular structure of Isoniazid. The molecular structure of Isoniazid, denoted as G, is composed of 10 vertices (dots) and 10 edges (lines). To analyze this structure, a technique of degree-based has been employed. The vertex set of this structure, V(Isoniazid), consists of vertices Vv, where v ranges from 1 to 10. Similarly, the edge set, E(Isoniazid), is formed by the edges {v1v2, v2v3, v3v4, … , v9v10}. There are three vertex partitions, namely, V1, V2, and V3, that can be identified within this structure. In the given graph G, V1 represents the set of vertices that have a degree of 1. These vertices are connected to only one other vertex in the graph. On the other hand V2, consists of vertices that have a degree of 2, meaning they are connected to two other vertices in the graph. Lastly, V3 represents the set of vertices that have a degree of 3, indicating that they are connected to three other vertices in the graph. Therefore, five edge bundles of this structure is obtained. The cardinalities of these bundles are as follows: |E(1,2)| = 1, |E(1,3)| = 1, |E(2,3)| = 3, |E(3,3)| = 1, and |E(2,2)| = 4. These edge bundles are utilized to compute all ten defined reducible indices of the isoniazid structure, which will be discussed below. The same methodology can be applied to determine the remaining indices of other drug structures.

RM1Isoniazid=uvεEIsoniazidndu+ndv=1101+102+1101+103+3102+103+1103+103+4102+102=115+113.3333+38.3333+16.6667+410=100
RM2Isoniazid=uvεEIsoniazidndu×ndv=1101×102+1101×103+3102×103+1103×103+4102×102=150+133.3333+316.6667+111.1111+425=244.4445
RRIsoniazid=uvεEIsoniazidndu×ndv=1101×102+1101×103+3102×103+1103×103+1102×102=115.8113+110.5409+35.2704+13.5137+47.9057=77.2999
RSCIsoniazid=uvεEIsoniazidndu+ndv=1101+102+1101+103+3102+103+1103+103+1102+102=18.1622+16.4957+34.9144+14.3874+46.5811=60.1129
RHM1Isoniazid=twεEIsoniazidndt+ndw2=1101+1022+1101+1032+3102+1032+1103+1032+4102+1022=1225+1177.7778+369.4444+144.4444+4100=1055.5555
RHM2Isoniazid=uvεEIsoniazidndu×ndv2=1101×1022+1101×1032+3102×1032+1103×1032+4102×1022=12500+11111.1111+3277.7778+1123.4567+4625=7067.9012
RSIsoniazid=uvεEIsoniazidndundv2=11011022+11011032+31021032+11031032+41021022=125+144.4444+32.7778+10+40=77.7778
RFIsoniazid=uvεEIsoniazidndu2+ndv2=11012+1022+11012+1032+31022+1032+11032+1032+41022+1022=1125+1111.1111+336.1111+122.2222+450=566.6666
RG1Isoniazid=uvεEIsoniazidndu+ndv+ndu×ndv=1101+102+101×102+1101+103+101×103+3102+103+102×103+1103+103+103×103+4102+102+102×102=165+146.6667+325+117.7778+435=344.4445
RG2Isoniazid=uvεEIsoniazidndu+ndvndu×ndv=1101+102101×102+1101+103101×103+3102+103102×103+1103+103103×103+4102+102102×102=1750+1444.4444+3138.8889+174.0740+4250=2685.1851

Figure 6
www.frontiersin.org

Figure 6. Molecular structure of the Isoniazid drug.

5 Regression models

Regression models are fundamental tools in statistics that allow us to understand and analyze the relationship between variables. By fitting a regression model to a dataset, we can make predictions, uncover patterns, and gain insights into the underlying factors that influence a particular outcome. Regression analysis is like playing matchmaker for numbers. It helps us understand the relationship between different variables and predicts how they interact with each other. Basically, it is a statistical method to find patterns and connections within data. Regression models come in different types, Some popular types include linear regression, non-linear regression, and multiple regression analysis. This article covering three regression models such as linear-regression, quadratic-regression and logarithmic-regression.

Linear regression is like a good old-fashioned straight line. It is the simplest type of regression where we try to fit a line that best represents the relationship between an independent variable and a dependent variable. The variable used for prediction is referred to as the dependent variable (Y), while the term used to predict the value of another variable is known as the independent variable (TI). The linear regression equation is in the form:

Y=a+bTI(5.1)

Quadratic regression is a statistical modeling technique that allows us to analyze the relationship between a dependent variable and one or more independent variables. It is an extension of linear regression, where the relationship between variables is assumed to be quadratic rather than linear. Quadratic regression allows us to model relationships with a curved pattern, capturing more complexity than linear regression. The quadratic regression equation is a fancy way of expressing a curved relationship between variables. In quadratic equation (Y) is the dependent variable, (TI) is the independent variable, and a, b, and c are coefficients that determine the shape of the curve. The quadratic regression model comprises three important components: the intercept (a), the linear coefficient (b), and the quadratic coefficient (c). These components determine the position, slope, and curvature of the curve, respectively. The quadratic regression equation is in the form:

Y=a+bTI+cTI2(5.2)

One commonly used type of regression model is logarithmic regression. Logarithmic regression offers a unique approach to exploring a relationships by fitting a logarithmic function to the data. Logarithmic regression is a type of regression analysis where the relationship between the dependent variable (Y) and the independent variable (TI) is modeled using a logarithmic function. Unlike linear regression, which assumes a linear relationship between the variables, logarithmic regression acknowledges that the relationship may not be linear and allows for a more flexible curve. The logarithmic regression equation is in the form:

Y=a+b.lnTI(5.3)

The ten reducible indices mentioned above serve as models for six physical properties: molar mass, XLOGP3, Complexity, LOGP, melting point, and collision cross section of the thirteen molecular structures of the anti-tuberculosis drugs. Eqs 5.15.3, presented above, are employed to determine the regression equations for degree-based reducible TIs. These equations represent the linear model, quadratic model, and logarithmic model. All three models have been derived using SPSS statistical software. These models are widely used in various fields, including economics, finance, marketing, and social sciences. They provide a quantitative approach to understand the relationships between variables and make predictions based on the available data. By examining the patterns and trends in the data, these regression models help us gain valuable insights and make informed decisions.

5.1 Reducible first Zag-index RM1(G)

MM=121.4199+0.3174RM1GMM=0.0001RM1G2+0.4654RM1G+96.0478XLOGP3=0.44450.002938RM1GXLOGP3=0.0000RM1G2+0.0016RM1G0.3322C=254.0263+0.3989RM1GC=0.0001RM1G2+0.5621RM1G+226.0558LOGP=0.51540.001852RM1GLOGP=0.0000RM1G2+0.0052RM1G0.6961MP=262.37590.04733RM1GMP=0.0000RM1G20.0719RM1G+266.488CCS=128.5696+0.07279RM1GCCS=0.0000RM1G2+0.128RM1G+116.9132MM=524.7273+144.3139.lnRM1GXLOGP3=5.06471.1031.lnRM1GC=465.9574+165.6227.lnRM1GLOGP=2.67890.5673.lnRM1GMP=361.73222.0794.lnRM1GCCS=31.3612+34.8091.lnRM1G

5.2 Reducible second Zag-index RM2(G)

MM=197.8953+0.03537RM2GMM=0.0000RM2G2+0.0776RM2G+158.6936XLOGP3=0.98940.0001154RM2GXLOGP3=0.0000RM2G20.0007RM2G0.4226C=347.2226+0.04531RM2GC=0.0000RM2G2+0.0977RM2G+298.5519LOGP=1.114+0.0001391RM2GLOGP=0.0000RM2G20.0006RM2G0.407MP=255.51930.006822RM2GMP=0.0000RM2G20.0082RM2G+256.7221CCS=152.867+0.00754RM2GCCS=0.0000RM2G2+0.0183RM2G+139.8961MM=356.0103+95.0916.lnRM2GXLOGP3=1.65780.4286.lnRM2GC=208.1414+100.0905.lnRM2GLOGP=1.2177+0.0817.lnRM2GMP=338.266714.9125.lnRM2GCCS=10.3761+23.0143.lnRM2G

5.3 Reducible reciprocal randic index RR(G)

MM=120.8357+0.6734RRGMM=0.0004RRG2+0.973RRG+96.7815XLOGP3=0.35820.005922RRGXLOGP3=0.0000RRG2+0.0001RRG0.123C=255.4439+0.8391RRGC=0.0004RRG2+1.1469RRG+230.722LOGP=0.45140.003701RRGLOGP=0.0000RRG2+0.0087RRG0.5473MP=262.82920.1017RRGMP=0.0000RRG20.1053RRG+263.1149CCS=127.921+0.1565RRGCCS=0.0002RRG2+0.2695RRG+116.6876MM=399.3469+141.2089.lnRRGXLOGP3=3.90971.0407.lnRRGC=283.9748+154.5721.lnRRGLOGP=2.04680.5277.lnRRGMP=336.148220.3144.lnRRgCCS=6.5383+35.0199.lnRRG

5.4 Reducible reciprocal sum connectivity index RSC(G)

MM=81.6943+2.0309RSCGMM=0.0006RSCG2+2.1967RSCG+75.7318XLOGP3=0.62490.0172RSCGXLOGP3=0.0001RSCG2+0.0115RSCG0.4058C=207.3057+2.525RSCGC=0.0026RSCG2+1.8426RSCG+231.8508LOGP=0.53190.01001RSCGLOGP=0.0002RSCG2+0.0311RSCG0.9455MP=268.2718+0.4817RSCGMP=0.0004RSCG20.1936RSCG+264.3534CCS=116.8003+0.4817RSCGCCS=0.0007RSCG2+0.6722RSCG+109.0213MM=479.3429+182.2592.lnRSCGXLOGP3=4.46631.3357.lnRSCGC=389.4377+203.5928.lnRSCGLOGP=2.26720.6632.lnRSCGMP=350.016426.7631.lnRSCGCCS=24.2619+44.7407.lnRSCG

5.5 Reducible hyper first Zag-index RHM1(G)

MM=162.7938+0.00754RHM1GMM=0.0000RHM1G+20.0149RHM1G+122.6824XLOGP3=0.34020.00008324RHM1GXLOGP3=0.0000RHM1G2+0.0001RHM1G0.4118C=308.3285+0.009365RHM1GC=0.0000RHM1G2+0.0168RHM1G+267.8934LOGP=0.59040.00005927RHM1GLOGP=0.0000RHM1G2+0.0002RHM1G0.5996MP=254.0060.0009953RHM1GMP=0.0000RHM1G20.0021RHM1G+259.4173CCS=141.1509+0.001613RHM1GCCS=0.0000RHM1G2+0.0038RHM1G+124.5373
MM=544.4697+96.6785.lnRHM1GXLOGP3=5.67580.7905.lnRHM1GC=348.72+95.2908.lnRHM1GLOGP=3.31290.4423.lnRHM1GMP=315.79889.2112.lnRHM1GCCS=51.3892+24.7484.lnRHM1G

5.6 Reducible hyper first Zag-index RHM2(G)

MM=210.4404+0.00007769RHM2GMM=0.0000RHM2G2+0.0002RHM2G+178.4115XLOGP3=0.021999.748e7RHM2GXLOGP3=0.0000RHM2G2+0RHM2G0.3579C=366.7098+0.00009706RHM2GC=0.0000RHM2G2+0.0002RHM2G+326.4705LOGP=0.47167.936e7RHM2GLOGP=0.0000RHM2G2+0RHM2G0.2598MP=248.86520.00001071RHM2GMP=0.0000RHM2G2+0RHM2G+257.5147CCS=155.0311+0.00001535RHM2GCCS=0.0000RHM2G2+0RHM2G+145.1945MM=318.8317+54.8104.lnRHM2GXLOGP3=3.48660.4186.lnRHM2GC=202.41+60.5619.lnRHM2GLOGP=1.94010.2215.lnRHM2GMP=336.68678.9537.lnRHM2GCCS=21.5129+13.0429.lnRHM2G

5.7 Reducible sigma index RS(G)

MM=192.419+0.04944RSGMM=0.0000RSG2+0.129RSG+138.522XLOGP3=0.76160.0008381RSGXLOGP3=0.0000RSG2+0.0006RSG0.2409C=335.0517+0.06534RSGC=0.0000RSG2+0.1454RSG+280.8399LOGP=0.92850.0006116RSGLOGP=0.0000RSG2+0.0007RSG+0.0229MP=246.15530.004883RSGMP=0.0000RSG20.04RSG+267.6473CCS=146.93+0.01019RSGCCS=0.0000RSG2+0.0276RSG+133.0309MM=294.8106+93.1536.lnRSGXLOGP3=4.47650.8895.lnRSGC=206.2213+107.5349.lnRSGLOGP=2.82940.5261.lnRSGMP=322.168113.6067.lnRSGCCS=27.9428+21.7919.lnRSG

5.8 Reducible forgotten index RF(G)

MM=169.4351+0.01312RFGMM=0.0000RFG2+0.0267RFG+132.5XLOGP3=0.34950.0001521RFGXLOGP3=0.0000RFG2+0.0002RFG0.487C=312.626+0.01665RFGC=0.0000RFG2+0.0329RFG+268.6353LOGP=0.59320.000108RFGLOGP=0.0000RFG2+0.0003RFG0.5548MP=253.89040.001809RFGMP=0.0000RFG20.0068RFG+266.2883CCS=143.0493+0.002774RFGCCS=0.0000RFG2+0.0067RFG+128.9797MM=465.5766+95.8217.lnRFGXLOGP3=4.91330.7692.lnRFGC=357.8937+105.0634.lnRFGLOGP=2.75090.4138.lnRFGMP=347.210913.9873.lnRFGCCS=19.4979+23.3651.lnRFG

5.9 Reducible first Gourava index RG1(G)

MM=160.963+0.03131RG1GMM=0.0000RG1G2+0.061RG1G+124.4735XLOGP3=0.17750.0003094RG1GXLOGP3=0.0000RG1G2+0RG1G0.253C=304.648+0.03917RG1GC=0.0000RG1G2+0.0785RG1G+256.2399LOGP=0.44520.0002145RG1GLOGP=0.0000RG1G2+0.0005RG1G0.4838MP=256.68380.004654RG1GMP=0.0000RG1G20.0109RG1G+263.8316CCS=140.455+0.006874RG1GCCS=0.0000RG1G2+0.0155RG1G+126.0261MM=463.4443+104.0763.lnRG1GXLOGP3=4.55780.7904.lnRG1GC=346.22+112.8723.lnRG1GLOGP=2.49160.4162.lnRG1GMP=343.452514.7187.lnRG1GCCS=23.0011+25.8757.lnRG1G

5.10 Reducible second Gourava index RG2(G)

MM=191.5845+0.0007926RG2GMM=0.0000RG2G2+0.0018RG2G+156.491XLOGP3=0.091560.00000918RG2GXLOGP3=0.0000RG2G2+0RG2G0.3882C=342.2894+0.0009956RG2GC=0.0000RG2G2+0.0023RG2G+297.8582LOGP=0.47390.000006913RG2GLOGP=0.0000RG2G2+0RG2G0.4004MP=251.56990.0001121RG2GMP=0.0000RG2G20.0005RG2G+262.0989CCS=149.7374+0.0001628RG2GCCS=0.0000RG2G2+0.0004RG2G+137.149
MM=433.6237+73.2963.lnRG2GXLOGP3=4.48090.5712.lnRG2GC=298.8449+78.0269.lnRG2GLOGP=2.50810.3063.lnRG2GMP=338.415210.2879.lnRG2GCCS=16.0368+18.2656.lnRG2G

6 Statistical parameters in QSPR models

QSPR (Quantitative Structure-Property Relationship) modeling has emerged as a powerful tool in the field of computational chemistry to predict the properties and behaviors of chemicals and materials. In order to build reliable and accurate QSPR models, it is crucial to understand and utilize statistical parameters effectively. Statistical parameters play a crucial role in model selection. They help us identify the model that best captures the underlying relationship between chemical structure and property. By comparing statistical parameters across different models, we can determine which one is more accurate, precise, and reliable for making predictions. This ensures that we choose the most appropriate model for our specific needs. In QSPR modeling, several statistical parameters are used to assess model performance. These include correlation coefficient (R and R2), F-test value and p-value. By examining these parameters, we can determine whether our models are dependable and can consistently provide accurate predictions. This study delves into the discussion of eight statistical parameters for reducible topological indices within three regression models of QSPR. The variable “N” represents the size of the data set, which can vary from 9, 12, and 13. In correlation (r), there are three possible cases. The first case occurs when r>0, it means that as one term increases, the other term also tends to increase at the same time. It can be inferred that there is a powerful or undeviating connection between the two terms. The second case occurs when r<0, implying that as one term increases, the other term tends to decrease. This indicates a negative or inverse connection between the two variables. The third case occurs when r = 0, meaning that no relationship is present between the two variables. The range of r values lies between −1 and +1 for the correlation coefficient. A value of +1 indicates a perfect positive correlation, while a value of −1 signifies a perfect negative correlation. In simpler terms, when the value of r is close to 1, such as 0.9 or 0.8, it suggests a strong and direct relationship between two variables. On the other hand, if the value of r is significantly far for 1, like 0.4, 0.3, or 0.2, it indicates a very weak association between the variables. The same principle applies when the value of r is nearly equal or far to −1. The correlation coefficient (R2) is a convenient tool for evaluating the QSPR models’ quality. R2 > 0.7 is essential for a good QSPR model.

The F-test is a statistical test used to compare the variances of two populations. In simpler terms, it helps determine if the means of two groups are significantly different from each other, by comparing the variation within the groups to the variation between the groups. The p-value associated with the F-test indicates the probability of observing the data if the null hypothesis (usually that there is no difference between the groups) is true. A low p-value (typically below 0.05) suggests that there is strong evidence against the null hypothesis, meaning the difference between the groups is unlikely to be due to random chance alone. The evaluation of our model’s reliability is greatly enhanced by the utilization of two test values: the p-value and the F-test value. These values help us determine whether the relationship between properties and indices is significant or not.

In statistical analysis, a p-value ⩽0.05 and an F-test value >2.5 are often considered indicators of a statistically significant or strong relationship. On the other hand, if the p-value is greater than 0.05 and the F-test value is less than 2.5, it suggests a lack of significance, indicating a weak relationship between properties and indices. By assessing the p-value and F-test value, we ensure the validity of our models and make informed decisions. The numerical values of 8 parameters, including p-values and F-test values, related to all the defined indices and properties for tuberculosis disease are contained in Tables 312.

7 Graphical analysis

Bar graphs are a popular and effective way to visually represent data. The use of rectangular bars of different lengths to show the magnitude of data points and make comparisons between different categories or groups for numerous drugs. By presenting data in a graphical format, bar graphs make it easier for people to understand and interpret information quickly. In bar graphs, the x-axis and the y-axis are commonly known as the two axes used. The horizontal axis (x-axis) represents the all reducible indices, while the vertical axis (y-axis) showcases the values of all correlation related to properties. These vertical or horizontal rectangles represent the magnitude or value of each category. The length or height of the bars is proportional to the data which are represent. In each graph, the squared symbol denotes data points associated with the linear, quadratic, and logarithmic model. Microsoft excel has been used to create these graphs. In Figure 7, there is a visual representation of the compared correlation graph showing the relationship between various reducible degree-based indices and characteristics of tuberculosis drugs. Every single graph was created using the data from Tables 1315. Figure 7 reveals a clear visualization of the range of r values.

Figure 7
www.frontiersin.org

Figure 7. Bar graphs visually compared the r values between properties and reducible TIs.

Figure 7A illustrates the comparison of correlation values with blue, orange, and grey bars representing the linear-model, quadratic-model, and logarithmic-models, respectively. The correlation values between reducible indices and molar mass range from 0.77 to 0.99. In Figure 7B, the inverse correlation values for the linear and quadratic models lie between −0.1 and −0.6. However, only RG2(G) does not exhibit an inverse relationship in the linear model, as its correlation value is 0.2256. On the other hand, the range of positive correlation values for the quadratic model lies between 0.1 and 0.7. The correlation r results for the three models in Figure 7C are all positive and range from 0.4 to 0.7. In Figure 7D, the inverse correlation values for the linear and quadratic models lie between −0.1 and −0.5. However, only RG2(G) does not show an inverse relationship in the linear and quadratic models, as its correlation values are 0.1487 and 0.0358. On the other hand, the range of positive correlation values (r) for the quadratic model lies between 0.2 and 0.6. In Figure 7E, both the linear and quadratic models exhibit an inverse relationship due to their negative correlation values, which fall to −0.3. However, the quadratic model demonstrates positive correlation values ranging from 0.30 to 0.39. Lastly, in Figure 7F, all models exhibit a positive relationship, ranging from 0.6 to 0.9.

8 Results with discussion

The main objective of this portion is to discuss a QSPR analysis that focuses on ten specified reducible topological indices (TIs) and six distinct properties associated with tuberculosis disease. The mentioned properties comprise molar mass, XLOGP3, Complexity, LOGP, melting point, and collision cross section. The data for these properties has been sourced from the PubChem website and is presented in Table 2. Vertex partitions and edge partitions, which are degree-based TIs techniques, have been employed to compute the numerical values corresponding to the reducible molecular descriptors. The values obtained through calculation for all indices are present in Table 1. The association between reducible topological descriptors and physicochemical characteristics for anti-tuberculosis drugs has been determined in the three QSPR models that were examined. The statistical parameters were computed to obtain these results. Tables 312 presents the summarized values of eight statistical parameters that were calculated. All the results in the table include information on the p-values and F-test values for the ten reducible TIs associated with anti-tuberculosis drugs. As a result, some outcomes hold significant and strong value. For instance, Table 3 represents the statistical analysis for the reducible first Zagreb index. In this table, the linear model shows the F-test value with the highest and lowest significance values as (647.5515, 0.8791), with corresponding p-values of (0.0000, 0.3686). The correlation r values for this model are (0.9916, −0.272). In the quadratic model, the F-test value is (139.5896, 0.6068) with a p-value of (0.0000, 0.566). The correlation r values for this model are (0.9981, 0.3446). Lastly, in the logarithmic model, the F-test value is (94.6245, 1.5637) with p-values of (0.0003, 0.5241). The correlation r values for this model are (0.9861, −0.3301). This analysis provides valuable insights into the relationship between the reducible first Zagreb index and the different models. Figure 7, represents the bar graphs which are visually compared the r values between characteristics and reducible indices for anti-tuberculosis drugs. The graphs clearly illustrate the positive correlation r values observed in Figures 7A, C, F. Interestingly, Figures 7A, F exhibit a stronger positive correlation compared to Figure 7C. On the other hand, Figures 7B, D, E demonstrate that among the three models (linear, logarithmic, and quadratic), only the quadratic model displays a positive correlation value. Below the results with discussion for all indices that is based on Tables 1315 are presented.

Table 2
www.frontiersin.org

Table 2. Six physicochemical properties of anti-tuberculosis drugs.

Table 3
www.frontiersin.org

Table 3. Statistical analysis for the reducible first Zagreb index.

Table 4
www.frontiersin.org

Table 4. Statistical analysis for the reducible second Zagreb index.

Table 5
www.frontiersin.org

Table 5. Statistical analysis for the reducible reciprocal Randic index.

Table 6
www.frontiersin.org

Table 6. Statistical analysis for the reducible reciprocal sum connectivity index.

Table 7
www.frontiersin.org

Table 7. Statistical analysis for the reducible first hyper Zagreb index.

Table 8
www.frontiersin.org

Table 8. Statistical analysis for the reducible second hyper Zagreb index.

Table 9
www.frontiersin.org

Table 9. Statistical analysis for the reducible sigma index.

Table 10
www.frontiersin.org

Table 10. Statistical analysis for the reducible forgotten index.

Table 11
www.frontiersin.org

Table 11. Statistical analysis for the reducible first Gourava index.

Table 12
www.frontiersin.org

Table 12. Statistical analysis for the reducible second Gourava index.

Table 13
www.frontiersin.org

Table 13. Comparison table of correlation coefficient of linear regression.

8.1 Linear regression model

     By using Tables 313, we conclude that all of the defined reducible indices show a robust positive connection with molar mass within the range of r = 0.7 to 0.9 with r2 = 0.6 to 0.9 having F-value, grater and lower value is (685.6292 and 20.1735) with p-value is less than 0.05, as well as with collision cross section within the range of r = 0.6 to 0.9 with r2 = 0.4 to 0.9 having F-value, grater and lower value is (452.0602 and 6.4616) with p-value is less than 0.05. The complexity property demonstrates a significant correlation with RM1(G) at r = 0.7074, with r2 = 0.5004 having F-value is 11.0191, RR(G) at r = 0.7016 with r2 = 0.4923 and F-value is 10.6656, and RSC(G) at r = 0.7044 and r2 = 0.4962 having F-value is 10.833. These three properties display a strong positive correlation with the indices. On the other hand, the remaining three properties, namely, XLOGP3, LOGP, and MP, do not exhibit a strong correlation with any of the indices, as their correlation r values range from −0.1 to −0.5 with r2 = −0.4 to 0.2 having very low F-value and p-value which is greater than 0.05. These values indicate an inverse relationship, and have a very low significance level. Only XLOGP3 demonstrates a moderate inverse relationship with RS(G) at r = −0.6737 with r2 = 0.4538 and F-value is 9.1398. Consequently, the properties of molar mass and collision cross section offer valuable insights into the relationship with all of the indices.

8.2 Quadratic regression model

     Based on the results obtained from Tables 312, 14, it is clear that there exists a robust positive correlation between molar mass and all the specified indices, falling within the range of r = 0.8 to 0.9 with r2 = 0.7 to 0.9 having F-value, highest and lowest value is (1448.3066 and 12.0901) with p-value is less than 0.05. Additionally, a similar association is observed with collision cross section, ranging from r = 0.7 to 0.9 and r2 = 0.5,0.8 and 0.9 with F-value which is greater and lower is (567.848 and 3.97) having p-value which is less than 0.05. These intriguing findings have a profound impact on the advancement of anti-tuberculosis drugs, as they provide remarkably valuable insights into the development process. Only one index, namely, the reducible sigma index at r = 0.7455 with r2 = 0.5558 and F-value is 6.2551, demonstrates a good association with XLOGP3. The relationship between complexity with RM1(G), RR(G), RSC(G), RS(G), RF(G), and RG1(G) demonstrates a strong correlation, with r values exceeding 0.7 with r2 = 0.4 to 0.5. At a r = 0.6, this index demonstrates only a mediocre correlation with RM2(G), RHM1(G), RHM2(G), and RG2(G). LOGP show a mediocre correlation with only reducible sima index at r = 0.6045 with r2 = 0.3655.

Table 14
www.frontiersin.org

Table 14. Comparison table of correlation coefficient of quadratic regression.

8.3 Logarithmic regression model

     After analyzing the data presented in Tables 312, 15, we have reached the conclusion that all the reducible indices demonstrate a very high positive correlation with molar mass within the range of r = 0.8 to 0.9 with r2 varies from to 0.7 to 0.9 having F-value, greater and lower is (368.6357 and 57.5364) with p-value which is less than 0.05, as well as with collision cross section within the same r and r2 range having F-value greater and lower is (479.3848, 39.7256) and p-values which is less than 0.05. This analysis reveals a noteworthy relationship between the defined indices and both molar mass and collision cross section. The data clearly indicates a consistent and significant positive correlation within the specified range. The range of r = 0.6 with r2 varies from 0.4 to 0.3 reveals a mediocre connection between the complexity property with six reducible indices, namely, RM1(G), RR(G), RSC(G), RHM2(G), RS(G), and RF (G). It can be observed that there is no strong correlation and significance level between LOGP and melting point across all defined indices due to the fact that the value of r falls between −0.1 and −0.3 with r2 = 0.0 to 0.1.

Table 15
www.frontiersin.org

Table 15. Comparison table of correlation coefficient of logarithmic regression.

9 Conclusion

Degree-based topological indices prove exceptionally valuable in modeling and characterizing the molecular structure of medications, allowing for the best prediction of physicochemical properties without conducting time-consuming experiments. These indices, derived from the principles of graph theory, serve as numerical descriptors for molecular structures. When analyzing drugs using QSPR, degree-based reducible topological indices serve as important tools for predicting drug properties by providing molecular descriptors. This knowledge is vital for the development of effective treatments and the optimization of drug design. The selected drugs such as Isoniazid, Pyrazinamide, Ethionamide, Levofloxacin, Amikacin, Ofloxacin, Cycloserine, 4-aminosalicyclic acid, Ethambutol, Ciprofloxacin, Bedaquiline, Kanamycin and Streptomycin have been widely used in the treatment of tuberculosis. By calculating the degree-based reducible topological indices for these compounds, we can better understand their structural features and how they contribute to their pharmacological properties. Furthermore, we have developed a three-regression model to investigate the correlation between these calculated values and experimental properties, including molar mass, XLOGP3, Complexity, LOGP, melting point, and collision cross-section. Based on the statistical parameters and topological indices employed in the three QSPR model, we can conclude that both molar mass and collision cross section exhibit a highly significant association with all the specified indices. As an example, when examining the relationship between molar mass and collision cross section with RM1(G) index, we observe varying correlation coefficients (r values) across different models. In the linear model, the correlation coefficient ranges from 0.9916 to 0.9733. Meanwhile, the quadratic model yields a higher correlation coefficient, ranging from 0.9981 to 0.9947. Lastly, the logarithmic model shows a correlation coefficient ranging from 0.9742 to 0.9861. There is a good correlation between complexity and RM1(G), RR(G), and RSC(G) indices in the linear model at the value of r exceeds from 0.7. Similarly, in the quadratic model, the relationship between complexity and the RM1(G), RR(G), RSC(G), RS(G), RF(G), and RG1(G) indices also demonstrates a good correlation, with r values exceeding from 0.7. Furthermore, within the three discussed models, it can be observed that there is no strong correlation or significance level between LOGP and melting point across all defined indices. This research indicates that theoretical assessment could allow chemists and other professionals in the pharmaceutical sector to predict the characteristics of tuberculosis medications without the need for conducting experiments. The analysis revealed a wide range of reducible topological indices, suggesting that alternative formulations of these drugs could potentially be used to treat a variety of disorders. The research also determined the correlation coefficient for several reducible topological indices, providing chemists with valuable information to develop new medications by combining existing ones with high correlations. We aspire for this research to create a pathway for future breakthroughs in drug design and therapy, ultimately resulting in enhanced results for individuals afflicted with this incapacitating illness. Before ending this paper, we present the following open problem for further examination.

9.1 Open problem

The analysis of QSPR focused on the correlation between different topological indices that is based on distance-based methodology, and physicochemical properties of anti-tuberculosis drugs through the use of exponential regression model and multi linear regression model.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

SA: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing–original draft, Writing–review and editing. NN: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing–original draft, Writing–review and editing. MR: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing–original draft, Writing–review and editing. LA: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing–original draft, Writing–review and editing.

Funding

The authors declare that no financial support was received for the research, authorship, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Natarajan A, Beena PM, Devnikar AV, Mali S. A systemic review on tuberculosis. Indian J Tuberculosis (2020) 67(3):295–311. doi:10.1016/j.ijtb.2020.02.005

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Jasmer RM, Nahid P, Hopewell PC. Latent tuberculosis infection. New Engl J Med (2002) 347(23):1860–6. doi:10.1056/nejmcp021045

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Ramirez-Lapausa M, Menendez-Saldana A, Noguerado-Asensio A. Extrapulmonary tuberculosis: an overview. Rev Esp Sanid Penit (2015) 17(1):3–11. doi:10.4321/s1575-06202015000100002

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Sharma SK, Mohan A, Kohli M. Extrapulmonary tuberculosis. Expert Rev Respir Med (2021) 15(7):931–48. doi:10.1080/17476348.2021.1927718

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Mirzayev F, Viney K, Linh NN, Gonzalez-Angulo L, Gegia M, Jaramillo E, et al. World Health Organization recommendations on the treatment of drug-resistant tuberculosis, 2020 update. Eur Respir J (2021) 57(6):2003300. doi:10.1183/13993003.03300-2020

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Griffith DE, Kerr CM. Tuberculosis: disease of the past, disease of the present. J Perianesthesia Nurs (1996) 11(4):240–5. doi:10.1016/s1089-9472(96)80023-2

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Agyeman AA, Ofori-Asenso R. Tuberculosis-an overview. J Public Health Emerg (2017) 1(7):7–11. doi:10.21037/jphe.2016.12.08

CrossRef Full Text | Google Scholar

8. Hayat S, Imran M. On degree based topological indices of certain nanotubes. J Comput Theor Nanoscience (2015) 12(8):1599–605. doi:10.1166/jctn.2015.3935

CrossRef Full Text | Google Scholar

9. Siddiqui MK, Naeem M, Rahman NA, Imran M. Computing topological indices of certain networks. J Optoelectronics Adv Mater (2016) 18:884–92.

Google Scholar

10. Hayat S, Imran M. Computation of topological indices of certain networks. Appl Math Comput (2014) 240:213–28. doi:10.1016/j.amc.2014.04.091

CrossRef Full Text | Google Scholar

11. Bollobás B, Erdös P. Graphs of extremal weights. Ars combinatoria (1998) 50:225.

Google Scholar

12. Amic D, Bešlo D, Lucic B, Nikolic S, Trinajstic N. The vertex-connectivity index revisited. J Chem Inf Comput Sci (1998) 38(5):819–22. doi:10.1021/ci980039b

CrossRef Full Text | Google Scholar

13. Hu Y, Li X, Shi Y, Xu T, Gutman I. On molecular graphs with smallest and greatest zeroth-order general Randic index. MATCH Commun Math Comput Chem (2005) 54(2):425–34.

Google Scholar

14. Caporossi G, Gutman I, Hansen P, Pavlovi? L. Graphs with maximum connectivity index. Comput Biol Chem (2003) 27(1):85–90. doi:10.1016/s0097-8485(02)00016-5

CrossRef Full Text | Google Scholar

15. Li X, Gutman I. Mathematical chemistry monographs no. 1. Kragujevac, Macedonia: University of Kragujevac (2006).

Google Scholar

16. Kier LB, Hall LH. Molecular connectivity VII: specific treatment of heteroatoms. J Pharm Sci (1976) 65(12):1806–9. doi:10.1002/jps.2600651228

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Kier LB, Hall LH. Molecular connectivity in structure-activity analysis (1986). (No Title).

Google Scholar

18. Shafiei F. Relationship between topological indices and thermodynamic properties and of the monocarboxylic acids applications in QSPR. Iranian J Math Chem (2015) 6(1):15–28.

Google Scholar

19. Sepehri B. A review on created QSPR models for predicting ionic liquids properties and their reliability from chemometric point of view. J Mol Liquids (2020) 297:112013. doi:10.1016/j.molliq.2019.112013

CrossRef Full Text | Google Scholar

20. Hosamani SM, Kulkarni BB, Boli RG, Gadag VM. QSPR analysis of certain graph theocratical matrices and their corresponding energy. Appl Math Nonlinear Sci (2017) 2(1):131–50. doi:10.21042/amns.2017.1.00011

CrossRef Full Text | Google Scholar

21. Mondal S, Dey A, De N, Pal A. QSPR analysis of some novel neighbourhood degree-based topological descriptors. Complex Intell Syst (2021) 7:977–96. doi:10.1007/s40747-020-00262-0

CrossRef Full Text | Google Scholar

22. Miyao T, Kaneko H, Funatsu K. Inverse QSPR/QSAR analysis for chemical structure generation (from y to x). J Chem Inf Model (2016) 56(2):286–99. doi:10.1021/acs.jcim.5b00628

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Huang R, Mahboob A, Rasheed MW, Alam SM, Siddiqui MK. On molecular modeling and QSPR analysis of lyme disease medicines via topological indices. The Eur Phys J Plus (2023) 138(3):243. doi:10.1140/epjp/s13360-023-03867-9

CrossRef Full Text | Google Scholar

24. Hasani M, Ghods M. Topological indices and QSPR analysis of some chemical structures applied for the treatment of heart patients. Int J Quan Chem (2024) 124:e27234. doi:10.1002/qua.27234

CrossRef Full Text | Google Scholar

25. Mahboob A, Rasheed MW, Amin L, Hanif I. A study of novel molecular descriptors and quantitative structure–property relationship analysis of blood cancer drugs. Eur Phys J Plus (2023) 138(9):856. doi:10.1140/epjp/s13360-023-04499-9

CrossRef Full Text | Google Scholar

26. Mahboob A, Rasheed MW, Hanif I, Amin L, Alameri A. Role of molecular descriptors in quantitative structure-property relationship analysis of kidney cancer therapeutics. Int J Quan Chem (2024) 124:e27241. doi:10.1002/qua.27241

CrossRef Full Text | Google Scholar

27. Husin MN, Ariffin A. On the edge version of topological indices for certain networks. Ital J Pure Appl Math (2022) 47:550–64.

Google Scholar

28. Liu JB, Ali H, Shafiq MK, Dustigeer G, Ali P. On topological properties of planar octahedron networks. Polycyclic Aromatic Comp (2023) 43(1):755–71. doi:10.1080/10406638.2021.2022726

CrossRef Full Text | Google Scholar

29. Sardar MS, Ali MA, Ashraf F, Cancan M. On topological indices of double and strong double graph of silicon carbide Si2 C3-I [p, q]. Eurasian Chem Commun (2023) 5:37–49. doi:10.22034/ecc.2023.356160.1519

CrossRef Full Text | Google Scholar

30. Raza Z, Akhter S, Shang Y. Expected value of first Zagreb connection index in random cyclooctatetraene chain, random polyphenyls chain, and random chain network. Front Chem (2023) 10:1067874. doi:10.3389/fchem.2022.1067874

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Nadeem I, Siddique S, Shang Y. Some inequalities between general randic-type graph invariants. J Math (2024) 2024:1–7. doi:10.1155/2024/8204742

CrossRef Full Text | Google Scholar

32. Gutman I, Trinajstic N. Graph theory and molecular orbitals. Total φ-electron energy of alternant hydrocarbons. Chem Phys Lett (1972) 17(4):535–8. doi:10.1016/0009-2614(72)85099-1

CrossRef Full Text | Google Scholar

33. Islam SR, Pal M. Second Zagreb index for fuzzy graphs and its application in mathematical chemistry. Iranian J Fuzzy Syst (2023) 20(1):119–36. doi:10.22111/IJFS.2023.7350

CrossRef Full Text | Google Scholar

34. Mondal S, De N, Pal A. On neighborhood Zagreb index of product graphs. J Mol Struct (2021) 1223:129210. doi:10.1016/j.molstruc.2020.129210

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Randic M. Characterization of molecular branching. J Am Chem Soc (1975) 97(23):6609–15. doi:10.1021/ja00856a001

CrossRef Full Text | Google Scholar

36. Yuan M. On the Randic index and its variants of network data. TEST (2023) 1–25. doi:10.1007/s11749-023-00887-6

CrossRef Full Text | Google Scholar

37. Zhou B, Trinajstic N. On general sum-connectivity index. J Math Chem (2010) 47:210–8. doi:10.1007/s10910-009-9542-4

CrossRef Full Text | Google Scholar

38. Hussain M, ur Rehman A, Iqbal MS, Muhammad A. Computation of randic and sum-connectivity indices for H3BO3 layer. Structure’s Derivatives (2022). doi:10.21203/rs.3.rs-2124344/v1

CrossRef Full Text | Google Scholar

39. Carballosa W, Pestana D, Sigarreta JM, Tour E. Relations between the general sum connectivity index and the line graph. J Math Chem (2020) 58:2273–90. doi:10.1007/s10910-020-01180-9

CrossRef Full Text | Google Scholar

40. Shirdel GH, Rezapour H, Sayadi AM. The hyper-Zagreb index of graph operations (2013).

Google Scholar

41. Sarkar I, Nanjappa M. On second hyper-zagreb index of corona products related to R-graphs. Int J Appl Math (2023) 36(3):301. doi:10.12732/ijam.v36i3.1

CrossRef Full Text | Google Scholar

42. Hakeem A, Muhammad Katbar N, Muhammad F, Ahmed N. QSPR analysis of some important drugs used in heart attack treatment via degree-based topological indices and regression models. Polycyclic Aromatic Comp (2023) 1–10. doi:10.1080/10406638.2023.2262697

CrossRef Full Text | Google Scholar

43. Gutman I, Togan M, Yurttas A, Cevik AS, Cangul IN. Inverse problem for sigma index. MATCH Commun Math Comput Chem (2018) 79(2):491–508.

Google Scholar

44. Ascioglu M, Cangul IN. Sigma index and forgotten index of the subdivision and r-subdivision graphs. Proc Jangjeon Math Soc (2018) 21(3):375–83. doi:10.17777/pjms2018.21.3.375

CrossRef Full Text | Google Scholar

45. Havare OC. QSPR analysis with curvilinear regression modeling and topological indices. Iranian J Math Chem (2019) 10(4):331–41. doi:10.22052/ijmc.2019.191865.1448

CrossRef Full Text | Google Scholar

46. Furtula B, Gutman I. A forgotten topological index. J Math Chem (2015) 53(4):1184–90. doi:10.1007/s10910-015-0480-z

CrossRef Full Text | Google Scholar

47. Hosamani SM. A upper bound for the F 1-index and its applications to fullerenes. MATCH Commun Math Comput Chem (2015) 74(1):97–101.

Google Scholar

48. De N, Nayeem SMA, Pal A. F-index of some graph operations. Discrete Math algorithms Appl (2016) 8(02):1650025. doi:10.1142/s1793830916500257

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Kulli VR. The Gourava indices and coindices of graphs. Ann Pure Appl Math (2017) 14(1):33–8. doi:10.22457/apam.v14n1a4

CrossRef Full Text | Google Scholar

50. Wang Y, Aslam A, Idrees N, Kanwal S, Iram N, Razzaque A. On trees with given independence numbers with maximum Gourava indices. Symmetry (2023) 15(2):308. doi:10.3390/sym15020308

CrossRef Full Text | Google Scholar

Keywords: tuberculosis drugs, degree-based reducible topological indices, QSPR analysis, regression models, correlation coefficients (r)

Citation: Alam SM, Nigar N, Rasheed MW and Amin L (2024) Analyzing the role of reducible molecular descriptors and thermodynamic aspects of anti-tuberculosis drugs via QSPR study. Front. Phys. 12:1383216. doi: 10.3389/fphy.2024.1383216

Received: 07 February 2024; Accepted: 22 March 2024;
Published: 18 April 2024.

Edited by:

Vickramjeet Singh, Dr. B. R. Ambedkar National Institute of Technology Jalandhar, India

Reviewed by:

Yilun Shang, Northumbria University, United Kingdom
Imran Khan, Sultan Qaboos University, Oman

Copyright © 2024 Alam, Nigar, Rasheed and Amin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Muhammad Waheed Rasheed, d2FoZWVkcmFzaGVlZDQ2MUBnbWFpbC5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.