Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol., 08 September 2022
Sec. Infectious Agents and Disease
This article is part of the Research Topic Mycobacterium spp.- Transmission, Pathogenesis, Host-pathogen interaction, Prevention and Treatment View all 9 articles

The shortcut of mycobacterial interspersed repetitive unit-variable number tandem repeat typing for Mycobacterium tuberculosis differentiation

\r\nShima Hadifar,Shima Hadifar1,2Mansour Kargarpour Kamakoli,Mansour Kargarpour Kamakoli1,2Sana EybpooshSana Eybpoosh3Mehran Nakhaeizadeh,Mehran Nakhaeizadeh4,5Mohammad Ali Kargarpour KamakoliMohammad Ali Kargarpour Kamakoli6Nasim Ebrahimifard,Nasim Ebrahimifard1,2Abolfazl Fateh,Abolfazl Fateh1,2Seyed Davar Siadat,Seyed Davar Siadat1,2Farzam Vaziri,*Farzam Vaziri1,2*
  • 1Department of Mycobacteriology and Pulmonary Research, Pasteur Institute of Iran, Tehran, Iran
  • 2Microbiology Research Center, Pasteur Institute of Iran, Tehran, Iran
  • 3Department of Epidemiology and Biostatistics, Research Centre for Emerging and Reemerging Infectious Diseases, Pasteur Institute of Iran, Tehran, Iran
  • 4Modeling in Health Research Center, Institute for Futures Studies in Health, Kerman University of Medical Sciences, Kerman, Iran
  • 5Department of Biostatistics and Epidemiology, School of Public Health, Kerman University of Medical Sciences, Kerman, Iran
  • 6Department of Mechanical Engineering, Quchan University of Technology, Quchan, Iran

The 24-loci mycobacterial interspersed repetitive unit-variable number tandem repeat (MIRU-VNTR) genotyping has been used as an international standard method for Mycobacterium tuberculosis (Mtb) genotyping. However, different optimized VNTR loci sets for improving the discrimination of specific Mtb genotypes have been proposed. In this regard, we investigated the efficacy of accumulation of the percentage differences (APDs) compared with the least absolute shrinkage and selection operator (LASSO) regression strategy to identify a customized genotype-specific VNTR loci set which provides a resolution comparable to 24-loci MIRU-VNTR in divergent Mtb populations. We utilized Spoligotyping and 24-loci MIRU-VNTR typing for genotyping 306 Mtb isolates. The APD and LASSO regression approaches were used to identify a customized VNTR set in our studied isolates. Besides, the Hunter-Gaston discriminatory index (HGDI), sensitivity, and specificity of each selected loci set were calculated based on both strategies. The selected loci based on LASSO regression compared with APD-based loci showed a better discriminatory power for identifying all studied genotypes except for T genotype, which APD-based loci showed promising discriminative power. Our findings suggested the LASSO regression rather than the APD approach is more effective in the determination of possible discriminative VNTR loci set to precise discrimination of our studied Mtb population and may be beneficial to be used in finding reduced number loci sets in other Mtb genotypes or sublineages. Moreover, we proposed customized genotype-specific MIRU-VNTR loci sets based on the LASSO regression and APD approaches for precise Mtb strains identification. As the proposed VNTR sets offered a comparable discriminatory power to the standard 24 MIRU-VNTR loci set could be promising alternatives to the standard genotyping for using in resource-limited settings.

Introduction

Understanding and monitoring the patterns and dynamics in the population structure of Mycobacterium tuberculosis (Mtb), as one of the deadly infectious agents, is imperative for epidemiological and biological purposes. Molecular typing methods play an important role in achieving this purpose and provide powerful tools to measure local and global tuberculosis (TB) surveillance (Allix-Béguec et al., 2008; Hachisu et al., 2013). Over the previous decade, different polymerase chain reaction (PCR)-based genotyping methods, such as the 24-loci mycobacterial interspersed repetitive unit-variable number tandem repeat (MIRU-VNTR) typing and Spoligotyping have been widely used as valuable tools for identification of Mtb strain (Allix-Béguec et al., 2008; Ei et al., 2016). Although Whole-Genome Sequencing (WGS) has been suggested as a proxy to trace Mtb transmission and provides a comprehensive identification; however, due to several limitations have not been in resource-limited settings compared with PCR-based methods (Satta et al., 2018).

MIRU-VNTR typing is considered a promising genotyping method, providing a comparable or superior resolution compared with other PCR-based genotyping methods like IS6110-RFLP and Spoligotyping depending on the selected VNTR loci set (De Beer et al., 2012; de Beer et al., 2013). Generally, MIRU-VNTR loci are selected based on the discrimination power of each locus, which can result in genotyping errors for different Mtb families due to the different genetic backgrounds of the populations and reduce discrimination power based on specific genotypes (Murase et al., 2008; Luo et al., 2014). It has been proposed that an optimized locus set based on the accumulation of the percentage differences (APDs) approach can efficiently discriminate specific genotypes (Beijing, NEW-1, and CAS) (Pan et al., 2017; Hadifar et al., 2019a). Accordingly, in this study, we adopted another approach to investigate the capacity and efficacy of APD compared with the least absolute shrinkage and selection operator (LASSO) regression approach to identify a customized genotype-specific MIRU-VNTR loci set, which provides a resolution comparable to 24-loci VNTR in divergent Mtb population.

Materials and methods

Clinical isolates collection

A total of 306 Mtb isolates were randomly collected from who had been diagnosed with pulmonary TB from January 2017 to January 2019. All patients had no history of anti-TB treatment. None of the samples were collected from the same patient. In addition, all Mtb isolates were confirmed by PCR for IS6110. Culture-negative pulmonary TB cases were also excluded. This study was approved by the Ethics Committee of the Pasteur Institute of Iran. All experiments were performed in accordance with guidelines approved by Pasteur Institute International networks. Ethical reviews and informed consent approval were granted by the Ethical Committee of the Pasteur Institute of Iran. Informed consent was obtained from all patients in this study. The results of this research did not influence TB patients’ treatment.

Twenty-four-loci mycobacterial interspersed repetitive unit-variable number tandem repeat typing

The extraction of Mtb genomic DNA was performed by a PROBA-NK DNA extraction kit (DNA Technology Company, Moscow, Russia) according to the manufacturer’s instructions and was used for the 24-loci MIRU-VNTR genotyping method as described previously (Supply et al., 2006).

Spoligotyping

Spoligotyping commercial kit (Mapmygenome Genomics Company, India) was used for analyzing all Mtb isolates based on the Spoligotyping protocol described by Kamerbeek et al. (1997).

Statistical analysis

Mtb genotypes were initially identified with the MIRU-VNTR technique and confirmed with the Spoligotyping method. The MIRU-VNTR and Spoligotyping data were analyzed by the MIRU-VNTRplus web server1 and SITVIT-WEB online tool2 (Demay et al., 2012), respectively. In addition, TBminer databases (Azé et al., 2015) were used to analyze and confirm the results.

Simpson’s diversity index for each VNTR locus and VNTR loci in combination, as described by Hunter and Gaston (1988), was calculated using the online tool at http://www.comparingpartitions.info/index.php. Based on the Hunter-Gaston discriminatory index (HGDI) score, each locus showed a high (h > 0.6), moderate (0.3 ≤ h ≤ 0.6), and poor (h < 0.3) discriminative power (Sola et al., 2003).

The data set consisted of variables with more than 30 percent missing values. We checked the mechanisms of missing values which were completely at random. Therefore, the missing values were imputed using Multiple Imputations via the Chained Equations (MICE) algorithm. LASSO regression method was used to provide a reduced, and hence, more cost-effective set of loci that could predict the Mtb genotypes with reasonable accuracy. The LASSO regression model, which is acknowledged to be suitable for high dimensional data regression analysis (Sauerbrei et al., 2007), is a regularization-based regression model that uses shrinkage. The lasso shrinks the coefficient estimates toward zero. It performs as a variable selection method and adds a penalty term (L1) to the cost function. Indeed, in lasso regression, we aim to find estimates of the coefficients β to βp that minimize RSS + λ*j=1p|βj| where RSS is the residual sum of squares. So, if λ = 0, then we have regular ordinary least squares regression. Accordingly, LASSO regression was performed to select the optimal set of VNTR loci for predicting Mtb genotypes.

Afterward, the predicted values, sensitivity, and specificity indices were computed and used for comparison with APD method. Sensitivity and specificity were used to evaluate the accuracy of a test that predicts dichotomous outcomes (e.g., logistic regression). Sensitivity evaluates model performance and measures a model’s capability to identify positive examples correctly. When sensitivity is used to evaluate model performance, it is often compared to specificity. Specificity measures the proportion of true negatives that the model correctly identifies. We assessed internal and external validity to validate the performance of LASSO regression. The fivefold cross-validation (CV) was used for evaluating internal validity. This approach randomly divides the data 5 times as training and test samples and runs the analysis on each fold. Then the sensitivity and specificity indices were calculated. External validity was tested by defining our dataset as a train set and an additional source of data as a test set, including the collection of 225 strains with their 24 MIRU-VNTR loci and genotype assignments provided from the MIRU-VNTRplus reference database (see text footnote 1) and a part of data from two previous studies (Homolka et al., 2008; Shuaib et al., 2022; Supplementary Table 2). Then the sensitivity and specificity indices were also calculated. Analyses were performed in R software.

To APD calculation in each VNTR locus, firstly, we categorized our MIRU-VNTR data into eight data groups, including Beijing and Non-Beijing, CAS and Non-CA, NEW-1 and Non-NEW-1, Haarlem and Non-Haarlem, Cameroon and Non- Cameroon, LAM and Non-LAM, T and Non-T, and Ural1 and Non-Ural1. Then, the frequency of repeat numbers (patterns) at any VNTR locus was determined in both groups of each category. Finally, the APD value for each VNTR locus in every category was calculated as follows:

A P D = 1 / 2 J = 1 S | A j - B j |

Where s is the total number of distinct repeat patterns, Aj (Bj) is the percentage of A (B) category strains with the jth repeat pattern from all the strains of the A (B) category.

Results

According to results of the standard 24-loci MIRU-VNTR genotyping with TBminer database, we identified eight genotypes as follows: 101 CAS1-Delhi (33.00%), 88 NEW-1 (28.76%), 56 Beijing (18.30%), followed by 16 isolates as Haarlem (5.22%), 16 isolates as Cameroon (5.22%), 12 isolates as T (3.92%), 9 isolates as Ural1 (2.94%), and 8 isolates as LAM genotype (2.61%). The Spoligotyping method showed a similar result to MIRU-VNTR data (Supplementary Table 1). HGDI of each VNTR locus was calculated for each genotype. According to the global HGDI, 8 VNTR loci showed a high discriminatory power including QUB26 (4052), MIRU10 (960), MIRU26 (2996), ETRA (2165), MTUB04 (424), MTUB21 (1955), MIRU16 (1644), and QUB11B (2163b); while VNTR MIRU24 (2687), MTUB29 (2347), MIRU02 (154), MIRU27 (3007), MIRU04 (580), and MIRU20 (2059) were poorly discriminative (HGDI < 0.3).

We previously defined a combination of loci for specific genotypes (NEW-1 and CAS1-Delhi) based on APD (Hadifar et al., 2019a). Here, we examined an additional strategy to select the customized genotype-specific MIRU-VNTR loci set based on LASSO regression and APD to improve the power of differentiation in the Mtb population. Besides, HGDI, sensitivity, and specificity of each selected loci set were investigated based on the strategies. As shown in Table 1, selected loci based on LASSO regression approach showed a better discriminatory power for identifying all the studied genotypes (especially in locally dominant Mtb genotypes) compared with those based on the APD approach except for T genotype, which APD-based loci showed the better result. Besides, external and internal validity results for the LASSO regression approach showed proper and comparable sensitivity and specificity in most of the studied genotypes (Table 1).

TABLE 1
www.frontiersin.org

Table 1. HGDI values of proposed genotype-specific MIRU-VNTR loci based on APD and LASSO regression approaches.

In the CAS-Delhi population, a combination of five VNTR loci [ETRC (577), MIRU10 (960), MIRU16 (1644), MIRU26 (2996), and MTUB21 (1955)] was found as a proper set based on the descending order of the APD values (Table 2). According to the LASSO regression value, five VNTR loci [ETRC (577), MIRU10 (960), MIRU16 (1644), MIRU26 (2996), MIRU39 (4348)] were identified as the customized genotype-specific MIRU-VNTR loci. Four out of five LASSO regression-based loci were similar to APD-based loci [VNTR ETRC (577), MIRU10 (960), MIRU16 (1644), and MIRU26 (2996)].

TABLE 2
www.frontiersin.org

Table 2. The APDs of 24 loci VNTR in all the studied genotypes.

The VNTR loci set obtained based on LASSO regression showed the similar level of discrimination as the 24 loci reference method (HGDI: 0.956). Besides, the estimated sensitivity and specificity of the selected loci set based on LASSO regression were more significant compared with the APD results (Table 1).

In the NEW-1 population, the VNTR loci set of ETRB (2461), MIRU10 (960), MTUB34 (3171), MTUB04 (424), and ETRA (2165) were identified based on the APD approach. The APD value of the selected loci was more than 59% (Table 2). Based on the LASSO regression approach, the VNTR loci set of MIRU10 (960), MTUB21 (1955), ETRB (2461), MTUB34 (3171), and MIRU31 (3192) were identified as a customized loci set. According to the HGDI value of each set, the loci set obtained based on LASSO regression exhibited the promising discriminative power in this genotype (HGDI: 0.943). The specificity and sensitivity values of the LASSO-based loci also were more significant compared with the APD results (Table 1).

In the Beijing population, as shown in Tables 1, 2, the effective VNTR loci for the discrimination of the genotype had an APD value of more than 50% consistent with our previous result (Hadifar et al., 2019a). Based on LASSO regression, MTUB21 (1955), QUB11B (2163b), ETRA (2165), MTUB30 (2401), and MIRU31 (3192) loci were identified as a customized loci set, for identification of Beijing genotype with a promising discriminatory value (HGDI: 0.934).

As depicted in Table 1, in other genotypes, including Haarlem, Cameroon, LAM, and Ural1, the LASSO-based loci set showed a higher HGDI value than the APD-based loci set. The values of sensitivity and specificity exhibited a variation between the two strategies in each genotype. In T genotype, HGDI value obtained based on APD approach showed promising discriminative power.

Discussion

The 24-loci MIRU–VNTR genotyping has been used as an international standard method for Mtb genotyping. However, regarding a specific strain background and polymorphism which can affect the resolution of discrimination, various studies in the different geographical regions have tried to explore optimized VNTR sets for improving the standard typing method and offered different VNTRs combinations (Ali et al., 2007; Asante-Poku et al., 2014). In addition, previous studies have shown that a combination of VNTR loci selected based on the discrimination power method (HGDI) displayed a low capacity for a precise classification in endemic genotypes (Pan et al., 2017; Hadifar et al., 2019a). In this regard, we used two different approaches, including APD and LASSO regression to identify an optimized VNTR set in our studied Mtb population. We first established a specific genotype panel based on two utilized methods. In the next step, a comparison of the HGDI of each VNTR set with HGDI, sensitivity, and specificity of the reference method showed that LASSO regression compared with APD was more effective in discrimination of the studied Mtb genotype, especially for CAS, NEW-1, and Beijing genotypes.

Different studies showed that there is a remarkable genotypic diversity among Mtb population in Iran (Merza et al., 2010; Mansoori et al., 2018; Hadifar et al., 2019b,2021; Kochkaksaraei et al., 2019; Vaziri et al., 2019). According to the reports, certain genotypes, such as CAS, NEW-1, and Beijing are more medically important than other genotypes in this geographical region. These genotypes showed a high capacity for drug resistance acquisition (Sinkov et al., 2016). A more recent meta-analysis study using MIRU-VNTR typing and Spoligotyping data demonstrated that NEW-1 and CAS are the most dominant Mtb genotypes in Iran. Besides, Beijing was shown as the prominent genotype circulating among the multidrug-resistant Mtb population (Hadifar et al., 2021). Based on this diversity, an in-depth study for the precise identification of Mtb genotypes can be helpful for better management and control of their infection.

Some MIRU-VNTR-based studies have suggested a set of loci including MIRU10 (960), MIRU16 (1644), MIRU26 (2996), MIRU27 (3007), MIRU31 (3192), MIRU39 (4348), MIRU40 (802), MTUB21 (1955), and QUB26 (4052) are more discriminative in the standard 24 MIRU-VNTR loci panel for identification of different Mtb isolates circulating in Iran (Zamani et al., 2016; Mansoori et al., 2018). However, the described loci were also offered based on HGDI, which previously showed that selecting VNTR loci based on this index displayed a low capacity for a precise classification in endemic genotypes (Pan et al., 2017; Hadifar et al., 2019a).

We have previously reported that using the APD approach may facilitate efficient discrimination and clustering of NEW-1 and CAS populations (Hadifar et al., 2019a). In the present study, 5 optimized VNTR loci namely, ETRC (577), MIRU10 (960), MIRU16 (1644), MIRU26 (2996), and MIRU39 (4348) (with a sensitivity of 98% and a specificity of 93.7% in internal validity and sensitivity of 100%, and a specificity of 94,.4% in external validity) and five optimized loci, including ETRC (577), MIRU10 (960), MIRU16 (1644), MIRU26 (2996), and MTUB21 (1955) (with a sensitivity of 83.1% and a specificity of 99.5%) were identified based on LASSO regression and APD approaches for CAS genotype, respectively. The APD result was consistent with our previous study (Hadifar et al., 2019a), in which VNTR loci with APDs of more than 50% could efficiently discriminate the CAS and non-CAS genotypes. In addition, the selected loci were similar to those reported in our previous study (Hadifar et al., 2019a). Four VNTR loci, including ETRC (577), MIRU10 (960), MIRU16 (1644), and MIRU26 (2996) were also found in LASSO analysis. In a study by Rasoahanitralisoa et al. (2017) a combination of ten VNTR loci [MIRU10 (960), QUB4156 (4156), MTUB04 (424), MIRU16 (1644), MIRU40 (802), ETRC (577), MTUB39 (3690), QUB26 (4052), MIRU31 (3192), and QUB11B (2163b)] have been proposed as discriminative loci for CAS genotype combined principal component analysis (PCA) and HGDI methods, which only five loci (QUB4156 (4156), MIRU10 (960), MIRU16 (1644), MIRU40 (802), and MTUB04 (424) of the panel have been selected based on the PCA method. Overall, these findings suggest that the combination of VNTR ETRC (577), MIRU10 (960), and MIRU16 (1644), might be considered a preliminary marker that facilitates the identification of the CAS population with higher discriminatory power. In addition, we found the LASSO regression-based VNTR selection process offers a more helpful strategy for precise discrimination of CAS from the non-CAS population in different settings.

In the NEW-1 population, as a member of the Euro-American super lineage, five combinations of VNTR loci based on APD and five combinations of VNTR loci based on LASSO were found with a high capacity for efficient discrimination. In addition, a combination of VNTR MIRU10 (960), ETRB (2461), and MTUB34 (3171) which were similar between the two methods could be used a primary set for identifying the NEW-1 family. The efficient implementation of these methods for genotype data is helpful for the effective selection of VNTR loci for NEW-1 discrimination. However, based on the specificity of each selected set, we can suggest that using LASSO may be more effective than APD for discrimination of the NEW-1 population.

In Beijing population, as the third dominant genotype, a set of loci, which can more effectively classify this genotype were obtained based on the LASSO method, including VNTR MTUB21 (1955), QUB11B (2163b), ETRA (2165), MTUB30 (2401), and MIRU31 (3192). (With a sensitivity of 96.4% and a specificity of 98.4% in internal validity and sensitivity of 100%, and a specificity of 83.9% in external validity, HGDI: 0.934). Five loci set selected based on APD method also showed the high discriminatory power in this genotype (with a sensitivity of 80.5% and a specificity of 98.9%, HGDI: 0.876). In a previous study, 12 optimized loci were suggested for discrimination of Beijing genotype based on the combined PCA and HGDI methods. Among these loci, only five loci [QUB11B (2163b), MIRU26 (2996), QUB26 (4052), MTUB30 (2401), and ETRC (577)] were selected based on the PCA result (Rasoahanitralisoa et al., 2017). A different combination of loci has been described in Pan et al. (2017) study based on the APD method. It can be speculated that these discordances reflect the difference in the specific circulating sub-lineage, geographical region, VNTR loci to identify a suitable set, and sample size.

Other genotypes identified in our study were a part of the Euro-American super lineage, which is the most widely dispersed lineage. Different customized VNTR loci sets were offered based on both applied approaches to high effective discrimination of these groups. However, the selected loci sets based on both methods were reliable and promising in most of these genotypes, based on HGDI value, LASSO-based VNTR loci were more discriminative than the APD-based loci. Besides, no relevant study was found on the discriminative power of the loci combination based on APD, LASSO, and/or other statistical methods for Haarlem, Cameroon, LAM, and Ural1. Further investigations are crucial to accredit the efficiency of our implemented approaches in selecting discriminative loci in the other main Mtb genotypes and some genotypes in our study with small sample size like T. Another limitation of this study is that selection of VNTR loci was based on the predefined panel for standard 24 loci MIRU-VNTR typing, so evaluating additional loci could be beneficial to improve the efficiency of the selected customized loci set in discrimination of Mtb isolates in different settings. In addition, more Mtb genotypes should have been included in the current study.

Conclusion

In summary, we proposed customized genotype-specific MIRU-VNTR loci sets based on the LASSO regression and APD approaches for precise Mtb strains identification. As the proposed VNTR set offered a comparable discriminatory power to the standard 24 MIRU-VNTR loci set could be promising alternatives to the 24-loci MIRU–VNTR genotyping for using in resource-limited-settings. Moreover, our findings suggested the LASSO regression rather than the APD approach is more effective in the determination of possible discriminative VNTR loci set to precise discrimination of our studied Mtb population and may be beneficial to be used in finding reduced number loci set in other Mtb genotypes or sublineages.

Data availability statement

The original contributions presented in this study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author/s.

Ethics statement

The studies involving human participants were reviewed and approved by Pasteur Institute of Iran. The patients/participants provided their written informed consent to participate in this study.

Author contributions

SH and FV wrote the manuscript. FV supervised the project. FV, SDS, and AF designed the project. SH, NE, and MKK performed laboratory work. SE, MN, and MAKK performed the statistical analysis. All authors have read and approved the final manuscript.

Funding

This work was supported by the Iran National Science Foundation (INSF) (project no. 9800097). The funding body had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Acknowledgments

We would like to thank all the personnel of Mycobacteriology and Pulmonary Research Department of Pasteur Institute of Iran for their assistance.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2022.978355/full#supplementary-material

Footnotes

  1. ^ http://www.miru-vntrplus.org
  2. ^ http://www.pasteur-guadeloupe.fr:8081/SITVIT_ONLINE/index.jsp

References

Ali, A., Hasan, Z., Tanveer, M., Siddiqui, A. R., Ghebremichael, S., Kallenius, G., et al. (2007). Characterization of Mycobacterium tuberculosis Central Asian Strain1 using mycobacterial interspersed repetitive unit genotyping. BMC Microbiol. 7:76. doi: 10.1186/1471-2180-7-76

PubMed Abstract | CrossRef Full Text | Google Scholar

Allix-Béguec, C., Fauville-Dufaux, M., and Supply, P. (2008). Three-year population-based evaluation of standardized mycobacterial interspersed repetitive-unit-variable-number tandem-repeat typing of Mycobacterium tuberculosis. J. Clin. Microbiol. 46, 1398–1406. doi: 10.1128/JCM.02089-07

PubMed Abstract | CrossRef Full Text | Google Scholar

Asante-Poku, A., Nyaho, M. S., Borrell, S., Comas, I., Gagneux, S., and Yeboah-Manu, D. (2014). Evaluation of customised lineage-specific sets of MIRU-VNTR loci for genotyping Mycobacterium tuberculosis complex isolates in Ghana. PLoS One. 9:e92675. doi: 10.1371/journal.pone.0092675

PubMed Abstract | CrossRef Full Text | Google Scholar

Azé, J., Sola, C., Zhang, J., Lafosse-Marin, F., Yasmin, M., Siddiqui, R., et al. (2015). Genomics and Machine Learning for Taxonomy Consensus: The Mycobacterium tuberculosis Complex Paradigm. PLoS One. 10:e0130912. doi: 10.1371/journal.pone.0130912

PubMed Abstract | CrossRef Full Text | Google Scholar

De Beer, J. L., Kremer, K., Ködmön, C., Supply, P., and Van Soolingen, D., and Global Network for the Molecular Surveillance of Tuberculosis 2009. (2012). First worldwide proficiency study on variable-number tandem-repeat typing of Mycobacterium tuberculosis complex strains. J. Clin. Microbiol. 50, 662–669.

Google Scholar

de Beer, J. L., van Ingen, J., de Vries, G., Erkens, C., Sebek, M., Mulder, A., et al. (2013). Comparative study of IS6110 RFLP and VNTR typing of Mycobacterium tuberculosis in the Netherlands, based on a five year nationwide survey. J. Clin. Microbiol. 51, 1193–1198. doi: 10.1128/JCM.03061-12

PubMed Abstract | CrossRef Full Text | Google Scholar

Demay, C., Liens, B., Burguière, T., Hill, V., Couvin, D., Millet, J., et al. (2012). SITVITWEB–a publicly available international multimarker database for studying Mycobacterium tuberculosis genetic diversity and molecular epidemiology. Infect. Genet. Evol. 12, 755–766. doi: 10.1016/j.meegid.2012.02.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Ei, P. W., Aung, W. W., Lee, J. S., Choi, G.-E., and Chang, C. L. (2016). Molecular strain typing of Mycobacterium tuberculosis: a review of frequently used methods. J. Korean Med. Sci. 31, 1673–1683. doi: 10.3346/jkms.2016.31.11.1673

PubMed Abstract | CrossRef Full Text | Google Scholar

Hachisu, Y., Hashimoto, R., Kishida, K., and Yokoyama, E. (2013). Analytical procedure of variable number of tandem repeats (VNTR) analysis and effective use of analysis results for tuberculosis control. Rinsho Byori 61, 1123–1135.

PubMed Abstract | Google Scholar

Hadifar, S., Fateh, A., Pourbarkhordar, V., Siadat, S. D., Mostafaei, S., and Vaziri, F. (2021). Variation in Mycobacterium tuberculosis population structure in Iran: a systemic review and meta-analysis. BMC Infect Dis. 21:2. doi: 10.1186/s12879-020-05639-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Hadifar, S., Kamakoli, M. K., Fateh, A., Siadat, S. D., and Vaziri, F. (2019a). Enhancing the differentiation of specific genotypes in Mycobacterium tuberculosis population. Sci. Rep. 9, 1–9. doi: 10.1038/s41598-019-54393-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Hadifar, S., Shamkhali, L., Kamakoli, M. K., Mostafaei, S., Khanipour, S., Mansoori, N., et al. (2019b). Genetic diversity of Mycobacterium tuberculosis isolates causing pulmonary and extrapulmonary tuberculosis in the capital of Iran. Mol. Phylogenet Evol. 132, 46–52. doi: 10.1016/j.ympev.2018.11.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Homolka, S., Post, E., Oberhauser, B., George, A. G., Westman, L., Dafae, F., et al. (2008). High genetic diversity among Mycobacterium tuberculosis complex strains from Sierra Leone. BMC Microbiol. 8:103. doi: 10.1186/1471-2180-8-103

PubMed Abstract | CrossRef Full Text | Google Scholar

Hunter, P. R., and Gaston, M. A. (1988). Numerical index of the discriminatory ability of typing systems: an application of Simpson’s index of diversity. J. Clin. Microbiol. 26, 2465–2466. doi: 10.1128/jcm.26.11.2465-2466.1988

PubMed Abstract | CrossRef Full Text | Google Scholar

Kamerbeek, J., Schouls, L., Kolk, A., Van Agterveld, M., Van Soolingen, D., Kuijper, S., et al. (1997). Simultaneous detection and strain differentiation of Mycobacterium tuberculosis for diagnosis and epidemiology. J. Clin. Microbiol. 35, 907–914.

Google Scholar

Kochkaksaraei, M. B., Kaboosi, H., and Ghaemi, E. A. (2019). Genetic variation of the Mycobacterium tuberculosis in north of Iran; the Golestan Province. Iranian Red Crescent Med. J. 21, 1–9.

Google Scholar

Luo, T., Yang, C., Pang, Y., Zhao, Y., Mei, J., and Gao, Q. (2014). Development of a hierarchical variable-number tandem repeat typing scheme for Mycobacterium tuberculosis in China. PLoS One 9:e89726. doi: 10.1371/journal.pone.0089726

PubMed Abstract | CrossRef Full Text | Google Scholar

Mansoori, N., Yaseri, M., Vaziri, F., and Douraghi, M. (2018). Genetic diversity of Mycobacterium tuberculosis complex isolates circulating in an area with high tuberculosis incidence: Using 24-locus MIRU-VNTR method. Tuberculosis 112, 89–97. doi: 10.1016/j.tube.2018.08.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Merza, M. A., Farnia, P., Salih, A. M., Masjedi, M. R., and Velayati, A. A. (2010). The most predominant spoligopatterns of Mycobacterium tuberculosis isolates among Iranian, Afghan-immigrant, Pakistani and Turkish tuberculosis patients: a comparative analysis. Chemotherapy 56, 248–257. doi: 10.1159/000316846

PubMed Abstract | CrossRef Full Text | Google Scholar

Murase, Y., Mitarai, S., Sugawara, I., Kato, S., and Maeda, S. (2008). Promising loci of variable numbers of tandem repeats for typing Beijing family Mycobacterium tuberculosis. J. Med. Microbiol. 57, 873–880. doi: 10.1099/jmm.0.47564-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Pan, X.-L., Zhang, C.-L., Nakajima, C., Fu, J., Shao, C.-X., Zhao, L.-N., et al. (2017). A quantitative and efficient approach to select MIRU–VNTR loci based on accumulation of the percentage differences of strains for discriminating divergent Mycobacterium tuberculosis sublineages. Emerg. Microb. Infect. 6, 1–8. doi: 10.1038/emi.2017.58

PubMed Abstract | CrossRef Full Text | Google Scholar

Rasoahanitralisoa, R., Rakotosamimanana, N., Stucki, D., Sola, C., Gagneux, S., and Razanamparany, V. R. (2017). Evaluation of spoligotyping, SNPs and customised MIRU-VNTR combination for genotyping Mycobacterium tuberculosis clinical isolates in Madagascar. PLoS One. 12:e0186088. doi: 10.1371/journal.pone.0186088

PubMed Abstract | CrossRef Full Text | Google Scholar

Satta, G., Lipman, M., Smith, G., Arnold, C., Kon, O., and McHugh, T. (2018). Mycobacterium tuberculosis and whole-genome sequencing: how close are we to unleashing its full potential? Clin. Microbiol. Infect. 24, 604–609. doi: 10.1016/j.cmi.2017.10.030

PubMed Abstract | CrossRef Full Text | Google Scholar

Sauerbrei, W., Royston, P., and Binder, H. (2007). Selection of important variables and determination of functional form for continuous predictors in multivariable model building. Stat. Med. 26, 5512–5528. doi: 10.1002/sim.3148

PubMed Abstract | CrossRef Full Text | Google Scholar

Shuaib, Y. A., Utpatel, C., Kohl, T. A., Barilar, I., Diricks, M., Ashraf, N., et al. (2022). Origin and Global Expansion of Mycobacterium tuberculosis Complex Lineage 3. Genes 13:990. doi: 10.3390/genes13060990

PubMed Abstract | CrossRef Full Text | Google Scholar

Sinkov, V., Ogarkov, O., Mokro, I., and Zhdanova, S. (2016). Evolutionary significance of non-synonymous substitution for Mycobacterium tuberculosis of Ural genotype. Mol. Med. 14, 44–50.

Google Scholar

Sola, C., Filliol, I., Legrand, E., Lesjean, S., Locht, C., Supply, P., et al. (2003). Genotyping of the Mycobacterium tuberculosis complex using MIRUs: association with VNTR and spoligotyping for molecular epidemiology and evolutionary genetics. Infect. Genet. Evol. 3, 125–133. doi: 10.1016/s1567-1348(03)00011-x

CrossRef Full Text | Google Scholar

Supply, P., Allix, C., Lesjean, S., Cardoso-Oelemann, M., Rüsch-Gerdes, S., Willery, E., et al. (2006). Proposal for standardization of optimized mycobacterial interspersed repetitive unit-variable-number tandem repeat typing of Mycobacterium tuberculosis. J. Clin. Microbiol. 44, 4498–4510. doi: 10.1128/JCM.01392-06

PubMed Abstract | CrossRef Full Text | Google Scholar

Vaziri, F., Kohl, T. A., Ghajavand, H., Kamakoli, M. K., Merker, M., Hadifar, S., et al. (2019). Genetic diversity of multi-and extensively drug-resistant Mycobacterium tuberculosis isolates in the capital of Iran, revealed by whole-genome sequencing. J. Clin. Microbiol. 57, e1477–e1418. doi: 10.1128/JCM.01477-18

PubMed Abstract | CrossRef Full Text | Google Scholar

Zamani, S., Haeili, M., Nasiri, M. J., Imani Fooladi, A. A., Javadpour, S., and Feizabadi, M. M. (2016). Genotyping of Mycobacterium tuberculosis isolates from Hormozgan province of Iran based on 15-locus MIRU-VNTR and spoligotyping. Int J. Bacteriol. 2016:7146470. doi: 10.1155/2016/7146470

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Mycobacterium tuberculosis, MIRU-VNTR, LASSO regression, accumulation of the percentage differences, genotyping

Citation: Hadifar S, Kargarpour Kamakoli M, Eybpoosh S, Nakhaeizadeh M, Kargarpour Kamakoli MA, Ebrahimifard N, Fateh A, Siadat SD and Vaziri F (2022) The shortcut of mycobacterial interspersed repetitive unit-variable number tandem repeat typing for Mycobacterium tuberculosis differentiation. Front. Microbiol. 13:978355. doi: 10.3389/fmicb.2022.978355

Received: 25 June 2022; Accepted: 18 August 2022;
Published: 08 September 2022.

Edited by:

Sukhendu Mandal, University of Calcutta, India

Reviewed by:

Ana Pombo Botelho, Instituto Nacional Investigaciao Agraria e Veterinaria (INIAV), Portugal
Yih-Yuan Chen, National Chiayi University, Taiwan

Copyright © 2022 Hadifar, Kargarpour Kamakoli, Eybpoosh, Nakhaeizadeh, Kargarpour Kamakoli, Ebrahimifard, Fateh, Siadat and Vaziri. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Farzam Vaziri, ZmFyemFtX3ZhemlyaUB5YWhvby5jb20=, Zl92YXppcmlAcGFzdGV1ci5hYy5pcg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.