Tuberculosis (TB), caused by
Blood samples from patients with newly diagnosed tuberculosis and healthy controls (HC) were utilized in this study. Examining methylation microarray data from 40 whole blood samples (22TB + 18HC), we employed two procedures: signature gene methylated position analysis and signature region methylated position analysis to pinpoint distinctive methylated positions. Based on the screening results, diagnostic classifiers are constructed through machine learning, and validation was conducted through pyrosequencing in a separate queue (22TB + 18HC). Culminating in the development of a new tuberculosis diagnostic method via quantitative real-time methylation specific PCR (qMSP).
The combination of the two procedures revealed a total of 10 methylated positions, all of which were located in the promoter region. These 10 signature methylated positions facilitated the construction of a diagnostic classifier, exhibiting robust diagnostic accuracy in both cross-validation and external test sets. The LDA model demonstrated the best classification performance, achieving an AUC of 0.83, specificity of 0.8, and sensitivity of 0.86 on the external test set. Furthermore, the validation of signature methylated positions through pyrosequencing demonstrated high agreement with screening outcomes. Additionally, qMSP detection of 2 potential hypomethylated positions (cg04552852 and cg12464638) exhibited promising results, yielding an AUC of 0.794, specificity of 0.720, and sensitivity of 0.816.
Our study demonstrates that the validated signature methylated positions through pyrosequencing emerge as plausible biomarkers for tuberculosis diagnosis. The specific methylation markers in the