AUTHOR=Iddamalgoda Lahiru , Das Partha S. , Aponso Achala , Sundararajan Vijayaraghava S. , Suravajhala Prashanth , Valadi Jayaraman K. TITLE=Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications JOURNAL=Frontiers in Genetics VOLUME=7 YEAR=2016 URL=https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2016.00136 DOI=10.3389/fgene.2016.00136 ISSN=1664-8021 ABSTRACT=
Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately prioritizing the single nucleotide polymorphisms (SNP) associated with the disease. In this commentary, we review the state-of-art data mining and pattern recognition models for identifying inherited diseases and deliberate the need of binary classification- and scoring-based prioritization methods in determining causal variants. While we discuss the pros and cons associated with these methods known, we argue that the gene prioritization methods and the protein interaction (PPI) methods in conjunction with the K nearest neighbors' could be used in accurately categorizing the genetic factors in disease causation.