AUTHOR=Castaneda Everest Uriel , Baker Erich J. TITLE=KNeXT: a NetworkX-based topologically relevant KEGG parser JOURNAL=Frontiers in Genetics VOLUME=15 YEAR=2024 URL=https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2024.1292394 DOI=10.3389/fgene.2024.1292394 ISSN=1664-8021 ABSTRACT=
Automating the recreation of gene and mixed gene-compound networks from Kyoto Encyclopedia of Genes and Genomes (KEGG) Markup Language (KGML) files is challenging because the data structure does not preserve the independent or loosely connected neighborhoods in which they were originally derived, referred to here as its topological environment. Identical accession numbers may overlap, causing neighborhoods to artificially collapse based on duplicated identifiers. This causes current parsers to create misleading or erroneous graphical representations when mixed gene networks are converted to gene-only networks. To overcome these challenges we created a python-based KEGG NetworkX Topological (KNeXT) parser that allows users to accurately recapitulate genetic networks and mixed networks from KGML map data. The software, archived as a python package index (PyPI) file to ensure broad application, is designed to ingest KGML files through built-in APIs and dynamically create high-fidelity topological representations. The utilization of NetworkX’s framework to generate tab-separated files additionally ensures that KNeXT results may be imported into other graph frameworks and maintain programmatic access to the original