MycoWiki: Functional annotation of the minimal model organism Mycoplasma pneumoniae

Elfmann, Christoph; Zhu, Bingyao; Pedreira, Tiago; Hoßbach, Ben; Lluch-Senar, Maria; Serrano, Luis; Stülke, Jörg

doi:10.3389/fmicb.2022.935066

ORIGINAL RESEARCH article

Front. Microbiol., 25 July 2022

Sec. Evolutionary and Genomic Microbiology

Volume 13 - 2022 | https://doi.org/10.3389/fmicb.2022.935066

This article is part of the Research TopicMollicutes: From Evolution To Pathogenesis, Volume IIView all 20 articles

MycoWiki: Functional annotation of the minimal model organism Mycoplasma pneumoniae

Christoph Elfmann^‡1†

Bingyao Zhu^1†

Tiago Pedreira^‡1†

Ben Hoßbach¹

Maria Lluch-Senar²

Luis Serrano²

Jörg Stülke^‡1*

¹Department of General Microbiology, Göttingen Center for Molecular Biosciences, Georg-August University Göttingen, Göttingen, Germany
²EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), Universitat Pompeu Fabra (UPF), Barcelona, Spain

The human pathogen Mycoplasma pneumoniae is viable independently from host cells or organisms, despite its strongly reduced genome with only about 700 protein-coding genes. The investigation of M. pneumoniae can therefore help to obtain general insights concerning the basic requirements for cellular life. Accordingly, M. pneumoniae has become a model organism for systems biology in the past decade. To support the investigation of the components of this minimal bacterium, we have generated the database MycoWiki. (http://mycowiki.uni-goettingen.de) MycoWiki organizes data under a relational database and provides access to curated and state-of-the-art information on the genes and proteins of M. pneumoniae. Interestingly, M. pneumoniae has undergone an evolution that resulted in the limited similarity of many proteins to proteins of model organisms. To facilitate the analysis of the functions of M. pneumoniae proteins, we have integrated structure predictions from the AlphaFold Protein Structure Database for most proteins, structural information resulting from in vivo cross-linking, and protein-protein interactions based on a global in vivo study. MycoWiki is an important tool for the systems and synthetic biology community that will support the comprehensive understanding of a minimal organism and the functional annotation of so far uncharacterized proteins.

Introduction

Bacteria of the genus Mycoplasma are characterized by their strongly reduced genomes that still encode all the functions required for autonomous growth. Bacteria such as Mycoplasma genitalium and Mycoplasma pneumoniae have genomes of only 480 and 816 kb and encode about 480 and 700 proteins, respectively. These small genomes have put these bacteria into the spotlight of systems and synthetic biology, two recent disciplines in biology that aim for a complete understanding of all processes in a living cell up to mathematic modeling and for the creation of artificial forms of life, respectively.

Starting with global analyses of the metabolism, gene expression, and protein-protein interactions in 2009 (Güell et al., 2009; Kühner et al., 2009; Yus et al., 2009), M. pneumoniae has become one of the model organisms of systems biology. Many aspects of its biology such as metabolism, DNA and protein modifications, the micro-proteome, protein degradation, regulatory networks, and gene essentiality have been studied at the global level as well (Schmidl et al., 2010; van Noort et al., 2012; Lluch-Senar et al., 2013, 2015; Wodke et al., 2013; Miravet-Verde et al., 2019; Yus et al., 2019; Burgos et al., 2020; Montero-Blay et al., 2020). The small proteome of M. pneumoniae facilitates the investigation of protein function at the global scale as revealed by the first large-scale global in vivo study of protein-protein interactions. This analysis resulted in the visualization of important protein complexes and in the identification of functions of so far unknown proteins (O’Reilly et al., 2020).

In addition to its role in systems biology, M. pneumoniae is also intensively studied due to its role as a lung pathogen (Meyer Sauteur et al., 2014; Waites et al., 2017; Esposito et al., 2021). Its main virulence determinants are a specific ADP-ribosylating and vacuolating cytotoxin (CARDS, MPN372) (Kannan and Baseman, 2006; Becker et al., 2015), hydrogen peroxide which is produced by glycerol phosphate oxidase (GlpO) as a product of phospholipid and glycerol utilization (Schmidl et al., 2011; Blötz and Stülke, 2017), hydrogen sulfide is produced by the cysteine desulfurase HapE during cysteine degradation (Großhennig et al., 2016), and the immunoglobulin binding protein IbpM helps the bacteria to escape the human immune system (Blötz et al., 2020).

As a minimal pathogen, M. pneumoniae might also be useful in fighting disease by delivering therapeutics to the human host or by directly combatting other bacteria (Piñero-Lambea et al., 2015; Garrido et al., 2021). Such applications are favored by the fact that the genetic code used by M. pneumoniae is unique, thus preventing horizontal gene transfer, and by the development of methods that allow the construction of attenuated strains by deleting the genes that encode virulence factors. Indeed, this strategy has recently been employed to eliminate biofilms of the harmful and often multiresistant human pathogen Staphylococcus aureus (Garrido et al., 2021).

The importance of M. pneumoniae as a human pathogen, as a potential therapeutic agent, and its role in systems and synthetic biology suggests that this bacterium will remain the focus of intense research. This requires tools that allow easy access to all available information on the genes and proteins of M. pneumoniae and their functional and regulatory interactions. To facilitate the investigation of M. pneumoniae, we have developed MycoWiki, a database centered around the genes and proteins of this bacterium. This database shares its framework with the established databases SubtiWiki and SynWiki, which provide functional annotation of Bacillus subtilis and the artificial minimal organism Mycoplasma mycoides JCVI-syn3A, respectively (Pedreira et al., 2022a,b). MycoWiki presents the available information on the genes and proteins of M. pneumoniae in a highly intuitive manner. A particular focus on MycoWiki is the presentation of links and interactions between different genes and proteins, which allows the scientific community to develop novel hypotheses. The information provided in MycoWiki is derived from earlier annotations of the M. pneumoniae genome (Dandekar et al., 2000; Wodke et al., 2015) and the published body of knowledge.

Description of the database

MycoWiki (http://mycowiki.uni-goettingen.de) is built upon the same framework as the aforementioned databases SubtiWiki and SynWiki (Pedreira et al., 2022a,b). As a result, the general organization of data entities and their relations to each other, and the layout of the web pages, are the same. However, some features are exclusive to MycoWiki, such as the representation of cross-linking data combined with protein structures.

The structure of MycoWiki is centered around genes and their products. Most of the information represented in this database is associated with a specific gene/protein, and thus the Gene pages are the core part of MycoWiki. They integrate the most data relating to a particular gene, but also connect to separate web pages, for example, pages on certain groups of genes, such as specific functional categories. The Gene page also links to browsers, which allow exploring some aspects of a gene or the gene product and possible interactions of the encoded protein (such as the Expression, Interaction, and Pathway Browsers).

The front page

The front page of MycoWiki gives access to the Gene pages via a search bar, which can be used to query genes by unique identifiers (Figure 1). One option is to use a gene’s name, usually a mnemonic of three or four letters as it is commonly the case for bacterial genes (such as eno for enolase). Genes can also be identified via their locus tags, which are largely based on genome re-annotations (Dandekar et al., 2000; Lluch-Senar et al., 2015). For example, MPN606 is the locus tag for eno, and it is guaranteed to never change, even if the mnemonic designation of the gene should be changed. In some cases, a name has not been assigned to a gene yet, so the locus tag is the primary identifier. Aside from these two identifiers, a full-text search of a gene’s data is possible via the “Search” button.

FIGURE 1

Figure 1. Front page of MycoWiki. The central search bar allows to query genes by name or locus tag, but also facilitates full-text searches. Below, the data browsers are linked for quick access. In the top bar, direct access to an overview of categories and a list of essential genes is provided, and utility links for jumping to a random gene page and for logging in.

Moreover, the top bar of the front page gives access to an overview of the functional categories each gene/protein was assigned to, a list of essential genes, and to a random gene page. Finally, it allows the user to log into the database. These links also appear on all gene pages in the right-side bar (Figure 2D, see below). Below the search bar, links to the interactive MycoWiki browsers (see below) are provided.

FIGURE 2

Figure 2. The gene page for eno. The general structure of gene pages is the same for all genes, but individual sections and interactive elements may vary due to different information being available. (A) Gene name, strain, and description; (B) table summarizing basic information on the gene and its product; (C) embedded genomic region display; (D) sidebar containing helpful links and additional tools, such as the Structure Viewer; (E) further sections describing various aspects of the gene; and (F) list of publications as sources of information.

The gene pages

In MycoWiki, the Gene pages provide access to all data relating to a particular gene. Most of the annotation can be directly viewed on the page, and links to browsers are provided which investigate certain aspects of the gene in more detail. All gene pages share the same basic structure. Figure 2 shows the page for eno. The top bar (Figure 2A) contains links to the data browsers, the change history of the page, and a log-in pop-up. It also features the search bar, which has the same functionality as the one presented on the front page. At the top of the main view (Figure 2B), the gene name is indicated as the page title. Below, a short general description is displayed, and the name of the M. pneumoniae strain M129 is shown on the side. Next, a table summarizes some basic information about the gene such as the locus tag, function, and sequence information. The latter is accompanied by utility links used to directly BLAST (Sayers et al., 2022) the nucleotide or translated amino acid sequence. This table also features data on the gene’s product, for example, the molecular weight and enzyme commission number of the encoded protein. Further down, an interactive presentation of the genomic region is embedded in the page (Figure 2C), which allows viewing the genomic neighborhood of the gene. It does not feature all of the functionality of the full Genome Browser, which can be accessed via the top bar, and which will be explained below. On the sidebar (Figure 2D), a group of links provides access to helpful pages. Depending on the gene and the available data, additional interactive elements follow: the Structure Viewer shows 3D visualizations of the protein structure and cross-linking data, if available (see below). The Interaction overview displays a graph of the protein-protein interactions between the protein and its interaction partners. Proteins are represented by nodes that can be clicked to open the corresponding gene pages. In addition, the edges, which depict interactions, link to relevant publications.

Further down below on the page (Figure 2E), additional sections shed light on various aspects of the gene/protein, such as assigned categories, genomic coordinates, details about the gene product, and other data. At the end of the page, a list of relevant publications is featured (Figure 2F).

Browsers

MycoWiki and its siblings SubtiWiki and SynWiki feature various browsers, which are interactive, graphical displays that allow users to explore certain types of data in an intuitive manner. When they are accessed via a gene page’s top bar, the data corresponding to that gene are highlighted. However, data on other genes can be easily loaded in the browsers by the use of search bars, which allows the user to compare and contrast information on multiple genes effortlessly. These search bars are located in the top left corner of any browser. MycoWiki currently features four different browsers.

The Genome Browser (Figure 3) allows the user to see the immediate neighborhood of the gene and the orientations and lengths of genes, scroll through the genome, and adjust the zoom level. In addition, it includes the display of DNA and protein sequences. Clicking on a gene displays the corresponding sequences below the interactive genome display, where the user can search for substrings and toggle the reverse complement sequence. Flanking regions of genes or freely defined substrings of the genome can be loaded via the search bar.

FIGURE 3

Figure 3. Genome Browser page with eno selected. The interactive display allows to scroll through the organism’s genome and to load nucleotide and amino acid sequences of genes.

The Pathway Browser visualizes a curated map of metabolic pathways and related metabolites and enzymes for M. pneumoniae. In Figure 4A, the reaction catalyzed by Eno as part of the glycolytic pathway is shown. Clicking on enzymes in the map opens a small pop-up window featuring a basic summary for the corresponding gene/protein. Using the collapsible toolbar, the user can enter a full-screen mode and select enzymes or metabolites to be highlighted.

FIGURE 4

Figure 4. Interaction and Pathway Browser with Eno highlighted. (A) The Pathway Browser features a curated map of metabolic reactions. Selected enzymes and metabolites can be highlighted using the toolbar. (B) The Interaction Browser allows viewing interaction networks of genes. With the toolbar, the size and appearance of the currently viewed network can be adjusted.

Protein-protein interactions are important clues to characterize proteins of unknown function. M. pneumoniae is the first organism for which a global analysis of the in vivo interactome was performed (O’Reilly et al., 2020). The results and the outcome of other more protein-specific studies are displayed in the Interaction Browser. In this browser, networks of interacting proteins can be visualized in a dynamic and interactive manner (Figure 4B). Similar to the corresponding interactive element on the gene page sidebar, proteins and their interactions are represented by nodes and edges of a graph. However, the browser display is more flexible: the user can rearrange nodes by dragging them with the cursor, and other visualization options can be adjusted via the toolbar. More proteins can be included in the display by increasing the radius, and the distance between nodes on the screen is controlled by the spacing setting. In addition, specific proteins can be highlighted, and the color scheme can be adjusted via the “Settings” button. Left-clicking nodes open a summary pop-up window, while right-clicking somewhere on the screen triggers a context menu. The latter features options to export the displayed interaction network as an image or to download the corresponding list of interactions. In the top left corner, an info box displays the currently viewed gene, the radius of the network, and the proportion of proteins contained in the network.

With the Expression Browser (Figure 5), the user can investigate protein and transcript levels (not shown) of genes/proteins under different conditions (Yus et al., 2009; Maier et al., 2011). Additional genes can be dynamically loaded for comparison using the search bar, and descriptions of the individual conditions are available by clicking on the corresponding data points. Options for data export are provided as well.

FIGURE 5

Figure 5. The Expression Browser shows protein levels and transcript levels (not included in the screenshot) for genes under various conditions. Data on additional genes can be loaded via the search bar for comparison. Here, protein levels for Eno are compared with Pyk and Pgm.

Structure viewer

MycoWiki introduces a new 3D protein structure viewer (Figure 6), which is not yet present in either SubtiWiki or SynWiki. It is able to load and display structures from the Protein Data Bank (PDB) (Burley et al., 2019) and structure predictions from the AlphaFold Protein Structure Database (Varadi et al., 2022). In addition, it features the visualization of internal cross-links based on data from a global in vivo study of protein-protein interactions (O’Reilly et al., 2020).

FIGURE 6

Figure 6. Fullscreen view of the Structure Viewer, which can be found on the sidebar of gene pages. It displays interactive 3D representations of available protein structures loaded from PDB or AlphaFold DB, and can also feature visualizations of internal cross-links. Using the control panel to the side of the view canvas, the visibility of cross-links can be toggled. The screenshot shows the AlphaFold structure prediction for eno and corresponding cross-links.

A minimized form of the Structure Viewer can be found on the gene page sidebar. While it features full functionality, a full-screen view is also available (shown in the figure), which includes information on how to control the viewer. By using the arrow icons, the user can cycle through the available structures of a protein, which are also found in the main body of the gene page in the section “The protein > Structure.” An info text in the bottom left corner indicates the currently viewed structure, and also links to the respective PDB or AlphaFold DB page. The user can choose different molecular representation styles from the drop-down selection in the upper left corner, such as renderings of the protein surface indicating hydrophobicity or electrostatic values. The visualization of structures and cross-links is performed with NGL Viewer, a web-based tool for molecular 3D graphics (Rose and Hildebrand, 2015). To the side of the viewer, additional information is displayed, including instructions about how to control the viewer and further details about the structure, if available.

As shown in the figure, visualization of cross-linking data is also available in MycoWiki. The data result from a large-scale in vivo study (O’Reilly et al., 2020), in which whole-cell cross-linking mass spectrometry with two different cross-linkers (DSS and DSSO) was performed. For the Structure Viewer, only internal (intraprotein) cross-links were extracted and mapped to the AlphaFold structure predictions of the corresponding proteins. Of the 686 predictions assigned to MycoWiki genes, internal cross-links were available for 441 structures. Cross-linked residues of a protein are highlighted in the viewer by dashed lines indicating the Euclidean distance between them. Furthermore, if the distance between the cross-linked residues is smaller than the spacer arms of the respective cross-linker, a molecular 3D representation of the linker is fitted to the structure. This representation was calculated and rendered using the program Xwalk, which determines the Solvent Accessible Surface Distance (SASD) between cross-linked amino acids (Kahraman et al., 2011). It corresponds to the shortest path between them only using solvent-occupied space, without passing through the protein surface. For DSS and DSSO, distances of 11.4 and 10.1 Å, respectively, plus a 1.5 Å tolerance were chosen as the maximum distance for which they could be fitted to the structure. In the Structure Viewer, the visibility of the different distance representations can be toggled via controls at the side of the viewer panel. In addition, a download link for an archive file of all structures and cross-link data is provided.

Implementation and data

The MycoWiki platform shares its framework with its predecessor SubtiWiki (Pedreira et al., 2022b). Accordingly, it is implemented using the same custom PHP backend framework and frontend functionality, and uses MySQL for its relational database. The application is hosted with Apache HTTP Server. Some differences SubtiWiki and SynWiki exist in presentation due to differing availability of data for the corresponding organisms, and some frontend features slightly vary in design.

MycoWiki contains a mixture of manually curated information, which is gathered from recent publications and evaluated by experts, and individual bulk data imports from existing data sources, such as other databases or published experimental data. The platform received a lot of its original annotation from the database MyMpn (Wodke et al., 2015), which was discontinued in 2020. With its structure similar to the one of MycoWiki, many parts of MyMpn could be directly adopted, such as genomic coordinates, enzyme commission numbers, and post-translational modifications. The main body of information on protein-protein interactions comes from a global in vivo study (O’Reilly et al., 2020).

Similar to SubtiWiki and SynWiki, a specialized set of categories was conceived for MycoWiki. They are represented by a tree-like structure that classifies genes by their functions, but also groups them according to their localization, essentiality, or quality of characterization, among others. Table 1 shows the five top-level categories and their immediate subcategories, while lower-level subcategories are omitted. The main part of the categories has been adapted from SynWiki, however, “Virulence and pathogenicity” has been added as a top-level category to help characterization of the organism as a pathogen.

TABLE 1

Table 1. List of top-level categories and their subcategories used in MycoWiki, and the number of genes assigned to each of them.

As in SubtiWiki and SynWiki, a list of precomputed best-hit protein homologs in selected related bacteria based on a FASTA pipeline (Pedreira et al., 2022b) has been added to each protein. A specialized page with a corresponding table (Figure 7) can be accessed from the “Homologs” section on any gene page. In total, 16 species were deemed representative of protein homologies, among them Mycoplasma genitalium, Mycoplasma mycoides subsp. mycoides, the artificial synthetic organism M. mycoides JCVI Syn3A, Escherichia coli, and Bacillus subtilis. Identity and similarity scores are given for each potential homolog, and an indication as to whether the homolog in question is also the best hit for the protein in the other direction.

FIGURE 7

Figure 7. Protein homology table for Eno. Best BLAST hits for 16 representative related organisms are featured, and scores on identity and similarity are provided.

To keep functional genome annotation up to date, joint efforts of the corresponding scientific community are required. Therefore, MycoWiki is open to contributions from all scientists in the field of Mycoplasma research. This is a major distinctive feature as compared to other databases that include information on M. pneumoniae. In addition, MyMpn, as mentioned above, has not received updates in the past years. BioCyc, a large suite of databases for many species including M. pneumoniae (Karp et al., 2019), is only available behind a paywall after very few pages access whereas MycoWiki is freely accessible to the scientific community. Finally, PATRIC, the Pathosystems Resource Integration Center (Davis et al., 2020), has a strong focus on genes rather than proteins. Thus, we are confident that MycoWiki will become a valuable tool for the Mycoplasma research community.

Curation and community

To further enhance the information provided in MycoWiki, the Mycoplasma research community is invited to register and participate in the curation of the database. While access to the complete contents is free for everyone, only registered users are able to log in and contribute. The entries will be curated by the team behind MycoWiki to ensure continuous high quality of the information provided.

Future perspectives

With MycoWiki, we have released a new comprehensive model organism database for the minimal bacterium M. pneumoniae. It utilizes the popular framework of SubtiWiki to facilitate intuitive exploration of the available annotation. Particular focus is put on the interactions between different genes and proteins, which may support the scientific community in the development of novel research hypotheses. Customized categories are used to classify the functions and other qualities of genes and their products concisely. In addition, the inclusion of a homology analysis could help to infer the functional annotation of genes. AlphaFold structure predictions have been assigned for the proteins, allowing a visual representation even in cases where no experimentally determined structure exists. While the rendering of internal cross-links for these structures can give an idea about the quality of the prediction, interprotein cross-links could be integrated in the future to explore the interaction between proteins in more detail. We hope that MycoWiki will become a valuable tool for the M. pneumoniae research community, and in turn an asset to the ongoing systems and synthetic biology research. Moreover, the wealth of information provided in MycoWiki and the easy access to classes of proteins based on the categories will help in the further development of M. pneumoniae as a chassis to target therapeutical compounds.

For the family of databases that includes MycoWiki, SubtiWiki, and SynWiki, we will develop novel features including protein-RNA, RNA-RNA, and protein-metabolite interactions that will certainly enhance the value of the databases.

Data availability statement

The original contributions presented in this study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

CE, BZ, and TP developed the framework of the database and integrated the data. BH performed initial work for the development of the structure viewer and integration of cross-link data. ML-S and LS provided data for the database. JS provided funding and supervised the development of the database. CE and JS wrote the manuscript. All authors read and approved the current submission.

Acknowledgments

We are grateful to Cedric Blötz and Ole Hinrichs for their help with the initial work on MycoWiki and to Marc Weber for the help with data collection.

Conflict of Interest

ML-S was employed by Pulmobiotics Ltd., Spain.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Becker, A., Kannan, T. R., Taylor, A. B., Pakhomova, O. N., Zhang, Y., Somarajan, S. R., et al. (2015). Structure of CARDS toxin, a unique ADP-ribosylating and vacuolating cytotoxin from Mycoplasma pneumoniae. Proc. Natl. Acad. Sci. U.S.A. 112, 5165–5170. doi: 10.1073/pnas.1420308112

PubMed Abstract | CrossRef Full Text | Google Scholar

Blötz, C., and Stülke, J. (2017). Glycerol metabolism and its implication in virulence in Mycoplasma. FEMS Microbiol. Rev. 41, 640–652.

Google Scholar

Blötz, C., Singh, N., Dumke, R., and Stülke, J. (2020). Characterization of an immunoglobulin binding protein (IbpM) from Mycoplasma pneumoniae. Front. Microbiol. 11:685. doi: 10.3389/fmicb.2020.00685

PubMed Abstract | CrossRef Full Text | Google Scholar

Burgos, R., Weber, M., Martinez, S., Lluch-Senar, M., and Serrano, L. (2020). Protein quality control and regulated proteolysis in the genome-reduced organism Mycoplasma pneumoniae. Mol. Syst. Biol. 16:e9530. doi: 10.15252/msb.20209530

PubMed Abstract | CrossRef Full Text | Google Scholar

Burley, S. K., Berman, H. M., Bhikadiya, C., Bi, C., Chen, L., Costanzo, L. D., et al. (2019). Protein Data Bank: the single global archive for 3D macromolecular structure data. Nucleic Acids Res. 47, D520–D528.

Google Scholar

Dandekar, T., Huynen, M., Regula, J. T., Ueberle, B., Zimmermann, C. U., Andrade, M. A., et al. (2000). Re-annotating the Mycoplasma pneumoniae genome sequence: adding value, function and reading frames. Nucleic Acids Res. 28, 3278–3288. doi: 10.1093/nar/28.17.3278

PubMed Abstract | CrossRef Full Text | Google Scholar

Davis, J. J., Wattam, A. R., Aziz, R. K., Brettin, T., Butler, R., Butler, R. M., et al. (2020). The PATRIC bioinformatics resource center: expanding data and analysis capabilities. Nucleic Acids Res. 48, D606–D612.

Google Scholar

Esposito, S., Argentiero, A., Gramegna, A., and Principi, N. (2021). Mycoplasma pneumoniae: a pathogen with unsolved therapeutic problems. Expert. Opin. Pharmacother. 22, 1193–1202. doi: 10.1080/14656566.2021.1882420

PubMed Abstract | CrossRef Full Text | Google Scholar

Garrido, V., Piñero-Lambea, C., Rodriguez-Arce, I., Paetzold, B., Ferrar, T., Weber, M., et al. (2021). Engineering a genome-reduced bacterium to eliminate Staphylococcus aureus biofilms in vivo. Mol. Syst. Biol. 17:e10145.

Google Scholar

Großhennig, S., Ischebeck, T., Gibhardt, J., Busse, J., Feussner, I., and Stülke, J. (2016). Hydrogen sulfide is a novel potential virulence factor of Mycoplasma pneumoniae: characterization of the unusual cysteine desulfurase/desulfhydrase HapE. Mol. Microbiol. 100, 42–54. doi: 10.1111/mmi.13300

PubMed Abstract | CrossRef Full Text | Google Scholar

Güell, M., van Noort, V., Yus, E., Chen, W. H., Leigh-Bell, J., Michalodimitrakis, K., et al. (2009). Transcriptome complexity in a genome-reduced bacterium. Science 326, 1268–1271. doi: 10.1126/science.1176951

PubMed Abstract | CrossRef Full Text | Google Scholar

Kahraman, A., Malmström, L., and Aebersold, R. (2011). Xwalk: computing and visualizing distances in cross-linking experiments. Bioinformatics 27, 2163–2164. doi: 10.1093/bioinformatics/btr348

PubMed Abstract | CrossRef Full Text | Google Scholar

Kannan, T. R., and Baseman, J. B. (2006). ADP-ribosylating and vacuolating cytotoxin of Mycoplasma pneumoniae represents unique virulence determinant among bacterial pathogens. Proc. Natl. Acad. Sci. U.S.A. 103, 6724–6729. doi: 10.1073/pnas.0510644103

PubMed Abstract | CrossRef Full Text | Google Scholar

Karp, P. D., Billington, R., Caspi, R., Fulcher, C. A., Latendresse, M., Kothari, A., et al. (2019). The BioCyc collection of microbial genomes and metabolic pathways. Brief Bioinf. 20, 1085–1093.

Google Scholar

Kühner, S., van Noort, V., Betts, M. J., Leo-Macias, A., Batisse, C., Rode, M., et al. (2009). Proteome organization in a genome-reduced bacterium. Science 326, 1235–1240. doi: 10.1126/science.1176343

PubMed Abstract | CrossRef Full Text | Google Scholar

Lluch-Senar, M., Delgado, J., Chen, W. H., Lloréns-Rico, V., O’Reilly, F. J., Wodke, J. A., et al. (2015). Unraveling the hidden universe of small proteins in bacterial genomes. Mol. Syst. Biol. 11:780. doi: 10.15252/msb.20188290

PubMed Abstract | CrossRef Full Text | Google Scholar

Lluch-Senar, M., Luong, K., Lloréns-Rico, V., Delgado, J., Fang, G., Spittle, K., et al. (2013). Comprehensive methylome characterization of Mycoplasma genitalium and Mycoplasma pneumoniae at single-base resolution. PLoS Genet. 9:e1003191. doi: 10.1371/journal.pgen.1003191

PubMed Abstract | CrossRef Full Text | Google Scholar

Maier, T., Schmidt, A., Güell, M., Kühner, S., Gavin, A. C., Aebersold, R., et al. (2011). Quantification of mRNA and protein and integration with protein turnover in a bacterium. Mol. Syst. Biol. 7:511. doi: 10.1038/msb.2011.38

PubMed Abstract | CrossRef Full Text | Google Scholar

Meyer Sauteur, P. M., van Rossum, A. M., and Vink, C. (2014). Mycoplasma pneumoniae in children: carriage, pathogenesis, and antibiotic resistance. Curr. Opin. Infect. Dis. 27, 220–227. doi: 10.1097/QCO.0000000000000063

PubMed Abstract | CrossRef Full Text | Google Scholar

Miravet-Verde, S., Ferrar, T., Espadas-García, G., Mazzolini, R., Gharrab, A., Sabido, E., et al. (2019). Defining a minimal cell: essentiality of small ORFs and ncRNAs in a genome-reduced bacterium. Mol. Syst. Biol. 15:e8290. doi: 10.15252/msb.20145558

PubMed Abstract | CrossRef Full Text | Google Scholar

Montero-Blay, A., Piñero-Lambea, C., Miravet-Verde, S., Lluch-Senar, M., and Serrano, L. (2020). Inferring active metabolic pathways from proteomics and essentiality data. Cell Rep. 31:107722. doi: 10.1016/j.celrep.2020.107722

PubMed Abstract | CrossRef Full Text | Google Scholar

O’Reilly, F. J., Xue, L., Graziadei, A., Sinn, L., Lenz, S., Tegunov, D., et al. (2020). In-cell architecture of an actively transcribing-translating expressome. Science 369, 554–557. doi: 10.1126/science.abb3758

PubMed Abstract | CrossRef Full Text | Google Scholar

Pedreira, T., Elfmann, C., and Stülke, J. (2022b). The current state of SubtiWiki, the database for the model organism Bacillus subtilis. Nucleic Acids Res. 50, D875–D882.

Google Scholar

Pedreira, T., Elfmann, C., Singh, N., and Stülke, J. (2022a). SynWiki: functional annotation of the first artificial organism Mycoplasma mycoides JCVI-syn3A. Protein Sci. 31, 54–62. doi: 10.1002/pro.4179

PubMed Abstract | CrossRef Full Text | Google Scholar

Piñero-Lambea, C., Ruano-Gallego, D., and Fernández, L. Á (2015). Engineered bacteria as therapeutic agents. Curr. Opin. Biotechnol. 35, 94–102.

Google Scholar

Rose, A. S., and Hildebrand, P. W. (2015). NGL viewer: a web application for molecular visualization. Nucleic Acids Res. 43, W576–W579.

Google Scholar

Sayers, E. W., Bolton, E. E., Brister, J. R., Canese, K., Chan, J., Comeau, D. C., et al. (2022). Database resources of the national center for biotechnology information. Nucleic Acids Res. 50, D20–D26.

Google Scholar

Schmidl, S. R., Gronau, K., Pietack, N., Hecker, M., Becher, D., and Stülke, J. (2010). The phosphoproteome of the minimal bacterium Mycoplasma pneumoniae: analysis of the complete known Ser/Thr kinome suggests the existence of novel kinases. Mol. Cell. Proteomics 9, 1228–1242. doi: 10.1074/mcp.M900267-MCP200

PubMed Abstract | CrossRef Full Text | Google Scholar

Schmidl, S. R., Otto, A., Lluch-Senar, M., Piñol, J., Busse, J., Becher, D., et al. (2011). A trigger enzyme in Mycoplasma pneumoniae: impact of the glycerophosphodiesterase GlpQ on virulence and gene expression. PLoS Pathog. 7:e1002263. doi: 10.1371/journal.ppat.1002263

PubMed Abstract | CrossRef Full Text | Google Scholar

van Noort, V., Seebacher, J., Bader, S., Mohammed, S., Vonkova, I., Betts, M. J., et al. (2012). Cross-talk between phosphorylation and lysine acetylation in a genome-reduced bacterium. Mol. Syst. Biol. 8:571. doi: 10.1038/msb.2012.4

PubMed Abstract | CrossRef Full Text | Google Scholar

Varadi, M., Anyango, S., Deshpande, M., Nair, S., Natassia, C., Yordanova, G., et al. (2022). AlphaFold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 50, D439–D444. doi: 10.1093/nar/gkab1061

PubMed Abstract | CrossRef Full Text | Google Scholar

Waites, K. B., Xiao, L., Liu, Y., Balish, M. F., and Atkinson, T. P. (2017). Mycoplasma pneumoniae from the respiratory tract and beyond. Clin. Microbiol. Rev. 30, 747–809.

Google Scholar

Wodke, J. A., Alibés, A., Cozzuto, L., Hermoso, A., Yus, E., Lluch-Senar, M., et al. (2015). MyMpn: a database for the systems biology model organism Mycoplasma pneumoniae. Nucleic Acids Res. 43, D618–D623. doi: 10.1093/nar/gku1105

PubMed Abstract | CrossRef Full Text | Google Scholar

Wodke, J. A., Puchałka, J., Lluch-Senar, M., Marcos, J., Yus, E., Godinho, M., et al. (2013). Dissecting the energy metabolism in Mycoplasma pneumoniae through genome-scale metabolic modeling. Mol. Syst. Biol. 9:653. doi: 10.1038/msb.2013.6

PubMed Abstract | CrossRef Full Text | Google Scholar

Yus, E., Lloréns-Rico, V., Martínez, S., Gallo, C., Eilers, H., Blötz, C., et al. (2019). Determination of the gene regulatory network of a genome-reduced bacterium highlights alternative regulation independent of transcription factors. Cell Syst. 9, 143–158. doi: 10.1016/j.cels.2019.07.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Yus, E., Maier, T., Michalodimitrakis, K., van Noort, V., Yamada, T., Chen, W. H., et al. (2009). Impact of genome reduction on metabolism and its regulation. Science 326, 1235–1240.

Google Scholar

Keywords: MycoWiki, genome annotation, essential genes, systems biology, database

Citation: Elfmann C, Zhu B, Pedreira T, Hoßbach B, Lluch-Senar M, Serrano L and Stülke J (2022) MycoWiki: Functional annotation of the minimal model organism Mycoplasma pneumoniae. Front. Microbiol. 13:935066. doi: 10.3389/fmicb.2022.935066

Received: 03 May 2022; Accepted: 01 July 2022;
Published: 25 July 2022.

Edited by:

Meghan May, University of New England, United States

Reviewed by:

Lei Shi, Chalmers University of Technology, Sweden
Wiep Klaas Smits, Leiden University, Netherlands

Copyright © 2022 Elfmann, Zhu, Pedreira, Hoßbach, Lluch-Senar, Serrano and Stülke. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jörg Stülke, anN0dWVsa0Bnd2RnLmRl

^†These authors have contributed equally to this work

^‡ORCID: Christoph Elfmann, https://orcid.org/0000-0002-7872-8232; Tiago Pedreira, https://orcid.org/0000-0002-3416-5714; Jörg Stülke, https://orcid.org/0000-0001-5881-5390

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.