- 1Department of Biotechnology, Jaypee Institute of Information Technology, Noida, India
- 2Department of Microbiology and Immunology, Dalhousie University, Halifax, NS, Canada
- 3Department of Bioinformatics, Gujarat Biotechnology University, Gandhinagar, India
Legumes comprise one of the world’s largest, most diverse, and economically important plant families, known for their nutritional and medicinal benefits. Legumes are susceptible to a wide range of diseases, similar to other agricultural crops. Diseases have a considerable impact on the production of legume crop species, resulting in large yield losses worldwide. Due to continuous interactions between plants and their pathogens in the environment and the evolution of new pathogens under high selection pressure; disease resistant genes emerge in plant cultivars in the field against those pathogens or disease. Thus, disease resistant genes play critical roles in plant resistance responses, and their discovery and subsequent use in breeding programmes aid in reducing yield loss. The genomic era, with its high-throughput and low-cost genomic tools, has revolutionised our understanding of the complex interactions between legumes and pathogens, resulting in the identification of several critical participants in both the resistant and susceptible relationships. However, a substantial amount of existing information about numerous legume species has been disseminated as text or is preserved across fractions in different databases, posing a challenge for researchers. As a result, the range, scope, and complexity of these resources pose challenges to those who manage and use them. Therefore, there is an urgent need to develop tools and a single conjugate database to manage genetic information for the world’s plant genetic resources, allowing for the rapid incorporation of essential resistance genes into breeding strategies. Here, developed the first comprehensive database of disease resistance genes named as LDRGDb - LEGUMES DISEASE RESISTANCE GENES DATABASE comprises 10 legumes [Pigeon pea (Cajanus cajan), Chickpea (Cicer arietinum), Soybean (Glycine max), Lentil (Lens culinaris), Alfalfa (Medicago sativa), Barrelclover (Medicago truncatula), Common bean (Phaseolus vulgaris), Pea (Pisum sativum),Faba bean (Vicia faba), and Cowpea (Vigna unguiculata)]. The LDRGDb is a user-friendly database developed by integrating a variety of tools and software that combine knowledge about resistant genes, QTLs, and their loci, with proteomics, pathway interactions, and genomics (https://ldrgdb.in/).
Introduction
Legumes are seed plants belonging to the Leguminosae family, spanning more than 13000 species across 600 genera. Legumes are diverse; they are suitable for cultivation in a variety of environments and temperatures (Magrini et al., 2019). They are an excellent source of antioxidants, micronutrients, and proteins, finding use as natural fertilizers, medicines, and animal fodder (Shah et al., 2020; Singh et al., 2022). Apart from their health benefits, legumes naturally have the ability to fix nitrogen from the atmosphere symbiotically, which has positive effects on agriculture and soil enrichment that are both long-lasting and economically viable. Improvement in legume cultivation and their increased usage can help ensure food security and contribute to better soil fertility (Foyer et al., 2016; Singh et al., 2020).
However, one of the main impediments in the way of abundant and quality yield of legume crops is pests and diseases Figure 1. Furthermore, climate changes have made it easier for various species to move freely, which has led to the emergence of new diseases that could grow into uncontrollable epidemics and endanger food security (Piquerez et al., 2014). Thus, crop improvement is a key component of sustainable agriculture and can be accomplished using a variety of techniques, ranging from traditional breeding to genetic engineering. To increase the effectiveness of the breeding process, it is necessary to have a thorough understanding of host-pathogen interactions as well as effective resistance mechanisms at the cellular, genetic, and molecular levels. The most effective, affordable, and environmentally friendly of the various control techniques available is breeding for resistance (Rubiales et al., 2014). These factors drive plant breeders and scientists to focus their efforts on finding mechanisms for plant disease resistance (Osuna-Cruz et al., 2018). Indeed, studies into the genes and genomes of legumes have provided valuable insight into disease resistance components and other agronomic traits (Leal-Bertioli et al., 2009; Khera et al., 2018; Zhuang et al., 2019; Sampaio et al., 2020).
Resistance genes (R genes) are variable plant genes that confer resistance to vast varieties of biotrophic pathogens, including viruses. As they can specifically recognise the matching pathogen effectors or associated protein(s), resistance (R) genes are the most potent defences against pathogen invasion (Liu et al., 2007). This allows plants to mount an effective defence at the site of infection. R genes are capable of expressing PR proteins in response to physical or chemical stimuli (Rodríguez-Sifuentes et al., 2020). As PR proteins have hydrolytic activity, contact toxicity, involvement in defence signalling, and pathogen enzyme inhibition, they constitute a variety of weapons against pathogens. We have seen that plant R genes have been divided into several groups based on their typical domains (Zheng et al., 2016). A class of proteins containing nucleotide binding (NB) and leucine-rich repeat (LRR) domains is encoded by the majority of R genes (Friedman and Baker, 2007).
It is now known that R genes also produce proteins with a series of carboxy-terminal leucine-rich repeats (LRRs), a putative amino-terminal signalling domain, and a nucleotide binding site (NBS). There are two primary classes of “NBS-LRR” proteins: first which encodes an amino-terminal coiled-coiled motif (CC-NBS-LRR or CNL proteins) and second which has an amino-terminal TIR (Toll/interleukin receptor) domain (which are known as TIR-NBS-LRR or TNL proteins) (Meyers et al., 2005). Figure 2 depicts the various classes of R-Gene. Hence, the participation of multiple R genes, post-transcriptional regulators, and biotic and abiotic stressors limits the ability of plants to resist disease.
R gene mechanisms work on the gene-for-gene concept (Kaloshian, 2004). The resistance occurs only when the R gene proteins engage with the pathogen Avr gene in a certain form. They can interact with the pathogen gene in two ways, first by directly interacting with its protein product and second, if it plays a catalytic role by interacting with something created by the Avr gene product (Hulbert et al., 2001). Any attempt of an infection is thus governed by both the genotype of the host and that of the pathogen. Once this recognition has taken place, the defence reactions are triggered. Hypersensitive reaction is frequently characterized in these defence reactions, which results in the death of the initial cell or cells infected and also the local accumulation of antimicrobial compounds (Moffett, 2009). Phytophthora root and stem rot in nonhost common bean has been found to induce a strong hypersensitive response to Phytophthora sojae due to upregulation of genes promoting phaseollidin and glyceollin production, which have significant inhibitory effects on oospore production and mycelial formation (Bi et al., 2022). Transcription factors have also been implicated in multilayer defence signalling against Fusarium wilt disease in chickpea (Chakraborty et al., 2020). Legumes exhibit resistance through other means as well; a recent study on Faba beans indicated that its resistant genotype withstood the Chocolate spot infection caused by Botrytis fabae due to a better Photosystem II Repair mechanism at early stages of the infection (Castillejo et al., 2021). Leaf spot infected alfalfa, when colonized by arbuscular mycorrhizal fungus (AMF), has shown to ameliorate the effects of the infection, thus displaying the potential to serve as a biocontrol strategy for leaf spot infection of alfalfa (Li et al., 2019).
Development of cost-effective next generation sequencing platforms with enhanced performances have given way to copious amounts of data. In order for these data to be effectively analysed and compared, various data management practices are being employed (Bauchet et al., 2019). Many databases exist for comprehensive study and management of the insurmountable data generated by plant genome studies. However, these repositories either only focus individually on model genomes such as soybean (https://soybase.org/) (Grant et al., 2009), Medicago truncatula (https://www.legoo.org) (Carrere et al., 2019), (http://www.medicago.org/MtDB) (Lamblin, 2003), or a combination of multiple model species (http://www.plantgdb.org/) (Dong, 2004), (http://plantgrn.noble.org/LegumeIP/) (Li et al., 2011). Currently, there is a lack of repositories that can pool the numerous disease resistance genes with proteomics and facilitate the quick integration of crucial resistance genes into breeding methods. Keeping this in mind, we have established and developed the first comprehensive database of disease resistance genes in legume plants named as LDRGDb, that incorporates knowledge about resistant genes, QTLs, and their loci, with proteomics, pathways interactions as well as structural conformations of proteins. The database serves as a medium for comprehensive search and retrieval of relevant information, and will help researchers understand various biological phenomena with relative ease. Our database has been constructed so as to aid in research of disease resistant common legume cultivars and their underlying mechanisms. LDRGDb spans 10 legume species, with genes, QTL information such as linkage group, neighbouring marker, population, maternal and paternal parent, proteomics, informatics such as molecular weight, theoretical pI, length of amino acids, half-life of protein, and aliphatic index with emphasis on structural conformation, which can aid in a thorough understanding of various pathways and interactions involved in disease resistance, functioning as a single repository for assisting in research.
Methodology
Data collection
The database has major 10 legume crops with their common disease-related genes and QTLs incorporated. QTLs data has been combined from numerous databases for a single model organism, such as soybase (Grant et al., 2009), databases for several species, such as pulsedb (Humann et al., 2019), prgdb (Calle García et al., 2021), and other databases. Further, the QTLs related to diseases were manually curated from the available scientific literature, books and journals. We incorporated gene data from various journals and books using the Pubmed (PubMed Labs, 2021) search box and varied search queries incorporating different diseases and crops from the last 5 years. (Figures 3, 4) show the number of genes and QTLs incorporated into the database.
Annotation of genes and QTLs
We performed functional annotation studies at genes and QTLs levels to develop comprehensive information for the collected data. Uniprot (Bateman et al., 2020) was used to retrieve protein-related data of the genes while protparam (Gasteiger et al., 2005) was used to retrieve individual protein details such as theoretical pI, the total number of negatively and positively charged residues, protein’s half-life, aliphatic index, and molecular weight etc. To mine and integrate the biological, molecular, and cellular processes involved in genes Uniport was utilized, (Bateman et al., 2020) whereas Interpro (Paysan-Lafosse et al., 2022) database for protein family and domain retrieval. The Swiss model (Waterhouse et al., 2018) was used to incorporate the structure of the protein of a gene with its template included.
Database design
The database is made up of two tables: Genes and QTLs, and one-to-many relationships were utilised to create the complete database structure, allowing for the inclusion of any number of connections. The data’s detailed data flow is documented in (Figure 5). For website development, Django v4.1.3 with in-built sqlite3 has been utilized. Django offers speedy development, fast processing, and scalability for website development, while sqlite3 provides a lightweight disk-based database that doesn’t require a separate server process and allows accessing the database using a nonstandard variant of the SQL query language. Special care has been taken to protect the database’s structure, consistency, and the accuracy of the data stored.
Key features
The LDRGDb is a comprehensive search and retrieval database of significant information that serves as a centralised repository for research purposes. It is the only legume repository that includes not only QTL information such as linkage group, neighbouring marker, population, and maternal and paternal parent, but also genes with proteomics information such as theoretical pI, the total number of negatively and positively charged residues, protein half-life, aliphatic index, and molecular weight, among other things. The inclusion of protein pathway interactions with diverse biological, cellular, and molecular processes aid researchers in understanding a wide range of processes. The incorporation of protein structural conformations with the template annotated through the Swiss model is a major aspect of the LDRGDb that can aid in full knowledge of protein interactions involved in disease resistance. (Figure 6) depicts how the LDRGDb is utilized for the retrieval of significant information.
Conclusion
Disease-resistant mechanisms of plants are constrained by the involvement of various genes, post-transcriptional controls, biotic and abiotic stresses. The understanding of these mechanisms and their impact on plant proteomes and metabolomes is important for improvement of legume production. To obtain knowledge about the mechanisms of resistance, LDRGDb serves as an exemplary one stop database for quick access of disease resistance genes, QTLs, and the proteins and pathways associated with them for various legume species. Users can search using either disease names, crop names, or both, facilitating easy and comprehensive ingress into the data. The data collected from various databases and literature has been meticulously structured to allow for rapid searches for diseases, legumes and the genes associated with them using user-friendly web interfaces. The database has specific sections for particular crop characteristics, prevalent diseases, as well as a FAQ section for frequently asked questions. These are the webpage screenshots (Figure 7).
Data availability statement
The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author.
Author contributions
HS contributed to collection of data implementation and initial draft writing. AiK contributed to collection of data, initial draft writing tables, and figures. AA contributed to collection of data and initial draft writing. AK contributed to designing and editing the manuscript. NS conceived, supervised and edited the final manuscript. CJ conceived, supervised and edited the final manuscript. All authors contributed to the article and approved the submitted version.
Acknowledgments
We are thankful for Department of Biotechnology, Jaypee Institute of Information Technology, Noida, India and Department of Bioinformatics, Gujarat Biotechnology University, Gandhinagar, Gujarat, India for providing the necessary support.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Bateman, A., Martin, M.-J., Orchard, S., Magrane, M., Agivetova, R., Ahmad, S., et al. (2020). UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 49 (D1), D480–D489. doi: 10.1093/nar/gkaa1100
Bauchet, G. J., Bett, K. E., Cameron, C. T., Campbell, J. D., Cannon, E. K. S., Cannon, S. B., et al. (2019). The future of legume genetic data resources: challenges, opportunities, and priorities. Legume Sci. 1 (1), e16. doi: 10.1002/leg3.16
Bi, X., Song, G., Yu, H., Zhang, Z., Liu, H., Yang, Z., et al. (2022). Changes in biochemistry and cellular ultrastructure support different resistance mechanisms to phytophthora sojae in nonhost common bean and host soybean. Plant Pathol. 71 (4), 917–926. doi: 10.1111/ppa.13527
Calle García, J., Guadagno, A., Paytuvi-Gallart, A., Saera-Vila, A., Amoroso, C., D’Esposito, D., et al. (2021). PRGdb 4.0: an updated database dedicated to genes involved in plant disease resistance process. Nucleic Acids Res. 50 (D1), D1483–D1490. doi: 10.1093/nar/gkab1087
Carrere, S., Verdenaud, M., Gough, C., Gouzy, J., Gamas, P. (2019). LeGOO: an expertized knowledge database for the model legume medicago truncatula. Plant Cell Physiol. 61 (1), 203–211. doi: 10.1093/pcp/pcz177
Castillejo, M.Á., Villegas-Fernández, Á.M., Hernández-Lao, T., Rubiales, D. (2021). Photosystem II repair cycle in faba bean may play a role in its resistance to botrytis fabae infection. Agronomy 11 (11), 2247. doi: 10.3390/agronomy11112247
Chakraborty, J., Sen, S., Ghosh, P., Jain, A., Das, S. (2020). Inhibition of multiple defense responsive pathways by CaWRKY70 transcription factor promotes susceptibility in chickpea under fusarium oxysporum stress condition. BMC Plant Biol. 20 (1), 319. doi: 10.1186/s12870-020-02527-9
Dong, Q. (2004). PlantGDB, plant genome database and analysis tools. Nucleic Acids Res. 32 (90001), 354D359. doi: 10.1093/nar/gkh046
Foyer, C. H., Lam, H.-M., Nguyen, H. T., Siddique, K. H. M., Varshney, R. K., Colmer, T. D., et al. (2016). Neglecting legumes has compromised human health and sustainable food production. Nat. Plants 2 (8), 16112. doi: 10.1038/nplants.2016.112
Friedman, A. R., Baker, B. J. (2007). The evolution of resistance genes in multi-protein plant resistance systems. Curr. Opin. Genet. Dev. 17 (6), 493–499. doi: 10.1016/j.gde.2007.08.014
Gasteiger, E., Hoogland, C., Gattiker, A., Duvaud, S., Wilkins, M. R., Appel, R. D., et al. (2005). Protein identification and analysis tools on the ExPASy server. Proteomics Protoc. Handb., 571–607. doi: 10.1385/1-59259-890-0:571
Grant, D., Nelson, R. T., Cannon, S. B., Shoemaker, R. C. (2009). SoyBase, the USDA-ARS soybean genetics and genomics database. Nucleic Acids Res. 38 (suppl_1), D843–D846. doi: 10.1093/nar/gkp798
Hulbert, S. H., Webb, C. A., Smith, S. M., Sun, Q. (2001). RESISTANCE GENE COMPLEXES: Evolution and utilization. Annu. Rev. Phytopathol. 39 (1), 285–231. doi: 10.1146/annurev.phyto.39.1.285
Humann, J. L., Jung, S., Cheng, C.-H., Lee, T., Zheng, P., Frank, M., et al. (2019). “Cool season food legume genome database: A resource for pea, lentil, faba bean and chickpea genetics, genomics and breeding,” in Proceedings of the international plant and animal genome conference(San Diego, CA, USA).
Kaloshian, I. (2004). Gene-for-gene disease resistance: Bridging insect pest and pathogen defense. J. Chem. Ecol. 30 (12), 2419–2438. doi: 10.1007/s10886-004-7943-1
Khera, P., Pandey, M. K., Mallikarjuna, N., Sriswathi, M., Roorkiwal, M., Janila, P., et al. (2018). Genetic imprints of domestication for disease resistance, oil quality, and yield component traits in groundnut (Arachis hypogaea l.). Mol. Genet. Genomics 294 (2), 365–378. doi: 10.1007/s00438-018-1511-9
Lamblin, A.-F. J. (2003). MtDB: A database for personalized data mining of the model legume medicago truncatula transcriptome. Nucleic Acids Res. 31 (1), 196–201. doi: 10.1093/nar/gkg119
Leal-Bertioli, S. C., José, A., Alves-Freitas, D. M., Moretzsohn, M. C., Guimarães, P. M., Nielen, S., et al. (2009). Identification of candidate genome regions controlling disease resistance in arachis. BMC Plant Biol. 9 (1), 112. doi: 10.1186/1471-2229-9-112
Li, J., Dai, X., Liu, T., Zhao, P. X. (2011). LegumeIP: an integrative database for comparative genomics and transcriptomics of model legumes. Nucleic Acids Res. 40 (D1), D1221–D1229. doi: 10.1093/nar/gkr939
Li, Y., Duan, T., Nan, Z., Li, Y. (2019). Arbuscular mycorrhizal fungus alleviates alfalfa leaf spots caused by phoma medicaginis revealed by RNA-seq analysis. J. Appl. Microbiol. 130 (2), 547–560. doi: 10.1111/jam.14387
Liu, J., Liu, X., Dai, L., Wang, G. (2007). Recent progress in elucidating the structure, function and evolution of disease resistance genes in plants. J. Genet. Genomics 34 (9), 765–776. doi: 10.1016/s1673-8527(07)60087-3
Magrini, M.-B., Cabanac, G., Lascialfari, M., Plumecocq, G., Amiot, M.-J., Anton, M., et al. (2019). Peer-reviewed literature on grain legume species in the WoS, (1980–2018): A comparative analysis of soybean and pulses. Sustainability 11 (23), 6833. doi: 10.3390/su11236833
Meyers, B. C., Kaushik, S., Nandety, R. S. (2005). Evolving disease resistance genes. Curr. Opin. Plant Biol. 8 (2), 129–134. doi: 10.1016/j.pbi.2005.01.002
Moffett, P. (2009). Mechanisms of recognition in dominant r gene mediated resistance. Adv. Virus Res. 75, 1–33. doi: 10.1016/S0065-3527(09)07501-0
Osuna-Cruz, C. M., Paytuvi-Gallart, A., Di Donato, A., Sundesha, V., Andolfo, G., Aiese Cigliano, R., et al. (2018). PRGdb 3.0: a comprehensive platform for prediction and analysis of plant disease resistance genes. Nucleic Acids Res. 46 (D1), D1197–D1201. doi: 10.1093/nar/gkx1119
Paysan-Lafosse, T., Blum, M., Chuguransky, S., Grego, T., Pinto, B. L., Salazar, G., et al. (2022). InterPro in 2022. Nucleic Acids Res 51(D1), D418–D427. doi: 10.1093/nar/gkac993
Piquerez, S. J. M., Harvey, S. E., Beynon, J. L., Ntoukakis, V. (2014). Improving crop disease resistance: lessons from research on arabidopsis and tomato. Front. Plant Sci. 5. doi: 10.3389/fpls.2014.00671
PubMed Labs (2021). Available at: https://pubmed.ncbi.nlm.nih.gov/.
Rodríguez-Sifuentes, L., Marszalek, J. E., Chuck-Hernández, C., Serna-Saldívar, S. O. (2020). Legumes protease inhibitors as biopesticides and their defense mechanisms against biotic factors. Int. J. Mol. Sci. 21 (9), 3322. doi: 10.3390/ijms21093322
Rubiales, D., Fondevilla, S., Chen, W., Gentzbittel, L., Higgins, T. J. V., Castillejo, M. A., et al. (2014). Achievements and challenges in legume breeding for pest and disease resistance. Crit. Rev. Plant Sci. 34 (1-3), 195–236. doi: 10.1080/07352689.2014.898445
Sampaio, A. M., Araújo, S., de, S., Rubiales, D., Vaz Patto, M. C. (2020). Fusarium wilt management in legume crops. Agronomy 10 (8), 1073. doi: 10.3390/agronomy10081073
Shah, F., Khan, Z., Iqbal, A., Turan, M., Olgun, M. (2020) Recent advances in grain crops research. Available at: https://www.intechopen.com/books/8168.
Singh, N., Jain, P., Ujinwal, M., Langyan, S. (2022). Escalate protein plates from legumes for sustainable human nutrition. Front. Nutr. 9. doi: 10.3389/fnut.2022.977986
Singh, N., Rai, V., Singh, N. K. (2020). Multi-omics strategies and prospects to enhance seed quality and nutritional traits in pigeonpea. Nucleus 63 (3), 249–256. doi: 10.1007/s13237-020-00341-0
Waterhouse, A., Bertoni, M., Bienert, S., Studer, G., Tauriello, G., Gumienny, R., et al. (2018). SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 46 (W1), W296–W303. doi: 10.1093/nar/gky427
Zheng, F., Wu, H., Zhang, R., Li, S., He, W., Wong, F. -L., et al. (2016). Molecular phylogeny and dynamic evolution of disease resistance genes in the legume family. BMC Genomics 17 (1), 402. doi: 10.1186/s12864-016-2736-9
Keywords: disease resistance genes, genomics, LDRGDb, legumes, proteomics, QTLs data
Citation: Saxena H, Kulshreshtha A, Agarwal A, Kumar A, Singh N and Jain CK (2023) LDRGDb - Legumes disease resistance genes database. Front. Plant Sci. 14:1143111. doi: 10.3389/fpls.2023.1143111
Received: 12 January 2023; Accepted: 22 March 2023;
Published: 18 April 2023.
Edited by:
Lida Zhang, Shanghai Jiao Tong University, ChinaReviewed by:
Shyam Sundar Dey, Indian Agricultural Research Institute (ICAR), IndiaDhananjay K. Pandey, Amity University, Jharkhand, India
Copyright © 2023 Saxena, Kulshreshtha, Agarwal, Kumar, Singh and Jain. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Chakresh Kumar Jain, Y2tqNTIyQHlhaG9vLmNvbQ==; Nisha Singh, c2luZ2gubmlzaGE4OEBnbWFpbC5jb20=