Skip to main content

ORIGINAL RESEARCH article

Front. Plant Sci., 15 May 2023
Sec. Plant Breeding
This article is part of the Research Topic IPPS 2022 - Plant Phenotyping for a Sustainable Future View all 14 articles

Sainfoin (Onobrychis spp.) crop ontology: supporting germplasm characterization and international research collaborations

  • 1Bioengineering Department, Adana Alparslan Türkeş Science and Technology University, Adana, Türkiye
  • 2The Land Institute, Salina, KS, United States
  • 3Department of Computer Engineering, Kafkas University, Kars, Türkiye
  • 4Donald Danforth Plant Science Center, St. Louis, MO, United States
  • 5Department of Viticulture and Enology, University of California Davis, Davis, CA, United States
  • 6Plant and Environmental Sciences Department, Clemson University, Clemson, SC, United States
  • 7Department. of Biology, Saint Louis University, St. Louis, MO, United States
  • 8Department of Geography, Birzeit University, Birzeit, West Bank, Palestine

Sainfoin (Onobrychis spp.) is a perennial forage legume that is also attracting attention as a perennial pulse with potential for human consumption. The dual use of sainfoin underpins diverse research and breeding programs focused on improving sainfoin lines for forage and pulses, which is driving the generation of complex datasets describing high dimensional phenotypes in the post-omics era. To ensure that multiple user groups, for example, breeders selecting for forage and those selecting for edible seed, can utilize these rich datasets, it is necessary to develop common ontologies and accessible ontology platforms. One such platform, Crop Ontology, was created in 2008 by the Consortium of International Agricultural Research Centers (CGIAR) to host crop-specific trait ontologies that support standardized plant breeding databases. In the present study, we describe the sainfoin crop ontology (CO). An in-depth literature review was performed to develop a comprehensive list of traits measured and reported in sainfoin. Because the same traits can be measured in different ways, ultimately, a set of 98 variables (variable = plant trait + method of measurement + scale of measurement) used to describe variation in sainfoin were identified. Variables were formatted and standardized based on guidelines provided here for inclusion in the sainfoin CO. The 98 variables contained a total of 82 traits from four trait classes of which 24 were agronomic, 31 were morphological, 19 were seed and forage quality related, and 8 were phenological. In addition to the developed variables, we have provided a roadmap for developing and submission of new traits to the sainfoin CO.

1 Introduction

Sainfoin (Onobrychis spp. Fabaceae) has a long and rich history of cultivation across Asia, Europe, and North America where it is used to provide roughage for livestock and to maintain soil fertility (Frame et al., 1998). Sainfoin has been used as a perennial forage legume and in crop rotation regimes with major grains such as wheat and barley (Hayot Carbonero et al., 2011). Sainfoin use has been centered in Turkey, the Balkans, and Central and Southern Europe (Bennett et al., 2001), but historical evidence suggests it was also grown in Palestine, Syria, and Lebanon in the late 1800s (Tristram, 1885). The introduction of synthetic fertilizer-based production schemes led to a reduction in sainfoin cultivation in these regions (Hayot Carbonero et al., 2011), and a concomitant decline in research and breeding efforts. Recently, as concerns about synthetic fertilizers have grown, attention has refocused on the crucial role of legumes in agricultural systems, and as a result, interest in sainfoin has been revived (Şakiroğlu, 2021; Craine et al., 2023).

Agriculture and plant breeding are undergoing a revolution in response to calls for the development of more diverse, sustainable, agricultural systems. A key part of this is plant breeding, the improvement of existing crops and development of new ones that provide agronomic products and critical ecosystem services. For example, there is emerging interest in domesticating sainfoin as a potential novel, sustainable food source - a perennial pulse - for human consumption (Butkutė et al., 2018a; Butkutė et al., 2018b; Schlautman et al., 2018). Evidence from nutritional analyses and animal feeding studies suggest sainfoin seeds could be suitable for human and animal consumption (Ditterline, 1973; Tarasenko et al., 2015; Baldinger et al., 2016; Wijekoon et al., 2021; Craine et al., 2023). Thus, recent breeding efforts to develop sainfoin for dual-purpose perennial grain and forage production by selecting phenotypes related to grain yield and grain quality have begun at The Land Institute (Salina, KS, USA), Adana Alparslan Türkeş Science and Technology University (Adana, Turkey), and elsewhere.

As plant breeding programs expand and evolve to meet current and future agricultural needs, it is necessary to adapt existing frameworks for cataloging plant information. In the post-omics era (e.g. genomics, transcriptomics, proteomics, metabolomics, hormonomics, ionomics, and large-scale automated phenomics) the amount and complexity of data collected, stored, and shared within and among breeding and agriculture research programs has reached an all-time high (Langridge and Fleury, 2011; Leonelli et al., 2017; Li and Yan, 2020; Van Tassel et al., 2022). These post-omics era technologies promise to generate more data at lower costs than ever before, which could accelerate genetic gains in underutilized crops or even rapidly domesticate new ones. However, leveraging these technologies and large datasets when collaborating is only possible with available infrastructure to robustly store and access data. As such, there is a need for new frameworks that enable breeders to efficiently share and communicate about the multi-dimensional plant phenotypes characterized in their programs, for different breeding goals, in a broader diversity of current and emerging crop species like sainfoin.

As early as the 1990s, the need for designing databases serving multiple users with a robust set of common ontologies was recognized (Wieczorek et al., 2012). Numerous ontologies and ontology platforms have since been created to support and standardize data sharing within and among research fields such as Darwin Core (https://dwc.tdwg.org) as a standard for biodiversity data and the Planteome platform (https://www.planteome.org) and related Plant Ontology (PO), Plant Trait Ontology (TO), and Plant Experimental Conditions Ontology (PECO) frameworks, which provide a base for ontologies for plant and species-specific traits related to plant development, anatomy, physiology in the context of genomics data (Jaiswal et al., 2005; Pujar et al., 2006; Ilic et al., 2007; Avraham et al., 2008; Arnaud et al., 2012; Cooper et al., 2013; Cooper and Jaiswal, 2016; Cooper et al., 2018).

Crop Ontology (CO, https://cropontology.org, Matteis et al., 2013)) was created in 2008 by the CGIAR to provide a framework and common language to catalog crop-specific trait data, allowing traits to be easily interpretable and interoperable for further aggregation, analysis, and multidisciplinary communication (Gruber, 2009). CO currently supports the standardization of plant breeding databases such as the Integrated Breeding Platform’s BMS (IBP; https://www.integratedbreeding.net/), the Boyce Thompson Institute’s Breedbase (https://breedbase.org/, (Morales et al., 2022), and others (Arnaud et al., 2020) which allow the creation and management of annotated trial data (Crop Ontology 2022). The Minimum Information About a Plant Phenotype Experiment (MIAPPE https://www.miappe.org/; (Ćwiek-Kupczyńska et al., 2016; Papoutsoglou et al., 2020) and the Breeding Application Programming Interface (BrAPI; https://brapi.org/; (Selby et al., 2019) have both adopted the CO format, demonstrating the widespread acceptance and utility of the standard (Arnaud et al., 2020). The CO Application Programming Interface (API) is used by third-party websites and databases like the EMBL-EBI Ontology Lookup Service that replicates CO and provides term search access through its own portal. Agroportal, the registry of ontologies in agriculture and related domains, regularly synchronizes their files with CO.

Several different COs have since been developed and made accessible through the CO platform by research groups and crop specific consortiums for several commonly cultivated crops including apple, banana, cotton, corn, common bean, potato, rice, and wheat. We expect that the broader impacts made possible through international and transdisciplinary collaboration and germplasm characterization in sainfoin can be magnified through early efforts of a consortium of researchers, hereafter referred to as the “Sainfoin Consortium.” In this current work, we describe the efforts of the Sainfoin Consortium to standardize the nomenclature and data storage systems used for sainfoin research to create a sainfoin CO - the first CO developed for a perennial forage legume and grain crop. We also include a roadmap for further expansion of the sainfoin CO through a set of guidelines for the development and suggestion of new traits.

2 The sainfoin ontology framework

2.1 Ontology construction methods

We used the Crop Ontology framework guidelines (version 2.1; Pietragalla et al., 2022) and trait dictionary template to build the first version of a sainfoin crop ontology. The online database software, Airtable, was used to create the first version of the ontology as it incorporates relational data structures that can be used to easily link ontology terms across tables in the database. Figures and entity relation diagrams were constructed in DrawIO (https://github.com/jgraph/drawio). The Crop Ontology CGIAR advisory board assigned the crop code CO_369 to sainfoin, which is prepended to Variable terms in the final sainfoin ontology. The sainfoin crop ontology is available to the public, can be browsed on the Crop Ontology website (https://cropontology.org/term/CO_369:ROOT), resides in a dedicated GitHub repository (https://github.com/Planteome/CO_369-sainfoin-traits), and is maintained by a group of community curators from the Sainfoin Consortium.

2.2 Term types and structure

The CO phenotype annotation model is based on three fundamental CO term types: Trait, Method, and Scale. These three fundamental types are then used in conjunction to construct a fourth term type, Variable, which formalizes how a given trait is collected. Each CO term in the sainfoin ontology was assigned a persistent unique identifier (PUID), which is composed of the sainfoin CO crop code and a seven-digit number in the form {CO crop code}:{#######}, e.g., CO_369:0000001. While the format of the ID system is constrained as shown above, the CO guidelines do not mandate a particular system for assigning PUIDs to Variables, Traits, Methods, or Scales within an individual ontology. To maintain an incremental, identifiable PUID system, we added constraints for each term type (Variable, Trait, Method, and Scale) shown in Table 1.

TABLE 1
www.frontiersin.org

Table 1 Term type ID series composition and creation.

In addition to PUIDs, all terms in the ontology were given human-readable names and abbreviations that can be used in trait selection used in data collection. We avoided using acronyms within CO term names except where the term name would become unwieldy without its use or when the acronym is more widely known than the actual phrase which the acronym represents (e.g., ‘SPAD’ for soil plant analysis development). Term names were constrained to standard ASCII characters, and aside from acronyms, only the first letter of the first word in each name is capitalized.

2.3 Trait composition framework

A Trait in the Sainfoin CO is a subcomponent of a variable that defines what is observed. Traits are composed of a meaningful, two- to four-word phrase consisting of an Entity, the observed part of a plant, and an Attribute, a feature of an entity, in the form {Entity} {Attribute} such as {Plant} {height}. Traits are then assigned to one of the nine Trait classes specified in the CO guidelines for organizational purposes when viewing the ontology (Table 2). When possible, Entities and Attributes in the sainfoin ontology were cross referenced with terms in existing relevant ontologies such as PO, the Agronomy Ontology (AGRO, Aubert et al., 2017), The Environment Ontology (ENVO, Buttigieg et al., 2016) and The Phenotype And Trait Ontology (PATO, Gkoutos) to standardize term vocabulary across other ontology frameworks.

TABLE 2
www.frontiersin.org

Table 2 List of trait classes, descriptions, and corresponding frequencies in the Sainfoin Crop Ontology.

Extensive examination of ontologies from other species revealed inconsistent approaches for assigning Entities and Attributes, especially those that can have multiple states. This multiple state problem is common when a treatment or processing step is applied (e.g., drying, boiling, milling). Using the example of the Entity {Forage} and the attribute {mass}, which can be measured either in a fresh or dried state. The CO guidelines specify two distinct approaches for assigning the state “dry”, but each has its own challenge.

1. The “dry” state is assigned to the Entity {Dry forage} rather than the Attribute{mass}.

a. This approach creates a hierarchy of entities with multiple states rather than treating entities as a single observed part of a plant.

2. The “dry” state is not assigned to either the Entity {Forage} or Attribute {mass} but is instead included in the Method describing how Forage mass was measured either fresh or dry.

b. This approach results in multiple traits with the same name, which can create downstream challenges for users in selecting the proper Variable in tools such as Field Book (Rife and Poland, 2014) or Gridscore (Raubach et al., 2022). This is especially important in the sainfoin ontology where only the Method class abbreviation is included in the Variable name.

We chose a third approach, which was to assign the state to the attribute instead of the entity or method. In this fashion, any traits with state(s) would be constructed as {Entity} {(state) attribute} as in Forage dry mass and Forage fresh mass. With this format, we can have Methods that simply describe the Method for a given Trait(s) without having to form specific Methods for each Trait that only differ in the state of a Trait.

Finally, when choosing Attribute words, we opted for words that were specific enough to be contextually correct, but general enough that they could be used in more than one context. For example, “mass” was used instead of “weight”, and “mass” was chosen over “biomass” since the latter adds no further meaning when the context is already scoped to biological organisms.

2.4 Method composition framework

Methods are the component of a Variable that describes how an observation is made. The framework we followed for composing Methods was based on the outline specified in the CO guidelines, with some modifications that add clarity to the procedures and allow for some flexibility between different breeding programs’ goals. Our modifications fall into three main categories: Method name, Method description, and Method abbreviation.

First, we constrain the Method name to be a succinct, human-readable name appended with the Method class abbreviation. In most cases this should be the trait name followed by the Method class the Method belongs to following the format of {Trait} {Method class}. For example, ‘Leaflet SPAD measurement’ tells us this Method describes how to measure SPAD values of a leaflet. However, exceptions were made when general Methods could be assigned to multiple traits, as in the case of ‘Object equivalent diameter measurement’, which describes a simple image processing technique that could be applied to many different objects in an image. Methods were categorized into one of the seven classes (measurement, counting, estimation, computation, prediction, description, or classification) defined in the CO Guidelines.

Second, the Method description should be structured according to the format shown in Figure 1. While the Method description is allowed to be a free text field in the CO guidelines, we constrained our descriptions to include at minimum the following structured information: Description: A brief, one sentence description of the Method; Materials: A semicolon separated list of materials or supplies needed for the Method; Dependent_on: A semicolon separated list of Variables that the Method uses for computational purposes or for normalization. Variables can be specified either by the variable label (Raceme length msr [cm]), ID (CO_369:0000061), or name (RaL_RaLmsr_cm); Protocol: A detailed, ordered list of steps in the protocol needed to complete the Method. When it is infeasible to write out a lengthy protocol, it is indicated by writing “See full protocol in {publication title, author, year}”.

FIGURE 1
www.frontiersin.org

Figure 1 Schematic representation of the Method development for the Sainfoin Crop Ontology.

Many existing CO Method descriptions either present no added information beyond that which is already specified in the Method name or present methodology too vague to be followed without error. We developed our Method descriptions to be agnostic regarding the sampling procedure so that the employment of a given Method can be adapted to the end user’s specific experimental design. This modification is not entirely in line with the CO guidelines which state that the sampling should specify whether the observation is collected on a single plant or an aggregation from multiple plants (an experimental group or plot). However, this poses an issue with the scope of sample sizes (e.g., number of samples aggregated) commonly seen in breeding or agronomic trials. Specifically, a trait collected in greenhouse studies using single plant reps should, according to this guideline, have a separate method from the same trait collected in a field trial with collections of plants in a plot or sward. This leads to, at most, doubling the number of variables in the ontology. Furthermore, while the guidelines state that the experimental protocol should be distinguished from the observational protocol, the authors realize that the sampling protocol is inevitably linked to the experimental design and informs the observation procedure. Various research groups have their preferred sampling methods, dependent on goals, scale of projects, funding, etc.; therefore, we leave the specific sampling procedure up to the individual researchers and only dictate that the trait value is represented as-is for data collected on one observational unit, or the mean/aggregated value for a collection of observational units.

Finally, we constrain the abbreviation of the Method used in Variable name construction to include the Trait abbreviation, or if not possible, at least a portion of the Trait abbreviation. Ambiguous abbreviations such as ‘PH’ for ‘Plant height’ or ‘pH’ were avoided. Additionally, the standard abbreviation of the Method class is appended at the end: msr, cnt, est, cmp, prd, dsc, or cls (See Supplemental Table 3). This is to avoid confusion about what type of methodology is being employed in a Variable.

New Methods are to be developed following the above guidelines as schematized in Figure 1. Briefly, there are five obligatory Method components: Method ID, Method name, Method class, Method description, and Method abbreviation. The Method ID is assigned automatically and incrementally. The Method class must be selected among the seven different Method classes (Figure 1).

2.5 Scale composition framework

Scales are the component of a variable that describes how the observation is expressed. Scales were composed of units in the International System of Units (SI) with their associated official abbreviations. Scales were cross referenced to the Units of measurement Ontology (Gkoutos et al., 2012) or other ontologies such as PO and AGRO when applicable. Units with a ‘μ’ prefix were included even though they are not a part of the ASCII standard character set, however, ‘u’ may be used in place of ‘μ’ when convenient. Scales were specified as either a Date, Duration, Nominal, Numerical, Ordinal, Text, or Code Scale class.

2.6 Variable composition framework

Variables are the breeder’s or agronomist’s observations or measurements. The CO model defines a Variable as a unique combination of a Trait, Method, and Scale (Variable = Trait + Method + Scale), which allows for standardized data collection, storage, and sharing. Variable labels are human readable: they are composed of a Trait name followed by an associated Method class abbreviation, and a Scale enclosed in square brackets (e.g., Seed length msr [mm]); and used in scientific discussions and publications. Variable abbreviations, also referred to as names in the CO guidelines, are composed of {Trait abbreviation}_{Method abbreviation}_{Scale abbreviation} with no further modifications, (e.g., ‘SdL_Lmsr_mm’); and used as unique IDs in databases, analyses, and phenotyping applications. A detailed schematic of Variable composition is shown in Figure 2.

FIGURE 2
www.frontiersin.org

Figure 2 Schematic representation of Variable term composition in the Sainfoin Crop Ontology. In this example, the Variable ‘Canopy height msr [cm]’ is used to demonstrate how complex ontology terms are constructed of simpler terms and how the term identification number system functions. For each term type, (Variable, Trait, Method, and Scale) the PUID, term label and abbreviations are listed in order.

3 Populating the sainfoin ontology

3.1 Gathering a broad target list and shortlist of most used traits

Despite limited interest in sainfoin in the research community, a wealth of research targeting various aspects of the crop has been published in different languages. In addition to forage yield and quality traits, adaptability, resistance to biotic and abiotic stress factors, biochemical, physiological, morphological and phenological traits have been investigated. Research targeting cytogenetic, morphological, and molecular aspects along with taxonomic status of sainfoin and allied taxa have also been investigated and reported (Aktoklu, 1995; Hayot Carbonero et al., 2011; Kempf et al., 2017; Şakiroğlu, 2021).

We performed an in-depth literature review in the languages accessible to our consortium (English and Turkish) to develop a comprehensive list of Variables that have been previously measured and reported in sainfoin. The process of determining which of these variables to include, and the specific Trait, Method, and Scale terms to use, required many discussions, compromises, and decisions that spanned multiple months. The terms we included reflect the traits, methods, and scales currently being used in breeding and research programs led by the authors of this manuscript. In some instances, multiple variables are included for the same trait reflecting the differences in methods or scales used within the consortium. We selected 76 sainfoin Variables to include in sainfoin CO v1, of which the majority are from the agronomic, morphological, quality, and phenological trait classes (Table 2). We observed many traits from other classes (e.g. abiotic stress and biotic stress) during our literature review, but did not incorporate them into Variables in the first version of the sainfoin CO. We have included a list of 79 Traits that could be added to the sainfoin CO in the future as our community’s expertise or interest in these other trait classes expands (Supplementary Table 1).

3.2 Adding quality traits relevant to sainfoin as a perennial grain crop

Developing sainfoin as a new perennial pulse will require measuring new traits that are typically relevant only to cereal and grain legume crops. We reviewed the ontologies for several grain crops, including wheat, barley, oat, and common bean, that are available on the CO platform, and we compiled a list of 258 Variables from multiple trait classes that have not been previously measured in sainfoin but might be applicable to sainfoin breeding as a perennial pulse crop (Supplemental Table 2). While cereals provide a broad frame for grain related traits, common bean traits are particularly relevant, serving as a source of legume specific traits.

A comprehensive understanding of the chemical composition and nutritional quality of sainfoin seeds is needed to determine the safety of this new food source and the nutritional value of sainfoin as a novel pulse crop. Some of these quality traits, such as crude protein, dietary fiber, and phytic acid content, were recently measured in sainfoin for the first time (Craine et al., 2023). Compared to other pulses, depodded sainfoin seeds have higher protein content, as reported by Baldinger et al. (2016) (38.8%), Woodman and Evans (1947) (36.6%), Craine et al. (2023) (38.78%), and Ditterline (1973) (36.0%), and comparable iron and zinc content, as reported by Craine et al. (2023) (Fe, 56.25 - 74.24 ppm; Zn, 54.78 - 79.05 ppm), each of which plays a vital role in human health.

Of the many potential “grain” related traits, we chose to create only 8 Variables related to seed quality in the initial sainfoin CO, which were recently profiled in a study evaluating sainfoin seed attributes (Craine et al., 2023). Creating Variables and appropriate terms for these eight seed quality Traits, namely, protein, crude fat, carbohydrates, total starch, dietary fiber, iron, zinc, and phytic acid content, was simplified by their common use across many crop species (Table 3). These quality Traits are measured using methods approved by the Association of Official Agricultural Chemists (AOAC) and/or American Association of Cereal Chemists (AACC), and the appropriate AOAC and AACC method codes are referenced in the related Method description in the sainfoin CO.

TABLE 3
www.frontiersin.org

Table 3 Variables from Craine et al. (2023) that are included in the Sainfoin Crop Ontology V1.

4 Future perspectives

4.1 Guidelines for contributing to the sainfoin ontology

The sainfoin CO represents a necessary step towards making sainfoin research accessible and discernable to an international community of researchers. However, the sainfoin CO is far from complete. Sainfoin has tolerance to various biotic and abiotic stresses, and traits related to sainfoin drought and salinity tolerance would be of immense importance in agriculture (Heinrichs, 1972; Juan, 1982; Morrill et al., 1998; Meyer and Badaruddin, 2001; Irani et al., 2015; Kölliker et al., 2017). We encourage researchers with expertise and experience in areas not represented in the current sainfoin CO to contribute to expanding its scope and utility in the future.

Researchers can suggest and submit new sainfoin ontology terms (I.e., Variables, Traits, Methods, or Scales) through https://trait-requests.planteome.org/or a GitHub issues template form. Extant terms can be updated with sufficient rationale, or term synonyms can be suggested where two competing names are commonly used to describe that term. Any new terms should meet the baseline guidelines laid forth in the official CO Guidelines v2.1, and conform to the additional requirements and constraints set forth above. Such a system will aid in constructing a more helpful ontology. Two curators (English and Turkish native speakers) are actively maintaining and improving the sainfoin CO. The curators are notified upon any new term submission, and follow-up discussions about the term are handled through GitHub issues.

4.2 The roles of crop ontologies in developing new sustainable crops and cropping systems

In this manuscript, we share our experiences in building the sainfoin CO in hopes that we can continue improving research infrastructure for the international sainfoin community and to provide a template for future crop ontology development for other perennial grains, forages or minor pulse crops. There is global recognition for the growing need for agroecosystem sustainability and resilience to climate change (FAO (Food and Nations), A. O. of the United Nations, 2018; Tittonell, 2020). Sainfoin, as both a perennial pulse and perennial forage, has the potential to contribute towards these goals internationally; however, many other new and underutilized crops will be needed in various contexts. We expect that data infrastructure, such as the sainfoin CO presented herein, combined with technology to collect multi-dimensional data at scales and rates higher than ever before, will allow researchers from multiple languages and research disciplines to collaborate effectively to make rapid progress towards domesticating new perennial grains, developing new sustainable cropping systems, and preparing agriculture internationally for climate challenges in the future.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://cropontology.org/term/CO_369:ROOT https://github.com/Planteome/CO_369-sainfoin-traits.

Author contributions

Conceptualization: BS, MS. Data curation - EK, KE, ECr, MAc, MAy, SB, BM, JB. Funding acquisition - BS, MS, AM. Methodology- BM, SB, JH, EK, KE, ECr, SB, LD, OT. Project administration - BS, MS, BM. Software - MAy, BM. Supervision - MS, BS. Visualization – BM. Writing – original draft - MS, BM, BS, KE, EK, ECr, ECa, MR. Writing – review & editing - All authors. All authors contributed to the article and approved the submitted version.

Funding

The Scientific and Technological Research Council of Turkey (TUBITAK) Research Grant (number 120C136); The Foundation for Food & Agriculture Research Seeding Solutions Grant, The Perennial Agriculture Project, a joint venture between the Malone Family Foundation and The Land Institute; Clemson University Startup.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1177406/full#supplementary-material

Supplementary Table 1 | List of important Traits and Variables in sainfoin as determined by literature review.

Supplementary Table 2 | Traits and variables from wheat, oat, barley, and common bean that might be relevant to sainfoin.

Supplementary Table 3 | List of entities and descriptions in the ontology.

Abbreviations

Crop Ontology, CO; Plant Ontology, PO; Agronomy Ontology, AGRO; Plant Trait Ontology, TO; Plant Experimental Conditions Ontology, PECO; The Environment Ontology, ENVO; The Phenotype And Trait Ontology, PATO; The Consultative Group on International Agricultural Research, CGIAR.

References

Agronomy Ontology. CGIAR Platform for big data in agriculture. Available at: https://bigdata.cgiar.org/resources/agronomy-ontology/ (Accessed August 23, 2022).

Google Scholar

Aktoklu, E. (1995). Revision of onobrychis miller (Fabaceae) species grown in Turkey. (Malatya, Türkiye: PhD Dissertation. Inönü Univ.).

Google Scholar

Arnaud, E., Cooper, L., Menda, N., Nelson, R., Matteis, L., Skofic, M., et al. (2012). Towards a Reference Plant Trait Ontology For Modeling Knowledge of Plant Traits and Phenotypes. Proceedings of the International Conference on Knowledge Engineering and Ontology Development (KEOD-2012), 220–225. doi: 10.13140/2.1.2550.3525

CrossRef Full Text | Google Scholar

Arnaud, E., Laporte, M.-A., Kim, S., Aubert, C., Leonelli, S., Miro, B., et al. (2020). The ontologies community of practice: a CGIAR initiative for big data in agrifood systems. Patterns 1, 100105. doi: 10.1016/j.patter.2020.100105

PubMed Abstract | CrossRef Full Text | Google Scholar

Aubert, C., Buttigieg, P., Laporte, M., Devare, M., Arnaud, E. (2017) CGIAR agronomy ontology. Available at: http://purl.obolibrary.org/obo/agro.owl.

Google Scholar

Avraham, S., Tung, C.-W., Ilic, K., Jaiswal, P., Kellogg, E. A., McCouch, S., et al. (2008). The plant ontology database: a community resource for plant structure and developmental stages controlled vocabulary and annotations. Nucleic Acids Res. 36, D449–D454. doi: 10.1093/nar/gkm908

PubMed Abstract | CrossRef Full Text | Google Scholar

Baldinger, L., Hagmüller, W., Minihuber, U., Matzner, M., Zollitsch, W. (2016). Sainfoin seeds in organic diets for weaned piglets–utilizing the protein-rich grains of a long-known forage legume. Renew. Agric. Food Syst. 31, 12–21. doi: 10.1017/S1742170514000386

CrossRef Full Text | Google Scholar

Bennett, S., Francis, C., Reid, B. (2001). “Minor and under-utilised legumes,” in Plant genetic resources of legumes in the Mediterranean current plant science and biotechnology in agriculture. Eds. Maxted, N., Bennett, S. J. (Dordrecht: Springer Netherlands), 207–230. doi: 10.1007/978-94-015-9823-1_12

CrossRef Full Text | Google Scholar

Butkutė, B., Padarauskas, A., Cesevičienė, J., Taujenis, L., Norkevičienė, E. (2018a). Phytochemical composition of temperate perennial legumes. Crop Pasture Sci. 69, 1020. doi: 10.1071/CP18206

CrossRef Full Text | Google Scholar

Butkutė, B., Taujenis, L., Norkevičienė, E. (2018b). Small-seeded legumes as a novel food source. variation of nutritional, mineral and phytochemical profiles in the chain: raw seeds-sprouted seeds-microgreens. Molecules 24, 133. doi: 10.3390/molecules24010133

PubMed Abstract | CrossRef Full Text | Google Scholar

Buttigieg, P., Pafilis, E., Lewis, S., Schildhauer, M., Walls, R., Mungall, C. (2016). The environment ontology in 2016: bridging domains with increased scope, semantic density, and interoperation. J. BioMed. Semant 7, 57. doi: 10.1186/s13326-016-0097-6

CrossRef Full Text | Google Scholar

Cooper, L., Jaiswal, P. (2016). The plant ontology: a tool for plant genomics. Methods Mol. Biol. 1374, 89–114. doi: 10.1007/978-1-4939-3167-5_5

PubMed Abstract | CrossRef Full Text | Google Scholar

Cooper, L., Meier, A., Laporte, M.-A., Elser, J. L., Mungall, C., Sinn, B. T., et al. (2018). The planteome database: an integrated resource for reference ontologies, plant genomics and phenomics. Nucleic Acids Res. 46, D1168–D1180. doi: 10.1093/nar/gkx1152

PubMed Abstract | CrossRef Full Text | Google Scholar

Cooper, L., Walls, R. L., Elser, J., Gandolfo, M. A., Stevenson, D. W., Smith, B., et al. (2013). The plant ontology as a tool for comparative plant anatomy and genomic analyses. Plant Cell Physiol. 54, e1. doi: 10.1093/pcp/pcs163

PubMed Abstract | CrossRef Full Text | Google Scholar

Craine, E. B., Şakiroğlu, M., Peters, T. E., Barriball, S., Schlautman, B. (2023). Nutritional quality of onobrychis viciifolia (Scop.) seeds: a potentially novel perennial pulse crop for human use. Legume Sci. 2023, e189. doi: 10.1002/leg3.189

CrossRef Full Text | Google Scholar

Ćwiek-Kupczyńska, H., Altmann, T., Arend, D., Arnaud, E., Chen, D., Cornut, G., et al. (2016). Measures for interoperability of phenotypic data: minimum information requirements and formatting. Plant Methods 12, 44. doi: 10.1186/s13007-016-0144-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Ditterline, R. L. (1973) Yield and yield components of sainfoin (Obobrychis viciaefolia scop.) seed and an evaluation of its use as a protein supplement (ScholarWorks at Montana State University). Available at: https://scholarworks.montana.edu/xmlui/bitstream/handle/1/4295/31762100107232.pdf?sequence=1 (Accessed November 17, 2022).

Google Scholar

FAO. (2018). The 10 elements of agroecology: guiding the transition to sustainable food and agricultural systems (Rome: Food and Agriculture Organization of the United Nations).

Google Scholar

Frame, J., Charlton, J. F. L., Laidlaw, A. S. (1998). Temperate forage legumes (Wallingford, Oxon, UK ; New York, NY, USA: CAB International).

Google Scholar

Gkoutos, G. Phenotype and trait ontology. Available at: https://github.com/pato-ontology/pato/.

Google Scholar

Gkoutos, G., Schofield, P., Hoehndorf, R. (2012). The units ontology: a tool for integrating units of measurement int science. Database, bas033. doi: 10.1093/database/bas033

PubMed Abstract | CrossRef Full Text | Google Scholar

Gruber, T. (2009). Ontology in encyclopedia of database systems. Eds. Liu, L., Özsu, M.T.. Springer-Verlag.

Google Scholar

Hayot Carbonero, C., Mueller-Harvey, I., Brown, T. A., Smith, L. (2011). Sainfoin (onobrychis viciifolia): a beneficial forage legume. Plant Genet. Resour. 9, 70–85. doi: 10.1017/S1479262110000328

CrossRef Full Text | Google Scholar

Heinrichs, D. H. (1972). Root-zone temperature effects on flooding tolerance of legumes. Can. J. Plant Sci. 52, 985–990. doi: 10.4141/cjps72-169

CrossRef Full Text | Google Scholar

Ilic, K., Kellogg, E. A., Jaiswal, P., Zapata, F., Stevens, P. F., Vincent, L. P., et al. (2007). The plant structure ontology, a unified vocabulary of anatomy and morphology of a flowering plant. Plant Physiol. 143, 587–599. doi: 10.1104/pp.106.092825

PubMed Abstract | CrossRef Full Text | Google Scholar

Irani, S., Majidi, M. M., Mirlohi, A. (2015). Half-Sib matting and genetic analysis of agronomic, morphological, and physiological traits in sainfoin under nonstressed versus water-deficit conditions. Crop Sci. 55, 123–135. doi: 10.2135/cropsci2014.03.0235

CrossRef Full Text | Google Scholar

Jaiswal, P., Avraham, S., Ilic, K., Kellogg, E. A., McCouch, S., Pujar, A., et al. (2005). Plant ontology (PO): a controlled vocabulary of plant structures and growth stages. Comp. Funct. Genomics 6, 388–397. doi: 10.1002/cfg.496

PubMed Abstract | CrossRef Full Text | Google Scholar

Juan, N. F. (1982) Some effects of boron to the growth and chemical composition of sainfoin (Onobrychis viciaefolia scop.) : a thesis presented in partial fulfilment of the requirmeents for the degree of master of agricultural science in plant science at Massey university, palmerston north, new Zealand. Available at: https://mro.massey.ac.nz/handle/10179/13712 (Accessed January 22, 2023).

Google Scholar

Kempf, K., Malisch, C. S., Grieder, C., Widmer, F., Kölliker, R. (2017). Marker-trait association analysis for agronomic and compositional traits in sainfoin (Onobrychis viciifolia). Genet. Mol. Res. 16 (1). doi: 10.4238/gmr16019483

PubMed Abstract | CrossRef Full Text | Google Scholar

Kölliker, R., Kempf, K., Malisch, C. S., Lüscher, A. (2017). Promising options for improving performance and proanthocyanidins of the forage legume sainfoin (Onobrychis viciifolia scop.). Euphytica 213, 179. doi: 10.1007/s10681-017-1965-6

CrossRef Full Text | Google Scholar

Langridge, P., Fleury, D. (2011). Making the most of ‘omics’ for crop breeding. Trends Biotechnol. 29, 33–40. doi: 10.1016/j.tibtech.2010.09.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Leonelli, S., Davey, R. P., Arnaud, E., Parry, G., Bastow, R. (2017). Data management and best practice for plant science. Nat. Plants 3, 1–4. doi: 10.1038/nplants.2017.86

CrossRef Full Text | Google Scholar

Li, Q., Yan, J. (2020). Sustainable agriculture in the era of omics: knowledge-driven crop breeding. Genome Biol. 21, 1–5. doi: 10.1186/s13059-020-02073-5

CrossRef Full Text | Google Scholar

Matteis, L., Chibon, P. Y., Espinosa, H., Skofic, M., Finkers, H. J., Bruskiewich, R., et al. (2013) Crop ontology: vocabulary for crop-related concepts. Available at: https://library.wur.nl/WebQuery/wurpubs/441015 (Accessed February 5, 2022).

Google Scholar

Meyer, D. W., Badaruddin, M. (2001). Frost tolerance of ten seedling legume species at four growth stages. Crop Sci. 41, 1838–1842. doi: 10.2135/cropsci2001.1838

CrossRef Full Text | Google Scholar

Morales, N., Ogbonna, A. C., Ellerbrock, B. J., Bauchet, G. J., Tantikanjana, T., Tecle, I. Y., et al. (2022). Breedbase: a digital ecosystem for modern plant breeding. G3 (Bethesda) 12, jkac078. doi: 10.1093/g3journal/jkac078

PubMed Abstract | CrossRef Full Text | Google Scholar

Morrill, W. L., Ditterline, R. L., Cash, S. D. (1998). Insect pests and associated root pathogens of sainfoin in western USA. Field Crops Res. 59, 129–134. doi: 10.1016/S0378-4290(98)00113-0

CrossRef Full Text | Google Scholar

Papoutsoglou, E. A., Faria, D., Arend, D., Arnaud, E., Athanasiadis, I. N., Chaves, I., et al. (2020). Enabling reusability of plant phenomic datasets with MIAPPE 1.1. New Phytol. 227, 260–273. doi: 10.1111/nph.16544

PubMed Abstract | CrossRef Full Text | Google Scholar

Pietragalla, J., Valette, L., Shrestha, R., Laporte, M.-A., Hazekamp, T., Arnaud, E. (2022) Guidelines for creating crop-specific ontologies to annotate phenotypic data: version 2.1. Available at: https://cgspace.cgiar.org/handle/10568/110906 (Accessed November 21, 2022).

Google Scholar

Pujar, A., Jaiswal, P., Kellogg, E. A., Ilic, K., Vincent, L., Avraham, S., et al. (2006). Whole-plant growth stage ontology for angiosperms and its application in plant biology. Plant Physiol. 142, 414–428. doi: 10.1104/pp.106.085720

PubMed Abstract | CrossRef Full Text | Google Scholar

Raubach, S., Schreiber, M., Shaw, P. D. (2022). GridScore: a tool for accurate, cross-platform phenotypic data collection and visualization. BMC Bioinf. 23, 214. doi: 10.1186/s12859-022-04755-2

CrossRef Full Text | Google Scholar

Rife, T. W., Poland, J. A. (2014). Field book: an open-source application for field data collection on android. Crop Sci. 54, 1624–1627. doi: 10.2135/cropsci2013.08.0579

CrossRef Full Text | Google Scholar

Şakiroğlu, M. (2021). “Population genomics of perennial temperate forage legumes,” in Population genomics (Cham: Springer International Publishing). doi: 10.1007/13836_2021_90

CrossRef Full Text | Google Scholar

Schlautman, B., Barriball, S., Ciotir, C., Herron, S., Miller, A. J. (2018). Perennial grain legume domestication phase I: criteria for candidate species selection. Sustainability 10, 730. doi: 10.3390/su10030730

CrossRef Full Text | Google Scholar

Selby, P., Abbeloos, R., Backlund, J. E., Salido, M. B., Bauchet, G., Benites-Alfaro, O. E., et al. (2019). BrAPI–an application programming interface for plant breeding applications. Bioinformatics 35, 4147. doi: 10.1093/bioinformatics/btz190

PubMed Abstract | CrossRef Full Text | Google Scholar

Tarasenko, N., Butina, E., Gerasimenko, E. (2015). Peculiarities of chemical composition of sainfoin seeds powder. Orient. J. Chem. 31, 1673–1682. doi: 10.13005/ojc/310346

CrossRef Full Text | Google Scholar

Tittonell, P. (2020). Assessing resilience and adaptability in agroecological transitions. Agric. Syst. 184, 102862. doi: 10.1016/j.agsy.2020.102862

CrossRef Full Text | Google Scholar

Tristram, H. B. (1885). The survey of western Palestine: the fauna and flora of Palestine (The Committee of the Palestine Exploration Fund: London).

Google Scholar

Van Tassel, D. L., DeHaan, L. R., Diaz-Garcia, L., Hershberger, J., Rubin, M. J., Schlautman, B., et al. (2022). Re-imagining crop domestication in the era of high throughput phenomics. Curr. Opin. Plant Biol. 65, 102150. doi: 10.1016/j.pbi.2021.102150

PubMed Abstract | CrossRef Full Text | Google Scholar

Wieczorek, J., Bloom, D., Guralnick, R., Blum, S., Döring, M., Giovanni, R., et al. (2012). Darwin Core: an evolving community-developed biodiversity data standard. PloS One 7, e29715. doi: 10.1371/journal.pone.0029715

PubMed Abstract | CrossRef Full Text | Google Scholar

Wijekoon, C., Acharya, S. N., Siow, Y. L., Sura, S., Thandapilly, S., Sabra, A. (2021). Canadian Sainfoin and fenugreek as forage and functional foods. Crop Sci. 61, 1–20. doi: 10.1002/csc2.20280

CrossRef Full Text | Google Scholar

Woodman, H. E., Evans, R. E. (1947). The chemical composition and nutritive value of ryegrass-seed meal, clover-seed meal, lucerne-seed meal and sainfoin-seed meal. J. Agric. Sci. 37, 311–315. doi: 10.1017/S0021859600008133

CrossRef Full Text | Google Scholar

Keywords: sainfoin, Onobrychis spp., crop ontology, perennial grain, pulse, forage

Citation: Karabulut E, Erkoç K, Acı M, Aydın M, Barriball S, Braley J, Cassetta E, Craine EB, Diaz-Garcia L, Hershberger J, Meyering B, Miller AJ, Rubin MJ, Tesdell O, Schlautman B and Şakiroğlu M (2023) Sainfoin (Onobrychis spp.) crop ontology: supporting germplasm characterization and international research collaborations. Front. Plant Sci. 14:1177406. doi: 10.3389/fpls.2023.1177406

Received: 01 March 2023; Accepted: 18 April 2023;
Published: 15 May 2023.

Edited by:

Philipp Von Gillhaussen, Interantional Plant Phenotyping Network (IPPN), Germany

Reviewed by:

Fabio Fiorani, Helmholtz Association of German Research Centres (HZ), Germany
Dionysia Apostolos Fasoula, Agricultural Research Insitute, Cyprus

Copyright © 2023 Karabulut, Erkoç, Acı, Aydın, Barriball, Braley, Cassetta, Craine, Diaz-Garcia, Hershberger, Meyering, Miller, Rubin, Tesdell, Schlautman and Şakiroğlu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Muhammet Şakiroğlu, bXNha2lyb2dsdUBhdHUuZWR1LnRy; Brandon Schlautman, c2NobGF1dG1hbkBsYW5kaW5zdGl0dXRlLm9yZw==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.