Phylogenetic analysis and structural prediction reveal the potential functional diversity between green algae SWEET transporters

Fleet, Jack; Ansari, Mujtaba; Pittman, Jon K.

doi:10.3389/fpls.2022.960133

ORIGINAL RESEARCH article

Front. Plant Sci., 15 September 2022

Sec. Plant Membrane Traffic and Transport

Volume 13 - 2022 | https://doi.org/10.3389/fpls.2022.960133

This article is part of the Research TopicInsights in Plant Membrane Traffic and Transport: 2021View all 4 articles

Phylogenetic analysis and structural prediction reveal the potential functional diversity between green algae SWEET transporters

Jack Fleet¹

Mujtaba Ansari²

Jon K. Pittman¹^*

¹Department of Earth and Environmental Sciences, Faculty of Science and Engineering, School of Natural Sciences, The University of Manchester, Manchester, United Kingdom
²School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, United Kingdom

Sugar-Will-Eventually-be-Exported-Transporters (SWEETs) are an important family of sugar transporters that appear to be ubiquitous in all organisms. Recent research has determined the structure of SWEETs in higher plants, identified specific residues required for monosaccharide or disaccharide transport, and begun to understand the specific functions of individual plant SWEET proteins. However, in green algae (Chlorophyta) these transporters are poorly characterised. This study identified SWEET proteins from across representative Chlorophyta with the aim to characterise their phylogenetic relationships and perform protein structure modelling in order to inform functional prediction. The algal genomes analysed encoded between one and six SWEET proteins, which is much less than a typical higher plant. Phylogenetic analysis identified distinct clusters of over 70 SWEET protein sequences, taken from almost 30 algal genomes. These clusters remain separate from representative higher or non-vascular plant SWEETs, but are close to fungi SWEETs. Subcellular localisation predictions and analysis of conserved amino acid residues revealed variation between SWEET proteins of different clusters, suggesting different functionality. These findings also showed conservation of key residues at the substrate-binding site, indicating a similar mechanism of substrate selectivity and transport to previously characterised higher plant monosaccharide-transporting SWEET proteins. Future work is now required to confirm the predicted sugar transport specificity and determine the functional role of these algal SWEET proteins.

Introduction

Sugars are essential components of carbon metabolism, and are a key source of energy for growth and development in all organisms. Photosynthetic organisms including plants and algae are valuable sources of sugars both for nutritional uses but also for other industrial applications, such as for fermentation to produce biofuels (Bhaumik and Dhepe, 2016; Figueroa-Torres et al., 2020). The ability of many algae strains (particularly unicellular microalgae) to achieve fast growth and high biomass yield, and with the potential for cultivation under conditions that will not compete with agriculture, allows for a more scalable and sustainable source of fermentable sugars (Chen et al., 2013). It is therefore important to understand the molecular mechanisms of sugar metabolism and transport in algae in order to maximise their potential as a renewable fuel feedstock (Johnson and Alric, 2013), as well as to enhance our fundamental understanding of carbohydrate metabolism processes in these organisms.

In higher plants, there are several classes of sugar transporter that play crucial roles in sugar partitioning and carbohydrate metabolism, which include the more recently discovered Sugar Will Eventually be Exported Transporter (SWEET) family (Chen et al., 2010; Julius et al., 2017). They are bidirectional uniporters that can allow the movement of sugars across a membrane as determined by the concentration gradient (Xue et al., 2022). The discovery of SWEETs was a notable development, particularly as they were found in all Kingdoms of life, have a broad range of functionality and high level of conservation between species (Jia et al., 2017). As a result, there is significant, ongoing investigation into the evolution of SWEET proteins, their structure, and their functional purpose in organisms. In plants, SWEETs participate in both monosaccharide and disaccharide transport across the plasma membrane and organellar membranes (Chen et al., 2010, 2012) and are shown to have a diverse range of functionality. This includes providing sugars to fungal symbionts or pathogens (Gao et al., 2018; Jeena et al., 2019; Gupta et al., 2021), mediating sucrose efflux from sink tissue cells and vascular pathway cells during phloem loading or unloading (Chen et al., 2012; Lin et al., 2014), or mobilising sugars during plant development (Guan et al., 2008; Shammai et al., 2018). Recent evidence suggest SWEETs may even have the capacity to transport non-sugar substrates. Screening for gibberellin transporters revealed the capacity for low-affinity bidirectional transport by AtSWEET13 and AtSWEET14 (Kanno et al., 2016). Alternatively, analysis of OsSWEET3a revealed dual functionality as both a glucose and gibberellin transporter, mediating phloem loading and early plant development (Morii et al., 2020). Full functionality and diversity of plant SWEET proteins are yet unknown.

Phylogenetic characterisation of SWEETs in angiosperms is based around the nomenclature given to the first identified SWEETs from Arabidopsis thaliana (Chen et al., 2010). AtSWEET1 to AtSWEET17 were categorised into four distinct clades: SWEET1–3 (Clade I), SWEET4–8 (Clade II), SWEET9–15 (Clade III), and SWEET16–17 (Clade IV; Eom et al., 2015; Sui et al., 2017). SWEETs in each clade are distinguished in part by their propensity to transport monosaccharides (Clade I, II, IV) or disaccharides (Clade III). Clade I and II SWEETs exclusively transport monosaccharide sugars including glucose and mostly localise to the plasma membrane, although some isoforms show vacuole membrane localisation (Chen et al., 2010, 2015). Preferential disaccharide (mainly sucrose) transport is a characteristic of Clade III SWEETs, which localise to intracellular or plasma membranes (Chen et al., 2012; Yuan and Wang, 2013; Lin et al., 2014; Gao et al., 2018). Clade IV SWEETs are distinguished from the other clades as they exclusively localise to the vacuole and have the capacity to transport several different monosaccharide sugars (Chen et al., 2010; Chardon et al., 2013; Guo et al., 2014). This functional variation between SWEET clades can be attributed to specific amino acids within conserved motifs on SWEET proteins. For example, the addition of a conserved Trp residue in the second transmembrane (TM) helix distinguishes Clades I and II, from Clades III and IV. An additional conserved motif containing the positively charged amino acids His and Arg, distinguishes Clade II SWEETs from Clade I, whereas the conservation of Arg and Trp residues within a Clade III motif distinguishes SWEETs from Clade IV (Jia et al., 2017).

SWEETs are characterised by a distinct TM structure: plant SWEETs possess seven TM helices arranged into two TM helix bundles (THB). Each THB contains three TM helices linked by a central TM4 helix (Figure 1A), with this structure referred to as a 3-1-3 formation (Chen et al., 2010). Each THB structure contains a conserved PQ-loop repeat motif (Pfam: PF04193; Xuan et al., 2013). Crystal structures of SWEET proteins from rice (Oryza sativa; OsSWEET2b) and A. thaliana (AtSWEET13) distinguish the TM helices within the THBs, such that THB1 forms the N-terminal half of the protein arranged in a (1-3-2) + 4 structure, while THB2 forms the C-terminal half arranged in a (5-7-6) formation (Figures 1A,B). Spanning the membrane, TM4 aligns closer to THB1 and covalently fuses THB1 and THB2 asymmetrically, creating a pore for substrate translocation (Tao et al., 2015; Han et al., 2017; Jia et al., 2017). Conformational change mediated by substrate binding drives an alternating access mechanism of passive transport of the substrate through the pore (Figure 1C) with specific residues critical to this mechanism (Latorraca et al., 2017; Selvam et al., 2019). There is also evidence that plant SWEETs can form functional homo- or hetero-oligomers (Xuan et al., 2013; Tao et al., 2015; Han et al., 2017). Prokaryotic SWEET proteins are arranged with a single THB (1-3-2), hence are dubbed SemiSWEETs (Xu et al., 2014), but they can symmetrically dimerise without a TM4 inversion linker to create a pore for the transport of sugars (Xuan et al., 2013). Given the high conservation of the (1-3-2) THB arrangement between SWEETs and SemiSWEETs, it has been hypothesised that eukaryotic SWEETs evolved from a fusion of SemiSWEETs, which have also been identified in eukaryotes including higher plants and algae (Jia et al., 2017).

FIGURE 1

Figure 1. The structure and transport mechanism of SWEET proteins. (A) A schematic of a typical eukaryotic SWEET showing the seven transmembrane (TM) helices that are organised into two TM helix bundles (THB). (B) The crystal structure of AtSWEET13 with each TM helix numbered. (C) The proposed transport mechanism of SWEET proteins. Two Pro residues (such as Pro27 and Pro149, numbered according to AtSWEET13) act as molecular hinges allowing the opening and closing of intra- and extracellular cavities for the translocation of a sugar substrate across a membrane.

There has been very limited investigation into the SWEET family in unicellular eukaryotes, and particularly with regard to the various taxonomic groupings of algae. Algae SWEETs are often referred to as “outliers” to contextualise larger angiosperm clades (Eom et al., 2015; Jia et al., 2017;Hu et al., 2018; Li et al., 2018), although there is some evidence that algae SWEETs form a distinct clade to other eukaryote SWEETs alongside fungi-derived SWEETs (Hu et al., 2016; Jia et al., 2017). The evolutionary history of SWEET genes across the algae and plant lineage is not well understood. Whole genome duplication events are likely responsible for the expansion of the SWEET gene family (16–53 per species) in higher plants (Li et al., 2018). In contrast, dispersed duplication as a result of adaptation to environmental changes is a possible explanation for the low number of homologues in algae (Hu et al., 2018; Li et al., 2018). Characterisation of green algae (Chlorophyta) SWEETs is important to both further our understanding of the ancestral origins of higher plant SWEETs, since Chlorophyta SWEETs are closely related to Clade II plant SWEETs (Li et al., 2018), but also to further our understanding of sugar transport within algae themselves. Chlorophyta microalgae represent viable targets for engineered biorefineries for the generation of fermentable sugars, biodiesel lipids and other valuable chemicals (Usher et al., 2014; Banerjee et al., 2021; Figueroa-Torres et al., 2021).

Compared with higher plants, there is a considerable gap in understanding of sugar transport processes in Chlorophyta algae. Hexose uniporter transporters in Chlorella kessleri can provide low-affinity, bidirectional glucose transport (Caspari et al., 1994) while transgenic expression of one of these transporters (CkHUP1) into Chlamydomonas reinhardtii resulted in improved dark heterotrophic growth on a glucose carbon source and improved the capacity for downstream H₂ production (Doebbe et al., 2007). Beyond this, there has been little characterisation of the native function of these sugar transporters. Some putative chloroplast-localised transporters, such as a hexose transporter and a triose-phosphate translocator, have been implicated in algal carbohydrate metabolism (Johnson and Alric, 2013; Marchand et al., 2018), while intracellular maltose transport by C. reinhardtii MEX1 is important for starch metabolism (Findinier et al., 2017). Furthermore, there is very limited investigation into algal plasma membrane-localised sugar transporters with the capacity for sugar efflux or influx. One study indicated a possible hexose transporter of the green alga Micractinium conductrix that could efflux maltose or glucose to support endosymbiotic growth of Paramecium bursaria, although the gene encoding this hypothesised transporter was not identified (Arriola et al., 2018). Genomic and proteomic studies have begun to indicate the breadth of transporters present in chlorophytes (Blaby-Haas and Merchant, 2019), although experimental validation and further bioinformatic study of sugar transporters including SWEETs is still lacking at this stage.

This present study has investigated the broad presence of the SWEET transporter family across several green algae, identifying a potential “Chlorophyta clade” of SWEETs external to the known clade structure in higher plants, alongside a comparison with some selected fungal SWEET proteins. Using a combination of phylogenetic and structural analysis, this study has begun to fill the knowledge gap for SWEET proteins in green algae. We examine whether proteins from the same genus or species cluster together or whether there is evidence of similarity of the basis of putative substrate characteristics but independent of taxonomy. Furthermore, we discuss the endogenous functions of these SWEET proteins but also consider how these proteins could be manipulated and exploited for microalgae biotechnological applications.

Materials and methods

Identification of chlorophyta SWEETs

The four previously identified but not experimentally characterised C. reinhardtii SWEET gene sequences were obtained from C. reinhardtii CC-503 MT+ genome (Merchant et al., 2007) assembly and annotation data v.5.6, which was obtained from the DOE Joint Genome Institute (JGI) Phytozome v.13 genome data portal (Goodstein et al., 2011). The C. reinhardtii SWEET genome ID numbers were: Cre06.g271800, Cre06.g275000, Cre07.g340700, and Cre10.g421650, named here as CrSWEET1 to 4, respectively. Alongside the C. reinhardtii SWEETs, several previously well-characterised higher plant SWEETs from A. thaliana and O. sativa that represent a cross-section of Clade I to IV SWEETs were used as query sequences in protein homology searches. These included AtSWEET1 (UniProt accession number: Q8L9J7), AtSWEET4 (Q944M5), AtSWEET5 (Q8LBF7), AtSWEET8 (Q8LFH5), AtSWEET13 (Q9FGQ2), OsSWEET2b (Q5N8JI) and OsSWEET11 (Q6YZF3). Using these C. reinhardtii and plant amino acid sequences as references, BLASTp searches were carried out in several databases (UniprotKB/SwissProt; UniprotKB Viridiplantae) using different search engines (under E > 0.001): EBI-BLASTp, NCBI-BLASTp, JGI Phytozome-BLASTp (Camacho et al., 2009; Goodstein et al., 2011). Candidate SWEET sequences from Chlorophyta algae were logged and cross-compared to SWEET sequences present in a database of SWEETs across several Kingdoms (including Viridiplantae), as well as those identified by EBI-HMMER protein homology searches (Madeira et al., 2019).

Chlorophyta SWEET proteins were identified from several available genomes from organisms of the order Chlamydomonadales (including five Chlamydomonas species: C. reinhardtii, Chlamydomonas debaryana, Chlamydomonas eustigma, Chlamydomonas incerta and Chlamydomonas schloesseri and Dunaliella salina, Gonium pectorale, Tetrabaena socialis, and Volvox carteri), Sphaeropleales (Monoraphidium neglectum, Raphidocelis subcapitata, Scenedesmus obliquus, Tetradesmus deserticola, and Tetradesmus obliquus), Chlorellales (Auxenochlorella protothecoides, Chlorella sorokiniana, Chlorella variabilis and Helicosporidium sp.), Trebouxiales (Coccomyxa subellipsoidea and Trebouxia sp.), Chloropicales (Chloropicon primus), Chlorodendrales (Tetraselmis striata) and Mammiellales (Bathycoccus prasinos, Micromonas commoda, Ostreococcus lucimarinus and Ostreococcus tauri). The candidate proteins were identified as SWEETs following confirmation of the gene ID/genome location in the JGI PhycoCosm database (Nordberg et al., 2013), the identification of at least one copy of the MtN3/slv Pfam domain (PF03083) and/or the presence of a PQ-loop repeat Pfam domain (PF04193; found predominantly in SemiSWEET sequences) using a Pfam annotation search (Mistry et al., 2020), and the presence of at least three TM helices (forming one THB) using TOPCONS membrane topology prediction software (Tsirigos et al., 2015). The protein sequences were also confirmed as being members of the SWEET family by performing multiple sequence alignments using ClustalW against the well-characterised SWEET representatives from A. thaliana described above. Genome details and individual ID numbers or accession numbers for the protein sequences identified and used for subsequent analysis are provided in Supplementary Table 1. For many of these genomes, the transcript details have been supported by transcriptome data from a large number of RNA-Seq datasets; however, these sequences should be considered as representative Chlorophyta SWEETs and it is likely that the exact number of predicted SWEET proteins from some of these genomes may change during subsequent genome reanalysis due to incorrect annotation or genome assembly errors that can lead to genes and protein sequences being missed or incorrectly assigned.

Identification of outlier and comparative SWEETs

For the phylogenetic analysis of Chlorophyta SWEETs, several fungal and plant SWEETs were used as points of comparison. Unicellular fungal SWEETs allow for comparison of SWEETs from single celled organisms that are not derived from unicellular algae. The chosen fungal SWEETs were six SWEET proteins from Allomyces macrogynus (Jia et al., 2017), and nine SWEET proteins from Neocallimastix californiae (Podolsky et al., 2021). Lower plant SWEETs from taxa that bridge the evolutionary gap between Chlorophyta and Streptophyta were selected as outliers: eight SWEET proteins from Amborella trichopoda, seven from Selaginella moellendorffii, two from Marchantia polymorpha and one each from Physcomitrella patens and Sphagnum fallax.

Phylogenetic analysis

Phylogenetic analysis was carried out using the CIPRES Science Gateway Toolkit (Miller et al., 2010). First, a multiple sequence alignment of the Chlorophyta and fungal SWEETs and the Streptophyta outliers was generated using ClustalW (using default settings). Random Accelerated Maximum Likelihood (RAxML) Black Box was then used to generate a phylogenetic tree (Kozlov et al., 2019), using recommended parameters (Stamatakis, 2014), which included a condition to let RAxML halt bootstrapping following the end of a set run time (30 min; Pattengale et al., 2009), which generated 650 bootstrap replications. Bootstrap percentage values for each node are shown on the tree. Other parameters included use of the BLOSUM62 substitution matrix, the Gonnet 250 matrix for pairwise alignment and no empirical base frequencies. RAxML Black Box automatically generated the best-likelihood tree, based on maximum likelihood, which was evaluated and optimised under GAMMA (0.1 log likelihood). Use of the BLOSUM62 substitution matrix was evaluated using ModelTest-NG (Darriba et al., 2020) and was found to have a high relative scoring log-likelihood value, which was equivalent or better than other substitution models that were also evaluated (lnL scores of −47784.8 for BLOSUM62 compared to −47690.9 for VT, −47737.5 for PMB, −47813.9 for WAG and −47936.9 for LG, according to the Akaike information criterion). Furthermore, it was found that use of other substitution models did not alter the tree structure. For example, comparison of the BLOSUM62 and the LG substitution matrix model found that the structure of both sets of trees was virtually identical, with minor differences in bootstrap values at the nodes (Supplementary Figure 1). In addition, the BLOSUM62 model was preferred to allow consistency with its use in pairwise-similarity CLANS analysis (see below). FigTree¹ was used to generate tree images.

The Chlorophyta-fungal clade structure was determined by the maximum likelihood tree bootstrap values, then distinguished further by use of pairwise similarity Cluster Analysis of Sequences (CLANS) using CLANS Jar from the MPI Bioinformatics Toolkit (Frickey and Lupas, 2004; Gabler et al., 2020), which allowed determination of specific clusters within the Chlorophyta-fungal clade. The “attractive force” between SWEET proteins using E-values from the BLAST high-scoring pairs was calculated, with the lower the E-value demonstrating the greater the attractive force. Clustering of sequences was calculated by normalising “attractive force” to p-values between 0 and 1 (with 1 being a strong cluster and 0 being a weak cluster). The p-value was set to 1 × 10⁻⁵ and the analysis was run for 50,000 iterations, as previously found to be appropriate for phylogenetic validation (Ibuot et al., 2020). CLANS displays the attractive force visually in the Fruchterman-Reingold graph, whereby the stronger attractive forces (the higher normalised p-value, or lower E-value) between specific SWEET proteins are denoted by a darker connective line in the two-dimensional network representation.

Subcellular localisation prediction

Subcellular localisation prediction was carried out on all Chlorophyta SWEETs using DeepLoc, MULocDeep and PProwler software (Bodén and Hawkins, 2005; Almagro Armenteros et al., 2017; Jiang et al., 2021). These tools were chosen in part due to their ability to accurately be able to confirm the localisation of the A. thaliana SWEETs for which membrane localisation has been experimentally determined. A consensus prediction of Chlorophyta SWEET localisation based on the outputs of the three approaches was determined. Predictions with a confidence score > 30% were recorded unless there was no high confidence prediction, in which case the prediction with the highest score was recorded.

Protein structure modelling and analysis

Protein structure predictions of representative SWEET proteins from Chlorophyta Clusters 1 to 5 were simulated using Phyre-2 homology modelling software (Kelley et al., 2015). In each case Phyre-2 gave the prediction with the highest likelihood (100% confidence in all models; combination of the highest sequence identity and sequence coverage), which was based on the previously solved crystal structure of AtSWEET13 with a substrate analogue, 2′-deoxycytidine 5′-monophosphate (DCM) ligand (PDB id: 5XPD; DOI: 10.2210/pdb5XPD/pdb; Han et al., 2017). The defined protein structure of AtSWEET13 was visualised (Figure 1B) using SWISS-MODEL protein homology software (Waterhouse et al., 2018).

Characterisation of the TM helices and their location in the structure was carried out using TOPCONS (Tsirigos et al., 2015). Conservation across protein sequences was first visualised by Constraint Based Alignment Tool (COBALT; Papadopoulos and Agarwala, 2007) using a frequency-based difference between sequences (BLAST E-value 0.003, Gap penalties −11, −1, End gap penalties −5, −1). Conserved amino acids were visualised using Easy Sequencing in PostScript (ESPript; Robert and Gouet, 2014), with standard default parameters. To demonstrate substrate binding for candidate Chlorophyta SWEET proteins, the Phyre-2 models were run through UCSF Chimera’s in-built docking tool, AutoDock Vina (Trott and Olson, 2010; Gaillard, 2018; Nguyen et al., 2020). Using a PDB file generated from the 5XPD AtSWEET13 template, the substrate-binding pocket was estimated. AutoDock Vina predicted the ligand-binding site based on the orientation positioning of the ligand following simulated binding. Models of DCM ligands bound in representative Cluster 1 to 5 SWEET proteins were determined to be the most accurate if they possessed the most negative ligand binding score, and root-mean-square-deviation values closest to zero.

Results and discussion

Phylogenetic relationship of SWEETs in green algae

To investigate the phylogenetic and structural characteristics of SWEET proteins from Chlorophyta, protein sequences were obtained from several databases using selected SWEET protein query sequences and filtered to confirm their identification as SWEETs, including the presence of at least one and mostly two copies of the MtN3/slv Pfam domain (PF03083) in all proteins. This screen identified 70 SWEET proteins of Chlorophyta origin from 28 representative algal strains (Supplementary Table 1). Forty-nine of the proteins were from 15 different Chlorophyceae (order Chlamydomonadales and Sphaeropleales) strains, including five Chlamydomonas species, two Tetradesmus species and algae such as V. carteri and D. salina. Thirteen proteins were identified from seven Trebouxiophyceae (order Chlorellales and Trebouxiales) strains, including three Chlorella strains, four proteins were from four different Mamiellophyceae (order Mamiellales) strains and four proteins were from Chloropicophyceae (order Chloropicales) and Chlorodendrophyceae (order Chlorodendrales) strains (Table 1). Two of the proteins (ApSWEET2 from A. protothecoides and MnSWEET2 from M. neglectum) contained just one THB suggesting that these could be SemiSWEET-like proteins although further experimental validation of the sequences will be needed to confirm this is correct or whether they are partial length sequences. The rest of the sequences show characteristics of full-length SWEET proteins. Eleven of the Chlorophyta genomes appeared to possess only one SWEET sequence while the other algal genomes examined encoded between two and six SWEET proteins. This contrasts with the large numbers (usually >20) of SWEET protein isoforms typically present in a higher plant genome. Vascular plants have more complex genomes, caused by occurrence of whole-genome duplication events, leading to the expansion and diversification of SWEET families in higher plants. This allows plants to meet the requirement for more complex sugar transport processes within different cell and tissue types of these large multicellular organisms (Li et al., 2018). In contrast, unicellular algae typically have smaller genome sizes and do not require multiple transporters to mediate sugar translocation across many cell layers, and hence fewer SWEET homologues are observed per species.

TABLE 1

Table 1. Numbers and distribution of algae (Chlorophyta) and selected fungi (Blastocladiomycota and Chytridiomycota) SWEET proteins.

Before constructing a phylogenetic tree, outlier sequences were selected to aid understanding of the evolutionary relationships between the Chlorophyta SWEET proteins and the Streptophyta SWEETs. Previous analysis of SWEET proteins from plant lineages has identified SWEETs from several Bryophyte, Lycophyte, and basal Streptophyta that link to the Chlorophyta SWEETs, but are excluded from the four higher plant SWEET clades (Jia et al., 2017; Li et al., 2018). Representatives of these “linking” SWEETs were therefore selected as additional outliers: eight SWEET protein sequences from the plant A. trichopoda that is the most basal lineage of angiosperms, seven from the Lycophyte S. moellendorffii, two from the non-vascular plant M. polymorpha and one each from the Bryophytes P. patens and S. fallax. One rice and seven A. thaliana SWEET sequences were included in the tree, as Streptophyta representatives, and final outliers to the Chlorophyta-fungal cluster. For comparison, SWEETs from two fungi species were additionally used in order to understand the conservation of SWEETs across these two classes of distinct eukaryotes and help analyse the function of these transporters within less complex organisms. Nine SWEET sequences were taken from N. californiae that is ancestral to Saccharomyces cerevisiae, and six from the model fungus A. macrogynus (Jia et al., 2017; Podolsky et al., 2021). In total 112 SWEETs from selected higher and lower land plants, green algae and anaerobic fungi were used for the phylogenetic tree construction (Figure 2).

FIGURE 2

Figure 2. (A) Phylogenetic tree of selected Chlorophyta SWEET protein sequences and outliers from representative plants and fungi. The outlined region denotes the Chlorophyta-fungal clade although four fungal-derived SWEETs (NcSWEET2, NcSWEET5, AmaSWEET1, AmaSWEET2) sit outside the clade. Symbols indicate the taxonomic orders of the species where the protein sequence is derived. A consensus maximum likelihood tree following bootstrap replications is shown. The Chlorophyta-fungal clade can be further divided into two sub-clades, determined on the basis of bootstrap percentage values shown at each node, and denoted by the shaded sections of the tree. Algae-fungi SWEETs that form distinct clusters, determined on the basis of CLANS analysis, are highlighted and labelled. The branch length scale bar indicates the evolutionary distance of amino acid substitutions per site. (B) A two-dimensional cluster analysis of the SWEET sequences used for the phylogenetic tree was performed using CLANS. Each symbol represents one of the SWEET proteins and is coloured on the basis of the phylogenetic groups shown in Panel A. Grey lines represent protein connections with reciprocal BLAST hits at p < 10⁻⁵. A version of Panel B showing the names of the un-clustered proteins is shown in Supplementary Figure 2. The full list of SWEET sequences and their identification numbers is shown in Supplementary Table 1.

A clear phylogenetic distinction was observed between the Chlorophyta SWEETs and the Streptophyta SWEETs including those from Bryophytes and Lycophytes (Figure 2A). In contrast, most of the fungi SWEETs examined, except for two N. californiae and two A. macrogynus SWEETs (NcSWEET2 and 5; AmaSWEET1 and 2), clustered with all the Chlorophyta SWEETs, forming an Chlorophyta-fungi clade (Figure 2A). The conservation of fungi-derived SWEETs within this clade is consistent with previous observations (Jia et al., 2017). Bootstrap values indicate that the Chlorophyta-fungal clade can be further divided into two sub-clades (Figure 2A; Supplementary Figure 1). Phenetic sequence clustering analysis by CLANS was used to provide further distinction of the tree structure and to support the classification of clusters of specific proteins within the clade. CLANS determined the “attractive force” between SWEET protein sequences by performing an all-against-all pairwise similarity search (Frickey and Lupas, 2004). CLANS confirmed the separation of the four outlier fungal SWEETs, which were distinct from plant SWEETs and the rest of the Chlorophyta-fungal clade (Figure 2B). CLANS also supported the grouping of phylogenetically similar SWEET proteins into five clusters within the clade named Cluster 1 to 5 (Figure 2). Furthermore, this CLANS method was used to identify individual protein sequences that did not show strong clustering due to a low attractive force value with members of a cluster, and therefore were not included in one of the five clusters (Supplementary Figure 2). For example, sequence MnSWEET2 was very distant from the 10 Cluster 2 proteins in the CLANS network and so was not included in Cluster 2.

Cluster 1 and 2 are solely composed of proteins derived from the Chlamydomonadales order, whereas Cluster 3 includes both Chlamydomonadales and Sphaeropleales representatives. Clusters 4 and 5 specifically distinguish the fungi within the clade, with N. californiae SWEETs in Cluster 4 and A. macrogynus SWEETs in Cluster 5 (Figure 2). All the remaining Sphaeropleales SWEETs as well as all of the SWEET sequences derived from the Trebouxiophyceae, Chloropicophyceae, Chlorodendrophyceae and Mamiellophyceae classes of algae could not be grouped within defined clusters supported by CLANS analysis, and so were defined as “un-clustered.” While this phylogenetic analysis indicates that many of the SWEET sequences have clustered by higher order taxonomy, there is also a clear separation of several SWEETs from the same species into different clusters. For example, SWEETs from C. reinhardtii, G. pectorale and V. carteri are present in Clusters 1, 2, and 3, indicating potential functional and structural differences between the proteins, as is the case for the higher plants SWEETs. However, since there is currently no experimental characterisation of Chlorophyta SWEETs, confirmation of functional and structural differences must be initially derived from bioinformatics analyses.

Subcellular localisation of green algae SWEETs

Subcellular localisation prediction by a combination of algorithms was utilised to further characterise the Chlorophyta SWEETs and discern if there was commonality between proteins in the same cluster. DeepLoc and MULocDeep algorithms were used to indicate localisation to specific plasma membrane or organelle membranes by the presence of a signal peptide (Almagro Armenteros et al., 2017; Jiang et al., 2021). In contrast, PProwler, known for its high accuracy and sensitivity for predicting localisation in green algae (Bodén and Hawkins, 2005), complemented the predictions with indications of localisation to secretory pathway membranes (including plasma membrane, tonoplast, endoplasmic reticulum or Golgi), the mitochondrion, chloroplast or “other” (including nucleus and cytosol). To ensure an accurate estimation of localisation, a prediction was made based on the consensus of the three algorithms (Table 2). In the cases where there was significant likelihood (30%–50% score) of localisation to another membrane, or in cases where one algorithm clearly predicted localisation to another membrane, a secondary or tertiary prediction was also recorded.

TABLE 2

Table 2. Subcellular localisation predictions of Cluster 1 to 3 Chlorophyta SWEETs.

While most of the algal SWEET sequences have plasma membrane as the only significant prediction or the primary prediction, there was variation between SWEET homologues from the same species and between SWEETs from different clusters, potentially indicative of differences in protein function. Predicted localisation to the plasma membrane was conserved across all Cluster 1 SWEETs, except for CinSWEET1, which is moderately likely to localise to the mitochondrial membrane according to PProwler, or HlSWEET, which may localise to the vacuolar membrane (Table 2). Although MULocDeep and DeepLoc algorithms shared consensus over plasma membrane localisation predictions for eight out of 10 Cluster 2 SWEETs, PProwler indicated a significant likelihood of localisation to “other” membranes (>30% confidence score) in three Cluster 2 SWEETs (CrSWEET1, CschSWEET2A, and CschSWEET2B; Table 2). As such, consensus predictions indicate that half of Cluster 2 SWEETs are likely to localise to the plasma membrane, while the others have a low likelihood of localisation to the vacuole or “other” membranes (Table 2). Predictions for Cluster 3 SWEETs were less confident, with different algorithms giving secondary or tertiary predictions to the Golgi, vacuole or “other” membranes. Notably, in contrast to Clusters 1 and 2, algorithms offered consensus over vacuole/lysosomal membrane localisation for 5 out of 14 Cluster 3 SWEETs (VcSWEET1, CinSWEET4A, CinSWEET4B, CschSWEET1A, and CschSWEET1B) and confidently predicted plasma membrane localisation in only two of the Cluster 3 SWEETs (ToSWEET1 and SoSWEET1). This suggests that some SWEETs may be responsible for sugar transport across the membranes of a vacuole/lysosome-related organelle rather than the cell membrane. There is limited knowledge of vacuolar transport processes in green algae particularly relating to sugar storage or release. Many chlorophytes possess both a contractile vacuole, with potential roles in osmoregulation and protein degradation (Becker, 2007), and lysosome-like acidocalcisomes that are involved in metal storage and polyphosphate regulation (Goodenough et al., 2019). Although there is some indication of metabolite transport pathways within these organelles, there is currently no clear evidence for the need of a sugar transporter.

DeepLoc has a high sensitivity for plasma membrane localisation but low sensitivity towards vacuole membrane predictions (Almagro Armenteros et al., 2017; Savojardo et al., 2018). To validate the accuracy of these predictions, notably for Cluster 3 proteins where predictions were less certain, DeepLoc and MULocDeep predictions were performed on A. thaliana SWEETs (AtSWEET1–17) whose precise intracellular localisation has been experimentally validated (Table 3). In higher plants, Clade I to III SWEETs are broadly characterised by their plasma membrane localisation, whereas Clade IV SWEETs are distinguished by vacuolar localisation (Chen et al., 2010; Eom et al., 2015). In nearly all cases the DeepLoc and MULocDeep predictions of the Clade I – III (AtSWEET1–15) and Clade IV (AtSWEET16–17) SWEETs aligned with the experimentally validated localisation, although for vacuolar localised AtSWEET2 (Chen et al., 2015), the predictions were less confident: a plasma membrane score of 49% (both DeepLoc and MULocDeep) and a vacuole membrane score of 46% (DeepLoc) or 24% (MULocDeep). This indicates that predicting localisation for some vacuolar membrane SWEET proteins is challenging and supports the idea that some of the algal Cluster 3 SWEETs may indeed localise to a vacuole membrane, but of course experimental analysis is needed to fully validate these predictions.

TABLE 3

Table 3. Subcellular localisation predictions and known location comparisons of Clade I to IV Arabidopsis thaliana SWEETs.

Amino acid sequence conservation across green algae and fungi SWEET proteins

Multiple sequence alignments were carried out for the Cluster 1 to 5 sequences to visualise conservation between sequences via a “frequency-based difference” approach. Amino acids were scored based on the frequency of their representation in each column with the infrequent (low conservation) amino acids highlighted with darker shading. Comparison of the three algal clusters (Cluster 1, 2, and 3) found that while there are regions of high conservation within each cluster, there are noticeable differences between clusters (Figure 3). Cluster 1 SWEETs, which includes CrSWEET3 and CrSWEET4, all have an extensive, non-conserved C-terminal region (Figure 3A), which was absent from Cluster 2 and 3 proteins (Figures 3B,C). The C-terminal tail regions of SWEETs are predicted to reside on the cytosolic side of the membrane and may provide a docking platform for protein interactions. This allows oligomerisation to form homo- or hetero-dimers, or interactions with other proteins, such as kinases for phosphorylation to regulate sugar transport across the membrane (Eom et al., 2015; Han et al., 2017; Chen et al., 2022). The Cluster 2 proteins (which include CrSWEET1) and Cluster 3 proteins (which include CrSWEET2) show higher amino acid conservation than Cluster 1 proteins (Figures 3A–C), in part due to the lack of a long variable C-terminal tail, but also likely due to several proteins in this cluster all deriving from the Chlamydomonas genus and being orthologues of one another. The two fungal clusters (Cluster 4 and 5) showed very high sequence conservation, due to a small sample size within the Cluster and because all sequences were derived from the same species (Supplementary Figure 3).

FIGURE 3

Figure 3. (A–C) COBALT multiple sequence alignment of Cluster 1 (A), Cluster 2 (B), and Cluster 3 (C) algae SWEET protein sequences. (D) Comparison of AtSWEET13 with CrSWEET1 to 4. Amino acid conservation was scored based on frequency within each amino acid position (column). Grey shading indicates and identical residue at that position and darker shades of red indicate differences from residues in other rows in the alignment at that position.

Comparison of the C. reinhardtii SWEETs (CrSWEET1 to 4) with the well-characterised AtSWEET13 protein (Figure 3D) indicates discrete regions of extremely high conservation between the sequences. In particular, there are 21 residues that are identical between the four C. reinhardtii SWEETs and AtSWEET13 and other residues that are similar (Figure 4). This may indicate that some or all of these residues are of critical importance to SWEETs, and therefore functional characterisation at an amino acid level is necessary, which requires understanding of the tertiary structure of the Chlorophyta SWEETs.

FIGURE 4

Figure 4. Multiple amino acid sequence alignment of AtSWEET13 compared to CrSWEET1 to 4. Amino acids that are identical or similar are in red shading or red font, respectively. Critical residues in AtSWEET13 that are required for substrate selectivity (filled circles) and transport mechanism, either as extrafacial hinge residues (open circles) or residues required for conformational change (closed triangles) are indicated with a symbol under the sequence. The position of the TM helices for AtSWEET13 is indicated and the numbering is based on the AtSWEET13 sequence.

Structural simulation of green algae and fungi SWEETs

Understanding the structure or structural differences of algae SWEETs is critical to inform predictions of protein function and mechanism of transport. A template-based homology approach (Phyre-2) was applied to representative SWEET sequences from Clusters 1 to 5, with the assumption that protein structure prediction from solved SWEET-structure templates would give more reliable simulations. Crystal structures have been solved for the angiosperm SWEETs OsSWEET2b and AtSWEET13 (Tao et al., 2015; Han et al., 2017) and bacterial SemiSWEETs, including LbSemiSWEET, EcSemiSWEET, and TySemiSWEET (Wang et al., 2014; Xu et al., 2014; Lee et al., 2015). AtSWEET13 was chosen as the template for homology modelling for two reasons: SemiSWEETs possess only 3 TM helices (only one THB), thus provide both lower confidence alignments and alignments with less sequence coverage; and OsSWEET2b crystal structure was characterised as a homo-3-mer, thus models displayed lower sequence alignment compared to the monomeric structure of AtSWEET13. The modelled structures of the algal CrSWEET3 and 4 (Cluster 1), CrSWEET1 (Cluster 2) and CrSWEET2 (Cluster 3) proteins, the fungal NcSWEET1 (Cluster 4) and AmaSWEET3 (Cluster 5) proteins, alongside the AtSWEET13 template, are shown in Figure 5A. TOPCONS was used to estimate the residue positions of the TM helices of each protein (Figure 5B). The structural simulations show the presence of seven TM domains for each algal and fungal SWEET protein, which is characteristic of the SWEET family (Jia et al., 2017). Furthermore, each of the proteins appeared to emulate the THB structure of AtSWEET13 (Figure 1); namely THB1 and THB2 arranged in a [1-3-2] and [5-7-6] formation, respectively, and linked by a central TM4 helix (Figure 5).

FIGURE 5

Figure 5. (A) Structure models of representative Cluster 1 to 5 algae and fungi SWEETs compared to the known protein structure of AtSWEET13 (PDB ID: 5XPD) that was used as the template for modelling. Ribbon structures of Cluster 1 (CrSWEET3 and CrSWEET4), Cluster 2 (CrSWEET1), Cluster 3 (CrSWEET2), Cluster 4 (NcSWEET1), Cluster 5 (AmaSWEET3) and template (AtSWEET13) SWEET proteins viewed from within the membrane (top image) and a top-down view from the extrafacial side (bottom image). TM helices are denoted by different colours. (B) Predicted location determined by TOPCONS for the seven TM helices of each SWEET protein sequence. Numbers refer to amino acid residues.

In all cases, Phyre-2 was able to simulate SWEET structure with 100% confidence; however, the SWEETs varied with regard to alignment and homology to the template. CrSWEET1 and CrSWEET2 shared good alignment with the template: 22% identity, 94% coverage and 28% identity, 88% coverage, respectively. In contrast, both CrSWEET3 and CrSWEET4 displayed lower homology and alignment with the template: 21% identity, 61% coverage and 17% identity, 71% coverage, respectively. This could be due to the extensive non-conserved C-terminal tail region, most prominent in CrSWEET3 and 4. However, the fungal AmaSWEET3 (Cluster 5) demonstrated high homology with the template (21% identity, 92% coverage) despite also possessing an extensive non-conserved C-terminal tail (Figure 5A). For homology-based structural predictions, these values represent an accurate alignment. TOPCONS analysis was also able to identify homologous structures to either THB1 or THB2 in each assessed SWEET. Particularly, the THB2 region of CrSWEET1, CrSWEET2, NcSWEET1 and AmaSWEET3 all displayed homology to the single-THB, bacterial SemiSWEET: LbSemiSWEET (PDB: 4QNC). Similarly, the THB2 region of AtSWEET13 displayed homology to the bacterial SemiSWEET TySemiSWEET (PDB: 4rng). Although the process of SWEET evolution is not well characterised, it has been postulated that eukaryotic SWEETs evolved from a duplication and fusion event of individual SemiSWEETs, or a fusion of a bacterial SemiSWEET and archaeal SemiSWEET (Xuan et al., 2013; Hu et al., 2016). Alignment of THB2 in CrSWEET1, CrSWEET2, NcSWEET1 and AmaSWEET3 but not in CrSWEET3 or CrSWEET4, perhaps indicates a divergence of evolution of the latter two proteins.

Prediction of substrate binding and transport

Ten hydrophobic and polar residues in a modified version of AtSWEET13, which was used to generate the crystal structure, are crucial for ligand binding and have been shown to comprise the putative substrate binding pocket. These are Ser20, Leu23 (Val23 in wild type AtSWEET13), Asn54 (Ser54 in wild type AtSWEET13), Trp58, Asn76, Ser142, Met145 (Val145 in wild type AtSWEET13), Asn176 (Ser176 in wild type AtSWEET13), Trp180 and Asn196 (Han et al., 2017). Many of these residues are conserved in the C. reinhardtii SWEETs within Clusters 1, 2 and 3 (Figure 4), and in Cluster 4 and 5 fungal SWEETs (Supplementary Figure 4). To show how these residues are arranged on the modelled SWEET structures and how they could interact with a substrate, protein-ligand docking simulation was performed. The substrate analogue DCM, which was used to simulate glucose binding within the AtSWEET13 structure (Han et al., 2017), was used for substrate docking simulation in each of the representative Cluster 1 to 5 SWEETs. DCM was predicted to fit within the binding pocket of each SWEET protein although there was variation in the orientation of DCM binding as determined by the highest binding affinity score in each structure simulation (Supplementary Figure 5). By comparing residues in the known binding site and extracellular hinge of AtSWEET13 with the Cluster 1 to 5 SWEETs, it was possible to overlay and identify the critical residues in the representative algal and fungal SWEETs that may be involved in the substrate binding pocket (Figure 6) and extrafacial gate (Figure 7).

FIGURE 6

Figure 6. Predicted substrate binding site residues of CrSWEET1 to 4, NcSWEET1 and AmaSWEET3, compared with AtSWEET13. The DCM substrate surrounded by the binding pocket residues is shown. Residues were determined by alignment with the previously characterised binding site residues for AtSWEET13. The structures are viewed from the intrafacial/cytosolic side through the pore channel.

FIGURE 7

Figure 7. Predicted extrafacial hinge residues of CrSWEET1 to 4, NcSWEET1 and AmaSWEET3, compared with AtSWEET13. The DCM substrate associated with the extrafacial gate residues is shown. Residues were determined by alignment with the previously characterised extrafacial hinge residues for AtSWEET13. The structures are viewed from the lumenal or extracellular side through the pore channel.

With a few notable exceptions, these key residues identified in AtSWEET13 are conserved within the representative algae and fungi SWEET binding pocket region (Figure 6). It has been demonstrated from AtSWEET13 mutation experiments that four binding pocket residues are crucial identifiers for the selection of monosaccharide or disaccharide substrates (Han et al., 2017). Val23, Ser54, Val145, and Ser176 (numbered according to AtSWEET13) determine disaccharide (such as sucrose) selection, while Leu, Asn, Met, and Asn residues in the equivalent positions determine monosaccharide (such as glucose) selection. To simulate glucose transport in the AtSWEET13 crystal structure, four amino acid mutations (V23L, S54N, V145M, S176N) were made to convert the binding pocket from being disaccharide-specific to monosaccharide-specific, resulting in an abolishment of sucrose transport activity but maintenance of glucose transport activity (Han et al., 2017). The Leu, Asn, and Met residues are longer and possess more bulky side chains, which is thought to restrict the size of the binding pocket and prevent larger disaccharide sugar binding (Chen et al., 2010). Leu23 and Met145 do not interact with the ligand directly, but rather stabilise and restrict the cavity size of the binding pocket. In the Cluster 1 to 3 microalgae SWEETs, the equivalent positions are held by a Met residue (e.g.,Met24 in CrSWEET1) and a Tyr residue (e.g., Tyr145 in CrSWEET1; Figure 4), which possess significantly bulkier side chains (Figure 6). This would indicate a further restriction of the size of the binding pocket and the size of the sugar able to be transported. Cluster 4 fungal SWEETs, including NcSWEET1, have a bulkier Ile residue in the Val23 equivalent position (Ile28 in NcSWEET1), but retain the Val residue in the Val145 position (Val150 in NcSWEET1; Figure 6; Supplementary Figure 4). NcSWEET1 has been shown to selectively transport both hexose and pentose sugars, therefore it would appear that the restricted-size binding pocket cavity facilitates the transport of smaller sugar substrates (Podolsky et al., 2021). As such, it is possible that these algae SWEETs as well as the examined fungi SWEETs may only be able to transport small hexose or pentose sugars due to the restricted binding pocket size.

The bulky side chain of Trp residues was thought to stabilise the sugar substrate in the binding pocket (Lee et al., 2015; Han et al., 2017). Comparison of algae SWEET and AtSWEET13 sequences and structures has revealed the conservation of the crucial Trp-Asn pairs (e.g., Trp58-Asn76 and Trp180-Asn196 in AtSWEET13; Trp59-Asn75 and Trp179-Asn195 in CrSWEET1) (Figures 4, 6), indicative of a conserved ligand-binding mechanism. The solved crystal structure of EcSemiSWEET, modelled with monoolein to mimic a sugar substrate, revealed a hydrogen bond between the hydroxyl of the substrate and Asn66 (Asn76 in AtSWEET13). The aromatic group of the adjacent Trp residue, Trp50 (Trp58 in AtSWEET13), stabilised the substrate in the binding pocket (Lee et al., 2015). This process is observed in both DCM and glucose binding by AtSWEET13 and AtSWEET1 (Han et al., 2017; Selvam et al., 2019). The conservation of these critical binding pocket residues across the algae Cluster 1 to 3 SWEETs and the fungal Cluster 5 SWEETs (Supplementary Figure 4), and the conformational similarities between the SWEET simulations, strongly indicates a conserved substrate binding mechanism, despite the potential variation in the orientation of DCM in the binding pockets of each SWEET (Supplementary Figure 5).

The crystal structure of AtSWEET13, used as a template for the Cluster 1 to 5 SWEET modelling, was solved in the inward-facing (IF) state with a monomeric stoichiometry (Han et al., 2017). By aligning conserved residues known to aid in the substrate translocation from the IF to outward-facing (OF) state, we can predict which residues may play a similar role in the sugar transport pathway in the Cluster 1–5 SWEETs. A tetrad of Pro residues (Pro27, Pro47, Pro150, Pro167; numbered according to OsSWEET2b) are highly conserved across eukaryotic SWEETs, SemiSWEETs and Cluster 1 to 5 SWEETs (Figure 4; Lee et al., 2015; Selvam et al., 2019). Pro27 and Pro150 (equivalent to Pro28 and Pro148 in CrSWEET1) induce a 30° kink in TM1 and TM5, respectively (Figure 1C). This kink in the TM helix is an indication of a molecular hinge responsible for the transition from IF to OF state (Lee et al., 2015; Tao et al., 2015). The remaining Pro tetrad residues, Pro47 and Pro167 (equivalent to Pro48 and Pro167 in CrSWEET1) are found to terminate the TM2 and TM6 helices, respectively. These Pro residues are crucial for substrate transport, such that mutation of any of these residues in AtSWEET1 resulted in a complete loss of sugar transport activity (Tao et al., 2015). Adjacent residues are thought to stabilise the substrate while the protein is undergoing conformational change. Glu residues that complete the PQ loop repeat in SemiSWEETs form cross-protomer linkages, while in eukaryote SWEETs, covalent linkages with the TM4 helix replace the role Glu residues (Xu et al., 2014; Lee et al., 2015; Tao et al., 2015).

Clusters of residues in AtSWEET13 that localise to the protein extremities comprise the intra- and extrafacial gates (Han et al., 2017). As the AtSWEET13 structure was solved in the IF state, we can observe residues interacting in the extrafacial gate, including Tyr61, Tyr183, Asp189, Lys65 and Val192. Examination of the equivalent extrafacial hinge residues in the representative Cluster 1 to 5 SWEETs found that the Tyr and Asp residues are conserved across all these microalgae and fungi SWEETs, while the Lys residue is conserved in just two of the six structures (CrSWEET1 and AmaSWEET3), and there is no equivalent Val residue in any of the structures (Figure 7). Molecular dynamic analysis for OsSWEET2b revealed both Tyr61 and Asp189 are crucial for substrate translocation through the protein (Selvam et al., 2019). Asp189 is thought to hydrogen-bond to both Tyr61 and Lys65, stabilising the IF conformation, and substitution of all three residues resulted in a significant loss of transport activity (Han et al., 2017). Both Tyr residues use polar, Van der Waals and hydrophobic interactions with the substrate as it moves through and exits the transport pathway in AtSWEET1, AtSWEET13, and OsSWEET2b (Han et al., 2017; Selvam et al., 2019). The conservation of these residues across the Cluster 1 to 5 SWEETs indicates that these algal and fungal SWEETs will share the alternating access model of sugar transport (Figure 1C) that has been described for many eukaryotic and prokaryotic SWEET representatives (Latorraca et al., 2017; Selvam et al., 2019). Overall, a smaller, more restricted binding pocket cavity than typically observed in higher plant SWEETs allows us to predict that these Chlorophyta SWEETs function as small hexose or pentose transporters, despite showing broad conservation with the substrate binding and translocation mechanism of higher plant SWEETs.

Potential functions and applications of green algae SWEETs

Chlorophyta algae, such as C. reinhardtii are mainly found in aquatic environments where access to carbon is limited. These algae have evolved methods of carbon concentrating and fixation of inorganic carbon to acclimatise to nutrient-limited environments (Kono et al., 2020). The CrSWEET1 protein was previously identified in low CO₂ inducible transcriptomic screens (and has also been named as LCI36) where it was found to be moderately induced under low CO₂ conditions (Miura et al., 2004; Brueggeman et al., 2012). While an increase in glucose transport is unlikely to be directly involved in the carbon concentrating mechanism, the need to remobilise sugars within the cell may be a consequence of CO₂ deprivation and inhibited photosynthesis. In A. thaliana, sucrose-transporting SWEET proteins contribute to adaptation to osmotic stress (Durand et al., 2018), although whether algae SWEETs are involved in responses to conditions such as osmotic stress remains to be determined. There is also some evidence that algal sugar transporters including SWEETs are routes for transferring sugars to a pathogenic or symbiotic host, as is the case with some higher plant SWEETs (Gupta et al., 2021). For example, sugar transporters are thought to play a key role in the symbiotic feeding between the alga Micractinium conductrix and a Paramecium bursaria host, and although SWEET genes have been identified in this alga, they have not yet been confirmed as the sugar efflux pathway for the host (Arriola et al., 2018). Microalgae SWEETs have also been indicated to facilitate sugar scavenging from a marine algal host (the dinoflagellate Scrippsiella acuminata) by the parasite Amoebophyra spp. (Decelle et al., 2021). Although no experimental validation has been carried out in this study, expression of CrSWEET1 to 4 was observed in studies investigating transcriptomic response to environmental stress (Bell et al., 2016; Wase et al., 2019). This raises further questions of the endogenous function of these SWEETs and is the subject of ongoing research.

SWEET proteins also have potential biotechnological applications. Podolsky et al. (2021) postulated how SWEET proteins can be utilised to enhance fermentation of digested lignocellulosic biomass. Consumption of xylose represents a bottleneck for many microbial cell factories as there is a shortage of yeast strains that can metabolise xylose; glucose uptake will often competitively inhibit xylose uptake. Specific SWEET proteins from N. californiae and A. thaliana (NcSWEET1 and AtSWEET7) have demonstrated the capacity to non-discriminatively co-transport glucose and xylose, allowing for efficient fermentation of digested lignocellulosic biomass using genetically modified yeast expressing the protein (Kuanyshev et al., 2021; Podolsky et al., 2021). The present study has demonstrated how algal SWEET proteins may have a binding site more suited to the selection and transport of pentose sugars (Figure 6) and are closely related phylogenetically to NcSWEET1 (Figure 2). Bioprospecting for pentose- and hexose-utilising SWEETs, such as from algae species, could yield credible candidates for engineering novel fermentative strains of yeast. Furthermore, as some plasma membrane-localised SWEET proteins have demonstrated the capacity for glucose efflux (Chen et al., 2010; Shammai et al., 2018), particularly when overexpressed, direct engineering of algal SWEETs may yield new outlooks for novel bio-refineries. For example, over-expression of algal SWEET genes may allow the controlled efflux of glucose for use as a bioethanol feedstock, without the need for harsh, energy-intensive or expensive methods of cell pre-treatment and hydrolysis to release the sugars.

There are key questions that remain outstanding following this in silico work which require subsequent experimental validation. These include confirmation of sub-cellular localisation for specific SWEETs and experimental determination of their substrate, as well as the kinetics and mode of regulation of sugar transport. Additionally, to date, significant investigation has focused on the diverse functional role of SWEET proteins across plant lineages, including their involvement in sugar partitioning, seed development or symbiotic relationships (Gao et al., 2018; Jeena et al., 2019; Gupta et al., 2021; Xue et al., 2022). This has greatly improved our understanding of carbon flux and transport in plants; however, the role of sugar transporters in microalgae is much less well understood. The present study should lead to more investigation into the functional role of SWEET sugar transporters in carbon partitioning within individual algal cells.

Conclusion

SWEET proteins have been identified in all kingdoms of life, including in different classes of algae (Jia et al., 2017), but to our knowledge, this is the first study to perform a focused bioinformatics investigation of the SWEET protein family across the Chlorophyta taxon of algae and carry out an initial phylogenetic and structural characterisation of representative sequences. Sub-cellular localisation predictions showed possible diversity in functionality between SWEETs of different clusters. Although we have shown that Chlorophyta SWEETs, alongside selected fungal SWEETs are phylogenetically distinct from Streptophyta SWEETs, there is phylogenetic diversity within the Chlorophyta-fungal clade. Structural modelling of selected algal and fungal SWEETs, distinguished from five clusters display clear conservation of most of the key amino acid residues that are known to determine substrate specificity and transport mechanisms in higher plant and bacterial SWEETs. However, there is some divergence of binding site residues due to different side chain characteristics that would be predicted to give rise to differences in the sugar transport characteristics of distinct algal SWEETs, and differences in other structural features such as length of the C-terminal tail. Therefore we have shown that a degree of structure–function variation exists between each cluster, thus demonstrating potential functional diversity that correlates with the Chlorophyta-fungal SWEET evolutionary model.

Specifically, Cluster 1 proteins, which are composed exclusively of Chlamydomonadales-derived SWEETs, and predicted to be nearly all plasma membrane localised, have more substantial differences in THB structure with poor homology to SemiSWEET THB1 indicating possible evolutionary divergence. These SWEETs typically have a longer cytosolic C-terminal tail indicating different regulation modes including oligomerisation, have conserved Trp-Asn pairs, Pro tetrad, Tyr and Asp extrafacial gate residues, and show the presence of Met and Tyr binding pocket residues, indicating small sugar substrate selection.

Cluster 2 SWEET proteins have similar characteristics to those of Cluster 1 except that more members have possible vacuolar localisation and therefore some different cellular function. They mostly do not have a long C-terminal tail, but the THB1 domain does have stronger homology to SemiSWEET THB. Equivalent characteristics of binding pocket residues to those of Cluster 1 proteins also indicate the ability to bind and transport small sugars.

Cluster 3 SWEETs have a broader taxonomic composition as they include proteins derived from Chlamydomonadales- and Sphaeropleales species. Localisation to the vacuole is likely in five of the 14 SWEETs and only two of the Cluster 3 SWEETs have confident PM-localisation. As for the Cluster 2 SWEETs, they lack a long C-terminal domain and there is greater mismatching of residues across the proteins, particularly for the Sphaeropleales-derived SWEETs, potentially indicating some functional variation that is yet to be evaluated. Extrafacial gate and binding pocket residues are conserved in the same manner as for the Cluster 1 and 2 SWEETs, showing that residues that are seen as deterministic for substrate selection (whether monosaccharide or disaccharide sugars) were conserved among Chlorophyta SWEETs, although divergent from higher plant SWEETs.

Cluster 4 SWEETs derived from N. californiae fungi also consistently lack a long C-terminal tail, and also have homology to SemiSWEET THB structure. There is some variation in the Pro tetrad that makes up the extrafacial gate region, and the presence of an Ile residue in the binding pocket is predicted to allow pentose sugar selectivity.

Finally, Cluster 5 composed of A. macrogynus-derived SWEETs has equivalent conservation of the substrate binding and translocation mechanism residues compared to the algal SWEETs, but the presence or absence of a long C-terminal tail is not conserved across the cluster.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary material.

Author contributions

JF, MA, and JP conceived and designed the data analysis and reviewed and edited the manuscript. JF and MA performed data analysis. All authors contributed to the article and approved the submitted version.

Funding

JF was supported by a BBSRC DTP PhD studentship (grant number BB/M011208/1).

Acknowledgments

We thank Thomas Pugh for assistance with some of the preliminary data analysis.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.960133/full#supplementary-material

Footnotes

1. ^http://tree.bio.ed.ac.uk/software/figtree

References

Almagro Armenteros, J. J., Sønderby, C. K., Sønderby, S. K., Nielsen, H., and Winther, O. (2017). DeepLoc: prediction of protein subcellular localization using deep learning. Bioinformatics 33, 3387–3395. doi: 10.1093/bioinformatics/btx431

PubMed Abstract | CrossRef Full Text | Google Scholar

Arriola, M. B., Velmurugan, N., Zhang, Y., Plunkett, M. H., Hondzo, H., and Barney, B. M. (2018). Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium conductrix SAG 241.80: implications to maltose excretion by a green alga. Plant J. 93, 566–586. doi: 10.1111/tpj.13789

PubMed Abstract | CrossRef Full Text | Google Scholar

Banerjee, S., Ray, A., and Das, D. (2021). Optimization of Chlamydomonas reinhardtii cultivation with simultaneous CO₂ sequestration and biofuels production in a biorefinery framework. Sci. Total Environ. 762:143080. doi: 10.1016/j.scitotenv.2020.143080

PubMed Abstract | CrossRef Full Text | Google Scholar

Becker, B. (2007). “Function and evolution of the vacuolar compartment in Green algae and land plants (Viridiplantae),” in International review of cytology. ed K. W. Jeon (Cambridge, Massachusetts: Academic Press), 1–24.

Google Scholar

Bell, S. A., Shen, C., Brown, A., and Hunt, A. G. (2016). Experimental genome-wide determination of RNA polyadenylation in Chlamydomonas reinhardtii. PLoS One 11:e0146107. doi: 10.1371/journal.pone.0146107

PubMed Abstract | CrossRef Full Text | Google Scholar

Bhaumik, P., and Dhepe, P. L. (2016). “Conversion of biomass into sugars,” in Biomass sugars for non-fuel applications. eds D. Murzin and O. Simakova (London: The Royal Society of Chemistry), 1–53.

Google Scholar

Blaby-Haas, C. E., and Merchant, S. S. (2019). Comparative and functional algal genomics. Annu. Rev. Plant Biol. 70, 605–638. doi: 10.1146/annurev-arplant-050718-095841

PubMed Abstract | CrossRef Full Text | Google Scholar

Bodén, M., and Hawkins, J. (2005). Prediction of subcellular localization using sequence-biased recurrent networks. Bioinformatics 21, 2279–2286. doi: 10.1093/bioinformatics/bti372

PubMed Abstract | CrossRef Full Text | Google Scholar

Brueggeman, A. J., Gangadharaiah, D. S., Cserhati, M. F., Casero, D., Weeks, D. P., and Ladunga, I. (2012). Activation of the carbon concentrating mechanism by CO₂ deprivation coincides with massive transcriptional restructuring in Chlamydomonas reinhardtii. Plant Cell 24, 1860–1875. doi: 10.1105/tpc.111.093435

PubMed Abstract | CrossRef Full Text | Google Scholar

Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., et al. (2009). BLAST+: architecture and applications. BMC Bioinformatics 10:421. doi: 10.1186/1471-2105-10-421

PubMed Abstract | CrossRef Full Text | Google Scholar

Caspari, T., Will, A., Opekarová, M., Sauer, N., and Tanner, W. (1994). Hexose/H⁺ symporters in lower and higher plants. J. Exp. Biol. 196, 483–491. doi: 10.1242/jeb.196.1.483

PubMed Abstract | CrossRef Full Text | Google Scholar

Chardon, F., Bedu, M., Calenge, F., Klemens, P. A. W., Spinner, L., Clement, G., et al. (2013). Leaf fructose content is controlled by the vacuolar transporter SWEET17 in Arabidopsis. Curr. Biol. 23, 697–702. doi: 10.1016/j.cub.2013.03.021

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, L.-Q., Hou, B.-H., Lalonde, S., Takanaga, H., Hartung, M. L., Qu, X.-Q., et al. (2010). Sugar transporters for intercellular exchange and nutrition of pathogens. Nature 468, 527–532. doi: 10.1038/nature09606

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, Q., Hu, T., Li, X., Song, C.-P., Zhu, J.-K., Chen, L., et al. (2022). Phosphorylation of SWEET sucrose transporters regulates plant root:shoot ratio under drought. Nat. Plants 8, 68–77. doi: 10.1038/s41477-021-01040-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, H.-Y., Huh, J.-H., Yu, Y.-C., Ho, L.-H., Chen, L.-Q., Tholl, D., et al. (2015). The Arabidopsis vacuolar sugar transporter SWEET2 limits carbon sequestration from roots and restricts Pythium infection. Plant J. 83, 1046–1058. doi: 10.1111/tpj.12948

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, L.-Q., Qu, X.-Q., Hou, B.-H., Sosso, D., Osorio, S., Fernie, A. R., et al. (2012). Sucrose efflux mediated by SWEET proteins as a key step for phloem transport. Science 335, 207–211. doi: 10.1126/science.1213351

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, C.-Y., Zhao, X.-Q., Yen, H.-W., Ho, S.-H., Cheng, C.-L., Lee, D.-J., et al. (2013). Microalgae-based carbohydrates for biofuel production. Biochem. Eng. J. 78, 1–10. doi: 10.1016/j.bej.2013.03.006

CrossRef Full Text | Google Scholar

Darriba, D., Posada, D., Kozlov, A. M., Stamatakis, A., Morel, B., and Flouri, T. (2020). ModelTest-NG: a new and scalable tool for the selection of DNA and protein evolutionary models. Mol. Biol. Evol. 37, 291–294. doi: 10.1093/molbev/msz189

PubMed Abstract | CrossRef Full Text | Google Scholar

Decelle, J., Kayal, E., Bigeard, E., Gallet, B., Bougoure, J., Clode, P., et al. (2021). Intracellular development and impact of a eukaryotic parasite on its zombified microalgal host in the marine plankton. bioRxiv [Preprint]. bioRxiv: 2021.2011.2004.467241.

Google Scholar

Doebbe, A., Rupprecht, J., Beckmann, J., Mussgnug, J. H., Hallmann, A., Hankamer, B., et al. (2007). Functional integration of the HUP1 hexose symporter gene into the genome of C. reinhardtii: impacts on biological H₂ production. J. Biotechnol. 131, 27–33. doi: 10.1016/j.jbiotec.2007.05.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Durand, M., Mainson, D., Porcheron, B., Maurousset, L., Lemoine, R., and Pourtau, N. (2018). Carbon source–sink relationship in Arabidopsis thaliana: the role of sucrose transporters. Planta 247, 587–611. doi: 10.1007/s00425-017-2807-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Eom, J.-S., Chen, L.-Q., Sosso, D., Julius, B. T., Lin, I. W., Qu, X.-Q., et al. (2015). SWEETs, transporters for intracellular and intercellular sugar translocation. Curr. Opin. Plant Biol. 25, 53–62. doi: 10.1016/j.pbi.2015.04.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Figueroa-Torres, G. M., Pittman, J. K., and Theodoropoulos, C. (2021). Optimisation of microalgal cultivation via nutrient-enhanced strategies: the biorefinery paradigm. Biotechnol. Biofuels 14:64. doi: 10.1186/s13068-021-01912-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Figueroa-Torres, G. M., Wan Mahmood, W. M. A., Pittman, J. K., and Theodoropoulos, C. (2020). Microalgal biomass as a biorefinery platform for biobutanol and biodiesel production. Biochem. Eng. J. 153:107396. doi: 10.1016/j.bej.2019.107396

CrossRef Full Text | Google Scholar

Findinier, J., Tunçay, H., Schulz-Raffelt, M., Deschamps, P., Spriet, C., Lacroix, J.-M., et al. (2017). The Chlamydomonas mex1 mutant shows impaired starch mobilization without maltose accumulation. J. Exp. Bot. 68, 5177–5189. doi: 10.1093/jxb/erx343

PubMed Abstract | CrossRef Full Text | Google Scholar

Frickey, T., and Lupas, A. (2004). CLANS: a Java application for visualizing protein families based on pairwise similarity. Bioinformatics 20, 3702–3704. doi: 10.1093/bioinformatics/bth444

PubMed Abstract | CrossRef Full Text | Google Scholar

Gabler, F., Nam, S.-Z., Till, S., Mirdita, M., Steinegger, M., Söding, J., et al. (2020). Protein sequence analysis using the MPI bioinformatics toolkit. Curr. Protoc. Bioinformatics 72:e108. doi: 10.1002/cpbi.108

CrossRef Full Text | Google Scholar

Gaillard, T. (2018). Evaluation of AutoDock and AutoDock Vina on the CASF-2013 benchmark. J. Chem. Inf. Model. 58, 1697–1706. doi: 10.1021/acs.jcim.8b00312

PubMed Abstract | CrossRef Full Text | Google Scholar

Gao, Y., Zhang, C., Han, X., Wang, Z. Y., Ma, L., Yuan, D. P., et al. (2018). Inhibition of OsSWEET11 function in mesophyll cells improves resistance of rice to sheath blight disease. Mol. Plant Pathol. 19, 2149–2161. doi: 10.1111/mpp.12689

PubMed Abstract | CrossRef Full Text | Google Scholar

Goodenough, U., Heiss, A. A., Roth, R., Rusch, J., and Lee, J.-H. (2019). Acidocalcisomes: ultrastructure, biogenesis, and distribution in microbial eukaryotes. Protist 170, 287–313. doi: 10.1016/j.protis.2019.05.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Goodstein, D. M., Shu, S., Howson, R., Neupane, R., Hayes, R. D., Fazo, J., et al. (2011). Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 40, D1178–D1186. doi: 10.1093/nar/gkr944

CrossRef Full Text | Google Scholar

Guan, Y.-F., Huang, X.-Y., Zhu, J., Gao, J.-F., Zhang, H.-X., and Yang, Z.-N. (2008). RUPTURED POLLEN GRAIN1, a member of the MtN3/saliva gene family, is crucial for exine pattern formation and cell integrity of microspores in Arabidopsis. Plant Physiol. 147, 852–863. doi: 10.1104/pp.108.118026

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, W.-J., Nagy, R., Chen, H.-Y., Pfrunder, S., Yu, Y.-C., Santelia, D., et al. (2014). SWEET17, a facilitative transporter, mediates fructose transport across the tonoplast of Arabidopsis roots and leaves. Plant Physiol. 164, 777–789. doi: 10.1104/pp.113.232751

PubMed Abstract | CrossRef Full Text | Google Scholar

Gupta, P. K., Balyan, H. S., and Gautam, T. (2021). SWEET genes and TAL effectors for disease resistance in plants: present status and future prospects. Mol. Plant Pathol. 22, 1014–1026. doi: 10.1111/mpp.13075

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, L., Zhu, Y., Liu, M., Zhou, Y., Lu, G., Lan, L., et al. (2017). Molecular mechanism of substrate recognition and transport by the AtSWEET13 sugar transporter. Proc. Natl. Acad. Sci. U. S. A. 114, 10089–10094. doi: 10.1073/pnas.1709241114

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, W., Hua, X., Zhang, Q., Wang, J., Shen, Q., Zhang, X., et al. (2018). New insights into the evolution and functional divergence of the SWEET family in Saccharum based on comparative genomics. BMC Plant Biol. 18:270. doi: 10.1186/s12870-018-1495-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, Y.-B., Sosso, D., Qu, X.-Q., Chen, L.-Q., Ma, L., Chermak, D., et al. (2016). Phylogenetic evidence for a fusion of archaeal and bacterial SemiSWEETs to form eukaryotic SWEETs and identification of SWEET hexose transporters in the amphibian chytrid pathogen Batrachochytrium dendrobatidis. FASEB J. 30, 3644–3654. doi: 10.1096/fj.201600576R

PubMed Abstract | CrossRef Full Text | Google Scholar

Ibuot, A., Dean, A. P., and Pittman, J. K. (2020). Multi-genomic analysis of the cation diffusion facilitator transporters from algae. Metallomics 12, 617–630. doi: 10.1039/d0mt00009d

PubMed Abstract | CrossRef Full Text | Google Scholar

Jeena, G. S., Kumar, S., and Shukla, R. K. (2019). Structure, evolution and diverse physiological roles of SWEET sugar transporters in plants. Plant Mol. Biol. 100, 351–365. doi: 10.1007/s11103-019-00872-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Jia, B., Zhu, X. F., Pu, Z. J., Duan, Y. X., Hao, L. J., Zhang, J., et al. (2017). Integrative view of the diversity and evolution of SWEET and SemiSWEET sugar transporters. Front. Plant Sci. 8:2178. doi: 10.3389/fpls.2017.02178

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, Y., Wang, D., Yao, Y., Eubel, H., Künzler, P., Møller, I. M., et al. (2021). MULocDeep: a deep-learning framework for protein subcellular and suborganellar localization prediction with residue-level interpretation. Comput. Struct. Biotechnol. J. 19, 4825–4839. doi: 10.1016/j.csbj.2021.08.027

PubMed Abstract | CrossRef Full Text | Google Scholar

Johnson, X., and Alric, J. (2013). Central carbon metabolism and electron transport in Chlamydomonas reinhardtii: metabolic constraints for carbon partitioning between oil and starch. Eukaryot. Cell 12, 776–793. doi: 10.1128/EC.00318-12

PubMed Abstract | CrossRef Full Text | Google Scholar

Julius, B. T., Leach, K. A., Tran, T. M., Mertz, R. A., and Braun, D. M. (2017). Sugar transporters in plants: new insights and discoveries. Plant Cell Physiol. 58, 1442–1460. doi: 10.1093/pcp/pcx090

PubMed Abstract | CrossRef Full Text | Google Scholar

Kanno, Y., Oikawa, T., Chiba, Y., Ishimaru, Y., Shimizu, T., Sano, N., et al. (2016). AtSWEET13 and AtSWEET14 regulate gibberellin-mediated physiological processes. Nat. Commun. 7:13245. doi: 10.1038/ncomms13245

PubMed Abstract | CrossRef Full Text | Google Scholar

Kelley, L. A., Mezulis, S., Yates, C. M., Wass, M. N., and Sternberg, M. J. E. (2015). The Phyre2 web portal for protein modeling, prediction and analysis. Nat. Protoc. 10, 845–858. doi: 10.1038/nprot.2015.053

PubMed Abstract | CrossRef Full Text | Google Scholar

Kono, A., Chou, T.-H., Radhakrishnan, A., Bolla, J. R., Sankar, K., Shome, S., et al. (2020). Structure and function of LCI1: a plasma membrane CO₂ channel in the Chlamydomonas CO₂ concentrating mechanism. Plant J. 102, 1107–1126. doi: 10.1111/tpj.14745

PubMed Abstract | CrossRef Full Text | Google Scholar

Kozlov, A. M., Darriba, D., Flouri, T., Morel, B., and Stamatakis, A. (2019). RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics 35, 4453–4455. doi: 10.1093/bioinformatics/btz305

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuanyshev, N., Deewan, A., Jagtap, S. S., Liu, J., Selvam, B., Chen, L.-Q., et al. (2021). Identification and analysis of sugar transporters capable of co-transporting glucose and xylose simultaneously. Biotechnol. J. 16:2100238. doi: 10.1002/biot.202100238

PubMed Abstract | CrossRef Full Text | Google Scholar

Latorraca, N. R., Fastman, N. M., Venkatakrishnan, A. J., Frommer, W. B., Dror, R. O., and Feng, L. (2017). Mechanism of substrate translocation in an alternating access transporter. Cells 169, 96–107.e12. doi: 10.1016/j.cell.2017.03.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, Y., Nishizawa, T., Yamashita, K., Ishitani, R., and Nureki, O. (2015). Structural basis for the facilitative diffusion mechanism by SemiSWEET transporter. Nat. Commun. 6:6112. doi: 10.1038/ncomms7112

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, X., Si, W., Qin, Q., Wu, H., and Jiang, H. (2018). Deciphering evolutionary dynamics of SWEET genes in diverse plant lineages. Sci. Rep. 8:13440. doi: 10.1038/s41598-018-31589-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Lin, I. W., Sosso, D., Chen, L.-Q., Gase, K., Kim, S.-G., Kessler, D., et al. (2014). Nectar secretion requires sucrose phosphate synthases and the sugar transporter SWEET9. Nature 508, 546–549. doi: 10.1038/nature13082

PubMed Abstract | CrossRef Full Text | Google Scholar

Madeira, F., Park, Y. M., Lee, J., Buso, N., Gur, T., Madhusoodanan, N., et al. (2019). The EMBL-EBI search and sequence analysis tools APIs in 2019. Nucleic Acids Res. 47, W636–W641. doi: 10.1093/nar/gkz268

PubMed Abstract | CrossRef Full Text | Google Scholar

Marchand, J., Heydarizadeh, P., Schoefs, B., and Spetea, C. (2018). Ion and metabolite transport in the chloroplast of algae: lessons from land plants. Cell. Mol. Life Sci. 75, 2153–2176. doi: 10.1007/s00018-018-2793-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Merchant, S. S., Prochnik, S. E., Vallon, O., Harris, E. H., Karpowicz, S. J., Witman, G. B., et al. (2007). The Chlamydomonas genome reveals the evolution of key animal and plant functions. Science 318, 245–250. doi: 10.1126/science.1143609

PubMed Abstract | CrossRef Full Text | Google Scholar

Miller, M. A., Pfeiffer, W., and Schwartz, T. (2010). Creating the CIPRES science gateway for inference of large phylogenetic trees. Paper presented at the 2010 Gateway Computing Environments Workshop (GCE).

Google Scholar

Mistry, J., Chuguransky, S., Williams, L., Qureshi, M., Salazar, G. A., Sonnhammer, E. L. L., et al. (2020). Pfam: the protein families database in 2021. Nucleic Acids Res. 49, D412–D419. doi: 10.1093/nar/gkaa913

CrossRef Full Text | Google Scholar

Miura, K., Yamano, T., Yoshioka, S., Kohinata, T., Inoue, Y., Taniguchi, F., et al. (2004). Expression profiling-based identification of CO₂-responsive genes regulated by CCM1 controlling a carbon-concentrating mechanism in Chlamydomonas reinhardtii. Plant Physiol. 135, 1595–1607. doi: 10.1104/pp.104.041400

PubMed Abstract | CrossRef Full Text | Google Scholar

Morii, M., Sugihara, A., Takehara, S., Kanno, Y., Kawai, K., Hobo, T., et al. (2020). The dual function of OsSWEET3a as a gibberellin and glucose transporter is important for young shoot development in rice. Plant Cell Physiol. 61, 1935–1945. doi: 10.1093/pcp/pcaa130

PubMed Abstract | CrossRef Full Text | Google Scholar

Nguyen, N. T., Nguyen, T. H., Pham, T. N. H., Huy, N. T., Bay, M. V., Pham, M. Q., et al. (2020). Autodock Vina adopts more accurate binding poses but Autodock4 forms better binding affinity. J. Chem. Inf. Model. 60, 204–211. doi: 10.1021/acs.jcim.9b00778

PubMed Abstract | CrossRef Full Text | Google Scholar

Nordberg, H., Cantor, M., Dusheyko, S., Hua, S., Poliakov, A., Shabalov, I., et al. (2013). The genome portal of the Department of Energy Joint Genome Institute: 2014 updates. Nucleic Acids Res. 42, D26–D31. doi: 10.1093/nar/gkt1069

CrossRef Full Text | Google Scholar

Papadopoulos, J. S., and Agarwala, R. (2007). COBALT: constraint-based alignment tool for multiple protein sequences. Bioinformatics 23, 1073–1079. doi: 10.1093/bioinformatics/btm076

PubMed Abstract | CrossRef Full Text | Google Scholar

Pattengale, N. D., Alipour, M., Bininda-Emonds, O. R. P., Moret, B. M. E., and Stamatakis, A. (2009). “How many bootstrap replicates are necessary?” in Research in computational molecular biology. ed. S. Batzoglou (Berlin, Heidelberg: Springer), 184–200.

Google Scholar

Podolsky, I. A., Seppälä, S., Xu, H., Jin, Y.-S., and O'Malley, M. A. (2021). A SWEET surprise: anaerobic fungal sugar transporters and chimeras enhance sugar uptake in yeast. Metab. Eng. 66, 137–147. doi: 10.1016/j.ymben.2021.04.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Robert, X., and Gouet, P. (2014). Deciphering key features in protein structures with the new ENDscript server. Nucleic Acids Res. 42, W320–W324. doi: 10.1093/nar/gku316

PubMed Abstract | CrossRef Full Text | Google Scholar

Savojardo, C., Martelli, P. L., Fariselli, P., Profiti, G., and Casadio, R. (2018). BUSCA: an integrative web server to predict subcellular localization of proteins. Nucleic Acids Res. 46, W459–W466. doi: 10.1093/nar/gky320

PubMed Abstract | CrossRef Full Text | Google Scholar

Selvam, B., Yu, Y.-C., Chen, L.-Q., and Shukla, D. (2019). Molecular basis of the glucose transport mechanism in plants. ACS Cent. Sci. 5, 1085–1096. doi: 10.1021/acscentsci.9b00252

PubMed Abstract | CrossRef Full Text | Google Scholar

Shammai, A., Petreikov, M., Yeselson, Y., Faigenboim, A., Moy-Komemi, M., Cohen, S., et al. (2018). Natural genetic variation for expression of a SWEET transporter among wild species of Solanum lycopersicum (tomato) determines the hexose composition of ripening tomato fruit. Plant J. 96, 343–357. doi: 10.1111/tpj.14035

PubMed Abstract | CrossRef Full Text | Google Scholar

Stamatakis, A. (2014). RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313. doi: 10.1093/bioinformatics/btu033

PubMed Abstract | CrossRef Full Text | Google Scholar

Sui, J.-L., Xiao, X.-H., Qi, J.-Y., Fang, Y.-J., and Tang, C.-R. (2017). The SWEET gene family in Hevea brasiliensis – its evolution and expression compared with four other plant species. FEBS Open Bio. 7, 1943–1959. doi: 10.1002/2211-5463.12332

PubMed Abstract | CrossRef Full Text | Google Scholar

Tao, Y., Cheung, L. S., Li, S., Eom, J.-S., Chen, L.-Q., Xu, Y., et al. (2015). Structure of a eukaryotic SWEET transporter in a homotrimeric complex. Nature 527, 259–263. doi: 10.1038/nature15391

PubMed Abstract | CrossRef Full Text | Google Scholar

Trott, O., and Olson, A. J. (2010). AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 31, 455–461. doi: 10.1002/jcc.21334

PubMed Abstract | CrossRef Full Text | Google Scholar

Tsirigos, K. D., Peters, C., Shu, N., Käll, L., and Elofsson, A. (2015). The TOPCONS web server for consensus prediction of membrane protein topology and signal peptides. Nucleic Acids Res. 43, W401–W407. doi: 10.1093/nar/gkv485

PubMed Abstract | CrossRef Full Text | Google Scholar

Usher, P. K., Ross, A. B., Camargo-Valero, M. A., Tomlin, A. S., and Gale, W. F. (2014). An overview of the potential environmental impacts of large-scale microalgae cultivation. Biofuels 5, 331–349. doi: 10.1080/17597269.2014.913925

CrossRef Full Text | Google Scholar

Wang, J., Yan, C., Li, Y., Hirata, K., Yamamoto, M., Yan, N., et al. (2014). Crystal structure of a bacterial homologue of SWEET transporters. Cell Res. 24, 1486–1489. doi: 10.1038/cr.2014.144

PubMed Abstract | CrossRef Full Text | Google Scholar

Wase, N., Tu, B., Rasineni, G. K., Cerny, R., Grove, R., Adamec, J., et al. (2019). Remodeling of Chlamydomonas metabolism using synthetic inducers results in lipid storage during growth. Plant Physiol. 181, 1029–1049. doi: 10.1104/pp.19.00758

PubMed Abstract | CrossRef Full Text | Google Scholar

Waterhouse, A., Bertoni, M., Bienert, S., Studer, G., Tauriello, G., Gumienny, R., et al. (2018). SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 46, W296–W303. doi: 10.1093/nar/gky427

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, Y., Tao, Y., Cheung, L. S., Fan, C., Chen, L.-Q., Xu, S., et al. (2014). Structures of bacterial homologues of SWEET transporters in two distinct conformations. Nature 515, 448–452. doi: 10.1038/nature13670

PubMed Abstract | CrossRef Full Text | Google Scholar

Xuan, Y. H., Hu, Y. B., Chen, L.-Q., Sosso, D., Ducat, D. C., Hou, B.-H., et al. (2013). Functional role of oligomerization for bacterial and plant SWEET sugar transporter family. Proc. Natl. Acad. Sci. U. S. A. 110, E3685–E3694. doi: 10.1073/pnas.1311244110

PubMed Abstract | CrossRef Full Text | Google Scholar

Xue, X., Wang, J., Shukla, D., Cheung, L. S., and Chen, L.-Q. (2022). When SWEETs turn tweens: Updates and perspectives. Annu. Rev. Plant Biol. 73, 379–403. doi: 10.1146/annurev-arplant-070621-093907

CrossRef Full Text | Google Scholar

Yuan, M., and Wang, S. (2013). Rice MtN3/saliva/SWEET family genes and their homologs in cellular organisms. Mol. Plant 6, 665–674. doi: 10.1093/mp/sst035

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Chlorophyta, evolution, green algae, phylogeny, protein structure, sugar transport, SWEET

Citation: Fleet J, Ansari M and Pittman JK (2022) Phylogenetic analysis and structural prediction reveal the potential functional diversity between green algae SWEET transporters. Front. Plant Sci. 13:960133. doi: 10.3389/fpls.2022.960133

Received: 02 June 2022; Accepted: 25 August 2022;
Published: 15 September 2022.

Edited by:

Stanislav Valentinovich Isayenkov, National Academy of Sciences of Ukraine (NAN Ukraine), Ukraine

Reviewed by:

Volodymyr Radchuk, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Germany
Artur Conde, University of Minho, Portugal

Copyright © 2022 Fleet, Ansari and Pittman. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jon K. Pittman, am9uLnBpdHRtYW5AbWFuY2hlc3Rlci5hYy51aw==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Phylogenetic analysis and structural prediction reveal the potential functional diversity between green algae SWEET transporters

Introduction

Materials and methods

Identification of chlorophyta SWEETs

Identification of outlier and comparative SWEETs

Phylogenetic analysis

Subcellular localisation prediction

Protein structure modelling and analysis

Results and discussion

Phylogenetic relationship of SWEETs in green algae

Subcellular localisation of green algae SWEETs

Amino acid sequence conservation across green algae and fungi SWEET proteins

Structural simulation of green algae and fungi SWEETs

Prediction of substrate binding and transport

Potential functions and applications of green algae SWEETs

Conclusion

Data availability statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

Footnotes

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good