ORIGINAL RESEARCH article

Front. Microbiol., 15 January 2020

Sec. Food Microbiology

Volume 10 - 2019 | https://doi.org/10.3389/fmicb.2019.02960

Investigation of Genetic Relationships Between Hanseniaspora Species Found in Grape Musts Revealed Interspecific Hybrids With Dynamic Genome Structures

  • 1. Micalis Institute, INRA, AgroParisTech, Université Paris-Saclay, Jouy-en-Josas, France

  • 2. Micalis Institute, INRA, AgroParisTech, CIRM-Levures, Université Paris-Saclay, Jouy-en-Josas, France

  • 3. SPO, Univ Montpellier, INRA, Montpellier SupAgro, Montpellier, France

Abstract

Hanseniaspora, a predominant yeast genus of grape musts, includes sister species recently reported as fast evolving. The aim of this study was to investigate the genetic relationships between the four most closely related species, at the population level. A multi-locus sequence typing strategy based on five markers was applied on 107 strains, confirming the clear delineation of species H. uvarum, H. opuntiae, H. guilliermondii, and H. pseudoguilliermondii. Huge variations were observed in the level of intraspecific nucleotide diversity, and differences in heterozygosity between species indicate different life styles. No clear population structure was detected based on geographical or substrate origins. Instead, H. guilliermondii strains clustered into two distinct groups, which may reflect a recent step toward speciation. Interspecific hybrids were detected between H. opuntiae and H. pseudoguilliermondii. Their characterization using flow cytometry, karyotypes and genome sequencing showed different genome structures in different ploidy contexts: allodiploids, allotriploids, and allotetraploids. Subculturing of an allotriploid strain revealed chromosome loss equivalent to one chromosome set, followed by an auto-diploidization event, whereas another auto-diploidized tetraploid showed a segmental duplication. Altogether, these results suggest that Hanseniaspora genomes are not only fast evolving but also highly dynamic.

Introduction

Grape must is a complex ecosystem that combines grape and cellar micro-organisms. Many species are interacting with each other, including yeasts and bacteria (Jolly et al., 2014; Capozzi et al., 2015; Raymond Eder et al., 2017). It is now well recognized that natural microbial populations play an important role in winemaking, notably by increasing the complexity of wine aromas (Gamero et al., 2016; Padilla et al., 2016; Bagheri et al., 2018; Escribano et al., 2018). The occurrence of microorganisms in grapes is dependent on biotic and abiotic factors such as geographic location, soil, grapevine cultivar, viticultural practices, and climate (Bokulich et al., 2014; Drumonde-Neves et al., 2016). Yeast biodiversity in must is thus generally vintage dependent (Vigentini et al., 2015). Whereas Saccharomyces cerevisiae, already well known for its importance in diverse fermented food production, is the dominant species at the end of vinification, the yeast species commonly found on grapes and in musts at the beginning of spontaneous fermentations are rather Saccharomycotina than Basidiomycotina. They mostly belong to genera Hanseniaspora, Lachancea, Metschnikowia, Pichia, and Starmerella (Wang et al., 2015; Varela and Borneman, 2017; Cioch-Skoneczny et al., 2018; Lorenzini and Zapparoli, 2019), and the genus Hanseniaspora is generally the most abundant at the onset of the fermentation.

The Saccharomycotina genus Hanseniaspora belongs to the Saccharomycodaceae family. Due to their lemon-shaped cell structure, Hanseniaspora species have been called apiculate yeasts, together with the closely related species of the genera Saccharomycodes and Nadsonia. The genus includes 21 described species, which can be separated into two lineages based on phylogenetic relationships deduced from D1D2 domains of the 28S rRNA subunit (Boekhout et al., 1994; Ouoba et al., 2015; Martin et al., 2018) or from the concatenation of taxonomic markers (Cadez et al., 2019). Recently, based on whole genome sequence comparison, Steenwyk et al. (2019) confirmed Hanseniaspora to be composed of two lineages, a fast-evolving lineage (FEL) and a slow-evolving lineage (SEL), which differ by their evolution rate and the extent of their gene loss. Species found on grapes and musts belong to both FEL (mostly H. uvarum) and SEL (H. vinae). Several studies report that they play an important role in wine fermentation by producing flavors, modulating the growth and metabolism of Saccharomyces cerevisiae and affecting wine color (reviewed in Martin et al., 2018). However, population structure and genetic relationships between Hanseniaspora species remain unclear. Nine species of FEL are particularly close phylogenetically and some of them could be difficult to differentiate probably because they diverged quite recently (Cadez et al., 2002). D1D2 regions of the ribosomal subunit, which are generally used for taxonomic classification and phylogenetic trees, differ by less than seven nucleotides between pairs of species and only by two nucleotides between H. opuntiae and H. guilliermondii, and between H. meyeri and H. clermontiae.

Four of these closely related species are associated with grapes and wine environment: H. uvarum, H. opuntiae, H. pseudoguilliermondii, and H. guilliermondii. Although frequently found in grapes, population studies have been performed only in H. uvarum (Albertin et al., 2016). To clarify the phylogenetic relationships within this species complex, and eventually to detect hidden sub-species clustering, we first designed multi locus sequence typing (MLST) markers to distinguish strains at the inter- and intraspecies level. MLST method allows comparison of variable sequences between strains or species inside highly conserved housekeeping genes. This method, initially developed for the identification of clones within populations of pathogenic bacteria (Maiden et al., 1998), is progressively becoming classic for genomic diversity studies in yeasts from different environments, and many markers have been reported (Bougnoux et al., 2002; Ayoub et al., 2006; Odds and Jacobsen, 2008; Jacques et al., 2017; Tittarelli et al., 2018). In this study, MLST analysis performed on Hanseniaspora strains was used to clarify the complex of the four wine-growing species examined and allowed detection of inter-specific hybrids, which were then investigated by flow cytometry and karyotyping. Finally, genome sequencing and comparison to parental species genomes provided more details into the contribution of each of them to the genome of the hybrids.

Materials and Methods

Hanseniaspora Strains

A total of 120 Hanseniaspora strains were examined in this study, 35 of them were collected from French grape must in Occitanie region, France in 2015 and 2016 (Table 1). The other strains have been collected from grape musts or from other substrates in distant French regions and in countries mainly from Europe, South America and Africa. Yeast cells were cultivated on YPD medium (yeast extract 10 g/L, peptone 10 g/L, glucose 10 g/L) at 28°C.

TABLE 1

GenusSpeciesNameSynonymsSubstrateCountryIsolation date
Hanseniasporaguilliermondii11-1173Toumodi Vin de rônierIvory Coast2016
Hanseniasporaguilliermondii11-1176FlowerIvory Coast2016
HanseniasporaguilliermondiiCBS 1972Grape juiceItaly<1978
HanseniasporaguilliermondiiCBS 2591Trachea of beeFrance<1978
HanseniasporaguilliermondiiCBS 95Fermenting bottled tomatoesNetherlands<1978
HanseniasporaguilliermondiiCLIB 510TCBS 465TInfected nailSouth Africa1978
HanseniasporaguilliermondiiCLIB 1559LemonFrench Guiana2010
HanseniasporaguilliermondiiCLIB 3085Toumodi Vin de rônierIvory Coast2016
HanseniasporaguilliermondiiCLIB 3092V2-10Grape must ViognierFrance (Sommières, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3093S1-2Grape must SauvignonFrance (Puylacher, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3094S1-3Grape must SauvignonFrance (Puylacher, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3095S2-34Grape must SauvignonFrance (Saint Mathieu de Tréviers, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3096S4-9Grape must SauvignonFrance (Ouveillan, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3097S4-16Grape must SauvignonFrance (Ouveillan, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3098V2-1Grape must ViognierFrance (Sommières, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3099V2-2Grape must ViognierFrance (Sommières, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3100V5-12Grape must ViognierFrance (Pech Rouge, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3206V1-9Grape must ViognierFrance (Saint Mathieu de Tréviers, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3207BIA-V1L15Grape must ViognierFrance (Saint Mathieu de Tréviers, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3208V5-13Grape must ViognierFrance (Pech Rouge, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3209V5-30Grape must ViognierFrance (Pech Rouge, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3210S4-54Grape must SauvignonFrance (Ouveillan, Occitanie)2015
HanseniasporaguilliermondiiCLIB 3228R18-212Fermented juice of sugar cane (3 days)Réunion Island2018
Hanseniasporaopuntiae x pseudoguilliermondiiCCY46-1-3Fruit; plum tree (Prunus domestica L. ‘Stanley’)Slovakia (Malé Leváre)2009
Hanseniasporaopuntiae x pseudoguilliermondiiCCY46-1-3aSingle cell colony of CCY46-1-3Lab strain2019
Hanseniasporaopuntiae x pseudoguilliermondiiCCY46-1-3bSingle cell colony of CCY46-1-3Lab strain2019
Hanseniasporaopuntiae x pseudoguilliermondiiCCY46-1-3cSingle cell colony of CCY46-1-3Lab strain2019
Hanseniasporaopuntiae x pseudoguilliermondiiCCY46-1-3dSingle cell colony of CCY46-1-3Lab strain2019
Hanseniasporaopuntiae x pseudoguilliermondiiCLIB 3101V3-28Grape must ViognierFrance (Puylacher, Occitanie)2015
Hanseniasporaopuntiae x pseudoguilliermondiiDBVPG 5828Soil close to plum treeAlgeria2010
Hanseniasporaopuntiae x pseudoguilliermondiiCLIB 3263M18-204Fermented pineappleMayotte Island2018
Hanseniasporaopuntiae x pseudoguilliermondiiCLIB 3313M18-207Fermented pineappleMayotte Island2018
Hanseniasporaopuntiae x pseudoguilliermondiiCLIB 3265M18-215Fermented pineappleMayotte Island2018
Hanseniasporaopuntiae11-1102Fruit papayaGuatemala (Flores)2009
Hanseniasporaopuntiae11-1139Fallen fruitEl Salvador (San Salvador)2009
Hanseniasporaopuntiae11-1184Flower (Hybiscus)Palau (Ngerekebesang)2010
Hanseniasporaopuntiae11-1196Rotten fruit (Syzygium malaccense)Palau (Koror)2010
HanseniasporaopuntiaeCLIB 1203IvyFrench Guiana2008
HanseniasporaopuntiaeCLIB 1208PapayaFrench Guiana2008
HanseniasporaopuntiaeCLIB 1557PapayaFrench Guiana2010
HanseniasporaopuntiaeMUCL 49139TCBS 8733TRotHawaii Island<2003
HanseniasporaopuntiaeCLIB 1564AntFrench Guiana2010
HanseniasporaopuntiaeCLIB 3102V1-109Grape must ViognierFrance (Saint Mathieu de Tréviers, Occitanie)2015
HanseniasporaopuntiaeCLIB 3103V3-16Grape must ViognierFrance (Puylacher, Occitanie)2015
HanseniasporaopuntiaeCLIB 3104V4-31Grape must ViognierFrance (Fabrezan, Occitanie)2015
HanseniasporaopuntiaeCLIB 3105S1-11Grape must SauvignonFrance (Puylacher, Occitanie)2015
HanseniasporaopuntiaeCLIB 3106S1-14Grape must SauvignonFrance (Puylacher, Occitanie)2015
HanseniasporaopuntiaeCLIB 3107BIA-S2L2Grape must SauvignonFrance (Saint Mathieu de Tréviers, Occitanie)2015
HanseniasporaopuntiaeCLIB 3108V1-1Grape must ViognierFrance (Saint Mathieu de Tréviers, Occitanie)2015
HanseniasporaopuntiaeCLIB 3109V3-27Grape must ViognierFrance (Puylacher, Occitanie)2015
HanseniasporaopuntiaeCLIB 3203V2-12Grape must ViognierFrance (Sommières, Occitanie)2015
HanseniasporaopuntiaeCLIB 3204V2-31Grape must ViognierFrance (Sommières, Occitanie)2015
HanseniasporaopuntiaeCLIB 3205S2-25Grape must SauvignonFrance (Saint Mathieu de Tréviers, Occitanie)2015
HanseniasporaopuntiaeCLIB 3227R18-143”Grenadille” fruitRéunion Island2018
HanseniasporaopuntiaeCLIB 3230R18-219Lemonade (3 days fermentation)Réunion Island2018
HanseniasporaopuntiaeCLIB 3234R18-484Lemonade (3 days fermentation)Réunion Island2018
Hanseniasporapseudoguilliermondii11-494Fruit papayaThe Philippines (Manila)2011
HanseniasporapseudoguilliermondiiCBS 8772TOrange juice concentrateunknown<2006
HanseniasporapseudoguilliermondiiCLIB 1441OrangeFrench Guiana2010
HanseniasporapseudoguilliermondiiCLIB 3226R18-113Fruit EviRéunion Island2018
HanseniasporapseudoguilliermondiiCLIB 3229R18-218Lemonade (3 days fermentation)Réunion Island2018
HanseniasporapseudoguilliermondiiCLIB 3233FruitFrench Guiana2010
Hanseniasporauvarum10-1471GrapeSlovakia (Malá Tr̀ňa)2015
Hanseniasporauvarum11-1148FlowerGuatemala (Guatemala City)2009
Hanseniasporauvarum11-1288Flower (Leguminosae)Romania (Bucuresti)2006
HanseniasporauvarumA1Grape must Pinot noir Côte de nuitFrance (Burgundy)2005
HanseniasporauvarumA4Grape must Pinot noir Côte de nuitFrance (Burgundy)2005
HanseniasporauvarumB2Grape must Pinot noir Côte de nuitFrance (Burgundy)2005
HanseniasporauvarumC4Grape must Pinot noir Côte de beauneFrance (Burgundy)2005
HanseniasporauvarumCBS 2583Fermenting cucumber brineUnited States?
HanseniasporauvarumCBS 2585Sour doughPortugal1978
HanseniasporauvarumCBS 2588Tanning fluidFrance?
HanseniasporauvarumCBS 286SoilIndonesia<1934
HanseniasporauvarumCCY25-6-34Grape must (7 days fermentation), variety Green VeltlinerSlovakia (Strekov)2008
HanseniasporauvarumCCY25-6-36Soil adjacent to apricot treeSlovakia (Malé Zálužie)2013
HanseniasporauvarumCCY46-1-2Fresh-water lakeSlovakia (Plaveckẏ Štvrtok)1987
HanseniasporauvarumCLIB 303TCBS 314TMuscatel grapeUkraine<1978
HanseniasporauvarumCLIB 512SoilDanemark1978
HanseniasporauvarumCLIB 979Sylvaner start AFFrance (Alsace)2001
HanseniasporauvarumCLIB 1207Cacao BerryFrench Guiana2008
HanseniasporauvarumCLIB 1209Saul flowerFrench Guiana2008
HanseniasporauvarumCLIB 1210Saul flowerFrench Guiana2008
HanseniasporauvarumCLIB 1563ButterflyFrench Guiana2010
HanseniasporauvarumCLIB 1627Saul flowerFrench Guiana2008
HanseniasporauvarumCLIB 1650VineyardFrance (Alsace)2007
HanseniasporauvarumCLIB 3110S1-8Grape must SauvignonFrance (Puylacher, Occitanie)2015
HanseniasporauvarumCLIB 3111V1-6Grape must ViognierFrance (Saint Mathieu de Tréviers, Occitanie)2015
HanseniasporauvarumCLIB 3114V7-44Grape must ViognierFrance (Pech Rouge, Occitanie)2016
HanseniasporauvarumCLIB 3115S6-45Grape must SauvignonFrance (Pech Rouge, Occitanie)2016
HanseniasporauvarumCLIB 3117V4-17Grape must ViognierFrance (Fabrezan, Occitanie)2015
HanseniasporauvarumCLIB 3118V3-8Grape must ViognierFrance (Puylacher, Occitanie)2015
HanseniasporauvarumCLIB 3119dV7-11Grape must ViognierFrance (Pech Rouge, Occitanie)2016
HanseniasporauvarumCLIB 3120dV7-39Grape must ViognierFrance (Pech Rouge, Occitanie)2016
HanseniasporauvarumCLIB 3123dV7-66Grape must ViognierFrance (Pech Rouge, Occitanie)2016
HanseniasporauvarumCLIB 3224R18-8Coffee beanRéunion Island2018
HanseniasporauvarumCLIB 3225R18-18Fermented coffeeRéunion Island2018
HanseniasporauvarumCLIB 3231R18-221GrapeRéunion Island2018
HanseniasporauvarumCLIB 3232R18-409Badamier fruitRéunion Island2018
HanseniasporauvarumCLIB 3237R18-2Coffee beanRéunion Island2018
HanseniasporauvarumCLIB 3239R18-388GrapeRéunion Island2018
HanseniasporauvarumCLIB 3241R18-185Badamier fruitRéunion Island2018
HanseniasporauvarumD8Grape must Pinot noir Côte de beauneFrance (Burgundy)2005
HanseniasporauvarumE6Grape must Pinot noir Côte de ChalonnaisFrance (Burgundy)2005
HanseniasporauvarumMAFF 516149UnknownJapan<2016
HanseniasporauvarumNZ15Grape mustNew Zealand2009
HanseniasporauvarumNZ234Grape mustNew Zealand2009
HanseniasporauvarumPMH-15-56UnknownLituania2015
HanseniasporauvarumRNH-15-14UnknownLituania2015
HanseniasporauvarumRWH-15-1UnknownLituania2015
HanseniasporauvarumS246-OACherryGermany (Württemberg)2014
HanseniasporauvarumS382-CBGrapesGermany (Württemberg)2014
HanseniasporaclermontiaeY-27515TCBS 8821TRotten stem of a lobelioid plantHawaï, United States2003
HanseniasporajakobseniiCBS 12942TBandji, a traditional palm wine of the palm tree Borassus akeassiiBurkina Faso<2012
HanseniasporalachanceiY-27514TCBS 8818TFermenting agave juiceMexico<2000
HanseniasporameyeriY-27513TCBS 8734TFruit of Sapindus sp.Hawaii Island<2003
HanseniasporanectarophilaCBS 13383TFlower of Siphocampylus corymbiferusBrazil2006
HanseniasporasingularisCBS 10840TFlowerThailand<2009
HanseniasporathailandicaCBS 10841TLichenThailand<2009
HanseniasporavalbyensisY-1626TCBS 479TSoilDanemark1912
KloeckerahatyaiensisCBS 10842TRoots of Chamaedaphne calyculata, from rotted woodThailand<2009

List of strains.

Isolation date is anterior to publication or sequence deposition related to this strain.

DNA Extraction

DNA extractions were carried out on cells grown in complete medium to stationary phase with two different methods: extraction with the Masterpure yeast DNA purification kit (Epicentre, France), and with an in-house protocol involving a mechanical and chemical lysis. Briefly, cells were resuspended in 200 μL of lysis buffer (Tris 10 mM pH8, EDTA 1 mM, NaCl 100 mM, Triton 2% and SDS 1%), 200 μL of phenol chloroform isoamyl alcohol 25:24:1, and 0.3 g of glass beads. This step was followed by 4 min of vortexing and 5 min of centrifugation at 13,000 rpm. Then, the aqueous phase was mixed with ethanol and the precipitated DNA pellet washed twice with ethanol 70%, dried and resuspended in 100 μL of TE with 1 μL of RNase A at 10 mg/mL.

Amplification and Sequencing of D1D2 and ITS Regions

All strains, which did not originate from international collections, were identified by amplification and sequencing of the D1D2 and ITSs regions of the ribosomal subunit. The identification of strains from international collections was verified likewise only when there was a doubt about the initial identification after MLST results. Sequences obtained were compared to reference strains by basic local alignment search tool (BLAST) on the YeastIP server1. The primers used for these amplifications were ITS1 (TCCGTAGGTGAACCTGCGG) and NL4 (GGTCCGTGTTTCAAGACGG). Amplifications were carried out by polymerase chain reaction (PCR) with the kit Taq Mix (Dongsheng Biotech II) in a 40 μL reaction volume with a first step of DNA denaturation at 94°C for 3 min, followed by 30 cycles of DNA denaturation at 94°C for 30 s, primer hybridization at 55°C for 30 s, and elongation at 72°C for 1 min. A final elongation was performed at 72°C for 3 min. All PCR reaction were performed in a SimpliAmpTM thermal Cycler (Applied BiosystemsTM).

Multilocus Sequence Typing

Five housekeeping genes previously used for MLST studies in other yeast species (Bougnoux et al., 2002; Munoz et al., 2009) were selected: ACC1, ADP1, GLN4, RPN2, and VSP13 (Supplementary Table S1). For each marker, homologous sequences of H. guilliermondii, H. opuntiae and H. uvarum were retrieved from available genomes (Sternes et al., 2016; Seixas et al., 2019) and aligned with Multalin (Corpet, 1988). A set of 11 primer pairs were designed in regions highly conserved surrounding variable sequences of 200–1100 nucleotides. Three additional markers were chosen in intergenic regions of H. uvarum genome using Artemis (Rutherford et al., 2000) as a visualization tool: between the homolog of MET5 and the upstream gene, between homologs of SKI2 and DUS3, and between RNR2 and CRM1. Then, homologous sequences of H. guilliermondii, and H. opuntiae were extracted from EMBL files with Artemis and aligned with that of H. uvarum using Multalin. Five primer pairs were designed in these regions the same way as for housekeeping genes (Supplementary Table S1). Selection of the best primer pairs was performed on an initial set of four strains of Hanseniaspora: H. uvarum CLIB 303T, H. opuntiae MUCL 49139T, H. guilliermondii CLIB 510T, and H. pseudoguilliermondii CLIB 1441. Markers of these strains were amplified and sequenced with each pair of primers. The PCR conditions in a 40 μL reaction volume were: 3 min at 94°C followed by 30 cycles of 30 s at 94°C, 30 s at 55°C, and 1 min at 72°C, with a final step of 2 min at 72°C. Agarose gel electrophoresis was performed to select the best primer pairs. To this end, two criteria were used: (1) amplicon of good intensity in the four strains, (2) a single band per amplicon. Among the 16 primer pairs tested, five were retained and additional pairs of primers were designed specifically to amplify the selected markers in the divergent strain Y-1626 of H. valbyensis, considered as the outgroup for phylogenetic trees. In total, 107 strains of Hanseniaspora were amplified and sequenced likewise with each of the five selected primer pairs, in order to detect heterozygous sites, to count polymorphic sites inside each species, and to build phylogenetic trees. Sequences of homologous genes in H. guilliermondii UTAD222 and H. uvarum AWRI 3580 were added to the alignments. For each marker sequence, heterozygous sites were looked for on Chromas (Technelysium)2, and replaced by the corresponding degenerated base. Resulting sequences were aligned with clustal, manually cleaned for complex regions with gaps and highly variable positions, and then concatenated using Seaview (Gouy et al., 2010).

For phylogenetic analysis at the species level, concatenated sequences were converted into bi-allelic sequences using an in-house python script. Phylogenetic trees were constructed by maximum likelihood using phyML (Guindon et al., 2009) with a GTR substitution model. Robustness of the tree was assessed by the approximate likelihood ratio test approach (aLRT) and bootstrap of 100 replicates.

Population Structure Analysis

The concatenated sequence file for the five MLST markers was converted into a STRUCTURE (Pritchard et al., 2000) compatible format with an in-house Perl script where each base is encoded by two digits with values between 0 and 3, allowing consideration of bi-allelic positions. Then, population structure was analyzed with STRUCTURE and InStruct (Gao et al., 2007). Ten runs (STRUCTURE) and 15 runs (InStruct) were performed and the best partitioning was determined for each software output from different criteria: the best likelihood, the increase of likelihood, and the variation between the runs as proposed by CLUMPAK (Kopelman et al., 2015), and using the best DIC (Deviance information criterion) for Instruct. The most frequent consensus partition was then inferred at the optimal number of groups K with CLUMPAK.

Analysis of Genetic Diversity

Nucleotide diversity was compared between each marker at the species level using the statistics Π and π (Nei and Li, 1979). For that purpose, non-concatenated marker sequences were turned into bi-allelic sequences and analyzed with the R package PopGenome (Pfeifer et al., 2014). In order to infer the mating system, selfing rates were estimated for each species from the Fis obtained with the R package Genepop v1.13 after conversion of the structure data file to the genepop format using PGDspider (Lischer and Excoffier, 2012), and using the RMES software (David et al., 2007) from the heterozygosity profile obtained with a custom script.

Flow Cytometry

For flow cytometry analysis, cells were first grown overnight in YPD medium at 28°C under 180 rpm agitation, and then diluted to OD600 0.1 in 10 mL YPD and placed in the same conditions during 5 h. After OD measuring, about 107 cells were centrifuged 1 min at 10,000 rpm with 1 mL water. The cell pellets were then suspended in 1 mL water, added drop by drop in 8 mL of ethanol 75% with permanent vortexing and finally stored at 5°C. After one night at 5°C, the cells were centrifuged 5 min at 3,000 rpm, suspended in 1 mL of PBS buffer and centrifuged once again 1 min at 13,000 rpm. Cell pellets were then suspended in 500 μL of RNase A (2 mg/mL in 10 mM Tris-Cl and 15 mM NaCl) and incubated 1 h at 37°C. Finally, cells were treated 1 h at 50°C with 200 μL of 1 mg/mL of proteinase K diluted in PBS buffer, centrifuged and suspended in 500 μL of PBS buffer before being sonicated 15 s in a Branson Sonifier 250 sonicator at 10% of the maximum power. About 106 cells were labeled with SYTOX® green (Invitrogen) in 250 μL at a final concentration of 1 μM. The DNA content was determined on a C6 Accuri (Ann Arbor, MI, United States) spectrophotometer with an excitation wavelength of 488 nm and an emission wavelength of 530 ± 15 nm. Acquisition was performed on 30,000 events observed with a gating on forward scatter/side scatter signal. The flow rate was set to approximately 2,000 events per second (medium flow, 35 μL/min; core, 16 μm). The Python (v. 2.7.13) module FlowCytometryTools (v. 0.5.0) was used for data extraction and manipulation and an “in house” R (v. 3.3.3) script was developed for graphical representations.

Karyotyping

Yeast karyotyping was achieved by contour-clamped homogeneous electric field (CHEF) gel electrophoresis. Plugs of yeast chromosomes were prepared according to (Vezinhet et al., 1990). The CHEF-DR III pulsed-field gel electrophoresis system (Bio-Rad, Hercules, CA, United States) was set, for a first karyotype, to 3 V/cm with a switching time of 360 s for 23 h and then to 6 V/cm with pulses of 70–90 s for 20 h in 0.5X TBE buffer at 13.5°C in 1% Seakem GTG agarose (FMC) gel. A second karyotype was made to better separate the high molecular weight chromosomes by setting the apparatus to 3 V/cm with pulses of 480 s for 18.1 h, to 4 V/cm with pulses of 300 s for 24 h and to 6 V/cm with pulses 90 s for 12 h in 0.5X TBE buffer at 13°C. We used S. cerevisiae CLIB 112 (=YNN295) chromosomal DNA as a marker. The agarose gels were stained with ethidium bromide (0.5 μg/mL) and washed with water before being visualized under UV.

Genome Sequencing and Analysis

The DNAs of strains CCY46-1-3, CLIB 3101, DBVPG 5828, CLIB 3263, CLIB 3313, CCY-46-1-3a, and S382-CB were used to construct shotgun 400-bp insert libraries, which were sequenced in paired-end (2 × 150 bp) using the Illumina HiSeq2000 platform (Supplementary Table S2). Sequencing reads were cleaned using Fastp v. 0.19.4 (Chen et al., 2018) and used for retrieving MLST marker sequences and whole genome analysis.

First, the cleaned reads of the six interspecific hybrids were used for mapping against the five amplified markers of strains CLIB 1441 (H. pseudoguilliermondii) and MUCL 49139T (Type strain of H. opuntiae) using HiSat2 v. 2.0.4 (Kim et al., 2015) and the samtools package v. 1.9 (Li et al., 2009). SNPs and Indels were visualized using Artemis to determine manually the sequence of each allele. The different haplotypes obtained in this way were finally considered for the MLST analyses. Second, the cleaned reads were used to estimate the proportion of each parental species using SppIDer (Langdon et al., 2018). This tool only requires the genome sequences of each parental species, in FASTA format, and the sequencing reads of hybrid strains. It is rooted on read mapping and provides various statistics and graphical representations of read coverage against the considered parental genomes. The genome assembly of H. pseudoguilliermondii ZIM213 (Shen et al., 2018) and H. opuntiae AWRI 3578 (Sternes et al., 2016) were used as references. In order to ease the reading and the interpretation of SppIDer analyses, the scaffolds of these two genomes were preliminary re-ordered to make their sequences collinear as much as possible. To do so, we used MUMmer tool suite v4,0,0beta (Marcais et al., 2018). Maximal unique matches were retrieved with nucmer and the optimal scaffold order was obtained with the -layout option of mummerplot. The dotplot illustrating the re-ordered genome sequences is available in Supplementary Figure S1 on which we also reported the genomic position of the five MLST markers used in this study.

Genome-wide genotyping of hybrid strains was performed as follows. First, cleaned reads were separately aligned on both aforementioned reference genomes using HiSat2 under no-mixed and no-spliced-alignment options, leading to two SAM/BAM files per strain. Mapped read pairs were extracted from these files with samtools, and ambiguous reads that aligned on both reference genomes were removed using BBMap from the BBTools suite3 (Supplementary Table S2). Optical and PCR duplicates were then removed with the MarkDuplicates command from Picard v. 2.9.04. Variant calling at the strain level was finally performed using the HaplotypeCaller from GATK v. 3.7, the Genome Analysis Toolkit (McKenna et al., 2010). At this step, raw SNPs and Indels identified were outputted into genomic VCF files (gVCF).

In order to visualize the level of heterozygosity along both parental genomes, each individual gVCF file (two per strain) was first converted into a VCF file using GATK GenotypeGVCFs tool. Raw SNPs were extracted and submitted to a hard-filtering procedure. SNPs that matched the following criteria were filtered out: QualByDepth (QD) < 5.0, FisherStrand (FS) > 55, StrandOddsRation (SOR) > 2.0, RMSMappingQuality (MQ) < 40, MappingQualityRankSumTest (MQRankSum) <−5.0, and ReadPosRankSumTest (ReadPosRankSum) <−5.0. These cutoff thresholds were chosen after visual inspection of the respective distributions of these annotation fields. The rational here was to find a balance between a global transition/transversion ratio (Ts/Tv) close to 2 – an indicator of an effective false positive cleaning – and minimizing the loss of true biological SNPs. Following the hard-filtering procedure, the final call sets were constituted by retaining only biallelic positions. Heterozygous SNPs were then extracted using SnpSift v. 4.3 (Cingolani et al., 2012) and were quantified within 10-kb sliding windows along reference genomes. For that purpose, bedtools v. 2.27.1 (Quinlan and Hall, 2010) was used for both genome splitting into consecutive windows and SNP counting. The minor allele coverage (in percentage of the total read coverage) at biallelic positions was computed for each hybrid genome on 2-kb non-overlapping sliding windows across H. opuntiae reference genome.

Assessment of the genetic distance between hybrid strains relied on a similar genotyping approach as that described above, the only difference being that it was based on a joint variant analysis. Briefly, individual gVCF files obtained from one of the parental genome were first pooled together into a cohort gVCF file using the CombineGVCFs tool from GATK. This cohort file was then converted into a VCF file, and the final SNP call set was constituted as previously described. Only an extra filtering step was added in order to remove all positions showing missing data. This cohort call set was first used to estimate nucleotide divergence between strains with the help of Plink v. 1.9 (Purcell et al., 2007) in order to quantify the number of SNPs sharing 0 and 1 Identity By State (IBS0 and IBS1, respectively) in pairwise comparisons for all strains. Principal component analyses were performed on cohort SNP data of each sub-genome using the R package adegenet v. 2.1.1 (Jombart, 2008; Jombart and Ahmed, 2011).

In order to estimate the level of heterozygosity in H. uvarum triploid strain S382-CB, we measured the read coverage at each heterozygous position. For that purpose, we genotyped the strain as described above, using H. uvarum strain AWRI 3580 as reference genome (Sternes et al., 2016). Only the hard-filtering procedure was applied to the individual VCF file so that all levels of ploidy were retained in the final individual-level call set. Bi- and tri-allelic positions were manually extracted according to the content of the GT and AD fields. Briefly, all positions enclosing either a 0/1- or a 1/2-GT field with less than 5% of total reads supporting the reference allele (AD field) were considered bi-allelic. Conversely, all positions containing a 1/2-GT field with reads supporting both reference and two alternative alleles were considered tri-allelic. Read coverage values were normalized in order to take into account variations of sequencing depth between positions.

Gene Annotation

As the genome annotation of H. pseudoguilliermondii ZIM213 (Shen et al., 2018) was not available, we proceeded to the annotation of protein-coding genes of 576-kb corresponding to the region of CLIB 3263, which showed a 1.4 X coverage compared to the rest of the genome. Non-overlapping open reading frames larger than 180 nt, i.e., with a translated sequence larger than 60 amino acids were compared to S. cerevisiae proteome using blastp with an E-value threshold of 1.e-10. Sequences without any hit were compared to nr database limited to fungi at the NCBI using blastp. Sequences with positive matches were manually curated for initiator methionine (iMet) and introns. Amino acid sequences with an iMET and smaller than 100 aa without any homologs were discarded. Functional annotations, if any, were transferred from their putative homologs. Similarly, the H. pseudoguilliermondii homologous sequence of 126-kb corresponding to the region of H. opuntiae lost in CCY 46-1-3a was annotated the same way and manually compared to H. opuntiae for synteny and gene model prediction using artemis.

MAT Locus Analysis

The MAT loci of the reference genomes ZIM213 and AWRI 3578 were annotated (Supplementary Figure S2). Sequencing reads of the hybrid genomes were aligned on reference MAT loci using HiSat2 under no-mixed, no-discordant and no-spliced-alignment options and k = 1 parameter. Mapped read pairs were extracted with samtools view and bam2fastq. The recovered reads were then assembled with SPAdes with default parameters (Bankevich et al., 2012).

Results

Species Delineation in Hanseniaspora

Among 16 primer pairs designed in eight markers, five were selected because they amplified and gave an amplicon of the same size for H. uvarum, H. opuntiae, H. guilliermondii, and H. pseudoguilliermondii, as estimated on agarose gel. The selected loci and primer pairs were GLN4 Glutamine tRNA synthetase (GLN4F2/GLN4R2), ADP1 ATP-dependent permease (ADP1F2/ADP1R2), intergenic region upstream of MET5 sulfite reductase beta subunit (MET5F1/MET5R1), RPN2 subunit of the 26S proteasome (RPN2F2/RPN2R2) and VPS13 vacuolar protein sorting (VPS13F2/VPS13R2). 107 strains were successfully amplified and sequenced for MET5, ADP1, RPN2, VPS13 and GLN4, whereas strains of more divergent Hanseniaspora species could not be amplified with some primer pairs (Supplementary Table S3). Among the 107 strains, three showed complex chromatograms, characteristic of hybrid markers. After unsuccessful attempts to clone the different alleles due to sequence recombination probably during the PCR step, we finally decided to sequence the whole genomes of CLIB 3101, CCY46-1-3 and DBVPG 5828. Their respective alleles were manually reconstructed by mapping the reads against the five markers of both CLIB 1441 (H. pseudoguilliermondii) and MUCL 49139T (H. opuntiae). Two to three divergent alleles per marker were obtained for each of the three strains, suggesting that the strains are H. pseudoguilliermondii × H. opuntiae hybrids. The only exception was an absence of H. pseudoguilliermondii MET5 in DBVPG 5828, a region that may have been lost or rearranged.

Phylogenetic trees were constructed from the concatenation of the five markers. Four clearly distinct groups of strains emerged from the tree, corresponding to the four studied species (Figure 1). The topology of the concatenated tree, which included H. valbyensis as outgroup and H. lachancei and H. clermontiae as internal references, was coherent with the topology based on concatenated datasets of actin, D1D2 and ITS gene sequences (Cadez et al., 2006) or on 1,034 orthologous proteins (Steenwyk et al., 2019). As expected, strains of H. pseudoguilliermondii appeared as a separate group, very close to the group of H. opuntiae strains. Alleles of CLIB 3101, CLIB3263 (its genome was also sequenced, see below) and CCY46-1-3, grouped with both H. pseudoguilliermondii and H. opuntiae strains, confirming that these strains are hybrids that derive from a cross between the two most closely related species, H. pseudoguilliermondii and H. opuntiae.

FIGURE 1

Population Structure in H. uvarum, H. guilliermondii, and H. opuntiae

In order to explore more thoroughly the phylogenetic relationships between strains of the same species, and to find putative links between strains of the same substrate or geographical origin, we drew trees relying on the concatenated marker sequences, species by species for H. uvarum, H. guillermondii and H. opuntiae. H. pseudoguilliermondii was excluded from this analysis due to the insufficient number of strains. For H. uvarum (Supplementary Figure S3) and H. opuntiae (Supplementary Figure S4), the tree topology was generally supported by low bootstrap values and no clear correlation to substrate origin emerged. Nevertheless, for H. uvarum, a somehow clear phylogenetic signal was obtained for a few strains isolated from the French Guyana and for a group enriched in strains from La Réunion Island, which separated from the other isolates. This signal was echoed by the population structure analysis performed with STRUCTURE and InStruct that provided the same clustering for K equal to three. In contrast, the phylogeny of H. guillermondii presented two distinct populations and a single separate strain, CLIB 3085 from Ivory Coast, which attests of the presence of a probable third population (Supplementary Figure S5A). STRUCTURE and InStruct analysis confirmed the partitioning of the strains into three distinct populations (Supplementary Figure S5B). Interestingly, three strains, which were isolated in the same grape must in Ouveillan (Occitanie, France), belong to population 1 (S4–54) or population 2 (S4–16 and S4–9). Similarly, V5–13 and V5–30 belong to population 1 and V5–12 to population 2; all of them were isolated in Pech Rouge (Occitanie, France). Last, for H. opuntiae, STRUCTURE and Instruct analyses did not provide any convergent population structure in agreement with the MLST data (Supplementary Figure S4).

Hanseniaspora Species Exhibit Differences in Nucleotide Variability

Analysis of heterozygous and polymorphic sites permitted us to estimate the relative divergence of markers within each species (Table 2). Whereas the most conserved marker was GLN4 in all species, the intergenic region MET5 displayed the highest diversity with up to 10.24% of polymorphic sites and a π value up to 40.55 nucleotides per site in H. opuntiae. In this species, MET5 alleles showed two different sizes differing by an insertion of five nucleotides in the intergenic region; some strains showed the presence of both types. Insertions of 3–4 nt were also found in MET5 from H. guilliermondii CLIB 3085 and some strains of H. pseudoguilliermondii. We also found an insertion of 3 nt in the coding sequence of RPN2 in H. uvarum, which corresponds to an additional amino acid. From these data, we could also observe differences between species. H. opuntiae showed the highest percentages of polymorphic sites and π values in all markers but GLN4, even if the number of strains studied was two times less than in H. uvarum. In contrast, H. guilliermondii appeared the least variable species, with values quite similar to that of H. pseudoguilliermondii for which we studied three times fewer strains.

TABLE 2

H. uvarum
H. guilliermondii
H. opuntiae
H. pseudoguilliermondii
MLST locusADP1GLN4MET5RPN2VPS13ADP1GLN4MET5RPN2VPS13ADP1GLN4MET5RPN2VPS13ADP1GLN4MET5RPN2VPS13
# Sequences50505050502424242424262626262699999
# total sites445559909951566440556917973563419564859961551432556918953561
# heterozygous sites110165450210205021880593345012
# polymorphic sites2172159602474211152288860355516145
% polymorphic sites3.823.766.496.314.241.590.722.291.130.895.251.4210.246.246.351.150.901.741.470.89
Nucleotide diversity (Π)34.352.8714.5217.3310.002.521.7911.126.533.374.622.2140.5516.916.763.394.119.8910.893.33

Divergence of markers in the four species of Hanseniaspora.

1The number of heterozygous sites refers to the number of positions where at least one strain has a heterozygous position. 2The number of polymorphic sites refers to the number of positions where at least two alleles are present in the strain population, including heterozygous positions. 3Nucleotide diversity is measured by the average number of nucleotide differences between all pairs of strains in each alignment.

As the analysis of the genetic variation in each gene revealed high proportions of heterozygous loci variable across species, we wondered if these differences might originate from different life styles. The estimation of the different allelic frequencies revealed a highly significant deficit of heterozygosity for the four species. However, in relation with the Fis value of each population that varied from 0.1 to 1, the estimate of the selfing rates s varied from 0.21 for H. opuntiae, 0.61 for H. uvarum, 0.74 for H. pseudoguilliermondii and 0.99 for H guillermondii (Table 3). In order to avoid the impact of hidden population structure, we estimated Fis and s from a subset of H. uvarum devoid of strains from Guyane or la Reunion and we obtained similar results indicating that population structure does not explain Fis and s value for H uvarum. Last, selfing rates inferred from the heterozygosity profile with RMES varied also in similar proportions with those obtained from Fis.

TABLE 3

H. uvarumH. guilliermondiiH. opuntiaeH. pseudoguilliermondii
# Individuals5024236
# Polymorphic sites1814821329
Nucleotide diversity (π) per kb14.317.3424.169.24
Fis0.43720.9740.1160.5825
Deficit of heterozygosity P-value reject null hypothesis0.0000.0000.0000.000
Selfing estimated from Fis0.610.990.210.74
Selfing estimated from heterozygosity profile0.63 ± 0.081.000.45 ± 0.080.80 ± 0.14

Statistics of genetic diversity applied to Hanseniaspora sequences.

The three interspecific hybrids were removed from this analysis.

One major feature is the complete absence of heterozygous sites in all strains of H. guilliermondii but one, CLIB 3085, which contrasts with the substantial heterozygosity of the other species. In order to check if these differences in heterozygosity might be due to differences in ploidy, we analyzed the ploidy of the strains of our strain set.

Variable Strain Ploidy in Hanseniaspora Species

We used flow cytometry and compared the level of Sitox green intensity in H. guilliermondii cells and in other Hanseniaspora strains that were found heterozygous for the MLST markers, and are thus probably diploid. As shown in Figure 2A, the intensity of all H. guilliermondii strains is of the same order as diploid strains of other species, suggesting that these strains are also diploid. As for the hybrids, DBVPG 5828 (Figure 2B) and CLIB 3101 (Figure 2C) were clearly found triploid whereas it was more difficult to evaluate the ploidy of CCY46-1-3 (Supplementary Figure S6). We also found another triploid strain, S382-CB, for which only H. uvarum alleles were found in MLST. We can suspect that this strain is not an interspecies hybrid but rather a H. uvarum triploid (Figure 2D). Considering the quite high number of hybrids that we found randomly, we decided to screen a collection of additional strains with MET5 as a selective marker. We found three candidate strains isolated from the same biological sample, fermented pineapple from Mayotte Island. All of them have H. opuntiae and H. pseudoguilliermondii MET5 alleles. Two of them are diploid, CLIB 3313 and CLIB 3265, whereas the third one is tetraploid, CLIB 3263 (Figures 2A,E). As for the first hybrids, the genomes of strains CLIB 3313 and CLIB 3263 were sequenced and the deduced MLST marker sequences, identical for both strains, were added to the phylogenetic tree in Figure 1.

FIGURE 2

Genomic Structure of Hybrids

Considering that H. opuntiae and H. pseudoguilliermondii are closely related but distinct species, we hypothesized that they might have different karyotypes as this is the case for sister species in Saccharomyces (Fischer et al., 2000) and thus that this could provide us with further clues about the genomic structure of hybrids. However, karyotypes of the type strains of parental species did not indicate any chromosome length polymorphism, and this was the case for the hybrids too (Supplementary Figure S7). We thus decided to sequence their genome with a shotgun strategy with Illumina sequencing chemistry, and to map the reads to reference genomes with SppIDer. To this end, the genome of H. pseudoguilliermondii ZIM213 (Shen et al., 2018) and H. opuntiae AWRI 3578 (Sternes et al., 2016) were used as references. A preliminary analysis with H. guillermondii UTA222, H. uvarum AWRI 3580, H. osmophila AWRI 3579 and H. vineae T02/19AF suggested that the hybrids did not have a third parent (Supplementary Figure S8). The genomes of CLIB 3101, DBVPG 5828 and CCY46-1-3 showed an overall proportion of 2:1, suggesting that these strains are allotriploid H. opuntiae (2n) × H. pseudoguilliermondii (1n), with the nomenclature proposed by Nguyen and Boekhout (Nguyen and Boekhout, 2017). In CLIB 3101, the reads were homogeneously distributed along the chromosomes whereas in DBVPG5828, differences in parental genome contribution was observed along the chromosomes, accounting for numerous chromosomal rearrangements leading, in some cases, to losses of H. pseudoguilliermondii genomic regions, which were counterbalanced by a triploidization of H. opuntiae homologous regions (Figure 3). One of the lost regions contained the MET5 allele of H. pseudoguilliermondii, which could not be amplified in the MLST approach. The pattern of CCY46-1-3 was much more difficult to analyze as the read coverage sometimes showed intermediate values to 1n, 2n, or 3n. Together with the flow cytometry pattern, this result suggested that CCY46-1-3 might be a population of cells with different genomic contents. We thus selected four individual colonies from CCY46-1-3. They had the same pattern in flow cytometry, suggesting that they were all tetraploid (Supplementary Figure S5). Sequence analysis of the genome of one of them, CCY46-1-3a, showed large H. opuntiae genomic regions of probable ploidy 2n and 4n if we consider CCY46-1-3a is a tetraploid. Homologous regions originated from the H. pseudoguilliermondii parent were 2n or lost, respectively. A region of 126.5 kb long was lost in H. opuntiae and is 4n in H. pseudoguilliermondii. One extremity of this region corresponds to the extremity of the scaffolds in both reference genomes (PPNX01000020 in ZIM213; LPNL01000005 in AWRI 3578), which might be a subtelomeric region. The other extremity is internal to the scaffolds, at the locus of homologs of the tandem genes CSH2 and CSH3. Other changes in read coverage occurred internal to H. opuntiae scaffolds (e.g., in LPNL01000002, LPNL01000007, and LPNL01000008), which may suggest chromosomal rearrangements rather than chromosome loss and gain.

FIGURE 3

Sequencing of strains CLIB 3263 and CLIB 3313 showed the same proportion of sequences from H. opuntiae and H. pseudoguilliermondii parents, with the exception of a H. pseudoguilliermondii region of about 576 kb in CLIB3263, which had a coverage ratio of about 1.4 compared to the rest of the genome. Knowing that CLIB 3263 is tetraploid, this suggests that either a segmental or a chromosome duplication occurred in only one of the two H. pseudoguilliermondii homologous chromosomes – but in that case the ratio should be 1.5, not 1.4 – or that CLIB 3263 is a population of heterogeneous cells having undergone or not an event of segmental or chromosome duplication.

Genetic Diversity of Hybrids

To get clues about the event that led to the hybrid formation, we first investigated their MAT loci by comparison to those of reference genomes. H. pseudoguilliermondii ZIM 213 has a single MAT locus with only MATalpha1 gene. To recover the MATa locus we used CBS 8772, which is diploid and possesses both MAT loci. The reference genome of H. opuntiae AWRI 3578 contains a MAT locus with both MATalpha1 and MATa2 genes (Supplementary Figure S2A). Mapping and assembly of the mapped reads of the six hybrids revealed that CCY46-1-3, CCY46-1-3a and DBVPG 5828 had both MATa and MATalpha from H. opuntiae, suggesting that the parental diploids were MATa/MATalpha, whereas CLIB 3101 had only a MATa locus (Supplementary Figure S2B). CCY46-1-3 and CLIB 3101 had also a H. pseudoguilliermondii MATalpha locus whereas CCY46-1-3a and DBVPG 5828 have lost this genomic region, which has been replaced by its H. opuntiae counterpart (Figure 3). CLIB 3263 and CLIB 3313 had a single MAT locus per subgenome, MATa from H. opuntiae and MATalpha from H. pseudoguilliermondii.

Then, we investigated the level of heterozygosity in both subgenomes of each hybrid. A high level of heterozygosity, up to 0.503%, was observed for H. opuntiae subgenome in strains CLIB 3101, DBVPG 5828, CCY46-1-3 and its derivative CCY46-1-3a, which suggests that H. opuntiae parental strains were heterozygous diploids in each case (Table 4). This level is probably under-estimated, as only bi-allelic positions in the population were considered. The distribution of heterozygous positions was not homogeneous along the chromosomes (Figure 4 and Supplementary Figure S9), and showed some regions of loss of heterozygosity (LOH). Interestingly, the largest LOH in CLIB 3101 and DBVPG 5828 covers the same region. In contrast, in these four strains a very low level of heterozygosity was observed for the H. pseudoguilliermondii subgenomes, even in the duplicated or triplicated chromosomal regions, which may correspond to false positive, sequencing errors or single nucleotide mutations after duplication. In strains CLIB 3263 and CLIB 3313, very few heterozygous sites were observed, neither for H. opuntiae, nor for H. pseudoguilliermondii subgenomes. This is coherent with the fact that CLIB 3313 is a diploid with 1n chromosomes from each parent and this suggests that the tetraploid strain CLIB 3263 derives from an auto-diploidization event. This also corroborates the subsequent 576-kb segmental duplication event in CLIB 3263.

TABLE 4

Strain% H. opuntiae% H. pseudoguilliermondii
DBVPG 58280.4060.007
CCY46-1-30.4240.009
CLIB 31010.5030.009
CCY46-1-3a0.3650.003
CLIB 32630.0080.004
CLIB 33130.0090.005

Percentage of heterozygosity in H. opuntiae and H. pseudoguilliermondii subgenomes of hybrids.

FIGURE 4

As MLST marker sequences were almost identical in CLIB 3101 and DBVPG 5828 and as their regions of LOH were at the same genomic position, we suspected that both strains could derive from the same hybridization event. To address this hypothesis, we examined the pairwise allele sharing between strains across the genomic regions present in all hybrids, which represents more than 95% of H. opuntiae and about 30% of H. pseudoguilliermondii reference genomes. It appeared that CLIB 3101 and DBVPG 5828 showed a major divergence regarding their respective H. opuntiae subgenomes. There are 7648 positions with no nucleotide in common (IBS0), and 49,107 positions (IBS1) where only one allele is in common (Figure 5A). These data suggest that the parental diploids were different and consequently that CLIB 3101 and DBVPG 5828 may derive from two distinct hybridization events. The H. pseudoguilliermondii part is much less divergent with only 282 different positions between the two strains, which may be linked to the low level of divergence observed in H. pseudoguilliermondii with the MLST analysis. The same analysis performed on the other interspecific hybrid genomes showed that the three European strains are different between them at least for H. opuntiae subgenomes and very distant from the two strains from Mayotte Island for both subgenomes (Figure 5B).

FIGURE 5

Assessment of the heterozygosity in H. uvarum triploid strain S382-CB revealed 55,355 bi-allelic positions with one allele in common with the reference strain AWRI 3580, 160 bi-allelic positions with both alleles being alternative to the reference allele, and 20 tri-allelic positions. Bi-allelic positions presented a distribution of read coverage of 1/3-2/3, which confirm the triploid nature of the strain (Supplementary Figure S10).

Discussion

The aim of this study was to characterize and to clarify the genetic relationships between strains of a species complex with a method that would facilitate further studies of Hanseniaspora strains, especially strains isolated from vineyards. The results obtained from the MLST approach showed that even if five loci are not representative of the whole genome, our markers are pertinent to discriminate species and therefore hybrids. Indeed, the topology of our multi-species tree is consistent with that of Steenwyk et al. (2019) based on 1,034 orthologous groups, with strong bootstrap values. In comparison, trees based on classical taxonomic markers harbor variable topologies sometimes poorly supported. For instance, the trees of Cadez et al. (2006, 2019) showed different topologies due to the additional use of EF-1α in the latter tree. It is therefore essential to have relevant markers that allow resolving accurately species delineation. Our combination of markers is also sufficiently divergent to unveil the presence of subpopulations, as this is the case for H. guilliermondii populations. However, we failed to establish a clear population structure related to substrate origin or geographical localization except for H. uvarum for which a group of strains isolated from Guyana and a group enriched in strains from La Réunion Island separated from the other strains. Using microsatellite analysis, Albertin et al. (2016) reported some population structure for H. uvarum oenological strains according to geographic origin, i.e., South Africa versus other origins, primarily from France and New Zealand, and to sampling year. However, this clustering might be a fuzzy rule, as some Bordeaux isolates such as strain CRB1430 were identical to South Africa isolates.

Another aspect of our MLST analysis is the assessment of the level of genetic diversity according to species and markers. As expected, the four markers designed in exons of housekeeping genes (GLN4, ADP1, RPN2 and VPS13) were generally less divergent than MET5, which includes an intergenic region, known to be highly variable. It was, however, surprising to reach almost 10% of polymorphic sites in H. opuntiae MET5 with only 26 studied strains, whereas it is only 2.29% in H. guilliermondii with nearly the same number of strains. This clearly denotes a species-specific variability, with a significant difference even in the two most closely related species, H. opuntiae and H. pseudoguilliermondii. Another interesting result emerged from the level of heterozygosity. No heterozygous sites were observed in strains of H. guilliermondii, except in CLIB 3085. This finding does not result from a difference of ploidy, as all of the H. guilliermondii strains were found diploid by flow cytometry, like most of the strains from the other species. This rather indicates a major difference in life style compared to the other species. While we observed a high genetic diversity, a high heterozigosity for most species and some population structure with admixture for H. uvarum, the absence of heterozygosity suggests an absence of random mating in both population 1 and population 2 of H. guilliermondii. A whole genome sequencing strategy at the population level may provide clues to address this hypothesis.

The use of MLST markers compatible with multiple species amplification allowed us to detect interspecific hybrids with a surprisingly high frequency, i.e., six strains among 107 studied strains. Yeast hybrids have been extensively studied in Saccharomyces sister species. They appeared to occur rarely in nature but much more frequently in anthropogenic environments, where they present a great interest for their biotechnological potentials in winemaking and lager brewing (Sipiczki, 2018). With the development of second and third generation sequencing technologies, interspecies hybrids have been reported for a number of other Saccharomycotina species, such as Pichia sorbitophila (Louis et al., 2012), Zygosaccharomyces bailii (Mira et al., 2014; Ortiz-Merino et al., 2017), Zygosaccharomyces parabailii (Braun-Galleani et al., 2018), Zygosaccharomyces rouxii (Gordon and Wolfe, 2008; Solieri et al., 2008; Bizzarri et al., 2019), Dekkera bruxellensis (Borneman et al., 2014), Candida orthopsilosis (Schroder et al., 2016), or Saccharomycopsis fibuligera (Farh et al., 2017).

We exclusively found hybrids between the two closest related species H. opuntiae and H. pseudoguilliermondii. They showed different genomic structures. CLIB 3101, DBVPG 5828 and CCY46-1-3 are allotriploids with different degrees of chimerism. CCY46-1-3a, a single colony of CCY46-1-3, is an allotetraploid. CLIB3313 and CLIB 3263 are allodiploid and allotetraploid, respectively. We propose different scenarios of hybrid creation in Figure 6. Genome sequencing of recently isolated strains CLIB 3101, CLIB 3313 and CLIB 3263 showed an absence of mosaicism between the parental strains. They probably derive from a recent rare mating between a diploid cell of H. opuntiae and a haploid cell of H. pseudoguilliermondii for CLIB 3101 and from a mating between two haploid cells for CLIB 3313 and CLIB 3263. These scenarios are in agreement with the organization of their MAT loci. Indeed, CLIB 3313 and CLIB 3263 are H. opuntiae MATa and H. pseudoguilliermondii MATalpha. CLIB 3101 is homozygous MATa/MATa for the mating type of H. opuntiae subgenome and MATalpha in H. pseudoguilliermondii part, which enables the mating program (Sipiczki, 2018). As there are no silent cassettes in H. opuntiae, there are only two ways to become MATa/MATa, i.e., the conversion of MATalpha to MATa in a diploid cell, or auto-diploidization of a haploid MATa strain. As heterozygosity was observed in H. opuntiae subgenome, we favored the conversion hypothesis. In contrast, DBVPG 5828 and CCY46-1-3 have complex genomes with multiple chimeric chromosomes and unexpectedly they possess both MATa and MATalpha loci in the H. opuntiae subgenome, which suggests more intricate creation and evolution scenarios than in CLIB 3101. Genome sequences and karyotypes of hybrids, which are similar to those of parental species strains, showed that at least some genomic rearrangements occurred by non-reciprocal translocations between chromosomes and replacement of intrachromosomal regions by their homologs from the other subgenomes. Duplication/loss of entire chromosomes may have also occurred but as the reference genomes have not been assembled at the chromosome level (Sternes et al., 2016; Shen et al., 2018), it is therefore difficult to validate it. Both strains have been conserved in collection for 10 years, which addresses the question of the stability and evolution of conserved hybrid strains. Indeed, CCY46-1-3 comprises a population of cells deriving from the initial mating. Isolation of four single colonies showed a unique tetraploid pattern in flow cytometry. Sequencing of CCY46-1-3a showed that, compared to CCY46-1-3, the equivalent of a 1n genome of H. pseudoguilliermondii has been lost, leading to a 2n equivalent genome. This implies that the isolate has been further duplicated by auto-diploidization to form the resulting mosaic allotetraploid (Figure 6). Similarly, as the genome of CLIB 3313 and CLIB 3263, which have been isolated in the same pineapple fermentation, were almost identical in sequence, it is likely that CLIB 3263 derives from CLIB 3313 by auto-duplication. Rare mating leading to allotriploid genomes have been widely reported in Saccharomyces genus, with for instance S. cerevisiae (2n) × S. kudriavzevii (1n) hybrid VIN7 (Borneman et al., 2012) or S. cerevisiae (1n) xS. eubayanus (2n) strains of S. pastorianus group I/Saaz-type (Monerawela and Bond, 2017). D. bruxellensis is another species example in which AWRI 1499 and AWRI 1608 are D. bruxellensis (2n) × Dekkera sp.x (1n) and D. bruxellensis (2n) × Dekkera sp.y (1n), respectively, with Dekkera sp.x and Dekkera sp.y two distinct unknown species (Borneman et al., 2014).

FIGURE 6

Hybridization is known to provide selective advantage in a stressful environment or to cumulate advantageous properties such as the ability to grow at low temperature together with robust fermentation characteristics as reported for the lager yeast S. pastorianus (Gibson and Liti, 2015). It is also a mechanism for restoration of fertility (Ortiz-Merino et al., 2017; Watanabe et al., 2017; Braun-Galleani et al., 2018). To get clues about the benefit obtained by Hanseniaspora hybrids, we investigated the differences in gene content of particular regions between both subgenomes. Thus, we investigated the H. pseudoguilliermondii triplicated 126-kb region in CCY46-1-3a. By comparing reference genomes, we found 68 genes in H. opuntiae and 67 homologs in H. pseudoguilliermondii. The difference comes from two consecutive genes g2511 and g2512 corresponding to pseudogenes in H. opuntiae, which form a single gene in H. pseudoguilliermondii encoding the peroxisomal biogenesis factor 6. Whether the replacement of this pseudogene by its functional counterpart was the reason of this triplication will require further experimental data. Similarly, we investigated the 576-kb region that has been duplicated in H. pseudoguilliermondii subgenome of CLIB 3263. 312 putative protein-coding genes were annotated, 45 were specific to Hanseniaspora genus with unknown function. For the other genes, we considered the function of S. cerevisiae homologs, but failed to identify any bias, except a putative overrepresentation of mitochondrial proteins.

Conclusion

This study provides efficient markers that could be used to identify rapidly species and hybrids from closely related Hanseniaspora recovered from grapes and musts. Our approach revealed the presence of some putative population structure in three species, and showed differences in the species lifestyle, which make it an interesting yeast species cluster to explore further, especially in the context of the potential adaptation to the wine environment. As in many other Saccharomycotina yeasts, we have found hybrid genomes in Hanseniaspora. The presence of different types and origins of these strains, i.e., allotriploids with different degrees of mosaicism, allodiploids and allotetraploids, is one of the most interesting aspect. Moreover, the genomes seem particularly instable with frequent auto-diploidization. This is therefore a unique model to study the evolution and stability of genomes in a genetic context that do not derives from the whole genome duplication.

Statements

Data availability statement

Sequencing reads of H. opuntiae × H. pseudoguilliermondii hybrids have been deposited at the EMBL-ENA under the accession numbers ERR3456268, ERR3456265, ERR3456264, ERR3456266, ERR3456262, and ERR3456263 (project PRJEB33345).

Author contributions

CN conceived the project. MS, CN, and J-LL designed the experiments. MS, CB, CG, MP, and CN performed the experiments. HD, LP, JL-L, MS, and CN performed the bioinformatics analyses. MS and CN wrote the first draft. HD, LP, J-LL, CG, MS, and CN edited the manuscript. All authors read and approved the final version.

Funding

This work was supported by the French National Research Agency (ANR) through the project PeakYeast (ANR-15-CE20-0010).

Acknowledgments

We thank Waren Albertin, Michael Brysch-Herzberg, Elena Serviené, Delphine Sicard, Matthias Sipiczki, Renata Vadkertiova, as well as Pernod SAS and the NRRL for giving us Hanseniaspora strains.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2019.02960/full#supplementary-material

References

  • 1

    AlbertinW.SetatiM. E.Miot-SertierC.MostertT. T.Colonna-CeccaldiB.CoulonJ.et al (2016). Hanseniaspora uvarum from winemaking environments show spatial and temporal genetic clustering.Front. Microbiol.6:1569. 10.3389/fmicb.2015.01569

  • 2

    AyoubM. J.LegrasJ. L.SalibaR.GaillardinC. (2006). Application of multi locus sequence typing to the analysis of the biodiversity of indigenous Saccharomyces cerevisiae wine yeasts from Lebanon.J. Appl. Microbiol.100699711. 10.1111/j.1365-2672.2006.02817.x

  • 3

    BagheriB.ZambelliP.VigentiniI.BauerF. F.SetatiM. E. (2018). Investigating the effect of selected non-saccharomyces species on wine ecosystem function and major volatiles.Front. Bioeng. Biotechnol.6:169. 10.3389/fbioe.2018.00169

  • 4

    BankevichA.NurkS.AntipovD.GurevichA. A.DvorkinM.KulikovA. S.et al (2012). SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.J. Comput. Biol.19455477. 10.1089/cmb.2012.0021

  • 5

    BizzarriM.CassanelliS.BartoliniL.PryszczL. P.DuskovaM.SychrovaH.et al (2019). Interplay of chimeric mating-type loci impairs fertility rescue and accounts for intra-strain variability in zygosaccharomyces rouxii interspecies Hybrid ATCC42981.Front. Genet.10:137. 10.3389/fgene.2019.00137

  • 6

    BoekhoutT.KurtzmanC. P.O’donnellK.SmithM. T. (1994). Phylogeny of the yeast genera Hanseniaspora (anamorph Kloeckera), Dekkera (anamorph Brettanomyces), and Eeniella as inferred from partial 26S ribosomal DNA nucleotide sequences.Int. J. Syst. Bacteriol.44781786. 10.1099/00207713-44-4-781

  • 7

    BokulichN. A.ThorngateJ. H.RichardsonP. M.MillsD. A. (2014). Microbial biogeography of wine grapes is conditioned by cultivar, vintage, and climate.Proc. Natl. Acad. Sci. U.S.A.111E139E148. 10.1073/pnas.1317377110

  • 8

    BornemanA. R.DesanyB. A.RichesD.AffourtitJ. P.ForganA. H.PretoriusI. S.et al (2012). The genome sequence of the wine yeast VIN7 reveals an allotriploid hybrid genome with Saccharomyces cerevisiae and Saccharomyces kudriavzevii origins.FEMS Yeast Res.128896. 10.1111/j.1567-1364.2011.00773.x

  • 9

    BornemanA. R.ZeppelR.ChambersP. J.CurtinC. D. (2014). Insights into the Dekkera bruxellensis genomic landscape: comparative genomics reveals variations in ploidy and nutrient utilisation potential amongst wine isolates.PLoS Genet.10:e1004161. 10.1371/journal.pgen.1004161

  • 10

    BougnouxM. E.MorandS.D’enfertC. (2002). Usefulness of multilocus sequence typing for characterization of clinical isolates of Candida albicans.J. Clin. Microbiol.4012901297.

  • 11

    Braun-GalleaniS.Ortiz-MerinoR. A.WuQ.XuY.WolfeK. H. (2018). Zygosaccharomyces pseudobailii, another yeast interspecies hybrid that regained fertility by damaging one of its MAT loci.FEMS Yeast Res.18:foy079. 10.1093/femsyr/foy079

  • 12

    CadezN.BelloraN.UlloaR.HittingerC. T.LibkindD. (2019). Genomic content of a novel yeast species Hanseniaspora gamundiae sp. nov. from fungal stromata (Cyttaria) associated with a unique fermented beverage in Andean Patagonia. Argentina.PLoS One14:e0210792. 10.1371/journal.pone.0210792

  • 13

    CadezN.RasporP.De CockA. W.BoekhoutT.SmithM. T. (2002). Molecular identification and genetic diversity within species of the genera Hanseniaspora and Kloeckera.FEMS Yeast Res.1279289. 10.1016/s1567-1356(01)00041-1

  • 14

    CadezN.RasporP.SmithM. T. (2006). Phylogenetic placement of hanseniaspora-kloeckera species using multigene sequence analysis with taxonomic implications: descriptions of Hanseniaspora pseudoguilliermondii sp. nov. and Hanseniaspora occidentalis var. citrica var. nov.Int. J. Syst. Evol. Microbiol.5611571165. 10.1099/ijs.0.64052-0

  • 15

    CapozziV.GarofaloC.ChiriattiM. A.GriecoF.SpanoG. (2015). Microbial terroir and food innovation: the case of yeast biodiversity in wine.Microbiol. Res.1817583. 10.1016/j.micres.2015.10.005

  • 16

    ChenS.ZhouY.ChenY.GuJ. (2018). fastp: an ultra-fast all-in-one FASTQ preprocessor.Bioinformatics34i884i890. 10.1093/bioinformatics/bty560

  • 17

    CingolaniP.PatelV. M.CoonM.NguyenT.LandS. J.RudenD. M.et al (2012). Using drosophila melanogaster as a model for genotoxic chemical mutational studies with a new program. SnpSift.Front. Genet.3:35. 10.3389/fgene.2012.00035

  • 18

    Cioch-SkonecznyM.SatoraP.SkotnicznyM.SkonecznyS. (2018). Quantitative and qualitative composition of yeast microbiota in spontaneously fermented grape musts obtained from cool climate grape varieties ‘Rondo’ and ‘Regent’.FEMS Yeast Res.18:foy089. 10.1093/femsyr/foy089

  • 19

    CorpetF. (1988). Multiple sequence alignment with hierarchical clustering.Nucleic Acids Res.161088110890. 10.1093/nar/16.22.10881

  • 20

    DavidP.PujolB.ViardF.CastellaV.GoudetJ. (2007). Reliable selfing rate estimates from imperfect population genetic data.Mol. Ecol.1624742487. 10.1111/j.1365-294x.2007.03330.x

  • 21

    Drumonde-NevesJ.Franco-DuarteR.LimaT.SchullerD.PaisC. (2016). Yeast biodiversity in vineyard environments is increased by human intervention.PLoS One11:e0160579. 10.1371/journal.pone.0160579

  • 22

    EscribanoR.Gonzalez-ArenzanaL.PortuJ.GarijoP.Lopez-AlfaroI.LopezR.et al (2018). Wine aromatic compound production and fermentative behaviour within different non-Saccharomyces species and clones.J. Appl. Microbiol.12415211531. 10.1111/jam.13735

  • 23

    FarhM. E.ChoY.LimJ. Y.SeoJ. A. (2017). A diversity study of Saccharomycopsis fibuligera in rice wine starter nuruk, reveals the evolutionary process associated with its interspecies hybrid.J. Microbiol.55337343. 10.1007/s12275-017-7115-y

  • 24

    FischerG.JamesS. A.RobertsI. N.OliverS. G.LouisE. J. (2000). Chromosomal evolution in Saccharomyces.Nature405451454. 10.1038/35013058

  • 25

    GameroA.QuintillaR.GroenewaldM.AlkemaW.BoekhoutT.HazelwoodL. (2016). High-throughput screening of a large collection of non-conventional yeasts reveals their potential for aroma formation in food fermentation.Food Microbiol.60147159. 10.1016/j.fm.2016.07.006

  • 26

    GaoH.WilliamsonS.BustamanteC. D. (2007). A markov chain monte carlo approach for joint inference of population structure and inbreeding rates from multilocus genotype data.Genetics17616351651. 10.1534/genetics.107.072371

  • 27

    GibsonB.LitiG. (2015). Saccharomyces pastorianus: genomic insights inspiring innovation for industry.Yeast321727. 10.1002/yea.3033

  • 28

    GordonJ. L.WolfeK. H. (2008). Recent allopolyploid origin of Zygosaccharomyces rouxii strain ATCC 42981.Yeast25449456. 10.1002/yea.1598

  • 29

    GouyM.GuindonS.GascuelO. (2010). SeaView version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building.Mol. Biol. Evol.27221224. 10.1093/molbev/msp259

  • 30

    GuindonS.DelsucF.DufayardJ. F.GascuelO. (2009). Estimating maximum likelihood phylogenies with PhyML.Methods Mol. Biol.537113137. 10.1007/978-1-59745-251-9_6

  • 31

    JacquesN.MalletS.LaaghouitiF.TinsleyC. R.CasaregolaS. (2017). Specific populations of the yeast Geotrichum candidum revealed by molecular typing.Yeast34165178. 10.1002/yea.3223

  • 32

    JollyN. P.VarelaC.PretoriusI. S. (2014). Not your ordinary yeast: non-Saccharomyces yeasts in wine production uncovered.FEMS Yeast Res.14215237. 10.1111/1567-1364.12111

  • 33

    JombartT. (2008). adegenet: a R package for the multivariate analysis of genetic markers.Bioinformatics2414031405. 10.1093/bioinformatics/btn129

  • 34

    JombartT.AhmedI. (2011). adegenet 1.3-1: new tools for the analysis of genome-wide SNP data.Bioinformatics2730703071. 10.1093/bioinformatics/btr521

  • 35

    KimD.LangmeadB.SalzbergS. L. (2015). HISAT: a fast spliced aligner with low memory requirements.Nat. Methods12357360. 10.1038/nmeth.3317

  • 36

    KopelmanN. M.MayzelJ.JakobssonM.RosenbergN. A.MayroseI. (2015). Clumpak: a program for identifying clustering modes and packaging population structure inferences across K.Mol. Ecol. Resour.1511791191. 10.1111/1755-0998.12387

  • 37

    LangdonQ. K.PerisD.KyleB.HittingerC. T. (2018). sppIDer: a species identification tool to investigate hybrid genomes with high-throughput sequencing.Mol. Biol. Evol.3528352849. 10.1093/molbev/msy166

  • 38

    LiH.HandsakerB.WysokerA.FennellT.RuanJ.HomerN.et al (2009). The sequence Alignment/Map format and SAMtools.Bioinformatics2520782079. 10.1093/bioinformatics/btp352

  • 39

    LischerH. E.ExcoffierL. (2012). PGDSpider: an automated data conversion tool for connecting population genetics and genomics programs.Bioinformatics28298299. 10.1093/bioinformatics/btr642

  • 40

    LorenziniM.ZapparoliG. (2019). Yeast-like fungi and yeasts in withered grape carposphere: characterization of Aureobasidium pullulans population and species diversity.Int. J. Food Microbiol.289223230. 10.1016/j.ijfoodmicro.2018.10.023

  • 41

    LouisV. L.DesponsL.FriedrichA.MartinT.DurrensP.CasaregolaS. (2012). Pichia sorbitophila, an interspecies yeast hybrid, reveals early steps of genome resolution after polyploidization.G32299311. 10.1534/g3.111.000745

  • 42

    MaidenM. C.BygravesJ. A.FeilE.MorelliG.RussellJ. E.UrwinR.et al (1998). Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms.Proc. Natl. Acad. Sci. U.S.A.9531403145. 10.1073/pnas.95.6.3140

  • 43

    MarcaisG.DelcherA. L.PhillippyA. M.CostonR.SalzbergS. L.ZiminA. (2018). MUMmer4: a fast and versatile genome alignment system.PLoS Comput. Biol.14:e1005944. 10.1371/journal.pcbi.1005944

  • 44

    MartinV.ValeraM. J.MedinaK.BoidoE.CarrauF. (2018). Oenological impact of the Hanseniaspora/Kloeckera yeast genus on wines—a review.Fermentation4:76. 10.3390/fermentation4030076

  • 45

    McKennaA.HannaM.BanksE.SivachenkoA.CibulskisK.KernytskyA.et al (2010). The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.Genome Res.2012971303. 10.1101/gr.107524.110

  • 46

    MiraN. P.MunsterkotterM.Dias-ValadaF.SantosJ.PalmaM.RoqueF. C.et al (2014). The genome sequence of the highly acetic acid-tolerant Zygosaccharomyces bailii-derived interspecies hybrid strain ISA1307, isolated from a sparkling wine plant.DNA Res.21299313. 10.1093/dnares/dst058

  • 47

    MonerawelaC.BondU. (2017). Brewing up a storm: the genomes of lager yeasts and how they evolved.Biotechnol. Adv.35512519. 10.1016/j.biotechadv.2017.03.003

  • 48

    MunozR.GomezA.RoblesV.RodriguezP.CebolleroE.TaberaL.et al (2009). Multilocus sequence typing of oenological Saccharomyces cerevisiae strains.Food Microbiol.26841846. 10.1016/j.fm.2009.05.009

  • 49

    NeiM.LiW. H. (1979). Mathematical model for studying genetic variation in terms of restriction endonucleases.Proc. Natl. Acad. Sci. U.S.A.7652695273. 10.1073/pnas.76.10.5269

  • 50

    NguyenH. V.BoekhoutT. (2017). Characterization of Saccharomyces uvarum (Beijerinck, 1898) and related hybrids: assessment of molecular markers that predict the parent and hybrid genomes and a proposal to name yeast hybrids.FEMS Yeast Res.17:fox014. 10.1093/femsyr/fox014

  • 51

    OddsF. C.JacobsenM. D. (2008). Multilocus sequence typing of pathogenic Candida species.Eukaryot. Cell710751084.

  • 52

    Ortiz-MerinoR. A.KuanyshevN.Braun-GalleaniS.ByrneK. P.PorroD.BranduardiP.et al (2017). Evolutionary restoration of fertility in an interspecies hybrid yeast, by whole-genome duplication after a failed mating-type switch.PLoS Biol.15:e2002128. 10.1371/journal.pbio.2002128

  • 53

    OuobaL. I.NielsenD. S.AnyoguA.KandoC.DiawaraB.JespersenL.et al (2015). Hanseniaspora jakobsenii sp. nov., a yeast isolated from Bandji, a traditional palm wine of Borassus akeassii.Int. J. Syst. Evol. Microbiol.6535763579. 10.1099/ijsem.0.000461

  • 54

    PadillaB.GilJ. V.ManzanaresP. (2016). Past and future of non-saccharomyces yeasts: from spoilage microorganisms to biotechnological tools for improving wine aroma complexity.Front. Microbiol.7:411. 10.3389/fmicb.2016.00411

  • 55

    PfeiferB.WittelsburgerU.Ramos-OnsinsS. E.LercherM. J. (2014). PopGenome: an efficient Swiss army knife for population genomic analyses in R.Mol. Biol. Evol.3119291936. 10.1093/molbev/msu136

  • 56

    PritchardJ. K.StephensM.DonnellyP. (2000). Inference of population structure using multilocus genotype data.Genetics155945959.

  • 57

    PurcellS.NealeB.Todd-BrownK.ThomasL.FerreiraM. A.BenderD.et al (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses.Am. J. Hum. Genet.81559575. 10.1086/519795

  • 58

    QuinlanA. R.HallI. M. (2010). BEDTools: a flexible suite of utilities for comparing genomic features.Bioinformatics26841842. 10.1093/bioinformatics/btq033

  • 59

    Raymond EderM. L.ReynosoC.LauretS. C.RosaA. L. (2017). Isolation and identification of the indigenous yeast population during spontaneous fermentation of isabella (Vitis labrusca L.) Grape Must.Front. Microbiol.8:532. 10.3389/fmicb.2017.00532

  • 60

    RutherfordK.ParkhillJ.CrookJ.HorsnellT.RiceP.RajandreamM. A.et al (2000). Artemis: sequence visualization and annotation.Bioinformatics16944945. 10.1093/bioinformatics/16.10.944

  • 61

    SchroderM. S.Martinez De San VicenteK.PrandiniT. H.HammelS.HigginsD. G.BagagliE.et al (2016). Multiple origins of the pathogenic yeast candida orthopsilosis by separate hybridizations between two parental species.PLoS Genet.12:e1006404. 10.1371/journal.pgen.1006404

  • 62

    SeixasI.BarbosaC.Mendes-FaiaA.GuldenerU.TenreiroR.Mendes-FerreiraA.et al (2019). Genome sequence of the non-conventional wine yeast Hanseniaspora guilliermondii UTAD222 unveils relevant traits of this species and of the Hanseniaspora genus in the context of wine fermentation.DNA Res.266783. 10.1093/dnares/dsy039

  • 63

    ShenX. X.OpulenteD. A.KominekJ.ZhouX.SteenwykJ. L.BuhK. V.et al (2018). Tempo and mode of genome evolution in the budding yeast subphylum.Cell175:1533-1545.e20. 10.1016/j.cell.2018.10.023

  • 64

    SipiczkiM. (2018). Interspecies hybridisation and genome chimerisation in saccharomyces: combining of gene pools of species and its biotechnological perspectives.Front. Microbiol.9:3071. 10.3389/fmicb.2018.03071

  • 65

    SolieriL.CassanelliS.CroceM. A.GiudiciP. (2008). Genome size and ploidy level: new insights for elucidating relationships in Zygosaccharomyces species.Fungal Genet. Biol.4515821590. 10.1016/j.fgb.2008.10.001

  • 66

    SteenwykJ. L.OpulenteD. A.KominekJ.ShenX.-X.ZhouX.LabellaA. L.et al (2019). Extensive loss of cell cycle and DNA repair genes in an ancient lineage of bipolar budding yeasts.BioRxiv[Preprint]. 10.1371/journal.pbio.3000255

  • 67

    SternesP. R.LeeD.KutynaD. R.BornemanA. R. (2016). Genome sequences of three species of hanseniaspora isolated from spontaneous wine fermentations.Genome Announc.4:e1287-16. 10.1128/genomeA.01287-16

  • 68

    TittarelliF.VarelaJ. A.GethinsL.StantonC.RossR. P.SuzziG.et al (2018). Development and implementation of multilocus sequence typing to study the diversity of the yeast Kluyveromyces marxianus in Italian cheeses.Microb. Genom.4:e000153. 10.1099/mgen.0.000153

  • 69

    VarelaC.BornemanA. R. (2017). Yeasts found in vineyards and wineries.Yeast34111128. 10.1002/yea.3219

  • 70

    VezinhetF.BlondinB.HalletJ. N. (1990). Chromosomal DNA patterns and mitochondrial DNA polymorphism as tools for identification of enological strains of Saccharomyces cerevisiae.Appl. Microbiol. Biotechnol.32568571.

  • 71

    VigentiniI.De LorenzisG.FabrizioV.ValdetaraF.FaccincaniM.PanontC. A.et al (2015). The vintage effect overcomes the terroir effect: a three year survey on the wine yeast biodiversity in franciacorta and oltrepo pavese, two northern Italian vine-growing areas.Microbiology161362373. 10.1099/mic.0.000004

  • 72

    WangC.Garcia-FernandezD.MasA.Esteve-ZarzosoB. (2015). Fungal diversity in grape must and wine fermentation assessed by massive sequencing, quantitative PCR and DGGE.Front. Microbiol.6:1156. 10.3389/fmicb.2015.01156

  • 73

    WatanabeJ.UeharaK.MogiY.TsukiokaY. (2017). Mechanism for restoration of fertility in hybrid zygosaccharomyces rouxii generated by interspecies hybridization.Appl. Environ. Microbiol.83:AEM.1187-17. 10.1128/AEM.01187-17

Summary

Keywords

MLST, yeast, biodiversity, evolution, Hanseniaspora uvarum, Hanseniaspora guilliermondii

Citation

Saubin M, Devillers H, Proust L, Brier C, Grondin C, Pradal M, Legras J-L and Neuvéglise C (2020) Investigation of Genetic Relationships Between Hanseniaspora Species Found in Grape Musts Revealed Interspecific Hybrids With Dynamic Genome Structures. Front. Microbiol. 10:2960. doi: 10.3389/fmicb.2019.02960

Received

19 September 2019

Accepted

09 December 2019

Published

15 January 2020

Volume

10 - 2019

Edited by

Matthias Sipiczki, University of Debrecen, Hungary

Reviewed by

Lisa Solieri, University of Modena and Reggio Emilia, Italy; Giuseppe Blaiotta, Università degli Studi di Napoli Federico II, Italy; Anthony Borneman, The Australian Wine Research Institute, Australia

Updates

Copyright

*Correspondence: Cécile Neuvéglise,

This article was submitted to Food Microbiology, a section of the journal Frontiers in Microbiology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics