1. Introduction
Yarrowia lipolytica is a dimorphic and oleaginous yeast from the Saccharomycetes/Hemiascomycetes class, which strains are found in various natural or oil-polluted environments as well as in many foodstuffs, notably in fermented dairy and meat products [
1,
2]. This industrially-relevant yeast has been used since the 1950s as cattle feed, applied to bioremediation and, more recently, engineered for multiple white biotechnology applications, as extensively reviewed previously [
2,
3,
4,
5,
6,
7,
8,
9]. As a biosafety class 1 microorganism,
Y. lipolytica has gained a GRAS (Generally Regarded as Safe) status (FDA, USA) [
1]. Most natural isolates are haploid (
MATA or
MATB), as are nearly all laboratory strains, although diploidisation can be used as a strategy for assembling interesting strain characteristics [
2]. The French strain W29 (aka CLIB 89 and CBS 7504) and a series of its genetically engineered derivatives, Po1d, Po1f, Po1g and Po1h strains, are among the more widely used
Y. lipolytica strains [
2]. Since the first sequencing of a
Y. lipolytica genome (from E150, aka CLIB 122, a derivative of a crossing between W29 and the American strain CBS 6124-2, aka CLIB 78), by the Génolevures Consortium in 2004 [
10], a total of 27 sequences of
Y. lipolytica genomes have been made publicly available [
11]. These include the sequence of W29 in 2016 [
12], and that of its most used derivative, Po1f, in 2014 [
13].
A very large array of expression/secretion/surface display vectors and other engineering tools have been developed since a few decades for use in
Y. lipolytica, and a number of them have been made publicly available through the Addgene website (
https://www.addgene.org), as extensively reviewed in 2021 [
14]. These tools include notably tunable recombinant promoters, various toolkits for vector construction making use of Golden Gate technology, disruption and marker rescue systems, transposomics tools and various CRISPR-based genome engineering applications [
14]. Since then,
Y. lipolytica technology has continued its exponential development, with the mining of new promoters [
15], strategies for hybrid promoter engineering [
16], new CRISPR/Cas9-mediated integration toolkits [
17,
18], the YALIcloneNHEJ and YALIcloneHR modular cloning toolkits for integrating multigene pathways [
19,
20], an iterative gene integration system [
21], an improved CRISPR/Cas9 genome-editing system [
22], the design of the searchable YaliFunTome database for transcription factors driving a phenotype of interest [
23], and the dissection of W29 carbon metabolism using genome-scale metabolic modelling [
24]. Most of these recent works, as well as a lot of their predecessors, make use of W29, or one of its derivative of the Po1 series, as recipient strain.
Despite the present availability of several different modular cloning toolkits for the design of expression vectors for complex engineering of
Y. lipolytica metabolic pathways, the authors consider that there remains a need for simpler, ready-to-use expression vectors, for establishing proofs of concepts, or for dedicated uses. Such “Yarrowia for dummies” tools could either help beginners to get started with yeast engineering and test rapidly the expression of their genes, or constitute an easy-to-use alternative for projects involving a single gene, like protein engineering. Some vectors developed two decades ago at INRA (presently INRAE), based on an efficient recombinant promoter, are good candidates for such a purpose. Their common promoter, hp4d, is a construct composed of 4 direct copies of an UAS (upstream activating sequence) from
Y. lipolytica XPR2 promoter, linked to a TATA box from
YlLEU2 gene [
25]. This strong promoter is mainly active at the beginning of stationary phase, allowing a natural decoupling of growth and expression phases, which made it particularly successful for heterologous expression and one of the prominent tools for
Y. lipolytica engineering, as reviewed previously [
6,
14]. Two different expression systems using hp4d were constructed: (i) a series of expression/secretion vectors based on the pBR322 bacterial plasmid, designed to be targeted to a pBR docking platform integrated into the Po1g recipient strain [
25]; (ii) a series of expression/secretion/surface display vectors from which the bacterial moiety can be removed to liberate a yeast cassette (expression cassette + selective marker) bordered by zeta sequences (LTRs - long terminal repeats - from Ylt1 retrotransposon) that is used for transformation [
26,
27,
28]. The pBR-based expression system, characterized by a high transformation efficiency and an efficient targeting to the genomic pBR docking platform by homologous recombination, is commercialized by Yeastern Biotech Co (Taiwan). It has been successfully used for heterologous protein production [
29], and is well adapted for protein engineering since the precise targeting of a single copy allows a direct comparison of the characteristics of the transformants obtained with different constructs [
30,
31]. However, the fact that some bacterial DNA from pBR322 is present in the final construct strongly limits the use of this expression system for a number of applications. In contrast, the zeta-based expression system generates final constructs devoid of any bacterial DNA, which thus retain the GRAS status of
Y. lipolytica. The zeta-bordered yeast cassette, obtained by
NotI digestion of any zeta-based vector, carries a URA3 allele as selective marker and can be integrated either by homologous recombination in a Ura
- recipient strain bearing the Ylt1 retrotransposon, or by NHEJ (non homologous end joining) in a Ura
- recipient strain devoid of Ylt1 (as are most
Y. lipolytica strains, notably W29 and its Po1 derivatives). In that latter case of random integration, the efficiency of transformation is reduced by approximately an order of magnitude and the insertion of the yeast cassette can possibly disrupt a cell function. In addition, the level of expression of the heterologous gene may be affected (positively or negatively) by nearby genomic sequences [
27]. Therefore, a high number of transformants need to be checked in order to select one which growth is not affected and which is a good producer of the protein of interest. The characteristics of the transformants may thus vary for a same construct, which impair the direct comparison of the effect of different constructs. Despite these limitations, the zeta-based expression system has been transferred to numerous laboratories around the world, through more than hundred MTAs (material transfer agreement), and remains still used nowadays.
The purpose of the present work is to combine the advantages of these two expression sytems: (i) the efficiency and reliability of targeted integration, like with the pBR-based system, and (ii) the large array of vectors and the absence of bacterial DNA, like with the zeta-based system. This can be achieved by constructing some new strains equipped with an integrated zeta docking platform, in order to be able to integrate the
NotI yeast cassette of (pre-existing or new) zeta-based vectors by homologous recombination, at a precise genomic location. The authors choose to integrate a zeta docking platform into two derivative of W29, the Po1a and the protease-free Po1f strains, in order to generate respectively the Po1z and the Po1zΔP strains. As its parent strain Po1f, Po1zΔP is deleted for both extracellular proteases. As the other GM strains of the Po1 series, both Po1z and Po1zΔP carry a
ura3-302 allele that allows them to metabolize sucrose-based substrates, such as molasses [
32].
The newly designed strains have been tested, and the interest of targeting integration of yeast cassettes has been validated, through two examples of (sur)expression of a secreted homologous protein, respectively the Y. lipolytica CTS1 chitinase in Po1z, and the AEP protease in Po1zΔP.
4. Discussion
Through the examples of surexpression of the Y. lipolytica CTS1 chitinase in Po1z, and of expression/secretion of the AEP protease in Po1zΔP, this work demonstrated the interest of targeting the integration of yeast cassettes at the docking platform of these new zeta-bearing strains: transformation efficiencies were considerably enhanced and the transformants obtained exhibited more reliable and reproducible performances.
Both of these new strains could be transformed with high efficiency by the yeast cassette of any zeta-based vector, through homologous recombination with their zeta docking platform. Depending of the nature of each research project, either the less modified Po1z, or the more modified protease-free Po1zΔP, could be preferred as the recipient strain. The Po1zΔP strain appears more adapted to the production of secreted heterologous proteins, due to the absence of native extracellular proteases likely to degrade the protein of interest.
The array of zeta-based vectors available, already encompassing vectors for intracellular expression, for expression/secretion or for surface display in
Y. lipolytica, was completed with vectors allowing a more powerful expression through promoter engineering, as shown in
Table 2.
In order to make this user-friendly zeta-targeted expression system freely available to all, the newly designed
Y. lipolytica strains and vectors were deposited at CIRM-Levures (
https://cirm-levures.bio-aware.com). Numerous
Y. lipolytica strains, including W29 and the whole Po1 series are already available, through MTAs, from the CIRM-Levures catalog. A range of vectors for expression, secretion and surface display in
Y. lipolytica (including the whole series of zeta-based vectors) are also available; they may not appear yet on the webpage (the addition of a new tab for
Y. lipolytica engineering tools being still under progress when this paper was submitted) but can nevertheless be requested from CIRM-Levures (contact: cirmlevures@inrae.fr).
The new zeta-targeted expression system developed in this work has been fully validated through the expression examples described. Together with the whole series of zeta-based vectors, the new zeta-bearing strains constitute a user-friendly and reliable expression system, particularly adapted to basic metabolic engineering or to protein engineering projects.