Trimming the genomic fat: minimising and re-functionalising genomes using synthetic biology

Xu, Xin; Meier, Felix; Blount, Benjamin A.; Pretorius, Isak S.; Ellis, Tom; Paulsen, Ian T.; Williams, Thomas C.

doi:10.1038/s41467-023-37748-7

Download PDF

Review Article
Open access
Published: 08 April 2023

Trimming the genomic fat: minimising and re-functionalising genomes using synthetic biology

Nature Communications volume 14, Article number: 1984 (2023) Cite this article

7764 Accesses
8 Citations
43 Altmetric
Metrics details

Subjects

Abstract

Naturally evolved organisms typically have large genomes that enable their survival and growth under various conditions. However, the complexity of genomes often precludes our complete understanding of them, and limits the success of biotechnological designs. In contrast, minimal genomes have reduced complexity and therefore improved engineerability, increased biosynthetic capacity through the removal of unnecessary genetic elements, and less recalcitrance to complete characterisation. Here, we review the past and current genome minimisation and re-functionalisation efforts, with an emphasis on the latest advances facilitated by synthetic genomics, and provide a critical appraisal of their potential for industrial applications.

Application of combinatorial optimization strategies in synthetic biology

Article Open access 15 May 2020

Building genomes to understand biology

Article Open access 02 December 2020

The automated Galaxy-SynBioCAD pipeline for synthetic biology design and engineering

Article Open access 29 August 2022

Introduction

Modern genomes represent the culmination of ~3.8 billion years of life’s evolution on Earth and encode the stunning complexity we observe in the biosphere. Despite ~70 years of molecular biology research, we are still yet to understand precisely how this complexity is encoded in a genome. Our complete understanding is hindered by two major factors. Firstly, genomes have evolved to facilitate reproduction and survival in diverse environments, through incompletely characterised mechanisms, and thus appear to be incredibly complex to us. Furthermore, under laboratory conditions, a large proportion of genes are individually non-essential^1,2,3,4, yet essential when removed combinatorially. Adding to these challenges is the fact that a significant proportion of genes in any given genome have functions that are yet to be defined. For example, in the genome of Escherichia coli, only 48.9% of genes have been characterized, while in the genome of Saccharomyces cerevisiae, over 1000 of the ~6000 genes have unknown functions^1,5,6. To explain this phenomenon, it is hypothesised that genes with unknown functions are either redundant, or their functions are not needed in the lab conditions, and are only important under specific conditions. Secondly, genetic interactions and regulations are overwhelmingly complex, and emergent phenomena are difficult to define in well-studied organisms, let alone rationally designed ones^7,8. As biotechnological capabilities advance, scientists are not only working to understand the biology, but are also increasingly ambitious in engineering biological systems to solve existing problems. Synthetic biology is a young interdisciplinary field that combines biology with cutting-edge engineering techniques and can benefit agricultural, manufacturing, fuel, environmental and medical sectors. With advances in synthetic biology, complex heterologous pathways have been engineered for wide applications, including the production of value-added molecules, the utilisation of inexpensive nutrition sources and the detection of pollutants or diseases. However, the complexity of biological systems has hindered our ability to modify an existing genome, which might result in genetic incompatibility, instability of the heterologous pathways, and low product yields due to competition for cellular resources. Unexpected results are often obtained after rational engineering, and thus require a laborious trial and error process. As whole-genome synthesis becomes achievable and cheaper, one solution is to unlock a more complete understanding of biology at the genomic level by construction of minimal genomes. Theoretically, a minimal genome consists of the smallest possible number of genes required to support a living cell under a defined set of conditions. Minimal genomes are therefore almost as difficult to define as they are to create, since a minimal gene set will vary according to the environment and construction method. In practice, genome minimisation is more commonly aimed at building a genome with a reduced set of genes relative to its wild-type counterpart, rather than the absolute lowest number of genes. These genomes are intended to be easier to understand and engineer, have fewer uncharacterised genetic elements, and less complex regulatory networks.

Engineering minimal genomes will also facilitate a greater understanding of fundamental genome biology. These methods will allow us to understand what constitutes the minimal genome requirement of a functional cell in different contexts. Synthetic minimal genomes will also provide insights into the extent to which genomes can be defragmented and refactored, the roles of non-coding DNA and repetitive elements, and the extent to which global epigenetic regulation can be engineered through genome redesign. Moreover, they can serve as a simplified and superior cell chassis for biotechnological applications due to improved stability, increased predictability via modelling, and greater biosynthetic capacity (Fig. 1).

**Fig. 1: Applications of synthetic and synthetic-minimal genomes.**

Here we review past and current genome minimisation efforts, with a focus on the novel genome minimisation strategies enabled by cutting-edge synthetic genomics technologies. Pros and cons of constructing a minimal genome are considered carefully, and the future scope and applications of minimal genomes are discussed.

Top-down non-synthetic genome minimisation

There are two broad approaches used to generate minimal genomes, termed ‘top-down’ and ‘bottom-up’. ‘Top-down’ minimal genomes are generated by reducing the gene number and genome size of an existing genome.

Minimisation of an E. coli genome

Since the early 2000s, ‘top-down’ genome reduction has been attempted in several bacterial species as well as in the fission yeast Schizosaccharomyces pombe^{9,10,11,12,13}. These genomes were steamlined via a series of sequential deletions, based on known essential genomic regions and comparative genomics analysis. A selection of previous ‘top down’ genome minimisation approaches and the resulting phenotypes are shown in Table 1. A standard scheme for genome minimization of bacterial systems was also reviewed by Kurasawa et al.¹⁴ As E. coli is the most characterized prokaryotic organism, construction of a simpler E. coli cell by genome reduction has drawn great interest. Mizoguchi et al. constructed a 3.62 Mb E. coli genome, with a 22.2% genome size reduction compared to the parental strain W3110^11,15 (Table 1). Specifically, regions of more than ten consecutive non-essential genes were selected as candidates for deletion by comparative genomics analysis between E. coli and Buchnera sp., which has a small ~600 kb genome and is thought to share a common ancestor with E. coli, while essential genes and genes required for E. coli growth in the minimal medium were excluded. In addition, transporter genes, insertion sequences (ISs) and toxin–antitoxin pairs were also designed for deletion. In total, 103 candidate regions were selected and deleted individually using lambda-mediated homologous recombination, by which a target region was replaced by a selection cassette, which was subsequently recycled via another round of recombination. The regions that didn’t affect normal growth when absent were then removed in a single strain via 28 cycles of deletions via P1 transduction¹⁵. This genome-reduced strain, designated as MGF-01, had 1.5 times higher final cell density and a 2.4-fold increase in threonine yield from an engineered pathway compared to the wild-type. With similar approaches, the MGF-01 genome was further reduced by removing the remaining IS sites, generating a 2.98 Mb genome (strain DGF-298)¹⁶. DGF-298 showed no auxotrophic phenotype and better growth in a medium commonly used in industry, demonstrating its potential for industrial applications.

Table 1 Top-down genome minimisation projects

Full size table

In an earlier study, Hashimoto et al. reduced the genome of E. coli MG1655 from 4.64 Mb to 3.26 Mb¹⁷ using lambda homologous recombination and P1 transduction (Table 1). However, the minimised strain △16 had a much slower growth rate, and abnormal cell shape and nucleoid organisation. A further 430 kb was then deleted, yielding E. coli △33a with the smallest E. coli genome reported so far (2.83 Mb)¹⁸. It was shown that △33a was sensitive to oxidative stress, which might preclude its use in industrial fermetation settings without further modification. From the studies above, we note that the phenotypes of minimised E. coli strains are different in each study. This likely results from the different engineering approaches, specific regions of deletions, and the mutations arisen during the construction.

Minimisation of Bacillus subtilis, Streptomyces avermitilis and Schizosaccharomyces pombe genomes

Bacillus subtilis is a model Gram-positive bacterium with gene essentiality now well characterised^3,4,19. In the first study of B. subtilis genome minimisation, prophages and AT-rich islands were removed by homologous recombination, producing a strain with 7.7% genome reduction (strain Δ6)¹⁰ (Table 1). The genome-reduced strain had normal growth, and comparable heterologous protein production and secretion compared to the wild-type. With more gene functions discovered, the regions not required for cell survival in the rich medium were selected and deleted stepwise, and a 36.5 % decrease in genome size was achieved (strain PS38)²⁰ (Table 1). The genome-minimised B. subtilis had comparable growth rate to the wild-type in a rich medium. Interestingly, unknown genes still represented 18% of the total genes in PS38, while the proteins of unknown function only represented 2.5% in the total expressed proteome. The finding suggested that the unknown genes were poorly expressed generally and might only be useful under specific conditions. It is therefore possible the B. subtilis genome could be further reduced by removing a large proportion of genes with unknown functions.

Streptomyces avermitilis is an industrial microorganism able to produce various secondary metabolites. A sub-telomeric region >1.4 Mb, which did not contain essential genes, was deleted sequentially from the 9.02-Mb linear chromosome of S. avermitilis. This generated a series of deletion mutants, whose sizes corresponded to around 80% of the wild-type chromosome¹² (Table 1). After integration with gene clusters encoding the production of streptomycin and cephamycin C, the deletion mutants produced both antibiotics at higher levels than the natural hosts.

In an example of eukaryotic genome reduction, 657.3 kb was deleted from the terminal regions on chromosomes I and II of the fission yeast Schizosaccharomyces pombe by the Latour method²¹, which included integration of a ura4 + marker by homologous recombination and subsequent counter-seletion for deletion of the inserted ura4 + marker using 5-FOA medium¹³ (Table 1). The resulting strain had decreased uptake of glucose and some amino acids, but had increased levels of heterologous protein production.

These ‘top-down’ genome minimisation or reduction studies have improved our understanding of fundamental biology, and showed that reduction of genome size does not necessarily generate strains with impaired fitness. Some genome-reduced strains have similar or even superior fitness and industrially favourable phenotypes relative to their wild-type parents. These top-down approaches are straightforward, generally affordable, and are often the method of choice for generating a small number of deletions, or deletion of a large non-essential gene cluster. However, for genome-wide minimisation, the reduction process can be very challenging. For example, tens or even hundreds of rounds of transformation might be required to yield and identify a strain with significant genome reduction. In addition, the deletion regions were limited to the segments of genes with characterised functions. Unknown genes, intergenic regions, introns, and non-annotated genomic features were not targeted for deletion. Moreover, the target regions for deletion were chosen based on the essentiality of single genes, while genetic interactions and synthetic lethality were not taken into consideration. Thus, it is very difficult to generate a minimal genome for practical applications using previous non-multiplexed gene knock out approaches. A faster and more systematic approach is needed for genome-level minimisation and eventual minimisation.

Synthetic genomics unlocks new possibilities for genome minimisation

With the decreased DNA synthesis costs and the development of large-scale DNA assembly techniques, it is now possible for ‘bottom-up’ genome minimisation and re-functionalisation via whole-genome design and synthesis. These ‘bottom-up’ approaches rely on either the de novo synthesis of a new genome, or the stepwise replacement of an existing genome with rationally designed and chemically synthesised DNA.

Synthesis of Mycoplasma genomes

In 2002, the poliovirus cDNA (~7.5 kb) was the first to be chemically synthesised²². Following the success of the synthetic poliovirus genome, Gibson et al. synthesized the first prokaryotic genome, a 582,970–base pair genome of Mycoplasma genitalium, which is a bacterium with the smallest genome grown in pure culture²³. Since M. genitalium has a very slow growth rate, two faster-growing Mycoplasma species were selected for subsequent research. M. mycoides was chosen for de novo genome synthesis, and M. capricolum as the recipient cell for the synthetic M. mycoides genome. The synthetic genome of M. mycoides (JCVI-syn1.0) was successfully completed by Gibson et al. in 2010 with four watermark sequences, designed to differentiate the synthetic genome from the wild-type genome²⁴. The synthetic genome was assembled by transformation and homologous recombination in yeast of 1078 overlapping 1 kb DNA cassettes into a 1.08 Mb genome, and then transplanted into a M. capricolum recipient cell to create a new synthetic strain showing a similar phenotype to M. mycoides (Fig. 2a).

**Fig. 2: Construction of synthetic genomes.**

Synthesis of a minimal Mycoplasma mycoides genome

In a following study, the team applied whole-genome design and synthesis to minimise the M. mycoides genome²⁵ (Fig. 3). The minimal genome was initially designed according to existing transposon mutagenesis data and molecular biology knowledge. However, the initial design did not generate a viable strain. Subsequently a global Tn5 mutagenesis study was conducted to determine the essential genes, non-essential genes, alongside quasi-essential genes. Quasi-essentiality, a concept developed during the design of the minimal synthetic Mycoplasma genome, describes genes whose deletion wouldn’t result in cell-death immediately, but would cause minimal to severe growth impairments. Whilst not being strictly essential, these genes are needed for long term fitness²⁵. By retaining quasi-essential genes and avoiding deleting synthetic lethal pairs, a viable minimised genome was obtained (JCVI-syn2.0). In addition, 42 more genes were removed after another round of Tn5 mutagenesis in syn2.0, yielding an approximately minimal genome (JCVI-syn3.0) with removal of 428 genes in total. The syn3.0 strain has a genome of 531 kb, smaller than any autonomously replicating cell known in nature, has 51% genome reduction compared to syn1.0 and a doubling time of ~180 min, which is slower than of syn1.0, which has a doubling time of ~60 min. However, its growth is much faster than the 16-h doubling time of M. genitalium. It can be inferred that there is a trade-off between removing as many genes as possible and maintaining a certain level of growth fitness. In a follow-up study, adaptive laboratory evolution (ALE) was conducted to improve the growth rate of JCVI-syn3.0 by 15%²⁶.

**Fig. 3: ‘Bottom up’ construction of a minimal *Mycoplasma mycoides* genome (JCVI-syn3.0).**

Synthesis of an essential Caulobacter crescentus genome

Chemical synthesis has also been applied to rebuild the genome of Caulobacter crescentus, a model system for cell cycle, cellular differentiation, and cell division studies. This synthetic C. crescentus genome, C. eth-2.0, was designed to only contain essential genes²⁷. Initially, a 785,701-bp genome was designed computationally according to characterisation data from transposon gene knockout studies. However, the natural sequences failed to be commercially synthesized due to synthesis constraints such as homopolymers and repetitive sequences. Thus, computational DNA design algorithms were applied and resulted in 10,172 base substitutions to facilitate DNA synthesis. In addition, 123,141 base substitutions were introduced within protein-coding sequences to reduce the number of hypothetical genetic elements from 6290 to 799. These removed elements included alternative open reading frames (ORFs), transcriptional start sites (TSSs) and ribosome binding sites (RBSs) within the coding sequences (CDSs). In total, 56.1% of all codons were replaced by synonymous codons in C. eth-2.0.

The C. eth-2.0 genome was constructed from the assembly of 236 DNA blocks of 3-4 kb into 37 large segments of 19–22 kb, and further into 16 ‘mega-segments’, and finally into the full-length chromosome via transformation-associated recombination (TAR) in yeast. Functional analysis showed that 81.5% of all synonymously recoded essential genes had no significant influence on their functionality, which demonstrated the potential of synonymous recoding to facilitate de novo genome synthesis. This analysis also revealed that 98 genes lost their function due to rewriting, which may have been a result of inaccurate annotations causing other important features to be modified.

The fully assembled C. eth-2.0 failed to replace the native genome and generate a living cell. A follow-up study compared the transcriptional profiles of genes expressed from plasmid-borne C. eth-2.0 segments to those on the native C. crescentus genome, with the intention of uncovering important elements that had been disrupted by synonymous recoding. The analysis resulted in 60 promoter annotations being refined and showed that in C. eth-2.0, 18 termination elements and 77 transcription start sites had been unintentionally introduced²⁸. Translational regulations for 20 CDSs and an essential translational regulatory element for the expression of ribosomal protein were also identified²⁸.

Synthesis of a recoded E. coli genome

Advances in synthetic genomics have also facilitated global codon reassignment. Total synthesis was implemented in E. coli for genome-wide removal of three codons, generating a synthetic E. coli genome with 61 codons²⁹. In the synthetic E. coli genome, two of the serine codons and a stop codon were replaced, and an essential transfer RNA gene was freed up. 10 kb synthetic constructs were assembled into 37 fragments of around 100 kb each onto bacterial artificial chromosomes (BACs) by TAR in yeast (Fig. 2b). The 100 kb synthetic fragments were integrated into different E. coli strains in parallel, by ‘replicon excision for enhanced genome engineering through programmed recombination’ (REXER), which used CRISPR-Cas9 and lambda-mediated recombination to replace the genomic DNA with the recoded DNA from BACs. Seven strains with partially synthetic genomes were generated by integrating four or five fragments of around 100 kb via consecutive REXER cycles with alternating uses of positive- and negative-selection, enabling genome stepwise interchange synthesis (GENESIS) (Fig. 2b). The recoded large sections were at last combined into a full synthetic recoded genome by conjugative transfer and recombination, designated as ‘Syn61’. The resulting codon compressed strain Syn61 provides huge potential for production of proteins with novel functions via codon reassignment, as well as for industrial bioprocesses since they are resistant to phage contamination through genetic code incompatibility. This was demonstrated in a later study where the three previously freed-up codons were reassigned to enable the incorporation of non-canonical amino acids. Cells with reassigned codons were resistant to viral infection and were able to produce novel polymers and macrocycles³⁰. However, it is very challenging to free up more codons since large scale genome recoding will not only increase the technical difficulty of DNA synthesis and assembly, but also affect GC content, protein expression, and global epigenetic signals, potentially resulting in severe fitness-defects or lethality³¹. Ostrov et al. reported the design, synthesis, and testing progress of a 57-codon E. coli genome in 2016, in which they have validated the function of 63% of recoded genes, and they are still working on the assembly of the fully recoded strain^31,32.

Synthesis of a S. cerevisiae genome

In parallel to the success of de novo synthesis of bacterial genomes, a global consortium led by Jef Boeke at New York University has been pursuing an ambitious project Sc2.0, aiming to build the first synthetic eukaryotic genome, that of S. cerevisiae. The aim of the project is not only to gain insights into yeast genomics, but also to create a simpler version of a yeast cell with comparable fitness to wild-type, which could be streamlined and refactored for different engineering purposes. The following changes were implemented in the design of Sc2.0: unstable or redundant elements including retrotransposons, subtelomeric repeats and introns were removed; the repetitive transfer RNA (tRNA) genes were relocated to a ‘neo-chromosome’ to test their functions and stability separately; TAG stop codons were swapped with TAA for future codon reassignment; native telomeres were replaced with a standardised synthetic version; strings of codons were recoded to synonymous codons as ‘PCRtags’, which can be used as watermarks to distinguish the synthetic sequences and wild-type sequences^33,34. Initially, the construction of synthetic chromosomes started from oligonucleotide assembly into 750 bp building blocks, then assembled in yeast to produce minichunks³⁴ (Fig. 2c). Several overlapping DNA minichunks, with an auxotrophic marker (LEU2 or URA3) in the last minichunk, were co-transformed in yeast cell to replace the native DNA³⁵. The integration of the next group of minichunks would over-write the previous marker, enabling both positive and negative selection by the ‘SWAP-In’ method^35,36 (Fig. 2c). Ultimately, this would generate a complete synthetic chromosome. Most recently, chromosome assembly has been further expedited by starting with 6–10 kb commercially synthesized ‘chunks’. Four or five chunks were ligated in vitro for the assembly of a 30–50 kb megachunk, before integration into yeast with an auxotrophic marker. The megachunks were ‘SWAPPED-In’ gradually to create a synthetic chromosome (Fig. 2c). Thus far, the construction of all synthetic chromosomes is close to completion, with nine strains containing one synthetic chromosome reported to have comparable growth with the wild-type strain^{37,38,39,40,41,42,43,44,45}. The global team is on track to build an entirely synthetic yeast genome. Once completed, the synthetic genome will have a nearly 8% genome size reduction, and will serve as a whole-genome diversification and minimisation platform³⁶.

‘Bottom-up’ genome construction enables implementation of novel design changes at the whole-genome level. However, it is still currently very costly for genome-scale synthesis, especially for eukaryotic genomes. There will also be regions difficult to synthesise that require recoding. As more knowledge of genome biology and gene regulation is gained through the study and rewriting of genomes, our inability to design a functional minimal genome from scratch is continually highlighted. However, the ensuing iterative design, build, test and learn cycles needed to generate a final functional minimal genome will ultimately refine our understanding and capabilities in genome design^25,27,45,46.

Sc2.0 SCRaMbLE: ‘bottom-up’ genome engineering meets top-down pruning for genome minimisation

In the wild-type S. cerevisiae genome, most of the genes are individually non-essential, and many genes have homologues originating from historical duplication events. This indicates a great potential for genome minimisation. Complementing these existing factors, a novel genome diversification and minimisation approach has been designed into the Sc2.0 genome. In the synthetic genome, all non-essential genes and major landmarks have been designed to be flanked by the symmetrical loxPsym recombination sites. This enables the most dramatic novel ability of Sc2.0 strains, the Synthetic Chromosome Rearrangement and Modification by LoxPsym-mediated Evolution (SCRaMbLE) system. Upon induction of the SCRaMbLE system by an active Cre-recombinase, an effectively infinite number of genome rearrangements can be generated, including gene deletions, duplications, inversions, and translocations between any two loxPsym sites (Fig. 4a). The ability of SCRaMbLE to generate genome diversity had been confirmed in a previous study, in which 156 deletions, 89 inversions, 94 duplications, and 55 additional complex rearrangements were identified from deep sequencing of 64 synIXR SCRaMbLEd strains⁴⁷. Moreover, each SCRaMbLEd strain has a unique genome. Since deletions are the most frequent recombination events arising via SCRaMbLE, it can generate a near-infinite number of variable reduced genomes that are sufficient to keep the cells alive and functional. Compared to previous genome reduction approaches, SCRaMbLE-assisted minimisation can be achieved by sequential ‘bottom up’ genome synthesis followed by ‘top down’ SCRaMbLE-mediated genome reduction approaches.

**Fig. 4: Genome minimisation by SCRaMbLE.**

One major challenge involved in applying the Sc2.0 SCRaMbLE system for genome minimisation is the loss in cell viability after SCRaMbLE, which decreases the chances of long sequence deletions. This is because although loxPsym sites were inserted 3 bp downstream of the stop codon of non-essential genes, in many cases, there are essential genes and non-essential genes present between two sequential loxPsym sites. As a result, these non-essential genes have to be deleted by SCRaMbLE together with the adjacent essential genes, leading to loss of viability. Fortunately, a complementary ‘bottom-up’ approach can address this challenge by introducing a minimal-essential chromosome containing all individually essential genes but without loxPsym sites so that the extra copy of the essential genes can be stably maintained during SCRaMbLE. This principle has been tested via SCRaMbLE-ing of synIII and the synXII left arm (synXIIL), which demonstrated the capacity for deletion of large regions containing the essential genes, now complemented by the supplemental copies^48,49. With an extra copy of all essential genes from the genome, this would increase the post-SCRaMbLE population viability, and increase the probability of finding strains with smaller genomes. Even in this scenario, there is still likely to be a loss in population viability through synthetic lethality, where the loss of individually non-essential genes in combination causes lethality⁴⁹.

After SCRaMbLE, there will be a mixed population of genomes, except for the reduced genomes, some without changes, some with a net increase in genome sizes, and some with undesired or deleterious rearrangements. Thus, the other challenge for SCRaMbLE-assisted minimisation is how to identify SCRaMbLEd strains with reduced genomes efficiently. One approach is to select deletions by integration of marker genes. URA3 insertion and 5-FOA counterselection was successfully applied to compact the synthetic chromosome XII left arm⁴⁸. With the aid of an essential gene array, 64 kb of a total 170 kb was deleted in syn XIIL via only one round of SCRaMbLE-based genome compaction (SGC)⁴⁸. After another two rounds of SGC, a strain with 58% reduction of synXIIL (~100 kb deleted from 170 kb) was generated and had comparable growth with wild-type strain. This study has demonstrated SCRaMbLE is an efficient system for yeast genome minimisation. However, selection of deletion at a specific locus does not rule out the possibility of duplications at another locus. Another approach we propose is to determine genome sizes assisted by fluorescence-activated cell sorting (FACS). Non-lethal double-strand DNA-specific dyes are able to sort out the cells with different genome sizes (preliminary data from our group), and can be used to stain and sort out the SCRaMbLEd cells with smaller genomes. After staining, FACS could then be applied for high-throughput screening of reduced genomes. This approach would enable the screening of 2000–5000 cells per second for their approximate genome size, easily providing the throughput necessary to find cells with rare large deletions. The phenotypes of sorted strains could then be tested, and analysed using systems biology approaches, which will shed light on the genetic elements that are common to minimal yeast genomes, and suggest paths towards rational genome design. Iterative rounds of SCRaMbLE, selection, test and learn could be conducted to explore the compactability of the yeast genome (Fig. 4b). Without the need to identify the essentiality of each genetic element, multiple rounds of SCRaMbLE-based selection provide an evolutionary process to enrich cells with smaller, and eventually minimal genomes.

In previous studies, minimal genomes were constructed in one specific condition, usually in rich media, which might not be useful for industrial settings. This, in-part, has led to pervasive arguments that minimised genomes cannot be industrially relevant. However, the SCRaMbLE-based minimisation process provides the opportunity for ‘industrial minimal’ genome selection by carrying out the iterative minimisation process with outgrowth under industrially relevant conditions, or with alternative selection pressures to co-select for desired industrial traits along with minimisation. This approach would reflect the fact that while there may be only one truly minimal genome, there are likely to be near infinite possibilities for genomes that are simultaneously minimised and selected for other desirable traits. Such industrial minimal genomes could be co-selected for phenotypes such as temperature tolerance, stress tolerance, or the bioproduction of specific proteins and metabolites.

Genome minimisation: does the ends justify the means?

Despite the applications of cutting-edge synthetic genomics and engineering approaches, genome minimisation projects are still relatively costly and time consuming. Thus, there is ongoing debate as to whether it is cost effective to build a minimal genome.

One school of thought posits that construction of minimal genomes can bring significant impacts for research and industrial purposes. First of all, it facilitates a deeper understanding of functions and interactions of genome components, and uncovers knowledge of how a genome is programmed into a living and functioning cell. In the redesign and chemical synthesis of C. eth-2.0, 52 instances of inaccurate annotations of the Caulobacter genome were identified via analysis of non-functional genes, and 27 regulatory elements within protein-coding sequences were discovered²⁷. To construct a minimal genome of M. mycoides, gene essentiality was re-identified by whole-genome Tn5 mutagenesis, and a class of quasi-essential genes, which do not result in lethality directly but are required for robust growth, were identified and retained in JCVI-syn3.0²⁵. Reorganisation of essential genes from Sc2.0 chrIII had little effect on their transcriptional level despite altered gene order and orientation, demonstrating the feasibility of defragmentation and reorganisation of the yeast genome⁴⁹.

Secondly, reducing the genome size can improve fitness or biomass yield possibly by avoiding unnecessary energetic costs. E. coli MGF-01 with 1030 kb removed had improved growth in M9 minimal medium^11,15. The reduced genome E. coli strain DGF-298, with 1670- kb deleted, had better growth fitness and cell yield in a rich medium than the wild-type strain¹⁶. In addition, mobile elements, recombinogenic and repetitive DNA are often deleted, which leads to better stability and more efficient and predictable genetic modifications^37,50. The E. coli multiple-deletion (MDS) series with removal of IS elements was subsequently free of IS-mediated mutagenesis, thus enabling more stable propagation of recombinant genes and plasmids⁵⁰. The MDS42 strain also showed more than 180-fold higher efficiency of DNA transformation by electroporation of a 2.7 kb pUC plasmid than its parental strain, which is comparable to, or even better than, the efficiencies of commercial competent cells⁵⁰. Further deletions were made from MDS42 to generate commercial strains ‘Clean Genome® E. coli’ by Scarab Genomics, which serves as a superior host for protein and nucleic acid production^51,52. With major IS elements deleted in Corynebacterium glutamicum, improved production of recombinant proteins was observed possibly due to the increased stability of plasmids⁵³. Thirdly and more importantly, minimal genomes could serve as better chassis cells for industrial applications. Genome minimisation is likely to reduce physiological complexity and therefore make metabolic modelling more predictive and systems biology more informative. Engineered heterologous pathways will also be less likely to be affected by complex native metabolism. In theory, minimal genomes will only contain the smallest set of genes required for survival and replication within a given environment, and their biosynthetic capacity will therefore be liberated to produce desired proteins and metabolites. This concept is supported by several previous studies. For example, an increase in threonine production was shown in the genome-reduced E. coli strain MGF-01¹¹, S. avermitilis with more than 1.4 Mb deletion enabled higher streptomycin and cephamycin C production than their native hosts, and B. subtilis PG10, with a 36% genome reduction, showed substantially higher secretory protein production⁵⁴. Furthermore, SCRaMbLE of synthetic chromosomes in yeast has been utilised to increase the yields of several valuable compounds including carotenoids, aromatics and antibiotics^55,56,57. Although these SCRaMbLEd strains did not have reduced genomes, SCRaMbLE has been used previously to streamline individual chromosomes^48,49, making the use of SCRaMbLE to simultaneously streamline genomes and improve biosynthetic capacity an intriguing near-term possibility.

In contrast to the optimistic prospect of higher biosynthetic capacity and reduced complexity in genome-reduced strains, another school of thought maintains that minimal genomes are of little value to industrial applications. It usually takes a long-time to assemble or engineer a minimal genome, followed by a lengthy trial and error process to return a minimal genome strain to comparable fitness and function with the wild-type strain. It is also worth noting that some of the minimised genomes reported thus far are probably not industrially applicable, nor were they designed to be. For example, the M. mycoides JCVI-syn3.0 grew slowly, and its metabolism is extremely reduced^25,58. The genome-reduced E. coli △16 had a severe growth defect and abnormal nucleoid organisation, while △33a with a genome of only 2.83 Mb was sensitive to oxidative stress^17,18 In addition, a minimal genome is affected by the conditions under which it is constructed. Currently, most minimal genome projects are constructed in nutrient-rich medium. The genetic elements encoding stress response and tolerance are often discarded, which might result in decreased fitness under industrial fermentation conditions. Furthermore, the resulting minimal genome strain might not have intermediates or co-factors for expressing a heterologous pathway.

The shortcomings mentioned above could be overcome via ALE or genetic engineering to either select for the mutations to overcome the stress, re-insert the required genes, or select for their function during a ‘top-down’ minimisation process. ALE is an effective approach that has been demonstrated to improve the fitness of JCVI-syn3.0, a recoded E. coli genome, as well as the synXIV strain from Sc2.0^14,26,45,59. Another approach is to build customised neo-chromosomes or entire minimal genomes for different requirements, such as for different fermentation conditions or for producing different categories of compounds. Sc2.0 is constructed based on the genome background of laboratory strain S288c, which lacks genetic diversity compared to industrial yeast strains. To address this, Kutyna et al. constructed a synthetic pan-genome neo-chromosome (PGNC), which incorporated 75 predicted ORFs from industry, human pathogen and natural isolates⁶⁰. With the presence of PGNC, the resulting strain was able to utilise a wider range of carbon sources beyond the Sc2.0 parental strain. This is a clear example of how construction of neo-chromosome can improve the industrially favourable features and expand the applications of the synthetic strain. However, given the current state of biological knowledge, much more effort is required to realise this idealised view of synthetic genome design. Given the large amount of work to construct industrial applicable minimal genomes, optimisation of the pathway via genetic engineering directly is clearly a more straightforward and promising approach in the immediate future. However, the weight of evidence from the existing literature suggests that minimised genomes hold great promise for both understanding biology and engineering superior industrial strains.

Conclusion and future prospects

Overall, genome minimisation and re-functionalisation are very attractive research topics. The key current bottlenecks for the construction of novel minimal genomes are the workload and cost required to assemble comparatively small synthesised DNA chunks into ever-larger fragments, transplant these into host organisms, and modify and ‘debug’ these synthetic genomes^46,61. Enabling technologies such as enzymatic DNA synthesis, together with automated DNA assembly platforms can greatly reduce the cost of synthesis, as well as shorten the time and labour required for genome synthesis projects. Therefore, it could become economically viable to create minimal genomes for a wide range of species that are specific to particular applications and enable rapid iteration in design-build-test cycles^62,63.

However, the increasing dominance of the ‘bottom-up’ approach via synthesis shouldn’t be taken for granted. The ever-expanding ecosystem of CRISPR-based tools for genome editing has recently started to produce new methods for rapid and precise removal of large regions of genomes, especially those of mammalian cells. In particular, the Prime Editing system of Liu and co-workers can insert pairs of site-specific recombinase sequences, allowing a targeted genome section to be cleanly deleted by a recombinase^64,65. Two variations of Prime Editing, called PRIME-Del and PEDAR go further, doing precise programmed deletions over 10 kb at a time without the need for any recombinase^66,67. Multiplexing these methods to allow many large deletions from a genome at the same time is the next challenge, but such multiplexing has already been shown to be possible for Prime Editing’s precursor, Base Editing, where over 13,000 edits in a human genome can be achieved in a cell in parallel⁶⁸. For large genomes such those for mammalian cell lines, multiplex CRISPR-guided precise deletions are likely to be the quickest and cheapest route to minimal genomes, and will hopefully ensure that this important set of organisms for industrial biology and research do not miss out on the many opportunities that minimal and synthetic genomes can offer.

Rapid and inexpensive generation of streamlined genomes can lead to a diverse set of context-dependent minimal genomes. Systems analysis of both fundamental and applied minimal genomes would provide a rich repository of omics data for design guidelines for future simple and more robust genomes, and even de novo new-to-nature genomes⁶⁹. A minimal genome could function as a platform for modular plug-and-play integration of metabolic pathways, such as ‘feedstock modules’ for efficient utilisation of non-conventional carbon sources, ‘production modules’ for making different groups of compounds, ‘stress and toxic compound resistance modules’, ‘biosensing modules’ for in vivo metabolite measurements and feedback control, ‘cell-to-cell communication modules’, ‘data-storage modules’ and ‘bio-computation modules’ (Fig. 5). Each module could also be optimised via ALE separately and subsequently reintegrated while leveraging reduced genomic complexity for troubleshooting. This could allow for interoperability of modules when relying on the same minimal genome strain and enable a greater degree of orthogonality when designing each individual module. As we uncover the minimal genetic modules required for different cellular processes and metabolic functions, together with machine learning-assisted design and modelling, we may be able to combine these off-the-shelf to generate entirely novel genomes and organisms designed de novo for specific applications with vastly reduced turnaround times than any past and current genome writing projects^33,70. Moreover, the simpler host can also be applied as a chassis to unravel the complexities of microbial ecosystems. With the growing capacity of genome synthesis and genome transplantation, it might be possible to incorporate a synthetic metagenome into a streamlined host cell. The resulting meta-synthetic cell will gain the traits from the ecosystem and have much improved industrial potential, with the metagenome containing all their keystone genes⁷¹. The meta-synthetic cell can also serve as a platform to study the communications and interactions of the eco-systems. Above all, synthetic minimal genomes could provide a path to inch closer to an answer to the genetic aspects of age-old questions: What constitutes life? To what extent can we reduce genomes while retaining industrial relevance? What is the trade-off between minimisation and application?

**Fig. 5: Future smart design, construction, and applications of minimal genomes.**

References

Serres, M. H. et al. A functional update of the Escherichia coli K-12 genome. Genome Biol. 2, 1–7 (2001).
Article Google Scholar
Goffeau, A. et al. Life with 6000 genes. Science 274, 546–567 (1996).
Article ADS CAS PubMed Google Scholar
Juhas, M., Reuß, D. R., Zhu, B. & Commichau, F. M. Bacillus subtilis and Escherichia coli essential genes and minimal cell factories after one decade of genome engineering. Microbiology 160, 2341–2351 (2014).
Article CAS PubMed Google Scholar
Kobayashi, K. et al. Essential Bacillus subtilis genes. Proc. Natl Acad. Sci. USA 100, 4678–4683 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Pena-Castillo, L. & Hughes, T. R. Why are there still over 1000 uncharacterized yeast genes? Genetics 176, 7–14 (2007).
Article CAS PubMed PubMed Central Google Scholar
Keseler, I. M. et al. The EcoCyc database in 2021. Front. Microbiol. 12, 711077 (2021).
Article PubMed PubMed Central Google Scholar
Boone, C., Bussey, H. & Andrews, B. J. Exploring genetic interactions and networks with yeast. Nat. Rev. Genet. 8, 437–449 (2007).
Article CAS PubMed Google Scholar
Tong, A. H. Y. et al. Global mapping of the yeast genetic interaction network. Science 303, 808–813 (2004).
Article ADS CAS PubMed Google Scholar
Yu, B. J. et al. Minimization of the Escherichia coli genome using a Tn 5-targeted Cre/loxP excision system. Nat. Biotechnol. 20, 1018–1023 (2002).
Article CAS PubMed Google Scholar
Westers, H. et al. Genome engineering reveals large dispensable regions in Bacillus subtilis. Mol. Biol. Evol. 20, 2076–2090 (2003).
Article CAS PubMed Google Scholar
Mizoguchi, H., Mori, H. & Fujio, T. Escherichia coli minimum genome factory. Biotechnol. Appl. Biochem. 46, 157–167 (2007).
Article CAS PubMed Google Scholar
Komatsu, M., Uchiyama, T., Ōmura, S., Cane, D. E. & Ikeda, H. Genome-minimized Streptomyces host for the heterologous expression of secondary metabolism. Proc. Natl Acad. Sci. USA 107, 2646–2651 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Sasaki, M., Kumagai, H., Takegawa, K. & Tohda, H. Characterization of genome-reduced fission yeast strains. Nucleic Acids Res. 41, 5382–5399 (2013).
Article PubMed PubMed Central Google Scholar
Kurasawa, H., Ohno, T., Arai, R. & Aizawa, Y. A guideline and challenges toward the minimization of bacterial and eukaryotic genomes. Curr. Opin. Syst. Biol. 24, 127–134 (2020).
Article Google Scholar
Mizoguchi, H., Sawano, Y., Kato, J.-I. & Mori, H. Superpositioning of deletions promotes growth of Escherichia coli with a reduced genome. DNA Res. 15, 277–284 (2008). They generated an E. coli genome with 22.2% reduction compared to the parental strain, and the strain had improved growth in minimal medium and enhanced threonine yield, demonstrating the industrial potential of minimal genomes.
Article CAS PubMed PubMed Central Google Scholar
Hirokawa, Y. et al. Genetic manipulations restored the growth fitness of reduced-genome Escherichia coli. J. Biosci. Bioeng. 116, 52–58 (2013).
Article CAS PubMed Google Scholar
Hashimoto, M. et al. Cell size and nucleoid organization of engineered Escherichia coli cells with a reduced genome. Mol. Microbiol. 55, 137–149 (2005).
Article CAS PubMed Google Scholar
Iwadate, Y., Honda, H., Sato, H., Hashimoto, M. & Kato, J.-I. Oxidative stress sensitivity of engineered Escherichia coli cells with a reduced genome. FEMS Microbiol. Lett. 322, 25–33 (2011).
Article CAS PubMed Google Scholar
Commichau, F. M., Pietack, N. & Stülke, J. Essential genes in Bacillus subtilis: a re-evaluation after ten years. Mol. Biosyst. 9, 1068–1075 (2013).
Article CAS PubMed Google Scholar
Reuß, D. R. et al. Large-scale reduction of the Bacillus subtilis genome: consequences for the transcriptional network, resource allocation, and metabolism. Genome Res. 27, 289–299 (2017).
Article PubMed PubMed Central Google Scholar
Hirashima, K., Iwaki, T., Takegawa, K., Giga-Hama, Y. & Tohda, H. A simple and effective chromosome modification method for large-scale deletion of genome sequences and identification of essential genes in fission yeast. Nucleic Acids Res. 34, e11–e11 (2006).
Article PubMed PubMed Central Google Scholar
Cello, J., Paul, A. V. & Wimmer, E. Chemical synthesis of poliovirus cDNA: generation of infectious virus in the absence of natural template. Science 297, 1016–1018 (2002).
Article ADS CAS PubMed Google Scholar
Gibson, D. G. et al. Complete chemical synthesis, assembly, and cloning of a Mycoplasma genitalium genome. Science 319, 1215–1220 (2008).
Article ADS CAS PubMed Google Scholar
Gibson, D. G. et al. Creation of a bacterial cell controlled by a chemically synthesized genome. Science 329, 52–56 (2010).
Article ADS CAS PubMed Google Scholar
Hutchison, C. A. et al. Design and synthesis of a minimal bacterial genome. Science 351, aad6253 (2016). The first construction of a minimial genome via whole genome design and synthesis.
Article PubMed Google Scholar
Sandberg, T. E. et al. Adaptive evolution of a minimal organism with a synthetic genome. Available at SSRN: https://ssrn.com/abstract=4147935.
Venetz, J. E. et al. Chemical synthesis rewriting of a bacterial genome to achieve design flexibility and biological functionality. Proc. Natl Acad. Sci. USA 116, 8070–8079 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
van Kooten, M. J., Scheidegger, C. A., Christen, M. & Christen, B. The transcriptional landscape of a rewritten bacterial genome reveals control elements and genome design principles. Nat. Commun. 12, 1–13 (2021).
Google Scholar
Fredens, J. et al. Total synthesis of Escherichia coli with a recoded genome. Nature 569, 514–518 (2019). Whole genome recoding was applied to generate an E. coli strain with 61 codons.
Article ADS CAS PubMed PubMed Central Google Scholar
Robertson, W. E. et al. Sense codon reassignment enables viral resistance and encoded polymer synthesis. Science 372, 1057–1062 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Ostrov, N. et al. Synthetic genomes with altered genetic codes. Curr. Opin. Syst. Biol. 24, 32–40 (2020).
Article Google Scholar
Ostrov, N. et al. Design, synthesis, and testing toward a 57-codon genome. Science 353, 819–822 (2016).
Article ADS CAS PubMed Google Scholar
Pretorius, I. & Boeke, J. Yeast 2.0—connecting the dots in the construction of the world’s first functional synthetic eukaryotic genome. FEMS Yeast Res. 18, foy032 (2018).
Article PubMed PubMed Central Google Scholar
Dymond, J. S. et al. Synthetic chromosome arms function in yeast and generate phenotypic diversity by design. Nature 477, 471–476 (2011). The first chemical synthesis of a partially synthetic eukaryotic chromosome, and application SCRaMbLE for generating genetic diversity was demonstrated in this study.
Article ADS CAS PubMed PubMed Central Google Scholar
Jovicevic, D., Blount, B. A. & Ellis, T. Total synthesis of a eukaryotic chromosome: redesigning and SCRaMbLE‐ing yeast. Bioessays 36, 855–860 (2014).
Article CAS PubMed Google Scholar
Richardson, S. M. et al. Design of a synthetic yeast genome. Science 355, 1040–1044 (2017).
Article ADS CAS PubMed Google Scholar
Annaluru, N. et al. Total synthesis of a functional designer eukaryotic chromosome. Science 344, 55–58 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Mitchell, L. A. et al. Synthesis, debugging, and effects of synthetic chromosome consolidation: synVI and beyond. Science 355, eaaf4831 (2017).
Article PubMed Google Scholar
Shen, Y. et al. Deep functional analysis of synII, a 770-kilobase synthetic yeast chromosome. Science 355, eaaf4791 (2017).
Article PubMed PubMed Central Google Scholar
Wu, Y. et al. Bug mapping and fitness testing of chemically synthesized chromosome X. Science 355, eaaf4706 (2017).
Article ADS PubMed PubMed Central Google Scholar
Xie, Z.-X. et al. “Perfect” designer chromosome V and behavior of a ring derivative. Science 355, eaaf4704 (2017).
Article PubMed Google Scholar
Zhang, W. et al. Engineering the ribosomal DNA in a megabase synthetic chromosome. Science 355, eaaf3981 (2017).
Article PubMed Google Scholar
Blount, B. A. et al. Synthetic yeast chromosome XI design enables extrachromosomal circular DNA formation on demand. Preprint at bioRxiv https://doi.org/10.1101/2022.07.15.500197 (2022).
Shen, Y. et al. Dissecting aneuploidy phenotypes by constructing Sc2. 0 chromosome VII and SCRaMbLEing synthetic disomic yeast. Preprint at bioRxiv https://doi.org/10.1101/2022.09.01.506252 (2022).
Williams, T. C. et al. Laboratory evolution and polyploid SCRaMbLE reveal genomic plasticity to synthetic chromosome defects and rearrangements. Preprint at bioRxiv https://doi.org/10.1101/2022.07.22.501046 (2022).
Xie, Z.-X., Zhou, J., Fu, J. & Yuan, Y.-J. Debugging: putting the synthetic yeast chromosome to work. Chem. Sci. 12, 5381–5389 (2021).
Article CAS PubMed PubMed Central Google Scholar
Shen, Y. et al. SCRaMbLE generates designed combinatorial stochastic diversity in synthetic chromosomes. Genome Res. 26, 36–49 (2016).
Article CAS PubMed PubMed Central Google Scholar
Luo, Z. et al. Compacting a synthetic yeast chromosome arm. Genome Biol. 22, 1–18 (2021). In this study, SCRaMbLE was applied to compact the synthetic yeast chromosome XII left arm, demonstrating of the feasilbity of application of SCRaMbLE for whole genome minimisation.
Article Google Scholar
Wang, P. et al. SCRaMbLEing of a synthetic yeast chromosome with clustered essential genes reveals synthetic lethal interactions. ACS Synth. Biol. 9, 1181–1189 (2020).
Article CAS PubMed Google Scholar
Posfai, G. et al. Emergent properties of reduced-genome Escherichia coli. Science 312, 1044–1046 (2006).
Article ADS CAS PubMed Google Scholar
Scarab Genomics, LLC. https://www.scarabgenomics.com/products/clean-genome-e-coli/.
Zhang, W., Mitchell, L. A., Bader, J. S. & Boeke, J. D. Synthetic genomes. Annu. Rev. Biochem. 89, 77–101 (2020).
Article CAS PubMed Google Scholar
Choi, J. W., Yim, S. S., Kim, M. J. & Jeong, K. J. Enhanced production of recombinant proteins with Corynebacterium glutamicum by deletion of insertion sequences (IS elements). Microb. Cell Fact. 14, 1–12 (2015).
Article Google Scholar
Aguilar Suárez, R. O., Stülke, J. R. & van Dijl, J. M. Less is more: toward a genome-reduced Bacillus cell factory for “difficult proteins”. ACS Synth. Biol. 8, 99–108 (2018).
Article PubMed Google Scholar
Liu, W. et al. Rapid pathway prototyping and engineering using in vitro and in vivo synthetic genome SCRaMbLE-in methods. Nat. Commun. 9, 1–12 (2018).
ADS Google Scholar
Blount, B. et al. Rapid host strain improvement by in vivo rearrangement of a synthetic yeast chromosome. Nat. Commun. 9, 1–10 (2018).
Article ADS CAS Google Scholar
Jia, B. et al. Precise control of SCRaMbLE in synthetic haploid and diploid yeast. Nat. Commun. 9, 1–13 (2018).
Article ADS Google Scholar
Vickers, C. E. The minimal genome comes of age. Nat. Biotechnol. 34, 623–624 (2016).
Article CAS PubMed Google Scholar
Wannier, T. M. et al. Adaptive evolution of genomically recoded Escherichia coli. Proc. Natl Acad. Sci. USA 115, 3090–3095 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Kutyna, D. R. et al. Construction of a synthetic Saccharomyces cerevisiae pan-genome neo-chromosome. Nat. Commun. 13, 1–9 (2022). Construction of a neo-chromosome containing industrial relevant genes improved the industrially favourable features and expanded the applciations of the lab strain.
Article Google Scholar
Ostrov, N. et al. Technological challenges and milestones for writing genomes. Science 366, 310–312 (2019).
Article ADS CAS PubMed Google Scholar
Eisenstein, M. Enzymatic DNA synthesis enters new phase. Nat. Biotechnol. 38, 1113–1116 (2020).
Article CAS PubMed Google Scholar
Wang, L. et al. Synthetic genomics: from DNA synthesis to genome design. Angew. Chem. Int. Ed. 57, 1748–1756 (2018).
Article CAS Google Scholar
Anzalone, A. V. et al. Programmable deletion, replacement, integration and inversion of large DNA sequences with twin prime editing. Nat. Biotechnol. 40, 731–740 (2022).
Article CAS PubMed Google Scholar
Anzalone, A. V. et al. Search-and-replace genome editing without double-strand breaks or donor DNA. Nature 576, 149–157 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Jiang, T., Zhang, X.-O., Weng, Z. & Xue, W. Deletion and replacement of long genomic sequences using prime editing. Nat. Biotechnol. 40, 227–234 (2022).
Article CAS PubMed Google Scholar
Choi, J. et al. Precise genomic deletions using paired prime editing. Nat. Biotechnol. 40, 218–226 (2022).
Article CAS PubMed Google Scholar
Smith, C. J. et al. Enabling large-scale genome editing at repetitive elements by reducing DNA nicking. Nucleic Acids Res. 48, 5183–5195 (2020).
Article CAS PubMed PubMed Central Google Scholar
Antonakoudis, A., Barbosa, R., Kotidis, P. & Kontoravdi, C. The era of big data: Genome-scale modelling meets machine learning. Comput. Struct. Biotechnol. J. 18, 3287–3300 (2020).
Article CAS PubMed PubMed Central Google Scholar
Steensels, J. et al. Improving industrial yeast strains: exploiting natural and artificial diversity. FEMS Microbiol. Rev. 38, 947–995 (2014).
Article CAS PubMed Google Scholar
Belda, I., Williams, T. C., de Celis, M., Paulsen, I. T. & Pretorius, I. S. Seeding the idea of encapsulating a representative synthetic metagenome in a single yeast cell. Nat. Commun. 12, 1–8 (2021).
Article Google Scholar

Download references

Acknowledgements

External support for Macquarie University’s Synthetic Biology initiative is acknowledged from Bioplatforms Australia, the New South Wales (NSW) Chief Scientist and Engineer, and the NSW Government’s Department of Primary Industries. Australian Government funding through its investment agency, the Australian Research Council, towards the Macquarie University-led ARC Centre of Excellence for Synthetic Biology is gratefully acknowledged. T.C.W. and I.S.P. acknowledge the support of ARC Discovery Project DP200100717. B.A.B. was supported by the University of Nottingham through a Nottingham Research Fellowship.

Author information

Authors and Affiliations

ARC Centre of Excellence in Synthetic Biology and School of Natural Sciences, Macquarie University, Sydney, NSW, 2109, Australia
Xin Xu, Felix Meier, Isak S. Pretorius, Ian T. Paulsen & Thomas C. Williams
School of Life Sciences, University of Nottingham, Nottingham, NG7 2RD, UK
Benjamin A. Blount
Imperial College Centre for Synthetic Biology, Imperial College London, London, SW7 2AZ, UK
Tom Ellis
Department of Bioengineering, Imperial College London, London, SW7 2AZ, UK
Tom Ellis
Wellcome Trust Sanger Institute, Cambridgeshire, CB10 1SA, UK
Tom Ellis

Authors

Xin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Felix Meier
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin A. Blount
View author publications
You can also search for this author in PubMed Google Scholar
Isak S. Pretorius
View author publications
You can also search for this author in PubMed Google Scholar
Tom Ellis
View author publications
You can also search for this author in PubMed Google Scholar
Ian T. Paulsen
View author publications
You can also search for this author in PubMed Google Scholar
Thomas C. Williams
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.X., T.C.W. and F.M. conceived and developed the first draft of the manuscript. B.A.B, I.S.P., T.E., and I.T.P contributed sections of the paper and/or reviewed, edited and refined the various drafts of the manuscript. The order of authors reflects the order of the contributions in the review.

Corresponding authors

Correspondence to Xin Xu or Thomas C. Williams.

Ethics declarations

Competing interests

T.C.W. is a co-founder of Number 8 Bio Pty Ltd. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Alexander Wagner and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xu, X., Meier, F., Blount, B.A. et al. Trimming the genomic fat: minimising and re-functionalising genomes using synthetic biology. Nat Commun 14, 1984 (2023). https://doi.org/10.1038/s41467-023-37748-7

Download citation

Received: 17 November 2022
Accepted: 30 March 2023
Published: 08 April 2023
DOI: https://doi.org/10.1038/s41467-023-37748-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.