Engineering rules that minimize germline silencing of transgenes in simple extrachromosomal arrays in C. elegans

Aljohani, Mohammed D.; El Mouridi, Sonia; Priyadarshini, Monika; Vargas-Velazquez, Amhed M.; Frøkjær-Jensen, Christian

doi:10.1038/s41467-020-19898-0

Download PDF

Article
Open access
Published: 09 December 2020

Engineering rules that minimize germline silencing of transgenes in simple extrachromosomal arrays in C. elegans

Nature Communications volume 11, Article number: 6300 (2020) Cite this article

9000 Accesses
25 Citations
69 Altmetric
Metrics details

Subjects

Abstract

Transgenes are prone to progressive silencing due to their structure, copy number, and genomic location. In C. elegans, repressive mechanisms are particularly strong in the germline with almost fully penetrant transgene silencing in simple extrachromosomal arrays and frequent silencing of single-copy transgene insertions. A class of non-coding DNA, Periodic A_n/T_n Clusters (PATCs) can prevent transgene-silencing in repressive chromatin or from small interfering RNAs (piRNAs). Here, we describe design rules (codon-optimization, intron and PATC inclusion, elevated temperature (25 °C), and vector backbone removal) for efficient germline expression from arrays in wildtype animals. We generate web-based tools to analyze PATCs and reagents for the convenient assembly of PATC-rich transgenes. An extensive collection of silencing resistant fluorescent proteins (e.g., gfp, mCherry, and tagBFP) can be used for dissecting germline regulatory elements and a set of enhanced enzymes (Mos1 transposase, Cas9, Cre, and Flp recombinases) enable efficient genetic engineering in C. elegans.

Improving prime editing with an endogenous small RNA-binding protein

Article Open access 03 April 2024

Jun Yan, Paul Oyler-Castrillo, … Britt Adamson

Genome engineering with Cas9 and AAV repair templates generates frequent concatemeric insertions of viral vectors

Article 08 April 2024

Fabian P. Suchy, Daiki Karigane, … Hiromitsu Nakauchi

Spatially organized cellular communities form the developing human heart

Article Open access 13 March 2024

Elie N. Farah, Robert K. Hu, … Neil C. Chi

Introduction

Cells protect themselves by limiting foreign DNA expression, including transposons and transgenes, via small-RNA pathways and heterochromatin formation¹. In mammals, conventional plasmid vectors are transcriptionally silenced in vivo in somatic cells, limiting our ability to develop efficient gene therapies². Removing the bacterial backbone by recombination (minicircles)³ or optimizing the codon-usage of transgenes⁴ can increase expression. However, the effects of codon-optimization vary and may negatively impact the safety and efficacy of therapeutic proteins⁵. Alternatively, increasing the vector backbone’s A_n/T_n composition can reduce transgene-silencing for improved gene therapy⁶. Transgene silencing is not limited to animal cells. In plants, transgene silencing is also a significant roadblock to introducing desirable traits⁷. Transgene silencing often occurs in a stochastic manner over time, but expression can be stimulated through unknown mechanisms by the inclusion of introns⁸. In plants, silencing is linked to transgene structure, copy number, and double-strand RNA (dsRNA) generation⁹. Thus, despite some advances in limiting transgene silencing, this phenomenon remains a significant barrier to the development of transgenic technologies for biomedical and biotechnological use.

In the nematode Caenorhabditis elegans, the easiest and most commonly used method to generate transgenic animals is the injection of DNA into the germline syncytium, where semi-stable extrachromosomal and repetitive arrays are formed^10,11. Persistent transgene expression in somatic cells is readily achieved from simple arrays¹², whereas germline expression is only observed for a few generations before transgene-silencing occurs¹³. This difference between cell types has likely evolved because the silencing of foreign DNA, such as transposable elements, is of particular importance in germ cells to prevent heritable defects leading to reduced fitness. Thus, germ cells face an inherent problem of balancing opposing pathways that repress and promote germline expression, respectively¹⁴. Candidate gene approaches^15,16,17, and genome-wide RNA interference screens^10,18 have identified small-RNA, chromatin, and splicing pathways that mediate active silencing processes in the germline. These studies have been essential for understanding the mechanistic basis of transgene silencing. However, they are of limited experimental use for biomedical or biotechnological transgene expression because mutant backgrounds frequently show germline defects that include maternal-effect sterility¹⁹, accumulation of mutations caused by transposons²⁰, or progressive transgenerational loss of germline immortality²¹.

Instead, several technical approaches independent of genetic background have been developed to overcome germline silencing of C. elegans transgenes: mimicking a genomic environment by co-injecting genomic DNA¹³, low-copy transgene insertion by biolistic transformation²², single-copy transgene insertion into defined or random genomic locations^23,24, and CRISPR/Cas9 insertion²⁵. However, none of these methods entirely prevent silencing, and all are substantially more labor- and time-intensive compared with generating simple, extrachromosomal arrays¹⁰. From a scientific and practical perspective, there remains, therefore, considerable interest in understanding transgene silencing and developing methods that prevent this silencing.

A pervasive non-coding DNA structure named Periodic A_n/T_n Clusters (PATCs) comprises a substantial fraction (6–10%) of the C. elegans genome²⁶. We and others have previously shown that the inclusion of PATC-rich introns into single-copy integrated transgenes offered significant protection from germline silencing^{27,28,29,30,31}. Together these experiments suggested that PATCs may be generally useful for protecting transgenes from silencing in the germline. However, the application of PATCs to prevent silencing has been limited by the technical difficulty of identifying PATC-rich DNA sequences and generating PATC-rich transgenes. Furthermore, best practices for generating silencing-resistant transgenes are spread over a disparate collection of published and unpublished manuscripts³².

In this work, we develop a suite of tools and determine a set of engineering rules that largely prevent transgene silencing in simple extrachromosomal arrays in the germline of wildtype C. elegans. We develop an integrated web interface (www.wormbuilder.org/PATC/) that computes PATC scores and allows interactive browsing of pre-computed PATC values for all protein-coding genes in C. elegans. We show that insertion of PATC-rich introns in a one-pot reaction using validated and standardized reagents generates silencing-resistant transgenes. We use this ease of engineering transgenes and generating array animals to test the effects of codon adaptation, number and placement of introns, transgene concentration, optimal placement of fluorescent tags, temperature, and removal of the plasmid backbone, in addition to characterizing the protective effects of PATC-rich introns. Finally, we use these rules to generate collections of silencing-resistant fluorescent proteins that recapitulate endogenous gene regulation in the germline and high-efficiency gene-editing enzymes (e.g., Mos1 transposase and Cas9). In aggregate, these resources will broadly facilitate experiments across C. elegans laboratories.

Results

Online tools to identify and analyze PATC-rich sequences

We developed a user-friendly, versatile web server to identify and quantify PATCs. The published PATC algorithm²⁶ is written in a narrowly used language (Pascal) and needs to be compiled for a specific operating system. Thus, the “activation energy” for studying PATCs is relatively high and requires a certain level of bioinformatics expertize. To facilitate the identification of PATCs, we updated our analysis of PATCs using the C. elegans genome build (ce11) and developed a set of online tools with an interactive graphical interface that can be accessed at www.wormbuilder.org/PATC/. The online app allows the computation of PATC content and phasing of any DNA sequence by either uploading a FASTA-formatted text file or by simple “copy-paste” (Fig. 1a). The tools also allow users to identify protein-coding genes with high PATC content or identify genomic regions with PATCs using a genome browser (Fig. S1). These tools make it significantly easier for other researchers to use or study the role of PATCs.

**Fig. 1: A set of tools to analyze and engineer transgenes with periodic A_n/T_n clusters (PATCs).**

Efficient generation of PATC-rich transgenes

Our second aim was to facilitate the insertion of PATC-rich introns into transgenes for use in C. elegans or other organisms⁶. Incorporating introns is technically challenging with homology-based methods such as “Gibson” cloning³³. Therefore, we developed an efficient protocol to insert up to four introns into a synthetic fluorescent proteins by Golden Gate cloning³⁴ (Fig. 1b) using a collection of PATC-rich introns (Table 1). Donor plasmids with introns do not contain splice acceptor and donor sequences, and are, therefore, compatible with species specific-splicing signals³⁵ incorporated in the synthetic “acceptor” transgene.

Table 1 Golden gate-compatible endogenous C. elegans introns.

Full size table

PATC-rich transgenes are expressed in the germline from simple arrays

An experimentalist building a transgene faces many design decisions: should the coding sequence be optimized? If introns are added, how many are necessary, and where should they be placed? Should these introns contain PATCs to mitigate silencing? Moreover, what regulatory elements (e.g., promoters and 3′-UTRs) and strategies for co-expression (e.g., operons or viral 2A peptides) are most efficient? What genetic contexts are compatible with expression? Is single-copy transgene integration into a safe harbor required, or will simpler extrachromosomal arrays suffice? If arrays are suitable, how much DNA needs to be injected, and what form of DNA (circular or linear) gives the most consistent expression? We set out to answer these questions and determine a set of practical and reliable design rules that minimize transgene silencing in the C. elegans germline from simple extrachromosomal repetitive arrays (“arrays”). We have focused on the germline as this tissue is the most difficult for expression and is the subject of persistent efforts to understand small-RNA-mediated silencing mechanisms.

We based our initial transgene designs on two gfp-tagged genes, smu-1 and smu-2, which are unusual because they are expressed in the germline from X-ray integrated simple arrays^36,37. These genes are highly enriched for PATCs²⁶, suggesting that PATCs might generally enable germline expression from arrays. To determine germline expression requirements, we generated a gfp-tagged smu-1 transgene based on Spike et al.³⁶. We observed reproducible germline (and somatic) expression from extrachromosomal arrays containing the smu-1::gfp transgene (Fig. 1c and Fig. S2). Arrays containing the smu-1::gfp transgene were expressed at high frequency using different combinations of germline promoters (Psmu-2, Ppie-1, and Pmex-5) and 3′-UTRs (smu-2 and tbb-2)^38,39 (Fig. S3). We conclude that genomic integration is not required for germline expression of a multi-copy array and that several commonly used promoters and 3′-UTRs can be used for expression.

These data suggest that sequences or signals intrinsic to the smu-1-coding region are significant determinants of germline expression from arrays. To define general rules for anti-silencing by PATCs, we designed transgenes encoding green fluorescent protein (GFP) that do not contain any homology to endogenous coding sequences or known piRNAs, which can silence germline genes⁴⁰. These synthetic genes also lacks homology to 22 G RNAs, which are thought to protect genes from silencing through a pathway dependent on the Argonaute protein CSR-1^41,42,43. The transgenes were designed using a popular web-based platform for C. elegans codon adaptation⁴⁴, and a custom algorithm to eliminate sequences homologous to known piRNA sequences^40,45. We generated several GFP variants that contained several (two to four) small, synthetic introns that were designed to enable subsequent insertion of large stretches of PATC-rich sequence. When this optimized “ce-gfp_PATC” containing PATC-rich introns was inserted between the promoter and 3′-UTR from the smu-1 gene, GFP was robustly expressed (Fig. 1d).

We generated additional ce-gfp transgenes using our Golden-Gate-based cloning approach (Fig. 1b) to test the role of PATC-rich introns in isolation. We exchanged the synthetic introns in ce-gfp for 250 bp and 900 bp native introns with or without PATCs and quantified germline expression from arrays using a pie-1 promoter (Fig. 2a). A commonly used gfp that is not codon-optimized for C. elegans (distributed in the Fire lab vector kits, Andrew Fire unpublished reagents) was expressed at low frequency in the germline. Codon-optimization modestly increased the frequency of expression but did not reach statistical significance (Fig. 2a). In contrast, germline expression was observed at high frequency in all animals with arrays carrying ce-gfp_PATC transgenes with native 250 bp and 900 bp introns. To address whether the enhanced germline expression was the result of using native introns, we replaced the PATC-rich introns with 250 bp and 900 bp endogenous introns lacking PATCs and observed no enhanced germline expression (Fig. 2a).

**Fig. 2: PATCs promote germline expression from simple extrachromosomal arrays.**

We tested the modest effect of codon-optimization using the smu-1 promoter (Fig. 2b). With Psmu-1, we observed substantial germline array expression of ce-gfp with synthetic introns and a further increase from adding PATC-rich introns (Fig. 2b). A codon-optimized mCherry (ce-mCherry) with synthetic introns was poorly expressed using Psmu-1, but PATC-rich introns significantly increased expression (Fig. 2c). Similarly, a smu-1 promoter from C. briggsae (Pcbr-smu-1) also required addition of PATCs for robust germline expression (Fig. S4). These results demonstrate that codon-optimization in itself does not necessarily ensure germline expression.

How many PATC-rich introns are required for this anti-silencing effect? We tested the effect of individual smu-2 introns (rather than all four smu-2 introns as in Fig. 1d). A single intron (intron 3) significantly stimulated germline expression (Fig. 2d). In contrast, two smu-2 introns (introns 5 and 6) resulted in only a modest but non-significant increase in expression, and one intron (intron 4) had no effect (Fig. 2d). This unequal effect was not due to where the introns were positioned in the transgene as intron three increased germline expression from arrays at all four locations (Fig. 2e). Combining intron three with the other smu-2 introns did not further increase expression, and artificial introns with high levels of PATCs were ineffective at increasing expression (Fig. S5). Although codon-optimization and inclusion of PATCs are generally effective improvements, we were unable to generate a silencing-resistant tandem dimer Tomato (tdTomato)⁴⁶. We observed only infrequent expression in the germline that was prone to rapid bleaching (Fig. S6). tdTomato silencing was not due to the two tandem repeats as an analogous tandem dimer ce-gfp was expressed at high frequency (Fig. S6). Although not perfect, we conclude that PATC-rich introns generally stimulate germline expression from arrays and that not all PATC-rich introns are equally effective.

Fluorescently tagged endogenous genes inserted as single-copy transgenes by MosSCI are sensitive to germline silencing depending on where the gfp is positioned. For example, rde-3 and cdk-1 with N-terminal gfps were frequently silenced, whereas C-terminally tagged genes were rarely silenced⁴⁷. We observed the same effect in arrays: a smu-1::gfp transgene was frequently expressed in the germline whereas a gfp::smu-1 transgene was consistently silenced (Fig. 2f). Thus, we suggest inserting foreign DNA sequences at the C-terminus of endogenous genes, if possible, for optimal germline expression.

Complex extrachromosomal arrays approximate euchromatin by injecting high concentrations of genomic carrier DNA (50–100 ng/ul) and low concentrations of the transgene (1–2 ng/ul), which can prevent germline silencing¹³. Complex arrays are infrequently used as they are challenging to generate, and expression is not easily maintained. To test if our experimental conditions are similar to complex arrays, we tested the effect of transgene concentration on germline expression (Fig. 2g). For simple arrays, germline expression of Ppie-1::ce-gfp_PATC increased with higher transgene concentrations (Fig. 2g). Furthermore, PATC-rich carrier DNA did not prevent transgene silencing. Instead, high concentrations of carrier DNA with PATCs occasionally caused unusual aberrant germline morphology (Fig. S7). We, therefore, recommend injecting transgenes at high concentrations and using standard DNA ladder as carrier DNA.

In aggregate, these results establish a basic set of design rules that improve germline expression. Germline expression from transgenes in simple arrays is possible, and PATC-rich introns stimulate expression. We have found no evidence that PATC-rich promoters or 3′-UTRs improve germline expression. Transgenes consisting of fluorescently tagged endogenous genes are more efficiently expressed when tagged at the C-terminus, which is in agreement with previous observations from single-copy insertions⁴⁷. Finally, higher transgene concentrations in the injection mix result in more-frequent germline expression but inclusion of PATC-rich carrier DNA appears to be toxic. Although we primarily focused on determining a set of applicable rules for improved germline expression, two of our results provide biological insight into requirements for transgene expression. First, there is no strict requirement for protein-coding sequences from endogenous genes (smu-1 or smu-2), suggesting that PATCs and the csr-1 (which depends on homology to coding regions) pathway are complementary. Second, array silencing in the germline was proposed to result from divergent transcription and dsRNA intermediates leading to RNAi-mediated silencing⁴⁸. This model is difficult to reconcile with the observation that higher plasmid concentrations increase germline expression unless PATCs prevent antisense transcription.

Early introns and backbone removal increase germline expression

Transposons are detected in the yeast Cryptococcus Neoformans by nuclear RNAi machinery that scans for sub-optimal introns⁴⁹. In C. elegans, a similar mechanism has been proposed, with introns acting as a barrier to repressive nuclear RNAi pathways that act via the EMB-4 RNA helicase¹⁵. Perhaps, N-terminal gfp-tagged genes are prone to silencing owing to unusual codon-usage or intron structure, specifically at the 5′ end of genes? To explore this possibility, and the general requirement for introns to enable germline expression, we generated chimeric transgenes consisting of smu-1 genomic DNA and cDNA with a C-terminal gfp tag (Fig. 3a). We observed robust germline fluorescence from transgenes with introns at the 5′ end of the gene but virtually no expression from a smu-1 cDNA or a smu-1 mini-gene lacking the first five introns (Fig. 3a). A single synthetic intron at the 5′ end of smu-1 restored germline fluorescence, establishing that endogenous introns are not required. Similarly, trans-spliced promoters (that have “half” of a splicing reaction in the 5′ -UTR) also partially restored germline expression from arrays in the absence of 5′ introns (Fig. S8). This improved expression could be due to improved transcription and mRNA processing or by enhanced translation efficiency⁵⁰. Surprisingly, the smu-1 cDNA transgene was expressed in the germline from single-copy transgene insertions, whereas the chimeric transgene remained silenced, showing that transgene context can play a role in transgene silencing or detection (Fig. 3b).

**Fig. 3: The effects of splicing and the plasmid backbone on germline expression.**

We tested the role of splicing in detail by generating Psmu-1::ce-gfp transgenes with a variable number of synthetic introns (all lacking PATCs) at various locations and monitoring germline expression from arrays (Fig. 3c–e). ce-gfps with no introns or a single intron were infrequently expressed in the germline (Figs. 3c and S9). In contrast, ce-gfps with two introns were expressed at consistently high frequency when one intron was located near the 5′ end of the coding region (at base number 48) (Fig. 3c). Further experiments showed that efficient germline expression required a short first exon (<350 base pairs and preferably shorter than 150 base pairs) combined with a second intron anywhere (Fig. 3d). This is similar to observations in human cells, where short first exons (~250–500 base pairs) serve as position-dependent transcriptional enhancers that act via activating histone modifications (H3K4me3 and H3K9ac) leading to higher expression levels⁵¹. Furthermore, short first exons promote transcriptional accuracy and reduce antisense transcription by repressing transcriptional initiation within the first exon⁵¹. Antisense transcription is a potent trigger for small-RNA mediated silencing in the germline⁵² and transgenes with long first exons may, therefore, be actively silenced. In support of this model, Makeyeva and colleagues have recently shown that genes from which introns have been removed become default targets of small-RNA silencing in the germline (Y.V. Makeyeva and C.C. Mello, personal communication 2020).

Are transgenes with no introns at the 3′ end of the coding region expressed at all? We tested several ce-gfp transgenes in different genetic contexts to determine whether they were expressed in some circumstances (Figs. 3e and S9). We observed robust germline expression from these sub-optimal single-copy transgenes when inserted into a permissive genomic environment (ttTi5605), illustrating the importance of transgene copy number and chromatin context for germline silencing. Somatic transgene expression in C. elegans is improved by removing the plasmid backbone by PCR amplification or restriction digest and gel purification⁵³, similar to how backbone removal increases the perdurance of transgene expression in mammals³. We observed frequent germline expression when ce-gfp transgenes were PCR amplified or gel-purified (Fig. 3e). Restriction enzyme digestion of the vector backbone alone was also sufficient to increase germline fluorescence (Fig. 3e), a somewhat more convenient approach for large transgenes. Many of our experiments were done using the pCFJ150 backbone for cbr-unc-119(+) selection in arrays and to make transgenes compatible with single-copy insertion. The pCFJ150 backbone contains two 1.5 kb genomic homology regions (in addition to the 2.1 kb cbr-unc-119 selection marker) flanking the transgene, which could conceivably shield from silencing. However, transgenes inserted into a backbone vector (pDESTR4-R3) with no other nematode DNA were expressed in the germline at similar or higher frequency (Fig. S10). Other vector backbones may result in reduced or increased silencing from circular plasmids, but cloning is not limited to a single vector.

We conclude that the inclusion of two introns, with one in the first 150 base pairs of the coding sequence and removing the cloning vector backbone, stimulates germline expression. These observations raise questions about how the germline’s silencing machinery identifies and silences foreign DNA elements based on a combination of copy number and transgene structure.

Germline co-expression using viral 2A peptides and operons

The relative ease of expressing a fluorescent protein located at the 3′-end of a PATC-rich gene suggested that it might be possible to bypass silencing by expressing transgenes downstream of an endogenous gene (e.g., a gfp downstream of smu-1). We tested this strategy using two different methods for co-expressing genes in C. elegans: viral 2A peptides and operons. 2A peptides allow co-expression of two or more genes by ribosomal skip mechanisms that occur during protein translation, an approach that has been validated in C. elegans⁵⁴. We tested four different 2A peptide sequences (E2A, F2A, P2A, and T2A) for expressing ce-gfp downstream of a full-length smu-1 gene using smu-1 or pie-1 promoters (Fig. 4a, b). Three of the four 2A peptides allowed co-expression but at reduced frequency compared to smu-1::gfp fusions, despite codon-optimizing ce-gfp (Fig. 4a, b).

**Fig. 4: The effect of viral 2A peptides and operon sequences on germline co-expression of genes.**

The second co-expression strategy relied on endogenous operons⁵⁵, which is a common organization for germline-expressed genes⁵⁶. We tested smu-1 and ce-gfp co-expression using the frequently used mai-1/gpd-2 operon and three additional operons with high PATC content (Fig. 4c, d). All four operons allowed germline expression at high frequency from Psmu-1 (Fig. 4c), whereas gpd-2 and par-4 resulted in more-frequent expression when using a pie-1 promoter (Fig. 4d). These results indicate that high PATC content in intergenic operon sequences is unnecessary and does not promote germline expression from arrays.

We conclude that a transgene (ce-gfp) can be expressed downstream of 2A peptides and intergenic operon sequences. Owing to the higher efficiency and native function in C. elegans, we recommend using the gpd-2 operon sequence for this strategy.

Germline expression is stable but temperature dependent

In C. elegans, growth at high temperature (25 °C) partially prevents gradual silencing of germline-expressed transgenes⁵⁷ via poorly understood mechanisms. All experiments described until now were therefore performed at 25 °C. We tested long-term germline expression and temperature-dependent silencing by establishing transgenic lines at 25 °C and transiently shifting one group of animals to 20 °C for two generations. For Psmu-1::ce-gfp transgenes, we observed persistently high expression for many generations at 25 °C but gradually reversible silencing at 20 °C (Fig. 5a, b). Transgenes containing endogenous coding sequences encoded by full-length and chimeric smu-1::gfp showed similar temperature-dependent silencing with one difference: smu-1 transgenes were fully de-silenced in the first generation after returning animals to 25 °C (Figs. 5c, d, and S11).

**Fig. 5: Reversible, temperature-dependent silencing of PATC-rich transgenes.**

We conclude that transgenes in simple arrays can be indefinitely expressed in the germline when animals are maintained at 25 °C. The expression state can be reversed over a few generations by switching between 20 °C and 25 °C. To our knowledge, such reproducible and full reversibility has not been observed before and could be a useful paradigm for studying mechanisms that lead to transgenerational silencing in response to a simple environmental change⁵⁸.

PATC-rich transgenes recapitulate endogenous germline expression

Germline expression from PATC-rich transgenic arrays may facilitate experimentation to understand germline regulatory elements (e.g., promoter bashing⁵⁹ or 3′-UTR regulation³⁸). However, such experiments depend on PATC-rich introns not influencing expression themselves by, for example, acting as enhancers.

To test if more accurate promoter expression patterns can be captured from expressing optimized fluorescent proteins in arrays, we tested a 4.7 kb promoter from synaptobrevin (Psnb-1) driving the expression of three fluorescent proteins: gfp, ce-gfp, and ce-gfp_PATC. Synaptobrevin has a role in neurotransmission and is primarily expressed in neurons based on antibody staining⁶⁰. However, mRNA expression suggests expression in the germline⁶¹. Transgenic animals with arrays showed consistent germline expression only when using the ce-gfp_PATC transgene (Fig. 6a). Germline expression is unlikely to be driven by endogenous germline enhancers in the PATC-rich introns because the same PATC-rich ce-gfp was not expressed in the germline when paired with the minimal pes-10 promoter⁶². The absence of germline expression is not because enhancers placed downstream of the minimal promoter are not active; tissue-specific enhancers⁶³ inserted into PATC-rich introns yielded expression in seam cells and the ventral cord neurons (Fig. 6b). We note an important caveat to these experiments: germline and somatic promoters may have fundamentally different architectures⁶¹. Therefore, the minimal promoter from the “soma only” pes-10 gene⁶² may not accurately capture germline enhancer activity. However, to our knowledge, no alternative minimal promoter has been used to study germline enhancers. PATC-rich fluorescent proteins could help identify germline-specific minimal promoters and experimental validation of differences between germline and somatic promoters.

**Fig. 6: PATC-rich fluorescent proteins are not ectopically expressed.**

3′-UTRs regulate stage-specific expression within the germline³⁸, and we tested if optimized fluorescent proteins capture this regulation. We generated Pmex-5::ce-gfp_PATC transgenes with two 3′-UTRs known to regulate gene expression in specific regions of the germline (fbf-2 and spn-4 3′-UTRs), and one 3′-UTR that permits ubiquitous expression in all germ cells (pgl-3 3′-UTR)³⁸. All these constructs showed the expected expression patterns from arrays (Fig. 6c).

We conclude that arrays with ce-gfp_PATC transgenes can accurately report known germline regulation via promoters and 3′-UTRs. We have generated and validated an extensive collection of codon-optimized fluorescent proteins with or without PATCs (Table 2) for use with arrays or single-copy insertions (e.g., MosSCI or CRISPR tagging) and have deposited the collection with Addgene.

Table 2 Optimized fluorescent proteins and tags.

Full size table

Efficient genetic engineering with optimized transgenes

More efficient genetic engineering can accelerate a diverse range of research in laboratories using C. elegans. Many genetic engineering techniques (e.g., MosSCI, miniMos, CRISPR/Cas9, and in vivo recombination with FRT or LoxP sites^{23,24,25,64,65,66}) rely on transient germline expression of injected DNA. Therefore, we reasoned that a set of gene-editing enzymes optimized for consistent and sustained germline expression was likely to improve gene-editing efficiency.

First, we generated optimized, PATC-rich transgenes encoding the Mos1 transposase (ce-Mos1_PATC) and tested the efficiency of generating MosSCI insertions at one safe harbor insertion site on Chr. V (oxTi365)²⁴. Injection of Psmu-1::ce-Mos1_PATC generated MosSCI insertions at significantly higher frequency (Fig. 7a) with the highest insertion frequency achieved when using the transgene at 10 ng/ul (Fig. S12).

**Fig. 7: PATC-rich transgenes mediate efficient gene-editing.**

Transgenic animals carrying arrays are relatively easy to generate compared with single-copy insertions (e.g., MosSCI or CRISPR-tagged alleles). Therefore, it would be advantageous if single-copy insertions could be reliably generated from the continued propagation of a few “founding” array animals. However, single-copy integrations occur almost exclusively in the first few generations²⁴, presumably owing to progressive transgene silencing. To test if a silencing-resistant ce-Mos1_PATC could extend this editing window, we co-injected Psmu-1::Mos1_PATC with a miniMos transposon carrying a 6.0 kb Peft-3::ce-gfp transgene. We purposely picked array animals with no single-copy insertions segregating in the first three generations to identify insertions generated in later generations insertions. We observed a continuous increase in the number of independent insertions in these transgenic lines until we stopped the experiment after eight generations (Fig. 7b). The optimized Psmu-1::Mos1_PATC also reduced the previously observed strong temperature-dependence of insertion frequency²⁴. Generating insertions by simply propagating strains may be an appealing protocol for researchers with limited injection experience and could potentially also be used to generate many independent insertions for large-scale transposon collections (e.g., enhancer or gene traps).

We also generated an optimized Cas9 transgene (Cas9_PATC) with piRNA homology removed and tested the efficiency for CRISPR-based gfp tagging at the endogenous his-72 locus²⁵. A comparison between a commonly used Peft-3::Cas9 plasmid (pDD133) and Psmu-1::Cas9_PATC showed modest but significantly higher insertion frequency after optimization (Fig. 7c).

Finally, we tested the efficiency of removing a single-copy integrated rescue marker (cbr-unc-119(+)) by recombination with optimized Cre (Psmu-2::Cre_PATC) or enhanced flp (Psmu-2::eFlp_PATC) recombinases. Both recombinases excised the cassette at high efficiency (~70–80%), quantified by Unc animals on plates two generations after injection (Fig. 7d).

In our laboratory, optimized enzymes with PATCs have consistently improved genetic engineering efficiency and enabled gene-editing for more generations from array animals, presumably by increasing enzyme levels and the duration of germline expression. We have deposited a small collection of optimized gene-editing enzymes at Addgene (Table 3). We propose that including PATCs in enzymes will be a generally useful way to improve the efficiency of current and future gene-editing technologies in C. elegans.

Table 3 Optimized gene-editing enzymes.

Full size table

Discussion

Here, we have generated reagents and determined a set of rules that allow persistent expression of most transgenes in the germline from simple, extrachromosomal arrays. Rule 1: codon-adaption⁴⁴, piRNA removal⁶⁷, and the addition of introns can improve expression but is rarely sufficient in itself. Rule 2: PATC-rich introns in the coding region improve expression, whereas PATCs in the promoter or 3′-UTR appear to be dispensable. Several PATC-rich introns can be inserted in a single reaction by Golden-gate based cloning or, alternatively, a single intron (intron three from smu-2) can be inserted by standard cloning. The shorter 250 bp introns are in most cases preferable although the longer 900 bp introns were better at preventing transgene silencing in repressive chromatin²⁹. Rule 3: fluorescent proteins inserted at the C-terminus are less prone to silencing. Rule 4: high transgene concentration (25 ng/ul) enhances expression from simple arrays. Rule 5: two introns, with one intron placed in the first 150 base pairs, stimulate expression. Rule 6: removal of the vector backbone by PCR or restriction digest can prevent silencing of non-optimal transgenes. Rule 7: viral 2A peptides and operons can be used to co-express endogenous genes and transgenes. Rule 8: propagating transgenic strains at high temperature (25 °C) allows persistent transgene expression and enables gene-editing for additional generations.

We hope that a description of general transgene engineering rules together with a set of standardized reagents to generate transgenes will facilitate experiments for researchers working on the C. elegans germline or those that wish to engineer the genome. In addition, we have aimed to enable research on an enigmatic class of non-coding DNA that has a striking effect on preventing gene silencing in C. elegans. Further investigation by us and others will expand upon and reveal mechanisms underlying the resources developed here.

Methods

Strains

C. elegans strains were cultured on nematode growth media (NGM) feeding on OP50 or HB101 bacteria and maintained at 15 °C, 20 °C, or 25 °C. unc-119(ed3) animals were cultured at 15 to 20 °C on HB101 bacteria, whereas rescued, transgenic array animals were cultured on OP50 bacteria.

Transgenic animals

We followed standard protocols for generating strains with extrachromosomal arrays¹⁰ or single-copy insertions by MosSCI, MiniMos, or CRISPR/Cas9^23,24,25.

Extrachromosomal arrays: we injected into unc-119(ed3) animals derived from a 10× outcrossed mutant strain (PS6038) or into the N2 wildtype strain. Selection for arrays was provided by Unc-119 rescue (for plasmids with cbr-unc-119 in backbone) or antibiotic resistance to hygromycin B⁶⁸ using the HygroR plasmid pCFJ782 by adding 500 µl of a 4 mg/ml stock solution (Gold Biotechnology, cat. no. H-270-10) to seeded NGM plates. Every transgenic array line was derived from an independently injected animal.

MosSCI insertions: we inserted single-copy transgenes cloned into pCFJ150 into the universal MosSCI insertion site oxTi365 on Chr. V by injection into the strain EG8082. The injection mix consisted of 10 ng/ul of the targeting vector pCFJ113 (Peft-3::ce-gfp::tbb-2 3′-UTR) in a pCFJ150 backbone, 10 ng/ul of pCFJ1532 (Psmu-1::mosase_PATC), 10 ng/ul pCFJ104 (Pmyo-3::mCherry), 10 ng/ul pGH8 (Prab-3::mCherry), 2.5 ng/ul pCFJ90 (Pmyo-2::mCherry), 10 ng/ul pMA122 (hs::peel-1) and 47.5 ng/ul 1 kb DNA ladder SM1331 (ThermoFisher). We identified insertions by selecting for Unc-119 rescued animals with no fluorescent co-injection markers and no lethality in response to heat-shock expression of the peel-1 toxin.

MiniMos insertions: we injected 10 ng/ul of a miniMos element (pCFJ1402 - Peft-3::ce-gfp_PATC::tbb-2 3′-UTR cbr-unc-119(+)) into unc-119(ed3) animals. The injection mix consisted of 10 ng/ul pCFJ1532 (Psmu-1::mosase_PATC), 10 ng/ul pCFJ104 (Pmyo-3::mCherry), 10 ng/ul pGH8 (Prab-3::mCherry), 2.5 ng/ul pCFJ90 (Pmyo-2::mCherry) and 57.5 ng/ul 1 kb DNA ladder SM1331 (ThermoFisher). The backbone of pCFJ1402 contains the negative selection marker hs::peel-1 to kill animals with extrachromosomal arrays in response to heat-shock. To test for insertions across generations, we picked three independent transgenic array lines that did not segregate miniMos insertions in the first three generation and propagated the lines at 20 °C and 25 °C on 10 plates each. For every generation, we transferred approx. ten animals to new plates before heat-shocking starved plates to identify miniMos insertions based on the lack of fluorescent co-injection markers. Any plate that gave an insertion was not propagated for more generations to avoid counting any transposon insertion twice.

CRISPR/Cas9 insertions: we tagged the his-72 locus with gfp. We generated a 4.5 kb repair template (pMNK17) derived from pDD129²⁵ with 400 bp homology regions and cbr-unc-119(+) rescue. Importantly, the repair template does not contain the his-72 promoter and no GFP fluorescence is observed prior to successfully tagging the endogenous his-72 locus. A single guide RNA with the spacer 5′-AGCTTAAGCACGTTCTCCG-3′ was expressed from a plasmid (pMNK18) using a U6 promoter. We expressed Cas9 from plasmids pCFJ1646 (Peft-3::Cas9) or pCFJ2474 (Psmu-2::Cas9_PATC::sl2::tagRFP). The injection mix consisted of 25 ng/ul of the Cas9 plasmid (pCFJ1646 or pCFJ2474), 10 ng/ul of the repair template (pMNK17), 25 ng/ul of the sgRNA (pMNK18), and 40 ng/ul 1 kb DNA ladder SM1331 (ThermoFisher). We injected this mix into unc-119(ed3) animals and identified insertions based on Unc-119 rescue and ubiquitous GFP expression, including the germline.

All injection strains are available from the Caenorhabditis elegans Genetics Center (CGC).

Imaging

All measurements were taken from distinct samples (defined as an independently generated transgenic animal), except for time-course measurements where the same sample was measured repeatedly every two generations. The sample sizes and all primary data (percentage of animals with fluorescent germline) are included in Source Data. Transgenic animals were generated and imaged in a stereotyped way, as described below, to ensure consistency. In all, 1–2 injected animals were placed on individual NGM plates seeded with HB101 at 25 °C in a temperature-controlled incubator. Plates were allowed to starve out and inspected for rescued F2 progeny, indicating that a plate contained stable transgenic lines. Such plates were “chunked” to a new plate, and a single young F2 adult animal with eggs was picked two days later, ensuring that only a single independent line was picked from any injected animal. Three days later, the F3 progeny of this clonal animal was scored for germline fluorescence by mounting animals on agarose pads (2%) and anesthetizing the animals with 50 mM sodium azide. We imaged animals on upright, non-motorized compound microscopes (Leica DM2500 and Zeiss Axioimager Z.2) with ×42 or ×60 oil immersion objectives and scored germline fluorescence in 11 animals from each independent strain. Both gonad arms were scored for GFP fluorescence and every animal was quantified in a binary way (“on” or “off”). The experimenter was not blinded to the genotype of the transgenic animals.

Molecular biology

Molecular biology was performed using standard protocols and commercial available reagents. A step-by-step protocol describing the Golden-Gate-based method for inserting PATC-rich introns into a synthetic transgene can be found at Protocol Exchange⁶⁹. All reactions were designed using the free molecular biology editor “A plasmid Editor” (ApE) developed and maintained by M Wayne Davis. Annotated DNA sequences for all plasmids are included in the Source Data file. All plasmids are available upon request from Addgene or from the corresponding author.

Statistical analysis

The statistical analysis was performed using GraphPad Prism v8 for macOS. The specific tests performed are listed in the legends of individual figures, and the primary data for every figure is included in the Source Data file. In general, fluorescence expression is stochastically silenced with frequent “all or none” observations (i.e., complete silencing or full expression), and the data do not follow a Gaussian distribution. Therefore, most of the utilized statistical tests are non-parametric tests.

Online analysis

The website www.wormbuilder.org/PATC/ was written in R programming language. Its online execution occurs through an Amazon Web Services (AWS) Elastic Computing Cloud (EC2) instance. The source code can be obtained at: https://github.com/AmhedVargas/PATC_2_0. Please see “Software and code” in the accompanying Reporting Summary for detailed information on all software used, including version numbers.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All data generated or analyzed during this study are included in this published article (and its supplementary information files). Any other relevant data are available from the authors upon reasonable request. Source data are provided with this paper.

Code availability

The source code⁷⁰ for www.wormbuilder.org/PATC/ is available at: https://github.com/AmhedVargas/PATC_2_0. Please see Supplementary Methods and the Reporting Summary for detailed information on all software used, including version numbers.

References

Holoch, D. & Moazed, D. RNA-mediated epigenetic regulation of gene expression. Nat. Rev. Genet. 16, 71–84 (2015).
Article CAS PubMed PubMed Central Google Scholar
Kay, M. A. State-of-the-art gene-based therapies: the road ahead. Nat. Rev. Genet. 12, 316–328 (2011).
Article CAS PubMed Google Scholar
Chen, Z.-Y., He, C.-Y., Ehrhardt, A. & Kay, M. A. Minicircle DNA vectors devoid of bacterial DNA result in persistent and high-level transgene expression in vivo. Mol. Ther. J. Am. Soc. Gene Ther. 8, 495–500 (2003).
Article CAS Google Scholar
Brown, H. C. et al. Target-cell-directed bioengineering approaches for gene therapy of hemophilia A. Mol. Ther. Methods Clin. Dev. 9, 57–69 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mauro, V. P. & Chappell, S. A. A critical analysis of codon optimization in human therapeutics. Trends Mol. Med. 20, 604–613 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lu, J., Zhang, F., Fire, A. Z. & Kay, M. A. Sequence-modified antibiotic resistance genes provide sustained plasmid-mediated transgene expression in mammals. Mol. Ther. 25, 1187–1198 (2017).
Article CAS PubMed PubMed Central Google Scholar
Rajeev Kumar, S., Anunanthini, P. & Ramalingam, S. Epigenetic silencing in transgenic plants. Front. Plant Sci. 6, 693 (2015).
Gallegos, J. E. & Rose, A. B. The enduring mystery of intron-mediated enhancement. Plant Sci. 237, 8–15 (2015).
Article CAS PubMed Google Scholar
Velten, J., Cakir, C., Youn, E., Chen, J. & Cazzonelli, C. I. Transgene silencing and transgene-derived siRNA production in Tobacco plants homozygous for an introduced AtMYB90 construct. PLoS ONE 7, e30141 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Mello, C. C., Kramer, J. M., Stinchcomb, D. & Ambros, V. Efficient gene transfer in C.elegans: extrachromosomal maintenance and integration of transforming sequences. EMBO J. 10, 3959–3970 (1991).
Article CAS PubMed PubMed Central Google Scholar
Stinchcomb, D. T., Shaw, J. E., Carr, S. H. & Hirsh, D. Extrachromosomal DNA transformation of Caenorhabditis elegans. Mol. Cell Biol. 5, 3484–3496 (1985).
Article CAS PubMed PubMed Central Google Scholar
Fire, A., Harrison, S. W. & Dixon, D. A modular set of lacZ fusion vectors for studying gene expression in Caenorhabditis elegans. Gene 93, 189–198 (1990).
Article CAS PubMed Google Scholar
Kelly, W. G., Xu, S., Montgomery, M. K. & Fire, A. Distinct requirements for somatic and germline expression of a generally expressed Caernorhabditis elegans gene. Genetics 146, 227–238 (1997).
CAS PubMed PubMed Central Google Scholar
Frøkjær-Jensen, C. A balance between silencing foreign DNA and protecting self in Caenorhabditis elegans. Curr. Opin. Syst. Biol. 13, 1–16 (2019).
Akay, A. et al. The helicase aquarius/emb-4 is required to overcome intronic barriers to allow nuclear rnai pathways to heritably silence transcription. Dev. Cell 42, 241–255.e6 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kelly, W. G. & Fire, A. Chromatin silencing and the maintenance of a functional germline in Caenorhabditis elegans. Dev. Camb. Engl. 125, 2451–2456 (1998).
CAS Google Scholar
Tabach, Y. et al. Identification of small RNA pathway genes using patterns of phylogenetic conservation and divergence. Nature 493, 694–698 (2013).
Article ADS CAS PubMed Google Scholar
Kim, J. K. et al. Functional genomic analysis of RNA interference in C. elegans. Science 308, 1164–1167 (2005).
Article ADS CAS PubMed Google Scholar
Holdeman, R., Nehrt, S. & Strome, S. MES-2, a maternal protein essential for viability of the germline in Caenorhabditis elegans, is homologous to a Drosophila Polycomb group protein. Development 125, 2457–2467 (1998).
CAS PubMed Google Scholar
Ketting, R. F., Haverkamp, T. H., van Luenen, H. G. & Plasterk, R. H. Mut-7 of C. elegans, required for transposon silencing and RNA interference, is a homolog of Werner syndrome helicase and RNaseD. Cell 99, 133–141 (1999).
Article CAS PubMed Google Scholar
Buckley, B. A. et al. A nuclear Argonaute promotes multigenerational epigenetic inheritance and germline immortality. Nature 489, 447–451 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Praitis, V., Casey, E., Collar, D. & Austin, J. Creation of low-copy integrated transgenic lines in Caenorhabditis elegans. Genetics 157, 1217–1226 (2001).
CAS PubMed PubMed Central Google Scholar
Frøkjær-Jensen, C. et al. Single-copy insertion of transgenes in Caenorhabditis elegans. Nat. Genet. 40, 1375–1383 (2008).
Article PubMed PubMed Central CAS Google Scholar
Frøkjær-Jensen, C. et al. Random and targeted transgene insertion in Caenorhabditis elegans using a modified Mos1 transposon. Nat. Methods 11, 529–534 (2014).
Article PubMed PubMed Central CAS Google Scholar
Dickinson, D. J., Ward, J. D., Reiner, D. J. & Goldstein, B. Engineering the Caenorhabditis elegans genome using Cas9-triggered homologous recombination. Nat. Methods 10, 1028–1034 (2013).
Article CAS PubMed PubMed Central Google Scholar
Fire, A., Alcazar, R. & Tan, F. Unusual DNA structures associated with germline genetic activity in Caenorhabditis elegans. Genetics 173, 1259–1273 (2006).
Article CAS PubMed PubMed Central Google Scholar
Artiles, K. L., Fire, A. Z. & Frøkjær-Jensen, C. Assessment and maintenance of unigametic germline inheritance for C. elegans. Dev. Cell 48, 827–839.e9 (2019).
Article CAS PubMed PubMed Central Google Scholar
Fielmich, L.-E. et al. Optogenetic dissection of mitotic spindle positioning in vivo. eLife 7, e38198 (2018).
Article PubMed PubMed Central Google Scholar
Frøkjær-Jensen, C. et al. An abundant class of non-coding DNA can prevent stochastic gene silencing in the C. elegans germline. Cell 166, 343–357 (2016).
Article PubMed PubMed Central CAS Google Scholar
Rog, O., Köhler, S. & Dernburg, A. F. The synaptonemal complex has liquid crystalline properties and spatially regulates meiotic recombination factors. eLife 6, e21455 (2017).
Article PubMed PubMed Central Google Scholar
Zhang, D. et al. The piRNA targeting rules and the resistance to piRNA silencing in endogenous genes. Science 359, 587–592 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Nance, J. & Frøkjær-Jensen, C. The Caenorhabditis elegans transgenic toolbox. Genetics 212, 959–990 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gibson, D. G. et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods 6, 343–345 (2009).
Article CAS PubMed Google Scholar
Engler, C., Kandzia, R. & Marillonnet, S. A one pot, one step, precision cloning method with high throughput capability. PloS ONE 3, e3647 (2008).
Article ADS PubMed PubMed Central CAS Google Scholar
Sheth, N. et al. Comprehensive splice-site analysis using comparative genomics. Nucleic Acids Res. 34, 3955–3967 (2006).
Article CAS PubMed PubMed Central Google Scholar
Spike, C. A., Shaw, J. E. & Herman, R. K. Analysis of smu-1, a gene that regulates the alternative splicing of unc-52 pre-mRNA in Caenorhabditis elegans. Mol. Cell Biol. 21, 4985–4995 (2001).
Article CAS PubMed PubMed Central Google Scholar
Spartz, A. K., Herman, R. K. & Shaw, J. E. SMU-2 and SMU-1, Caenorhabditis elegans homologs of mammalian spliceosome-associated proteins RED and fSAP57, work together to affect splice site choice. Mol. Cell Biol. 24, 6811–6823 (2004).
Article CAS PubMed PubMed Central Google Scholar
Merritt, C., Rasoloson, D., Ko, D. & Seydoux, G. 3′-UTRs are the primary regulators of gene expression in the C. elegans germline. Curr. Biol. 18, 1476–1482 (2008).
Article CAS PubMed PubMed Central Google Scholar
Zeiser, E., Frøkjær-Jensen, C., Jorgensen, E. & Ahringer, J. MosSCI and gateway compatible plasmid toolkit for constitutive and inducible expression of transgenes in the C. elegans germline. PloS ONE 6, e20082 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Bagijn, M. P. et al. Function, targets, and evolution of Caenorhabditis elegans piRNAs. Science 337, 574–578 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Claycomb, J. M. et al. The argonaute CSR-1 and Its 22G-RNA cofactors are required for holocentric chromosome segregation. Cell 139, 123–134 (2009).
Article CAS PubMed PubMed Central Google Scholar
Seth, M. et al. The C. elegans CSR-1 argonaute pathway counteracts epigenetic silencing to promote germline gene expression. Dev. Cell 27, 656–663 (2013), https://doi.org/10.1016/j.devcel.2013.11.014.
Wedeles, C. J., Wu, M. Z. & Claycomb, J. M. protection of germline gene expression by the C. elegans argonaute CSR-1. Dev. Cell (2013), https://doi.org/10.1016/j.devcel.2013.11.016.
Redemann, S. et al. Codon adaptation-based control of protein expression in C. elegans. Nat. Methods 8, 250–252 (2011).
Article CAS PubMed Google Scholar
Batista, P. J. et al. PRG-1 and 21U-RNAs interact to form the piRNA complex required for fertility in C. elegans. Mol. Cell 31, 67–78 (2008).
Article CAS PubMed PubMed Central Google Scholar
Shaner, N. C. et al. Improved monomeric red, orange and yellow fluorescent proteins derived from Discosoma sp. red fluorescent protein. Nat. Biotechnol. 22, 1567–1572 (2004).
Article CAS PubMed Google Scholar
Shirayama, M. et al. piRNAs initiate an epigenetic memory of nonself RNA in the C. elegans germline. Cell 150, 65–77 (2012).
Article CAS PubMed PubMed Central Google Scholar
Knight, S. W. & Bass, B. L. The Role of RNA Editing by ADARs in RNAi. Mol. Cell 10, 809–817 (2002).
Article CAS PubMed Google Scholar
Dumesic, P. A. et al. Stalled spliceosomes are a signal for RNAi-mediated genome defense. Cell 152, 957–968 (2013).
Article CAS PubMed PubMed Central Google Scholar
Yang, Y.-F. et al. Trans-splicing enhances translational efficiency in C. elegans. Genome Res. 27, 1525–1535 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bieberstein, N. I., Carrillo Oesterreich, F., Straube, K. & Neugebauer, K. M. First exon length controls active chromatin signatures and transcription. Cell Rep. 2, 62–68 (2012).
Article CAS PubMed Google Scholar
Tabara, H. et al. The rde-1 Gene, RNA Interference, and Transposon Silencing in C. elegans. Cell 99, 123–132 (1999).
Article CAS PubMed Google Scholar
Etchberger, J. F. & Hobert, O. Vector-free DNA constructs improve transgene expression in C. elegans. Nat. Methods 5, 3–3 (2008).
Article CAS PubMed Google Scholar
Ahier, A. & Jarriault, S. Simultaneous expression of multiple proteins under a single promoter in Caenorhabditis elegans via a versatile 2A-based toolkit. Genetics 196, 605–613 (2014).
Article CAS PubMed Google Scholar
Spieth, J., Brooke, G., Kuersten, S., Lea, K. & Blumenthal, T. Operons in C. elegans: polycistronic mRNA precursors are processed by trans-splicing of SL2 to downstream coding regions. Cell 73, 521–532 (1993).
Article CAS PubMed Google Scholar
Reinke, V. & Cutter, A. D. Germline expression influences operon organization in the Caenorhabditis elegans genome. Genetics 181, 1219–1228 (2009).
Article CAS PubMed PubMed Central Google Scholar
Strome, S. et al. Spindle dynamics and the role of γ-tubulin in early Caenorhabditis elegans embryos. Mol. Biol. Cell 12, 1751–1764 (2001).
Article CAS PubMed PubMed Central Google Scholar
Perez, M. F. & Lehner, B. Intergenerational and transgenerational epigenetic inheritance in animals. Nat. Cell Biol. 21, 143–151 (2019).
Article CAS PubMed Google Scholar
Okkema, P. G., Harrison, S. W., Plunger, V., Aryana, A. & Fire, A. Sequence requirements for myosin gene expression and regulation in Caenorhabditis elegans. Genetics 135, 385–404 (1993).
CAS PubMed PubMed Central Google Scholar
Nonet, M. L., Saifee, O., Zhao, H., Rand, J. B. & Wei, L. Synaptic transmission deficits in Caenorhabditis elegans synaptobrevin mutants. J. Neurosci. 18, 70–80 (1998).
Article CAS PubMed PubMed Central Google Scholar
Serizay, J. et al. Distinctive regulatory architectures of germline-active and somatic genes in C. elegans. Genome Res. (2020). https://doi.org/10.1101/gr.265934.120.
Seydoux, G. & Fire, A. Soma-germline asymmetry in the distributions of embryonic RNAs in Caenorhabditis elegans. Dev. Camb. Engl. 120, 2823–2834 (1994).
CAS Google Scholar
Natarajan, L., Jackson, B. M., Szyleyko, E. & Eisenmann, D. M. Identification of evolutionarily conserved promoter elements and amino acids required for function of the C. elegans beta-catenin homolog BAR-1. Dev. Biol. 272, 536–557 (2004).
Article CAS PubMed Google Scholar
Davis, M. W., Morton, J. J., Carroll, D. & Jorgensen, E. M. Gene activation using FLP recombinase in C. elegans. PLOS Genet. 4, e1000028 (2008).
Article PubMed PubMed Central CAS Google Scholar
Friedland, A. E. et al. Heritable genome editing in C. elegans via a CRISPR-Cas9 system. Nat. Methods 10, 741–743 (2013).
Article CAS PubMed PubMed Central Google Scholar
Nonet, M. L. Efficient transgenesis in Caenorhabditis elegans using Flp recombinase-mediated cassette exchange. Genetics (2020), https://doi.org/10.1534/genetics.120.303388.
Wu, W.-S. et al. pirScan: a webserver to predict piRNA targeting sites and to avoid transgene silencing in C. elegans. Nucleic Acids Res. 46, W43–W48 (2018).
Article CAS PubMed PubMed Central Google Scholar
Radman, I., Greiss, S. & Chin, J. W. Efficient and rapid C. elegans transgenesis by bombardment and hygromycin B selection. PloS ONe 8, e76019 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Frøkjær-Jensen, C. Insertion of PATC-rich C. elegans introns into synthetic transgenes by golden-gate-based cloning. Protoc. Exch. (2020), https://doi.org/10.21203/rs.3.pex-1253/v1.
Vargas-Velazquez, A. M. Engineering rules that minimize germline silencing of transgenes in simple extra-chromosomal arrays in C. elegans. PATC WebApp Version V100 (2020), https://doi.org/10.5281/zenodo.4159578.
Green, R. A. et al. Expression and imaging of fluorescent proteins in the C. elegans gonad and early embryo. in Methods Cell Biology (ed. Sullivan, K. F.) vol. 85, 179–218 (Academic Press, 2008).
Lee, H.-C. et al. C. elegans piRNAs mediate the genome-wide surveillance of germline transcripts. Cell 150, 78–87 (2012).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Andrew Z. Fire and Erik M. Jorgensen for experimental support, M. Wayne Davis for bioinformatic assistance, and Kam Hoe for technical assistance. Some strains were provided by the CGC, which is funded by NIH Office of Research Infrastructure Programs (P40 OD010440). We thank the KAUST Bioscience Core Labs and the Linux and Advanced Platforms team for expert assistance and Faisal Alkhaldi for assistance developing www.wormbuilder.org. This work was funded by a KAUST intramural grant and the funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

These authors contributed equally: Mohammed D. Aljohani, Sonia El Mouridi, Monika Priyadarshini, Amhed M. Vargas-Velazquez.

Authors and Affiliations

King Abdullah University of Science and Technology (KAUST), Biological and Environmental Science and Engineering Division (BESE), KAUST Environmental Epigenetics Program (KEEP), Thuwal, 23955-6900, Saudi Arabia
Mohammed D. Aljohani, Sonia El Mouridi, Monika Priyadarshini, Amhed M. Vargas-Velazquez & Christian Frøkjær-Jensen

Authors

Mohammed D. Aljohani
View author publications
You can also search for this author in PubMed Google Scholar
Sonia El Mouridi
View author publications
You can also search for this author in PubMed Google Scholar
Monika Priyadarshini
View author publications
You can also search for this author in PubMed Google Scholar
Amhed M. Vargas-Velazquez
View author publications
You can also search for this author in PubMed Google Scholar
Christian Frøkjær-Jensen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Christian Frøkjær-Jensen: conceptualization, methodology, formal analysis, investigation, resources, writing—original draft, writing—review & editing, visualization, funding acquisition. Mohammed D. Aljohani: investigation, writing—review & editing. Sonia El Mouridi: methodology, investigation, writing—review & editing, visualization. Monika Priyadarshini: investigation, writing—review & editing. Amhed M. Vargas-Velazquez: software, data curation, writing—review & editing, visualization.

Corresponding author

Correspondence to Christian Frøkjær-Jensen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Arshad Desai and Abby Dernburg for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Peer Review File

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Aljohani, M.D., El Mouridi, S., Priyadarshini, M. et al. Engineering rules that minimize germline silencing of transgenes in simple extrachromosomal arrays in C. elegans. Nat Commun 11, 6300 (2020). https://doi.org/10.1038/s41467-020-19898-0

Download citation

Received: 18 June 2020
Accepted: 05 November 2020
Published: 09 December 2020
DOI: https://doi.org/10.1038/s41467-020-19898-0

This article is cited by

Endogenous DAF-16 spatiotemporal activity quantitatively predicts lifespan extension induced by dietary restriction
- Javier Huayta
- Joseph P. Crapster
- Adriana San-Miguel
Communications Biology (2023)
Intermediate filaments associate with aggresome-like structures in proteostressed C. elegans neurons and influence large vesicle extrusions as exophers
- Meghan Lee Arnold
- Jason Cooper
- Monica Driscoll
Nature Communications (2023)
Reprogramming the piRNA pathway for multiplexed and transgenerational gene silencing in C. elegans
- Monika Priyadarshini
- Julie Zhouli Ni
- Christian Frøkjær-Jensen
Nature Methods (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Online tools to identify and analyze PATC-rich sequences

Efficient generation of PATC-rich transgenes

PATC-rich transgenes are expressed in the germline from simple arrays

Early introns and backbone removal increase germline expression

Germline co-expression using viral 2A peptides and operons

Germline expression is stable but temperature dependent

PATC-rich transgenes recapitulate endogenous germline expression

Efficient genetic engineering with optimized transgenes

Discussion

Methods

Strains

Transgenic animals

Imaging

Molecular biology

Statistical analysis

Online analysis

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links