Engineering cell sensing and responses using a GPCR-coupled CRISPR-Cas system

G-protein-coupled receptors (GPCRs) are the largest and most diverse group of membrane receptors in eukaryotes and detect a wide array of cues in the human body. Here we describe a molecular device that couples CRISPR-dCas9 genome regulation to diverse natural and synthetic extracellular signals via GPCRs. We generate alternative architectures for fusing CRISPR to GPCRs utilizing the previously reported design, Tango, and our design, ChaCha. Mathematical modeling suggests that for the CRISPR ChaCha design, multiple dCas9 molecules can be released across the lifetime of a GPCR. The CRISPR ChaCha is dose-dependent, reversible, and can activate multiple endogenous genes simultaneously in response to extracellular ligands. We adopt the design to diverse GPCRs that sense a broad spectrum of ligands, including synthetic compounds, chemokines, mitogens, fatty acids, and hormones. This toolkit of CRISPR-coupled GPCRs provides a modular platform for rewiring diverse ligand sensing to targeted genome regulation for engineering cellular functions.

E ukaryotic cells have evolved diverse classes of transmembrane receptors to transduce various extracellular signals into intracellular responses. Binding of receptors and their cognate ligands triggers their intracellular enzymatic activities. Inside the cell, this leads to complex downstream signaling protein activity and secondary messenger functions that ultimately transduce signals to genomic programs. The complexity of these signaling cascades has made it challenging to harness natural signaling pathways for cell engineering in a flexible and programmable manner 1 .
G-coupled protein receptors (GPCRs) are attractive candidates for receptor and cell engineering because they can sense a diverse repertoire of ligands, including endogenous hormones, growth factors, and natural or synthetic small molecules [2][3][4][5][6] . At least 800 GPCRs have been identified from the human genome and are important for regulating human physiology. Consequently, many GPCRs are implicated in diseases and are the targets of 30-40% of modern drugs 4 . Previous work in GPCR engineering replaced the intracellular domains with a proteolytically cleavable artificial transcription factor (e.g., GAL4, rtTA) to create a genetic reporter of receptor activity, called the Tango system 7,8 . However, the use of artificial transcription factors restricted the system's utility to exogenous genes and was unable to regulate endogenous targets. Furthermore, the system behaves as "one ligand in, one transcription factor out", potentially limiting its efficiency. In this work, we focus on harnessing GPCRs as a generic synthetic device that can regulate the expression of endogenous genes in response to their diverse ligands.
CRISPR-Cas9 technologies have revolutionized programmable genome manipulation. Cas9 can be precisely targeted to a genomic locus of interest for gene editing using a customized single guide RNA (sgRNA) [9][10][11] . Beyond editing, the nucleasedeactivated Cas9 (dCas9) molecule can be fused with effector protein domains to regulate transcription of target genes or to modify the epigenome [12][13][14][15][16][17][18] . In addition, multiple engineered sgRNAs can be used to simultaneously regulate multiple genes and drive complex gene expression programs in response to certain ligands 14,15,[19][20][21][22] . The flexibility of CRISPR-dCas9 systems for genome regulation makes them powerful tools for transducing receptor signaling directly to genomic targets and for programming novel sensor-coupled cell behaviors.
Here we repurpose GPCRs to convert extracellular signal sensing into programmed transcriptional responses via CRISPR-dCas9. Using an evolved GPCR that recognizes a synthetic ligand 5 , we compare two architectures for CRISPR-dCas9 gene regulation. Our design, named CRISPR ChaCha, outperforms a CRISPR Tango design 7,8 . We formulate a simple mathematical model supported by experimental data to suggest a "one ligand in, multiple effectors out" feature of the CRISPR ChaCha system. We demonstrate that the CRISPR ChaCha system can activate individual or multiple endogenous genes targeted by dCas9/sgRNA in response to GPCR ligands, and such regulation is dose-dependent and reversible. Importantly, we devise a CRISPR ChaCha toolkit consisting of 8 GPCR-CRISPR systems that sense various synthetic compounds, chemokines, mitogens, fatty acids, and hormones. The CRISPR ChaCha toolkit presented here serves as a useful platform for understanding receptor signaling and for engineering cell-based therapies.

Results
Implementation of two GPCR-coupled CRISPR designs. We created and tested two designs for proteolytically coupling dCas9 function to GPCRs ( Fig. 1a and Supplementary Fig. 1). The first design is derived from a previously reported Tango system 7,8 , where the C-terminus of a GPCR is fused with a V2 tail sequence from arginine vasopressin receptor 2 (AVPR2) and typically a transcription factor (e.g, tTa). Instead, we replaced this transcription factor with a dCas9 effector (Fig. 1a, left) to create CRISPR Tango. An adaptor protein, Beta-Arrestin-2 (ARRB2), which interacts with V2 upon GPCR activation, is fused to a Tobacco Etch Virus protease (TEVp). To release the dCas9 effector, TEVp specifically cleaves the TEV cleavage sequence (TCS) placed at the N-terminus of dCas9. In the second design, we implemented a configuration, where the dCas9 effector is instead fused at the C-terminus of the ARRB2 adaptor via the TCS, and the TEVp is fused at the C-terminus of GPCR via the V2 tail (Fig. 1a, right). In this case, ligand-activated GPCR-V2-TEVp proteolytically cleaves ARRB2-TCS-dCas9 to release the dCas9 effector. After cleavage, the dCas9-effector then translocates into the nucleus to modulate genes of interest. We called this second system ChaCha, in contrast to the Tango design 7, 8 , as both represent a functional interaction between two molecules.
The CRISPR ChaCha outperforms the CRISPR Tango. Both ChaCha and Tango designs were implemented with the evolved human muscarinic 3 GPCR (hM3-DREADD or hM3D), which recognizes the synthetic small molecule, clozapine-N-oxide (CNO) 5 . For gene activation, we used the Streptococcus pyogenes dCas9 fused to a tripartite transcriptional activator composed of VP64, p65 activation domain, and Rta (dCas9-VPR) 23 . We co-transfected these constructs and a TRE3G targeting sgRNA into HEK293T reporter cells, harboring a genomically integrated TRE3G promoter driving a GFP reporter ( Fig. 1a and Supplementary Fig. 1; see Supplementary Table 2 for sgRNA sequences). We found that the hM3D-CRISPR ChaCha exhibited better reporter gene activation (4.4 fold) compared to the hM3D-CRISPR Tango (1.2 fold) after 1-day treatment with CNO ( Fig. 1b). The Tango design displayed higher leakiness than the ChaCha design with significant GFP expression without CNO treatment. The observation that ChaCha outperforms Tango may suggest that fusing the small TEVp (28 kDa) to the receptor likely preserves the conformational fidelity and activity of the GPCR molecule, while fusing a larger domain such as dCas9-VPR (220 kDa) to GPCR may compromise its conformation by exposing the C-terminal tail of a GPCR and permit leaky interactions with ARRB2.
We applied mathematical modeling to the ChaCha architecture to characterize system behavior. Intuitively, by fusing the TEVp to the receptor, we avoid the generation of a "dead" receptor pool that can no longer be used to catalyze the release of dCas9-VPR effector. Thus, there is a potential for the ChaCha to release multiple effector across the lifetime of a receptor. To test this, we investigated the number of released dCas9-VPR molecules (defined as n in Fig. 1c) via experiments and mathematical modeling (see Methods section for experimental procedure and model derivation, Supplementary Data 1-4 for raw data, and Supplementary Data 5 for R scripts). We formulated a set of simple rate equations to model the effects of ARRB2-TCS-dCas9-VPR-mCherry levels and CNO concentration on target gene (GFP) activation 24 . Using a stable HEK293T cell line containing a Doxycycline (Dox)-inducible dCas9-VPR-mCherry that activated UAS promoter-driven GFP expression, we first verified that dCas9-VPR activation of GFP was not cooperative (Hill coefficient < = 1, Supplementary Fig. 2a & c). We generated another stable HEK293T reporter cell line containing the ChaCha design with a Dox-inducible ARRB2-TCS-dCas9-VPR-mCherry and a constitutively expressed receptor (Methods section), which released free dCas9-VPR molecules to activated the UAS promoter-driven GFP reporter ( Fig. 1c and Supplementary  Fig. 2b). Our experimental data and modeling showed GFP expression was a hyperbolic function of ARRB2-TCS-dCas9-VPR-mCherry and CNO concentration (Fig. 1c, Methods section). An unbiased, nonlinear regression fitting of GFP expression in response to varying levels of ARRB2-TCS-dCas9-VPR and CNO (Fig. 1c) revealed that the number of released dCas9 molecules per receptor (n) was 2.33 (r = 0.95), implying that multiple dCas9 molecules are released during the lifetime of a receptor.
Design parameters that impact CRISPR ChaCha performance. We characterized various design parameters that modulate the efficiency of CRISPR ChaCha, using hM3D as a model system. These parameters included the following: (1) the linker length between ARRB2, TCS, and dCas9; (2) the proteolytic cleavage efficiency of different TCS sequences 25 ; and (3) the expression of hM3D-V2-TEVp via different promoters ( Supplementary Fig. 3).
Among the three parameters, our data suggested that proper placement of a flexible linker between ARRB2, TCS, and dCas9 is essential for a robust ChaCha design. More importantly, the proteolytic cleavage efficiency affects gene activation (Supplementary Fig. 3a). Somewhat surprisingly, we observed that a weaker proteolytic activity achieved the highest dynamic range of the reporter gene ( Supplementary Fig. 3b). More efficient proteolytic cleavage dramatically increased the basal activation of the reporter gene in the absence of CNO, and reduced the dynamic range of activation (~2-fold). Furthermore, the receptor (hM3D-V2-TEVp) expression level affected gene activation by using different promoters ( Supplementary Fig. 3c). Quantification of the expression levels of the adaptor and the receptor with different promoters suggested that a higher stoichiometry ratio of adaptor to receptor increased the dynamic range for gene activation ( Supplementary Fig. 3c, d). As a result, proper choice of promoters to modulate stoichiometry between receptor and adaptor is an important consideration for optimizing the performance of GPCR-CRISPR ChaCha for a given cell type.
Kinetics and dose response of the CRISPR ChaCha system. To characterize the kinetics of gene regulation in the ChaCha system, we tracked GFP activation via live-cell time-lapse microscopy ( Fig. 2a, Supplementary Fig. 4, and Supplementary Movies 1-3). We transfected the SFFV promoter-driven ChaCha variant into cells and treated with CNO after 24 h. We saw that mCherrytagged ARRB2-TCS-dCas9-VPR proteins were predominantly . In contrast, samples without CNO treatment showed little GFP activation. The observation that gene activation occurs within 12 to 24 h suggests that the ChaCha system can be used to generate relatively fast cell responses to environmental stimuli. We next investigated how gene activation by the hM3D-CRISPR ChaCha varied with CNO ligand dose using the SFFVdriven ChaCha variant ( Supplementary Fig. 3c). We observed a relatively linear dose-response curve for hM3D-CRISPR ChaCha, spanning over 2 orders of magnitude of ligand induction (Fig. 2b, Supplementary Fig. 5). The fitted dose-response curve based on the Hill Equation revealed the effective CNO concentration to achieve half-maximal GFP level (EC 50 ) at 105 ± 4 nM (r = 0.97, Fig. 2b), which is comparable to previously reported EC 50 values using alternative methods 5 .
We further studied if the hM3D-CRISPR ChaCha is reversible upon removal of the CNO ligand. To do this, we treated the stable HEK293T reporter cell line ( Supplementary Fig. 2b) with 1 μg/mL Dox for 7 days to reach steady-state expression of ARRB2-TCS-dCas9-VPR. We then treated cells with 10 μM CNO for 1 day while maintaining Dox and measured GFP expression every day for 7 days. We observed that GFP expression continued to rise for two days (due to existing free dCas9-VPR) after stopping CNO treatment and subsequently decreased to the basal level after another 3 days. In contrast, cells with sustained CNO treatment maintained GFP expression throughout the measurement (Fig. 2c). This observation that the GPCR-CRISPR ChaCha system is reversible is consistent with reports that inducible CRISPR-dCas9 gene regulation is reversible 26 .
CRISPR ChaCha efficiently activates endogenous genes. A key advantage of coupling GPCR signaling to the CRISPR-dCas9 system is dCas9's ability to flexibly regulate the mammalian genome in a programmable manner. To demonstrate this ability in the GPCR-CRISPR ChaCha system, we chose to activate four endogenous genes in HEK293T cells: interleukin 2 (IL2) and interferon gamma (IFNG), which are cytokines that functionally modulate the cell-killing activity of leukocytes, and beta-globin (HBB) and gamma-globin (HBG), two genes in the human βglobin locus. All four genes are not actively expressed in HEK293T cells. We designed multiple sgRNAs that targeted the promoter region of each gene using the dCas9-VPR system, and measured the effects of sgRNAs for activating IL2 and IFN-γ using the Enzyme-linked immunosorbent assay (ELISA) and the effects for activating HBB and HBG using qPCR ( Supplementary  Fig. 6, see Methods section). For each gene, we then used the most effective sgRNA in combination with the hM3D-CRISPR ChaCha system for ligand-mediated endogenous gene activation. After 48 h of CNO treatment, we observed efficient activation of all four genes individually tested (3.4-fold for IL2, and 11-fold for IFN-γ, Fig. 3a The dose-response curve of hM3D-CRISPR ChaCha in HEK293T cells after 1-day treatment of different CNO concentrations. EC 50 , the effective ligand concentration to achieve half-maximal GFP induction, and is shown as mean ± standard deviation of three technical replicates. The biological replicate experimental data is shown in Supplementary Fig. 5. c Reversibility of the hM3D-CRISPR ChaCha system. 10 μM CNO was added to cells for 1 day (shaded magenta area) and removed after day 1 (magenta curve). As a positive control, cells were grown with CNO for 7 days (black). Each data point represents the mean of GFP fluorescence of six technical replicates from two independent experiments, and the error bars represent the standard error of the mean. See Methods section for detailed experimental procedure ChaCha system can activate different endogenous genes in response to its ligand. CRISPR ChaCha activates multiple endogenous genes. We reasoned that the CRISPR ChaCha could be used to control multiple genes simultaneously, serving as a single input, multiple output (SIMO) genetic device. To investigate this, we cloned a dual sgRNA expression vector to encode two different sgRNAs (Fig. 3c, left; Methods section). We first validated activation of IL2 and IFN-γ by the dual sgRNA vector as measured by ELISA. The control experiments indicated that the dual sgRNA paired with dCas9-VPR allowed activation of each gene, and each sgRNA was specific without affecting expression of the other gene (Supplementary Fig. 8a). When we transfected the dual sgRNA vector with the hM3D-CRISPR ChaCha system, we observed simultaneous activation of both cytokines in response to CNO treatment (3.3-fold for IL2 and 11-fold fold IFN-γ, Fig. 3c). Compared to using a single sgRNA, we observed similar activation fold change ( Fig. 3a and Supplementary Fig. 8b), suggesting the system is suitable for activating multiple genes without comprised efficiency. Together, our data confirmed that the GPCR-coupled CRISPR ChaCha system could efficiently modulate endogenous genes in response to GPCR signals in human cells.
Generation of a CRISPR ChaCha toolkit using different GPCRs. GPCRs are the largest and most diverse group of membrane receptors in the human body, with Class A GPCRs constituting over 80% of the entire family 4,8 . We next tested the generality of the ChaCha design for more Class A GPCRs. We chose seven additional GPCRs that were classified on different branches of the Class A GPCR phylogenetic tree (Fig. 4a). These included another evolved GPCR, the kappa opioid receptor DREADD (KORD), which senses Salvinorin B 6 and six natural GPCRs: CXCR4, which senses the chemokine stromal derived factor 1 (SDF1); NMBR, which senses the mitogen neuromedin B; LPAR1, which senses the fatty acid lysophosphatidic acid; ADRB2, which senses the hormone epinephrine and similar compounds; AVPR2, which senses the hormone vasopressin; and TRHR, which senses thyrotropin-releasing hormone 8 Fig. 3 The ChaCha system enables activation of individual and of multiple genes. a Induction of endogenous IL2 and IFN-γ expression and secretion using the hM3D-CRISPR ChaCha system measured by ELISA. Gray, activation using free dCas9-VPR with a targeting (sgIL2 or sgIFNG) or non-targeting sgRNA (sgGAL4). Blue, activation of IL2 or IFN-γ using a targeting sgRNA (sgIL2 for IL2, sgIFNG for IFN-γ) with or without 10 μM CNO; orange, activation using a non-targeting sgRNA with or without 10 μM CNO. The dotted line represents the detection limit of ELISA (4 pg/mL). b Activation of HBB and HBG using the hM3D-CRISPR ChaCha system measured by quantitative PCR (qPCR) with or without 10 μM CNO. c Simultaneous activation of two endogenous genes using the hM3D-CRISPR ChaCha system. Left, the plasmids of ChaCha system used for multiplexed gene regulation. Notably, the ChaCha design worked efficiently for all seven additional GPCRs tested, with NMBR exhibiting the best activity in response to its ligand (Fig. 4b). Interestingly, we observed efficient gene activation for TRHR, which has not been reported for the Tango design 8 . We also tested different linkers for fusing TEVp to the C-terminal of a subset of these GPCRs (CXCR4, NMBR, LPAR1), and observed that the alterations in the V2 tail length and composition was important for better ON/OFF activity ( Supplementary Fig. 9).
Using NMBR-CRISPR ChaCha as an example, we further measured the dose-response behavior. In contrast to the hM3D receptor, the NMBR receptor displayed a narrower induction range ( Fig. 4c and Supplementary Fig. 10). The fitted dose-response curve using the Hill Equation revealed an effective NMB concentration to achieve half-maximal GFP level (EC 50 ) at 13 ± 4.2 nM (r = 0.97, Fig. 4c) Supplementary Table 4 for ligand concentrations). The data were normalized to the free dCas9-VPR without a targeting sgRNA (for KORD, LPAR, CXCR4, NMBR) or with a non-targeting sgRNA (for ADRB2, AVPR, TRHR). The data represent two independent experiments with 2-3 technical replicates, and the bars represent the mean. For CXCR4, NMBR, and LPAR, we chose the best performing version characterized from a few variants as shown in Supplementary Fig. 9. See Methods section for the experimental procedure. c The dose-response curve of NMBR-CRISPR ChaCha in HEK293T cells after 1-day treatment of different CNO concentrations. EC 50 , the effective ligand concentration to achieve half-maximal GFP induction, and is shown as mean ± s.d. of three technical replicates. The replicate experimental data is shown in Supplementary Fig. 10. d Induction of endogenous IFN-γ by NMBR-CRISPR ChaCha in HEK293T cells after 2 days of 0.5 μM NMB treatment. sgGAL4, nontargeting sgRNA; sgIFNG, IFNG-targeting sgRNA.+/− indicates with or without Neuromedin B. The fold of activation displayed on top of bars compares+/− treatment conditions. The bars represent the mean, and the data represent two independent experiments with 3 technical replicates We also tested whether a natural GPCR-CRISPR ChaCha can activate endogenous genes. Using the NMBR-CRISPR ChaCha, we observed 20-fold activation of the endogenous IFN-γ expression in HEK293T cells after 2-day NMB treatment (Fig. 4d). The toolkit of 8 CRISPR ChaCha systems reported here greatly expands the ligand-inducible control of CRISPR-dCas9 for efficient endogenous gene regulation, suggesting that the strategy should be generalizable to many other Class A GPCRs.

Discussion
Here we established a strategy named the ChaCha design to couple programmable gene regulation function of CRISPR-dCas9 with the ligand-sensing ability of GPCRs. We demonstrated this strategy by devising a CRISPR-coupled GPCR toolkit consisting of two synthetic GPCRs and six natural GPCRs. These CRISPR ChaCha systems can modulate gene expression in response to diverse ligands including synthetic compounds (CNO, Salvinorin B, and Isoproterenol), hormones (vasopressin, thyrotropinreleasing hormone), mitogens (neuromedin B), chemokines (stromal derived factor 1), and fatty acids (lysophosphatidic acid), which greatly expands the tools for inducible control of the human genome via the CRISPR-Cas system. Together, the developed CRISPR ChaCha systems provide an efficient and modular platform that employs diverse GPCRs for programmable sensing of a broad spectrum of ligands for modulating gene expression. Given the generality of the ChaCha design, we expect this approach can be applied to more GPCRs for more ligand inputs.
The CRISPR-coupled GPCR ChaCha system presents several advantages. First, fusing the small TEVp to the receptor likely preserves the conformational fidelity and activity of the GPCR molecule. Indeed, our data showed that the hM3D ChaCha improved the dynamic range of gene activation compared to the hM3D Tango (Fig. 1b). Second, we used a simple rate model and experimental data regression (Methods section) to show that the ChaCha design could allow for the release of multiple dCas9 effectors (estimated to be 2.33) during the life cycle of a receptor while the Tango design can maximally release one effector per receptor. Third, the programmability of CRISPR-Cas system allows us to easily redirect the same (synthetic or natural) signal to different genomic outputs (e.g. endogenous IL2, IFN-γ, HBB, and HBG). This greatly expands previous approaches beyond the use of synthetic transcription factors (e.g., GAL4 or tTA) that can only produce synthetic outputs from delivered transgenes.
The ChaCha system can be further optimized to overcome a few current limitations. While the ChaCha system is less leaky than Tango (Fig. 1b), we observed some level of leakiness. We performed experiments to probe the sources of observed leakiness. Our data suggested that the basal signaling activity of receptor without the ligand and the basal cleavage of TCS by the TEVp were the main source of the leakiness ( Supplementary  Fig. 11). Adopting a weaker TEVp, a weaker TCS ( Supplementary  Fig. 3b), or an inducible TEVp may lower this leakiness. Furthermore, using promoters that optimize the stoichiometric ratio between the receptor and adaptor molecules could increase the dynamic range of gene regulation (Supplementary Fig. 3c).
The ChaCha system can benefit from diverse CRISPR tools that are being developed. For example, combining the system with diverse species of dCas9 should broaden the targetable genome space 29,30 ; coupling the system with the dCas9-mediated repressors may result in ligand-mediated repression of endogenous genes; combining with epigenetic regulators may further enable ligand-mediated epigenome editing of the mammalian genome 16,18,31 .
While this manuscript was under review, another work was reported that coupled the CRISPR-Cas system to GPCRs using the Tango architecture and a split dCas9 strategy 32 . In their system, the split dCas9 strategy reduced the leakiness of the system by limiting the activity of full dCas9 molecules for undesired transcription control. Our ChaCha architecture provides an alternative, complementary strategy, which could also reduce the leakiness by avoiding fusing a large dCas9 molecule to the receptor, and potentially allow for the release of multiple effectors per receptor to enhance target gene activation. We expect that a combination of both strategies should provide an approach to achieve robust, sensitive, efficient CRISPR-coupled GPCR devices in the future.
Probing GPCR activity in vivo has been a useful avenue for understanding their function 33 and the ligands of many orphan GPCRs remain elusive 34 . The engineered ChaCha systems may help understand GPCR signaling and probe GPCR activity in response to their ligands. For example, one of the receptors tested (TRHR) was classified as "non-orphan, but also non-optimized" for transcriptional read-out by the Tango assay 8 , despite being described to interact with ARRB2 27 . Our result showed TRHR ChaCha worked well, suggesting it may be possible for a broader family of GPCRs to be adopted to transcriptional outputs with the ChaCha design. This would not only facilitate profiling GPCR signaling, but further expand diverse convertible inputs across the GPCR phylogenetic tree for dCas9 genome regulation.
We further envision that the CRISPR ChaCha system can be used to create novel cell therapeutic programs. Given the diverse ligand spectrum of GPCRs and their importance in development, physiology, and disease 2, 4 , harnessing and rewiring GPCR pathways may incorporate physiological or disease-relevant stimuli that cannot be detected using antibody-based approaches 35,36 . For example, human primary T cells engineered with chimeric antigen receptor (CAR) 37 or SynNotch 35 could be equipped with CRISPR ChaCha to detect GPCR ligands in the tumor microenvironment 38 and conditionally execute tumoricidal functions. Additionally, given the pharmacological relevance of GPCRs, the toolkit also offers an avenue for external control of the CRISPR function using synthetic compounds in vivo 5,6,39 .
Altogether, the CRISPR ChaCha system represents a significant methodological advancement for connecting diverse extracellular signals to the CRISPR-dCas9 gene regulation program, and is useful for both understanding receptor biology and devising cellbased therapeutics.

Methods
Generation of genetic constructs. Standard molecular cloning techniques were performed to assemble all constructs used in this paper and they are included in Supplementary Table 1. Human codon-optimized S. pyogenes dCas9 was fused at the C-terminus with the tripartite VPR activator. 23 VPR is a fusion of VP64, p65 activation domain, and Rta via two GS linkers. An SV40 nuclear localization signal (NLS, PKKKRKV) was inserted C-terminal to VP64. For visualization, mCherry was fused at the Cterminus of the construct. The fusion construct was cloned into a pcDNA3 vector with a CMV promoter driving the expression of dCas9-VPR-mCherry. For the mathematical model, a lentiviral pHR vector with a Doxycycline (Dox)-inducible TRE3G promoter was used instead.
ARRB2-TCS-dCas9-VPR was assembled by fusing ARRB2 (Human cDNA, NM_004313.3; Origene) with dCas9-VPR-mCherry and cloned into a pcDNA3 vector. The TCS sequence ENLYFQ/X was inserted in between and was flanked with GS linkers of varying lengths (Supplementary Fig. 3). Two nuclear export signals (NES, LALKLAGLDI) flanked ARRB2 to ensure cytoplasmic localization of the chimera. For the mathematical model, a lentiviral pHR vector with a Doxinducible TRE3G promoter was used instead.
Synthetic GPCRs, natural GPCRs and TEV protease (Addgene #8835) were all PCR amplified and cloned into a pHR lentiviral vector by InFusion (Takara Clontech) cloning. The V2 sequence (derived from AVPR2) 7 was inserted in between GPCR and TEVp as primer overhangs via InFusion cloning. For visualization, p2A-BFP was fused C-terminal to TEVp. Expression of GPCR-V2-TEVp-p2A-BFP was driven by an EF1a, PGK or SFFV promoter. See Supplementary Table 3 for plasmid sources for receptors.
All sgRNAs were cloned into a pHR lentiviral U6-driven expression vector that co-expressed puromycin-p2A-BFP or upstream of the GPCR-V2-TEVp locus for ease of transfection of the three-component GPCR-CRISPR ChaCha system. Alternative sgRNA sequences were generated by PCR and inserted by InFusion cloning into the vector digested with BstXI and NotI (New England Biolabs).
For multiplexing experiments, we also cloned a dual sgRNA vector to otherwise reduce false positives in bulk measurements (e.g., ELISA). This consists of two sgRNA cassettes in tandem driven by mouse U6 (mU6) and human U6 (hU6) promoters, respectively, and a co-expressed puromycin-p2A-BFP cloned into a pHR lentiviral vector. Here, the mU6 vectors are cloned using InFusion (Clontech) to insert PCR products into a modified vector digested with BstXI and SpeI. The hU6 sgRNA vector was cloned inserting PCR productions with InFusion cloning into a parent vector digested with XbaI and SpeI. After sequence verification, vectors were combined by digesting the XU6 sgRNA with XbaI and SalI, taking the insert and ligating into a SpeI and SalI digested XU6 vector.
Spacer sequences for all sgRNAs used can be found in Supplementary Table 2.
Cell culture and generation of stable cell lines. HEK293T cells (Lenti-X TM , Clontech) were maintained in DMEM High Glucose with GlutaMAX TM media (Thermo Fisher) supplemented with 10% Tet System Approved FBS (Clontech) and 100 U/mL of penicillin and streptomycin (Gibco) at 37 o C with 5% CO 2 . We did not independently authenticate these cell lines and they were not tested for mycoplasma contamination. For transfection, HEK293T cells (Lenti-X TM , Clontech) with 3 μL of Mirus TransIT-LT1 reagent per μg of plasmid added, and then incubated at room temperature for 15-30 min. Unless otherwise noted, GPCR ligands were added at the following concentrations at day 3 as specified in Supplementary Table 4. This table also specifies the media conditions used for each receptor to generate the data in Fig. 4.
Flow cytometry analysis. Cell were dissociated using 0.05% Trypsin-EDTA (Life Technologies) and analyzed for reporter fluorescence in the Stanford Shared FACS facility with a Scanford FACScan analyzer (Becton Dickinson), or with a CytoFLEX S flow cytometer (Beckman Coulter). We collected 10,000 cells containing constructs of interest for analysis (BFP and mCherry double positive). The data presented are normalized to either a free dCas9-VPR with no sgRNA, or non-targeting sgRNA control as specified.
Time-lapse microscopy. At day 0, HEK293T TRE3G-GFP reporter cells were plated at 1 × 10 5 cells per 24-well well (μ-Plate 24 well; ibidi). At day 1, 250 ng of each plasmid was transfected (see Fig. 2a and Supplementary Fig. 4). At day 2, 20 μM of CNO was added to appropriate wells and immediately imaged. Time-lapse microscopy was performed on a Leica DMi8 inverted microscope equipped with, Lumencor SOLA SMII 405, Leica DFC9000 GT camera and Oko-Lab cage incubation system at 37 o C with 5% CO 2 . Leica Application Software was used to set up time-lapse imaging. Images from phase contrast, mCherry (filter cube TXR, No. 11525310), and GFP (filter cube GFP, Cat. No. 11525314) channels were taken every 0.5 h for 48 h with a 20x/0.40NA corr PH1 objective using Leica Adaptive Focus control. Image processing was performed in Fiji (ImageJ).
Endogenous cytokine activation and secretion assays. A day before transfection, HEK293T cells were seeded in 24-well plates at a density of 5 × 10 4 cells per well. On day 1, cells were transfected with 250 ng of each plasmid (i.e., the CRISPR ChaCha components: GPCR-V2-TEVp of interest, the ARRB2-TCS-dCas9-VPR, and an sgRNA). On day 2, controls were transfected, consisting of the GPCR of interest, dCas9-VPR, and an sgRNA. Media on the ChaCha-containing cells was then changed to those with or without ligand treatment (10 μM for CNO; 0.5 μM for NMB).
Supernatants from cell cultures were harvested on day 4, and stored at −80 o C. Secreted proteins were quantified using the ELISA MAX Deluxe kits for human IL2 and IFN-γ (BioLegend). Absorbance at 450 nm and 570 nm was measured for samples in technical triplicates with a Synergy H1 plate reader (BioTek). Samples were standardized by subtracting measurements at 570 nm from those at 450 nm. Protein concentrations were then determined by standard curves fitted to a power law using Excel (Microsoft).
qPCR analysis of gene expression. Cells were transfected as described in the proceeding section. On day 4, cells were harvested and RNA was extracted using a RNeasy Midi Plus Kit (Qiagen). cDNA was then prepared using 500 ng of RNA per 20 μL reaction via iSCRIPT cDNA synthesis (BioRad). Following synthesis, cDNA was stored at −30°C until qPCR.
qPCR was conducted in 10 μL reactions using 384 well plates, using 15 ng of cDNA, a 400 nM final concentration of primers, and iTaq Universal SYBR Green Supermix (BioRad). See Supplementary Table 5 for primers used. From transfection, there were three technical triplicates that were then ran in technical duplicate for qPCR reactions. Thermocycling was done as follows: 95°for 1 min, 95°for 10 s, and 60°for 30 s. The latter two steps cycled for 50 repeats with plate reads taken after the 60°step 40 on a CFX384 Touch Real-Time PCR thermocycler (BioRad). To reduce technical variation in loading 384 well plates, each independent experiment was ran on the same day with the same aliquots of a qPCR reaction master mix. We applied a Ct threshold of 35 cycles after running water controls for each primer. Thus, any Ct values that were over 35 or not reported after 50 cycles were then set to a Ct of 35 cycles.
The data were analyzed using the ΔΔC t method. ΔC t was calculated about the housekeeping gene GAPDH. Then ΔΔC t was calculated using the ΔC t t from the gene of interest (GOI), subtracted from the ΔC t of the free dCas9-VPR and sgGal4 condition (M 0 ). We then report relative expression as the following: Fold changes are reported as the ratio of relative expression between the CNO treated and untreated conditions. Class A GPCR phylogenetic tree construction. The phylogenetic tree in Fig. 4a was constructed using GPCRdb 41,42 . Human GPCRs from the Swiss-Prot database were used as reference, without any selection for G protein preference. One GPCR from each family of Class A/Rhodopsin Family GPCRs was used to construct the tree, including those utilized in this study. Full-length sequences of receptors were considered for tree construction. No bootstrapping was performed, and distance calculation utilized the neighbor-joining method, with the regular branch lengths option. The tree was then rendered using T-REX 43 .
Modeling GFP activation by doxycycline-inducible dCas9-VPR. We construct rate equations to model the induction of dCas9-VPR-mCherry (referred hereafter simply as dCas9) by Dox (D) and the dCas9-induced activation of the target reporter gene, GFP.
where α 1 and α 2 are first-order rate constants for dox-induced dCas9 (C) production and subsequent dCas9-induced production of GFP (G), respectively; the Hill coefficient n and K D are the cooperativity and affinity constants of dox induction, respectively; the exponent m is a lumped parameter that captures the following processes in series: dCas9 binding to the gene target (GFP), transcription, and translation of GFP; β 1 and β 2 are first-order degradation rate constants for dCas9 and GFP, respectively.
A simple mathematical rate model of the CRISPR ChaCha system. We construct rate equations to model four connected processes, which are: (i) conversion of inactive hM3D-TEV (R) receptor to an activated state (R * ) upon CNO ligand (L) binding, which leads to (ii) the cleavage of dCas9-VPR-mCherry (C) from ARRB2-dCas9-VPR-mCherry (A, referred hereafter simply as ARRB2-dCas9) that can be (iii) induced with doxycycline (D), and (iv) the subsequent activation of the target reporter gene, GFP (G), by cleaved dCas9-VPR.
Where α R , α R* and α A are production rate constants for inactive receptor, ligandactivated receptor, and ARRB2-dCas9; β R , β R* , β A , β C , and β G are first-order degradation rate constants for inactive receptor, active receptor, ARRB2-dCas9, cleaved dCas9, and GFP, respectively; γ C and γ G are reaction rate constants for active receptor-mediated cleavage of ARRB2-dCas9 to release dCas9, and subsequent dCas9-induced production of GFP, respectively; n is the number of ARRB2-dCas9-VPR molecules recruited per one active receptor. At steady state, all time derivatives go to zero, which yield the following steady state (ss) formulae for relevant molecules Substituting equations (12), (13), (14) into Equation (15) yields a steady-state formula for GFP as a function of CNO ligand and ARRB2-dCas9, or simply, where G max ¼ γ G β G Á αR γ G þβ C represents the theoretical maximum GFP level at high saturating levels of L and A, and it is a function of the rate constants for receptor production, dCas9 degradation, and GFP degradation; K L ¼ β R =α R Ã represents the set-point concentration for the CNO ligand to produce half-maximal GFP levels; K A ¼ β R Ã =γ C represents the ratio of active receptor-mediated ARRB2-dCas9 cleavage and active receptor degradation rate constants.
Data presentation and analyses. Data are displayed as individual points with sample size indicated in figure legends. No sample size estimates were performed, and the sample sizes used in this study are consistent with those used by similar genome editing and gene regulation studies. Experiments were performed independently at least two times. Values reported are relative to indicated control conditions. No randomization or blinding was performed.
Statistical analysis was performed using SPSS Statistics 21 (version 22, IBM Corporation), or Prism 7 (Graphpad). Equal variance between populations was not assumed. To account for unequal variance among conditions, Welch's two-sided t test was performed when comparing two conditions, and Welch's ANOVA was performed followed by Games-Howell post hoc tests when comparing more than two conditions with each other. All statistical data analyses are compiled in Supplementary Table 6.
Data availability. All relevant data can be provided by the authors. In the manuscript we provide the raw data (Supplementary Data 1-4) and R scripts (Supplementary Data 5) used to generate in Fig. 1 and Supplementary Fig. 2.