Abstract
Genome and expressed sequence tag projects are rapidly cataloguing and cloning the genes of higher organisms, including humans. An emerging challenge is to rapidly uncover the functions of genes and to identify gene products with desired properties. We have developed a microarray-driven gene expression system for the functional analysis of many gene products in parallel. Mammalian cells are cultured on a glass slide printed in defined locations with different DNAs. Cells growing on the printed areas take up the DNA, creating spots of localized transfection within a lawn of non-transfected cells. By printing sets of complementary DNAs cloned in expression vectors, we make microarrays whose features are clusters of live cells that express a defined cDNA at each location. Here we demonstrate two uses for our approach: as an alternative to protein microarrays for the identification of drug targets, and as an expression cloning system for the discovery of gene products that alter cellular physiology. By screening transfected cell microarrays expressing 192 different cDNAs, we identified proteins involved in tyrosine kinase signalling, apoptosis and cell adhesion, and with distinct subcellular distributions.
The growing collection of gene sequences and cloned cDNAs demands the development of systematic and high-throughput approaches to characterizing the gene products. Strategies comparable to DNA microarrays for transcriptional profiling1, 2 and to yeast two-hybrid arrays for determining protein–protein interactions3 do not exist to analyse the function, within mammalian cells, of large sets of genes. At present, in vivo gene analysis can be done—on a gene-by-gene scale—by expressing a DNA construct within cells that directs the overproduction of a gene product or inhibits its synthesis or function. The effects on cellular physiology of altering the level of a gene product is then detected using a variety of functional assays. We describe a strategy for the high-throughput analysis of gene function in mammalian cells. We have developed a system suitable for rapidly screening large sets of cDNAs or DNA constructs for those genes encoding desired products or causing cellular phenotypes of interest. Using slides printed with sets of cDNAs in expression vectors, we create living microarrays of cell clusters expressing the gene products. The cell clusters can be screened for any property detectable on a surface and the identity of the responsible cDNA determined from the coordinates of the cell cluster with a phenotype of interest.
To create these microarrays, we simultaneously transfect distinct and defined areas of a lawn of cells with different plasmid DNAs (Fig. 1a). This is done without the use of individual wells to sequester the DNAs. Nanolitre volumes of plasmid DNA in an aqueous gelatin solution are printed on a glass slide using a robotic arrayer. After drying, the DNA spots are briefly exposed to a lipid transfection reagent; the slide is then placed in a culture dish and covered with adherent mammalian cells in medium. The cells growing on the DNA and gelatin spots express the DNA and divide 2–3 times in the process of creating a microarray with features consisting of clusters of transfected cells (which we call 'transfected cell microarrays'). We call the method to make the arrays 'reverse transfection' because, compared with conventional transfection, we have reversed the order of addition of DNA and adherent cells. For detailed protocols of this and alternative methods see http://staffa.wi.mit.edu/sabatini_public/reverse_transfection.htm.
Figure 1: Well-less transfection of plasmid DNAs in defined areas of a lawn of mammalian cells.

a, Protocol for making microarrays of transfected cells. b, Laser scan image of a GFP-expressing microarray made from a slide printed in a 14
10 pattern with a GFP expression construct. c, Higher magnification image obtained with fluorescence microscopy of the cell cluster boxed in b. Scale bar, 100
m. d, Expression levels of cell clusters in a microarray are proportional, over a fourfold range, to the amount of plasmid DNA printed on the slide. Indicated amounts of the GFP construct assume a 1-nl printing volume. The graph shows the mean
s.d. of the fluorescence intensities of the cell clusters (n = 6). The fluorescent image is from a representative experiment. e, Co-transfection is possible with transfected cell microarrays. Arrays with elements containing expression constructs for HA–GST, GFP or both were transfected and processed for immunofluorescence and imaged with a laser scanner. Cy3, cell clusters expressing HA–GST; GFP, cell clusters expressing GFP; merged, superimposition of Cy3 and GFP signals. Yellow colour indicates co-expression. Scale bar, 100
m. f, Enlarged view of boxed area of scan image from e.
To illustrate the method, we printed an array with elements containing an expression construct for the green fluorescent protein (GFP). HEK293 cells were plated on the slide for transfection and the fluorescence of the cells detected with a laser fluorescence scanner. A low magnification scan shows a regular pattern of fluorescent spots that matches the pattern in which we printed the GFP expression construct (Fig. 1b). A higher magnification image obtained through fluorescence microscopy shows that each spot is about 120–150
m in diameter and consists of a cluster of 30–80 fluorescent cells (Fig. 1c). As in a conventional transfection, the total expression level in the clusters is proportional over a defined range to the amount of plasmid DNA used (Fig. 1d). As it is often useful to express two different plasmids in the same cells, we examined whether our technique is compatible with co-transfection. Arrays with elements containing expression constructs for GFP, haemagglutinin (HA)-tagged glutathione S-transferase (GST) or both were prepared and transfected. The cells growing on elements printed with both cDNAs express both encoded proteins, indicating that co-transfection does occur (Fig. 1e).
We first determined whether transfected cell microarrays could be used to identify gene products on the basis of their intrinsic properties. As a test case, we sought to use an array to identify the receptor for FK506, a clinically important immunosuppressant whose pharmacologically relevant target, FKBP12, is an intracellular protein4, 5, 6. We printed on a slide, in an easily recognizable pattern, elements containing expression constructs for Myc-tagged FKBP12, GFP or both. After the microarray of transfected cell clusters formed, we added radiolabelled FK506 to the tissue culture medium for 1 h before processing the slide for immunofluorescence and autoradiography. The radiolabelled FK506 bound to the array in a pattern of spots that exactly matched the pattern of cell clusters expressing FKBP12 (Fig. 2a). The GFP-expressing clusters and the non-transfected cells surrounding the clusters had background levels of signal (Fig. 2a). Detection of the bound FK506 with autoradiographical emulsion confirms, at the cellular level, colocalization between FKBP12 expression and FK506 binding (Fig. 2b). The clusters overexpressing FKBP12 had only a 3.1
0.2-fold increase in the binding of radiolabelled FK506 compared with the GFP-expressing clusters (Fig. 2c), probably reflecting the high levels of endogenous FK506-binding proteins expressed in the cells7. Consistent with this, the prior addition of excess rapamycin, a competitive antagonist of FK506, not only eliminated the specific signal but also greatly lowered the background signal (Fig. 2c).
Figure 2: Detection of drug receptors on transfected cell microarray.

a, Increased binding of tritiated FK506 to cell clusters expressing FKBP12. Laser scans show clusters expressing GFP, Myc-FKBP12 or both (merged). Film autoradiography detects binding of tritiated FK506 to the same array (autorad film). b, Colocalization between FKBP12-expressing cells and FK506-binding. Higher magnification image obtained by fluorescence microscopy of an FKBP12-expressing cluster (FKBP12). Emulsion autoradiography detects, with cellular resolution, binding of tritiated FK506 to the same cluster (autorad emulsion). c, Rapamycin competes with FK506 for binding to FKBP12-expressing clusters. Mean
s.d. (n = 10) is shown in arbitrary units. d, Binding of tritiated SCH23390 to cell clusters expressing the dopamine D1 (DA) but not the serotonin 1A (5-HT) receptor (autorad film). Each receptor was expressed from a plasmid encoding a bicistronic messenger RNA containing an IRES–GFP element downstream of the receptor cDNA. Clusters on the left and right were transfected with 20 and 30 pg plasmid DNA, respectively. Fluorescence microscopy shows that all clusters express GFP.
We also detected binding of a small molecule to a membrane protein expressed in specific clusters of a transfected cell microarray. We transfected slides printed with plasmids encoding an IRES–GFP cassette downstream of the cDNA for the dopamine D1 or serotonin 1A receptor. Cell clusters transfected with either plasmid expressed GFP, but a dopamine antagonist bound only to the cells transfected with the plasmid for the dopamine receptor (Fig. 2d).
We next used transfected cell microarrays to screen a collection, enriched for signalling molecules, of 192 epitope-tagged cDNAs cloned in expression vectors. All screens were performed in a blinded fashion without knowledge of the identity of the encoded proteins. We screened for gene products that increase the activity of kinase signalling pathways, induce apoptosis or are involved in cell–cell adhesion. After transfection of HEK293T cells on the array, we used phosphospecific antibodies to identify cell clusters with increased levels of phosphotyrosine or of the activated phosphorylated forms of several members of the mitogen-activated protein kinase (MAPK) family. Six cell clusters (1A2, 1C7, 1C9, 1C11, 1E3 and 1F6) had phosphotyrosine levels above background (Fig. 3a). The coordinates of the clusters match those of the wells of a microtitre plate containing the source cDNAs, and were used to ascertain the identity of the transfected cDNAs. This revealed that five of these clusters were expressing known tyrosine kinases (TrkC, Syk, Syn, Lck and Blk) whereas the sixth (1C9) encodes a protein of unknown function (Table 1). Screening of duplicate arrays for phospho-MAPKs showed that the clusters expressing Syk and Blk also had increased levels of activated phospho-JNK and phospho-p38 (Table 1). Syk is a known activator of these MAPKs8, but this function has not been ascribed previously to Blk. Overexpression of Syn and Lck did not activate JNK or p38, indicating that, at least in HEK293 cells, not all cytoplasmic tyrosine kinases have this effect. To identify gene products that might have a function in apoptosis or cell adhesion, we examined the microarray for clusters containing cells with abnormal morphologies. We found one cluster (2E8) in which the cells were fragmented (Fig. 3b) and positive for the TdT-mediated dUTP nick end-labelling9 reaction (data not shown), which is consistent with their expression of the TRAIL receptor 2, an apoptosis-inducing protein10. In another cluster (2F7), the plasma membranes of adjoining cells were in close contact to each other (Fig. 3b). This cluster expressed CD36, a cell-surface protein with functions in cell–cell recognition and adhesion11. Other cell phenotypes that are consistent with the potential functions of the cDNAs expressed in the clusters were also present on the microarray (Table 1). Thus, screens using simple detection methods (phosphospecific antibodies and visual inspection) revealed diverse cellular phenotypes in the transfected cell microarray and identified both known and new roles for proteins.
Figure 3: Identification of gene products inducing cellular phenotypes of interest.

a, Detecting components of tyrosine kinase signalling cascades. V5-epitope-tagged cDNAs (192) in expression vectors were printed in two 8
12 subgrids named grid 1 and 2. For ease of determining the coordinates of cell clusters within the arrays, a border around each array was printed with a GFP expression construct. After transfection, separate slides were processed for anti-V5 or anti-phosphotyrosine (anti-p-Tyr) immunofluorescence and Cy3 and GFP fluorescence was detected. Merged images of grid 1 show location of clusters expressing V5-tagged proteins (left panel) and having increased level of phosphotyrosine (right panel). In this particular array, no DNA was printed at coordinates F10–12. b, Two examples of the morphological phenotypes detectable in the transfected cell microarrays described in a. Clusters shown are E8 and F7 from grid 2.
Table 1: Phenotypes induced and subcellular distributions for cDNAs expressed in transfected cell microarray
The subcellular distribution of a protein can provide crucial insights to its cellular functions, and its determination is an important part of characterizing a gene of unknown function. Therefore, we also examined, through visual inspection, the subcellular distribution of the proteins expressed in the cell microarrays made with the cDNA collection. We identified 22 clusters expressing proteins with distinct, non-cytoplasmic subcellular distributions (Table 1). Several proteins that are known transcription factors, including ETR101, MEF2C and SP1-like transcription factor, were mainly located in the cell nucleus. This was also true for other proteins with potential nuclear roles, such as deoxycytidine kinase, 3pK (chromosome 3p kinase) and TIP60, as well as others, like phosphatase 1-
(PP1
), whose subcellular distribution has not been ascertained previously. The location of PP1
in the nucleus was unexpected and differs from that of its close homologue, PP1
, which is also expressed on the array but was found exclusively in the cytoplasm (data not shown).
Transfected cell microarrays have distinct advantages over conventional expression cloning strategies using fluorescence-activated cell sorting or sib selection12. First, cDNAs do not need to be isolated from the cells showing the phenotype of interest. This significantly accelerates the pace of expression cloning and allows for screens using a variety of detection methods (including those incompatible with cell viability) such as autoradiography and in situ hybridization. Our experiments took days to perform instead of the weeks or months necessary with other expression cloning strategies. Second, as the signal is concentrated in a well defined small area in a transfected cell microarray, cDNAs can be identified whose expression produces only small increases in signal over background. This capacity is well demonstrated by our detection of the receptor for FK506 in host cells with high levels of endogenous FK506-binding proteins. The relatively small increase in binding of the drug over background caused by the overexpression of FKBP12 would have been difficult to detect in the early rounds of sib selection. Third, transfected cell microarrays can be used to screen live cells, allowing the detection of transient phenotypes, such as a change in the intracellular calcium concentration or the movement of a GFP-tagged protein. Fourth, being compact and easy to handle, transfected cell microarrays have economies of scale. Because the microarrays are printed with the same robotic arrayers as traditional DNA arrays, it is feasible to achieve densities of up to 6,000–10,000 cell clusters per standard slide. At these densities, the entire set of human genes could be expressed on a small number of slides. The main difficulty in creating such pan-genomic arrays is the work required in making the cDNA expression constructs to print on the slides. This disadvantage is shared by other microarray-based technologies that use defined sets of cDNAs. Despite the effort required, projects to make collections of full-length cDNAs for all human genes are underway13, 14, 15, 16 (see also http://www.hip.harvard.edu/research.html) and will be valuable tools for making the arrays we describe.
Transfected cell microarrays also have certain advantages over protein microarrays, a recently introduced technology in which pure proteins are immobilized on a surface17, 18, 19. Although of potentially broad use, highly representational protein microarrays are difficult to make because large numbers of individually purified proteins are needed. Furthermore, it is unclear for how long the proteins on the arrays will be stable once the array is printed. Transfected cell microarrays can be substituted for protein microarrays for some applications, such as in the identification of small molecule targets. Our approach has the advantage that microarrays of DNA, which are stable for months, can be converted when needed into arrays of mammalian cell clusters expressing the encoded gene products.
We have described arrays in which the transfected plasmids direct gene overexpression. However, as antisense technology improves or other methods emerge for decreasing gene function in mammalian cells, it may be possible to also use transfected cell microarrays to screen for phenotypes caused by loss of gene function. The key to spatially restricting transfection without wells is the immobilization of the plasmid DNA in a gel from which it is only accessible to nearby cells. This concept may be generally applicable to the high-throughput and well-less screening in mammalian cells of other molecules, such as chemical compounds, proteins or oligonucleotides.
Methods
Plasmids
EGFP (enhanced GFP) was expressed from the retroviral vector pBABEpuro20, HA–GST and Myc-tagged FKBP12 from the cauliflower mosaic virus (CMV)-driven prk5 vector21, and the human dopamine D1 and serotonin 1A receptors from the LTR (long terminal repeat) driven, bicistronic pMSCV–IRES–EGFP vector. The Genestorm cDNAs are cloned into the CMV-driven pcDNA3.1/GS vector (Invitrogen). These clones were picked at random by Invitrogen from their collection of cDNAs and the clone names are provided in the Supplementary Information. DNA was purified with the Plasmid Maxi or QIAprep 96 Turbo Miniprep kits (Qiagen), and always had an A260/A280 greater than 1.7.
Microarray printing
A robotic arrayer (PixSys 5500; Cartesian Technologies) equipped with stealth pins (ArrayIt SMP4; Telechem) was used to print a plasmid DNA/gelatin solution contained in a 384-well plate onto CMT GAPS glass slides (2549, Corning). The pins deposited about 1-nl volumes 400
m apart using a 25–50-ms pin-down-slide time in a 55% relative humidity environment. Printed slides can be stored at 4 °C or at room temperature in a vacuum dessicator. Storage for long periods of time (>3 months) has not appreciably affected performance. Preparation of aqueous gelatin solution is important and is as follows. 0.2% gelatin (w/v) (G-9391; Sigma) was dissolved in MilliQ water by heating and gentle swirling in a 60 °C water bath for 15 min. The solution was cooled slowly to room temperature and filtered through a 0.45-
m cellular acetate membrane and stored at 4 °C. We diluted concentrated solutions of DNA in the gelatin solution to keep the gelatin concentration >0.17%. Unless otherwise specified, final plasmid DNA concentrations were 0.033
g
l-1. For the Genestorm clones, the final DNA concentration was 0.040
g
l-1.
Reverse transfection of microarrays
We used the Effectene transfection kit (301425; Qiagen) as follows. In a 1.5-ml microcentrifuge tube, 16
l enhancer was added to 150
l EC buffer, mixed, and pre-incubated for 5 min at room temperature. 25
l Effectene lipid was added, mixed and the entire volume pipetted onto a 40
20-mm cover well (PC200; Grace Bio-Labs). A slide with the printed side down was placed on the cover well such that the solution covered the entire arrayed area while also creating an airtight seal. After a 10–20-min incubation, the cover well was prised off the slide with forceps and the transfection reagent removed carefully by vacuum aspiration. The slide was placed printed side up in a 100
100
10 mm square tissue culture dish and 1
107 actively growing HEK293T cells in 25 ml medium (DMEM with 10% IFCS (inactivated fetal calf serum), 50 units ml-1 penicillin and 50
g ml-1 streptomycin) were poured into the dish. Three slides can be transfected side by side in this fashion. The cells grew on the slide for 40 h before fixing for 20 min at room temperature in 3.7% paraformaldehyde/4.0% sucrose in PBS. We have also tested other commonly used mammalian cells lines, such as HeLa and A549 cells, and obtained similar results but with transfection efficiencies of 30–50% of those obtained with HEK293T cells.
Immunofluorescence
For immunofluorescence staining, the cells were fixed as above, permeabilized in 0.1% Triton X-100 in PBS for 15 min and probed with primary and secondary antibodies as described21. Primary mouse monoclonal antibodies (500
l per slide) were used at the following concentrations: 1:500 anti-HA ascites fluid (BaBCo), 2
g ml-1 anti-Myc 9E-10 (Calbiochem), 2
g ml-1 anti-V5 (Invitrogen), and 10
g ml-1 4G10 anti-phosphotyrosine (Upstate Biotechnologies). Primary rabbit polyclonal antibodies (500
l per slide) were used at the following concentrations: 0.7
g ml-1 anti-p38 pTGpY (Promega), 0.7
g ml-1 anti-JNK pTPpY (Promega), and 0.5
g ml-1 anti-ERK1/2pTEpY (NEB). Secondary antibodies were Cy3-labelled anti-mouse or anti-rabbit antibodies produced in donkeys (Jackson Immunoresearch) and were used at 3.1
g ml-1.
Laser scanning and fluorescence microscopy
We imaged microarrays at a resolution of 5
m with a laser fluorescence scanner (ScanArray 5000; GSI Lumonics). GFP and Cy3 emission was measured separately after sequential excitation of the two fluorophores. To obtain images at cellular resolution, cells were photographed with a conventional fluorescence microscope. To measure fluorescence intensity, cell clusters were photographed and the signal intensity quantified with Image Quant (Fuji). All images were pseudocoloured and superimposed using Photoshop 5.5 (Adobe Systems).
FK506 and SCH23390 autoradiography
We added 5 nM dihydro-FK506 [propyl-3H] (NEN) to the cell medium containing the microarrays for 1 h before rinsing once with PBS at room temperature and fixation. When used, rapamycin was added at 1
M for 30 min before FK506 addition. Before autoradiography, slides were processed for anti-Myc immunofluorescence, scanned at 5-
m resolution and photographed using a fluorescence microscope. We then exposed slides to tritium-sensitive film (Hyperfilm, Amersham) for 4 days. Autoradiographical emulsion was performed as described by the manufacturer (EM-1; Amersham). Image Quant (Fuji) was used to quantify 0.01-mm2 areas of autoradiograms. For the dopamine receptor experiments, 1 nM SCH23390 [N-methyl-3H] (NEN) was added to the cell medium containing the microarrays for 1 h before rinsing once with PBS at room temperature and fixation. Before processing for autoradiography as above, we photographed GFP fluorescence using a fluorescence microscope.


