Differential binding cell-SELEX method to identify cell-specific aptamers using high-throughput sequencing

Pleiko, Karlis; Saulite, Liga; Parfejevs, Vadims; Miculis, Karlis; Vjaters, Egils; Riekstina, Una

doi:10.1038/s41598-019-44654-w

Download PDF

Article
Open access
Published: 31 May 2019

Differential binding cell-SELEX method to identify cell-specific aptamers using high-throughput sequencing

Karlis Pleiko¹,
Liga Saulite¹,
Vadims Parfejevs¹,
Karlis Miculis²,
Egils Vjaters² &
…
Una Riekstina¹

Scientific Reports volume 9, Article number: 8142 (2019) Cite this article

9414 Accesses
25 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Aptamers have in recent years emerged as a viable alternative to antibodies. High-throughput sequencing (HTS) has revolutionized aptamer research by increasing the number of reads from a few (using Sanger sequencing) to millions (using an HTS approach). Despite the availability and advantages of HTS compared to Sanger sequencing, there are only 50 aptamer HTS sequencing samples available on public databases. HTS data in aptamer research are primarily used to compare sequence enrichment between subsequent selection cycles. This approach does not take full advantage of HTS because the enrichment of sequences during selection can be due to inefficient negative selection when using live cells. Here, we present a differential binding cell-SELEX (systematic evolution of ligands by exponential enrichment) workflow that adapts the FASTAptamer toolbox and bioinformatics tool edgeR, which are primarily used for functional genomics, to achieve more informative metrics about the selection process. We propose a fast and practical high-throughput aptamer identification method to be used with the cell-SELEX technique to increase the aptamer selection rate against live cells. The feasibility of our approach is demonstrated by performing aptamer selection against a clear cell renal cell carcinoma (ccRCC) RCC-MF cell line using the RC-124 cell line from healthy kidney tissue for negative selection.

Selection of aptamers against triple negative breast cancer cells using high throughput sequencing

Article Open access 21 April 2021

In vitro selection of aptamers and their applications

Article 20 July 2023

SCITO-seq: single-cell combinatorial indexed cytometry sequencing

Article 05 August 2021

Introduction

Aptamers are short (20–100 nt) oligonucleotides that, contrary to most other functional nucleic acids, bind specific molecular targets due to their folded three-dimensional (3D) structures¹. Most aptamers are developed for therapeutic or diagnostic purposes^2,3. Several aptamer candidates are currently being tested in clinical trials to treat age-related macular degeneration⁴, Duchenne muscular dystrophy⁵, chronic lymphocytic leukaemia⁶ and other illnesses⁷.

Aptamer-based diagnostic assays have a great potential to become point-of-care diagnostics: they are affordable, sensitive, specific, user-friendly, robust and can be performed outside a laboratory or hospital. Several commercial aptamer diagnostic platforms have entered the market in recent years⁷.

OTA-Sense aptamer-based technology to detect ochratoxin, a mycotoxin and potential carcinogen in agricultural products, has been developed by Neoventures Biotechnology Inc. An aptamer-based detection system for aflatoxin is currently marketed as AflaSense. The company is developing similar diagnostic applications for other major mycotoxins including zearalenone, fumonisin and deoxynivalenol⁸.

ApolloDx’s aptamer-based food pathogen diagnostic platform of food safety testing is marketed by CibusDx. The technology is based on test strips with aptamer-based APOLLOMERTM probes that bind specific targets of foodborne and waterborne pathogens, toxins and viruses present in the test sample⁹.

The SOMAScan and SOMAmer aptamer array platforms marketed by SomaLogic use modified aptamers with multiplexed proteomics technology enabling high throughput screening of multiple biomarkers in limited sample volumes (150 μl). With SOMAmer technology, more than 1,305 human proteins at less than pg levels in body fluids can be detected¹⁰.

The OLIGOBIND©Thrombin activity assay, marketed by Sekisui Diagnostics, GmbH, is a novel oligonucleotide-based enzyme capture methodology that accurately measures thrombin levels through an aptamer-based enzyme-capture fluorescent assay^11,12.

AptoCyto, marketed by AptSci, are aptamer products developed for flow cytometry application. Aptocyto technology uses magnetic bead-based cell isolation kits that can efficiently isolate CD-31, EGFR, HGFR, ICAM-2, VEGFR-2 or HER-2 positive cells^7,13.

The current aptamer technology market was estimated at USD 1.0 Bn in 2016 with a cumulative annual growth rate of over 20% from 2017 to 2025¹⁴. In comparison, the monoclonal antibodies market size was 85.4 Bn in 2015 and is expected to reach USD 138.6 Bn by 2024¹⁵.

After initially describing an aptamer selection method termed SELEX (systematic evolution of ligands by exponential enrichment)¹⁶, several aptamer selection methods have been developed, among others cell-SELEX¹⁷, in which live cells are used. The first high-throughput SELEX (HT-SELEX) experiment, a variation of the SELEX process that uses high-throughput sequencing (HTS) methods instead of Sanger sequencing, was described by Zhao et al. in¹⁸. The consequent adaptation of HTS methods for aptamer research further improved selection procedure outcomes^19,20.

Subsequently, RNA aptamer selection against the active and inactive conformation of β₂ adrenoreceptor described by Kahsai et al. employed HTS methods to characterize the fold change enrichment of particular sequences during the selection against each individual target in parallel¹⁹. However, in cell-SELEX, this approach might be of very limited use due to the high diversity of protein targets on the cell surface that would cause the enrichment of non-specifically bound sequences if no negative selection were performed.

Several research teams have developed tools to analyse HTS data in aptamer selection, notably FASTAptamer, a toolkit developed by Alam et al. that can be used to track the evolutionary trajectory during the SELEX process of individual oligonucleotide sequences²⁰. Recently, AptaSUITE, a comprehensive bioinformatics framework that includes most of the previously published functionalities of different tools (data pre-processing, sequence clustering, motif identification and mutation analysis) has been introduced²¹.

RNA-sequencing (RNA-seq) experiments are used to quantify the differential expression of gene transcripts between samples²². We speculated that it might be possible to adapt data analysis tools currently used for RNA-seq to be used with HT cell-SELEX experiments. During the cell-SELEX experiment, the goal is to select aptamers that bind to the target cells in larger numbers compared to the control cells, making the experimental design similar to RNA-seq analysis. Here, we provide a differential binding cell-SELEX method that can be used to identify differentially abundant aptamers on the surface of target cells and negative control cells during the cell-SELEX experiments and to calculate the statistical significance of these differences. Similar approaches for protein SELEX, in which parallel aptamer selection is performed followed by HTS, have been previously performed to identify broad-spectrum aptamers against the primate lentiviral family of reverse transcriptases²³. A comparative binding analysis of a protein SELEX approach in combination with HTS, followed by a detailed characterization of aptamer sequences, has been used to identify high relevance aptamers against serpin plasminogen activator inhibitor-1²⁴. However, the risk of enrichment of non-specific sequences during cell-SELEX experiments is substantially higher than in protein SELEX due to the inherent complexity of the target in cell-SELEX. While in protein SELEX there is usually only one target, in cell-SELEX the target protein is unknown and selection is performed in live cells with all non-specifically expressed proteins present. Consequently, a statistical analysis of binding differences between the target and negative control cells included in the differential binding cell-SELEX could be a valuable approach for aptamer selection against complex targets. Our analysis includes the use of edgeR²⁵, a common tool for analysis in RNA-seq experiments. This tool employs a negative binominal distribution to identify differentially expressed genes. It also employs the FASTAaptamer²⁰ toolbox to estimate the read count, cutadapt²⁶ to remove the constant primer binding regions of aptamers and a bespoke R script for reuse. Moreover, we combine our approach with a sequence enrichment analysis already used by other groups for aptamer selection to identify the most relevant sequences.

Results

Aptamer selection

To identify ccRCC-specific aptamers, the initial randomized oligonucleotide library was subjected to cell-SELEX for 11 selection cycles using the RCC-MF cell line as a target cell line to identify ccRCC-specific aptamers and RC-124 cells as a negative control cell line to reduce the nonspecific binding. Cell-specific aptamer sequence enrichment monitoring was performed using flow cytometry (Guava 8HT) after the 4^th, 8^th and 11^th selection cycles. After the 4^th and 8^th selection cycles, there was a slight difference between the binding of the initial randomized oligonucleotide library compared to the enriched libraries. After the 11^th selection cycle, we observed binding of the enriched library to more than >95% of cells. However, the observed binding was nonspecific and the selected aptamer sequences were binding to both the RC-124 (Fig. 1a) and RCC-MF (Fig. 1b) cell lines.

During further selection and process optimization by changing the incubation time, library concentration, FBS concentration and temperature, complete selectivity against the RCC-MF cell line was not achieved up to the 11^th cycle.

We concluded that complete selectivity against ccRCC cells is not achieved. However, the low concentration binding measured for the enriched library after the 11^th pool (Fig. 2) did not exclude the possibility that the library contains ccRCC cell-specific sequences. To explore the differences that might exist within the library, we developed a differential binding cell-SELEX approach.

Differential binding cell-SELEX

The differential binding cell-SELEX process (Fig. 3) was performed after the 4^th and 11^th selection cycles. After incubation with identically split aptamer libraries and the retrieval of bound sequences to both RC-124 and RCC-MF, we performed two subsequent overlap PCR reactions and confirmed that both constructs after the 1^st overhang PCR and 2^nd overhang PCR are of expected size (Figs 4, S1). Quantification of the final libraries was performed using the NEBNext Library Quant Kit (New England BioLabs) to quantify only those sequences that have flow cell adapters attached to them (Table 1). Overall, our sequencing results also confirm the technical feasibility of the cell-SELEX experiments performed based on the developed protocols (Table 1).

Table 1 Aptamer concentration determined by qPCR before sequencing and sequencing reads per sample for the sequenced aptamer libraries.

Full size table

Data analysis for differential binding cell-SELEX

Sequencing was performed after the 4^th and 11^th selection cycles. The reads per sample after the initial quality filtration, adapter and constant primer binding region removal and length filtration (40 nt) varied from 169,024 to 1,142,856 (Table 1).

Combining all replicates from both samples after data clean-up, we identified 3,627,938 unique sequences within the 4^th selection cycle experiment and 503,107 unique sequences in the 11^th selection cycle experiment. After filtering the reads by edgeR to remove the sequences that had lower counts per million (CPM) than two per sample and that were present in less than two replicates, we were left with 1,015 unique sequences for the 4^th cycle aptamers and 35,859 sequences for the 11^th cycle aptamers.

For differential binding data analysis (Fig. 5), we further used selected sequences to run the edgeR package, a statistical analysis software that is used to estimate differential expressions from RNA-seq data. The resulting data were adjusted for multiple comparisons using the built-in Benjamini-Hochberg approach and filtered by removing all sequences that have log₂ fold change (logFC) values less than two or that had adjusted p-values higher than 0.0001.

Comparing differential binding datasets using the 4^th selection cycle enriched library, we were unable to identify any significantly differentially bound sequences based on the count per million (CPM) of each sequence and a fold change (FC) comparison between two cell lines (Fig. 6a). Most of the sequences bound from the 4^th cycle enriched library had a low abundance. However, an analysis of the 11^th selection enriched library discovered 195 statistically significant differentially bound sequences according to the same criteria as described for the first experiment (multiple comparison adjusted p-value < 0.0001, log₂(CPM) > abs(2)) (Fig. 6b). 178 sequences had log₂(CPM) < −2 compared to 17 sequences that had log₂(CPM) > 2 (Supplementary Table 1), indicating that more cell type specific sequences were identified for the control RC-124 cells than for the target RCC-MF cells (Fig. 6c).

Enrichment analysis identified 720 unique sequences that have log₂(meanCPM@11^th cycle/meanCPM@4^th cycle) > 5 or sequence enrichment in CPM terms 32 times from the 4^th to 11^th cycle (Supplementary Table 2). We further combined differential binding results that resulted in 17 unique sequences with 720 sequences obtained from enrichment analysis. We identified only 6 sequences that were present in both datasets (Supplementary Table 3) as the most likely candidates to specifically target ccRCC cells (if the log₂ cut off value is decreased to 5, it is possible to identify 6 sequences that can be found in both the differential binding analysis and enrichment analysis results). We also ordered all unique sequences that were present in the 11^th pool by CPM and calculated the log₂ enrichment value between the 4^th and 11^th cycle (Supplementary Table 4). Log₂ enrichment values for the top 10 most abundant sequences ranged from 4.7 to 6.2, and 7 of 10 sequences had a Log₂ value above 5, meaning that these sequences are also included in the enrichment analysis results. These 10 most abundant sequences contribute to approximately 27% of all sequencing reads from the 11^th pool. However, none of the top 10 most abundant sequences passed the statistical significance threshold or FC threshold in the differential binding analysis.

Differential binding results confirm that it is possible to use edgeR within our pipeline to identify the most likely candidate molecules for further testing.

Functional testing of selected lead aptamers

For lead aptamer testing using flow cytometry, we chose 11 sequences identified by different data analysis methods (DB, differential binding; EN, enrichment and MB, most abundant). The top three sequences in each data analysis method were chosen. Differential binding cell-SELEX analysis alone sorted by CPM identified sequences DB-1, DB-2 and DB-3. Differential binding cell-SELEX together with enrichment analysis sorted by log₂FC identified the DB-3, DB-4 and DB-5 sequences. Enrichment analysis between the 4^th and 11^th pools by log₂CPM enrichment identified sequences EN-1, EN-2 and EN-3. The three most abundant sequences bound to the RCC-MF cells were MB-1, MB-2 and MB-3 (Table 2). We estimated a population shift as a mode of fluorescence intensity (MFI) for each aptamer sample (n = 3). The data were corrected by subtracting the MFI from a sample that was incubated with a randomized starting library (MFI_{lead-sequence} − MFI_{random-library}).

Table 2 Lead sequences used for the confirmatory cell binding test by flow cytometry.

Full size table

Corrected MFIs were compared with the t-test (significance defined as p < 0.05, n = 3) using GraphPad Prism to determine if our identified sequences altogether bind more to RCC-MF cells than to RC-124 cells. Three sequences (DB-4, EN-2, MB-3) were confirmed to be differentially bound using flow cytometry by comparing MFIs (Figs 7, S2). While MB-3, identified as the 3^rd most abundant sequence, was significantly (p = 0.002) differentially bound, it was targeted towards RC-124 cells. The EN-2 sequence was identified using enrichment analysis and was statistically significantly (p = 0.013) binding to RCC-MF cells. DB-4 was significantly (p = 0.019) more bound to RCC-MF cells and was identified through a combined differential binding cell-SELEX and enrichment approach.

The binding of selected aptamer sequences was identified through differential binding cell-SELEX (DB-1, DB-2, DB-3, DB-4), and the most abundant sequence (MB-3) was further characterized by flow cytometry analysis. The sequence/randomized library fluorescence intensity ratio at 15 nM, 31 nM, 62 nM, 125 nM, 250 nM, 500 nM and 1000 nM concentrations are plotted in Fig. 8. Less than one ratio was observed for sequences DB-1 and DB-2 when incubated with the RC-124 cell line, indicating that these sequences are binding less than the randomized library to the control cell line. More detailed comparisons for each sequence binding to both cell lines are included in Fig. S3.

Discussion

A recent review on aptamer discovery notes that there are 141 entries of aptamer selection against live cells as of 2017. For comparison, proteins as targets have 584 entries and small molecules have 234 research entries¹. This is not surprising considering the advanced technological procedure involved in the cell-SELEX method compared to protein or small molecule SELEX. Several methods have been developed in recent years to improve the success rate of cell-SELEX; for example, HT-SELEX¹⁸, FACS-SELEX²⁷ and cell-internalization SELEX²⁸. An HTS adaptation for aptamer sequencing has been described as one of the most fundamental changes to aptamer selection technology²⁹.

The main goal achieved in this research is the development of a differential binding cell-SELEX method. This method can identify cell type-specific aptamer sequences from cell-SELEX selection pools that would not be selected by other cell-SELEX methods and thus would remain overlooked by the investigators.

Currently, the most often used analysis for aptamer finding using HTS data includes enrichment analysis, which means a comparison of the abundance of one particular sequence at the beginning of the SELEX procedure to the abundance of the same sequence after the SELEX procedure. Enrichment analysis can identify a large number of oligonucleotides with very similar log₂ enrichment values, as can be seen in our results (Supplementary Table 2). However, enrichment analysis is rarely useful for cell-SELEX because of the high possibility to enrich non-specific sequences. Using enrichment analysis with a cut-off value of log₂ > 5, we identified 720 sequences to be further tested. However, when the same sequencing dataset was submitted for differential binding analysis using edgeR, we identified 17 sequences that were more abundant on the surface of RCC-MF cells than on RC-124 cells.

Enrichment analysis identified one sequence (EN-2) that was significantly (p = 0.013) more bound to target RCC-MF cells, as also confirmed by flow cytometry (Fig. 7). We were able to confirm using flow cytometry that another sequence (DB-4), identified with a combined differential binding cell-SELEX and enrichment analysis, was significantly (p = 0.019) more bound to the RCC-MF cells. Importantly, DB-4 was found between 720 sequences identified using enrichment analysis, but only as the 528^th most enriched sequence. This provides scientific evidence that our approach can be used to identify lead aptamers that most likely would be lost during enrichment analysis.

MB-3, one of the most abundant sequences in the dataset, showed significant binding to RC-124 cells. MB-3 was identified neither in the enrichment analysis results nor in the differential binding cell-SELEX results. However, seven of the 10 most abundant aptamer sequences after the cell-SELEX process were enriched above the set cut-off value log₂ > 5 and thus did appear in the enrichment analysis results. None of these sequences appeared in the differential binding results because they did not pass the statistical significance test applied to logFC. These observations are in line with previous statements that the most abundant aptamer sequences are not necessarily the best binders³⁰. This proves the value of the differential binding approach for excluding the non-specifically enriched sequences during the cell-SELEX procedure.

After noticing high guanine abundance in several of the identified lead sequences, we searched for G-quadruplex (G4) forming motifs in sequences using QuadBase2³¹ TetraplexFinder with high stringency (G₃L_1–3) settings. Non-overlapping G4 motifs were identified in three (MB-3, DB-1, DB-2) out of 11 sequences that we previously tested using flow cytometry. Worth noticing is also the fact that the DB-1 and DB-2 sequences were outliers and had below library fluorescence intensity when binding to the negative control RC-124 cell line compared to the other tested sequences. The G4 motifs labelled in mfold³² predicted relevant aptamer structures (Fig. 9) using 4 °C as a folding temperature, with 5 mM Mg²⁺ and 157 nM Na⁺ concentrations. All sequences contain G4 motifs in randomized regions. Further sequence shortening might be of interest to determine the role of the G4 motifs in these sequences.

Differential binding cell-SELEX uses edgeR to compare how all sequences that can be found in the final enriched aptamer library interact with the control and target cells; it is also used to estimate the statistical significance of these differences. There are several bioinformatics tools available to analyse the statistical significance of the differential expression for RNA-seq data^25,33,34. To the best of our knowledge, none of these tools have been applied to estimate differentially bound aptamers on the cell surface. edgeR was chosen because it is compatible with the existing data analysis workflows in R³⁵. A combination of enrichment analysis and the differential binding approach provides an algorithm to choose target sequences for further analysis.

Altogether, we demonstrate a combined analysis pipeline that can be used to identify lead aptamers from low binding selectivity aptamer libraries after cell-SELEX experiments. We propose a fast and practical high throughput aptamer identification method to be used with the cell-SELEX technique to increase the successful aptamer selection rate against live cells.

A higher number of sequencing reads during differential binding cell-SELEX could even further increase the likelihood to identify low abundance, but differentially bound sequences specific to cells of interest. Sequences that were present only in one replicate from each selection pool were discarded. After the 4^th selection cycle, only a few sequences were present in more than one sequencing replicate (Fig. 6a) compared to the 11^th cycle (Fig. 6b). An increased number of reads would cover more diverse libraries and would make it possible to identify differentially bound aptamers using fewer selection cycles.

The cell-SELEX design described in this research uses commercially available human RCC-MF and RC-124 cells both as a target and a negative control. We are the first to use these cell lines for aptamer selection with a cell-SELEX approach. However, it could be more suitable to use patient-matched primary cells isolated from the tumour site and adjacent healthy kidney tissue within a few passages after isolation, when cells are most likely to represent the diversity found in clinical settings³⁶.

The differential binding cell-SELEX method developed here can be used to accelerate aptamer selection based on HTS analysis. Additional information from differential binding cell-SELEX reduces the time needed to identify aptamers. This can lead to the broader use of the cell-SELEX technique not only to identify aptamers against cell lines but also against primary cells isolated from patient samples.

We conclude that the differential binding cell-SELEX method can be used to characterize not only sequence enrichment between selection cycles, but also to select aptamer sequences that selectively bind to the target and control cells. We demonstrate the feasibility of our approach by showing cell-line specific aptamer identification against the ccRCC cell line RCC-MF as well as the RC-124 cell line from healthy kidney tissue.

Material and Methods

Cell culturing and buffer solutions

Kidney epithelial cell line RC-124 (Cell Lines Service GmbH) established from non-tumour tissue of kidney and carbonic anhydrase 9 (CA9)-positive ccRCC cell line RCC-MF (Cell Lines Service GmbH) established from renal clear cell carcinoma pT2, N1, Mx/GII-III (lung metastasis) were used for the cell-SELEX process as a negative control and as target cells accordingly. RCC-MF cells were cultured in RPMI 1640 (Gibco), and RC-124 cells were cultured in McCoy’s 5A medium (Sigma-Aldrich). Both culture media were supplemented with 10% foetal bovine serum (FBS) (Gibco), 50 U/ml penicillin and 50 µg/ml streptomycin (Gibco). The cells were propagated at 37 °C, 5% CO₂ and 95% relative humidity.

Washing buffer containing 4.5 mg/ml D-glucose and 5 mM MgCl₂ in phosphate-buffered saline (PBS) (SigmaAldrich, D8537, contains K⁺ at 4.45 mM, Na⁺ at 157 mM concentrations) was filtered through a 0.22 µM syringe filter (Corning). The binding buffer contained 4.5 mg/ml D-glucose, 5 mM MgCl₂, 1 mg/ml bovine serum albumin (SigmaAldrich) and 0.1 mg/ml baker’s yeast tRNA (SigmaAldrich) in phosphate-buffered saline and was filtered through a 0.22 µM syringe filter.

Oligonucleotide library

A randomised oligonucleotide library with 40 nt and 18 nt constant primer binding regions on both sides of randomized regions (5′-ATCCAGAGTGACGCAGCA-N40-TGGACACGGTGGCTTAGT-3′) was adapted from Sefah et al.³⁷. A FAM label was attached on one primer (5′-FAM- ATCCAGAGTGACGCAGCA-3′) for flow cytometry monitoring, and biotin was attached at the end of the second primer for ssDNA preparation after each cell-SELEX cycle (5′-biotin-ACTAAGCCACCGTGTCCA-3′). Oligonucleotides were ordered from Metabion or Invitrogen.

Cell-SELEX procedure

Cell-SELEX protocol was adapted from Sefah et al.³⁷. The aptamer library was prepared in binding buffer at a 14 µM concentration for the first selection cycle, heated at 95 °C for 5 min, folded on ice for at least 15 min, and added to fully confluent RCC-MF cells in a 100-mm Petri plate (Sarstedt) that were washed 2 times with washing buffer before the addition of the library. The initial library was applied to RCC-MF cells and incubated for 1 hour on ice with RCC-MF cells but not with RC-124 cells in the first selection cycle. After incubation with the oligonucleotide library, the cells were washed with 3 ml of washing buffer for 3 min and collected with a cell scraper after adding 1 ml of DNase free water. DNase free water was used to collect sequences only for the first cycle; in subsequent cycles, binding buffer was used to retrieve the bound sequences. After collection, the cell suspension was heated at 95 °C for 10 min to remove the bound sequences from the target proteins and centrifuged at 13,000 g; the supernatant containing the selected aptamer sequences was collected.

In subsequent selection cycles, the aptamer library was prepared at a 500 nM concentration and incubated with negative selection cell line RC-124 beforehand. Solution containing unbound sequences was collected and applied to the RCC-MF cell line after washing the cells as described previously. As the selection cycle was increased, a number of modifications were made to the selection procedure: after the 4^th selection cycle, 60 mm plates were used instead of 100 mm plates, an increasing concentration of FBS (10–20%) was added to the library after folding without changing the final concentration of the aptamer library, the wash volume was increased to 5 ml, the wash time was increased to 5 min and the number of wash times was increased to 3 after incubation.

PCR optimization

After each selection cycle, PCR optimization was performed to determine the optimal number of PCR cycles. For PCR optimization and preparative PCR cycling, the conditions involved a 12 min initial activation at 95 °C, followed by repeated denaturation for 30 sec at 95 °C, annealing at 56.3 °C and elongation at 72 °C.

ssDNA preparation

After preparative PCR, ssDNA was acquired using agarose-streptavidin (GE Healthcare) binding to a biotin-labelled strand, and FAM-labelled ssDNA was eluted with 0.2 M NaOH (Sigma-Aldrich). Desalting was done using NAP5 gravity flow columns (GE Healthcare), the concentration was determined measuring UV absorbance (NanoQuant Plate, M200 Pro, Tecan), and the samples were concentrated using vacuum centrifugation (Eppendorf).

Monitoring of aptamer binding by flow cytometry

In the enriched aptamer pool, randomized starting library and selected lead aptamers were prepared in binding buffer at 1 µM concentrations, heated to 95 °C for 5 min and then put on ice for at least 15 min. RC-124 and RCC-MF cells were washed with PBS two times and dissociated using Versene solution (Gibco). Then, 50 µL of the enriched aptamer library, starting library, lead aptamers or binding buffer were added to 50 µL of the cell suspension (2.5 * 10⁵ cell per sample), followed by the addition of 11 µL of FBS to each sample to a final concentration of 225 nM. The samples were incubated for 35 min on ice. After incubation, the samples were washed two times with 500 µL of binding buffer and resuspended in 500 µL of binding buffer. The samples were passed through a 40 µM cell strainer before flow cytometry analysis. Flow cytometry data were acquired using a Guava EasyCyte 8HT flow cytometer and analysed using the ExpressPro software (Merck Millipore). Flow cytometry data were analysed using FlowJo software, version 10 (FlowJo). 10,000 gated events were acquired for each sample.

Concentration-dependant binding for sequences DB-1, DB-2, DB-3, DB-4 and MB-3 were performed by preparing each sequence in binding buffer at 2 µM, heating at 95 °C for 5 min and folding on ice for at least 15 min. Subsequent manipulations were performed the same way as for a single concentration monitoring with the exception of preparing variable final concentrations (15 nM, 31 nM, 62 nM, 125 nM, 250 nM, 500 nM, 1000 nM) of each sequence in the cell suspension. Flow cytometry data were acquired using Amnis® ImageStream®XMark II (Luminex). Up to 5,000 single cell gated events were collected for each sample. Data were acquired using the INSPIRE® software and analysed using the IDEAS® software (Luminex).

Differential binding

Aptamer pools after the 4^th and 11^th selection cycle were prepared in binding buffer, heated and folded as described for the cell-SELEX procedure at a 1 ml volume with a final concentration of 500 nM. 500 µL were added to both the RC-124 cells and RCC-MF cells grown on 60 mm plates in appropriate cell culture media up to 95% confluence. The aptamer pools were added to the RC-124 and RCC-MF cells and incubated for 30 min on ice, then the cells were washed two times and collected using a cell scraper, heated immediately at 95 °C for 10 min, and centrifuged for 5 min at 13,000 g. The supernatants containing the bound sequences from both cell lines were frozen at −20 °C. Sequencing was done to compare the differential binding profiles of the enriched oligonucleotide libraries obtained from both cell lines.

Sequencing

The samples for sequencing were prepared by performing two subsequent overlap PCRs as described in the 16 S metagenomic sequencing library preparation protocol³⁸. The 1st overlap PCR used primers (5′-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG-ATCCAGAGTGACGCAGCA-3′ and 5′-GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG-ACTAAGCCACCGTGTCCA-3′) that are complementary to constant regions of the randomized oligonucleotide library with added overhang that includes an Illumina platform-specific sequence. Conditions for the 1st overlap PCR included 12 min of initial activation, followed by 30 sec at 95 °C, 30 sec at 56.3 °C and 3 min at 72 °C. The cycle number was optimized for each sample to reduce the non-specific amplification. Afterwards, PCR products from one sample were pooled together, concentrated using the DNA Clean & Concentrator (Zymo Research) and run on 3% agarose gel at 110 V for 40 min; the band at 143 bp was cut out and purified using the Zymoclean Gel DNA Recovery kit (Zymo Research).

The second overlap PCR used primers that were partly complementary to the previously added overhang and contained adapters to attach oligonucleotides to the flow cell and i5 and i7 indexes (5′-CAAGCAGAAGACGGCATACGAGAT-[i7 index]-GTCTCGTGGGCTCGG-3′ and 5′-AATGATACGGCGACCACCGAGATCTACAC-[i5 index]-TCGTCGGCAGCGTC-3′). Conditions for the second overhang PCR were 12 min at 95 °C, followed by 5 cycles of denaturation at 98 °C for 10 sec, annealing at 63 °C for 30 sec and elongation at 72 °C for 3 min. After PCR products from one sample were pooled together, the mixture was concentrated using DNA Clean & Concentrator (Zymo Research) and run on 3% agarose gel at 110 V for 45 min; the band at 212 bp was cut out and purified using a Zymoclean Gel DNA Recovery kit (Zymo Research). The concentrations for the final products were determined using the NEBNext Library Quant Kit for Illumina (New England BioLabs) by qPCR.

Sequencing was done on the Illumina MiSeq platform using MiSeq 150-cycle Reagent Kit v3 in single read mode for 150 cycles. 9% of PhiX was added to the run. Sequencing was done at the Estonian Genome Center, Tartu, Estonia.

Sequencing data analysis

Sequencing reads were filtered and demultiplexed. Constant primer binding regions were removed, and sequences that are longer or shorter than 40 nt were discarded using cutadapt²⁶. Counting of recurring sequences was done using fastaptamer-count, and matching of the sequences found in replicate samples was done using fastaptamer-enrich²⁰.

The differential expression analysis tool edgeR²⁵ was further used for the analysis of sequencing data. Replicate sequencing samples (n = 3) from differential binding cell-SELEX experiments after the 4^th and 11^th selection cycles were combined, and sequences with low abundance (reads per million < 2 and abundant at all in less than 2 sequencing samples) were filtered out. Normalization was performed based on the reads present in each library. Differential binding was estimated using the edgeR function to identify significantly differentially expressed genes using the following parameters: log₂ fold change (log₂FC) value > 2, p-value < 0.0001, adjusted for multiple comparisons using the Benjamini & Hochberg³⁹ method.

Enrichment analysis was done separately by using all reads that came from the 4^th pool and 11^th pool RCC-MF cell binding experiments. We calculated the mean log₂ value of enrichment (mean counts per million (CPM) for a sequence at the 11^th cycle divided by the mean CPM for the same sequence at the 4^th cycle) for each sequence and kept the sequences that had log₂FC > 6 or enrichment between the 4^th and 11^th cycle.

After these steps, we identified the common sequences in differential binding results and sequence enrichment results to identify the most likely lead aptamer sequences. (RNotebook used for 4^th cycle differential binding analysis and 11^th cycle differential binding analysis, including enrichment analysis, can be found on https://github.com/KarlisPleiko/apta).

Accession numbers

Sequencing data are available at SRA under accession number PRJEB28411.

Data Availability

edgeR is available as a Bioconductor package (http://bioconductor.org/packages/edgeR/), FASTAptamer was downloaded from github (https://github.com/FASTAptamer/FASTAptamer), cutadapt was installed using Bioconda⁴⁰. RNotebooks for data analysis using tidyverse⁴¹ are available here: https://github.com/KarlisPleiko/apta.

References

Dunn, M. R., Jimenez, R. M. & Chaput, J. C. Analysis of aptamer discovery and technology. Nat. Rev. Chem. 1, 0076 (2017).
Article CAS Google Scholar
Pereira, R. L. et al. Aptamers: novelty tools for cancer biology. Oncotarget 9, 26934–26953 (2018).
PubMed PubMed Central Google Scholar
Zhou, J. & Rossi, J. Aptamers as targeted therapeutics: Current potential and challenges. Nat. Rev. Drug Discov. 16, 181–202 (2017).
Article CAS Google Scholar
A Safety and Efficacy Study of E10030 (Anti-PDGF Pegylated Aptamer) Plus Lucentis for Neovascular Age-Related Macular Degeneration (NCT01089517).
A Phase II Open-label, Multicenter Extension Study to Assess the Long-term Safety and Efficacy of Vamorolone in Boys with Duchenne Muscular Dystrophy (DMD)(EudraCT No: 2016-004263-38).
NOX-A12 in Combination With Bendamustine and Rituximab in Relapsed Chronic Lymphocytic Leukemia (CLL)(NCT01486797).
Kaur, H., Bruno, J. G., Kumar, A. & Sharma, T. K. Aptamers in the Therapeutics and Diagnostics Pipelines. Theranostics 8, 4016–4032 (2018).
Article CAS Google Scholar
Gregory, P. Commercialization of an aptamer-based diagnostic test. IVD Technology 31–37 (2012).
Bauer, J. Apollodx Licenses Technology For Food Safety. Available at, http://apollodx.com/apollodx-licenses-technology-food-safety/.
Candia, J. et al. Assessment of Variability in the SOMAscan Assay. Sci. Rep. 7, 14248 (2017).
Article ADS Google Scholar
Kalra, P., Dhiman, A., Cho, W. C., Bruno, J. G. & Sharma, T. K. Simple Methods and Rational Design for Enhancing Aptamer Sensitivity and Specificity. Front. Mol. Biosci. 5, 41 (2018).
Article Google Scholar
Königsbrügge, O. et al. A new measure for in vivo thrombin activity in comparison with in vitro thrombin generation potential in patients with hyper- and hypocoagulability. Clin. Exp. Med. 17, 251–256 (2017).
Article Google Scholar
Cell Separation Kit (AptoCyto). Available at, http://www.aptsci.com/product/product_1.html.
Aptamer Market (Material - Nucleic acid Aptamer, Peptide Aptamer; Selection Technique - SELEX Technique and Others; Application - Research, Diagnostics, Therapeutics) - Global Industry Analysis, Size, Share, Growth, Trends, and Forecast 2017–2025. Available at, https://www.transparencymarketresearch.com/pressrelease/aptamers-market.htm (2018).
Monoclonal Antibodies (mAbs) Market Analysis By Source (Chimeric, Murine, Humanized, Human), By Type of Production, By Indication (Cancer, Autoimmune, Inflammatory, Infectious, Microbial, Viral Diseases), By End-use (Hospitals, Research, Academic Institut (2016).
Ellington, A. D. & Szostak, J. W. In vitro selection of RNA molecules that bind specific ligands. Nature 346, 818 (1990).
Article ADS CAS Google Scholar
Hicke, B. J. et al. Tenascin-C Aptamers Are Generated Using Tumor Cells and Purified Protein. J. Biol. Chem. 276, 48644–48654 (2001).
Article CAS Google Scholar
Zhao, Y., Granas, D. & Stormo, G. D. Inferring binding energies from selected binding sites. PLoS Comput. Biol. 5 (2009).
Kahsai, A. W. et al. Conformationally selective RNA aptamers allosterically modulate the β 2-Adrenoceptor. Nat. Chem. Biol. 12, 709–716 (2016).
Article CAS Google Scholar
Alam, K. K., Chang, J. L. & Burke, D. H. FASTAptamer: A bioinformatic toolkit for high-throughput sequence analysis of combinatorial selections. Mol. Ther. - Nucleic Acids 4, 1–10 (2015).
Article CAS Google Scholar
Hoinka, J., Backofen, R. & Przytycka, T. M. AptaSUITE: A Full-Featured Bioinformatics Framework for the Comprehensive Analysis of Aptamers from HT-SELEX Experiments. Mol. Ther. - Nucleic Acids 11, 515–517 (2018).
Article CAS Google Scholar
Werner, T. Next generation sequencing in functional genomics. Brief. Bioinform. 11, 499–511 (2010).
Article CAS Google Scholar
Alam, K. K. et al. Poly-Target Selection Identifies Broad-Spectrum RNA Aptamers. Mol. Ther. - Nucleic Acids 13, 605–619 (2018).
Article CAS Google Scholar
Dupont, D. M., Larsen, N., Jensen, J. K., Andreasen, P. A. & Kjems, J. Characterisation of aptamer-target interactions by branched selection and high-throughput sequencing of SELEX pools. Nucleic Acids Res. 43 (2015).
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. {edgeR}: a {Bioconductor} package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010).
Article CAS Google Scholar
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal 17, 10 (2011).
Article Google Scholar
Mayer, G. et al. Fluorescence-activated cell sorting for aptamer SELEX with cell mixtures. Nat. Protoc. 5, 1993–2004 (2010).
Article CAS Google Scholar
Thiel, W. H. et al. Cell-Internalization SELEX: Method for Identifying Cell-Internalizing RNA Aptamers for Delivering siRNAs to Target Cells. In Methods in molecular biology (Clifton, N. J.) 1218, 187–199 (2015).
Ozer, A., Pagano, J. M. & Lis, J. T. New technologies provide quantum changes in the scale, speed, and success of SELEX methods and aptamer characterization. Mol. Ther. - Nucleic Acids 3, 1–18 (2014).
Article Google Scholar
Hoinka, J. et al. Large scale analysis of the mutational landscape in HT-SELEX improves aptamer discovery. Nucleic Acids Res. 43, 5699–5707 (2015).
Article CAS Google Scholar
Dhapola, P. & Chowdhury, S. QuadBase2: web server for multiplexed guanine quadruplex mining and visualization. Nucleic Acids Res. 44, W277 (2016).
Article CAS Google Scholar
Zuker, M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 31, 3406–3415 (2003).
Article CAS Google Scholar
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–578 (2012).
Article CAS Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47–e47 (2015).
Article Google Scholar
Team, R. C. R: A Language and Environment for Statistical Computing (2018).
Lobo, N. C. et al. Efficient generation of patient-matched malignant and normal primary cell cultures from clear cell renal cell carcinoma patients: Clinically relevant models for research and personalized medicine. BMC Cancer 16, 1–15 (2016).
Article Google Scholar
Sefah, K., Shangguan, D., Xiong, X., O’Donoghue, M. B. & Tan, W. Development of DNA aptamers using cell-selex. Nat. Protoc. 5, 1169–1185 (2010).
Article CAS Google Scholar
Illumina. 16S Metagenomic Sequencing Library Preparation. Illumina.com (2013).
Benjamini, Y. & Hochberg, Y. Controlling The False Discovery Rate - A Practical And Powerful Approach To Multiple. Testing. J. R. Stat. Soc., Ser. B 57, 289–300 (1995).
MathSciNet MATH Google Scholar
Grüning, B. et al. Bioconda: sustainable and comprehensive software distribution for the life sciences. Nat. Methods 15, 475–476 (2018).
Article Google Scholar
Wickham, H. tidyverse: Easily Install and Load the ‘Tidyverse’ (2017).

Download references

Acknowledgements

K.P., U.R. and E.V. conceived and designed the project. K.P., L.S., V.P. and K.M. carried out the experiments. K.P. and U.R. wrote the paper. All read and approved the final manuscript. This work was supported by the University of Latvia Foundation [Grant Number 2182].

Author information

Authors and Affiliations

Faculty of Medicine, University of Latvia, Riga, LV-1004, Latvia
Karlis Pleiko, Liga Saulite, Vadims Parfejevs & Una Riekstina
Pauls Stradins Clinical University Hospital, Riga, LV-1002, Latvia
Karlis Miculis & Egils Vjaters

Authors

Karlis Pleiko
View author publications
You can also search for this author in PubMed Google Scholar
Liga Saulite
View author publications
You can also search for this author in PubMed Google Scholar
Vadims Parfejevs
View author publications
You can also search for this author in PubMed Google Scholar
Karlis Miculis
View author publications
You can also search for this author in PubMed Google Scholar
Egils Vjaters
View author publications
You can also search for this author in PubMed Google Scholar
Una Riekstina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Karlis Pleiko.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary data

Supplementary Table 1

Supplementary Table 2

Supplementary Table 3

Supplementary Table 4

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pleiko, K., Saulite, L., Parfejevs, V. et al. Differential binding cell-SELEX method to identify cell-specific aptamers using high-throughput sequencing. Sci Rep 9, 8142 (2019). https://doi.org/10.1038/s41598-019-44654-w

Download citation

Received: 20 December 2018
Accepted: 21 May 2019
Published: 31 May 2019
DOI: https://doi.org/10.1038/s41598-019-44654-w

This article is cited by

Development and characterization of DNA aptamer against Retinoblastoma by Cell-SELEX
- Bhavani Shankar Maradani
- Sowmya Parameswaran
- Krishnakumar Subramanian
Scientific Reports (2022)
Selection of aptamers against triple negative breast cancer cells using high throughput sequencing
- Débora Ferreira
- Joaquim Barbosa
- Ligia R. Rodrigues
Scientific Reports (2021)
Organ-on-a-Chip: The Future of Therapeutic Aptamer Research?
- Thoa Thi Thanh Tran
- Antonio Delgado
- Sehoon Jeong
BioChip Journal (2021)
CRISPR-Mediated Isogenic Cell-SELEX Approach for Generating Highly Specific Aptamers Against Native Membrane Proteins
- Jonah C. Rosch
- Emma H. Neal
- Ethan S. Lippmann
Cellular and Molecular Bioengineering (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.