« Prev Next »
This page has been archived and is no longer updated
Even though nearly every cell in an organism's body contains the same set of genes, only a fraction of these genes are used in any given cell at any given time. It is this carefully controlled pattern of what is called "gene expression" that makes a liver cell different from a muscle cell, and a healthy cell different from a cancer cell. But how can researchers determine which genes are "turned on" and when?
The quantity of mRNA transcript for a single gene directly reflects how much transcription of that gene has occurred. Tracking of that quantity will therefore indicate how vigorously a gene is transcribed, or expressed. To visualize differences in the quantity of mRNA produced by different groups of cells or at different times, researchers often use the method known as a Northern blot. For this method, researchers must first isolate mRNA from a biological sample by exposing the cells within it to a protease, which is an enzyme that breaks down cell membranes and releases the genetic material in the cells. Next, the mRNA is separated from the DNA, proteins, lipids, and other cellular contents. The different fragments of mRNA are then separated from one another via gel electrophoresis (a technique that separates molecules by passing an electrical current through a gel medium containing the molecules) and transferred to a filter or other solid support using a technique known as blotting. To identify the mRNA transcripts produced by a particular gene, the researchers next incubate the sample with a short piece of single-stranded RNA or DNA (also known as a probe) that is labeled with a radioactive molecule. Designed to be complementary to mRNA from the gene of interest, the probe will bind to this sequence. Later, when the filter is placed against X-ray film, the radioactivity in the probe will expose the film, thereby making marks on it. The intensity of the resulting marks, called bands, tells researchers how much mRNA was in the sample, which is a direct indicator of how strongly the gene of interest is expressed (Figure 2).
Until recently, scientists studied gene expression by looking at only one or very few gene transcripts at a time. Thankfully, new techniques now make large-scale studies of gene expression possible. One such technique is SAGE (serial analysis of gene expression). A method for measuring the expression patterns of many genes at once, SAGE not only allows scientists to analyze thousands of gene transcripts simultaneously, but it also enables them to determine which genes are active in different tissues or at different stages of cellular development.
SAGE identifies and counts the mRNA transcripts in a cell with the help of short snippets of the genetic code, called tags. These tags, which are a maximum of 14 nucleotides long, enable researchers to match an mRNA transcript with the specific gene that produced it. In most cases, each tag contains enough information to uniquely identify a transcript. The name "serial analysis" refers to the fact that tags are read sequentially as a continuous string of information.
The basic steps of the SAGE technique are outlined below.To begin a SAGE analysis, researchers must first separate the mRNA in a sample from the other cellular contents. To do this, they attach long strips of thymine nucleotides to tiny magnetic beads. When researchers flush the contents of a cell over the beads, these thymine strips form complementary base pairs with the poly-A tails of the mRNA molecules. Thus, when the flushing process is complete, the mRNA transcripts from the sample are captured because they are attached to the magnetic beads, while the other contents of the cells flush past the beads and are discarded.
To begin the next portion of SAGE, the researchers use a cutting enzyme to slice off short segments of nucleotides, called tags, at designated positions in each cDNA molecule. Next, two tags from each cDNA are combined into a single unit. These tags then become the representative for the gene they came from, and they act as a unique identifier in the form of a stand-in. Without having to process the entire mRNA sequence thereafter, scientists can use these shorter tag sequences to keep track of whether a specific gene was expressed in mRNA form.
After the different tags have been made from each mRNA sequence, they are next linked together into long chains called concatemers. These concatemers therefore contain representatives of mRNAs from a group of genes. Linking the tags together in a concatemer is important, because it means that researchers will later be able to read thousands of tags at once during the analysis portion of the SAGE procedure.
Although the researchers now possess concatemers representing the genes expressed in the sample, they need multiple copies of these concatemers if they wish to run the molecules through a sequencing machine. Thus, just before sequencing, the concatemers are inserted into bacteria, and through their own replication process these bacteria make millions of copies of each concatemer chain. This step increases the volume of material, and it therefore ensures that there is a baseline amount of material necessary for a sequencing machine. After that, researchers use a sequencing machine to decode and read the long string of nucleotides in each chain.
Finally, a computer processes the data from the sequencing machine and compiles a list of tags. By comparing the tags to a sequence database, the researchers can identify the mRNA (and ultimately the gene) that each tag came from. By subsequently counting the number of times each tag is observed, the researchers can also estimate the degree to which a particular gene is expressed: the more often a tag appears, the greater the level of gene expression.
Compared to other techniques for measuring gene expression, SAGE offers a significant advantage because it measures the expression of both known and unknown genes. Sometimes, when analyzing SAGE data, computers cannot find matches for certain tags in their sequence databases. What does this mean? Interestingly, a lack of matches indicates that the mRNA used to produce these tags is associated with genes that have not been studied before. In this way, SAGE has been used to discover new genes involved in a variety of diseases.
In addition to Northern blot tests and SAGE analyses, there are several other techniques for analyzing gene expression. Most of these techniques, including microarray analysis and reverse transcription polymerase chain reaction (RT-PCR), work by measuring mRNA levels. However, researchers can also analyze gene expression by directly measuring protein levels with a technique known as a Western blot.
Further Exploration
Key Questions
eBooks
This page appears in the following eBook