Once scientists determined that messenger RNA (mRNA) served as a copy of each gene's DNA and specified the sequence of amino acids in proteins, they immediately had many more questions about the process of protein formation. Specifically, these researchers knew that proteins are made from 20 different amino acids. Moreover, they also knew that there were only four nucleotides in mRNA: adenine (A), cytosine (C), guanine (G), and uracil (U). But how exactly could these four nucleotides code for all 20 amino acids? The answer to this question turned out to be simpler than one might expect.
Determining the Number of Nucleotides Per Amino Acid
Right away, researchers knew that the genetic code was more complex than one nucleotide per amino acid. After all, if this was the case, a person's DNA could only code for four different amino acids. In fact, even two nucleotides per amino acid (i.e., a doublet code) could not account for 20 amino acids, because such a code provides only 16 permutations (four bases at each of two positions = 4 × 4 = 16 amino acids).
Figure 1: Distinct possibilities: Overlapping or non-overlapping genetic code?
Early researchers studying the genetic code had to determine if the mRNA encoding amino acids was non-overlapping. Was it each sequential set of three nucleotides encoding one amino acid? Or was it overlapping, with each three-nucleotide code beginning on sequential single nucleotides?
© 2008 Nature Education All rights reserved.
Table 1: Did the code have commas or not?
A non-overlapping code provided scientists with predictions they could test.
© 2008 Nature Education All rights reserved.