Correlation between base composition of deoxyribonucleic acid and amino acid composition of protein. Proc. Natl Acad. Sci. USA
47, 1141–1149 (1961)
Gu, X., Hewett-Emmett, D. & Li, W.-H.
Directional mutational pressure affects the amino acid composition and hydrophobicity of proteins in bacteria. Genetica
103, 383–391 (1998)
Knight, R. D., Freeland, S. J. & Landweber, L. F.
A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes. Genome Biol.
4, Research0010.1 (2001)
Wang, H.-C., Singer, G. A. C. & Hickey, D. A.
Mutational bias affects protein evolution in flowering plants. Mol. Biol. Evol.
21, 90–96 (2004)
Trifonov, E. N.
The triplet code from first principles. J. Biomol. Struct. Dyn.
22, 1–11 (2004)
Miller, S. L.
Which organic compounds could have occurred on the prebiotic earth?
Cold Spring Harb. Symp. Quant. Biol.
52, 17–27 (1987)
Cronin, J. R. & Pizzarello, S.
Amino acids in meteorites. Adv. Space Res.
3, 5–18 (1983)
Brooks, D. J., Fresco, J. R., Lesk, A. M. & Singh, M.
Evolution of amino acid frequencies in proteins over deep time: Inferred order of introduction of amino acids into the genetic code. Mol. Biol. Evol.
19, 1645–1655 (2002)
Brooks, D. J. & Fresco, J. R.
Increased frequency of cysteine, tyrosine, and phenylalanine residues since the last universal ancestor. Mol. Cell. Proteom.
1, 125–131 (2002)
Muller, T. & Vingron, M.
Modeling amino acid replacement. J. Comp. Biol.
7, 761–776 (2000)
Goldman, N. & Whelan, S.
A novel use of equilibrium frequencies in models of sequence evolution. Mol. Biol. Evol.
19, 1821–1831 (2002)
Veerassamy, S., Smith, A. & Tillier, E. R. M.
A transition probability model for amino acid substitutions from blocks. J. Comp. Biol.
10, 997–1010 (2003)
Henikoff, S. & Henikoff, J. G.
Amino acid substitution matrices. Adv. Prot. Chem.
54, 73–97 (2000)
A simple sequentially rejective multiple test procedure. Stand. J. Stat.
6, 65–70 (1979)
Feng, D. F. & Doolittle, R. F.
Progressive alignment of amino acid sequences and construction of phylogenetic trees from them. Methods Enzymol.
266, 368–382 (1996)
Problems with parsimony in sequences of biased base composition. J. Mol. Evol.
47, 686–690 (1998)
Prediction of deleterious human alleles. Hum. Mol. Genet.
10, 591–597 (2001)
Tice, M. M. & Lowe, D. R.
Photosynthetic microbial mats in the 3,416-Myr-old ocean. Nature
431, 549–552 (2004)
Rat Genome Sequencing Consortium. Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature
428, 493–535 (2004)
Zdobnov, E. M.
Comparative genome and proteome analysis of Anopheles gambiae and Drosophila melanogaster
298, 149–159 (2002)
Sorhannus, U. & Fox, M.
Synonymous and nonsynonymous substitution rates in diatoms: A comparison between chloroplast and nuclear genes. J. Mol. Evol.
48, 209–212 (1999)
Ochman, H. & Wilson, A. C.
Evolution in bacteria—evidence for a universal substitution rate in cellular genomes. J. Mol. Evol.
26, 74–86 (1987)
Clark, M. A., Moran, N. A. & Baumann, P.
Sequence evolution in bacterial endosymbionts having extreme base compositions. Mol. Biol. Evol.
16, 1586–1598 (1999)
Smith, N. G. C. & Eyre-Walker, A.
Adaptive protein evolution in Drosophila
415, 1022–1024 (2002)
Fitch, W. M. & Markowitz, E.
An improved method for determining codon variability in a gene and its application to the rate of fixation of mutations in evolution. Biochem. Genet.
4, 579–593 (1970)
Kondrashov, A. S., Sunyaev, S. & Kondrashov, F. A.
Dobzhansky-Muller incompatibilities in protein evolution. Proc. Natl Acad. Sci. USA
99, 14878–14883 (2002)
Clark, A. G.
Inferring nonneutral evolution from human-chimp-mouse orthologous gene trios. Science
302, 1960–1963 (2003)
Kellis, M., Patterson, N., Endrizzi, M., Birren, B. & Lander, E. S.
Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature
423, 233–234 (2003)
Tatusov, R. L., Koonin, E. V. & Lipman, D. J.
A genomic perspective on protein families. Science
278, 631–637 (1997)
Thompson, J. D., Higgins, D. G. & Gibson, T. J.
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res.
22, 4673–4680 (1994)