Many sequence alignment programs use the BLOSUM62 score matrix to score pairs of aligned residues. Where did BLOSUM62 come from?
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
References
Henikoff, J.G. Amino acid substitution matrices from protein blocks. Proc. Natl. Acad. Sci. USA 89, 10915–10919 (1992).
Karlin, S. & Altschul, S.F. Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc. Natl. Acad. Sci. USA 87, 2264–2268 (1990).
Altschul, S.F. Amino acid substitution matrices from an information theoretic perspective. J. Mol. Biol. 219, 555–565 (1991).
Author information
Authors and Affiliations
Supplementary information
Supplementary Notes
A program for taking a (possibly arbitrary) alignment score matrix and back-calculating the implied target frequencies pab. (DOC 81 kb)
Doing this requires solving for a nonzero lambda in: \sum_ab f_a f_b e{\lambda s_ab} = 1 and this is a good excuse to demo two methods of root-finding: bisection search and the Newton/Raphson method.
The program is ANSI C, and should compile on any machine with a C compiler: % cc -o lambda lambda.c -lm Any questions about this program should be addressed directly to the author.
Further study
Further study
You can download an ANSI C program for calculating the implicit target frequencies pab of a score matrix (see Supplementary Notes). The BLOSUM62 score matrix and its background frequencies are included as an example. The code also contains two basic methods of solving for roots of equations like the one for λ: the bisection method, and the Newton/Raphson method.
Rights and permissions
About this article
Cite this article
Eddy, S. Where did the BLOSUM62 alignment score matrix come from?. Nat Biotechnol 22, 1035–1036 (2004). https://doi.org/10.1038/nbt0804-1035
Issue Date:
DOI: https://doi.org/10.1038/nbt0804-1035
This article is cited by
-
Bridging drug discovery through hierarchical subtractive genomics against asd, trpG, and secY of pneumonia causing MDR Staphylococcus aureus
Molecular Genetics and Genomics (2024)
-
BitterMatch: recommendation systems for matching molecules with bitter taste receptors
Journal of Cheminformatics (2022)
-
Co-optimization of therapeutic antibody affinity and specificity using machine learning models that generalize to novel mutational space
Nature Communications (2022)
-
Proteome-wide prediction of bacterial carbohydrate-binding proteins as a tool for understanding commensal and pathogen colonisation of the vaginal microbiome
npj Biofilms and Microbiomes (2021)
-
Evolution, structure and emerging roles of C1ORF112 in DNA replication, DNA damage responses, and cancer
Cellular and Molecular Life Sciences (2021)