Calculate statistical significance of sequence alignments. Convert between E-values, bit scores, and P-values using Karlin-Altschul statistics.
The E-value (Expectation value) represents the number of alignments with scores at least as good as the observed score that would be expected to occur by chance in a database search. Lower E-values indicate more significant matches.
Common significance thresholds:
Key differences:
Matrix-specific constants (BLOSUM62):
λ (lambda) ≈ 0.267
K ≈ 0.041
H (entropy) ≈ 0.68 bitsDifferent matrices have different parameters.
E-values increase linearly with database size. An alignment with E=0.001 in a database of 1 million sequences would have E=1 in a database of 1 billion sequences, but the bit score remains constant.
Q: Why do E-values change with database size?
A: Larger databases have more chances for random matches, increasing the expected number of hits by chance.
Q: What's a good E-value cutoff?
A: Typically 0.01 or 0.001, but it depends on your analysis goals and tolerance for false positives.