Calculate identity and similarity between aligned DNA or protein sequences with substitution matrix analysis
Identity measures the percentage of exact matches between aligned sequences, while similarity considers biochemically similar amino acids. For proteins, substitution matrices like BLOSUM62 define which amino acids are considered similar based on evolutionary conservation.
Quick analysis in three steps:
This tool is essential for:
Sample aligned sequences:
ATCG-TCGATCG
ATCGATCGA-CG Sequences must be the same length including gaps (-).
Analysis results include:
Identity: 83.3%
Similarity: 91.7%
Matches: 10/12
Gaps: 2 Visual alignment shows matches (*), gaps (-), and mismatches (.).
Q: What's the difference between identity and similarity?
A: Identity counts exact matches only, while similarity includes biochemically similar amino acids.
Q: Do I need pre-aligned sequences?
A: Yes, sequences must be aligned and the same length including gaps.