Remove or preserve gaps, spaces, numbers, and special characters from sequences
Sequence Gap Cleaner is a tool that removes unwanted characters from DNA, RNA, or protein sequences. It can remove spaces, alignment gaps (- and .), numbers, and line breaks while preserving FASTA headers. This is essential when cleaning sequences from alignment files, formatted text, or when preparing sequences for analysis tools that don't accept special characters.
Quick guide to clean your sequences:
This tool is useful when you need to:
Sample sequence with gaps and spaces:
ATC GAT CGA---TCG ATC...GAT
123 456 789Contains spaces, gaps (- and .), and numbers.
Cleaned sequence:
ATCGATCGATCGATCGATAll spaces, gaps, and numbers removed.
Q: Does this preserve FASTA headers?
A: Yes, FASTA headers (lines starting with >) are always preserved.
Q: What characters count as gaps?
A: The tool removes hyphens (-) and periods (.) which are standard gap characters in alignments.
Q: Can I keep line breaks?
A: Yes, uncheck "Remove line breaks" to preserve original formatting.