Filter DNA

Clean DNA sequences by removing non-DNA characters, spaces, and digits. Support for IUPAC codes and custom replacement options.

Allowed Characters

Basic DNA (GATC only)
Basic DNA + N (GATCN)
IUPAC DNA codes

Replace Invalid With

Remove (delete)
Replace with N
Replace with -

Case Conversion

Uppercase
Lowercase
Keep original

Remove Options

Remove spaces
Remove digits
Remove line breaks

What is Filter DNA?

Filter DNA removes non-DNA characters from sequences, ensuring clean data for downstream analysis. Supports basic DNA, IUPAC codes, and custom filtering options.

How to Use This Filter DNA Tool

Clean your sequences instantly:

  1. Paste or upload your DNA sequences
  2. Choose allowed characters (basic, IUPAC)
  3. Select replacement or removal options
  4. Results filter automatically as you type

When to Use

This tool is useful when you need to:

  • Clean raw sequence data from various sources
  • Remove numbers and spaces from sequences
  • Standardize sequences for analysis tools
  • Convert between different DNA notations

Example Input

Messy sequence with numbers and spaces:

1 ATG CCA 123 GTA
45 TGC NNN xyz ATC

Try with the Example button.

Example Output

Cleaned sequence:

ATGCCAGTATGCNNNATC

Non-DNA characters removed or replaced.

FAQ

Q: What are IUPAC DNA codes?
A: Extended codes including R, Y, S, W, K, M, B, D, H, V, N for ambiguous bases.

Q: Does it preserve FASTA headers?
A: Yes, FASTA headers (lines starting with >) are preserved.