Scientific data tools
Deterministic utilities for validating, summarizing, and transforming sequence data. Built for quick inspection and preprocessing, with larger workflows available on demand.
Sequence QC & stats
Inspect structure, counts, and high-level sequence properties.
Sequence stats
StatsExplore general FASTA/FASTQ sequence or read summaries with interactive length plots, filterable per-record tables, and exports.
Assembly FASTA statistics
AssemblyInspect assembly-focused contig or scaffold metrics with cumulative span, Nx curve, length distribution, and GC% plots.
FASTA/FASTQ format validator
ValidateValidate FASTA and FASTQ file structure with strict, first-error checks. Verifies headers, record boundaries, and FASTQ records.
FASTA/FASTQ processing
Prepare, split, filter, extract, and reformat sequence files.
FASTQ → FASTA
ConvertConvert strict FASTQ to FASTA by dropping quality scores while preserving read headers and sequences.
FASTA ID deduplicator
CleanFind duplicated FASTA record IDs, remove duplicates (keep first/longest/shortest), or rename duplicates to unique IDs.
FASTA length filter
FilterFilter FASTA records by sequence length using minimum and/or maximum thresholds. Keeps or removes matching records and returns filtered FASTA plus a summary.
FASTA splitter
SplitSplit FASTA into multiple smaller files by records per file or by target number of output files. Returns a ZIP archive with split FASTA files and a summary report.
FASTA header extractor
ExtractExtract FASTA headers or record IDs as plain text. Returns one header per line with optional deduplication and numbering.
Sequence line wrapper
FormatWrap sequence lines to a fixed width for FASTA or plain sequence input. Preserves FASTA headers and rewrites sequence lines deterministically.
Composition & nucleotide metrics
Calculate GC content, frequencies, and windowed nucleotide metrics.
GC content calculator
CompositionCalculate GC content for nucleotide sequences from FASTA or plain sequence input. Reports GC%, counted bases, and ambiguous characters.
GC sliding window
WindowedCalculate GC% across DNA or RNA sequences using sliding windows. Returns positional GC% rows, per-sequence plots, and exports.
Nucleotide frequency
CompositionCalculate nucleotide frequencies from FASTA or plain sequence input. Reports counts and percentages for A, C, G, T/U and ambiguous characters.
Conversion, translation & ORFs
Transform nucleotide sequences and inspect coding potential.
Reverse complement
TransformCompute reverse-complement for DNA sequences (IUPAC-aware). Supports FASTA or raw sequence.
DNA ↔ RNA converter
ConvertConvert DNA to RNA or RNA to DNA by replacing T ↔ U while preserving FASTA structure.
DNA → Protein translation
TranslateTranslate DNA or RNA sequences to protein using standard or alternative NCBI genetic codes. Supports FASTA input, 6-frame translation, and protein output across selectable reading frames.
ORF finder
InteractiveFind ORFs in DNA or RNA, inspect a strand/frame map, and export ORF tables plus peptide or nucleotide FASTA.