Skip to content

Filtering options

Filtering limits processing to regions or motifs of interest, reducing both runtime and output size.

1. Reference regions

When working with reference-to-signal alignments, bases are processed only if they fall within specified regions.

There are three ways to provide reference regions:

Flag Description
--ref-regions <REF:START-END> Specify regions directly (1-based, inclusive). Multiple regions space-separated.
--bed-file <PATH> Provide regions from a BED file (0-based, start-inclusive, end-exclusive).
--positions-of-interest <REF:POS-HALFWINDOW> Define positions with flanking windows. Coordinates are 1-based. See Positions with windows.

Positions with windows

This mode targets a specific site plus a window around it, e.g.: ChrA:8-4

1-based index:          1 2 3 4 5 6 7 8 9 0 1 2 3 4 5
Ref sequence:           A C G T A G C T A A A G T C T
Region of interest:          |--------X--------|

2. Motifs

Motif filtering works for both query- and reference-to-signal alignments. Only alignment segments exactly matching a provided motif are processed. (Requires fishnet align output level 2 or 3 to include sequence information.)

There are two ways to provide motifs:

Flag Description
--motifs <MOTIF-1> ... Provide motifs directly. Named automatically (motif1, motif2, …).
--motifs-file <FASTA> Provide motifs via a FASTA file. Names are taken from the FASTA headers.

All motif matches within a read are processed.