EMBOSS: dotmatcher

dotmatcher

Function

Description

dotmatcher generates a dotplot from two input sequences. The dotplot is an intuitive graphical representation of the regions of similarity between two sequences. All positions from the first input sequence are compared with all positions from the second input sequence using a specified substitution matrix. The two sequences are the axes of the rectangular dotplot. Wherever there is "similarity" between a position from each sequence a dot is plotted. The threshold conditions for "similarity" are defined by the user.

Algorithm

All positions from the first input sequence are compared with all positions from the second input sequence and scored, using the specified substitution matrix. This produces a matrix of scores from which local regions of similarity (corresponding to diagonals in the dotplot) are identified. A window of user-specified length is moved along all possible diagonals. Each position in the window corresponds to a pair-wise score from the scoring matrix. The score for the entire window is the sum of the scores for individual positions within it. If the window score is above the user-defined threshold, then a line is plotted on the dotplot corresponding to the window.

Usage

Command line arguments

Input file format

Any 2 sequence USAs of the same type (DNA or protein).

Output file format

An image is output to the requested graphics device.

Data files

It uses the specified matrix substitution file to compare the two sequences.

For protein sequences EBLOSUM62 is used for the substitution matrix. For nucleotide sequence, EDNAFULL is used. Others can be specified.

Notes

Where the two sequences have substantial regions of similarity, the dots line up to form diagonal lines. It is possible to see at a glance such local regions of similarity. It is also easy to see other features such as repeats (which form parallel diagonal lines), and insertions or deletions (which form breaks or discontinuities in the diagonal lines).

References

None.

Warnings

None.

Diagnostic Error Messages

None.

Exit status

It always exits with status 0.

Known bugs

None. dottup, by comparison, has no threshold, using a wordmatch-style method. dottup is less sensitive, but substantially faster than dotmatcher.

Function

Description

Algorithm

Usage

Command line arguments

Input file format

Output file format

Data files

Notes

References

Warnings

Diagnostic Error Messages

Exit status

Known bugs

Author(s)

History

Target users

Comments