polydot

Function

Description

polydot generates a dotplot for each of an all-against-all comparison of a set of sequences. The dotplots are rendered on a single diagram. The dotplot is an intuitive graphical representation of the regions of similarity between two sequences. All positions from the first input sequence are compared with all positions from the second input sequence searching for exact matches between sequence regions ("words"). The two sequences are the axes of the rectangular dotplot. Wherever there is "similarity" between a word from each sequences a dot is plotted. The wordsize is specified by the user. Optionally, information on the sequence and location of all the exact matching regions may be written to file as a feature file.

Algorithm

All sequence words of the specified size from the first input sequence are compared with all words from the second input sequence and exact matches identified. A line is plotted on the dotplot corresponding to any exact matching words. Thus any local regions of identity correspond to diagonals in the dotplot.

Usage

Command line arguments


Input file format

polydot reads in a set of nucleic or protein sequences.

The sequences may or may not be aligned.

Output file format

A graphical image is displayed on the specified graphics device.

Data files

None.

Notes

Where the two sequences have substantial regions of identity, longer diagonal lines appear in the plot. It is possible to see at a glance such local regions of identity. It is also easy to see other features such as repeats (which form parallel diagonal lines), and insertions or deletions (which form breaks or discontinuities in the diagonal lines).

References

None.

Warnings

None.

Diagnostic Error Messages

None.

Exit status

0 if successful.

Known bugs

None.

Author(s)

History

Target users

Comments