polydot generates a dotplot for each of an all-against-all comparison of a set of sequences. The dotplots are rendered on a single diagram. The dotplot is an intuitive graphical representation of the regions of similarity between two sequences. All positions from the first input sequence are compared with all positions from the second input sequence searching for exact matches between sequence regions ("words"). The two sequences are the axes of the rectangular dotplot. Wherever there is "similarity" between a word from each sequences a dot is plotted. The wordsize is specified by the user. Optionally, information on the sequence and location of all the exact matching regions may be written to file as a feature file.
All sequence words of the specified size from the first input sequence are compared with all words from the second input sequence and exact matches identified. A line is plotted on the dotplot corresponding to any exact matching words. Thus any local regions of identity correspond to diagonals in the dotplot.
|
The sequences may or may not be aligned.
Where the two sequences have substantial regions of identity, longer diagonal lines appear in the plot. It is possible to see at a glance such local regions of identity. It is also easy to see other features such as repeats (which form parallel diagonal lines), and insertions or deletions (which form breaks or discontinuities in the diagonal lines).