twofeat

Function

Description

twofeat finds neighbouring pairs of specified features in one or more input sequences. Each feature may be specified by type, name, sense, score, tag/value pairs etc. Their relationship, e.g. their sequence separation or overlap, relative sense and order may also be specified. It writes a standard EMBOSS report file with any such feature pairs identified. By default each pair is written as a single feature but (optionally) they are written in their original form.

Algorithm

For each of the pair of features, you can specify its type name, its sense, its score and any tag/value pairs, amongst other things.

You can then specify the type of relationship that the two features should have as follows:

Minimum and maximum distance between each feature.

Distance measurement (measure from the nearest ends of the two features, from the left ends, from right ends, from the furthest ends.)

Type of overlap (any type of overlap or no overlap is allowed, Overlap required, No overlaps are allowed, Overlap required but one feature must not be completely within the other, Feature A must be completely enclosed within feature B, Feature B must be completely enclosed within feature A).

Feature sense (any sense, in the same sense or in opposite senses)

Feature order (any order, Feature A then feature B, Feature B then feature A)

The program proceeds as follows. For eachs sequence: Identify the features that match the criteria for Feature A Identify the features that match the criteria for Feature B Compare all pairs of features If they satisfy the requirements output them to the report file

By default the resulting pairs of features found are then written to a report file as a single feature from the first postion of the left-most feature to the last position of the right-most feature. You can modify the output to report the pairs of features with no changes made to them.

Usage

Command line arguments


Input file format

twofeat reads any normal sequence USAs.

Output file format

twofeat outputs a report format file. The default format is table

Data files

None.

Notes

twofeat is intended for use as a simple data-mining tool to enable you to look for instances of pairs of features that occur near to each other in the same sequence entry.

References

None.

Warnings

None.

Diagnostic Error Messages

None.

Exit status

It always exits with status 0.

Known bugs

There is a slight memory leak that must be fixed at some time. This does no affect the results.

Author(s)

History

Target users

Comments