twofeat finds neighbouring pairs of specified features in one or more input sequences. Each feature may be specified by type, name, sense, score, tag/value pairs etc. Their relationship, e.g. their sequence separation or overlap, relative sense and order may also be specified. It writes a standard EMBOSS report file with any such feature pairs identified. By default each pair is written as a single feature but (optionally) they are written in their original form.
For each of the pair of features, you can specify its type name, its sense, its score and any tag/value pairs, amongst other things.
You can then specify the type of relationship that the two features should have as follows:
Minimum and maximum distance between each feature.
Distance measurement (measure from the nearest ends of the two features, from the left ends, from right ends, from the furthest ends.)
Type of overlap (any type of overlap or no overlap is allowed, Overlap required, No overlaps are allowed, Overlap required but one feature must not be completely within the other, Feature A must be completely enclosed within feature B, Feature B must be completely enclosed within feature A).
Feature sense (any sense, in the same sense or in opposite senses)
Feature order (any order, Feature A then feature B, Feature B then feature A)
The program proceeds as follows. For eachs sequence: Identify the features that match the criteria for Feature A Identify the features that match the criteria for Feature B Compare all pairs of features If they satisfy the requirements output them to the report file
By default the resulting pairs of features found are then written to a report file as a single feature from the first postion of the left-most feature to the last position of the right-most feature. You can modify the output to report the pairs of features with no changes made to them.
|
twofeat is intended for use as a simple data-mining tool to enable you to look for instances of pairs of features that occur near to each other in the same sequence entry.