maskfeat reads a sequence with associated features and writes the same information to file but with features of the specified type omitted (masked). Sequence regions corresponding to the masked-out features are masked such that characters in those region are (optionally) converted to lower case and / or (optionally) replaced with the specified mask character.
|
Formats such as EMBL or SWISSPROT include features in the sequence annotation. Otherwise, features they may be supplied in a separate file and specified with the qualifier -ufo or -fopenfile / -fformat.
The feature type to mask is by default repeat*, i.e. any type whose name starts with repeat will be omitted from the output file. Any other type of feature(s) may be specified with the -type qualifier. To specify more than one type of feature, separate their names with spaces or commas. The names may be wild-carded with asterisks * to find gruops of feature types sharing a common part of their names.
If you are unsure of the names of feature types in use, please consult http://www3.ebi.ac.uk/Services/WebFeat/ (EMBL feature types) and see Appendix A of the Swissprot user manual in http://www.expasy.ch/txt/userman.txt (Swissprot feature types).