notseq writes to file a subset of an input stream of sequences. The list of sequence names or accession numbers to exclude from output is provided as a string. Optionally, the excluded sequences are also written to file.
|
The names (or accession numbers) of the sequences to be excluded can be entered as a file of such names by specifying an '@' followed by the name of the file containing the sequence names. For example: '@names.dat'.
The names or accession numbers of the sequences to be excluded are not standard EMBOSS USAs. Only the ID name or accession number can be specified, you cannot specify the sequences as 'database:ID', 'file:accession', 'format::file', etc.
notseq was written for the case where a file containing several sequences was being used as a small database, but some of the sequences were no longer required. notseq splits the input sequences into those that you wish to keep and those you wish to exclude.
Wildcarded names and accession numbers may be specified in the input string by using *.
The names (or accession numbers) of the sequences to be excluded can be entered as a file of such names by specifying an @ followed by the name of the file containing the sequence names. For example: @names.dat.
The input string does not support the full USA syntax; only the name or accession number should be specified, not the database or file that these entries may occur in.