notseq

Function

Description

notseq writes to file a subset of an input stream of sequences. The list of sequence names or accession numbers to exclude from output is provided as a string. Optionally, the excluded sequences are also written to file.

Usage

Command line arguments


Input file format

notseq reads normal sequence USAs.

The names (or accession numbers) of the sequences to be excluded can be entered as a file of such names by specifying an '@' followed by the name of the file containing the sequence names. For example: '@names.dat'.

The names or accession numbers of the sequences to be excluded are not standard EMBOSS USAs. Only the ID name or accession number can be specified, you cannot specify the sequences as 'database:ID', 'file:accession', 'format::file', etc.

Output file format

notseq writes normal a sequence file.

Data files

None.

Notes

notseq was written for the case where a file containing several sequences was being used as a small database, but some of the sequences were no longer required. notseq splits the input sequences into those that you wish to keep and those you wish to exclude.

Wildcarded names and accession numbers may be specified in the input string by using *.

The names (or accession numbers) of the sequences to be excluded can be entered as a file of such names by specifying an @ followed by the name of the file containing the sequence names. For example: @names.dat.

References

None.

Warnings

The input string does not support the full USA syntax; only the name or accession number should be specified, not the database or file that these entries may occur in.

Diagnostic Error Messages

If no matches are found to any of the specified sequence names, the message "This is a warning: No matches found." is displayed.

Exit status

It exits with a status of 0 unless no matches are found to any of the input sequences name, in which case it exits with a status of -1.

Known bugs

None.

Author(s)

History

Target users

Comments