aaindexextract

Function

Description

aaindexextract extracts amino acid property data from the AAINDEX database (see references 1, 2 and 3). One file for every amino acid property is created in the EMBOSS data directory data/AAINDEX. Each file corresponds to a single entry in the file aindex1 of the AAINDEX database.

The aaindexextract output files files may be used by any program that uses amino acid property data. For example, pepwindow and pepwindowall will take any one of these files if their -data qualifier is set to an appropriate file name. By default however these programs will use the standard EMBOSS data file Enakai.dat.

Usage

Command line arguments


Input file format

The AAINDEX database file 'aaindex1' can be downloaded from the AAINDEX site: ftp://ftp.genome.ad.jp/pub/db/genomenet/aaindex/

Output file format

One file for every entry in the AAINDEX database is created in the EMBOSS standard 'data/AAINDEX' directory. For example, the file 'kytj820101':

Data files

None.

Notes

AAINDEX is a database of biochemical and physicochemical properties of amino acids and pairs of amino acids. aaindex1 for properties of each of the 20 single amino acids, aaindex2 for amino acid mutation matrices and aaindex3 for the amino-acid pair-wise contact potentials. All data are derived from the published literature. The file aaindex1 is required by aaindexextract and can be downloaded from the AAINDEX site: ftp://ftp.genome.ad.jp/pub/db/genomenet/aaindex/aaindex1.

aaindexextract will write to the EMBOSS data directory data/AAINDEX. If you cannot find the output files then it is likely the data directory has been set to somewhere other than expected. To set explicitly the location of the EMBOSS data directory add the following to your emboss.default file:

ENV EMBOSS_DATA /path/to/the/EMBOSS/data/directory

Where /path/to/the/EMBOSS/data/directory is the location of your EMBOSS data as unpacked from the distribution file.

References

  1. Nakai, K., Kidera, A., and Kanehisa, M.; Cluster analysis of amino acid indices for prediction of protein structure and function. Protein Eng. 2, 93-100 (1988). [UI:89221001]
  2. Tomii, K. and Kanehisa, M.; Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins. Protein Eng. 9, 27-36 (1996). [UI:96272030]
  3. Kawashima, S., Ogata, H., and Kanehisa, M.; AAindex: amino acid index database. Nucleic Acids Res. 27, 368-369 (1999). [UI:99063742]

Warnings

None.

Diagnostic Error Messages

None.

Exit status

It always exits with status 0.

Known bugs

None.

The programs pepwindow and pepwindowall normally use the standard EMBOSS data file 'Enakai.dat' by default, but you can set their '-data' qualifier to use any of the files produced by aaextractindex.

Author(s)

History

Target users

Comments