Genome Information Research Center, Osaka Univ.

GenomeNet FASTA e-mail server


                FASTA E-Mail Server on GenomeNet


The FASTA program was developed by Bill Pearson and David Lipman for
nucleic acid and protein sequence similarity searching of databases and
made availabe for use on the GenomeNet E-mail server.  If you use FASTA
as a research tool, we ask that the following reference be cited in your
paper:
 
        Pearson, W.R. and Lipman, D.J. 1988.  Improved Tools for 
        Biological Sequence Comparison.  Proc. Natl. Acad. Sci., 
        85: 2444-2448.
  
The GenomeNet FASTA server allows you to send a specially formatted mail
message containing the nucleic acid or protein query sequence to the
FASTA server at the Supercomputer Laboratory, Institute for Chemical
Research, Kyoto University.  A FASTA sequence similarity search is then
performed against the specified database using the FASTA program.  The
results of the FASTA search will be returned to you in a reply mail
message.


Accessing the FASTA program

To access the program, send an electronic mail message containing the 
formatted query sequence (as described below) to the following Internet 
address:

       fasta@genome.ad.jp

If you are not on Internet, you may need to change the format of the 
address.  Consult your systems manager to determine the correct
address format.


Obtaining Help

If you would like to receive instructions on using the FASTA program,
send a mail message to the address above containing the word "help" on
a single line.  This document is then returned to you in a mail message.


Databases for use with FASTA

The following databases are currently available for FASTA searches:

    Designator                  Database
    ----------                  --------
    nr-nt                       Non-redundant nucleic acid sequence
                                database constructed from genbank,
                                embl,genbank-upd

    genbank                     GenBank nucleic acid sequence database
                                (Latest quarterly release)

    genbank-upd                 Cumulative daily updates of GenBank
                                since latest quarterly release

    genbank/primate             GenBank subdivisions
    genbank/rodent              (Latest quarterly release)
    genbank/other_mammalian
    genbank/other_vertebrate
    genbank/invertebrate
    genbank/plant
    genbank/bacterial
    genbank/structural_rna
    genbank/viral
    genbank/phage
    genbank/synthetic
    genbank/unannotated
    genbank/est
    genbank/patent

    embl                        EMBL nucleic acid sequence database
                                (Latest quarterly release)

    embl/bacteriophage          EMBL subdivisions
    embl/fungi                  (Latest quarterly release)
    embl/invertebrate
    embl/organelle
    embl/other_mammalian
    embl/other_vertebrate
    embl/plant
    embl/primate
    embl/prokaryote
    embl/rodent
    embl/synthetic
    embl/unannotated
    embl/viral

    nr-aa                       Non-redundant protein sequence database
				constructed from swissprot,pir,prf,
                                genpept,genpept-upd

    swissprot                   SWISS-PROT protein sequence database

    pir                         PIR protein sequence database

    prf                         PRF protein sequence database

    pdbstr                      Protein Data Bank reorganized as
				a sequence database


Formatting a Query

Queries consist of a mail message with search parameters identifying
the database to be searched, values related to the search and the
query sequence to be used in the search.  The mail message has two
mandatory lines, three optional lines and a line identifying the query
sequence as descibed below.  These lines are typed into the body of
the mail message in the order shown below:

 Search 
Parameter      Mandatory      Explanation
---------      ---------      -----------
PROGRAM          No           Specify 'tfasta' to compare a protein
                              sequence against a nucleic acid sequence
                              database translating in all 6 or 3 frames.
                              If omitted, 'fasta' is assumed.

DATALIB          Yes          This line specifies the database to be 
                              searched (as described above) for the query
                              sequence and must be included in the message.  

KTUP             No           This line identifies the Ktup value which 
                              specifies the sensitivity of the search. 
                              Values range between 3 and 6 for nucleic acid
                              searches and between 1 and 2 for protein 
                              searches. Lower values specify more sensitive 
                              searches but require more time to complete.  
                              The default value for nucleic acids is 6 and
                              2 for proteins.
________________________________________