Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP
Path: utzoo!utgpu!water!watmath!uunet!ig!daemon
From: daemon@ig.UUCP
Newsgroups: bionet.molbio.seqnet
Subject: SEQNET Bulletin
Message-ID: <3985@ig.ig.com>
Date: Mon, 9-Nov-87 18:37:15 EST
Article-I.D.: ig.3985
Posted: Mon Nov  9 18:37:15 1987
Date-Received: Wed, 11-Nov-87 00:55:06 EST
Sender: daemon@presto.ig.com
Lines: 266

From: MJB1@VMS-SUPP.CAM.AC.UK

Bulletin_# 45  From:  MJB1  7 November 1987  Please mail you sequences to DATASU
From: MJB1
Date: 7 November 1987
Subject: Please mail you sequences to DATASUBS%EMBL@UK.AC.RL.EARN

                EMBL/GenBank SEQUENCE DATA SUBMISSION FORM


This form solicits the information needed for a nucleotide and/or amino acid
sequence data bank entry.  It can be filled in using any text editor or printed
and filled in by hand.  By completing and returning it to us promptly you will
help us enter your data in the appropriate database accurately and rapidly.
Where appropriate, the data will be transmitted to the SWISS-PROT and the PIR
protein sequence databases.

Please answer all questions which apply to your data.  If you are reporting two
or more non-contiguous sequences, please copy and fill out this form for each
additional sequence.  Then send us (1) this form, (2) the sequence data, and
(3) a pre- or reprint of your manuscript.  You can send these materials (a)
electronically via computer network, (b) on magnetic tape, or (c) on a floppy
diskette.  Please do NOT send the sequence as a computer printout, as this
creates considerable extra work for us.  Information about formats for
submitted data is included at the end of this form.

   our mailing address:    EMBL Data Library Submissions
                           Postfach 10.2209
                           D-6900 Heidelberg
                           West Germany

   telephone:              (06221) 387 257

   network address:        datasubs@embl.earn  (for data submissions)
                            datalib@embl.earn  (for general inquiries)

Please include in your submission any additional sequence data which may not
reported in your manuscript but which has been reliably determined (for
example, introns or flanking sequences).

When we receive your sequence data and the completed form we will assign the
sequence an accession number, which serves as a reference that permanently
identifies this sequence (or set of sequences) in the database.  We will inform
you what number your data has been given and we recommend that you cite this
number when referring to the corresponding sequence in a publication.  A large
number of journals are actively supporting the practice of citing sequences by
accession number.

If new data become available which would make the database entry more
informative (e.g., function of the gene product or location of important sites
within the sequence), or if you discover errors in the sequence, we urge you to
contact us so that we can update your entry.


===============================================================================
  Your name
 ------------------------------------------------------------------------------
  Organization
 ------------------------------------------------------------------------------
  Address


 ------------------------------------------------------------------------------
  Telephone                     Computer network address
 ==============================================================================
  On what medium and in what format are you sending your sequence data? (see
  end of this form for instructions)
      [ ] magnetic tape: density   [ ] 800  [ ] 1600   [ ] 6250
                         character code  [ ] ASCII  [ ] EBCDIC
                         record length ______________ blocksize ______________
                         label type _________________
      [ ] electronic mail
      [ ] diskette: computer ________________ operating system _______________
      [ ] printed copy
===============================================================================


CITATION INFORMATION

===============================================================================
  These data will be published by
  authors
 ______________________________________________________________________________
  title of paper
 ______________________________________________________________________________
  journal
 ------------------------------------------------------------------------------
  vol, pages,year (if known)
 ==============================================================================
  Does the sequence which you are sending with this form include data that
  does not appear in the above journal article?
     [ ] yes, beginning at base number _____ and ending at base _____   [ ] no
  How should this data be cited in the database?
     authors
 ------------------------------------------------------------------------------
     title of paper (if applicable)
 ------------------------------------------------------------------------------
     journal (if applicable)                       vol,pages,year
 ==============================================================================
  Please list references to papers and/or database entries which report
  sequences overlapping with data submitted here.
 ------------------------------------------------------------------------------
     first author     journal, vol., pages, year OR database, accession number
 ------------------------------------------------------------------------------

 ------------------------------------------------------------------------------

 ==============================================================================


DESCRIPTION OF SEQUENCED SEGMENT

Where appropriate, please use standard nomenclature or conventions. NOT ALL
QUESTIONS ARE RELEVANT TO ALL SEQUENCES.
===============================================================================
 What kind of molecule does this sequence represent?

  [ ] genomic DNA                                      [ ] tRNA
  [ ] organelle DNA (please specify) ______________    [ ] rRNA
  [ ] cDNA                                             [ ] snRNA
  [ ] other nucleic acid __________________________    [ ] scRNA

  [ ] peptide
      sequence assembled by  [ ] overlap of sequenced fragments
                             [ ] homology with related protein sequence
                             [ ] other (please specify) _____________________
-------------------------------------------------------------------------------
 length of sequence          [ ] bp   or   [ ] amino acid residues
-------------------------------------------------------------------------------
 library (type, name)                      clone(s)
-------------------------------------------------------------------------------
 genomic location/map position             gene name(s) (e.g., lacZ)
-------------------------------------------------------------------------------
 gene product name(s) (e.g., beta-D-galactosidase) and Enyme Commission number

-------------------------------------------------------------------------------
 source organism (Latin name) (e.g., Mus musculus)
-------------------------------------------------------------------------------
 strain (e.g., BALB/c)                     haplotype
-------------------------------------------------------------------------------
 tissue/cell line source                   [ ] germ line     [ ] rearranged
===============================================================================


FEATURES OF THE SEQUENCE

Please list below the first and last base/residue numbers of all significant
features identified within the sequence.  In the column marked "id," indicate
the method by which the feature was identified (E = experimentally; S = by
similarity with another sequence; C = match to an established consensus
sequence).  For nucleotide sequences, indicate (by writing an x in the column
marked "comp") if the feature is encoded by the strand complementary to that
reported here.

Some examples of significant features are:

  - regulatory signals (e.g., promoters, attenuators, enhancers)
  - transcribed regions (mRNA, rRNA, tRNA, etc.)
  - regions subject to post-transcriptional modification (e.g., introns,
    modified bases)
  - translated regions
  - extent of signal peptide, prepropeptide, propeptide, mature peptide
  - sites subject to post-translational modification (glycosylation,
    phosporylation)
  - other domains/sites of interest (e.g., extracellular domain, DNA-binding
    domain, active site, inhibitory site)
  - sites involved in bonding (disulfide, thiolester, intrachain, interchain)
  - regions of alpha helix or beta pleated sheet

================================================================================
   Numbering for features on the sequence you are submitting to us
       [ ] starts at 1   OR    [ ] starts at _______
   Does the numbering match that in your manuscript?   [ ] yes   [ ] no
--------------------------------------------------------------------------------
                   feature                    from       to       id     comp
------------------------------------------- --------- ---------  ----- ---------

------------------------------------------- --------- ---------  ----- ---------

------------------------------------------- --------- ---------  ----- ---------

------------------------------------------- --------- ---------  ----- ---------

------------------------------------------- --------- ---------  ----- ---------

------------------------------------------- --------- ---------  ----- ---------

------------------------------------------- --------- ---------  ----- ---------

------------------------------------------- --------- ---------  ----- ---------

------------------------------------------- --------- ---------  ----- ---------

------------------------------------------- --------- ---------  ----- ---------

------------------------------------------- --------- ---------  ----- ---------

================================================================================


KEYWORDS

===============================================================================
Describe the properties of the sequence in terms of its associated
phenotype(s), the biological/enzymatic activity of its product and the general
functional classification of the gene and/or gene product.  Also indicate
macromolecules which the gene product can bind (e.g., DNA; Ca++; other
proteins), subcellular localization of the gene product and any other
information you think is relevant.

EXAMPLE (for the viral erbB gene sequence): transforming capacity;
EGF receptor-related; transmembrane protein; tyrosine kinase; oncogene.
-------------------------------------------------------------------------------


===============================================================================


FORMATS FOR SUBMITTED DATA

We can accept data submitted in any of the following formats, listed in order
of preference.  PLEASE SEND THE SEQUENCE DATA IN COMPUTER-READABLE FORM rather
than as a printout.

(1) Electronic file transfer: files can be sent via computer network to
    DATASUBS@EMBL.EARN.  This address can be reached via various gateways from
    Arpanet, Usenet, JANET, JUNET, etc.  Ask your local network expert how to
    send to BITNET/EARN or phone us for help at (06221) 387 257.

(2) Magnetic tapes: 9-track only (fixed-length records preferred); 800, 1600 or
    6250 bpi (any blocksize); ASCII or EBCDIC character codes; any label type
    or unlabelled.

(3) Floppy disks: we can read 5-1/4" diskettes from CP/M or MS-DOS
    systems, as well as Macintosh diskettes.  Phone us if you have questions
    about your specific format.


Whatever format you choose, we would appreciate receiving the sequence data in
a form which conforms to the following conventions:

    Each distinct sequence should be listed separately using the same number
    of bases/residues per line and its length in bases/residues clearly
    indicated.

    Nucleotide sequences should be shown in the 5' to 3' direction.  If both
    strands are listed, the top strand should be 5' to 3'.

    Enumeration of nucleic acid sequences should begin with a "1" and ascend
    in the direction 5' to 3'.

    Amino acid sequences should be listed using the one-letter code.

The code for representing the sequence characters should conform to the
IUPAC-IUB standards, which are described in the following references.

    for nucleic acids:  Eur. J. Biochem. 150, 1-5, 1985

    for amino acids:    J. Biol. Chem. 243, 3557-3559, 1968
                        Eur. J. Biochem. 5, 151-153, 1968