Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP Path: utzoo!utgpu!water!watmath!uunet!ig!daemon From: daemon@ig.UUCP Newsgroups: bionet.molbio.seqnet Subject: SEQNET Bulletin Message-ID: <3985@ig.ig.com> Date: Mon, 9-Nov-87 18:37:15 EST Article-I.D.: ig.3985 Posted: Mon Nov 9 18:37:15 1987 Date-Received: Wed, 11-Nov-87 00:55:06 EST Sender: daemon@presto.ig.com Lines: 266 From: MJB1@VMS-SUPP.CAM.AC.UK Bulletin_# 45 From: MJB1 7 November 1987 Please mail you sequences to DATASU From: MJB1 Date: 7 November 1987 Subject: Please mail you sequences to DATASUBS%EMBL@UK.AC.RL.EARN EMBL/GenBank SEQUENCE DATA SUBMISSION FORM This form solicits the information needed for a nucleotide and/or amino acid sequence data bank entry. It can be filled in using any text editor or printed and filled in by hand. By completing and returning it to us promptly you will help us enter your data in the appropriate database accurately and rapidly. Where appropriate, the data will be transmitted to the SWISS-PROT and the PIR protein sequence databases. Please answer all questions which apply to your data. If you are reporting two or more non-contiguous sequences, please copy and fill out this form for each additional sequence. Then send us (1) this form, (2) the sequence data, and (3) a pre- or reprint of your manuscript. You can send these materials (a) electronically via computer network, (b) on magnetic tape, or (c) on a floppy diskette. Please do NOT send the sequence as a computer printout, as this creates considerable extra work for us. Information about formats for submitted data is included at the end of this form. our mailing address: EMBL Data Library Submissions Postfach 10.2209 D-6900 Heidelberg West Germany telephone: (06221) 387 257 network address: datasubs@embl.earn (for data submissions) datalib@embl.earn (for general inquiries) Please include in your submission any additional sequence data which may not reported in your manuscript but which has been reliably determined (for example, introns or flanking sequences). When we receive your sequence data and the completed form we will assign the sequence an accession number, which serves as a reference that permanently identifies this sequence (or set of sequences) in the database. We will inform you what number your data has been given and we recommend that you cite this number when referring to the corresponding sequence in a publication. A large number of journals are actively supporting the practice of citing sequences by accession number. If new data become available which would make the database entry more informative (e.g., function of the gene product or location of important sites within the sequence), or if you discover errors in the sequence, we urge you to contact us so that we can update your entry. =============================================================================== Your name ------------------------------------------------------------------------------ Organization ------------------------------------------------------------------------------ Address ------------------------------------------------------------------------------ Telephone Computer network address ============================================================================== On what medium and in what format are you sending your sequence data? (see end of this form for instructions) [ ] magnetic tape: density [ ] 800 [ ] 1600 [ ] 6250 character code [ ] ASCII [ ] EBCDIC record length ______________ blocksize ______________ label type _________________ [ ] electronic mail [ ] diskette: computer ________________ operating system _______________ [ ] printed copy =============================================================================== CITATION INFORMATION =============================================================================== These data will be published by authors ______________________________________________________________________________ title of paper ______________________________________________________________________________ journal ------------------------------------------------------------------------------ vol, pages,year (if known) ============================================================================== Does the sequence which you are sending with this form include data that does not appear in the above journal article? [ ] yes, beginning at base number _____ and ending at base _____ [ ] no How should this data be cited in the database? authors ------------------------------------------------------------------------------ title of paper (if applicable) ------------------------------------------------------------------------------ journal (if applicable) vol,pages,year ============================================================================== Please list references to papers and/or database entries which report sequences overlapping with data submitted here. ------------------------------------------------------------------------------ first author journal, vol., pages, year OR database, accession number ------------------------------------------------------------------------------ ------------------------------------------------------------------------------ ============================================================================== DESCRIPTION OF SEQUENCED SEGMENT Where appropriate, please use standard nomenclature or conventions. NOT ALL QUESTIONS ARE RELEVANT TO ALL SEQUENCES. =============================================================================== What kind of molecule does this sequence represent? [ ] genomic DNA [ ] tRNA [ ] organelle DNA (please specify) ______________ [ ] rRNA [ ] cDNA [ ] snRNA [ ] other nucleic acid __________________________ [ ] scRNA [ ] peptide sequence assembled by [ ] overlap of sequenced fragments [ ] homology with related protein sequence [ ] other (please specify) _____________________ ------------------------------------------------------------------------------- length of sequence [ ] bp or [ ] amino acid residues ------------------------------------------------------------------------------- library (type, name) clone(s) ------------------------------------------------------------------------------- genomic location/map position gene name(s) (e.g., lacZ) ------------------------------------------------------------------------------- gene product name(s) (e.g., beta-D-galactosidase) and Enyme Commission number ------------------------------------------------------------------------------- source organism (Latin name) (e.g., Mus musculus) ------------------------------------------------------------------------------- strain (e.g., BALB/c) haplotype ------------------------------------------------------------------------------- tissue/cell line source [ ] germ line [ ] rearranged =============================================================================== FEATURES OF THE SEQUENCE Please list below the first and last base/residue numbers of all significant features identified within the sequence. In the column marked "id," indicate the method by which the feature was identified (E = experimentally; S = by similarity with another sequence; C = match to an established consensus sequence). For nucleotide sequences, indicate (by writing an x in the column marked "comp") if the feature is encoded by the strand complementary to that reported here. Some examples of significant features are: - regulatory signals (e.g., promoters, attenuators, enhancers) - transcribed regions (mRNA, rRNA, tRNA, etc.) - regions subject to post-transcriptional modification (e.g., introns, modified bases) - translated regions - extent of signal peptide, prepropeptide, propeptide, mature peptide - sites subject to post-translational modification (glycosylation, phosporylation) - other domains/sites of interest (e.g., extracellular domain, DNA-binding domain, active site, inhibitory site) - sites involved in bonding (disulfide, thiolester, intrachain, interchain) - regions of alpha helix or beta pleated sheet ================================================================================ Numbering for features on the sequence you are submitting to us [ ] starts at 1 OR [ ] starts at _______ Does the numbering match that in your manuscript? [ ] yes [ ] no -------------------------------------------------------------------------------- feature from to id comp ------------------------------------------- --------- --------- ----- --------- ------------------------------------------- --------- --------- ----- --------- ------------------------------------------- --------- --------- ----- --------- ------------------------------------------- --------- --------- ----- --------- ------------------------------------------- --------- --------- ----- --------- ------------------------------------------- --------- --------- ----- --------- ------------------------------------------- --------- --------- ----- --------- ------------------------------------------- --------- --------- ----- --------- ------------------------------------------- --------- --------- ----- --------- ------------------------------------------- --------- --------- ----- --------- ------------------------------------------- --------- --------- ----- --------- ================================================================================ KEYWORDS =============================================================================== Describe the properties of the sequence in terms of its associated phenotype(s), the biological/enzymatic activity of its product and the general functional classification of the gene and/or gene product. Also indicate macromolecules which the gene product can bind (e.g., DNA; Ca++; other proteins), subcellular localization of the gene product and any other information you think is relevant. EXAMPLE (for the viral erbB gene sequence): transforming capacity; EGF receptor-related; transmembrane protein; tyrosine kinase; oncogene. ------------------------------------------------------------------------------- =============================================================================== FORMATS FOR SUBMITTED DATA We can accept data submitted in any of the following formats, listed in order of preference. PLEASE SEND THE SEQUENCE DATA IN COMPUTER-READABLE FORM rather than as a printout. (1) Electronic file transfer: files can be sent via computer network to DATASUBS@EMBL.EARN. This address can be reached via various gateways from Arpanet, Usenet, JANET, JUNET, etc. Ask your local network expert how to send to BITNET/EARN or phone us for help at (06221) 387 257. (2) Magnetic tapes: 9-track only (fixed-length records preferred); 800, 1600 or 6250 bpi (any blocksize); ASCII or EBCDIC character codes; any label type or unlabelled. (3) Floppy disks: we can read 5-1/4" diskettes from CP/M or MS-DOS systems, as well as Macintosh diskettes. Phone us if you have questions about your specific format. Whatever format you choose, we would appreciate receiving the sequence data in a form which conforms to the following conventions: Each distinct sequence should be listed separately using the same number of bases/residues per line and its length in bases/residues clearly indicated. Nucleotide sequences should be shown in the 5' to 3' direction. If both strands are listed, the top strand should be 5' to 3'. Enumeration of nucleic acid sequences should begin with a "1" and ascend in the direction 5' to 3'. Amino acid sequences should be listed using the one-letter code. The code for representing the sequence characters should conform to the IUPAC-IUB standards, which are described in the following references. for nucleic acids: Eur. J. Biochem. 150, 1-5, 1985 for amino acids: J. Biol. Chem. 243, 3557-3559, 1968 Eur. J. Biochem. 5, 151-153, 1968