Path: utzoo!attcan!uunet!munnari.oz.au!uokmax!apple!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 19 Jul 90 12:00:08 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 1823 Approved: lear@genbank.bio.net Checksum: 10687 113 LOCUS HUMPPPB1A 3215 bp ss-mRNA PRI 19-JUL-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) mRNA, complete cds. ACCESSION M33689 KEYWORDS protein phosphotyrosyl phosphatase. SOURCE Human placenta, cDNA to mRNA, (library of Clontech), clone lambda-16-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3215) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept 73 1380 protein phosphotyrosyl phosphatase 1B (EC 3.1.3.48) BASE COUNT 818 a 828 c 801 g 768 t ORIGIN Chromosome 20q13.1-q13.2. 1 gcgcgacgcg gcctagagcg gcagacggcg cagtgggccg agaaggaggc gcagcagccg 61 ccctggcccg tcatggagat ggaaaaggag ttcgagcaga tcgacaagtc cgggagctgg 121 gcggccattt accaggatat ccgacatgaa gccagtgact tcccatgtag agtggccaag 181 cttcctaaga acaaaaaccg aaataggtac agagacgtca gtccctttga ccatagtcgg 241 attaaactac atcaagaaga taatgactat atcaacgcta gtttgataaa aatggaagaa 301 gcccaaagga gttacattct tacccagggc cctttgccta acacatgcgg tcacttttgg 361 gagatggtgt gggagcagaa aagcaggggt gtcgtcatgc tcaacagagt gatggagaaa 421 ggttcgttaa aatgcgcaca atactggcca caaaaagaag aaaaagagat gatctttgaa 481 gacacaaatt tgaaattaac attgatctct gaagatatca agtcatatta tacagtgcga 541 cagctagaat tggaaaacct tacaacccaa gaaactcgag agatcttaca tttccactat 601 accacatggc ctgactttgg agtccctgaa tcaccagcct cattcttgaa ctttcttttc 661 aaagtccgag agtcagggtc actcagcccg gagcacgggc ccgttgtggt gcactgcagt 721 gcaggcatcg gcaggtctgg aaccttctgt ctggctgata cctgcctctt gctgatggac 781 aagaggaaag acccttcttc cgttgatatc aagaaagtgc tgttagaaat gaggaagttt 841 cggatggggc tgatccagac agccgaccag ctgcgcttct cctacctggc tgtgatcgaa 901 ggtgccaaat tcatcatggg ggactcttcc gtgcaggatc agtggaagga gctttcccac 961 gaggacctgg agcccccacc cgagcatatc cccccacctc cccggccacc caaacgaatc 1021 ctggagccac acaatgggaa atgcagggag ttcttcccaa atcaccagtg ggtgaaggaa 1081 gagacccagg aggataaaga ctgccccatc aaggaagaaa aaggaagccc cttaaatgcc 1141 gcaccctacg gcatcgaaag catgagtcaa gacactgaag ttagaagtcg ggtcgtgggg 1201 ggaagtcttc gaggtgccca ggctgcctcc ccagccaaag gggagccgtc actgcccgag 1261 aaggacgagg accatgcact gagttactgg aagcccttcc tggtcaacat gtgcgtggct 1321 acggtcctca cggccggcgc ttacctctgc tacaggttcc tgttcaacag caacacatag 1381 cctgaccctc ctccactcca cctccaccca ctgtccgcct ctgcccgcag agcccacgcc 1441 cgactagcag gcatgccgcg gtaggtaagg gccgccggac cgcgtagaga gccgggcccc 1501 ggacggacgt tggttctgca ctaaaaccca tcttccccgg atgtgtgtct cacccctcat 1561 ccttttactt tttgcccctt ccactttgag taccaaatcc acaagccatt ttttgaggag 1621 agtgaaagag agtaccatgc tggcggcgca gagggaaggg gcctacaccc gtcttggggc 1681 tcgccccacc cagggctccc tcctggagca tcccaggcgg gcggcacgcc agacagcccc 1741 ccccttgaat ctgcagggag caactctcca ctccatattt atttaaacaa ttttttcccc 1801 aaaggcatcc atagtgcact agcattttct tgaaccaata atgtattaaa attttttgat 1861 gtcagccttg catcaagggc tttatcaaaa agtacaataa taaatcctca ggtagtactg 1921 ggaatggaag gctttgccat gggcctgctg cgtcagacca gtactgggaa ggaggacggt 1981 tgtaagcagt tgttatttag tgatattgtg ggtaacgtga gaagatagaa caatgctata 2041 atatataatg aacacgtggg tatttaataa gaaacatgat gtgagattac tttgtcccgc 2101 ttattctgct ccctgttatc tgctagatct agttctcaat cactgctccc ccgtgtgtat 2161 tagaatgcat gtaaggtctt cttgtgtcct gatgaaaaat atgtgcttga aatgagaaac 2221 tttgatctct gcttactaat gtgccccatg tccaagtcca acctgcctgt gcatgacctg 2281 atcattacat ggctgtggtt cctaagcctg ttgctgaagt cattgtcgct cagcaatagg 2341 gtgcagtttt ccaggaatag gcatttgcct aattcctggc atgacactct agtgacttcc 2401 tggtgaggcc cagcctgtcc tggtacagca gggtcttgct gtaactcaga cattccaagg 2461 gtatgggaag ccatattcac acctcacgct ctggacatga tttagggaag cagggacacc 2521 ccccgccccc cacctttggg atcagcctcc gccattccaa gtcgacactc ttcttgagca 2581 gaccgtgatt tggaagagag gcacctgctg gaaaccacac ttcttgaaac agcctgggtg 2641 acggtccttt aggcagcctg ccgccgtctc tgtcccggtt caccttgccg agagaggcgc 2701 gtctgcccca ccctcaaacc ctgtggggcc tgatggtgct cacgactctt cctgcaaagg 2761 gaactgaaga cctccacatt aagtggcttt ttaacatgaa aaacacggca gctgtagctc 2821 ccgagctact ctcttgccag cattttcaca ttttgccttt ctcgtggtag aagccagtac 2881 agagaaattc tgtggtggga acattcgagg tgtcaccctg cagagctatg gtgaggtgtg 2941 gataaggctt aggtgccagg ctgtaagcat tctgagctgg cttgttgttt ttaagtcctg 3001 tatatgtatg tagtagtttg ggtgtgtata tatagtagca tttcaaaatg gacgtactgg 3061 tttaacctcc tatccttgga gagcagctgg ctctccacct tgttacacat tatgttagag 3121 aggtagcgag ctgctctgct atgtccttaa gccaatattt actcatcagg tcattatttt 3181 ttacaatggc catggaataa accattttta caaaa // LOCUS HUMPPPB1A1 276 bp ds-DNA PRI 19-JUL-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x. ACCESSION M33688 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 1 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 276) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept / 34 + 243 protein phosphotyrosyl phosphatase 1B, exon x (EC 3.1.3.48) (AA at 34) pre-msg < 1 > 276 PTP1B mRNA and introns IVS < 1 33 PTP1B intron x-1 IVS 244 > 276 PTP1B intron x BASE COUNT 56 a 77 c 69 g 74 t ORIGIN Chromosome 20q13.1-q13.2. 1 ctttagaatc tactagatga ttttctcttt cagacccaag aaactcgaga gatcttacat 61 ttccactata ccacatggcc tgactttgga gtccctgaat caccagcctc attcttgaac 121 tttcttttca aagtccgaga gtcagggtca ctcagcccgg agcacgggcc cgttgtggtg 181 cactgcagtg caggcatcgg caggtctgga accttctgtc tggctgatac ctgcctcttg 241 ctggtaagga ggcctcgcgg gtgccctggg gagctc // LOCUS HUMPPPB1A2 453 bp ds-DNA PRI 19-JUL-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+1. ACCESSION M33687 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 2 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 453) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept + 236 + 397 protein phosphotyrosyl phosphatase 1B, exon x+1 (EC 3.1.3.48) pre-msg < 1 > 453 PTP1B mRNA and introns IVS < 1 235 PTP1B intron x IVS 398 > 453 PTP1B intron x+1 BASE COUNT 104 a 118 c 111 g 120 t ORIGIN Chromosome 20q13.1-q13.2. 1 ggggaggtcc cagactctta accagatctc ttgtgaatgc attgcctcag ggaggcacca 61 agcctttcat gaggacctgt ccccctgacc cagacacctc ccacccagcc ccacctccaa 121 cactagggat cacatttcag catgagattg ggaggggaca gacatctaac ggtgttatta 181 acgttgccct tgagaattgg acctggctga cttatatctc ctctctggct ttcagatgga 241 caagaggaaa gacccttctt ccgttgatat caagaaagtg ctgttagaaa tgaggaagtt 301 tcggatgggg ctgatccaga cagccgacca gctgcgcttc tcctacctgg ctgtgatcga 361 aggtgccaaa ttcatcatgg gggactcttc cgtgcaggtc agcattgcct ttgtttgaat 421 ccaggtgtga ccattttaac ttttttgtct ttg // LOCUS HUMPPPB1A3 426 bp ds-DNA PRI 19-JUL-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+2. ACCESSION M33686 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 3 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 426) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept + 46 + 269 protein phosphotyrosyl phosphatase 1B, exon x+2 (EC 3.1.3.48) pre-msg < 1 > 426 PTP1B mRNA and introns IVS < 1 45 PTP1B intron x+1 IVS 270 > 426 PTP1B intron x+2 BASE COUNT 115 a 122 c 109 g 80 t ORIGIN Chromosome 20q13.1-q13.2. 1 gaagtgaaca ctaatagact tccttcctct tgctgctctt tcaaggatca gtggaaggag 61 ctttcccacg aggacctgga gcccccaccc gagcatatcc ccccacctcc ccggccaccc 121 aaacgaatcc tggagccaca caatgggaaa tgcagggagt tcttcccaaa tcaccagtgg 181 gtgaaggaag agacccagga ggataaagac tgccccatca aggaagaaaa aggaagcccc 241 ttaaatgccg caccctacgg catcgaaagg taatatattg ggtccagctt gttggggtga 301 ggggaaatga cttctgttct agaaacacac gctggtactg aaaccctgtg atgcagcctc 361 tgttggcaag cagcgcttcg catccttggg aacagggcgc tggaccaaca cccactccac 421 tggtgg // LOCUS HUMPPPB1A4 732 bp ds-DNA PRI 19-JUL-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+3. ACCESSION M33685 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 4 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 732) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept + 402 + 597 protein phosphotyrosyl phosphatase 1B, exon x+3 (EC 3.1.3.48) pre-msg < 1 > 732 PTP1B mRNA and introns IVS < 1 401 PTP1B intron x+2 IVS 598 > 732 PTP1B intron x+3 BASE COUNT 164 a 193 c 205 g 170 t ORIGIN Chromosome 20q13.1-q13.2. 1 tctgtagctc taaagaatga gatctggtgt actgatgtgg ccagacattg caattgcagt 61 acatgagaag gcaaatcata cagtagtgtg tacaccagtg agtcctccag ccagataaat 121 cctcacagtg accagtcgcc caggcacctt gtgaacccta ccctgggtgt gggtgctatc 181 tgaagtacct gggggagggg gtgacaagtg gacttcaggc tgatgtggcc ctggcctggc 241 cctccctcca agcagagggg gctggcacgc tggaaggtta acatcatcca actctgtcta 301 cacgtggctt gttttttcct agaattcctg ccacaatagc agcatccttg ccattcattt 361 tctccaaagt gagtacccat ctctgccctc tgattcctca gcatgagtca agacactgaa 421 gttagaagtc gggtcgtggg gggaagtctt cgaggtgccc aggctgcctc cccagccaaa 481 ggggagccgt cactgcccga gaaggacgag gaccatgcac tgagttactg gaagcccttc 541 ctggtcaaca tgtgcgtggc tacggtcctc acggccggcg cttacctctg ctacagggta 601 tgtttccact gacagacgcg ctgggcagat gctcgtgtgc agagagcact ggccgctagc 661 ccgatggtag gattcagttc tgtggtgcat ctgagccagt ctcagaagaa acagatcaag 721 gttttaagtc tg // LOCUS HUMPPPB1A5 365 bp ds-DNA PRI 19-JUL-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+4. ACCESSION M33684 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 5 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 365) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept + 266 289 protein phosphotyrosyl phosphatase 1B, exon x+4 (EC 3.1.3.48) pre-msg < 1 > 365 PTP1B mRNA and introns IVS < 1 265 PTP1B intron x+3 BASE COUNT 80 a 101 c 88 g 96 t ORIGIN Chromosome 20q13.1-q13.2. 1 tacctcctaa gacttttacg gttttaaata ttttacctct ttccaggtgg catctgagta 61 catcagatgg ttttgcaaaa tgcaaacaat tttttccttg gggatgattt ttggggagag 121 ggggctactg taaaaaataa aaccaaaacc ccctttgctc cctcggaggt tgaagttgcc 181 ggggggtgtg gccggggtca tgcatgaggc gacagcactg caggtgcggg tctgggctca 241 tctgaactgt ttggtttcat tccagttcct gttcaacagc aacacatagc ctgaccctcc 301 tccactccac ctccacccac tgtccgcctc tgcccgcaga gcccacgccc gactagcagg 361 catgc // LOCUS HB3HBLA 1319 bp ds-DNA PHG 19-JUL-1990 DEFINITION Bacteriophage HB-3 amidase (hbl) gene, complete cds. ACCESSION M34652 KEYWORDS amidase. SOURCE Bacteriophage HB-3 (host Streptococcus pneumoniae) DNA. ORGANISM Bacteriophage HB-3 Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1319) AUTHORS Romero,A., Lopez,R. and Garcia,P. TITLE Sequence of the Streptococcus pneumoniae bacteriophage HB-3 amidase reveals high homology with the major host autolysin JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Romero, 25-MAY-1990. Consejo superior de Investigaciones Clentificas Centro de Investigaciones Biologicas Velazquez 144 Madrid, 28006 SPAIN FEATURES from to/span description pept 298 1254 amidase (hbl) binding 283 289 ribosomal binding site (put.) site 1285 1305 transcription stop sequence BASE COUNT 412 a 268 c 334 g 305 t ORIGIN 1 aagcttttta acagtagcag taggcggtat tgtaaaagca gtaaaagatt atcttttgcg 61 taaaggcgga gagaaagcgg tgatcatcgc tgaaattcta gctaaaatgc agttcatgcc 121 gttgagcaag tagcttcaga gactggctat aagggcgaag aaaagctgga gcaggctcgt 181 gctaaagtcc gtgctgagct tacaaaatac aatattagca tgactgacaa aaacttagac 241 accttcgtag agtcagcagt gaagcagatg aatgacgcat ggaaaggacg atagggaatg 301 gatatcgata gaaacagact acgtacaggc ttgccccagg ttggggtgca gccttatcga 361 caagtacatg ctcactcaac aggtaaccgc aactcaaccg tacagaatga agcggattat 421 cactggcgga aagacccaga attaggtttt ttctcgcacg ttgttgggaa ctttcgcatc 481 atgcaggtcg gacctgtgaa caacggaagt tgggatgttg ggggcggttg gaatgctgag 541 acctatgcag cggttgaact gattgaaagc cattcaacta aggaagagtt tatggctgac 601 tatcgcctct atatcgaatt gctacgcaat ctagcggacg aagcaggctt gccgaagact 661 cttgatacag acgacttggc aggtatcaag acgcatgaat actgtaccaa taaccaacca 721 aacaaccact cagaccatgt ggatccatat ccatatcttg caagttgggg cattagccgt 781 gaacagttta agcaagacat cgaaaacggc ttgagcgctg caacaggctg gcagaaaaat 841 ggcactggct actggtacgt acattcagac ggctcttatt caaaagataa gtttgagaaa 901 atcaacggta cctggtatta tttcgatggc tcaggctata tgctttcaga ccgctggaag 961 aagcacacag acggtaattg gtactacttt gaccaatcag gcgaaatggc cacaggctgg 1021 aagaaaatcg ctgacaagtg gtactatttt gatgtagaag gtgccatgaa gacaggctgg 1081 gtcaagtaca aggacacttg gtactactta gacgctaaag aaggcgccat ggtatcaaat 1141 gcctttatcc agtcagcgga cggaacaggc tggtactacc tcaaaccaga cggaacactg 1201 gcagacaagc cagagttcac agtagagcca gatggcttga ttacagttaa ataaatagaa 1261 aggaaacttt ctaaattgtt cttcaccgca gctcaggctt acggtttttt tgttttaaa // LOCUS FIBGLUC 1426 bp ds-DNA BCT 19-JUL-1990 DEFINITION F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene, complete cds. ACCESSION M33676 KEYWORDS 1,3-1,4-beta-D-glucan 4-glucanohydrolase; beta-glucanase. SOURCE F.succinogenes (strain S85) DNA, clone PJI5. ORGANISM Fibrobacter succinogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1426) AUTHORS Teather,R.M. and Erfle,J.D. TITLE DNA sequence of a Fibrobacter succinogenes mixed linkage beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene JOURNAL J. Bacteriol. 172, 3837-3841 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.M.Teather, 11-APR-1990. FEATURES from to/span description pept 145 1194 1,3-1,4-beta-D-glucan 4-glucanohydrolase precursor (EC 3.2.1.73) sigp 145 225 1,3-1,4-beta-D-glucan 4-glucanohydrolase signal peptide matp 226 1191 1,3-1,4-beta-D-glucan 4-glucanohydrolase binding 132 137 ribosome binding site signal 62 66 -35 region signal 85 90 -10 region BASE COUNT 371 a 346 c 335 g 374 t ORIGIN 1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt 61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt 121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc 181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt 241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg 301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc 361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc 421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct 481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac 541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac 601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt 661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat 721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat 781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt 841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc 901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa 961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct 1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac 1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt 1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg 1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt 1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa 1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt 1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa // LOCUS PP1BOFFO 931 bp ds-DNA PHG 19-JUL-1990 DEFINITION Bacteriophage P1 regulatory protein (bof) gene, complete cds. ACCESSION M33224 KEYWORDS regulatory protein. SOURCE Bacteriophage P1 viral DNA. ORGANISM Bacteriophage P1 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 931) AUTHORS Schaefer,T.S. and Hays,J.B. TITLE The bof gene of bacteriophage P1. DNA sequence and evidence for roles in regulation of phage c1 and ref genes JOURNAL J. Bacteriol. 172, 3269-3277 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.B.Hayes, 26-MAR-1990. FEATURES from to/span description pept > 544 789 Bof regulatory protein site 541 543 potential ttg start codon for Bof BASE COUNT 260 a 211 c 243 g 217 t ORIGIN Map position 9-10. 1 gggtaactgg tggattatcg agacaaaaca caacgtggcg gacgttctgg ccgtcatcca 61 acaatacgca taacaggagc gcccggttcg cgctgcgcat aatatggcca cactatctga 121 tacaataaaa ccgaataaaa catatcttga ggcggtactg cgtacggcat tattaggaaa 181 gacagaagac gaatacgttg atttcttcct gtcagggcta cgcgggcgat tactgaaaaa 241 tccccgcctg taccgcagct atggcccata ctggcggaaa ttaaaaaatt attactggag 301 cgacggttat ggtaatttcg gtcgtctcgt tgaccgtgac gttcgcaaat tttaccgtta 361 tgaccgcccg gcgctaacac tcatagccgc gacgctctac agccatgagc gttttgataa 421 tggtcagata tactcagcct ggcatttact gccagtccct gaagaagttg acgaccagga 481 ctatgagttt gagtcttacg atttggaagt tgaagccttg gcacaggctg gagagaaaac 541 ttgaaaaagc gatactacac agtaaagcat gggacgctac gagcattaca agagtttgct 601 gacaagcata acgttgaggt gcgcagggaa gggggaagta aagctctgcg catgtaccgt 661 ccggacggga aatggcggac ggtcgtcgat ttcaaaacaa acagtgttcc ccagggcgtc 721 cgtgaccggg cattcgaaga atgggagcag atcatcatag ataatgcatt gcttctcaat 781 gcggattaaa cttccccaaa ttagggctgt ttgctcaccg agcatcgctc aaagaagcac 841 gattcttcaa acatatagat agtgatagtg ccacaacttc tggctctaac gggctgggga 901 ggcggcgctt tgttgctaaa tgatctggtt t // LOCUS STRTEE6 2508 bp ds-DNA BCT 19-JUL-1990 DEFINITION S.pyogenes trypsin-resistant surface T6 protein (tee6) gene, complete cds. ACCESSION M32978 KEYWORDS surface protein; trypsin-resistant surface T6 protein. SOURCE S.pyogenes (strain D471, sub-species M-type 6) DNA. ORGANISM Streptococcus pyogenes Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 2508) AUTHORS Schneewind,O., Jones,K.F. and Fischetti,V.A. TITLE Sequence and structural characteristics of the trypsin-resistant T6 surface protein of group A streptococci JOURNAL J. Bacteriol. 172, 3310-3317 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.Schneewind, 18-MAR-1990. FEATURES from to/span description pept 719 2332 trypsin-resistant surface T6 protein (tee6) precursor sigp 719 784 trypsin-resistant surface T6 protein signal peptide matp 785 2329 trypsin-resistant surface T6 protein binding 706 709 ribosome binding site signal 2353 2382 transcription terminator BASE COUNT 929 a 364 c 480 g 735 t ORIGIN 1 aagcttcaga tgaagcctat gagaagtata aggataacga aggaagatat agcgaaatgg 61 gagattccga tactgattat ggaaccaacc aaactagttc tggaaaaggt ggtttgcctt 121 ctaattcaga tgcttcggtt aattatatgg cagatggtcg tgaacagaaa ttaccttata 181 agcacccagt gattcaggtc aaaacagtac caatcacgtt taccaaagta gatgctgaca 241 acaaccagaa aaaacttgca ggtgttgagt ttgaactccg taaagaggac aagaagatcg 301 tctgggaaaa gggaacaaca ggttcaaatg gccaactcaa ctttaagtac cttcaaaaag 361 gcaaaaccta ttatctgtat gagacgaagg caaaacttgg atacactctt ccagaaaatc 421 catgggaagt tgccgttgct aacaacggtg atataaaagt aaaacacccg attgaaggtg 481 aattgaagtc aaaagatggc tcttacatga ttaaaaatta taagatttat cagttgccat 541 cgtctggggg aagaggaagt caaattttca ttatagttgg tagcatgaca gcaactgtag 601 cattattatt ttatagacgc caacacagga aaaagcaata ttaaattaat gatcatattt 661 attgacaaac aggagagaaa cagtgagaga gaagatatta ataacagcaa aaaaactaat 721 gctagcttgt ttagctatct tagcggtagt agggcttgga atgacaagag tatcagcttt 781 atcaaaagat gatactgcac aactaaagat aacaaatatt gaaggtgggc caacagtaac 841 actttataaa ataggagaag gtgtttacaa cactaatggt gattctttta ttaactttaa 901 atatgctgag ggggtttctt taactgaaac aggacctaca tcacaagaaa ttactactat 961 tgcaaatggt attaatacgg gtaaaataaa gccttttagt actgaaaacg ttagtatttc 1021 taatggaaca gcaacttata atgcgagagg tgcatctgtt tatattgcat tattaacagg 1081 tgcgacagat ggccgtacct acaatcctat tttattagct gcatcttata atggtgaggg 1141 aaatttagtt actaaaaata ttgattccaa atctaattat ttatatggac aaacaagtgt 1201 tgcaaaatca tcattaccat ctattacaaa gaaagtaacc gggacaatag atgacgtgaa 1261 taaaaagact acctcgttag gaagtgtatt gtcttattcg ctgacatttg aattaccaag 1321 ttataccaaa gaagcagtca ataaaacagt atatgtttct gataatatgt cggaaggtct 1381 tacttttaac tttaatagtc ttacagtaga atggaaaggt aagatggcta atattactga 1441 agatggttca gtaatggtag aaaatacaaa aatcggaata gctaaggagg ttaataacgg 1501 ttttaattta agttttattt atgatagttt agaatctata tcaccaaata taagttataa 1561 agctgttgta aacaataaag ctattgttgg tgaagagggt aatcctaata aagctgaatt 1621 cttctattca aataatccaa caaaaggtaa tacatacgat aatttagata ggaagcctga 1681 taaagggaat ggtattacat ccaaagaaga ttctaaaatt gtttatactt atcaaatagc 1741 gtttagaaaa gttgatagtg ttagtaagac cccacttatt ggtgcaattt ttggagttta 1801 tgatactagt aataaattaa ttgatattgt tacaaccaat aaaaatggat atgctatttc 1861 aacacaagta tcttcaggaa aatataaaat taaggaatta aaagctccta aaggttattc 1921 attgaataca gaaacttatg aaattacggc aaattgggta actgctacag tcaagacaag 1981 tgctaattca aaaagtacta cttatacatc tgataaaaat aaggcgacag ataattcaga 2041 gcaagtagga tggttaaaaa atggtatatt ctattctata gatagtagac ctacaggaaa 2101 tgatgttaaa gaggcttata ttgaatctac taaggcttta actgatggaa caactttctc 2161 aaaatcgaat gaaggttcag gtacagtatt attagaaact gacatcccta acaccaagct 2221 aggtgaatta ccttcgacag gtagcattgg tacttacctc tttaaagcta ttggttcggc 2281 tgctatgatt ggtgcaattg gtatttatat tgttaaacgt cgtaaagctt aatgctttca 2341 aaagtcgaaa tcaatcgaga ctgtctttat gcggtctcga tttttaatga taaggaactg 2401 ctatgacaga aagactaaaa aatctaggga tactcttatt atttttattg ggaacagcca 2461 tttttcttta ccctacgcta agtagtcagt ggaatgccta tcgtgatc // LOCUS HALHPA 1317 bp ds-DNA BCT 19-JUL-1990 DEFINITION H.volcanii histidinol-phosphate-aminotransferase (hisC) gene, complete cds. ACCESSION M33161 KEYWORDS histidinol-phosphate-aminotransferase. SOURCE H.volcanii (strain DSM 3757) cell line WFD 18 DNA, clone 477. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 1317) AUTHORS Conover,R.K. and Doolittle,W.F. TITLE Characterization of a gene involved in histidine biosynthesis in Halobacterium (Haloferax) volcanii: Isolation and rapid mapping by transformation of an auxotroph with cosmid DNA JOURNAL J. Bacteriol. 172, 3244-3249 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.K.Conover, 22-MAR-1990. FEATURES from to/span description pept 121 1206 histidinol-phosphate-aminotransferase (hisC) (EC 2.6.1.9) BASE COUNT 234 a 504 c 410 g 169 t ORIGIN 1 agtcgttcgg gcggcgctcg gctgacggcc gtcggtcgtc gcgtccccaa cccgaccccc 61 taccgccacg tccgacccgg agtacgcacc cttaagaacc gcgacccgca ttttccgacc 121 atgcaaccac gggacctctc cgcgcacgct ccctacgtac ccggccgcgg gacagaggag 181 gtcgcccgcg aactcggaat ggaccccgag gacctgacga aactctcctc gaacgagaac 241 ccccacggcc cgagtccgaa ggcggtcgcc gccatcgaag acgccgcgcc gaccgtgagc 301 gtctacccga agaccgccca cacggacctg accgaacgcc tcgccgacaa gtggggcctc 361 gcacccgaac aggtgtgggt gtctcccggc gcggacggct ctatcgacta cctgacccgc 421 gcggtgctcg aaccggacga ccggattctc gaacccgcgc ccggcttttc gtactactcg 481 atgagcgccc gctaccacca cggcgacgcc gtccagtacg aggtgtcgaa ggacgacgac 541 ttcgaacaga ccgccgacct cgtcctcgac gcctacgacg gcgagcgcat ggtctacctc 601 acaacgccgc acaaccccac cggttccgtg ctcccgcggg aggaactcgt cgaactggcc 661 gagtcggtcg aagagcacac gctcctcgtc gtcgacgagg cctacggcga gttcgccgag 721 gagccgtcgg ccatcgacct cttgtcggag tacgacaacg tcgcggccct gcggacgttc 781 tcgaaggcgt acgggctggc cggcctccgc atcggctacg cctgcgtgcc cgaggcgtgg 841 gccgacgcct acgcccgcgt gaacacgccg ttcgccgcca gcgaggtcgc ctgccgcgcc 901 gcgctcgccg cgctcgacga cgaggaacac gtcgagaaat ccgtcgagtc ggcccggtgg 961 tcccgcgact atctccgcga acacctcgac gcgccgacgt gggaaagcga gggcaacttc 1021 gtcctcgtcg aggtcggcga cgccacggcc gtcaccgagg ccgcccagcg cgagggcgtc 1081 atcgtccgcg actgcgggag cttcggcctg ccggagtgca tccgcgtctc ctgcggcacg 1141 gaaacccaga ccaagcgcgc cgtggacgtg ctcaaccgca tcgtctcgga ggtgccgacg 1201 gcgtgagaga cgacgacacc ggcacgcccg gcaccggaaa gaccacggcg accgagccgg 1261 tcgccgccga cctcgacctc gacgtggtcc acctcaaccg actcgtgaaa gacgagg // LOCUS BOVGOA 472 bp ss-mRNA MAM 19-JUL-1990 DEFINITION B.taurus go-alpha mRNA, 3' end. ACCESSION J02900 KEYWORDS go-alpha. SOURCE B.taurus retina, cDNA to mRNA, clone GO3.1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (sites) AUTHORS Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M., Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M. TITLE Multiple forms of go-alpha mRNA: Analysis of the 3'-untranslated regions JOURNAL Biochemistry 29, 5069-5076 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 472) AUTHORS Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M., Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by S.R.Price, 12-JUN-1990. FEATURES from to/span description pept < 1 3 go-alpha (AA at 1) BASE COUNT 130 a 133 c 88 g 121 t ORIGIN 1 tgacctcttg tcctgtatag caacctattt ggtaatgatt ccagcactca cagaaaagct 61 tgcacacata cacacacacc ccacccctcc ccactaacaa atgcaagttg gtaaacaaat 121 tccaaaaagg cataacaaac cttatatata tagacaaata tatattaaag ttttttagtc 181 tgtactagaa agagcttcag acagaactga ccaccattcc attgctcatc aatttcctgg 241 gacagcacct gagcgtgcgc ttacgcgcgt acacacacat agacacgcac tgcgatacaa 301 gtcctgattt gggagtccgt ccttttaaaa acagccacat gctttcacgc tctgagaccc 361 acccgtttct gtgagcaggg ggagggcaag gaaagccctg gcctcagtcc agccttttct 421 ctgcttccac ctgctcaggc tgtgtgctct tggttctgtc ctgcacttgt gt // LOCUS CAJCAT 1334 bp ds-DNA BCT 19-JUL-1990 DEFINITION C.coli plasmid C-589 chloramphenicol acetyltransferase (cat) gene, complete cds. ACCESSION M35190 KEYWORDS chloramphenicol acetyltransferase. SOURCE C.coli plasmid C-589 DNA. ORGANISM Campylobacter coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1334) AUTHORS Wang,Y. and Taylor,D.E. TITLE Chloramphenicol resistance in Campylobacter coli, nucleotide sequence, expression and cloning vector construction JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Taylor, 15-JUN-1990. FEATURES from to/span description pept 309 932 chloramphenicol acetyltransferase (cat) mRNA 277 > 932 chloramphenicol acetyltransferase mRNA signal 242 271 promoter binding 297 301 ribosome binding site signal 960 1006 transcriptional termination signal BASE COUNT 433 a 232 c 282 g 387 t ORIGIN 1 attcccacaa cgccggaaac aagccgtgcc acgagcttat aataaaagag ggaagagaag 61 cgtatttttc ctcacttccg gtgaaggata tcgagaaaaa tctaaatgat aacggaattc 121 cgtcgtcggt atcgtatgga gcggacaacg agtaaaagag tgaccgccga gataacccat 181 tgctcggcgg tgttcctttc caagttaatt gcgtgatata gattgaaaag tggatagatt 241 tatgatatag tggatagatt tatgatataa tgagttatca acaaatcgga atttacggag 301 gataaatgat gcaattcaca aagattgata taaataattg gacacgaaaa gagtatttcg 361 accactattt tggcaatacg ccctgcacat atagtatgac ggtaaaactc gatatttcta 421 agttgaaaaa ggatggaaaa aagttatacc caactctttt atatggagtt acaacgatca 481 tcaatcgaca tgaagagttc aggaccgcat tagatgaaaa cggacaggta ggcgtttttt 541 cagaaatgct gccttgctac acagtttttc ataaggaaac tgaaaccttt tcgagtattt 601 ggactgagtt tacagcagac tatactgagt ttcttcagaa ctatcaaaag gatatagacg 661 cttttggtga acgaatggga atgtccgcaa agcctaatcc tccggaaaac actttccctg 721 tttctatgat accgtggaca agctttgaag gctttaactt aaatctaaaa aaaggatatg 781 actatctact gccgatattt acgtttggga agtattatga ggagggcgga aaatactata 841 ttcccttatc gattcaagtg catcatgccg tttgtgacgg ctttcatgtt tgccgttttt 901 tggatgaatt acaagacttg ctgaataaat aaaatcccag tttgtcgcac tgataaaaac 961 cctttaggaa ctaaagggcg cacttctata ctctctgtcg agagtagtgc gtcctgcgga 1021 gcttcattcc cggtcagcgc gcttatcaat atatctatag aatgggcaaa gcataaaaac 1081 ttgcatggac taatgcttga aacccaggac aataacctta tagcttgtaa attctatcat 1141 aattgtggtt tcaaaatcgg ctccgtcgat actatgttat acgccaactt tgaaaacaac 1201 tttgaaaaag ctgttttctg gtatttaagg ttttagaatg caaggaacag tgaattggag 1261 ttcgtcttgt tattaattag cttcttgggg tatctttaaa tactgtagaa agaggaagga 1321 aataataaat ggct // LOCUS CLOCBA 5120 bp ds-DNA BCT 19-JUL-1990 DEFINITION C.acetobutylicum beta-D-galactosidase (cbgA) and beta-D-galactosidase regulatory protein (cbgR) genes, complete cds. ACCESSION M35107 KEYWORDS beta-D-galactosidase; beta-D-galactosidase regulatory protein. SOURCE C.acetobutylicum (strain NCIB2951) DNA. ORGANISM Clostridium acetobutylicum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1260 to 5120) AUTHORS Hancock,K.R., Rockman,E., Pearce,L., Maddox,I.S. and Scott,D.B. TITLE Clostridium acetobutylicum beta-galactosidase gene, cbgA, is positively regulated in Escherichia coli by a novel regulatory gene, cbgR JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 5120) AUTHORS Scott,D.B., Hancock,K.R., Pearce,L. and Maddox,I.S. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by D.B.Scott, 11-JUN-1990. Author address:D.B.Scott: Molecular Genetics Unit Department of Microbiology and Genetics Massey University Palmerston North, New Zealand E-mail:D.B.Scott@massey.ac.nz FEATURES from to/span description pept 1560 4253 beta-D-galactosidase (cbgA) pept 4500 4805 beta-D-galactosidase regulatory protein (cbgR) BASE COUNT 1921 a 683 c 876 g 1640 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccttt tcatatatat ctttaatatt tctactggaa tagaagaggt tgctcaatac 61 aaaaaatgct tctttaaaac tatttgaaac tacttctgaa atattttcta gcttactaaa 121 tagagaatta taatttttat catcaaaatt tagaattaca actatgattt cgttttcaat 181 attagcaatt tgtatattat aattgctatt taatccgtct aaagaaaatt ctttgccgat 241 ttctgaaatt gtaaaatcaa taatttcatg gcgtttgcta taattatcat atatttcttt 301 gcgtttaaac caaataagca aaatgattga aaagtaaata tgtatcaaag tagttaaagt 361 caggatcatg tcaaaacctg atataaggcg atttaaggcg ctattagtga gacttaaaga 421 gtttccttct aaagtatttc ttttcatttt tattgaaatc ttttttagag tacttaataa 481 ctcagaagga tttagagaag gttttaaaat ataatcaaca gcaccatttt gaaaagatga 541 tttaacatat tcaaaatcgc tataactact taagatgata attcttatct taggatattt 601 gtcctgcaca aatttagcta attcaacccc atttatttgg ggcattacaa catcagaaat 661 tataatgtca ggaatatcct tttttatcat ttccagagct tcttgaccat tagaagcctg 721 tcctataatt tgaaagcctt ctttttccca atcaatcata tgagttatgc cttgccgcat 781 aataaattca tcatcaacaa ctaaattttt actatattcg ttcaatagta tagcacccct 841 tattctaaaa ttaccacaac atagataaat attgcttaat actattatac cttatagatt 901 tattgtatgt atctgtatac gttacgttaa ttcatctaca aatttatatg agttttggtt 961 gcacttttag agaaaatctt tttgtctatg gtcttattgt cctataatgg tcaaatcatc 1021 tttaccaaag tctcttgatt taaagagata aaaacaccac tgatccatta ttcctcattt 1081 tggtaatgaa cctatgcggt tgaagatatt aatcagatgt ctaaatactt tagaaaaaaa 1141 gacctttact aatatcttca atatttacac ccctattcta aaattaccac aagatagata 1201 aatattgctt aatactgatt ataccttata gattaaaggt tttcaattaa acaataaatt 1261 actttagtaa agtttagtaa aatataattg attttttact aaaaagataa taaaatgaaa 1321 ctataaattt agttaatagc ataaatctaa catcagaaga taggataaat taaagaagta 1381 atgtaattga ttacgaaaca aaatctcata ttaatattag cccataattt ttttattctc 1441 atatatgttt aagtattaat taaatgtgac tttataaaaa ggttgcattt agttaatacg 1501 attaacaact ttaatttaaa aaagcaataa ctctacaaag tgaaagtgag ggggtaagta 1561 tgattaataa taaaccgtca ttagattggc tagaaaatcc ggaaatattt agagttaata 1621 gaatagatgc tcattctgat acttggtttt atgaaaaatt tgaggatgtt aaattagaag 1681 acaccatgcc tcttaagcaa aatttaaatg gaaaatggag attttcatat agtgaaaatt 1741 catcattaag aattaaagag ttttataagg atgagtttga cgtaagttgg attgattata 1801 ttgaagttcc aggtcatatt cagcttcaag gatatgataa atgtcaatat attaatacta 1861 tgtatccttg ggaaggtcac gatgaattaa gaccacctca tatttcaaaa acatataatc 1921 cggtgggaag ctatgtaaca ttttttgaag ttaaagatga actcaaaaat aagcagactt 1981 ttatttcttt tcaaggtgtt gaaacagcat tttacgtatg ggtaaatgga gaatttgtag 2041 gatatagcga agatacattt acaccatcag aatttgatat tactgattat ttaagagagg 2101 gagaaaataa acttgcagtt gaggtttata aaaggagtag cgcaagttgg atagaagatc 2161 aagatttctg gagattttca ggcatcttta gagatgtata tttatatgca gttccagaaa 2221 ctcatgtaaa tgatatattt ataaaaacag atttatatga cgatttcaaa aacgcaaagt 2281 taaatgctga acttaaaatg attggaaatt cagaaacaac agttgaaaca tatttagaag 2341 ataaagaagg aaataaaata gctatatctg aaaagattcc gttctctgat gagttgactt 2401 tatatttaga tgcgcaaaat ataaacctat ggagtgcaga agagcctaac ttatatacac 2461 tttatatttt agtgaataaa aaagatggta atttaattga ggttgtaact caaaagatag 2521 ggtttaggca ctttgaaatg aaggataaaa ttatgtgtct aaaatggaaa cgtattatct 2581 ttaaaggcgt aaaccgtcac gaatttagcg caagacgtgg acgctcaatt acgaaagagg 2641 acatgttgtg ggatattaag ttcttgaaac aacacaatat taatgctgtt agaacatcac 2701 attatccaaa tcaaagttta tggtacagac tttgcgatga atacgggatt tatttaatag 2761 atgaaacaaa tttagaaagc catggttcat ggcaaaagat ggggcagatt gaaccatcat 2821 ggaatgtgcc aggaagtctt ccacagtggc aggcagcagt tttagatcga gcatcatcaa 2881 tggttgaaag agataaaaat catccatctg tacttatttg gtcatgtggt aatgaatcct 2941 atgcgggtga agatatttat cagatgtcta aatactttag aaaaaaagat ccttcacgtt 3001 tagtgcacta tgaaggggta actagatgca gagaatttat gacacgacga catgaaagta 3061 gaatgtatgc aaaggcagca gaaatagaag aatatcttaa tgataatccg aagaaacctt 3121 atatacagct gcgatacatg cactcaatgg gtaactcaac tggtggaatg atgaaataca 3181 cagaacttga agataaatat ttgatgtatc aaggtggatt catttgggat tacggcgatc 3241 aggcgttgta tagaaaactt ccagatggaa aagaagttct agcttatgga ggagacttta 3301 cagatcgtcc aacagactat aatttctctg gaaatggttt gatttatgca gatagaacta 3361 tatcacctaa agcacaggaa gttaagtatc tatatcaaaa cgtaaaatta gaaccagatg 3421 aaaaaggggt gactattaag aatcaaaatc tttttgttaa tactgataaa tatgatttat 3481 actatatcgt tgaaagagat ggaaaactaa taaaagatgg ttatctaaat gtatctgtag 3541 ctccagatga agaaaaatat atagaacttc caataggaaa ttacaatttt cctgaagaaa 3601 ttgtacttac aacctcatta agattagcac aagctacact ttgggcagaa aaaggatatg 3661 aaatagcatt tggacaaaag gttattaaag aaaaatcaga tatgaataat cataattcag 3721 agtctaaaat gaagatcatt catggagatg taaacatagg ggttcacgga aaagatttca 3781 aggctatatt ctctaaacaa gagggaggaa tcgtatcctt gagatataat aataaggagt 3841 ttataacgag aacgccaaaa actttctatt ggagagcaac aacagataat gatagaggaa 3901 atagacatga atttagatgc agtcaatggc tggctgctac tatggggcag aagtatgtgg 3961 atttttcagt tgaggaattt gatgagaaga ttacattata ttatacttat caattgccaa 4021 cagtgccatc tactaatgtt aagataactt atgaagtatc tggagaagga ataattaaag 4081 taaatgttaa gtataaagga gttagcggat tacctgaatt gcctgtacta ggaatggatt 4141 ttaaattatt agccgaattt aattcattta gctggtatgg aatggggcca gaagaaaact 4201 atatagacag atgtgaaggt gcaaaacttg gaatatatga gagtacacaa tagaaaatct 4261 atcaaggtat ttagtaccac aagaatgtgg taacaggata ggaactagat gggtagtagt 4321 taaaaatcat aagaatgaag gtcttaaatt tacttatgtt aaagttccat ttgaatttag 4381 tgttttacca tacagcagca tggaattaga aaattcactt catatagaag aattaccatc 4441 tgttaatttt acacattgtg aatataatag gtaaacaaat gggtgttggc ggagatgcaa 4501 tgctggggag caccatgata cctaaattct gtatagattc aagtaaggat ttagaatata 4561 gttttataat ttctaaaatt atactacgca catatgggaa ctatagatat ccaaaacaaa 4621 acttagactt atgcaataat ttacgaaagg acaggtactc tgttgtttcg gttactaaga 4681 ataagttgag gctttctaac atcataagtt gcaccatttc agcatgctcc cgagacaagc 4741 tcgtgacaag caaaaatgga acaacttatg atgaagaaat gcctgcaaca tattctttaa 4801 tgtaacactg cacaaaagag tacctgtcct ttctgatata gcagattttt caagctataa 4861 gtatatctca cgaaatcata aatattttga ttccgaaaag ctatgaaaat atcgctgaag 4921 gttctaagca gctggttgtg tgcaccttag catgctccaa ctttcagttt gacaagctaa 4981 aatggaacaa tctacagctc aagaaacttt aacagctcat tttcaaatgt tttctacaca 5041 aatatattta tatttctagt gaagatatga aattaaattt ttagcgactt tgtaaatatg 5101 ttaatctaat atacgaattc // LOCUS ECOPNCB 1490 bp ds-DNA BCT 19-JUL-1990 DEFINITION E.coli nicotinic acid phosphoribosyl transferase (pncB) gene, complete cds. ACCESSION J05568 KEYWORDS nicotinic acid phosphoribosyl transferase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1490) AUTHORS Wubbolts,G., Terpstra,P., Van Beilen,J.B., Kingma,J., Meesters,H.A.R. and Witholt,B. TITLE Variation of cofactor levels in Escherichia coli: Sequence analysis and expression of the pncB gene encoding nicotinic acid phosphoribosyl transferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1490; revises [1]) AUTHORS Wubbolts,G., Terpstra,P., Van Beilen,J.B., Kingma,J., Meesters,H.A.R. and Witholt,B. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by P.Terpstra, 31-MAY-1990. FEATURES from to/span description pept 216 1418 nicotinic acid phosphoribosyl transferase (pncB) (EC 2.4.2.11) mRNA 158 > 1490 nicotinic acid phosphoribosyl transferase mRNA signal 124 129 -35 region signal 146 151 -10 region rpt 170 185 inverted repeat binding 197 202 ribosome binding site signal 1426 1450 rho-independent transcription termination signal revision 56 57 gc in [2]; cg in [1] revision 191 191 t in [2]; tt in [1] BASE COUNT 348 a 374 c 364 g 404 t ORIGIN 1 tgttgcgtaa tgcgtatgca gaatcttcat cttttcaggt acaaacgcct ttattgctac 61 atttttataa catacagcgc gtaatgccat cgaccagaaa ggtggcatat ggtgtgatcg 121 gggttcaata aattgcgaaa caaggtatac tccagcagtt cctgaagatg tttattgtac 181 taaacgctcc tgtacgagga cgctactgcg cacctatgac acaattcgct tctcctgttc 241 tgcactcgtt gctggataca gatgcttata agttgcatat gcagcaagcc gtgtttcatc 301 actattacga tgtgcatgtc gcggcggagt ttcgttgccg aggtgacgat ctgctgggta 361 tttatgccga tgctattcgt gaacaggttc aggcgatgca gcacctgcgc ctgcaggatg 421 atgaatatca gtggctttct gccctgcctt tctttaaggc cgactatctt aactggttac 481 gcgagttccg ctttaacccg gaacaagtca ccgtgtccaa cgataatggc aagctggata 541 ttcgtttaag cggcccgtgg cgtgaagtca tcctctggga agttcctttg ctggcggtta 601 tcagtgaaat ggtacatcgc tatcgctcac cgcaggccga cgttgcgcaa gccctcgaca 661 cgctggaaag caaattagtc gacttctcgg cgttaaccgc cggtcttgat atgtcgcgct 721 tccatctgat ggattttggc acccgtcgcc gtttttctcg cgaagtacaa gaaaccatcg 781 ttaagcgtct gcaacaggaa tcctggtttg tgggcaccag caactacgat ctggcgcgtc 841 ggctttccct cacgccgatg ggaacacagg cacacgaatg gttccaggca catcagcaaa 901 tcagcccgga tctagccaac agccagcgag ctgcacttgc tgcctggctg gaagagtatc 961 ccgaccaact tggcattgca ttaaccgact gcatcactat ggatgctttc ctgcgtgatt 1021 tcggtgtcga gttcgctagt cggtatcagg gcctgcgtca tgactctggc gacccggttg 1081 aatggggtga aaaagccatt gcacattatg aaaagctggg aattgatcca cagagtaaaa 1141 cgctggtttt ctctgacaat ctggatttac gcaaagcggt tgagctatac cgccacttct 1201 cttcccgcgt gcaattaagt tttggtattg ggactcgcct gacctgcgat atcccccagg 1261 taaaacccct gaatattgtc attaagttgg tagagtgtaa cggtaaaccg gtggcgaaac 1321 tttctgacag ccctggcaaa actatctgcc atgataaagc gtttgttcgg gcgctgcgca 1381 aagcgttcga ccttccgcat attaaaaaag ccagttaata tcatcaggga gctaatcggc 1441 tccctttttt tacctttaat tccgaaatct ttcgctgcat ttgcgaattc // LOCUS NEUCCON13 2728 bp ds-DNA PLN 19-JUL-1990 DEFINITION N.crassa conidiation-specific protein (con-13) gene, complete cds. ACCESSION M35120 KEYWORDS conidiation-specific protein. SOURCE N.crassa (strain 74-OR23-1A) DNA, clone pCon10a. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 2728) AUTHORS Hager,K.M. and Yanofsky,C. TITLE Genes expressed during conidiation in Neurospora crassa: Molecular characterization of con-13 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.M.Hager, 12-JUN-1990. Author address:K.M.Hager: Dept. of Physiology UCLA Medical School 10833 Le Conte Avenue Los Angeles, CA 90024-1751 E-mail:COTRAN%VXBIO.SPAN@STAR.STANFORD.EDU FEATURES from to/span description pept 1009 1275 conidiation-specific protein (con-13), exon 1 1333 1847 conidiation-specific protein, exon 2 1910 2150 conidiation-specific protein, exon 3 pre-msg 922 2367 con-13 mRNA and introns (alt.) pre-msg 927 2367 con-13 mRNA and introns (alt.) pre-msg 936 2367 con-13 mRNA and intron (alt.) pre-msg 946 2367 con-13 mRNA and intron (alt.) IVS 1276 1332 con-13 intron A IVS 1848 1909 con-13 intron B site 2364 2367 polyadenylation site BASE COUNT 653 a 695 c 720 g 660 t ORIGIN Linkage group IV. 1 gatctcatca tctgaaacgc cgcctgagtc aatgactctt ggcaatcggg ctctgcgtcc 61 ggctagatag acagcgtccc actgatacag acttggtaag ctgccacagt tgccaagttt 121 ttatatcgat tattctttga acttccaagg acagtcttca agggcgcttt ctgtctcagc 181 atcgggagat atgacgcccg tggttcgtat accaatggtt cggcactaag gcgctgcatt 241 tgactcggag atattgacgc ctgccccctt ttgagaggag actgagtgag cgaggcccaa 301 tactatcacc acagttgcgg ttagctgccg agacttatcg gtcaacaccg aaatattggc 361 ccagaagggc aacaaaacgg gctgtcgatg gcttgcaacc attgatatcc ctgattgcca 421 ttcctacact accgcccatt cttcattcaa acctgactct cttactccct ttacagtcta 481 gcagatctgg acgtacctgc atgtaatgcg gccaacgggg ctggtaagct gaacacacca 541 ttcggagcgg ctggcaagtc tgtcatgccc gatcgacagc acatgtacta gactatctta 601 agcctagttc cgtgttcaga aacatccggt ttgattgcga atcaacagta cattgatgtt 661 catccaccgg actctaaacc gatcagctaa ttgttggcgg agcggagttc atcgcgggcg 721 taggaaacaa ggttgatgtt acccgtaaat ggaaatcgtg cttcgctcac ggcgttgctc 781 cgaagtaggg tgaagaggtc cgttggctgt gatggtttgc gctggtgtgt gtcaacgctt 841 agtgatgctg gtgatccaac tccgatccaa atgacaaagc aatgcatata agaaggactg 901 ggcatcacca acagcgcaac ggcggcagac acgaagccct agctcgacaa gcagccttca 961 taccccgacc aaaaagtcac acttgtcgta ccgtaacctc gtcgcaagat gccccaggct 1021 catttcttcg cgttgctgct tgcagccgtt gtaccggccg ttttggcgga cggtcccccg 1081 gaatcgatgg gcgagaagtt cagcggcctc aacgttctgg atgggaacgg cggacttcaa 1141 agtttgaccc cgacacccta caccataagt caatggcctt ggggtactgt acccaagctg 1201 tgctatgaca cgtctgtcaa caacaagtac tgcaacccgt acgatctcga agtatacgat 1261 gtcagataca cggatgtagg taaaagactt gcctcggatt cggaacctgt gcttacctta 1321 acttgacaat agtgccccat tcccaccacc gtctgccgat gcaagaactc acctatggcc 1381 atagacacca ttgcgcagcg tgtcggccaa ctccctgtca aggctcgcca gtataatggc 1441 tatgtgtcca gctttgcggg agacatgtgc tcagcctaca gcgatagctt caacaactac 1501 ttctttggcg actgcggcaa ttccgagtcc gtcttcttcc atgagctcag ccacaacctt 1561 gaccgtcacg ttgcaggggc gtccatcaac gattggtact ccctttcgca agactggaag 1621 gataccgttg ccaaggacac ttgcgtcgca gaccactatt ccaaggccag ctggctcgag 1681 gcatatgccc aggtgggagt catggctgga tacgatgcta cggtacagtc tatctatacc 1741 caaaatgtcg gctgtatggt caatcaggtc aagaaggtgg ttggacagtt gaacagtgtc 1801 tggcgtaaac agcctgggca gatgtgcgat cgttactgga tcaaggagta agtttctttc 1861 aacaagaccc attttcttga tgaccctgtg ctgaccggaa tgtaaacagc accacggttt 1921 gcatgggacc tgatgcggaa gccagtggcc actgtcaagc atccaaagct gatgtcgcgg 1981 cggagtctgg tggtgtaaac ccagtgttgc cggacgggca gcagaagaag cacgacgcct 2041 tggtcaagga gcttcagcgt cacgccgagg ccgcggccgg catttcttcc ggaaaaccgg 2101 cggccgatag aaagaccaag ggtaagaagg gtaccaaatt cagggtctga agcgggaact 2161 atgatcgatt ccaggtcctg ggctctagct gtgagttcag tcagggtgtt gaggaagttg 2221 cgaggcctca gttgtgagcg acgtcatcaa accgtctcct tttgggataa tgataacctt 2281 ttatttctgg ataactggga caggttaggc tgtctttgtc gatagactag gtacgtaaga 2341 attgatttga tgcttgttcg atgcttttaa gttgttgtcg cttgtggttg cgaggtagtc 2401 ggcaggtttg tttggataga cgggagacgc ccactcgcac ccagggcgat gaataacgaa 2461 ggccgatggc tctttccatg tgggaaatac acaagtctgg cattgtccac ttgtttgtct 2521 tcgagcgggg ttacgatttc tgtcaagccc tttgctcctt tcttccgaga acaaaggaag 2581 ttttcgatcc agatcgccaa catccgaaaa gggaggaata gttcgatcga tgtaccttga 2641 cggctcggcc atcgatctga tctgcatttc ccactctgga ttccagggga agggtcatat 2701 gatggaaacg agatcgaaac ccattgag // LOCUS VVUVVHAB 2237 bp ds-DNA BCT 19-JUL-1990 DEFINITION V.vulnificus cytolysin (vvhA) and vvhB gene (pot.), complete cds. ACCESSION M34670 KEYWORDS cytolysin; cytotoxin; hemolysin; toxin. SOURCE V.vulnificus (strain EDL174) DNA, clone pCVD702. ORGANISM Vibrio vulnificus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 2237) AUTHORS Yamamoto,K., Wright,A.C., Kaper,J.B. and Morris,J.G. TITLE The cytolysin gene of Vibrio vulnificus: Sequence and relationship to Vibrio cholerae El Tor hemolysin JOURNAL Infect. Immun. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.B.Kaper, 29-MAY-1990. FEATURES from to/span description pept 745 2160 cytolysin (vvhA) precursor sigp 745 804 cytolysin signal peptide matp 805 2157 cytolysin pept 237 743 pot. cytolysin (vvhB) signal 55 60 -35 region signal 80 85 -10 region signal 87 92 -35 region signal 110 115 -10 region signal 184 189 -35 region signal 206 211 -10 region signal 2185 2219 transcription termination signal binding 54 69 CRP binding site binding 59 74 Fur binding site binding 185 199 Fur binding site binding 226 231 vvhB ribosome binding site binding 730 735 vvhA ribosome binding site BASE COUNT 639 a 498 c 509 g 591 t ORIGIN 1 tatattagat cacttttaaa acaataatag atcagatatt aatctgttga ttttgtgata 61 atgagccaaa aaatactttt attttattta tatgaaatat tttcaggatt attaataaat 121 agccaacagg attttggtgc atatctattc tcaaggacga accaaacaat ctccatacaa 181 atattaatgt tatggagaaa ataacaataa taacccttac tcgtaatgag gaatctatgc 241 ttaataacaa aaatagaaat gtaggacgcc ttaccctact ctgctgtttg tttgcggcga 301 atacttttgc tgatgttcaa attttgggca gcgaaagtga gctttcacaa accattgccg 361 atcagtacca acaaaatgtc acgctgttta acggccagct aaacagtaat gatgtgttgt 421 atgtcaatgt aggaacagca accgatgacg aaatcactca agcaaaaagt catatcatct 481 ccggtagcac cgtggtgatt gatttgactc aaattgctgg tgacgacgca aggcttgatt 541 ggagccaaaa actcactggt ttaggactgt cagcgcctgt tgtggttacg ggggtttatc 601 aaggcgacgc cttagtcaat gcgattgtca gcgatgtcac cgacgagaat gacaacccaa 661 tcaacgatcc ccaagccgag ttagagagcg ttaaactttc tctcactcat gccctagacc 721 gcttccaatc tgagggaaaa taagatgaaa aaaatgactc tgtttaccct ttctctttta 781 cgtaccgcgg tacaggttgg cgcacaagaa tatgtgccga ttgttgagaa acctatttac 841 atcaccagct caaagattaa gtgtgtgttg cacacaagcg gtgatttcaa cgccacacga 901 gactggtgta atgcgggtgc ttccatcgat gttcgcgtca atgtggcaca aatgcgctcg 961 gtacaatcgg caacgtcaga tggttttact cctgacgcca aaattgtccg tttcaccgtc 1021 gatgccgaca agcctggcac gggtattcat ttggttaacg agctacagca agatcacagc 1081 tggttccaga gttgggcaaa ccgccgcact tacattggtc cattcgccag cagttacgac 1141 ctttgggtga aacccgtttc tggttacaca ccgaaaaaag cccgtgacct accgcagaat 1201 gagaacaaaa actaccaaca ccgcgatact tacggttact ccatcggtat taacggcaaa 1261 gtaggtgcgg aagtgaacaa agacggcccg aaagtgggtg gcgaagtcag tggctcattt 1321 acctacaact actcgaagac cttggtgttt gatacaaaag actatcgcat caacaaccgt 1381 tcatcattga gtgattttga tatttcattc gagcgtgaat ttggggaatg tgatgaactg 1441 cgccgccaag agcttggatg ctatttcacc gccgctcact ggggcagtgg ctgggtattt 1501 gataagacga agttcaaccc tatctcttat tccaacttca aaccgaacta tgacgttttg 1561 tacgaagcgc ccgtgtctga aactggcgta acggattttg agatgggcgt gaaactcaac 1621 tatcgtgcac gctttggtac cgttcttcct tcagcgctgt tttcggttta cggctctgcg 1681 ggctcgtcaa ccaacagcag tactgtgaaa caacgtattc gcatcgactg gaatcaccca 1741 ctgtttgaag cggaacgaca cgttacactg cagtcactga gcaacaacga tctctgcctg 1801 gatgtttatg gtgagaacgg tgacaaaacg gttgcgggtg gttcggttaa cggctggagc 1861 tgtcacggca gttggaacca agtttggggc ctagataaag aagaacgtta tcgtagccga 1921 gtggcatccg atcgttgttt gaccgtaaac gcagacaaaa cgctcacagt cgaacagtgt 1981 ggtgcgaact tagcacagaa atggtattgg gaaggcgata agctcattag ccgctatgtt 2041 gatggcagta atactcgcta ccttctaaac attgttggtg gtcgtaatgt tcaagtaacc 2101 cctgaaaatg aagcaaatca ggcgcgttgg aaacccacat tacaacaagt caaactctag 2161 gctctgttga ccttagcgat atccaaacgc tccctgtata ctagggagcg tttttcttta 2221 ttcgccatct attcgtc // LOCUS CHKMTPEPCK 3571 bp ss-mRNA ORG 19-JUL-1990 DEFINITION Chicken mitochondrial phosphoenolpyruvate carboxykinase (PEPCK-M) mRNA, complete cds. ACCESSION J05419 KEYWORDS phosphoenolpyruvate carboxykinase. SOURCE Chicken 3-day old liver mitochondrion, cDNA to mRNA. ORGANISM Mitochondrion Gallus domesticus Unclassified. REFERENCE 1 (bases 1 to 3571) AUTHORS Weldon,S.L., Rando,A., Matathias,A.S., Hod,Y., Kalonick,P.A., Savon,S., Cook,J.S. and Hanson,R.W. TITLE Mitochondrial phosphoenolpyruvate carboxykinase from the chicken: Comparison of the cDNA and protein sequences with the cytosolic isozyme JOURNAL J. Biol. Chem. 265, 7308-7317 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.L.Weldon, 17-MAY-1990. FEATURES from to/span description pept 28 1950 phosphoenolpyruvate carboxykinase precursor (EC4.1.1.32) matp 28 126 phosphoenolpyruvate carboxykinase signal peptide matp 127 1947 phosphoenolpyruvate carboxykinase rpt 2025 2123 large repeat copy A rpt 2124 2223 large repeat copy B rpt 2224 2316 large repeat copy C rpt 2317 2416 large repeat copy D rpt 2417 2508 large repeat copy E rpt 2519 2543 small repeat copy A rpt 2547 2577 small repeat copy B rpt 2580 2611 small repeat copy C rpt 2745 2777 small repeat copy D rpt 2780 2811 small repeat copy E rpt 2847 2878 small repeat copy F rpt 2040 2050 GCCAAGATGGC 11 bp repeat rpt 2105 2115 GCCAAGATGGC 11 bp repeat rpt 2205 2215 GCCAAGATGGC 11 bp repeat rpt 2298 2308 GCCAAGATGGC 11 bp repeat rpt 2072 2082 TCCAAGATGGC 11 bp repeat rpt 2139 2149 TCCAAGATGGC 11 bp repeat rpt 2265 2275 TCCAAGATGGC 11 bp repeat rpt 2332 2342 TCCAAGATGGC 11 bp repeat rpt 2424 2434 TCCAAGATGGC 11 bp repeat rpt 2524 2534 TCCAAGATGGC 11 bp repeat rpt 2558 2568 TCCAAGATGGC 11 bp repeat rpt 2657 2667 TCCAAGATGGC 11 bp repeat rpt 2691 2701 TCCAAGATGGC 11 bp repeat rpt 2724 2734 TCCAAGATGGC 11 bp repeat rpt 2758 2768 TCCAAGATGGC 11 bp repeat rpt 2792 2802 TCCAAGATGGC 11 bp repeat rpt 2825 2835 TCCAAGATGGC 11 bp repeat rpt 2859 2869 TCCAAGATGGC 11 bp repeat rpt 2926 2936 TCCAAGATGGC 11 bp repeat rpt 2983 2993 TCCAAGATGGC 11 bp repeat rpt 3023 3033 TCCAAGATGGC 11 bp repeat rpt 3057 3067 TCCAAGATGGC 11 bp repeat rpt 3114 3124 TCCAAGATGGC 11 bp repeat rpt 3234 3244 TCCAAGATGGC 11 bp repeat BASE COUNT 589 a 1077 c 1197 g 708 t ORIGIN 1 tcctcgccta tactgggaca atttataatg ttttggttaa gagggggggc gcagagttgt 61 aggggggggg aaactgagga cagaatgcag cgcgggatgt ggggcgtggg cctggcccgg 121 cgcaggctga gcacgtcgct gtcggcgctg ccggcggccg cgcgggattt cgtggaggag 181 gcggtccggc tgtgcaggcc cagggaggtt ctgctgtgcg atgggtccga ggaggagggg 241 aaggagctgc tcagagggct gcaggacgac ggggtgctgc atccgctgcc caaatacgac 301 aactgctggt tggctcgcac cgacccccgg gacgtggctc gggtgcaaag caagacggtg 361 ttggtaaccc ccgaacagag cgacgccgtc cccccacccc ccccatccgg gtccccccaa 421 ttggggaact ggatgagccc caatgctttc caggcagctg tgcaggagcg tttccccgga 481 tgcatggcag gccgccccct ctacgtcatc ccattcagca tgggcccccc cacgtccccc 541 ttggccaaac tgggggttca ggtgaccgac tccccctacg tggtgctctc catgcgcatt 601 atgacccgcg tgggccccgc ggtgctgcag cgcctcgacg acgacttcgt ccgctgcctc 661 cactctgtgg ggcggcctct gcccctcacc gagcccctgg tgagctcgtg gccgtgcgac 721 cggtcccgtg tcctggttgc ccacatcccc tcggagcgcc ggatcgtctc cttcggttcg 781 ggatacggcg gcaattcgct gctgggcaag aagtgcttcg cgctggccat cgcgtcccgc 841 atggcccagc agcagggctg gctggccgag cacatgctga ttttgggggt gacgtccccc 901 agcggtgaga agcgttacat ggcggcggcc tttcccagcg cctgcgggaa aaccaacctg 961 gccatgatga cccccagcct gccgggttgg cggatccact gcgttgggga cgacattgcg 1021 tggatgaagt tcgatgatcg agggcgcctc cgcgccatca accccgagcg tggctttttt 1081 ggggtggccc cggggacgtc gtcgcgcacc aaccccaacg ccatggccac catcgcccgc 1141 aacaccatct tcaccaacgt ggggctgcga agcgatggcg gcgtctactg ggacggcctg 1201 gatgagccca cggagcccgg ggtcacctac acctcctggc tgggcaagcc gtggaagcac 1261 ggtgaccccg agccgtgcgc ccaccccaac tcccgtttct gcgccccggc cgatcagtgc 1321 cccattatgg acccgcgttg ggacgacccg gaaggagttc ccatcgacgc catcatcttc 1381 ggggggcgcc gaccccgcgg agtgccgttg gtggtggagg cctttgggtg gcgccacgga 1441 gttttcatgg gcagcgcaat gaggagcgaa gccaccgccg ccgccgagca caaaggcggc 1501 cgtttgatgc acgacccctt cgccatgagc ccctttttcg gctacaacgc ggggcgttac 1561 ctggaacatt ggctgtctac gggtctccgg agcaacgccc gcctcccccg tctgttccac 1621 gttaattggt tcctccgaga taacgaaggt cgcttcgttt ggcccggctt cggtcacaac 1681 gcccgcgtct tggcttggat cttcgggagg atccagggga gggacactgc ccggcccacc 1741 cccatcggtt gggtacccaa agaaggggat ttggacctgg gggggctgcc gggggtcgat 1801 tactcccaac tgttccctat ggagaagggc ttttgggagg aggagtgcag gcagctgagg 1861 gagtattacg gggagaactt cggggccgat ctgcccaggg atgtcatggc ggagctggag 1921 ggcctggagg agagggtgag gaagatgtga ggggtcgggg tggggctgag ggaaaggatg 1981 gggggaggtt gggggggctg tggggggcga ggtgggggct ggcggtgggg gttggtgagg 2041 ccaagatggc ccatcggtat gggttggccg ttccaagatg gctgccgccg ctatgagttg 2101 gtcagccaag atggccgccg acagtgtggg ttggtgggtc caagatggct gccatcggta 2161 tgggttggcc gttccaagat gctgccgccg ctacgagttg gtcagccaag atggccgccg 2221 acagcgtggg tccaagaagg ccgccatcat tacgggttgg ccgttccaag atggctgccg 2281 ccactacgag ttggtcagcc aagatggcca ccgacagtgt gggttggtgg gtccaagatg 2341 gctgccatca gtatgggttg gccattccaa gatcgtgccg ccgctacgag ttggtcagcc 2401 aagatggctg ccgacagcgt gggtccaaga tggccgccat cattacgggt tggccgttcc 2461 aagatcgtgc cgccactacg agttggtcag ccaagatggc caccgacagc gttggttggt 2521 gggtccaaga tggctgccat cattgtgggt tggccgttcc aagatggccg ccatcactgt 2581 gggttggccg ttccaaggtg gctgccatct ttgtgggtcg gtgggcccat gatggctgcc 2641 atcgtgggtt ggctgttcca agatggctgc cagcagcgtg agatgactgt tccaagatgg 2701 ctgccaccac tatcagttgg ccatccaaga tggccgccaa cagcgtgggt tggtgggtcc 2761 aagatggccg ccatcactgt gggttggccg ttccaagatg gctgccgcca ctatgagttg 2821 gccatccaag atggctgcca gcaggatggg ttggtgggtc caagatggct gccaccataa 2881 tgcattggcc agacaagatg gccaccagca gcatgggatt gccgatccaa gatggccgcc 2941 ctacctggga aggagccccc tgcctgctca ttggctgagc gctccaagat ggctgccatt 3001 ccacgtcctc gttggttgac catccaagat ggctgccacc cccacagagt ggccgatcca 3061 agatggccgc cccgcctggg agggatcctc ctgccctctc attggctgag cgttccaaga 3121 tggctgccat tccacgtcct cattggttga ccatccacga tggctgctgc cttcctctcc 3181 attggctacc catctaagat ggctgctctc ctttgtcctg attggctggc caccccaaga 3241 tggctgctcg tgcccatcct ggctgctcat tggttcctgc agagctgtgg tgcctcccaa 3301 ttggtcgggg ccatttgata gtgggacttc tgggcgccat cttggagtga cgtcacactg 3361 tgagcaacgc tgcgttccta ctggcttgcc gcagcctccc atgaccaatg gctgtgtccg 3421 cttggttgcg aacgccctcg cctaatcaca gcgtcccgtt ggccgagcgg agcgtcctga 3481 ttggccgagc tcttcccctt gtccaaacgg cagcttccca ttggctgtgc tcatctcaat 3541 ggcctatcag agccgcccgt ggacctcaga a // LOCUS HUMPANMU 4139 bp ss-mRNA PRI 19-JUL-1990 DEFINITION Human pancreatic mucin mRNA, complete cds. ACCESSION J05582 KEYWORDS pancreatic mucin; tumor-associated antigen. SOURCE Human pancreatic tumor cell line HPAF-CD11, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4139) AUTHORS Lan,M., Batra,S., Qi W,-N., Metzgar,R. and Hollingsworth,M. TITLE Cloning and sequencing of a human pancreatic tumor mucin JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Hollingsworth, 08-JUN-1990. FEATURES from to/span description pept 74 3841 pancreatic mucin precursor sigp 74 136 pancreatic mucin signal peptide matp 137 3838 pancreatic mucin mRNA < 1 4139 pancreatic mucin mRNA rpt 453 2880 tandem repeat rpt 299 452 5' degenerate tandem repeat rpt 2881 2957 3' degenerate tandem repeat signal 4118 4123 poly-A signal BASE COUNT 632 a 1910 c 1055 g 542 t ORIGIN 1 ccgctccacc tctcaagcag ccagcgcctg cctgaatctg ttctgccccc tccccaccca 61 tttcaccacc accatgacac cgggcaccca gtctcctttc ttcctgctgc tgctcctcac 121 agtgcttaca gttgttacag gttctggtca tgcaagctct accccaggtg gagaaaagga 181 gacttcggct acccagagaa gttcagtgcc cagctctact gagaagaatg ctgtgagtat 241 gaccagcagc gtactctcca gccacagccc cggttcaggc tcctccacca ctcagggaca 301 ggatgtcact ctggccccgg ccacggaacc agcttcaggt tcagctgcca cctggggaca 361 ggatgtcacc tcggtcccag tcaccaggcc agccctgggc tccaccaccc cgccagccca 421 cgatgtcacc tcagccccgg acaacaagcc agccccgggc tccaccgccc ccccagccca 481 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 541 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 601 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 661 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 721 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 781 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 841 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 901 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 961 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1021 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1081 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1141 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1201 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1261 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1321 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1381 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1441 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1501 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1561 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1621 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1681 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1741 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1801 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1861 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1921 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1981 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2041 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2101 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2161 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2221 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2281 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2341 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2401 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2461 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2521 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2581 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2641 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2701 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2761 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2821 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2881 tggtgtcacc tcggccccgg acaacaggcc cgccttgggc tccaccgccc ctccagtcca 2941 caatgtcacc tcggcctcag gctctgcatc aggctcagct tctactctgg tgcacaacgg 3001 cacctctgcc agggctacca caaccccagc cagcaagagc actccattct caattcccag 3061 ccaccactct gatactccta ccacccttgc cagccatagc accaagactg atgccagtag 3121 cactcaccat agctcggtac ctcctctcac ctcctccaat cacagcactt ctccccagtt 3181 gtctactggg gtctctttct ttttcctgtc ttttcacatt tcaaacctcc agtttaattc 3241 ctctctggaa gatcccagca ccgactacta ccaagagctg cagagagaca tttctgaaat 3301 gtttttgcag atttataaac aagggggttt tctgggcctc tccaatatta agttcaggcc 3361 aggatctgtg gtggtacaat tgactctggc cttccgagaa ggtaccatca atgtccacga 3421 cgtggagaca cagttcaatc agtataaaac ggaagcagcc tctcgatata acctgacgat 3481 ctcagacgtc agcgtgagtg atgtgccatt tcctttctct gcccagtctg gggctggggt 3541 gccaggctgg ggcatcgcgc tgctggtgct ggtctgtgtt ctggttgcgc tggccattgt 3601 ctatctcatt gccttggctg tctgtcagtg ccgccgaaag aactacgggc agctggacat 3661 ctttccagcc cgggatacct accatcctat gagcgagtac cccacctacc acacccatgg 3721 gcgctatgtg ccccctagca gtaccgatcg tagcccctat gagaaggttt ctgcaggtaa 3781 cggtggcagc agcctctctt acacaaaccc agcagtggca gccgcttctg ccaacttgta 3841 gggcacgtcg ccgctgagct gagtggccag ccagtgccat tccactccac tcaggttctt 3901 caggccagag cccctgcacc ctgtttgggc tggtgagctg ggagttcagg tgggctgctc 3961 acagcctcct tcagaggccc caccaatttc tcggacactt ctcagtgtgt ggaagctcat 4021 gtgggcccct gaggctcatg cctgggaagt gttgtggggg ctcccaggag gactggccca 4081 gagagccctg agatagcggg gatcctgaac tggactgaat aaaacgtggt ctcccactg // LOCUS DOGSRP9A 1271 bp ss-mRNA MAM 19-JUL-1990 DEFINITION C.lupus signal recognition particle 9 protein (SRP9) mRNA, complete cds. ACCESSION M34952 KEYWORDS signal recognition particle protein. SOURCE C.lupus (strain Madin-Darby) kidney, cDNA to mRNA. ORGANISM Canis lupus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 1271) AUTHORS Strub,K. and Walter,P. TITLE Assembly of the alu domain of the signal recognition particle (SRP): Dimerization of the two protein components is required for efficient binding to SRP RNA JOURNAL Mol. Cell. Biol. 10, 777-784 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Strub, 07-JUN-1990. FEATURES from to/span description pept 59 319 signal recognition particle 9 protein BASE COUNT 372 a 232 c 254 g 413 t ORIGIN 1 gcccacctac cacctacctc gggcggccag aaaccgatgc ggggggccca gcggcaagat 61 ggcgcagtac cagacttggg aggagttcag ccgcgcggcc gagaaactct acctcgccga 121 ccctatgaag gcacgtgtgg ttctcaaata taggcattct gatgggagtt tgtgtattaa 181 agtaacagat gatttagttt gtttggtgta tagaacagac caagcccaag atgtaaagaa 241 gattgagaaa ttccacagtc aactaatgcg actcatggta gccaaggaat cccgcagtgt 301 tgccatggaa acggactgac gggtttgaaa tgaagatcct tcatgttctt aggagtaaat 361 atcttttgaa tcagaaaaag tgttgggaaa gaaaatatgt aactaagtgg gctcttcaga 421 agtggggaga tcattttttg tactttgttt tttaatgttt actttagaga gctaggaacg 481 tacatgcttt cggtgaaagc ctttatttat ttttggaaat tcagtaaaag gcagttcttc 541 cttaaattta gttaatctgt ctttaaaaga aaattaaatt taaccatttt gctggattgt 601 tgtatttctt ttggagcata aaatttgtgc tattgatgac caacaaacaa acataaaata 661 tagtaattgg aattacctgt gcacagcagt gtacctatgt ataatatagt aattagtctc 721 agttctatct aaaagtaatc atggaaatga gtatgcttta cctaaaactt ttccaaactt 781 aaactgtatt tttgaatgta aggaatttgt agtatcgtta gcttgttgag cagggacttg 841 ctttaatcta gtttccagtg ctcaaaaaca actgcattta cttgaagtgc atgaacagat 901 gatcactagt ggactgaacc accatattac gcaagtattt gcctgcagat ttcccatcta 961 tattttctca gaagggctaa agattatttg aactgttaaa tctttgccat atgtctgtgc 1021 cactcctgcc tgtttctccc tgtacttaac caaggtgttg aacatgactg tcacaactgt 1081 tagttaaatc tttgcatatg tctgtgccac tcctgcctgt ttctccctgt acttaaccaa 1141 ggtgttgaac atgactgtca caactgttat ttttttcatt aagtcagaag gatatcattt 1201 gatatttatc atataattgt aacctcagtt ttaccatctc aatgtaatgt tcacatgttg 1261 ttcctacatt a // LOCUS PCHPMMMSA 6409 bp ss-mRNA INV 19-JUL-1990 DEFINITION P.chabaudi major merozoite surface antigen mRNA, complete cds. ACCESSION M34947 KEYWORDS major merozoite surface antigen; surface antigen. SOURCE P.chabaudi chabaudi (strain IPP-C1), cDNA to mRNA, and DNA, clone IPP-C1/C. ORGANISM Plasmodium chabaudi Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 6409) AUTHORS Deleersnijder,W., Hendrix,D., Bendahman,N., Hanegreefs,J., Brijs,L., Hamers-Casterman,C. and Hamers,R. TITLE Molecular cloning and sequence analysis of the gene encoding the major surface antigen of Plasmodium chabaudi chabaudi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Deleersnijder, 06-JUN-1990. FEATURES from to/span description pept 667 6024 major merozoite surface antigen precursor sigp 667 723 major merozoite surface antigen signal peptide matp 724 6021 major merozoite surface antigen rpt 324 365 tandem repeat region BASE COUNT 2681 a 991 c 922 g 1814 t 1 others ORIGIN 1 tctagataat atattttttg tatgcatgct aaaattaatt atacatatat taaatagatt 61 tgtgcgaatc tttatgtgtg caagttattt tttttaataa taattatcca tataccacat 121 tatttatttg tgtaccgtta aatatttatt ttctaagcga tttttctcct taaattatat 181 tttttntgat catttttttt ttttttttgg aaaatcggga gcataaaaaa tatatattac 241 actttataaa ttttttatac acatttgttt attttatttt atatatattt tttaacacat 301 ttttattttg aaatgatatg atcaattata aaaaaacaat aacataataa tagtaataat 361 ttttttttgt acgatatata aaattatgca tttttatttt tatagtaagt taaaaagtgt 421 attatatgta cgtattttgt ttaacagaac ggaaattaga aaaaacacaa taaaacttat 481 atatatatgt gtaattagtg tatgtgtata tatttgtcaa cattataaat gatataattg 541 aacttcaata tttattttta cacaaattag tactaatata aaatgcaaaa gtaatgtacc 601 tttgtgtgta ttaattttag cattataatt tattccactc tgtatattag ttaagtttcg 661 ttgaaaatga aggcgatcgg acttttgttt tctttcgttt tttttgctat atattgcaaa 721 tctgaaacaa taggagttta caatgatctc gttcataagt tagaaaagtt agaagaatta 781 tcagtagaag gattagaact atttcaaaaa agtcaagtaa ttgtaaatgc acaatcacca 841 gaaacacctg ttgatccatt tacaaaccct gaatttgcac aaaagttaca accatttatt 901 ttaaaatttg aagaattagg atttacagaa caaacagagt tagtcaattt aataaaaact 961 ttaggcccaa ataaatatgg actaaaatat ttaattgaaa gtaaagaaga atttaacgaa 1021 ttaatgcacg caataaattt ttactatgac gtgcttagag ataaattgaa tgatatgtgt 1081 gcaaataact attgtgaaat tcctgaacat cttaaaatta atgttgaaga aatcgaaatg 1141 cttaagaaag ttgtcttagg ctatagaaaa ccaattgaaa atattcaaga tgatcttgta 1201 aaattagaag aatatattgc aagaaataaa gcaactgctg aaaccttaaa cactcttatt 1261 actgaagaaa caaaaaaaat aacacctgaa gaagaaacag attgcaacga tactaattgc 1321 gacaatacta aatatggaaa gaaaaaagca atatatcaag ctatgtacaa tgttatattt 1381 tacaaaaagc aattagctga aataaaaaaa gtcatcgaag tcttagaaaa gagagttgct 1441 acattaaaga agaacgaagc cataaaacca ttgttacaac aaatcgaagc tatcagaggt 1501 ccacctgctg tcactgaagg acaaatagct acagaaggaa gcagcgaaga aacaaaacaa 1561 aatagtacag aatcatctaa cacaaaaacg actactactg acaaagctgt tacaacccaa 1621 accgctacta aagcaactgg tacagaaaca aatactggta cagaaacaaa tactggcaca 1681 gaaacaaata ctgccacagg aacaactact gccacaggaa caactactgc cacaggaaca 1741 cctactgtca ctgaaccagt tcaagtgcca gccgttcaag ttcttacaga agaagaaaaa 1801 gcaaaaaaaa tagctgaact ttatgctcaa attaaagaaa ttgcaaaaac tataaaattc 1861 aatttagacg gaatatttgt cgatccagtt gaattagaat attacaaaaa agaaaaaaaa 1921 aatgaaagct gccattcaac ttcatcttgc cacaaaaata aaacacctga aactgtaata 1981 ccattaaatg tacgttatcc aaatggtatt agctacccat taactgaaga agttgtttac 2041 agcaaaattg ctcataatgc cgctgaaaca acttatggtg atttaacaaa tgtcgataat 2101 acagccataa cagaagattt aaccacaaat gaacaagcaa gaaaaaattt aattaaagct 2161 attaaaaaga aaatcgaagc agaagaacaa aaattagtag aattaaaaga tgattatgat 2221 actaaacttg cagcatttaa tggacaaaaa actccattca aagaagcagc taaaaaattt 2281 tatgaatcca aatttagaaa taaattgact actgacattt ttgacgattt taaaacaaaa 2341 agaactgaat atatgaacaa gaaagctgca ttagtaggtt gtgaatatgg aaatactcaa 2401 caactcatta ataaattaaa taaacaactt aattatttac aagattatgg attaagaaaa 2461 gaaatagtta acactgaaat tgaatatttt tcaaacaaaa aatcagaatt acaatataat 2521 attaatagat tagcaaatgc tgttcaagca aaacaaaata tattagttgc atcaaaacat 2581 attccacttt caacacttgt tgaattacaa atccaaaaat ctttattaac aaaactaatt 2641 gaacaattaa ataaaactga attttcttta aataaagctc acttaaaaga caagatatac 2701 gttccacaaa catatggtaa agaaggaaaa ccagaaccat actacttaat agctataaaa 2761 aaagaaattg acagacttgc caaatttatt cctaaaattg atgatatgat tgagaaagag 2821 aaacaaaaaa tggaacaaga acatgtagct accggagaat ctgaacaagc ctcttctgcc 2881 tctggtactg gatcatccac agaaaccaca tcacaaacag caccagccgt tccagctgca 2941 cccgcaccag cagaaaaggc aaaagaagga acagaatcaa cagaagaaac cccagcagca 3001 tcaaaaccag ccgaaggtgc agcatccaca ggtgcaacca ccccaacaga acaagaagct 3061 gcaccaacag aacaagaagc acaacctgca gcacctgaaa caccagcaga ggtaccagca 3121 ccaaccacgc ctgcagctcc agcaactcca gccgcaccag cagcacccgc aaaaccagtt 3181 atgacaaaat tatattacct tgaaaaatta aagaaatttt tagcattctc atatgcatgc 3241 cataaatatg ttttattaca aaactctacc ataaacaaag atgctttaag caaatatgct 3301 cttacaccag aagaagataa aataagaaca ttaaagagat gcagtgaatt agatgtatta 3361 ttagctattc aaaataatat gcctactatg tattcacttt atgaaaatgt agttgatggt 3421 ttacaaaaca tttacactga attatatgaa aaagaaatga tgtatcatat atataactta 3481 aaagataaaa acccagctgt taaagcttta ttagtaaaag ctggcgtcat tgatccagaa 3541 ccagtagccc caacaccagc agtaccagca ccagaaactg caccagaaac tgcaccagaa 3601 actgcaccag aaacaccagc acaagaagct ccacaacaac cagaatcggc acaagcacca 3661 gaagcagcaa ctgaaacaac aacaccagcc gaatcggcat caacagaacc aacaccaaaa 3721 gcacctacag caacacccac atctgaaaca gtaacacaag aaggaacaac accagcagca 3781 ccaaaagcac aagaaggagc atcatcatca gcaccagcac aaccagcccc agcaaaacca 3841 gcacctgcac aaacagtaac agggcaatca acaaacgttg aaggaagtac tcaagtaaga 3901 gcagaaagtg aagacgaaat gtttgtcgat gattttgaag tagacaattt ttacaaatct 3961 tacttacaac aagttgatgg aaataatact caattcatag attttataaa atctaaaaaa 4021 gaattaatca atgcattgac ccctgaaaaa gttaaccaat tatatcttga tattgcacac 4081 ttaaaggaat tatcagaaca ttactataat cgttattata aatataaatt aaaattagaa 4141 agattatatc aaaaacatga acaaattgaa gcagctaacc aaaaagttaa agaaattagc 4201 gtattaaaat cccgattatt aaaaagaaaa aaatatatta atggtacatt ttatgtatta 4261 tctggttttg caaatttctt taacaagaga agagaagctg aaaagcaata tgtagataac 4321 gcaataaaaa atactgatat gttattgaaa tactacaaag ctcgtagtaa atattttact 4381 tctgaagctg ttcctttaaa aacattaact aaaacatcaa ttgacagaga agccaactac 4441 ttgaaaatcg aaaaattcag agcatacagc cgattagaat taagattaaa gaaaaatatt 4501 aacttaggaa aagaaagaat tacatatgta tctggtggtt tacaccatgt atttgaagaa 4561 tttaaagaac ttttaaaaaa taaaggttat accggaaaaa ctaaccctga aaatgctcct 4621 gaagttatca aggcattcga acaatataaa gaattacttc caaagggagc aacaactcca 4681 gctccagtag ttgcacctgt agttgctcca gccccagcta cagcagcccc agcagctgac 4741 gcaccagtac cagcagccgc agccgcagcc gcatcaggat caggatcagc agccacaaca 4801 gaaggagaag ccgctacaac agtagttgca agcagcgata atgatgatga tgacgatgat 4861 gatatggatc aaattgcaaa tgctcaatcc acagacgaag aagtaaaaga tattcttgat 4921 gcatttaaaa gtgaaaatga atatatatac acaaagagct taggtaacac atataaatca 4981 tttaaaaaac acatgttaaa agaattttca atgattaaag aagacataat gactggatta 5041 aactataaat tagaaaaaag aaatgatttc cttgatgtat taagctatga attagcttta 5101 ttcaaagata taaataccaa caaatttgtt gttaaaaacc cataccaatt attagataat 5161 gataagaaag acaaacaaat gataaactta aaatatgcca ttaaaggtgt aactgaagat 5221 atcgaaacag ctactgatgg aattgaattc tttaacaaaa tgattgaatt atacaaacct 5281 caattaaacg cagttaatga acaaattgct gccataggaa cagaacctac cgatgccgaa 5341 aaaaagaaat acgctccaat ctttgaagat cttaaaggat tatatgaaac catattgaac 5401 ggagcagaag aattttcaga attattacaa cacaaacttg aaaactataa aattgaaaaa 5461 gctggatttg acattttaat ggcaaattta gaaacataca taagaattga cgaaaaactt 5521 gaagacttcg tagaaagtgc agaaaaaaat aaacacattg cctcaatagc tttaaataac 5581 ttaaacaaat ctggtttagt aactgaaggt gaatcaaaga aaatattagc aaaaatgctt 5641 aacatggatg ccatggattt attaggtata ggttctaatc atgtatgtat tagtacaagt 5701 actcctgaca atgctggatg ctttagatat gatgatggta cagaagaatg gagatgttta 5761 ttaggtttca aaaaagatga tgatggtaat agatgtgtag cagatgatgc tcctgtttgt 5821 aataacaaca atggtggatg tgataaaaat gctgattgta gagaagtaga aaatacagat 5881 agggatcctt ccaaaaaaat tgtatgtact tgtaaagaac caaacccaaa tgcatattat 5941 gctggtgtat tctgtagttc ttccggattt atgggattat caattttatt gatcatcaca 6001 ttaattgtat ttaatttatt ttaaataaat gattaaaata tttgttgcat tttatatttt 6061 tcctatatat attttaaaag ttgtataata catttgaaat atatattttg gcataaattg 6121 tatatttttt attatataaa aaaatatata tatataattt ttaataaaca tttttaaata 6181 aacgtacatg tgttttagta taggaaattt tgtatgactt taaaatatga tgatactatt 6241 ttttttaaat gtatagtaaa ttaatttatt tttatttttt atacaatata ttgtatgtgt 6301 gttctttatt actattattt tataagtata taaaataaag ctattttttt ttttttttta 6361 acttcaaaca tatttagtaa cttttttatt taaagaatag ccggaattc // LOCUS SHPMHCA 588 bp ss-mRNA MAM 19-JUL-1990 DEFINITION Sheep MHC class I protein gene, 3' end. ACCESSION M34672 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Sheep 8-week old, cDNA to mRNA, clone SC17. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 588) AUTHORS Grossberger,D., Hein,W. and Marcuz,A. TITLE Class I major histocompatibility complex cDNA clones from sheep thymus: Alternative splicing could make a long cytoplasmic tail JOURNAL Immunogenetics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Grossberger, 30-MAY-1990. FEATURES from to/span description pept < 1 287 MHC protein (AA at 3) BASE COUNT 120 a 161 c 157 g 150 t ORIGIN 1 ccaggaagtg ggcggccctg gtggttcctt ctggagagga gcacacatac acgtgccgtg 61 tgcagcacga ggggcttcag gagcctaccc tgagatggga acctcctcag acctccttcc 121 tcaccattgg catcattggt ctggatctcc tcgtggttgc tgtggtggct ggagctgtga 181 gctggatgaa gaagctctca ggtgaaaaaa gacggacgta cacacaggct gcaagcagtg 241 acagtgccca gggctctgat gtgtctctca cggtccctaa agtgtgaaac gctgccttgt 301 gggactgagt gatgctgcat cccgcaatgt gacgtcagat cctggacccc tctttctcgg 361 ctgcatccga atgtgtctgt gctcctagta gcataacatg aggagctggg gagactggtc 421 acccctgccc accacacccc cttctccgct gacctgtgtt ctcctccctg atacactgtc 481 ctgttccagc agagacaggg ctgggccgtg tcatcgctgt ctttgcttca tatgcactta 541 gtaatgatgt cttatttcat ctttgaaaat aaaatctgta tatatatc // LOCUS SHPMHCB 841 bp ss-mRNA MAM 19-JUL-1990 DEFINITION Sheep MHC class I protein gene, 3' end. ACCESSION M34673 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Sheep 8-week old, cDNA to mRNA, clone SCI89. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 841) AUTHORS Grossberger,D., Hein,W. and Marcuz,A. TITLE Class I major histocompatibility complex cDNA clones from sheep thymus: Alternative splicing could make a long cytoplasmic tail JOURNAL Immunogenetics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Grossberger, 30-MAY-1990. FEATURES from to/span description pept < 1 543 MHC protein (AA at 1) BASE COUNT 179 a 235 c 238 g 189 t ORIGIN 1 gaggactacc tggagggccg gtgcgtggag tggctccgca gatacctgga gaccgggaag 61 gacacgctgc tgccgcagac ccttccaaag gcacatgtga cccgacaccc catctctgag 121 cgtgaggtac ccttgaggtg ctgggccctg ggcttctacc ctgaggagat ctcactgacc 181 tggcagcgca atggggagga ccagacccag gacatggagc tcgtggagac caggccttca 241 ggagatggaa ccttccagaa gagggcggcc ctggtggtgc cttctgaaga ggagcagaga 301 tacacgtgcc atgtgcagca cgaggggctt caggagctca ccctgagatg ggaacctcct 361 cagacctcct tcctcaccaa gggcatcatt gttggcctgg ttctcctcgt gctggctgtg 421 gtggctggag ctgtgatctg gaggaagaag tgctcaggtg aaaaaagagg cacctatacc 481 caggcttcaa acaatgacat gtgcccaggc tctgatgtgt ctctcacagt tcctaaagtg 541 tgagacgctg ccttgtggga ctgagtgatg ctgtatccca ctatgtgatg tcagatccct 601 gacccctctt tctgcagctg catctgaacg ttgtctgtgc tccatgtagc ataacgtgag 661 gagctgggga gattggtcac ccctgcccac cacaccccct cccgcctgga cctatgtctc 721 ctccctgata cactgtccta atccagcaga gagggcctgg ctgtctccat ccctgtcttg 781 cttcatgtgc actgagtaat gatgtcttat acccttattg aaaataaaat ctgtatatat 841 g // LOCUS SHPMHCC 995 bp ss-mRNA MAM 19-JUL-1990 DEFINITION Sheep MHC class I protein gene, 3' end. ACCESSION M34674 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Sheep 8-week old, cDNA to mRNA, clone PSCI16. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 995) AUTHORS Grossberger,D., Hein,W. and Marcuz,A. TITLE Class I major histocompatibility complex cDNA clones from sheep thymus: Alternative splicing could make a long cytoplasmic tail JOURNAL Immunogenetics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Grossberger, 30-MAY-1990. FEATURES from to/span description pept < 1 537 MHC protein (AA at 1) site 466 626 unspliced intron BASE COUNT 203 a 248 c 288 g 254 t 2 others ORIGIN 1 gaccctccaa aggcacatgt ggcccatcac cccatctctg accgtgaggt caccctgagg 61 tgctgggccc tgggcttcta ccctgaggag atctcactga cctggcagcg tgacggggag 121 gaccagactc aggacatgga gtttgtggag accaggcctt caggggatgg aaccttccag 181 aagtgggcgg ccctggtggt gccttctgga gaggagcaga gatacacgtg ccgtgtgcag 241 cacgaggggc ttcaggagcc cctcaccctg agatgggaat ctcctcagcc ctccgtcctc 301 accatgggca tcattgttgg cctggttctc ctcgtggtgg ctgtggtggc tggagctgtg 361 atctggatga agaagcgctc aggtgaaaaa ggacggatct acacccaggc tgcaagcatg 421 tacagtgccc agggctctga tgtgtctctc acggttccta aaggtgaggc cctggagtgt 481 ctagattgga aggagcattg gggcagaggg gacacactgg gtggcggggg tctctgagtg 541 ggacatgtga gcatgtcggg ggctgtggag aatatcagcc cttacatgac tgacctgaac 601 tggctcctga ttcttttctc tcacagtgtg agacagctgc cttgtgggga ctgagtgatg 661 cttggtccca ctttgtgatg tcagatcgcc ggacccctct ttcttcagct gcatctgaat 721 gtgtctgtgc tcctattagc ataacatgag aagttgggga gactggtcac ccttgcccac 781 tgtacgctgt ccccaccctg acctgtgttc tcctccctga tccaccatcc tgttcagcga 841 gacgggctgg gccatcttca ttgctatctt tgcttcacat gcactgagta atgatgtctt 901 atttccttat tgaaaataaa ttctgtatat atatgaatct attttttcta attggtgcca 961 tgaaagggnn ttggataata aaatgagaat tcgat // LOCUS SHPMHCD 1050 bp ss-mRNA MAM 19-JUL-1990 DEFINITION Sheep MHC class I protein gene, 3' end. ACCESSION M34675 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Sheep 8-week old, cDNA to mRNA, clone PSCI11. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1050) AUTHORS Grossberger,D., Hein,W. and Marcuz,A. TITLE Class I major histocompatibility complex cDNA clones from sheep thymus: Alternative splicing could make a long cytoplasmic tail JOURNAL Immunogenetics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Grossberger, 30-MAY-1990. FEATURES from to/span description pept < 1 747 MHC protein (AA at 1) BASE COUNT 216 a 292 c 315 g 227 t ORIGIN 1 ggcgggtctc acaccatcca ggcgatgtac ggctgcgaag tgggacctga cgggcgtctc 61 ctccgcgggt atgagcagtt cgcctacgaa ggcagagatt acctcgccct gaacgaggac 121 ctgcgctcct ggaccgcggc ggacacggcg gctcagatca ccaagcgcaa gtgggaggcg 181 gcaggtgagg cggcgcgtgt gaggatctac ctggagggca cgtgcgtgga gtggctccgc 241 agacacctgg agaccgggaa ggacacgctg ctgcccgcag accctccaaa ggcacatgtg 301 acccaacacc ccatcactga gcgtgaggtc accctgaggt gctgggcctt gggcttctac 361 cctgaggaga tctcactaac ctggcagcac aatgaggagg accagaccca ggacatggag 421 cttgtgaaga ccaggccttc aggggatgga accttccaga agtgggcagc cctggtggtg 481 ccttctggaa aggagcagag atacacgtgc cgtgtgcagc acgaggggct tcaggagccc 541 ctcaccctga gatgggcacc tcctcagacc tccttcctca ccatgggcat cattgttggc 601 ctggttctcc tcgtggtgac tgtggtggct ggagctgtga tctggaggaa gaagcgctca 661 ggtgaaaaaa gacagaccta tacccaggct gcaagcagtg acagtgccca gggctctgat 721 gtgtctctta tggttcctaa agtgtgagac agctgccttg tggggactga gtgatgcttg 781 gtcccattct gtgacatcag atcttgggac ccctctttct gcaggggcat ctgaatgtgt 841 ctgtgctcct attagtataa catgaggagt tggggagact ggtcacccct gcccactgca 901 caccgtcccc accctgacct gtgttctcct tcctgatcca ctgtcctgtt gcagcagaga 961 cgcctgggcc ctctccatca ctgtctttgc ttcatatgca ctgagtaatg atgtgttatt 1021 tcctttttga aaataaaatc tgtatatatg // LOCUS SHPMHCE 1396 bp ss-mRNA MAM 19-JUL-1990 DEFINITION Sheep MHC class I protein gene, complete cds. ACCESSION M34676 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Sheep 8-week old, cDNA to mRNA, clone PSCI12. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1396) AUTHORS Grossberger,D., Hein,W. and Marcuz,A. TITLE Class I major histocompatibility complex cDNA clones from sheep thymus: Alternative splicing could make a long cytoplasmic tail JOURNAL Immunogenetics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Grossberger, 30-MAY-1990. FEATURES from to/span description pept 4 1110 MHC protein precursor sigp 4 89 MHC protein signal peptide matp 90 1107 MHC protein BASE COUNT 278 a 403 c 434 g 281 t ORIGIN 1 cccatgacca gaggattgcg agtaatgggg ccgcgaaccc tcctgttgct gctctcggga 61 gtcctggtcc tgaccgagat ccgggcgggc ccccactcca tgaggtattt cagcaccgcc 121 gtgtcccgcg ccggcgccgg ggagccccgg tacctggaag tcggctacgt ggacgacacg 181 cagttcgtgc ggttcgacag cgacgccccg gatccgaaga tggagcagag ggagccgtgg 241 atgaagcagg tggggccgga gtattgggat cggaacacgc gaaatcccaa gggcaacgca 301 cagactttcc gagtgggcct gaccatcctg cgcggctact acaaccagag cgagaccggg 361 tctcacacct ggcagtgtat gtacggctgc gacgtggggc cggacgggcg tctcctccgc 421 gggttcatgc agttcggcta cgacggcaga gattacatcg ccctgaacga ggacctgcgc 481 tcctggaccg cggcggacac ggcggctcag gtcacccagc gcaagtggga gaaggaaggt 541 gcggcggacc actacaggaa ctacgtggag ggcacgtgcg tggagtgcgt gcgcagatac 601 ctggagatcg ggaaggaaca gctgcagcga gcagaccctc caaaggcaca tgtgacccat 661 caccccatct ctggccatga tgtcaccctg aggtgctggg ccctgggctt ctaccctgag 721 gagatctcac tgacctggca gcgcaatggg gaggaccagt tgcaggacat ggagcttgtg 781 gagactaggc cttcagggga tggaaccttc cagaagtggg cggcccttgg tggtgcttct 841 ggagaggagc agagatacac gtgccatgtg cagcatgagg ggcttcagga gcccctcacc 901 ctgagatggg aacctcctca gacctccttc ctcacttcct caatgggcat cattgttggc 961 ctggttctcc tcgtcatggt ggctgtggtg gctgcagctg tgatctggag gaagaagtgc 1021 tcaggtgaaa aaagagggac ctatacccag gcttcaagca atgacagtgc ccagggttct 1081 gatgtgtctc tcacggttca taaagtgtga gacagtgatg ctgcatcccg ctatgtgcca 1141 tcagatcccc ggacccctct ttctgaagct gcatctgcac gtgtctgtgc tcctagtagc 1201 ataacgtgag gagttgggga gaccgttcac ccctgcccac cgcgccccct cctgccctga 1261 cctgtgttct cctccctgat ccactgtcct gttccagcag cagacagggc tgggccgtct 1321 ccatccctgt ctttgcttcg tatgcactga gtaatgatgt cttatttcct tattgaaaat 1381 aaaatctgta tgtatg // LOCUS YSPNMT1A 3787 bp ds-DNA PLN 19-JUL-1990 DEFINITION S.pombe no message in thiamine protein (nmt1) gene, complete cds. ACCESSION J05493 KEYWORDS . SOURCE S.pombe DNA. ORGANISM Schizosaccharomyces pombe Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3787) AUTHORS Maundrell,K. TITLE nmt1 of fission yeast: A highly transcribed gene completely repressed by thiamine JOURNAL J. Biol. Chem. 265, 10857-10864 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Maundrell, 07-JUN-1990. FEATURES from to/span description pept 1499 2539 no message in thiamine protein (nmt1) mRNA 1430 2681 nmt1 mRNA signal 1396 1403 TATA box BASE COUNT 1218 a 657 c 690 g 1222 t ORIGIN 1 ggctcattta taatctagca ctttatacct tttacctgac tgttgggttg tttatctgac 61 ctcataaaag aaagtgtcgt tttggaaaaa ttagcattac attgagggtc ctccgctaat 121 gctcctgcga aaaatgattt taattttgga tgttttttca gaaataaaat gaaaattagc 181 ttgatataat atcaaccggc agcgagtaat agatttaaaa taaatttgat taattaaaaa 241 aatttgttgt tttaagcaag ccattttgct aaaatcaaag gtaatggaag agtatttccg 301 aaaaatctca acacatgtga atgatcagaa aattatcgcc ataaaagaca gaataagtca 361 tcagcggttg tttcatttcc tatatttttt ttttattttt ttatttttta ataagggaaa 421 atttaacgtc taaggataca gaagattgtt agcacattaa agtaataaag gcttaagtag 481 taagtgcctt agcatgttat tgtatttcaa aggacataat ctaaaataat aacaatatca 541 tttctcacaa gttattcaat tttctttttt ttttctaata atatcaagaa tgtattattt 601 gtttgacata agtcaactaa tttatttaat atgctggatt aatcttgcag acatgtaaat 661 taacaagttt tagtcaaata acgttgaagt ttcaatgaac tcaaataatt tctctttttt 721 tttatataac catatgtcta atctgattta tattttccgc aggatcaact gaagttatga 781 catttggatt ggatcactta taaccttggt cgccaaataa tacaaaaatc agcgttataa 841 aacaaagaag gtttttgtta agaaattaat cctctttctt gataagaaag ttgaaccgaa 901 attgcagata ctgatatatg aaaataatac ccacaatttt gggaatagcg caagcctcaa 961 tttaaacaat aggtgaggac acatgataat gacctcaatg attgttagaa gaaaagagcc 1021 tcattacaaa atcgaaaaat gaatggttgg gtacaagttt ccaaaacatg gtaaagtgga 1081 ctttgcgtat gagacgtaaa tagaaaaaaa cacttgttat atgttttcta gaattattgt 1141 tgtctcttta tggttggatg atgcaaaata gtaatttcgg ttagttgctg taaaacacca 1201 cgagacaaat agatatggat atttattaaa tcaggaaaaa cgtaactctc ggctactgga 1261 tggttcagtc acccaacgat tactggggag agaaaacagg gcaaaagcaa agcttaaagg 1321 aatccgattg tcattcggca atgtgcagcg aaactaaaaa ccggataatg gacctgttaa 1381 tcgaaacatt gaagatatat aaaggaagag gaatcctggc atatcatcaa ttgaataagt 1441 tgaattaatt atttcaatct cattctcact ttctgactta tagtcgcttt gttaaatcat 1501 gtctactaac aagatcactt tcctcacaaa ctgggaggcc actccttacc atttgcccat 1561 ctttcttgct caaactcgcg gatactatga gcgtgaaggg attgaggttg ctattctcga 1621 gcctaccaac ccttccgacg ttacagcatt gattggttct ggtaaggttg acatgggatt 1681 aaaggccatg atccatactt tagctgctaa ggctcgcgga taccctgtca ccagttttgg 1741 atctttgtta aatgagcctt tcactggctt aattactttg aagggtaatg gcatcaacga 1801 cttcaaggac attaaaggaa agcgtattgg ctacgttggt gagtttggaa agatccaact 1861 cgatgacttg tgcagcaagt tcggtttgtc tccttctgat tatactgcta ttcgctgtgg 1921 tatgaacatt gcccctgcca tcatcaatgg tgaaatcgat ggcggcattg gcattgaatg 1981 catgcaacaa gtcgagcttg agcgctggtg cgtctcccaa ggccgcccaa ggtctgatgt 2041 ccaaatgttg cgtattgatc gattagccaa cttaggttgc tgctgtttct gtaccatttt 2101 gtatattgca catgatgaat tcattgctaa acatcccgac aagatcaagg ccttcttacg 2161 tgctatccat tctgctactt tggatatgct taaagatcct gtccaaacct acaaggagta 2221 cattcacttc aagcgtgaaa tgggatccga acttcatcgg gaacaatttg aacgttgctt 2281 tgcatatttc tcacatgaca tctctaacgt ccccagagat tggaacaagg ttaccaatta 2341 ttccaagcgt ttgggcatca tcccccaaga ttttgagccc aactgtacta acggttactt 2401 gacctgggaa cttgaccccg atgagaagga tcccatgggc aaacaagaag ccattgccga 2461 gatccaagat gaaattaagc aaaagggagg tgtcttcagc ggcaactcac ttcgttatgt 2521 cgagcctgcc aacctttaaa aggaatgtct cccttgccag tactgctagg gtttttcttt 2581 caaactatgg aagcccattc aagctgcata ttacgatttt gtttttcgct tttagaaagt 2641 ggtttagatg agataataga aaaattcttg atctccgaca acgagtactt ttattttttt 2701 tgctaatcac tttactcaat attagctcga aatcgtagaa acgtagacgg gtgcgggata 2761 ccgagtggtg tagttaagaa tttttataaa ccacgtggcc caaaaatatg aacccaaaac 2821 gtttatacat gagtatactt taagaaggct ataccccttc gtgttagatg tagttttagc 2881 tacccaaccc gagtctatga gcttgacttc agatgtagaa ggcattaaat cgttttgaat 2941 attaattaaa aaacgatgaa aattaaatat ttaaaagcaa tcatacgctg aaaatttagt 3001 gctgtggcta atccttcaac atggaaatgc cataaaagtg actttgacaa aaaaaaaagt 3061 atatacaggt agtaaactca tctacttcat tgactttgtt tacagcatgt ggaaggagga 3121 atatttattg ctaaatcgta gtttaacatt caataagtaa tactattgaa attcgacaag 3181 attggccgca tggatgaaaa agaggcattt tgctttggga gaattagttc aaattagaac 3241 tgaaaaaaaa aactttacga ggcaaaaatg tcggattgag atcgtaaaag ttcgctcgtc 3301 gtcttttgct ttgtgattgt tttcatggat acatcttgct ggatatttaa attttagtac 3361 tatgtataag atattctata aatgttttat cacccaaacc tgttagcgcc ttcttaattc 3421 tattcaatct ggcttttgct ctgagactac ttcttggact ttcactactt gttagttata 3481 cggaatttgt gtaattagaa gtgaaataat cctttctatt agtaatgcaa acaaaaatca 3541 attggaaagc aaatttacac atacttgctg tatcgccttc gactatcttt tcattgcata 3601 ccatgatttt agacgtttat acttaagcaa ttaaaaggtt ttgattcaat cataaacata 3661 attatccttg ataaaaaaag aattatacac attgttctct ttatttgact tcgaactgtt 3721 taacatcgaa acggtcagat gatacaccca ttcctccaat gtaatccctg gcttcttggg 3781 caagctt // LOCUS CP7CPL 1470 bp ds-DNA PHG 19-JUL-1990 DEFINITION Bacteriophage Cp-7 muramidase (cpl7) gene. ACCESSION M34779 KEYWORDS muramidase. SOURCE Bacteriophage Cp-7 [from S.pneumoniae] DNA. ORGANISM Bacteriophage Cp-7 Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1470) AUTHORS Garcia,P., Garcia,J.L., Garcia,E., Sanchez-Puelles,J.M. and Lopez,R. TITLE Modular organization of the lytic enzymes of Streptococcus pneumoniae and its bacteriophages JOURNAL Gene 86, 81-88 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 286 1314 muramidase BASE COUNT 483 a 270 c 318 g 399 t ORIGIN 1 cagctggaca ggcttaaaag gagttatcaa acataccctt acattcattt tttactactt 61 tgtagcggta ttcttgacct atattcacgc tatggcagtc ggtcagattt tgctggttat 121 cattaactta tactatgctt tgtcaatcat ggaaaatctt gctgttatgg gtgtatttat 181 tcccaagttt atgacggcaa gggtgcaaga agagttacag aaatacacag cacaactaga 241 cgcagggaaa gacctgctag aagaatttaa aggagaaaag aaataatggt taagaaaaat 301 gatttatttg tagacgttgc aagccatcaa ggctacgaca tttcaggaat tttagaagaa 361 gcagggacaa caaacacaat tattaaagtg tcagaaagta caagctattt aaacccttgc 421 ttgtctgctc aagtgagcca gtcaaatcct atcgggtttt atcattttgc ttgctttggt 481 ggaaatgaag aagaagcaga agcagaagca cgctatttcc ttgataacgt gcctacacaa 541 gttaaatacc ttgtactaga ttatgaagac catgcaagcg caagcgtaca aagaaacact 601 accgcgtgct tacgctttat gcaaatgatc gcagaagctg gatatacacc tatttattat 661 agttacaaac cgtttacgct tgataatgtg gactatcagc agattttagc acagttccct 721 aattctctat ggattgcagg ctatggctta aatgatggta cagctaactt tgaatacttt 781 ccaagcatgg acggtatcag atggtggcaa tattctagta acccgtttga caagaatatt 841 gtactgttag atgatgagaa agaagataat ataaacaatg aaaacactct aaaaagcctt 901 accacagtag ccaacgaggt cattcaggga ctttggggca acggtcaaga acgttatgac 961 agtttagcga atcgagggta tgacccccaa gcggttcaag acaaagtgaa tgaaatctta 1021 aacgctagag aaattgcaga ccttaccaca gtagccaacg aggtcattca gggactttgg 1081 ggcaacggtc aagaacgtta tgacagttta gcgaatcgag ggtatgaccc ccaagcggtt 1141 caagacaaag tgaatgaaat cttaaacgct agagaaattg cagaccttac cacagtagcc 1201 aacgaggtca ttcagggact ttggggcaac ggtcaagaac gttatgacag tttagcgaat 1261 cgagggtatg acccccaagc ggttcaagac aaagtgaatg aattactttc ataacaagta 1321 aaagctagta gaaattttct actagctatt tttatattct gctatgattt tataggcgtc 1381 ctcatctggg ttatccagag caatggagca aatggcagac aggacagctg ttcatctgat 1441 tgtatttctg taaatagtga ttttctagct // LOCUS CP9CPL 1253 bp ds-DNA PHG 19-JUL-1990 DEFINITION Bacteriophage Cp-9 muramidase (cpl9) gene. ACCESSION M34780 KEYWORDS muramidase. SOURCE Bacteriophage Cp-9 [from S.pneumoniae] DNA. ORGANISM Bacteriophage Cp-9 Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1253) AUTHORS Garcia,P., Garcia,J.L., Garcia,E., Sanchez-Puelles,J.M. and Lopez,R. TITLE Modular organization of the lytic enzymes of Streptococcus pneumoniae and its bacteriophages JOURNAL Gene 86, 81-88 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 52 1071 muramidase BASE COUNT 403 a 213 c 294 g 343 t ORIGIN 1 agtagacgca ggaaaagacc tgctagaaga atttaaagga gaaaagaaat aatggttaag 61 aaaaatgatt tatttataga cgtatcaagc cacaacggtt acgatataac aggaatttta 121 gagcagatgg gaacaacaaa cacgattgtt aaaatctcag aaagtacgac ctatttaaac 181 ccttgcttgt ctgctcaagt ggaacagtct acccctattg gcttttatca cttcgcacgc 241 tttggcggag acgtagcaga agctgaaaga gaagcgcagt ttttccttga caacgtgcct 301 acacaagtta aataccttgt attggactat gaagacgacc caagcggaaa cgcacaagcc 361 aacactaacg catgcttacg ctttatgcag atgattgcag acgctggata tacacctatt 421 tattatagtt ataaaccttt cacgcttgat aatgtggact atcagcagat tttagcacag 481 ttccctaatt ctctctggat tgcagggtat ggcttgaatg atggaaacgc tgattttgaa 541 tattttccat ctatggacgg gataagatgg tggcagtatt ctagtaaccc gtttgacaag 601 aatattgtac tgttagacga tgaagaagac gaaaagccaa agactgctgg aacgtggaaa 661 caagacagta agggctggtg gttcagacgc aataacggta gtttccctta taataaatgg 721 gaaaaaatcg ggggtgtgtg gtactacttc gatagtaaag gatattgctt aacgagcgaa 781 tggctcaaag ataatgaaaa atggtactac ctcaaggaca acggcgctat ggtgactggt 841 tgggtgctag tcgggtcaga gtggtattat atggacgatt caggtgcaat ggttactggt 901 tgggtcaaat acaagaataa ctggtactat atgacaaatg aacgtggtaa catggtttct 961 aatgaattta ttaaatctgg aaaaggctgg tatttcatga acacaaacgg agagcttgca 1021 gacaatccaa gctttacaaa agaaccagac ggacttataa cggtagcata aaaagaaaag 1081 ctagtagaaa ctttctacta gctgttttta tattctgcaa tgattttata agcgtcttcg 1141 tctgggttgt ccagagcgat ggagcagatg gcagacagaa ccgctgttca tctgattgta 1201 tttctgtagg tagtgatttt ctaggctgtt atgttgctga tgtgctttat acc // LOCUS YSCTY31A 5510 bp ds-DNA PLN 19-JUL-1990 DEFINITION S.cerevisiae Ty3-1 retrotransposon integrase gene, complete cds, and Cys-tRNA gene. ACCESSION M34549 KEYWORDS integrase; transfer RNA-Cys; transposable element; transposon. SOURCE S.cerevisiae (strain AB950) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 5510) AUTHORS Hansen,L.J. and Sandmeyer,S.B. TITLE Characterization of a transpositionally active Ty3 element and identification of the Ty3 integrase protein JOURNAL J. Virol. 64, 2599-2607 (1990) STANDARD unannotated staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.B.Sandmeyer, 24-MAY-1990. FEATURES from to/span description pept 536 1408 integrase tRNA 105 31 (c) Cys-tRNA mRNA 343 > 5510 integrase mRNA site 121 462 5' sigma element site 5132 5471 3' sigma element rpt 121 128 5' inverted terminal repeat rpt 455 462 3' inverted terminal repeat rpt 5132 5139 5' inverted terminal repeat rpt 5464 5471 3' inverted terminal repeat rpt 116 120 5' insertion target sequence rpt 463 467 3' insertion target sequence rpt 5127 5131 5' insertion target sequence rpt 5472 5476 3' insertion target sequence BASE COUNT 1955 a 1306 c 919 g 1330 t ORIGIN 1 aactttcatg gaaggaccac ctagttaata aaaagctcgc actcaggatc gaactaagga 61 ccaacagatt tgcaatctgc tgcgctacca ctgcgccata cgagcttgat tttctgaaag 121 tgttgtatct caaaatgaga tatgtcagta tgacaatacg tcaccctgaa cgttcataaa 181 acacatatga aacaacctta taacaaaacg aacaacatga gacaaaaccc gaccttccct 241 agctgaacta cccaaagtat aaatgcctga acaattagtt tagatccgag attccgcgct 301 tccaccactt agtatgattc atattttata taatatataa gataagtaac attccgtgaa 361 ttaatctgat aaactgtttt gacaactggt tacttcccta agactgttta tattaggatt 421 gtcaagacac tccggtatta ctcgagcccg taatacaaca cctggtagcg ttaaaggtta 481 ctaattgttc aaacgaacca tcgaaaagcc gaacctagct acaccacacc ccagtatgag 541 ctttatggat caaatcccag gaggaggaaa ttatccaaaa ctcccagtag aatgccttcc 601 taacttcccg atccaaccat ctttgacctt cagaggtaga aatgactcgc ataaactgaa 661 aaactttatc tccgaaataa tgttaaacat gtctatgata tcttggccga atgatgccag 721 tcgtattgtg tactgcagaa gacatttatt aaaccccgct gctcagtggg ctaatgactt 781 tgtacaagaa caaggtatac ttgaaataac attcgacaca ttcatacaag gattatatca 841 gcatttctat aagccaccag atatcaataa aatctttaat gcaatcacgc aactttccga 901 agctaaactt ggtattgagc gtctcaacca acgattcaga aagatttggg acagaatgcc 961 accagacttc atgaccgaaa aagctgccat aatgacatat actaggctat tgacaaagga 1021 aacctataat attgtcagaa tgcacaaacc agagacatta aaagacgcca tggaagaggc 1081 ttaccagaca actgcactaa ctgaaagatt cttcccagga ttcgaacttg atgctgatgg 1141 agacactatc atcggtgcca caacccactt acaagaagaa tacgactctg actatgattc 1201 agaagataat ctgacccaga atggatacgt ccataccgta aggacaagaa gatcttacaa 1261 taaaccaatg tcaaatcatc gaaacaggag aaataacaac ccatctagag aagaatgtat 1321 aaaaaatcgg ctatgcttct attgtaagaa agagggacat cgcctgaacg aatgtagagc 1381 acgtaaggcg agttctaacc gatcttgaac tcgaatcaaa agaccaacaa actcctttta 1441 tcaaaacctt accaattgta cactatatcg ccatccccga gatggacaat accgccgaaa 1501 aaaccataaa aatacaaaac acgaaagtaa aaaccctgtt tgacagtgga tcacccacgt 1561 catttatccg aagagatatt gtagaacttc tcaaatacga aatctacgag acccctccac 1621 tccgttttag aggattcgta gccaccaaat ccgccgttac atccgaagca gtcaccattg 1681 acctcaaaat caatgacctg catataactt tagccgcgta catactggat aacatggact 1741 accaattgtt aattggaaat ccaatcttac gccgctaccc gaaaatcctg cacacagtac 1801 tgaataccag agagagcccc gactccttaa agcccaagac ttatcgctcc gaaaccgtta 1861 ataacgttag aacctactcc gctggtaatc gtggtaaccc cagaaacata aaactgtctt 1921 ttgcccccac cattctcgaa gcaactgacc cgaaatccgc tggtaatcgt ggtgactcca 1981 gaaccaaaac cctgtctctt gcaaccacta ctcctgcagc aattgacccg cttacgaccc 2041 ttgataaccc aggtagtact caaagtacat ttgcgcaatt cccgatacct gaagaagcga 2101 gcatcctaga agaggatgga aaatactcca acgttgtctc aaccattcag agtgtagaac 2161 ctaatgctac tgatcacagc aataaggaca ccttttgcac tttgccagtt tggttacaac 2221 agaagtatag agagatcata cgtaatgatc tcccaccaag acctgccgac attaataaca 2281 tccccgtaaa acatgatatt gaaattaaac ctggcgcaag actacctcga ctacagccat 2341 accatgttac agaaaagaac gaacaagaaa tcaacaaaat agttcaaaaa ctgctcgata 2401 acaagttcat tgttccctca aagtcgcctt gcagctcccc tgtagtcctc gtcccgaaga 2461 aagacggtac cttccgactc tgcgtcgatt accgcaccct gaacaaagct accatctccg 2521 acccattccc attacccaga atcgacaacc tattgagccg tattggaaat gcccagatat 2581 ttaccacgct agatttgcat agtggttacc accagatccc gatggaaccc aaagaccgct 2641 acaaaaccgc ctttgtcaca ccatccggta agtatgaata taccgtcatg ccatttggct 2701 tagtcaatgc acctagtaca ttcgcaagat acatggctga tacatttaga gacctgagat 2761 tcgtcaatgt ttaccttgat gatatattaa tattctccga atctccagaa gaacattgga 2821 aacatttaga cacggtacta gaaagattaa agaacgagaa cctcattgtt aagaagaaaa 2881 aatgtaaatt tgcatctgaa gaaactgagt ttttaggcta tagtattgga atccagaaaa 2941 tagctccact acagcacaaa tgtgcagcaa tccgagactt tccgacgcct aaaacagtaa 3001 aacaagcaca gagattttta ggaatgatta attactacag acgattcatt ccaaattgct 3061 ccaagattgc acagccaatc caactgttta tttgtgacaa aagtcaatgg acagaaaaac 3121 aagacaaggc aattgataaa ctaaaagacg ccttgtgtaa ctcccccgtc ctagtaccat 3181 tcaacaacaa agcaaactac cgacttacaa cagacgcctc aaaagacggc attggtgctg 3241 ttctagaaga agtcgacaac aagaacaaac ttgttggtgt cgtcggttac ttctctaaat 3301 ccttagagag tgcccagaaa aactatcctg ctggcgaatt agaactactt ggaattatca 3361 aagcactcca ccacttccga tatatgcttc acggaaagca tttcacgtta agaacagacc 3421 acattagttt gttatcatta caaaacaaga acgaacccgc acgacgcgtg caacgctggt 3481 tagatgacct agccacatat gacttcacct tagaatacct agctggaccc aagaacgttg 3541 tcgcagatgc catatcccgt gccgtatata ctataacccc cgaaacatcc cgacctatcg 3601 acacagaaag ctggaaatct tactacaaat cagacccatt atgtagtgct gtcttaattc 3661 atatgaaaga attgacacaa cacaacgtca cacctgaaga tatgtcagcc ttccgtagtt 3721 accagaagaa actcgaacta tcagagacct tccgaaagaa ttattcccta gaagacgaaa 3781 tgatctatta ccaagaccga ctagtagtac caataaaaca acagaacgca gttatgagac 3841 tatatcatga ccatacctta tttggaggac attttggtgt aacagtgacc cttgcgaaaa 3901 tcagcccaat ttactattgg ccaaaattac aacattcgat catacaatac atcaggacct 3961 gcgtacaatg tcaactaata aaatcacacc gaccacgctt acatggacta ttacaaccac 4021 tccctatagc agaaggaaga tggcttgata tatcaatgga ttttgtgaca ggattacccc 4081 cgacatcaaa taacttgaat atgatcctcg tcgtagttga tcgtttttcg aaacgcgctc 4141 acttcatagc tacaaggaaa accttagacg caacacaact aatagatcta ctctttcgat 4201 acattttttc atatcatggt tttcccagga caataaccag tgatagagat gtccgtatga 4261 ccgccgacaa atatcaagaa ctcacgaaaa gactaggaat aaaatcgaca atgtcttccg 4321 cgaaccaccc ccaaacagat ggacaatccg aacgaacgat acagacatta aacaggttac 4381 taagagccta tgcttcaacc aatattcaga attggcatgt atatttacca caaatcgaat 4441 ttgtttacaa ttctacacct actagaacac ttggaaaatc accatttgaa attgatttag 4501 gatatttacc gaatacccct gctattaagt cagatgacga agtcaacgca agaagtttta 4561 ctgccgtaga acttgccaaa cacctcaaag cccttaccat ccaaacgaag gaacagctag 4621 aacacgctca aatcgaaatg gaaactaata acaatcaaag acgtaaaccc ttattgttaa 4681 acataggaga tcacgtatta gtgcatagag atgcatactt caagaaaggt gcttatatga 4741 aagtacaaca aatatacgtc ggaccatttc gagttgtcaa gaaaataaac gataacgcct 4801 acgaactaga tttaaactct cacaagaaaa agcacagagt tattaatgta caattcctga 4861 aaaagtttgt ataccgtcca gacgcgtacc caaagaataa accaatcagc tccactgaaa 4921 gaattaagag agcacacgaa gttactgcac tcataggaat agatactaca cacaaaactt 4981 acttatgtca catgcaagat gtagacccaa cactttcagt agaatactca gaagctgaat 5041 tttgccaaat tcccgaaaga acacgaagat caatattagc caactttaga caactctacg 5101 aaacacaaga caaccctgag agagaggaag atgttgtatc tcaaaatgag atatgtcagt 5161 atgacaatac gtcaccctga acgttcataa aacacatatg aaacaacctt ataacaaaac 5221 gaacaacatg agacaaaacc cgaccttccc tagctgaact acccaaagta taaatgcctg 5281 aacaattagt ttagatccga gattccgcgc ttccaccact tagtatgatt catattttat 5341 ataatatata agataagtaa cattccgtga attaatctga taaactgttt tgacaactgg 5401 ttacttccct aagactgttt atattaggat tgtcaagaca ctccggtatt actcgagccc 5461 gtaatacaac agaaagttcc attttggatg ctctatttat gggaatatga //